match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1pfam09002379335100.01E-33[-------------------------------------------------]DUF1887Domain of unknown function (DUF1887). This domain is found in a set of hypothetical bacterial proteins.
2pfam0967037910397.95.1E-05[   ---------------                               ]Cas_Cas02710CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
3cd0974737810397.87.3E-05[   ---------------                               ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as Cas02710 family
4TIGR027103809697.60.00014[   --------------                                ]TIGR02710CRISPR-associated protein, TIGR02710 family. Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Archaea), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
5cd097023788696.70.0027[   ------------                                  ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TIGR02710 family
6cd0972313211096.40.027[---------------                                  ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as csx13 family
7TIGR036421248895.60.13[   ------------                                  ]cas_csx14CRISPR-associated protein, Csx14 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown.
8pfam096511369995.40.31[   --------------                                ]Cas_APE2256CRISPR-associated protein (Cas_APE2256). This entry represents a conserved region of about 150 amino acids found in at least five archaeal and three bacterial species. These species all contain CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In six of eight species, the protein is encoded the vicinity of a CRISPR/Cas locus.
9cd0974218310795.20.26[   ---------------                               ]Csm6_III-ACRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems; also known as APE2256 family
10pfam0962322513182.613[-----------------                                ]Cas_NE0113CRISPR-associated protein NE0113 (Cas_NE0113). Members of this minor CRISPR-associated (Cas) protein family are encoded in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum.
11cd096551988881.818[   ------------                                  ]CasRa_I-ACRISPR/Cas system-associated transcriptional regulator CasRa. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Predicted transcriptional regulator of CRISPR/Cas system
12cd097412194377.815[           ------                                ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as NE0113 family
13pfam07788705477.85.1[                                    -------      ]DUF1626Protein of unknown function (DUF1626). This is a family consisting of sequences from hypothetical proteins of unknown function expressed by certain species of archaebacteria. One member is thought to be similar to tropomyosin.
14TIGR018842038674.432[   ------------                                  ]cas_HTHCRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.
15TIGR0261914910773.534[   --------------                                ]TIGR02619putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus.
16PRK146771076071.08.1[                                    -------      ]PRK14677hypothetical protein; Provisional
17COG40062785164.649[           -------                               ]COG4006CRISPR/Cas system-associated protein Csm6, COG1517 family
18cd097322219363.765[  -------------                                  ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family
19PRK146831222362.37.9[                                    --           ]PRK14683hypothetical protein; Provisional
20PRK124971192458.511[                                   ---           ]PRK12497hypothetical protein; Reviewed
21cd0969418110458.591[   ---------------                               ]Csm6_III-ACRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems
22pfam03008952355.313[                                    --           ]DUF234Archaea bacterial proteins of unknown function.
23pfam01870893352.718[                                    ----         ]HjcArchaeal holliday junction resolvase (hjc). This family of archaebacterial proteins are holliday junction resolvases (hjc gene). The Holliday junction is an essential intermediate of homologous recombination. This protein is the archaeal equivalent of RuvC but is not sequence similar.
24pfam135881105451.936[                                   --------      ]HSDR_N_2Type I restriction enzyme R protein N terminus (HSDR_N). This family consists of a number of N terminal regions found in type I restriction enzyme R (HSDR) proteins. Restriction and modification (R/M) systems are found in a wide variety of prokaryotes and are thought to protect the host bacterium from the uptake of foreign DNA. Type I restriction and modification systems are encoded by three genes: hsdR, hsdM, and hsdS. The three polypeptides, HsdR, HsdM, and HsdS, often assemble to give an enzyme (R2M2S1) that modifies hemimethylated DNA and restricts unmethylated DNA.
25pfam083781112445.119[                                   ---           ]NERDNuclease-related domain. The nuclease-related domain (NERD) is found in a range of bacterial as well as archaeal and plant proteins. It has distant similarity to endonucleases (hence its name) and its predicted secondary structure is helix - sheet - sheet - sheet - sheet - weak sheet/long loop - helix - sheet - sheet. The majority of NERD-containing proteins are single-domain, but in several cases proteins containing NERD have additional domains which in 75% of cases are involved in DNA processing.
26cd005231233944.822[                                   -----         ]archeal_HJRHolliday junction resolvases (HJRs) are endonucleases that specifically resolve Holliday junction DNA intermediates during homologous recombination. HJR's occur in archaea, bacteria, and in the mitochondria of certain eukaryotes, however this CD includes only the archeal HJR's. The bacterial and archeal HJRs perform a similar function but differ in both sequence and structure. Structural similarity does however, exist between the archeal HJRs and type II restriction endonucleases, such as EcoRV, BglII, and Fok, and this similarity includes their active site configurations.
27COG07921142439.730[                                   ---           ]YraNPredicted endonuclease distantly related to archaeal Holliday junction resolvase
28pfam094553729837.998[  -------------                                  ]Cas_DxTHGCRISPR-associated (Cas) DxTHG family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. The family describes Cas proteins of about 400 residues that include the motif
29PRK146861192437.735[                                   ---           ]PRK14686hypothetical protein; Provisional
30COG15911372337.632[                                    --           ]COG1591Holliday junction resolvase, archaeal type
31COG334022412036.42.4E+02[------------------                               ]PepEPeptidase E
32pfam136351542535.048[                                   ---           ]DUF4143Domain of unknown function (DUF4143). This domain is almost always found C-terminal to an ATPase core family.
33cd0854933211034.32.9E+02[-----------------                                ]G1PDH_relatedGlycerol-1-phosphate_dehydrogenase and related proteins. Bacterial and archeal glycerol-1-phosphate dehydrogenase-like oxidoreductases. The proteins have similarity with glycerol-1-phosphate dehydrogenase (G1PDH). G1PDH plays a role in the synthesis of phosphoglycerolipids in gram-positive bacterial species. It catalyzes the reversibly reduction of dihydroxyacetone phosphate (DHAP) to glycerol-1-phosphate (G1P) in a NADH-dependent manner. Its activity requires Ni++ ion. It also contains archaeal Sn-glycerol-1-phosphate dehydrogenase (Gro1PDH) that plays an important role in the formation of the enantiomeric configuration of the glycerophosphate backbone (sn-glycerol-1-phosphate) of archaeal ether lipids.
34PRK146811582133.855[                                    --           ]PRK14681hypothetical protein; Provisional
35PRK146781202333.268[                                    --           ]PRK14678hypothetical protein; Provisional
36pfam043131423833.197[                                   -----         ]HSDR_NType I restriction enzyme R protein N terminus (HSDR_N). This family consists of a number of N terminal regions found in type I restriction enzyme R (HSDR) proteins. Restriction and modification (R/M) systems are found in a wide variety of prokaryotes and are thought to protect the host bacterium from the uptake of foreign DNA. Type I restriction and modification systems are encoded by three genes: hsdR, hsdM, and hsdS. The three polypeptides, HsdR, HsdM, and HsdS, often assemble to give an enzyme (R2M2S1) that modifies hemimethylated DNA and restricts unmethylated DNA.
37pfam069561839132.886[  -------------                                  ]RtcRRegulator of RNA terminal phosphate cyclase. RtcR is a sigma54-dependent enhancer binding protein that activates transcription of the rtcBA operon. The product of the rtcA gene is an RNA 3'-terminal phosphate cyclase. This domain is found at the N terminus of the RtcR sequence. RtcR, and other sigma54-dependent activators, contain pfam00158 in the central region of the protein sequence.
38cd01719704528.41.4E+02[                               -------           ]Sm_GSm protein G. The eukaryotic Sm proteins (B/B', D1, D2, D3, E, F and G) assemble into a hetero-heptameric ring around the Sm site of the 2,2,7-trimethyl guanosine (m3G) capped U1, U2, U4 and U5 snRNAs (Sm snRNAs) forming the core of the snRNP particle. The snRNP particle, in turn, assembles with other components onto the pre-mRNA to form the spliceosome which is responsible for the excision of introns and the ligation of exons. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit G binds subunits E and F to form a trimer which then assembles onto snRNA along with the D1/D2 and D3/B heterodimers forming a seven-membered ring structure.
39PRK146891242027.057[                                    --           ]PRK14689hypothetical protein; Provisional
40PRK146821172326.176[                                    --           ]PRK14682hypothetical protein; Provisional
41cd063563349625.459[--------------                                   ]PBP1_Amide_Urea_BP_likePeriplasmic component (FmdD) of an active transport system for short-chain amides and urea (FmdDEF). This group includes the type I periplasmic-binding proteins that are predicted to have a function similar to that of an active transport system for short chain amides and/or urea in bacteria and Archaea, by sequence comparison and phylogenetic analysis.
42PRK146801342522.894[                                   ---           ]PRK14680hypothetical protein; Provisional
43cd04895721321.955[                               --                ]ACT_ACR_1ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). This CD includes the N-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). ACR proteins, found only in Arabidopsis and Oryza, as yet, are proposed to function as novel regulatory or sensor proteins in plants. Nine ACR gene products have been described (ACR1-8 in Arabidopsis and OsARC1-9 in Oryza) and are represented in this CD. Members of this CD belong to the superfamily of ACT regulatory domains.
44pfam095722512521.11.3E+02[                                   ---           ]RE_XamIXamI restriction endonuclease. This family includes the XamI (recognizes GTCGAC but cleavage site unknown) restriction endonuclease.
45COG54932315620.71.3E+02[                                   --------      ]COG5493Uncharacterized protein
46PRK146751252220.61E+02[                                    --           ]PRK14675hypothetical protein; Provisional