match no. | target id | target length | alignment length | probability | E-value | coverage | match description |
1 | cd09747 | 378 | 354 | 100.0 | 1.3E-47 | [ ---------------------------------------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as Cas02710 family |
2 | pfam09670 | 379 | 354 | 100.0 | 2.6E-47 | [ ---------------------------------------- ] | Cas_Cas02710 | CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). |
3 | TIGR02710 | 380 | 354 | 100.0 | 2E-41 | [ ---------------------------------------- ] | TIGR02710 | CRISPR-associated protein, TIGR02710 family. Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Archaea), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). |
4 | cd09702 | 378 | 355 | 100.0 | 5.6E-40 | [ ---------------------------------------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TIGR02710 family |
5 | pfam09659 | 382 | 294 | 99.9 | 1.7E-22 | [ -------------------------------- ] | Cas_Csm6 | CRISPR-associated protein (Cas_Csm6). Clusters of short DNA repeats with nonhomologous spacers, which are found at regular intervals in the genomes of phylogenetically distinct prokaryotic species, comprise a family with recognisable features. This family is known as CRISPR (short for Clustered, Regularly Interspaced Short Palindromic Repeats). A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. |
6 | cd09699 | 360 | 311 | 99.9 | 2.4E-22 | [ ----------------------------------- ] | Csm6_III-A | CRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems |
7 | cd09746 | 382 | 293 | 99.9 | 5.5E-22 | [ -------------------------------- ] | Csm6_III-A | CRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems |
8 | TIGR02672 | 362 | 311 | 99.9 | 3.6E-22 | [ ----------------------------------- ] | cas_csm6 | CRISPR type III-A/MTUBE-associated protein Csm6. Members of this family as found in CRISPR-associated (cas) gene regions in Streptococcus thermophilus CNRZ1066, Staphylococcus epidermidis RP62A, and Mycobacterium tuberculosis (strains CDC1551 and H37Rv), as part of Mtube-type CRISPR/Cas systems. CRISPR is a widespread form of direct repeat found in archaea and bacteria, with distinctive subtypes each of which has a characteristic sporadic distribution. |
9 | cd09723 | 132 | 124 | 97.8 | 0.00037 | [------------------ ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as csx13 family |
10 | pfam09623 | 225 | 136 | 97.7 | 0.00037 | [------------------ ] | Cas_NE0113 | CRISPR-associated protein NE0113 (Cas_NE0113). Members of this minor CRISPR-associated (Cas) protein family are encoded in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum. |
11 | cd09741 | 219 | 57 | 97.4 | 0.003 | [ ------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as NE0113 family |
12 | cd09742 | 183 | 106 | 97.2 | 0.0043 | [ ------------ ] | Csm6_III-A | CRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems; also known as APE2256 family |
13 | pfam09651 | 136 | 103 | 97.0 | 0.011 | [ ------------ ] | Cas_APE2256 | CRISPR-associated protein (Cas_APE2256). This entry represents a conserved region of about 150 amino acids found in at least five archaeal and three bacterial species. These species all contain CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In six of eight species, the protein is encoded the vicinity of a CRISPR/Cas locus. |
14 | TIGR03642 | 124 | 101 | 96.9 | 0.021 | [ ------------- ] | cas_csx14 | CRISPR-associated protein, Csx14 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown. |
15 | pfam09002 | 379 | 94 | 96.9 | 0.011 | [ ------------- ] | DUF1887 | Domain of unknown function (DUF1887). This domain is found in a set of hypothetical bacterial proteins. |
16 | cd09732 | 221 | 100 | 95.6 | 0.15 | [ ----------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family |
17 | pfam06956 | 183 | 127 | 94.8 | 0.55 | [ --------------- ] | RtcR | Regulator of RNA terminal phosphate cyclase. RtcR is a sigma54-dependent enhancer binding protein that activates transcription of the rtcBA operon. The product of the rtcA gene is an RNA 3'-terminal phosphate cyclase. This domain is found at the N terminus of the RtcR sequence. RtcR, and other sigma54-dependent activators, contain pfam00158 in the central region of the protein sequence. |
18 | pfam09455 | 372 | 94 | 93.5 | 6.5 | [ ---------- ] | Cas_DxTHG | CRISPR-associated (Cas) DxTHG family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. The family describes Cas proteins of about 400 residues that include the motif |
19 | TIGR02619 | 149 | 104 | 92.8 | 1.3 | [ ------------ ] | TIGR02619 | putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus. |
20 | TIGR02221 | 218 | 42 | 92.4 | 3.1 | [ ----- ] | cas_TM1812 | CRISPR-associated protein, TM1812 family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. This family, represented by TM1812 of Thermotoga maritima, is found also in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, a large plasmid of Synechocystis sp. PCC 6803, and Fibrobacter succinogenes S85. |
21 | COG4006 | 278 | 102 | 92.0 | 1.1 | [ ------------ ] | COG4006 | CRISPR/Cas system-associated protein Csm6, COG1517 family |
22 | TIGR01884 | 203 | 111 | 91.9 | 1.5 | [ -------------- ] | cas_HTH | CRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity. |
23 | cd09694 | 181 | 103 | 91.1 | 2.1 | [ ------------ ] | Csm6_III-A | CRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems |
24 | cd09668 | 214 | 16 | 91.0 | 2.5 | [ - ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family |
25 | TIGR02584 | 209 | 106 | 86.5 | 13 | [ ----------- ] | cas_NE0113 | CRISPR-associated protein, NE0113 family. Members of this minor CRISPR-associated (Cas) protein family are found in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum. |
26 | cd09655 | 198 | 113 | 84.4 | 9.8 | [ -------------- ] | CasRa_I-A | CRISPR/Cas system-associated transcriptional regulator CasRa. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Predicted transcriptional regulator of CRISPR/Cas system |
27 | cd09686 | 209 | 110 | 82.3 | 12 | [ ----------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as NE0113 family |
28 | cd09728 | 400 | 137 | 73.5 | 49 | [---------------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as DxTHG family |
29 | TIGR01897 | 410 | 100 | 68.6 | 64 | [ ----------- ] | cas_MJ1666 | CRISPR-associated protein, MJ1666 family. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model describes a Cas protein about 400 residues in length, found mostly in the Archaea but also in Aquifex. |
30 | cd09660 | 394 | 114 | 66.4 | 73 | [ ------------- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as MJ1666 family |
31 | COG4650 | 531 | 113 | 63.5 | 44 | [ ------------ ] | RtcR | Sigma54-dependent transcription regulator containing an AAA-type ATPase domain and a DNA-binding domain |
32 | cd00006 | 122 | 55 | 56.2 | 26 | [ ------ ] | PTS_IIA_man | PTS_IIA, PTS system, mannose/sorbose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. IIA subunits receive phosphoryl groups from HPr and transfer them to IIB subunits, which in turn phosphorylate the substrate. |
33 | COG1448 | 396 | 138 | 51.1 | 32 | [ ---------------- ] | TyrB | Aspartate/tyrosine/aromatic aminotransferase |
34 | pfam05168 | 118 | 27 | 42.3 | 49 | [ --- ] | HEPN | HEPN domain. |
35 | pfam08459 | 154 | 46 | 40.9 | 70 | [ ----- ] | UvrC_HhH_N | UvrC Helix-hairpin-helix N-terminal. This domain is found in the C subunits of the bacterial and archaeal UvrABC system which catalyses nucleotide excision repair in a multi-step process. UvrC catalyses the first incision on the fourth or fifth phosphodiester bond 3' and on the eighth phosphodiester bond 5' from the damage that is to be excised. The domain described here is found to the N-terminus of a helix hairpin helix (pfam00633) motif and also co-occurs with the pfam01541 catalytic domain which is found at the N-terminus of the same proteins. |
36 | cd04502 | 171 | 32 | 34.2 | 2.2E+02 | [ ---- ] | SGNH_hydrolase_like_7 | Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid. |
37 | COG0322 | 581 | 135 | 31.2 | 5.3E+02 | [ --------------- ] | UvrC | Excinuclease UvrABC, nuclease subunit |
38 | COG0519 | 315 | 51 | 31.1 | 1.3E+02 | [ ------ ] | GuaA2 | GMP synthase, PP-ATPase domain/subunit |
39 | cd04506 | 204 | 60 | 31.0 | 67 | [------ ] | SGNH_hydrolase_YpmR_like | Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad from other serine hydrolases, but may lack the carboxlic acid. This subfamily contains sequences similar to Bacillus YpmR. |
40 | cd06067 | 136 | 33 | 30.2 | 40 | [ --- ] | H2MP_MemB-H2evol | Endopeptidases belonging to membrane-bound hydrogen evolving hydrogenase group. In hydrogenase 3 from E coli, the maturation of the large subunit (HycE) requires the cleavage of a C-terminal peptide by the endopeptidase HycI, before the final formation of the |
41 | PRK09257 | 396 | 134 | 30.1 | 1.8E+02 | [ ---------------- ] | PRK09257 | aromatic amino acid aminotransferase; Provisional |
42 | PRK00558 | 598 | 51 | 29.6 | 1E+02 | [ ------ ] | uvrC | excinuclease ABC subunit C; Validated |
43 | pfam07067 | 236 | 115 | 27.9 | 2.5E+02 | [ ------------- ] | DUF1340 | Protein of unknown function (DUF1340). This family consists of several hypothetical Streptococcus thermophilus bacteriophage proteins of around 235 residues in length. The function of this family is unknown. |
44 | TIGR02153 | 404 | 59 | 27.4 | 2.4E+02 | [ ------- ] | gatD_arch | glutamyl-tRNA(Gln) amidotransferase, subunit D. This peptide is found only in the Archaea. It is part of a heterodimer, with GatE (TIGR00134), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn). |
45 | COG2361 | 117 | 37 | 25.7 | 58 | [ ---- ] | COG2361 | Uncharacterized conserved protein, contains HEPN domain |
46 | PRK00074 | 511 | 53 | 25.5 | 1.7E+02 | [ ------ ] | guaA | GMP synthase; Reviewed |
47 | pfam01934 | 120 | 56 | 24.5 | 77 | [ ------ ] | DUF86 | Protein of unknown function DUF86. The function of members of this family is unknown. |
48 | TIGR03903 | 1266 | 93 | 24.2 | 4.2E+02 | [ ------------- ] | TOMM_kin_cyc | TOMM system kinase/cyclase fusion protein. This model represents proteins of 1350 in length, in multiple species of Burkholderia, in Acidovorax avenae subsp. citrulli AAC00-1 and Delftia acidovorans SPH-1, and in multiple copies in Sorangium cellulosum, in genomic neighborhoods that include a cyclodehydratase/docking scaffold fusion protein (TIGR03882) and a member of the thiazole/oxazole modified metabolite (TOMM) precursor family TIGR03795. It has a kinase domain in the N-terminal 300 amino acids, followed by a cyclase homology domain, followed by regions without named domain definitions. It is a probable bacteriocin-like metabolite biosynthesis protein. |
49 | PRK00035 | 333 | 76 | 24.1 | 5E+02 | [ --------- ] | hemH | ferrochelatase; Reviewed |
50 | cd08261 | 337 | 33 | 22.9 | 2E+02 | [ --- ] | Zn_ADH7 | Alcohol dehydrogenases of the MDR family. This group contains members identified as related to zinc-dependent alcohol dehydrogenase and other members of the MDR family. The medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family, which contains the zinc-dependent alcohol dehydrogenase (ADH-Zn) and related proteins, is a diverse group of proteins related to the first identified member, class I mammalian ADH. MDRs display a broad range of activities and are distinguished from the smaller short chain dehydrogenases (~ 250 amino acids vs. the ~ 350 amino acids of the MDR). The MDR proteins have 2 domains: a C-terminal NAD(P)-binding Rossmann fold domain of a beta-alpha form and an N-terminal catalytic domain with distant homology to GroES. The MDR group includes various activities, including the founding alcohol dehydrogenase (ADH), quinone reductase, sorbitol dehydrogenase, formaldehyde dehydrogenase, butanediol DH, ketose reductase, cinnamyl reductase, and numerous others. The zinc-dependent alcohol dehydrogenases (ADHs) catalyze the NAD(P)(H)-dependent interconversion of alcohols to aldehydes or ketones. Active site zinc has a catalytic role, while structural zinc aids in stability. ADH-like proteins typically form dimers (typically higher plants, mammals) or tetramers (yeast, bacteria), and generally have 2 tightly bound zinc atoms per subunit. The active site zinc is coordinated by a histidine, two cysteines, and a water molecule. The second zinc seems to play a structural role, affects subunit interactions, and is typically coordinated by 4 cysteines. |
51 | pfam04536 | 120 | 98 | 21.2 | 4.6E+02 | [ ----------- ] | TPM | TLP18.3, Psb32 and MOLO-1 founding proteins of phosphatase. This family has a Rossmann-like fold. It has phosphatase activity. |
52 | cd09671 | 346 | 44 | 21.0 | 8.6E+02 | [ ----- ] | Csx1_III-U | CRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as DxTHG family |
53 | PRK00919 | 307 | 51 | 20.5 | 2.4E+02 | [ ------ ] | PRK00919 | GMP synthase subunit B; Validated |