match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1cd09732221216100.06.6E-61[ --------------------------------                ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family
2pfam09455372316100.07.2E-54[ ----------------------------------              ]Cas_DxTHGCRISPR-associated (Cas) DxTHG family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. The family describes Cas proteins of about 400 residues that include the motif
3TIGR02221218188100.04.5E-45[ --------------------                            ]cas_TM1812CRISPR-associated protein, TM1812 family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. This family, represented by TM1812 of Thermotoga maritima, is found also in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, a large plasmid of Synechocystis sp. PCC 6803, and Fibrobacter succinogenes S85.
4cd09668214186100.07.4E-42[ --------------------                            ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family
5cd0967134627099.97.9E-21[ -------------------------------                 ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as DxTHG family
6TIGR02549212098.71.1E-09[             --                                  ]CRISPR_DxTHGCRISPR-associated DxTHG motif protein. This model describes a short region highly conserved between two otherwise substantially different CRISPR-associated (cas) proteins, TIGR02221 and TIGR01987. This region includes the motif
7TIGR0189741018498.52E-06[ -------------------                             ]cas_MJ1666CRISPR-associated protein, MJ1666 family. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model describes a Cas protein about 400 residues in length, found mostly in the Archaea but also in Aquifex.
8cd0966039415098.51.5E-06[ ----------------                                ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as MJ1666 family
9cd0972840015398.53.1E-06[ ----------------                                ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as DxTHG family
10pfam0967037925597.80.027[    ------------------------------               ]Cas_Cas02710CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
11COG151740613397.50.00072[----------------                                 ]Csx1CRISPR/Cas system-associated protein Csx1, contains CARF domain
12cd0974737825597.40.035[    ------------------------------               ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as Cas02710 family
13pfam0900237912196.80.025[     ---------------                             ]DUF1887Domain of unknown function (DUF1887). This domain is found in a set of hypothetical bacterial proteins.
14TIGR0271038014295.70.34[    -----------------                            ]TIGR02710CRISPR-associated protein, TIGR02710 family. Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Archaea), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
15cd0970237815095.40.11[    ------------------                           ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TIGR02710 family
16TIGR0188420311290.62.4[   --------------                                ]cas_HTHCRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.
17pfam096511368790.52.5[     ----------                                  ]Cas_APE2256CRISPR-associated protein (Cas_APE2256). This entry represents a conserved region of about 150 amino acids found in at least five archaeal and three bacterial species. These species all contain CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In six of eight species, the protein is encoded the vicinity of a CRISPR/Cas locus.
18cd096551987788.53.3[        ---------                                ]CasRa_I-ACRISPR/Cas system-associated transcriptional regulator CasRa. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Predicted transcriptional regulator of CRISPR/Cas system
19cd0974218310187.54.1[    -----------                                  ]Csm6_III-ACRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems; also known as APE2256 family
20pfam0962322512285.411[    -------------                                ]Cas_NE0113CRISPR-associated protein NE0113 (Cas_NE0113). Members of this minor CRISPR-associated (Cas) protein family are encoded in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum.
21TIGR0364212410584.64.3[   ------------                                  ]cas_csx14CRISPR-associated protein, Csx14 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown.
22cd04223951674.11.6[             -                                   ]N2OR_CThe C-terminal cupredoxin domain of Nitrous-oxide reductase. Nitrous-oxide reductase participates in nitrogen metabolism and catalyzes the last step in dissimilatory nitrate reduction, the two-electron reduction of N2O to N2. It contains copper ions as cofactors in the form of a binuclear CuA center at the site of electron entry and a tetranuclear CuZ centre at the active site. The C-terminus of Nitrous-oxide reductase is a cupredoxin domain.
23cd0972313210673.316[   ------------                                  ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as csx13 family
24cd0974121911861.41E+02[    -------------                                ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as NE0113 family
25TIGR0135734413960.670[    ----------------                             ]aroB3-dehydroquinate synthase. This model represents 3-dehydroquinate synthase, the enzyme catalyzing the second of seven steps in the shikimate pathway of chorismate biosynthesis. Chorismate is the last common intermediate in the biosynthesis of all three aromatic amino acids.
26TIGR03504442745.633[                              --                 ]FimV_CtermFimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
27COG078828712043.020[     -------------                               ]PurUFormyltetrahydrofolate hydrolase
28pfam13041502241.433[                              --                 ]PPR_2PPR repeat family. This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
29PRK0602728611838.659[     -------------                               ]purUformyltetrahydrofolate deformylase; Reviewed
30TIGR026191499338.61.9E+02[     -----------                                 ]TIGR02619putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus.
31TIGR037063007136.01.5E+02[     ---------                                   ]exo_poly_onlyexopolyphosphatase. It appears that a single enzyme may act as both exopolyphosphatase (Ppx) and guanosine pentaphosphate phosphohydrolase (GppA) in a number of species. Members of the seed alignment use to define this exception-level model are encoded adjacent to a polyphosphate kinase 1 gene, and the trusted cutoff is set high enough (425) that no genome has a second hit. Therefore all members may be presumed to at least share exopolyphospatase activity, and may lack GppA activity. GppA acts in the stringent response.
32cd000061225534.71.5E+02[        ------                                   ]PTS_IIA_manPTS_IIA, PTS system, mannose/sorbose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. IIA subunits receive phosphoryl groups from HPr and transfer them to IIB subunits, which in turn phosphorylate the substrate.
33cd0794127311533.438[  --------------                                 ]DRE_TIM_LeuA3Desulfobacterium autotrophicum LeuA3 and related proteins, N-terminal catalytic TIM barrel domain. Desulfobacterium autotrophicum LeuA3 is sequence-similar to alpha-isopropylmalate synthase (LeuA) but its exact function is unknown. Members of this family have an N-terminal TIM barrel domain that belongs to the DRE-TIM metallolyase superfamily. DRE-TIM metallolyases include 2-isopropylmalate synthase (IPMS), alpha-isopropylmalate synthase (LeuA), 3-hydroxy-3-methylglutaryl-CoA lyase, homocitrate synthase, citramalate synthase, 4-hydroxy-2-oxovalerate aldolase, re-citrate synthase, transcarboxylase 5S, pyruvate carboxylase, AksA, and FrbC. These members all share a conserved triose-phosphate isomerase (TIM) barrel domain consisting of a core beta(8)-alpha(8) motif with the eight parallel beta strands forming an enclosed barrel surrounded by eight alpha helices. The domain has a catalytic center containing a divalent cation-binding site formed by a cluster of invariant residues that cap the core of the barrel. In addition, the catalytic site includes three invariant residues - an aspartate (D), an arginine (R), and a glutamate (E) - which is the basis for the domain name "DRE-TIM"td>
34TIGR002461539533.12.5E+02[      -----------                                ]tRNA_RlmH_YbeArRNA large subunit m3Psi methyltransferase RlmH. This protein, in the SPOUT methyltransferase family, previously designated YbeA in E. coli, was shown to be responsible for a further modification, a methylation, to a pseudouridine base in ribosomal large subunit RNA.
35pfam102435065932.688[                         -------                 ]MIP-T3Microtubule-binding protein MIP-T3. This protein, which interacts with both microtubules and TRAF3 (tumor necrosis factor receptor-associated factor 3), is conserved from worms to humans. The N-terminal region is the microtubule binding domain and is well-conserved; the C-terminal 100 residues, also well-conserved, constitute the coiled-coil region which binds to TRAF3. The central region of the protein is rich in lysine and glutamic acid and carries KKE motifs which may also be necessary for tubulin-binding, but this region is the least well-conserved.
36TIGR042465781731.523[             -                                   ]nitrous_NosZ_Gpnitrous-oxide reductase, Sec-dependent. This model represents the nitrous-oxide reductase protein NosZ as characterized in Geobacillus thermodenitrificans. In contrast to the related form in Pseudomonas stutzeri, this version lacks a recognizable twin-arginine translocation (TAT) signal at the N-terminus. Consequently, its accessory protein may differ. Some members of this family have an additional cytochrome c-like domain at the C-terminus.
37COG02484927231.41.8E+02[     ---------                                   ]GppAExopolyphosphatase/pppGpp-phosphohydrolase
38pfam13374422930.81.3E+02[                              --                 ]TPR_10Tetratricopeptide repeat.
39pfam08588564129.250[             ----                                ]DUF1769Protein of unknown function (DUF1769). Family of fungal protein with unknown function.
40TIGR033022353128.52.8E+02[                             ---                 ]OM_YfiOouter membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
41TIGR00756352327.497[                              --                 ]PPRpentatricopeptide repeat domain (PPR motif). This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
42PRK1495348612426.37.4E+02[    ------------------                           ]PRK14953DNA polymerase III subunits gamma and tau; Provisional
43pfam07719342625.71.5E+02[                              --                 ]TPR_2Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515.
44PRK125442065025.655[       ------                                    ]PRK12544RNA polymerase sigma factor; Provisional
45pfam0410037511125.34.6E+02[                  -------------                  ]Vps53_NVps53-like, N-terminal. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events.
46pfam01535312325.01.1E+02[                              --                 ]PPRPPR repeat. This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309, which may be involved in RNA stabilisation. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein that has a domain organisation similar to the human BRCA1 protein. The repeat has been called PPR.
47COG28931435324.71.7E+02[        ------                                   ]ManXPhosphotransferase system, mannose/fructose-specific component IIA
48COG41052543024.33E+02[                              --                 ]BamDOuter membrane protein assembly factor BamD, BamD/ComL family
49pfam12854342524.384[                              --                 ]PPR_1PPR repeat. This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.
50cd128312877223.65.3E+02[                       --------                  ]TmCorA-like_u2Uncharacterized bacterial subfamily of the Thermotoga maritima CorA-like family. This subfamily belongs to the Thermotoga maritima CorA (TmCorA)-like family of the MIT superfamily of essential membrane proteins involved in transporting divalent cations (uptake or efflux) across membranes. Members of the TmCorA-like family are found in all three kingdoms of life. It is a functionally diverse family which includes the CorA Co2+ transporter from the hyperthermophilic Thermotoga maritima, and three Saccharomyces cerevisiae proteins: two located in the plasma membrane: the Mg2+ transporter Alr1p/Swc3p and the putative Mg2+ transporter, Alr2p, and the vacuole membrane protein Mnr2p, a putative Mg2+ transporter. Thermotoga maritima CorA forms funnel-shaped homopentamers, the tip of the funnel is formed from two C-terminal transmembrane (TM) helices from each monomer, and the large opening of the funnel from the N-terminal cytoplasmic domains. The GMN signature motif of the MIT superfamily occurs just after TM1, mutation within this motif is known to abolish Mg2+ transport by a related protein, Saccharomyces cerevisiae Alr1p. Natural variants in this signature sequence may be associated with the transport of different divalent cations. The functional diversity of the MIT superfamily may also be due to minor structural differences regulating gating, substrate selection, and transport.
51cd154743527323.38.3E+02[                         -------                 ]Myo5p-like_CBD_fungalcargo binding domain of fungal myosin V -like proteins. Yeast myosin V travels along actin cables, actin filaments that are bundled by fimbrin, in the presence of tropomyosin. This is in contrast to the other vertebrate class V myosins. Like other class V myosins, fungal myosin 2 and 4 contain a C-terminal cargo binding domain. In case of Myo4 it has been shown to bind to the adapter protein She3p (Swi5p-dependent HO expression 3), which in turn anchors myosin 4 to its cargos, zip-coded mRNP (messenger ribonucleoprotein particles) and tER (tubular endoplasmic reticulum). Myo 2 binds to Vac17, vacuole-specific cargo adaptor, and Mmr1, mitochondria-specific cargo adaptor. Both adaptors bind competitivly at the same site.
52pfam01418774922.13E+02[                            ------               ]HTH_6Helix-turn-helix domain, rpiR family. This domain contains a helix-turn-helix motif. The best characterized member of this family is Escherichia coli rpiR, a regulator of the expression of rpiB gene.
53pfam13428442621.91.8E+02[                              --                 ]TPR_14Tetratricopeptide repeat.
54pfam154631397121.94.9E+02[                   --------                      ]ECM11Extracellular mutant protein 11. ECM11 is a family of largely fungal proteins. ECM11 interacts with Cdc6, an essential protein involved in the initiation of DNA replication, and is a nuclear protein involved in maintaining chromatin structure. It was previously identified as a protein involved in yeast cell wall biogenesis and organisation, but is also found to be required in meiosis where its function is related to DNA replication and crossing-over.
55TIGR02386114011921.72.7E+02[             --------------                      ]rpoC_TIGRDNA-directed RNA polymerase, beta' subunit, predominant form. Bacteria have a single DNA-directed RNA polymerase, with required subunits that include alpha, beta, and beta-prime. This model describes the predominant architecture of the beta-prime subunit in most bacteria. This model excludes from among the bacterial mostly sequences from the cyanobacteria, where RpoC is replaced by two tandem genes homologous to it but also encoding an additional domain.
56COG40062788721.53.9E+02[     ----------                                  ]COG4006CRISPR/Cas system-associated protein Csm6, COG1517 family
57pfam1511230712720.68.6E+02[                   -------------                 ]DUF4559Domain of unknown function (DUF4559). This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. This family includes human protein CXorf38.
58pfam019341204520.494[                                            -----]DUF86Protein of unknown function DUF86. The function of members of this family is unknown.