match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1TIGR025798371100.05.2E-36[    ---------------------------------------------]cas_csx3CRISPR-associated protein, Csx3 family. Members of this family are found encoded in CRISPR-associated (cas) gene clusters, near CRISPR repeats, in the genomes of several different thermophiles: Archaeoglobus fulgidus (archaeal), Aquifex aeolicus (Aquificae), Dictyoglomus thermophilum (Dictyoglomi), and a thermophilic Synechococcus (Cyanobacteria). It is not yet assigned to a specific CRISPR/cas subtype (hence the x designation csx3).
2cd096818371100.05.2E-36[    ---------------------------------------------]Csx3_III-UCRISPR/Cas system-associated protein Csx3. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Small protein in some cases fused to Csx1 (COG1517) family domains
3cd097408471100.01E-33[    ---------------------------------------------]Csx3_III-UCRISPR/Cas system-associated protein Csx3. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Small protein in some cases fused to Csx1 (COG1517) family domains
4pfam096208061100.01.3E-32[            -------------------------------------]Cas_csx3CRISPR-associated protein (Cas_csx3). This entry is encoded in CRISPR-associated (cas) gene clusters, near CRISPR repeats, in the genomes of several different thermophiles: Archaeoglobus fulgidus (archaeal), Aquifex aeolicus (Aquificae), Dictyoglomus thermophilum (Dictyoglomi), and a thermophilic Synechococcus (Cyanobacteria). It is not yet assigned to a specific CRISPR/cas subtype (hence the x designation csx3).
5PRK108874433582.31.6[                       --------------------------]glmMphosphoglucosamine mutase; Provisional
6cd058024343556.317[                       --------------------------]GlmMGlmM is a bacterial phosphoglucosamine mutase (PNGM) that belongs to the alpha-D-phosphohexomutase superfamily. It is required for the interconversion of glucosamine-6-phosphate and glucosamine-1-phosphate in the biosynthetic pathway of UDP-N-acetylglucosamine, an essential precursor to components of the cell envelope. In order to be active, GlmM must be phosphorylated, which can occur via autophosphorylation or by the Ser/Thr kinase StkP. GlmM functions in a classical ping-pong bi-bi mechanism with glucosamine-1,6-diphosphate as an intermediate. Other members of the alpha-D-phosphohexomutase superfamily include phosphoglucosamine mutase (PNGM), phosphoacetylglucosamine mutase (PAGM), the bacterial phosphomannomutase ManB, and the bifunctional phosphomannomutase/phosphoglucomutase (PMM/PGM). Each of these enzymes has four domains with a centrally located active site formed by four loops, one from each domain. All four domains are included in this alignment model.
7PRK143144503553.216[                       --------------------------]glmMphosphoglucosamine mutase; Provisional
8cd09887821449.19.3[                                         --------]NGN_ArchArchaeal N-Utilization Substance G (NusG) N-terminal (NGN) domain. The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. Transcription in archaea has a eukaryotic-type transcription apparatus, but contains bacterial-type transcription factors. NusG is one of the few archaeal transcription factors that has orthologs in both bacteria and eukaryotes. Archaeal NusG is similar to bacterial NusG, composed of an NGN domain and a Kyrpides Ouzounis and Woese (KOW) repeat. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. NusG was originally discovered as a N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Archaeal NusG forms a complex with DNA-directed RNA polymerase subunit E (rpoE) that is similar to the Spt5-Spt4 complex in eukaryotes.
9pfam14147521148.99.4[                           ------                ]Spore_YhaLSporulation protein YhaL. This family of proteins is involved in sporulation. In B. subtilis its expression is regulated by the early mother-cell-specific transcription factor sigma-E.
10pfam028781383548.335[                       --------------------------]PGM_PMM_IPhosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I.
11TIGR014554435345.435[           --------------------------------------]glmMphosphoglucosamine mutase. This model describes GlmM, phosphoglucosamine mutase, also designated in MrsA and YhbF E. coli, UreC in Helicobacter pylori, and femR315 or FemD in Staphlococcus aureus. It converts glucosamine-6-phosphate to glucosamine-1-phosphate as part of the pathway toward UDP-N-acetylglucosamine for peptidoglycan and lipopolysaccharides.
12cd04903712844.140[     ------------------                          ]ACT_LSDC-terminal ACT domain of the L-serine dehydratase (LSD), iron-sulfur-dependent, beta subunit. The C-terminal ACT domain of the L-serine dehydratase (LSD), iron-sulfur-dependent, beta subunit, found in various bacterial anaerobes such as Clostridium, Bacillis, and Treponema species. These enzymes catalyze the deamination of L-serine, producing pyruvate and ammonia. Unlike the eukaryotic L-serine dehydratase, which requires the pyridoxal-5'-phosphate (PLP) cofactor, the prokaryotic L-serine dehydratase contains an
13PRK138161311443.315[                                       --------  ]PRK13816ribosome-binding factor A; Provisional
14cd032262051340.112[                                          -------]ABC_cobalt_CbiO_domain2Second domain of the ATP-binding cassette component of cobalt transport system. Domain II of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. The CbiMNQO family ABC transport system is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.
15COG11094643536.752[                       --------------------------]ManBPhosphomannomutase
16pfam061291092034.151[                                   ------------- ]Chordopox_G3Chordopoxvirus G3 protein. This family consists of several Chordopoxvirus specific G3 proteins. The function of this family is unknown.
17cd03711783931.627[      -----------------------                    ]Tet_CTet_C: C-terminus of ribosomal protection proteins Tet(M) and Tet(O). This domain has homology to the C terminal domains of the elongation factors EF-G and EF-2. Tet(M) and Tet(O) catalyze the release of tetracycline (Tc) from the ribosome in a GTP-dependent manner thereby mediating Tc resistance. Tcs are broad-spectrum antibiotics. Typical Tcs bind to the ribosome and inhibit the elongation phase of protein synthesis, by inhibiting the occupation of site A by aminoacyl-tRNA.
18pfam043981114930.523[              -------------------------------    ]DUF538Protein of unknown function, DUF538. This family consists of several plant proteins of unknown function.
19cd024201253329.039[       --------------------                      ]Peptidase_C39DA sub-family of peptidase family C39. Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria. The cysteine peptidases in family C39 cleave the "double-glycine" leader peptides from the precursors of various bacteriocins (mostly non-lantibiotic). The cleavage is mediated by the transporter as part of the secretion process. Bacteriocins are antibiotic proteins secreted by some species of bacteria that inhibit the growth of other bacterial species. The bacteriocin is synthesized as a precursor with an N-terminal leader peptide, and processing involves removal of the leader peptide by cleavage at a Gly-Gly bond, followed by translocation of the mature bacteriocin across the cytoplasmic membrane. Most endopeptidases of family C39 are N-terminal domains in larger proteins (ABC transporters) that serve both functions. The proposed protease active site is conserved in this sub-family.
20cd00212783228.841[    ----------------------                       ]PTS_IIB_glcPTS_IIB, PTS system, glucose/sucrose specific IIB subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIB PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation
21pfam053522812628.855[                               ---------------   ]Phage_connectorPhage Connector (GP10). The head-tail connector of bacteriophage 29 is composed of 12 36 kDa subunits with 12 fold symmetry. It is the central component of a rotary motor that packages the genomic dsDNA into pre-formed proheads. This motor consists of the head-tail connector, surrounded by a 29-encoded, 174-base, RNA and a viral ATPase protein.
22pfam11890812927.829[                    ----------------------       ]DUF3410Domain of unknown function (DUF3410). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is about 90 amino acids in length. This domain is found associated with pfam02826, pfam00389. This domain has a conserved RRE sequence motif.
23pfam133041441227.333[                                           ------]AAA_21AAA domain.
24pfam01788401627.212[                         ---------               ]PsbJPsbJ. This family consists of the photosystem II reaction centre protein PsbJ from plants and Cyanobacteria. In Synechocystis sp. PCC 6803 PsbJ regulates the number of photosystem II centres in thylakoid membranes, it is a predicted 4kDa protein with one membrane spanning domain.
25TIGR000992565527.12.6E+02[          -------------------------------------- ]Cof-subfamilyCof subfamily of IIB subfamily of haloacid dehalogenase superfamily. This subfamily of sequences falls within the Class-IIB subfamily (TIGR01484) of the Haloacid Dehalogenase superfamily of aspartate-nucleophile hydrolases. The use of the name "Cof" as an identifier here is arbitrary and refers to the E. coli Cof protein. This subfamily is notable for the large number of recent paralogs in many species. Listeria, for instance, has 12, Clostridium, Lactococcus and Streptococcus pneumoniae have 8 each, Enterococcus and Salmonella have 7 each, and Bacillus subtilus, Mycoplasma, Staphylococcus and E. coli have 6 each. This high degree of gene duplication is limited to the gamma proteobacteria and low-GC gram positive lineages. The profusion of genes in this subfamily is not coupled with a high degree of divergence, so it is impossible to determine an accurate phylogeny at the equivalog level. Considering the relationship of this subfamily to the other known members of the HAD-IIB subfamily (TIGR01484), sucrose and trehalose phosphatases and phosphomannomutase, it seems a reasonable hypothesis that these enzymes act on phosphorylated sugars. Possibly the diversification of genes in this subfamily represents the diverse sugars and polysaccharides that various bacteria find in their biological niches. The members of this subfamily are restricted almost exclusively to bacteria (one sequences from S. pombe scores above trusted, while another is between trusted and noise). It is notable that no archaea are found in this group, the closest relations to the archaea found here being two Deinococcus sequences.
26PRK091202755026.443[    ------------------------------               ]PRK09120p-hydroxycinnamoyl CoA hydratase/lyase; Validated
27PRK135392071125.941[                                           ------]PRK13539cytochrome c biogenesis protein CcmA; Provisional
28TIGR037912912225.734[                                     ------------]TTQ_mauGtryptophan tryptophylquinone biosynthesis enzyme MauG. Members of this protein family are the tryptophan tryptophylquinone biosynthesis (TTQ) enzyme MauG, as found in Methylobacterium extorquens and related species. This protein is required to complete the maturation of the TTQ cofactor in the methylamine dehydrogenase light (beta) chain.
29PRK143234403125.563[                         ------------------------]glmMphosphoglucosamine mutase; Provisional
30pfam106822203125.544[                             --------------------]UL40Glycoprotein of human cytomegalovirus HHV-5. This is glycoprotein UL40 from human cytomegalovirus or herpesvirus 5. The signal sequence of the UL40 polypeptide contains an HLA-E ligand identical with HLA-Cw*0304. The first 37 residues of UL40, including this ligand, are predicted to encode a signal peptide. The virus thus prevents the lysis by NK (natural killer) cells of the cell it has invaded.
31PRK083145462625.041[          ----------------                       ]PRK08314long-chain-fatty-acid--CoA ligase; Validated
32PRK143154483524.896[                       --------------------------]glmMphosphoglucosamine mutase; Provisional
33pfam10006693224.41.6E+02[                -------------------              ]DUF2249Uncharacterized conserved protein (DUF2249). Members of this family of hypothetical bacterial proteins have no known function.
34cd032211441223.436[                                          -------]ABCF_EF-3ATP-binding cassette domain of elongation factor 3, subfamily F. Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions.
35PRK060722484523.21.4E+02[ ---------------------------                     ]PRK06072enoyl-CoA hydratase; Provisional
36TIGR036082061222.839[                                           ------]L_ocin_972_ABCputative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins.
37cd138721411521.866[                    --------                     ]CuRO_2_AAO_like_1The second cupredoxin domain of plant pollen multicopper oxidase homologous to ascorbate oxidase. The proteins in this subfamily are expressed in plant pollen. They share homology to ascorbate oxidase and other members of the blue copper oxidase family. The expression of the protein is detected during germination and pollen tube growth. Ascorbate oxidase catalyzes the oxidation of ascorbic acid to dehydroascorbic acid. It is a member of the multicopper oxidase (MCO) family that couples oxidation of substrates with reduction of dioxygen to water. Although MCOs have diverse functions, majority of them have three cupredoxin domain repeats that include one mononuclear and one trinuclear copper center. The copper ions are bound in several sites: Type 1, Type 2, and/or Type 3. The ensemble of types 2 and 3 copper is called a trinuclear cluster. MCOs oxidize their substrate by accepting electrons at a mononuclear copper center and transferring them to the active site trinuclear copper center. The cupredoxin domain 2 of 3-domain MCOs has lost the ability to bind copper.
38PRK085591534421.554[                       --------------------------]nusGtranscription antitermination protein NusG; Validated