match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1pfam096529392100.07.8E-56[  -----------------------------------------------]Cas_VVA1548Putative CRISPR-associated protein (Cas_VVA1548). This entry represents a conserved region of about 95 amino acids found exclusively in species with CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In all bacterial species that contain this entry, the genes encoding the proteins are in the midst of a cluster of cas (CRISPR-associated) genes.
2cd097439090100.01.9E-55[  ---------------------------------------------- ]Csx16_III-UCRISPR/Cas system-associated protein Csx16. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Small protein often seen in proximity to Csx1 (COG1517) family; also known as VVA1548 family
3TIGR026209392100.02E-46[  -----------------------------------------------]cas_VVA1548putative CRISPR-associated protein, VVA1548 family. This model represents a conserved domain of about 95 amino acids exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In all bacterial species with members so far (Vibrio vulnificus YJ016, Mannheimia succiniciproducens MBEL55E, and Nitrosomonas europaea ATCC 19718) and but not in the archaeon Methanothermobacter thermautotrophicus str. Delta H, the gene for this protein is in the midst of a cluster of Cas protein gene near CRISPR repeats.
4cd096957676100.08.9E-34[          ---------------------------------------]Csx16_III-UCRISPR/Cas system-associated protein Csx16. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Small protein often seen in proximity to Csx1 (COG1517) family; also known as VVA1548 family
5pfam026391304388.60.8[                ---------------------------      ]DUF188Uncharacterized BCR, YaiI/YqxD family COG1671.
6pfam069531212969.51.3[     ---------------                             ]ArsDArsenical resistance operon trans-acting repressor ArsD. This family consists of several bacterial arsenical resistance operon trans-acting repressor ArsD proteins. ArsD is a trans-acting repressor of the arsRDABC operon that confers resistance to arsenicals and antimonials in Escherichia coli. It possesses two-pairs of vicinal cysteine residues, Cys(12)-Cys(13) and Cys(112)-Cys(113), that potentially form separate binding sites for the metalloids that trigger dissociation of ArsD from the operon. However, as a homodimer it has four vicinal cysteine pairs.
7pfam03807935166.022[ ----------------------------                    ]F420_oxidoredNADP oxidoreductase coenzyme F420-dependent.
8COG03941394865.04.9[    -------------------------                    ]WzbProtein-tyrosine-phosphatase
9pfam15508942158.13.9[                              -----------        ]NAAA-betabeta subunit of N-acylethanolamine-hydrolyzing acid amidase. NAAA-beta is a family of vertebral sequences that form the beta subunit of vertebral N-acylethanolamine-hydrolyzing acid amidase, a member of the choloylglycine hydrolase acid ceramidase family. The alpha subunit is represented by family CBAH, pfam02275.
10PRK085064723858.013[                 -------------------             ]PRK08506replicative DNA helicase; Provisional
11PRK004024183257.14.5[      ----------------                           ]PRK004023-isopropylmalate dehydratase large subunit; Reviewed
12pfam08220572956.85[    --------------                               ]HTH_DeoRDeoR-like helix-turn-helix domain.
13pfam107871491756.75.4[--------                                         ]YfmQUncharacterized protein from bacillus cereus group. This family is conserved in the Bacillus cereus group. Several members are called YfmQ but the function is not known.
14pfam009772294255.921[                    ----------------------       ]His_biosynthHistidine biosynthesis protein. Proteins involved in steps 4 and 6 of the histidine biosynthesis pathway are contained in this family. Histidine is formed by several complex and distinct biochemical reactions catalysed by eight enzymes. The enzymes in this Pfam entry are called His6 and His7 in eukaryotes and HisA and HisF in prokaryotes. The structure of HisA is known to be a TIM barrel fold. In some archaeal HisA proteins the TIM barrel is composed of two tandem repeats of a half barrel. This family belong to the common phosphate binding site TIM barrel family.
15cd057101204253.49[ ---------------------                           ]SIS_1A subgroup of the SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars.
16PRK062642106252.916[   --------------------------------              ]cbiCprecorrin-8X methylmutase; Validated
17COG56612101951.90.68[   ----------                                    ]COG5661Predicted secreted Zn-dependent protease
18pfam093282641851.86.1[                      ---------                  ]Phytochelatin_CDomain of unknown function (DUF1984). Members of this family of functionally uncharacterized domains are found at the C-terminus of plant phytochelatin synthases.
19TIGR031683033851.718[           --------------------                  ]1-PFKhexose kinase, 1-phosphofructokinase family. This family consists largely of 1-phosphofructokinases, but also includes tagatose-6-kinases and 6-phosphofructokinases.
20PRK104771772549.29.2[ -------------                                   ]PRK10477outer membrane lipoprotein Blc; Provisional
21PRK012547073348.48.5[ -----------------                               ]PRK01254hypothetical protein; Provisional
22PRK124766123547.18.7[      --------------------                       ]PRK12476putative fatty-acid--CoA ligase; Provisional
23pfam014511384946.914[    -------------------------                    ]LMWPcLow molecular weight phosphotyrosine protein phosphatase.
24COG16711504146.634[                 -------------------------       ]YaiIUncharacterized conserved protein YaiI, UPF0178 family
25pfam138831672145.31.3[  -----------                                    ]Pyrid_oxidase_2Pyridoxamine 5'-phosphate oxidase.
26cd001151414545.317[   -----------------------                       ]LMWPcSubstituted updates: Aug 22, 2001
27COG30401742544.513[ -------------                                   ]BlcBacterial lipocalin
28cd009842423943.932[                 --------------------            ]DnaB_CDnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis.
29PRK141251032542.87.6[           -------------                         ]PRK14125cell division suppressor protein YneA; Provisional
30PRK007482333742.822[                    --------------------         ]PRK007481-(5-phosphoribosyl)-5-
31cd05789861042.714[                  -----                          ]S1_Rrp4S1_Rrp4: Rrp4 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure"td>
32COG00991213942.412[                        ------------------------ ]RpsMRibosomal protein S13
33pfam040161473239.671[                 -----------------               ]DUF364Domain of unknown function (DUF364). This domain of unknown function has a PLP-dependent transferase-like fold. Its genomic context suggests that it may have a role in anaerobic vitamin B12 biosynthesis. This domain is often found at the C-terminus of proteins containing DUF4213, pfam13938.
34cd09542631638.914[                                      --------   ]SAM_EPH-A1SAM domain of EPH-A1 subfamily of tyrosine kinase receptors. SAM (sterile alpha motif) domain of EPH-A1 subfamily of the receptor tyrosine kinases is a C-terminal protein-protein interaction domain. This domain is located in the cytoplasmic region of EPH-A1 receptors and appears to mediate cell-cell initiated signal transduction. Activation of these receptors leads to inhibition of cell spreading and migration in a RhoA-ROCK-dependent manner. EPH-A1 receptors are known to bind ILK (integrin-linked kinase) which is the mediator of interactions between integrin and the actin cytoskeleton. However SAM is not sufficient for this interaction; it rather plays an ancillary role. SAM domains of Eph-A1 receptors do not form homo/hetero dimers/oligomers. EphA1 gene was found expressed widely in differentiated epithelial cells. In a number of different malignant tumors EphA1 genes are downregulated. In breast carcinoma the downregulation is associated with invasive behavior of the cell.
35PRK051791224238.414[                       ------------------------- ]rpsM30S ribosomal protein S13; Validated
36PRK001241514238.444[                --------------------------       ]PRK00124hypothetical protein; Validated
37cd059436164638.116[           -----------------------               ]AACSAcetoacetyl-CoA synthetase (acetoacetate-CoA ligase, AACS). AACS is a cytosolic ligase that specifically activates acetoacetate to its coenzyme A ester by a two-step reaction. Acetoacetate first reacts with ATP to form an acyl-adenylate intermediate, which then reacts with CoA to produce an acyl-CoA ester. This is the first step of the mevalonate pathway of isoprenoid biosynthesis via isopentenyl diphosphate. Isoprenoids are a large class of compounds found in all living organisms. AACS is widely distributed in bacteria, archaea and eukaryotes. In bacteria, AACS is known to exhibit an important role in the metabolism of poly-b-hydroxybutyrate, an intracellular reserve of organic carbon and chemical energy by some microorganisms. In mammals, AACS influences the rate of ketone body utilization for the formation of physiologically important fatty acids and cholesterol.
38TIGR036311134237.415[                       ------------------------- ]uS13_bactribosomal protein uS13, bacterial form. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type.
39pfam021532585437.359[ -----------------------------                   ]PDHPrephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis.
40pfam020734043936.951[                 --------------------            ]Peptidase_M29Thermophilic metalloprotease (M29).
41pfam117392073236.48[         -----------------                       ]DctA-YdbHDicarboxylate transport. In certain bacterial families this protein is expressed from the ydbH gene, and there is a suggestion that this is a form of DctA or dicarboxylate transport protein. Dicarboxylate transport proteins are found in aerobic bacteria which grow on succinate or other C4-dicarboxylates.
42cd021962973335.417[               -----------------                 ]PurMPurM (Aminoimidazole Ribonucleotide
43PRK148211845734.66.9[-----------------------------------              ]PRK14821putative deoxyribonucleotide triphosphate pyrophosphatase; Provisional
44TIGR021983151734.44.8[                  ---------                      ]rfaE_dom_IrfaE bifunctional protein, domain I. RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in E. coli, and separate proteins in some other genome. The longer, N-terminal domain I (this family) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (TIGR02199) adds ADP to yield ADP-D-glycero-D-manno-heptose.
45TIGR006316556334.286[ ---------------------------------               ]uvrbexcinuclease ABC, B subunit. All proteins in this family for wich functions are known are DNA helicases that function in the nucleotide excision repair and are endonucleases that make the 3' incision next to DNA damage. They are part of a pathway requiring UvrA, UvrB, UvrC, and UvrD homologs. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University)
46pfam037551592933.817[                            ---------------      ]YicC_NYicC-like family, N-terminal region. Family of bacterial proteins. Although poorly characterized, the members of this protein family have been demonstrated to play a role in stationary phase survival. These proteins are not essential during stationary phase.
47PRK035846553433.620[      ------------------                         ]PRK03584acetoacetyl-CoA synthetase; Provisional
48pfam08350641533.629[  -------                                        ]DUF1724Domain of unknown function (DUF1724). This domain of unknown function has so far only been found at the C-terminus of archaean proteins, including several transcriptional regulators of the ArsR family (see pfam01022).
49cd011642893833.447[           --------------------                  ]FruK_PfkB_like1-phosphofructokinase (FruK), minor 6-phosphofructokinase (pfkB) and related sugar kinases. FruK plays an important role in the predominant pathway for fructose utilisation.This group also contains tagatose-6-phophate kinase, an enzyme of the tagatose 6-phosphate pathway, which responsible for breakdown of the galactose moiety during lactose metabolism by bacteria such as L. lactis.
50pfam143191112032.830[                              ----------         ]Zn_Tnp_IS91Transposase zinc-binding domain. This domain is likely to be a zinc-binding domain. It is found at the N-terminus of transposases belonging to the IS91 family.
51PRK133524312832.738[    --------------                               ]PRK13352thiamine biosynthesis protein ThiC; Provisional
52PRK092874592532.625[         -------------                           ]PRK092876-phosphogluconate dehydrogenase; Validated
53cd047322343532.346[                    -------------------          ]HisAHisA. Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase catalyzes the fourth step in histidine biosynthesis, an isomerisation of the aminoaldose moiety of ProFAR to the aminoketose of PRFAR (N-(5'-phospho-D-1'-ribulosylformimino)-5-amino-1-(5''-phospho-ribosyl)-4-imidazolecarboxamide). In bacteria and archaea, ProFAR isomerase is encoded by the HisA gene.
54pfam082841353831.525[              -------------------                ]RVP_2Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases
55cd073682774331.43.2[     ----------------------                      ]PhnC_Bs_likePhnC is a Class III Extradiol ring-cleavage dioxygenase involved in the polycyclic aromatic hydrocarbon (PAH) catabolic pathway. This subfamily is composed of Burkholderia sp. PhnC and similar poteins. PhnC is one of nine protein products encoded by the phn locus. These proteins are involved in the polycyclic aromatic hydrocarbon (PAH) catabolic pathway. PhnC is a member of the class III extradiol dioxygenase family, a group os enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. LigAB-like enzymes are usually composed of two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents the catalytic subunit, B.
56COG14692891731.225[                               ---------         ]FolE2GTP cyclohydrolase FolE2
57TIGR039045593031.027[ ---------------                                 ]SAM_YgiQuncharacterized radical SAM protein YgiQ. Members of this family are fairly widespread uncharacterized radical SAM family proteins, many of which are designated YgiQ.
58cd064131911730.915[   --------                                      ]GH25_muramidase_1Uncharacterized bacterial muramidase containing a glycosyl hydrolase family 25 (GH25) catalytic domain. Endo-N-acetylmuramidases are lysozymes (also referred to as peptidoglycan hydrolases) that degrade bacterial cell walls by catalyzing the hydrolysis of 1,4-beta-linkages between N-acetylmuramic acid and N-acetyl-D-glucosamine residues.
59pfam138151183230.923[  -----------------                              ]Dzip-like_NIguana/Dzip1-like DAZ-interacting protein N-terminal. The DAZ gene-product - Deleted in Azoospermia - and a closely related sequence are required early in germ-cell development in order to maintain germ-cell populations. This family is the N-terminal region that is the only part of the protein in some fungi and lower metazoa.
60TIGR006654323830.556[                 -------------------             ]DnaBreplicative DNA helicase. This model describes the helicase DnaB, a homohexameric protein required for DNA replication. The homohexamer can form a ring around a single strand of DNA near a replication fork. An intein of > 400 residues is found at a conserved location in DnaB of Synechocystis PCC6803, Rhodothermus marinus (both experimentally confirmed), and Mycobacterium tuberculosis. The intein removes itself by a self-splicing reaction. The seed alignment contains inteins so that the model built from the seed alignment will model a low cost at common intein insertion sites.
61TIGR02194721730.227[   --------                                      ]GlrX_NrdHGlutaredoxin-like protein NrdH. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterized by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase.
62pfam028871171030.161[                  ----                           ]PK_CPyruvate kinase, alpha/beta domain. As well as being found in pyruvate kinase this family is found as an isolated domain in some bacterial proteins.
63pfam010482281529.423[               --------                          ]PNP_UDP_1Phosphorylase superfamily. Members of this family include: purine nucleoside phosphorylase (PNP) Uridine phosphorylase (UdRPase) 5'-methylthioadenosine phosphorylase (MTA phosphorylase)
64TIGR02607783828.914[    -------------------                          ]antidote_HigAaddiction module antidote protein, HigA family. Members of this family form a distinct clade within the larger family HTH_3 of helix-turn-helix proteins, described by pfam01381. Members of this clade are strictly bacterial and nearly always shorter than 110 amino acids. This family includes the characterized member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is uncoded by the upstream member of the gene pair.
65cd121763042828.626[    --------------                               ]PGDH_3Phosphoglycerate dehydrogenases, NAD-binding and catalytic domains. Phosphoglycerate dehydrogenases (PGDHs) catalyze the initial step in the biosynthesis of L-serine from D-3-phosphoglycerate. PGDHs come in 3 distinct structural forms, with this first group being related to 2-hydroxy acid dehydrogenases, sharing structural similarity to formate and glycerate dehydrogenases. PGDH in E. coli and Mycobacterium tuberculosis form tetramers, with subunits containing a Rossmann-fold NAD binding domain. Formate/glycerate and related dehydrogenases of the D-specific 2-hydroxyacid dehydrogenase superfamily include groups such as formate dehydrogenase, glycerate dehydrogenase, L-alanine dehydrogenase, and S-adenosylhomocysteine hydrolase. Despite often low sequence identity, these proteins typically have a characteristic arrangement of 2 similar subdomains of the alpha/beta Rossmann fold NAD+ binding form. The NAD+ binding domain is inserted within the linear sequence of the mostly N-terminal catalytic domain, which has a similar domain structure to the internal NAD binding domain. Structurally, these domains are connected by extended alpha helices and create a cleft in which NAD is bound, primarily to the C-terminal portion of the 2nd (internal) domain. Some related proteins have similar structural subdomain but with a tandem arrangement of the catalytic and NAD-binding subdomains in the linear sequence.
66cd010963151927.848[  ---------                                      ]Alkanal_monooxygenaseAlkanal monooxygenase are flavin monoxygenases. Molecular oxygen is activated by reaction with reduced flavin mononucleotide (FMNH2) and reacts with an aldehyde to yield the carboxylic acid, oxidized flavin (FMN) and a blue-green light. Bacterial luciferases are heterodimers made of alpha and beta subunits which are homologous. The single activer center is on the alpha subunit. The alpha subunit has a stretch of 30 amino acid residues that is not present in the beta subunit. The beta subunit does not contain the active site and is required for the formation of the fully active heterodimer. The beta subunit does not contribute anything directly to the active site. Its role is probably to stabilize the high quantum yield conformation of the alpha subunit through interactionbs across the subunit interface.
67PRK113164731727.38.7[                  ---------                      ]PRK11316bifunctional heptose 7-phosphate kinase/heptose 1-phosphate adenyltransferase; Provisional
68PRK082452402326.957[                  -----------                    ]PRK08245hypothetical protein; Validated
69pfam07338561826.866[                        ---------                ]DUF1471Protein of unknown function (DUF1471). This family consists of several hypothetical Enterobacterial proteins of around 90 residues in length. Some members of this family are annotated as ydgH precursors and contain two copies of this region, one at the N-terminus and the other at the C-terminus. The function of this family is unknown.
70PRK053853274226.728[               ------------------------          ]PRK05385phosphoribosylaminoimidazole synthetase; Provisional
71COG48212433426.646[               -----------------                 ]COG4821Uncharacterized protein, contains SIS (Sugar ISomerase) phosphosugar binding domain
72pfam080301521626.447[ -------                                         ]NAD_binding_6Ferric reductase NAD binding domain.
73PRK134632031426.135[ -------                                         ]PRK13463phosphatase PhoE; Provisional
74PRK118834517525.923[  ----------------------------------------       ]PRK11883protoporphyrinogen oxidase; Reviewed
75cd132001092725.87.3[                     ---------------             ]FERM_C_KCBPFERM domain C-lobe of Kinesin-like calmodulin binding protein. KCBPs (also called KIPK/Kinesin-like Calmodulin-Binding Protein-Interacting Protein Kinase), a member of the Kinesin-14 family, is a C-terminal microtubule motor with three unique domains including a myosin tail homology region 4 (MyTH4), a talin-like domain, and a calmodulin-binding domain (CBD). Binding of the Ca2+-activated calmodulin to KCBP causes the motor to dissociate from microtubules. The microtubule binding of KCBP is controlled by the calcium binding protein KIC containing a single EF-hand motif. KCBPs are unique to land plants and green algae. The MyTH4 and talin-like domains are not found in other kinesins, while the CBD domain is also only found in Strongylocentrotus purpuratus kinesin-C (SpKinC). The FERM domain has a cloverleaf tripart structure composed of: (1) FERM_N (A-lobe or F1); (2) FERM_M (B-lobe, or F2); and (3) FERM_C (C-lobe or F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM domain is also found in protein tyrosine phosphatases (PTPs), the tyrosine kinases FAK and JAK, in addition to other proteins involved in signaling. This domain is structurally similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
76COG12402612925.763[                  ---------------                ]ChlDMg-chelatase subunit ChlD
77PRK095211891425.128[                -------                          ]PRK09521exosome complex RNA-binding protein Csl4; Provisional
78PRK000262444725.144[    -------------------------                    ]trmDtRNA (guanine-N(1)-)-methyltransferase; Reviewed
79cd082673192825.015[          --------------                         ]MDR1Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family. This group is a member of the medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family, but lacks the zinc-binding sites of the zinc-dependent alcohol dehydrogenases. The medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family, which contains the zinc-dependent alcohol dehydrogenase (ADH-Zn) and related proteins, is a diverse group of proteins related to the first identified member, class I mammalian ADH. MDRs display a broad range of activities and are distinguished from the smaller short chain dehydrogenases (~ 250 amino acids vs. the ~ 350 amino acids of the MDR). The MDR proteins have 2 domains: a C-terminal NAD(P)-binding Rossmann fold domain of a beta-alpha form and an N-terminal catalytic domain with distant homology to GroES. The MDR group contains a host of activities, including the founding alcohol dehydrogenase (ADH), quinone reductase, sorbitol dehydrogenase, formaldehyde dehydrogenase, butanediol DH, ketose reductase, cinnamyl reductase, and numerous others. The zinc-dependent alcohol dehydrogenases (ADHs) catalyze the NAD(P)(H)-dependent interconversion of alcohols to aldehydes or ketones. Active site zinc has a catalytic role, while structural zinc aids in stability. ADH-like proteins typically form dimers (typically higher plants, mammals) or tetramers (yeast, bacteria), and generally have 2 tightly bound zinc atoms per subunit. The active site zinc is coordinated by a histidine, two cysteines, and a water molecule. The second zinc seems to play a structural role, affects subunit interactions, and is typically coordinated by 4 cysteines.
80COG01503452924.836[               ---------------                   ]PurMPhosphoribosylaminoimidazole (AIR) synthetase
81pfam032931601324.839[                  ------                         ]Pox_RNA_polPoxvirus DNA-directed RNA polymerase, 18 kD subunit.
82TIGR028193935124.519[               ---------------------------       ]fdhA_non_GSHformaldehyde dehydrogenase, glutathione-independent. Members of this family represent a distinct clade within the larger family of zinc-dependent dehydrogenases of medium chain alcohols, a family that also includes the so-called glutathione-dependent formaldehyde dehydrogenase. Members of this protein family have a tightly bound NAD that can act as a true cofactor, rather than a cosubstrate in dehydrogenase reactions, in dismutase reactions for some aldehydes. The name given to this family, however, is formaldehyde dehydrogenase, glutathione-independent.
83cd04454821024.542[                  -----                          ]S1_Rrp4_likeS1_Rrp4_like: Rrp4-like, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein, and Rrp40 and Csl4 proteins, also represented in this group, are subunits of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure"td>
84pfam109521401324.317[          ------                                 ]DUF2753Protein of unknown function (DUF2753). This bacterial family of proteins has no known function.
85pfam13146782724.268[ ---------------                                 ]TRLTRL-like protein family. This family includes the TRL protein, which is found in a locus that includes several tRNAs. The function of this protein is not known. The proteins in this family usually have a lipoprotein attachment site at their N-terminus.
86TIGR022281583623.555[              ------------------                 ]sigpep_I_archsignal peptidase I, archaeal type. This model represents signal peptidase I from most archaea, a subunit of the eukaryotic endoplasmic reticulum signal peptidase I complex, and an apparent signal peptidase I from a small number of bacteria. It is related to but does not overlap in hits with TIGR02227, the bacterial and mitochondrial signal peptidase I.
87TIGR038283043722.91E+02[            -------------------                  ]pfkB1-phosphofructokinase. This enzyme acts in concert with the fructose-specific phosphotransferase system (PTS) which imports fructose as fructose-1-phosphate. The action of 1-phosphofructokinase results in beta-D-fructose-1,6-bisphosphate and is an entry point into glycolysis (GenProp0688).
88pfam01476432922.974[       --------------                            ]LysMLysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known.
89pfam019644212822.873[    --------------                               ]ThiCThiC family. ThiC is found within the thiamine biosynthesis operon. ThiC is involved in pyrimidine biosynthesis. The precise catalytic function of ThiC is still not known. ThiC participates in the formation of 4-Amino-5-hydroxymethyl-2-methylpyrimidine from AIR, an intermediate in the de novo pyrimidine biosynthesis.
90PRK029472463422.650[               -----------------                 ]PRK02947hypothetical protein; Provisional
91PRK008091444122.586[---------------------                            ]PRK00809hypothetical protein; Provisional
92cd031111062822.579[                      --------------             ]CpaE_likeThis protein family consists of proteins similar to the cpaE protein of the Caulobacter pilus assembly and the orf4 protein of Actinobacillus pilus formation gene cluster. The function of these proteins are unkown. The Caulobacter pilus assembly contains 7 genes: pilA, cpaA, cpaB, cpaC, cpaD, cpaE and cpaF. These genes are clustered together on chromosome.
93TIGR012176523322.340[      -----------------                          ]ac_ac_CoA_synacetoacetyl-CoA synthase. This enzyme catalyzes the first step of the mevalonate pathway of IPP biosynthesis. Most bacteria do not use this pathway, but rather the deoxyxylulose pathway.
94TIGR036004204321.966[                 ----------------------          ]phage_DnaBphage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model.
95pfam105021382221.885[                  -----------                    ]Peptidase_S26Signal peptidase, peptidase S26. This is a family of membrane signal serine endopeptidases which function in the processing of newly-synthesized secreted proteins. Peptidase S26 removes the hydrophobic, N-terminal, signal peptides as proteins are translocated across membranes. The active site residues take the form of a catalytic dyad that is Ser, Lys in subfamily S26A; the Ser is the nucleophile in catalysis, and the Lys is the general base.
96cd082823752221.819[                ------------                     ]PFDH_likePseudomonas putida aldehyde-dismutating formaldehyde dehydrogenase (PFDH). Formaldehyde dehydrogenase (FDH) is a member of the zinc-dependent/medium chain alcohol dehydrogenase family. Unlike typical FDH, Pseudomonas putida aldehyde-dismutating FDH (PFDH) is glutathione-independent. PFDH converts 2 molecules of aldehydes to corresponding carboxylic acid and alcohol. MDH family uses NAD(H) as a cofactor in the interconversion of alcohols and aldehydes, or ketones. Like the zinc-dependent alcohol dehydrogenases (ADH) of the medium chain alcohol dehydrogenase/reductase family (MDR), these tetrameric FDHs have a catalytic zinc that resides between the catalytic and NAD(H)binding domains and a structural zinc in a lobe of the catalytic domain. Unlike ADH, where NAD(P)(H) acts as a cofactor, NADH in FDH is a tightly bound redox cofactor (similar to nicotinamide proteins). The medium chain alcohol dehydrogenase family (MDR) has a NAD(P)(H)-binding domain in a Rossmann fold of an beta-alpha form. The N-terminal region typically has an all-beta catalytic domain. These proteins typically form dimers (typically higher plants, mammals) or tetramers (yeast, bacteria), and have 2 tightly bound zinc atoms per subunit.
97TIGR043114232221.824[    ------------                                 ]rSAM_Geo_metalputative metalloenzyme radical SAM/SPASM domain maturase. This model describes a family of radical SAM/SPASM enzymes largely from the deltaproteobacteria. The family is most closely related to radical SAM enzyme family regularly in archaea in the vicinity of tungsten-containing oxidoreductases. A single member of the family in archaea may correspond to multiple tungsten enzymes, e.g. five in Pyrococcus furiosus. Therefore, the lack of a conserved gene neighborhood for members of this family in deltaprotebacteria suggests members may be involved in the maturation of multiple metalloenzymes.
98COG14112293021.672[                      ----------------           ]COG1411Uncharacterized protein related to proFAR isomerase (HisA)
99cd077944705721.367[     ------------------------------              ]FGGY_GK_like_proteobactProteobacterial glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases. This subgroup corresponds to a small group of proteobacterial glycerol kinase (GK)-like proteins, including the glycerol kinase from Pseudomonas aeruginosa. Most bacteria, such as Escherichia coli, take up glycerol passively by facilitated diffusion. In contrast, P. aeruginosa may also utilize a binding protein-dependent active transport system to mediate glycerol transportation. The glycerol kinase subsequently phosphorylates the intracellular glycerol to glycerol 3-phosphate (G3P). GKs belong to the FGGY family of carbohydrate kinases, the monomers of which contain two large domains, which are separated by a deep cleft that forms the active site. This model includes both the N-terminal domain, which adopts a ribonuclease H-like fold, and the structurally related C-terminal domain.
100pfam11617311021.327[   -----                                         ]Cu-binding_MopEProtein metal binding site. This family of proteins represents a unique protein copper binding site that involves a tryptophan metabolite, kynurenine in the protein MopE. The production of kyneurenin by modification of tryptophan and its involvement in copper binding is an innate property of MopE.
101cd053043634721.21.2E+02[                ------------------------         ]Rubrum_tdhRubrum transdehydrogenase NAD-binding and catalytic domains. Transhydrogenases found in bacterial and inner mitochondrial membranes link NAD(P)(H)-dependent redox reactions to proton translocation. The energy of the proton electrochemical gradient (delta-p), generated by the respiratory electron transport chain, is consumed by transhydrogenase in NAD(P)+ reduction. Transhydrogenase is likely involved in the regulation of the citric acid cycle. Rubrum transhydrogenase has 3 components, dI, dII, and dIII. dII spans the membrane while dI and dIII protrude on the cytoplasmic/matrix side. DI contains 2 domains in Rossmann-like folds, linked by a long alpha helix, and contains a NAD binding site. Two dI polypeptides (represented in this sub-family) spontaneously form a heterotrimer with dIII in the absence of dII. In the heterotrimer, both dI chains may bind NAD, but only one is well-ordered. dIII also binds a well-ordered NADP, but in a different orientation than a classical Rossmann domain.
102cd084642004521.281[      ------------------------                   ]PBP2_DntR_like_2The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator similar to DntR, which is involved in the catabolism of dinitrotoluene; contains the type 2 periplasmic binding fold. This CD includes an uncharacterized LysR-type transcriptional regulator similar to DntR, NahR, and LinR, which are involved in the degradation of aromatic compounds. The transcription of the genes encoding enzymes involved in such degradation is regulated and expression of these enzymes is enhanced by inducers, which are either an intermediate in the metabolic pathway or compounds to be degraded. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two cytoplasmically located ATPase domains. This interaction triggers the ligand translocation across the cytoplasmic membrane energized by ATP hydrolysis.
103pfam02699831121.156[                 -----                           ]YajCPreprotein translocase subunit. See.
104COG28704671721.113[                  ---------                      ]RfaEADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase
105cd034831271821.186[                            --------             ]MutL_Trans_MLH1MutL_Trans_MLH1: transducer domain, having a ribosomal S5 domain 2-like fold, found in proteins similar to yeast and human MLH1 (MutL homologue 1). This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. MLH1 forms heterodimers with PMS2, PMS1 and MLH3. These three complexes have distinct functions in meiosis. hMLH1-hPMS2 also participates in the repair of all DNA mismatch repair (MMR) substrates. Roles for hMLH1-hPMS1 or hMLH1-hMLH3 in MMR have not been established. Cells lacking hMLH1 have a strong mutator phenotype and display microsatellite instability (MSI). Mutation in hMLH1 causes predisposition to HNPCC, Muir-Torre syndrome and Turcot syndrome (HNPCC variant). Mutation in hMLH1 accounts for a large fraction of HNPCC families.
106PRK073243732720.759[           -------------                         ]PRK07324transaminase; Validated
107TIGR026911296520.784[    -----------------------------------          ]arsC_pI258_famarsenate reductase (thioredoxin). This family describes the well-studied thioredoxin-dependent arsenate reductase of Staphylococcus aureaus plasmid pI258 and other mechanistically similar arsenate reductases. The mechanism involves an intramolecular disulfide bond cascade, and aligned members of this family have four absolutely conserved Cys residues. This group of arsenate reductases belongs to the low-molecular weight protein-tyrosine phosphatase family (pfam01451), as does a group of glutathione/glutaredoxin type arsenate reductases (TIGR02689). At least two other, non-homologous groups of arsenate reductases involved in arsenical resistance are also known. This enzyme reduces arsenate to arsenite, which may be more toxic but which is more easily exported.
108COG04224323320.762[                  ------------------------       ]ThiCThiamine biosynthesis protein ThiC
109PRK064822765420.22.3E+02[ ----------------------------------              ]PRK06482short chain dehydrogenase; Provisional
110COG40431111520.041[                 -------                         ]ASCHASC-1 homology (ASCH) domain, predicted RNA-binding domain