match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1TIGR01884203170100.03.3E-29[------------------------------------------       ]cas_HTHCRISPR locus-related DNA-binding protein. Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.
2cd0965519816899.91.4E-25[------------------------------------------       ]CasRa_I-ACRISPR/Cas system-associated transcriptional regulator CasRa. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Predicted transcriptional regulator of CRISPR/Cas system
3cd00090785798.92E-09[                           --------------        ]HTH_ARSRArsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors. ARSR subfamily of helix-turn-helix bacterial transcription regulatory proteins (winged helix topology). Includes several proteins that appear to dissociate from DNA in the presence of metal ions.
4pfam01022474698.88.8E-09[                           ------------          ]HTH_5Bacterial regulatory protein, arsR family. Members of this family contains a DNA binding 'helix-turn-helix' motif. This family includes other proteins which are not included in the Prosite definition.
5COG25122586298.88.8E-09[                          ---------------        ]COG2512Uncharacterized membrane protein
6pfam13412484598.61.4E-07[                           -----------           ]HTH_24Winged helix-turn-helix DNA-binding.
7pfam01978685398.52.6E-07[                           -------------         ]TrmBSugar-specific transcriptional regulator TrmB. One member of this family, TrmB, has been shown to be a sugar-specific transcriptional regulator of the trehalose/maltose ABC transporter in Thermococcus litoralis.
8COG18461266498.53.1E-07[                           ----------------      ]MarRDNA-binding transcriptional regulator, MarR family
9pfam12802615098.46.3E-07[                           ------------          ]MarR_2MarR family. The Mar proteins are involved in the multiple antibiotic resistance, a non-specific resistance system. The expression of the mar operon is controlled by a repressor, MarR. A large number of compounds induce transcription of the mar operon. This is thought to be due to the compound binding to MarR, and the resulting complex stops MarR binding to the DNA. With the MarR repression lost, transcription of the operon proceeds. The structure of MarR is known and shows MarR as a dimer with each subunit containing a winged-helix DNA binding motif.
10COG33551265498.43.8E-07[                          --------------         ]COG3355Predicted transcriptional regulator
11pfam12840615398.46.9E-07[                           -------------         ]HTH_20Helix-turn-helix domain. This domain represents a DNA-binding Helix-turn-helix domain found in transcriptional regulatory proteins.
12COG15221544998.45.9E-07[                          ------------           ]LrpDNA-binding transcriptional regulator, Lrp family
13pfam01047595098.41E-06[                           ------------          ]MarRMarR family. The Mar proteins are involved in the multiple antibiotic resistance, a non-specific resistance system. The expression of the mar operon is controlled by a repressor, MarR. A large number of compounds induce transcription of the mar operon. This is thought to be due to the compound binding to MarR, and the resulting complex stops MarR binding to the DNA. With the MarR repression lost, transcription of the operon proceeds. The structure of MarR is known and shows MarR as a dimer with each subunit containing a winged-helix DNA binding motif.
14pfam13404424198.31.2E-06[                           ----------            ]HTH_AsnC-typeAsnC-type helix-turn-helix domain.
15COG47422606398.27E-07[                           ----------------      ]COG4742Predicted transcriptional regulator, contains HTH domain
16pfam13463686098.18.5E-06[                           ---------------       ]HTH_27Winged helix DNA-binding domain.
17pfam09339524598.01.2E-05[                            -----------          ]HTH_IclRIclR helix-turn-helix domain.
18COG13211546498.07.6E-06[                           ----------------      ]MntRMn-dependent transcriptional regulator, DtxR family
19cd00092674497.92E-05[                              -----------        ]HTH_CRPhelix_turn_helix, cAMP Regulatory protein C-terminus; DNA binding domain of prokaryotic regulatory proteins belonging to the catabolite activator protein family.
20COG14142465597.83E-05[                            -------------        ]IclRDNA-binding transcriptional regulator, IclR family
21COG23452186797.83E-05[                          -----------------      ]COG2345Predicted transcriptional regulator, ArsR family
22pfam08279523997.78.7E-05[                             ---------           ]HTH_11HTH domain. This family includes helix-turn-helix domains in a wide variety of proteins.
23cd00569423897.79.2E-05[                           ---------             ]HTH_Hin_likeHelix-turn-helix domain of Hin and related proteins. This domain model summarizes a family of DNA-binding domains unique to bacteria and represented by the Hin protein of Salmonella. The basic HTH domain is a simple fold comprised of three core helices that form a right-handed helical bundle. The principal DNA-protein interface is formed by the third helix, the recognition helix, inserting itself into the major groove of the DNA. A diverse array of HTH domains participate in a variety of functions that depend on their DNA-binding properties. HTH_Hin represents one of the simplest versions of the HTH domains; the characterization of homologous relationships between various sequence-diverse HTH domain families remains difficult. The Hin recombinase induces the site-specific inversion of a chromosomal DNA segment containing a promoter, which controls the alternate expression of two genes by reversibly switching orientation. The Hin recombinase consists of a single polypeptide chain containing a C-terminal DNA-binding domain (HTH_Hin) and a catalytic domain.
24pfam08220574797.76.4E-05[                            ------------         ]HTH_DeoRDeoR-like helix-turn-helix domain.
25pfam13730554797.60.00018[                           -----------           ]HTH_36Helix-turn-helix domain.
26COG13492535097.50.00012[                            ------------         ]GlpRDNA-binding transcriptional regulator of sugar metabolism, DeoR/GlpR family
27COG17331205997.30.00035[                            ---------------      ]HxlRDNA-binding transcriptional regulator, HxlR family
28pfam01325584597.30.00047[                            -----------          ]Fe_dep_repressIron dependent repressor, N-terminal DNA binding domain. This family includes the Diphtheria toxin repressor. DNA binding is through a helix-turn-helix motif.
29COG06401105597.30.00079[                           -------------         ]ArsRDNA-binding transcriptional regulator, ArsR family
30pfam039651155797.20.00074[                           --------------        ]Penicillinase_RPenicillinase repressor. The penicillinase repressor negatively regulates expression of the penicillinase gene. The N-terminal region of this protein is involved in operator recognition, while the C-terminal is responsible for dimerization of the protein.
31pfam04967534397.20.0012[                           -----------           ]HTH_10HTH DNA binding domain.
32pfam14947775897.10.00077[                            ---------------      ]HTH_45Winged helix-turn-helix. This winged helix-turn-helix domain contains an extended C-terminal alpha helix which is responsible for dimerization of this domain.
33pfam03444796197.10.00079[                           ---------------       ]HrcA_DNA-bdgWinged helix-turn-helix transcription repressor, HrcA DNA-binding. This domain is always found with a pair of CBS domains pfam00571.
34PRK041724896497.10.00076[                           ----------------      ]pheSphenylalanyl-tRNA synthetase subunit alpha; Provisional
35pfam13936444197.10.0014[                           ----------            ]HTH_38Helix-turn-helix domain. This helix-turn-helix domain is often found in transferases and is likely to be DNA-binding.
36COG13782476097.00.0013[                          ---------------        ]TrmBSugar-specific transcriptional regulator TrmB
37COG07351455597.00.0014[                          --------------         ]FurFe2+ or Zn2+ uptake regulation protein
38pfam00126605496.90.0019[                            --------------       ]HTH_1Bacterial regulatory helix-turn-helix protein, lysR family.
39cd06170574096.90.0022[                           ----------            ]LuxR_C_likeC-terminal DNA-binding domain of LuxR-like proteins. This domain contains a helix-turn-helix motif and binds DNA. Proteins belonging to this group are response regulators; some act as transcriptional activators, others as transcriptional repressors. Many are active as homodimers. Many are two domain proteins in which the DNA binding property of the C-terminal DNA binding domain is modulated by modifications of the N-terminal domain. For example in the case of Lux R which participates in the regulation of gene expression in response to fluctuations in cell-population density (quorum-sensing), a signaling molecule, the pheromone Acyl HSL (N-acyl derivatives of homoserine lactone), binds to the N-terminal domain and leads to LuxR dimerization. For others phophorylation of the N-terminal domain leads to multimerization, for example Escherichia coli NarL and Sinorhizobium melilot FixJ. NarL controls gene expression of many respiratory-related operons when environmental nitrate or nitrite is present under anerobic conditions. FixJ is involved in the transcriptional activation of nitrogen fixation genes. The group also includes small proteins which lack an N-terminal signaling domain, such as Bacillus subtilis GerE. GerE is dimeric and acts in conjunction with sigmaK as an activator or a repressor modulating the expression of various genes in particular those encoding the spore-coat. These LuxR family regulators may share a similar organization of their target binding sites. For example the LuxR dimer binds the lux box, a 20bp inverted repeat, GerE dimers bind two 12bp consensus sequences in inverted orientation having the central four bases overlap, and the NarL dimer binds two 7bp inverted repeats separated by 2 bp.
40pfam01638915996.80.0019[                            ---------------      ]HxlRHxlR-like helix-turn-helix. HxlR, a member of this family, is a DNA-binding protein that acts as a positive regulator of the formaldehyde-inducible hxlAB operon in Bacillus subtilis.
41COG14972606296.80.0025[                           ---------------       ]COG1497Predicted transcriptional regulator
42COG41893086796.80.0025[                          -----------------      ]COG4189Predicted transcriptional regulator
43COG34132154796.80.0025[                          ------------           ]COG3413Predicted DNA binding protein, contains HTH domain
44cd071531164796.80.0017[                            ------------         ]Fur_likeFerric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators. Ferric uptake regulator (Fur) and related metalloregulatory proteins are iron-dependent, DNA-binding repressors and activators mainly involved in iron metabolism. A general model for Fur repression under iron-rich conditions is that activated Fur (a dimer having one Fe2+ coordinated per monomer) binds to specific DNA sequences (Fur boxes) in the promoter region of iron-responsive genes, hindering access of RNA polymerase, and repressing transcription. Positive regulation by Fur can be direct or indirect, as in the Fur repression of an anti-sense regulatory small RNA. Some members sense metal ions other than Fe2+. For example, the zinc uptake regulator (Zur) responds to Zn2+, the manganese uptake regulator (Mur) responds to Mn2+, and the nickel uptake regulator (Nur) responds to Ni2+. Other members sense signals other than metal ions. For example, PerR, a metal-dependent sensor of hydrogen peroxide. PerR regulates DNA-binding activity through metal-based protein oxidation, and co-ordinates Mn2+ or Fe2+ at its regulatory site. Fur family proteins contain an N-terminal winged-helix DNA-binding domain followed by a dimerization domain; this CD spans both those domains.
45pfam13601805796.80.0017[                            --------------       ]HTH_34Winged helix DNA-binding domain.
46pfam13518523996.80.0039[                            ----------           ]HTH_28Helix-turn-helix domain. This helix-turn-helix domain is often found in transposases and is likely to be DNA-binding.
47COG33982405796.70.003[                           --------------        ]COG3398Predicted transcriptional regulator, containsd two HTH domains
48cd07377663196.60.0019[                               --------          ]WHTH_GntRWinged helix-turn-helix (WHTH) DNA-binding domain of the GntR family of transcriptional regulators. This CD represents the winged HTH DNA-binding domain of the GntR (named after the gluconate operon repressor in Bacillus subtilis) family of bacterial transcriptional regulators and their putative homologs found in eukaryota and archaea. The GntR family has over 6000 members distributed among almost all bacterial species, which is comprised of FadR, HutC, MocR, YtrA, AraR, PlmA, and other subfamilies for the regulation of the most varied biological process. The monomeric proteins of the GntR family are characterized by two function domains: a small highly conserved winged helix-turn-helix prokaryotic DNA binding domain in the N-terminus, and a very diverse regulatory ligand-binding domain in the C-terminus for effector-binding/oligomerization, which provides the basis for the subfamily classifications. Binding of the effector to GntR-like transcriptional regulators is presumed to result in a conformational change that regulates the DNA-binding affinity of the repressor. The GntR-like proteins bind as dimers, where each monomer recognizes a half-site of 2-fold symmetric DNA sequences.
49COG2771654796.60.0081[                           -----------           ]CsgDDNA-binding transcriptional regulator, CsgD family
50TIGR041761055996.60.0051[                           ---------------       ]MarR_EPSEPS-associated transcriptional regulator, MarR family. Members of this family of MarR-family transcriptional regulators are associated with long genomic loci consisting of genes encoding enzymes for the biosynthesis of exopolysaccharides. These genes include glycosyl transferases, sugar modifying enzymes (epimerases, isomerases, methyltransferases, aminotransferases, etc.), and exopolysaccharide polymerases (wzx, wzy). In Leptospira interrogans, borgpeterenii and biflexa, this gene is observed first in unidirectional EPS biosynthesis loci as long as 90 genes. MarR genes (pfam01407) are known to bind to DNA regions with palindromic or pseudopalindromic sequences as homodimers, and to bind small molecules as triggers for conformational changes controlling on/off states.
51pfam13384504096.50.01[                            ----------           ]HTH_23Homeodomain-like domain.
52pfam00196584796.50.0078[                           -----------           ]GerEBacterial regulatory proteins, luxR family.
53COG1654793396.40.0072[                              --------           ]BirABiotin operon repressor
54COG25242946696.40.0049[                           ----------------      ]COG2524Predicted transcriptional regulator, contains C-terminal CBS domains
55pfam13545704296.30.0065[                               ----------        ]HTH_Crp_2Crp-like helix-turn-helix domain. This family represents a crp-like helix-turn-helix domain that is likely to bind DNA.
56pfam09012694296.30.0079[                              ----------         ]FeoCFeoC like transcriptional regulator. This family contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a
57PRK150902575496.30.0066[                            -------------        ]PRK15090DNA-binding transcriptional regulator KdgR; Provisional
58PRK002152055296.30.0083[                           -------------         ]PRK00215LexA repressor; Validated
59COG13861845996.30.0073[                          ----------------       ]ScpBChromosome segregation and condensation protein ScpB
60PRK118863195596.20.0071[                            -------------        ]PRK11886bifunctional biotin--
61pfam143941715396.20.0088[                             -------------       ]DUF4423Domain of unknown function (DUF4423). This presumed domain is functionally uncharacterized. This domain family is found in bacteria, and is approximately 170 amino acids in length.
62COG36821235496.10.017[                           -------------         ]COG3682Predicted transcriptional regulator
63pfam02796453996.00.012[                           ----------            ]HTH_7Helix-turn-helix domain of resolvase.
64pfam08281544395.80.026[                           ----------            ]Sigma70_r4_2Sigma-70, region 4. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif.
65TIGR003313376895.80.013[                           --------------------- ]hrcAheat shock gene repressor HrcA. HrcA represses the class I heat shock operons groE and dnaK; overproduction prevents induction of these operons by heat shock while deletion allows constitutive expression even at low temperatures. In Bacillus subtilis, hrcA is the first gene of the dnaK operon and so is itself a heat shock gene.
66pfam02082834195.80.017[                               ----------        ]Rrf2Transcriptional regulator. This family is related to pfam001022 and other transcription regulation families (personal obs: Yeats C).
67pfam08461665495.80.019[                             -------------       ]HTH_12Ribonuclease R winged-helix domain. This domain is found at the amino terminus of Ribonuclease R and a number of presumed transcriptional regulatory proteins from archaebacteria.
68pfam020021055595.80.025[                          --------------         ]TFIIE_alphaTFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits, TFIIE-alpha and TFIIE-beta, and joins the pre-initiation complex after RNA polymerase II and TFIIF. This family consists of the conserved amino terminal region of eukaryotic TFIIE-alpha and proteins from archaebacteria that are presumed to be TFIIE-alpha subunits.
69TIGR024312485495.70.018[                            --------------       ]pcaR_pcaUbeta-ketoadipate pathway transcriptional regulators, PcaR/PcaU/PobR family. Member of this family are IclR-type transcriptional regulators with similar DNA binding sites, able to bind at least three different metabolites related to protocatechuate metabolism. Beta-ketoadipate is the inducer for PcaR, p-hydroxybenzoate for PobR, and protocatechuate for PcaU.
70COG3432956395.70.0056[                           ----------------      ]COG3432Predicted transcriptional regulator
71COG28654679195.70.043[                -----------------------          ]COG2865Predicted transcriptional regulator, contains HTH domain
72PRK098342635895.70.012[                            --------------       ]PRK09834DNA-binding transcriptional activator MhpR; Provisional
73COG17251254495.70.014[                               -----------       ]YhcFDNA-binding transcriptional regulator YhcF, GntR family
74COG21972114695.70.022[                           -----------           ]CitBDNA-binding response regulator, NarL/FixJ family, contains REC and HTH domains
75pfam10007935495.60.036[                           -------------         ]DUF2250Uncharacterized protein conserved in archaea (DUF2250). Members of this family of hypothetical archaeal proteins have no known function.
76pfam00325323095.60.02[                               -------           ]CrpBacterial regulatory proteins, crp family.
77COG16933256195.50.026[                           ---------------       ]COG1693Repressor of nif and glnA expression
78pfam09202825995.40.044[                           ---------------       ]Rio2_NRio2, N-terminal. Members of this family are found in Rio2, and are structurally homologous to the winged helix (wHTH) domain. They adopt a structure consisting of four alpha helices followed by two beta strands and a fifth alpha helix. The domain confers DNA binding properties to the protein, as per other winged helix domains.
79PRK110501526395.40.029[                           ----------------      ]PRK11050manganese transport regulator MntR; Provisional
80PRK000823397195.30.022[                           --------------------- ]hrcAheat-inducible transcription repressor; Provisional
81PRK111791536595.30.0087[                           ----------------      ]PRK11179DNA-binding transcriptional regulator AsnC; Provisional
82COG41901446395.30.026[                         ---------------         ]COG4190Predicted transcriptional regulator
83COG19591504595.20.027[                               -----------       ]IscRDNA-binding transcriptional regulator, IscR family
84COG05832976295.20.015[                           ----------------      ]LysRDNA-binding transcriptional regulator, LysR family
85pfam14493913695.20.047[                             ---------           ]HTH_40Helix-turn-helix domain. This presumed domain is found at the C-terminus of a large number of helicase proteins.
86pfam096511366895.20.056[        -----------------                        ]Cas_APE2256CRISPR-associated protein (Cas_APE2256). This entry represents a conserved region of about 150 amino acids found in at least five archaeal and three bacterial species. These species all contain CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In six of eight species, the protein is encoded the vicinity of a CRISPR/Cas locus.
87pfam07848704095.10.028[                                ----------       ]PaaXPaaX-like protein. This family contains proteins that are similar to the product of the paaX gene of Escherichia coli. This protein is involved in the regulation of expression of a group of proteins known to participate in the metabolizm of phenylacetic acid. In fact, some members of this family are annotated by InterPro as containing a winged helix DNA-binding domain (Interpro:IPR009058).
88PRK062661784795.00.044[                           ------------          ]PRK06266transcription initiation factor E subunit alpha; Validated
89pfam014751206195.00.058[                          ---------------        ]FURFerric uptake regulator family. This family includes metal ion uptake regulator proteins, that bind to the operator DNA and controls transcription of metal ion-responsive genes. This family is also known as the FUR family.
90pfam040791589995.00.048[          -----------------------------          ]DUF387Putative transcriptional regulators (Ypuh-like). This family of conserved bacterial proteins are thought to possibly be helix-turn-helix type transcriptional regulators.
91COG33982405295.00.046[                           -------------         ]COG3398Predicted transcriptional regulator, containsd two HTH domains
92TIGR029371584394.90.047[                           ----------            ]sigma70-ECFRNA polymerase sigma factor, sigma-70 family. This model encompasses all varieties of the sigma-70 type sigma factors including the ECF subfamily. A number of sigma factors have names with a different number than 70 (i.e. sigma-38), but in fact, all except for the Sigma-54 family (TIGR02395) are included within this family. Several Pfam models hit segments of these sequences including Sigma-70 region 2 (pfam04542) and Sigma-70, region 4 (pfam04545), but not always above their respective trusted cutoffs.
93cd097421836494.90.084[          ----------------                       ]Csm6_III-ACRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems; also known as APE2256 family
94pfam00392643494.90.027[                               ---------         ]GntRBacterial regulatory proteins, gntR family. This family of regulatory proteins consists of the N-terminal HTH region of GntR-like bacterial transcription factors. At the C-terminus there is usually an effector-binding/oligomerization domain. The GntR-like proteins include the following sub-families: MocR, YtrR, FadR, AraR, HutC and PlmA, DevA, DasR. Many of these proteins have been shown experimentally to be autoregulatory, enabling the prediction of operator sites and the discovery of cis/trans relationships. The DasR regulator has been shown to be a global regulator of primary metabolizm and development in Streptomyces coelicolor.
95pfam04545504194.80.095[                           ----------            ]Sigma70_r4Sigma-70, region 4. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif. Due to the way Pfam works, the threshold has been set artificially high to prevent overlaps with other helix-turn-helix families. Therefore there are many false negatives.
96COG31773484794.80.036[                            ------------         ]COG3177Fic family protein
97pfam04182744594.70.064[                           -----------           ]B-block_TFIIICB-block binding subunit of TFIIIC. Yeast transcription factor IIIC (TFIIIC) is a multi-subunit protein complex that interacts with two control elements of class III promoters called the A and B blocks. This family represents the subunit within TFIIIC involved in B-block binding.
98COG17772175894.70.09[                             --------------      ]COG1777Predicted transcriptional regulator
99TIGR023371186694.70.065[                          -----------------      ]HpaRhomoprotocatechuate degradation operon regulator, HpaR. This Helix-Turn-Helix transcriptional regulator is a member of the MarR family (pfam01047) and is found in association with operons for the degradation of 4-hydroxyphenylacetic acid via homoprotocatechuate.
100pfam01726634894.70.1[                           ------------          ]LexA_DNA_bindLexA DNA binding domain. This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins. The aligned region contains a variant form of the helix-turn-helix DNA binding motif. This domain is found associated with pfam00717 the auto-proteolytic domain of LexA EC:3.4.21.88.
101pfam05043874394.60.077[                            -----------          ]MgaMga helix-turn-helix domain. M regulator protein trans-acting positive regulator (Mga) is a DNA-binding protein that activates the expression of several important virulence genes in group A streptococcus in response to changing environmental conditions. This domain is found in the centre of the Mga proteins. This family also contains a number of bacterial RofA transcriptional regulators that seem to be largely restricted to streptococci. These proteins have been shown to regulate the expression of important bacterial adhesins. This is presumably a DNA-binding domain.
102cd06171554194.60.096[                           ----------            ]Sigma70_r4Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs). A SF is a dissociable subunit of RNA polymerase, it directs bacterial or plastid core RNA polymerase to specific promoter elements located upstream of transcription initiation points. The SR4 of Ec sigma70 and other essential primary SFs contact promoter sequences located 35 base-pairs upstream of the initiation point, recognizing a 6-base-pair -35 consensus TTGACA. Sigma70 related SFs also include SFs which are dispensable for bacterial cell growth for example Ec sigmaS, SFs which activate regulons in response to a specific signal for example heat-shock Ec sigmaH, and a group of SFs which includes the extracytoplasmic function (ECF) SFs and is typified by Ec sigmaE which contains SR2 and -4 only. ECF SFs direct the transcription of genes that regulate various responses including periplasmic stress and pathogenesis. Ec sigmaE SR4 also contacts the -35 element, but recognizes a different consensus (a 7-base-pair GGAACTT). Plant SFs recognize sigma70 type promoters and direct transcription of the major plastid RNA polymerase, plastid-encoded RNA polymerase (PEP).
103PRK111691644894.60.044[                           -----------           ]PRK11169leucine-responsive transcriptional regulator; Provisional
104PRK135092514694.60.048[                             -----------         ]PRK13509transcriptional repressor UlaR; Provisional
105pfam057321654294.50.043[                               -----------       ]RepLFirmicute plasmid replication protein (RepL). This family consists of Firmicute RepL proteins which are involved in plasmid replication.
106pfam06969665294.20.15[                             -------------       ]HemN_CHemN C-terminal domain. Members of this family are all oxygen-independent coproporphyrinogen-III oxidases (HemN). This enzyme catalyses the oxygen-independent conversion of coproporphyrinogen-III to protoporphyrinogen-IX, one of the last steps in haem biosynthesis. The function of this domain is unclear, but comparison to other proteins containing a radical SAM domain (pfam04055) suggest it may be a substrate binding domain.
107TIGR00122694194.10.053[                            ----------           ]birA_repr_regBirA biotin operon repressor domain. This model represents the amino-terminal helix-turn-helix repressor region of the biotin--acetyl-CoA-carboxylase ligase/biotin operon repressor bifunctional protein BirA. In many species, the biotin--acetyl-CoA-carboxylase ligase ortholog lacks this DNA-binding repressor region and therefore is not equivalent to the well-characterized BirA of E. coli. This model may recognize some other putative repressor proteins, such as DnrO of Streptomyces peucetius with scores below the noise cutoff but with significance shown by low E-value.
108COG15101773994.00.074[                              ----------         ]GbsRDNA-binding transcriptional regulator GbsR, MarR family
109TIGR027022034494.00.045[                            -----------          ]SufR_cyanoiron-sulfur cluster biosynthesis transcriptional regulator SufR. All members of this cyanobacterial protein family are the transcriptional regulator SufR and regulate the SUF system, which makes possible iron-sulfur cluster biosynthesis despite exposure to oxygen. In all cases, the sufR gene is encoded near SUF system genes but in the opposite direction. This DNA-binding protein belongs to the the DeoR family of helix-loop-helix proteins. All members also have a probable metal-binding motif C-X(12)-C-X(13)-C-X(14)-C near the C-terminus.
110COG456522412993.90.14[     ---------------------------------           ]CitBResponse regulator of citrate/malate metabolism
111TIGR00637996293.90.1[                           ----------------      ]ModE_repressModE molybdate transport repressor domain. ModE is a molybdate-activated repressor of the molybdate transport operon in E. coli. It consists of the domain represented by this model and two tandem copies of mop-like domain, where Mop proteins are a family of 68-residue molybdenum-pterin binding proteins of Clostridium pasteurianum. This model also represents the full length of a pair of archaeal proteins that lack Mop-like domains. PSI-BLAST analysis shows similarity to helix-turn-helix regulatory proteins.
112pfam03551753993.80.043[                                 ----------      ]PadRTranscriptional regulator PadR-like family. Members of this family are transcriptional regulators that appear to be related to the pfam01047 family. This family includes PadR, a protein that is involved in negative regulation of phenolic acid metabolizm.
113pfam05225453693.80.15[                            ---------            ]HTH_psqhelix-turn-helix, Psq domain. This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster. In pipsqueak this domain binds to GAGA sequence.
114TIGR029441305993.50.16[                           ---------------       ]suf_reg_XanthoFeS assembly SUF system regulator, gammaproteobacterial. The SUF system is an oxygen-resistant iron-sulfur cluster assembly system found in both aerobes and facultative anaerobes. Its presence appears to be a marker of oxygen tolerance; strict anaerobes and microaerophiles tend to have different FeS cluster biosynthesis systems. Members of this protein family belong to the rrf2 family of transcriptional regulators and are found, typically, as the first gene of a SUF operon. It is found only in a subset of genomes that encode the SUF system, including the genus Xanthomonas. The conserved location suggests an autoregulatory role.
115COG13392144993.30.07[                               ------------      ]RfkArchaeal CTP-dependent riboflavin kinase
116COG16751765193.20.13[                           -------------         ]TFA1Transcription initiation factor IIE, alpha subunit
117pfam08222603493.10.094[                               --------          ]HTH_CodYCodY helix-turn-helix domain. This family consists of the C-terminal helix-turn-helix domain found in several bacterial GTP-sensing transcriptional pleiotropic repressor CodY proteins. CodY has been found to repress the dipeptide transport operon (dpp) of Bacillus subtilis in nutrient-rich conditions. The CodY protein also has a repressor effect on many genes in Lactococcus lactis during growth in milk.
118COG21882364693.00.086[                             -----------         ]MngRDNA-binding transcriptional regulator, GntR family
119TIGR007381324893.00.16[                              ------------       ]rrf2_superRrf2 family protein. This model represents a superfamily of probable transcriptional regulators. One member, RRF2 of Desulfovibrio vulgaris is an apparent regulatory protein experimentally (MEDLINE:97293189). The N-terminal region appears related to the DNA-binding biotin repressor region of the BirA bifunctional according to results after three rounds of PSI-BLAST with a fairly high stringency.
120TIGR002811865693.00.19[                           --------------        ]TIGR00281segregation and condensation protein B. Shown to be required for chromosome segregation and condensation in B. subtilis.
121pfam13542513493.00.25[                            ---------            ]HTH_Tnp_ISL3Helix-turn-helix domain of transposase family ISL3.
122PRK056384426592.90.12[                           ----------------      ]PRK05638threonine synthase; Validated
123COG14203464992.90.19[                           ------------          ]HrcATranscriptional regulator of heat shock response
124COG33881015692.60.06[                            --------------       ]COG3388Predicted transcriptional regulator
125pfam13309643792.50.072[                           ---------             ]HTH_22HTH domain. This domain is a helix-turn-helix domain that is likely to act as a DNA-binding domain.
126pfam020011004592.50.37[                          -----------            ]DUF134Protein of unknown function DUF134. This family of archaeal proteins has no known function.
127PRK141652175892.40.11[                            --------------       ]PRK14165winged helix-turn-helix domain-containing protein/riboflavin kinase; Provisional
128pfam135511093892.30.29[                             ---------           ]HTH_29Winged helix-turn helix. This helix-turn-helix domain is often found in transferases and is likely to be DNA-binding.
129COG33272914392.10.12[                                ----------       ]PaaXDNA-binding transcriptional regulator PaaX (phenylacetic acid degradation)
130COG34151383992.10.27[                             ---------           ]COG3415Transposase
131TIGR03879733392.10.083[                              --------           ]near_KaiC_domprobable regulatory domain. This model describes a common domain shared by two different families of proteins, each of which occurs regularly next to its corresponding partner family, a probable regulatory with homology to KaiC. By implication, this protein family likely is also involved in sensory transduction and/or regulation.
132COG53402694592.00.34[                              -----------        ]COG5340Transcriptional regulator, predicted component of viral defense system
133COG16951386292.00.15[                            ---------------      ]PadRDNA-binding transcriptional regulator, PadR family
134pfam14502483691.90.16[                               ---------         ]HTH_41Helix-turn-helix domain.
135pfam098621135891.90.34[                        --------------           ]DUF2089Protein of unknown function (DUF2089). This domain, found in various hypothetical prokaryotic proteins, has no known function. This domain is a zinc-ribbon.
136COG23903213391.90.28[                              --------           ]DeoRDNA-binding transcriptional regulator LsrR, DeoR family
137pfam12728522791.20.22[                               --------          ]HTH_17Helix-turn-helix domain. This domain is a DNA-binding helix-turn-helix domain.
138pfam04218534191.10.59[                           ----------            ]CENP-B_NCENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B dimers either bind two separate DNA molecules or alternatively, they may bind two CENP-B boxes on one DNA molecule, with the intervening stretch of DNA forming a loop structure. The CENP-B DNA-binding domain consists of two repeating domains, RP1 and RP2. This family corresponds to RP1 has been shown to consist of four helices in a helix-turn-helix structure.
139COG38883213391.10.43[                              --------           ]COG3888Predicted transcriptional regulator
140pfam079002204691.10.4[                           -----------           ]DUF1670Protein of unknown function (DUF1670). The hypothetical eukaryotic proteins found in this family are of unknown function.
141pfam087841034790.70.6[                           -----------           ]RPA_CReplication protein A C terminal. This domain corresponds to the C terminal of the single stranded DNA binding protein RPA (replication protein A). RPA is involved in many DNA metabolic pathways including DNA replication, DNA repair, recombination, cell cycle and DNA damage checkpoints.
142pfam08280593990.60.73[                            ---------            ]HTH_MgaM protein trans-acting positive regulator (MGA) HTH domain. Mga is a DNA-binding protein that activates the expression of several important virulence genes in group A streptococcus in response to changing environmental conditions.
143COG1342996790.41.2[                          -------------------    ]COG1342Predicted DNA-binding protein, UPF0251 family
144PRK092494536790.10.3[                          -----------------      ]PRK09249coproporphyrinogen III oxidase; Provisional
145cd01392521989.80.21[                                -----            ]HTH_LacIHelix-turn-helix (HTH) DNA binding domain of the LacI family of transcriptional regulators. HTH-DNA binding domain of the LacI (lactose operon repressor) family of bacterial transcriptional regulators and their putative homologs found in plants. The LacI family has more than 500 members distributed among almost all bacterial species. The monomeric proteins of the LacI family contain common structural features that include a small DNA-binding domain with a helix-turn-helix motif in the N-terminus, a regulatory ligand-binding domain which exhibits the type I periplasmic binding protein fold in the C-terminus for oligomerization and for effector binding, and an approximately 18-amino acid linker connecting these two functional domains. In LacI-like transcriptional regulators, the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. When the C-terminal domain of the LacI family repressor binds its ligand, it undergoes a conformational change which affects the DNA-binding affinity of the repressor. In Escherichia coli, LacI represses transcription by binding with high affinity to the lac operon at a specific operator DNA sequence until it interacts with the physiological inducer allolactose or a non-degradable analog IPTG (isopropyl-beta-D-thiogalactopyranoside). Induction of the repressor lowers its affinity for the operator sequence, thereby allowing transcription of the lac operon structural genes (lacZ, lacY, and LacA). The lac repressor occurs as a tetramer made up of two functional dimers. Thus, two DNA binding domains of a dimer are required to bind the inverted repeat sequences of the operator DNA binding sites.
146pfam02954423689.70.62[                            ---------            ]HTH_8Bacterial regulatory protein, Fis family.
147pfam076381854089.70.63[                          ----------             ]Sigma70_ECFECF sigma factor. These proteins are probably RNA polymerase sigma factors belonging to the extra-cytoplasmic function (ECF) subfamily and show sequence similarity to pfam04542 and pfam04545.
148COG21862414289.60.34[                              ----------         ]FadRDNA-binding transcriptional regulator, FadR family
149cd04761492789.60.39[                               --------          ]HTH_MerR-SFHelix-Turn-Helix DNA binding domain of transcription regulators from the MerR superfamily. Helix-turn-helix (HTH) transcription regulator MerR superfamily, N-terminal domain. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription of multidrug/metal ion transporter genes and oxidative stress regulons by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
150PRK137771855189.60.24[                              -------------      ]PRK13777transcriptional regulator Hpr; Provisional
151COG20051306589.50.32[                          -----------------      ]ModEDNA-binding transcriptional regulator ModE (molybdenum-dependent)
152COG15951824389.30.74[                          -----------            ]RpoEDNA-directed RNA polymerase specialized sigma subunit, sigma24 family
153pfam13613532989.31.5[                              -------            ]HTH_Tnp_4Helix-turn-helix of DDE superfamily endonuclease. This domain is the probable DNA-binding region of transposase enzymes, necessary for efficient DNA transposition. Most of the members derive from the IS superfamily IS5 and rather fewer from IS4.
154cd047891024489.20.22[                               ----------------- ]HTH_CfaHelix-Turn-Helix DNA binding domain of the Cfa transcription regulator. Putative helix-turn-helix (HTH) MerR-like transcription regulator; the N-terminal domain of Cfa, a cyclopropane fatty acid synthase and other related methyltransferases. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
155PRK093331505788.90.45[                             --------------      ]PRK0933330S ribosomal protein S19e; Provisional
156COG04783046388.81[                          ----------------       ]RIO2RIO-like serine/threonine protein kinase fused to N-terminal HTH domain
157COG56251135688.80.65[                          --------------         ]COG5625Predicted DNA-binding transcriptional regulator, contains HTH domain
158pfam13443633088.80.56[                             -------             ]HTH_26Cro/C1-type HTH DNA-binding domain. This is a helix-turn-helix domain that probably binds to DNA.
159pfam00356462388.40.43[                               ------            ]LacIBacterial regulatory proteins, lacI family.
160PRK104342564488.30.77[                            ----------           ]srlRDNA-bindng transcriptional repressor SrlR; Provisional
161COG18022304188.20.52[                             ----------          ]GntRDNA-binding transcriptional regulator, GntR family
162pfam041572184588.21[                           -----------           ]EAP30EAP30/Vps36 family. This family includes EAP30 as well as the Vps36 protein. Vps36 is involved in Golgi to endosome trafficking. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity.
163COG15683545787.80.59[                            ---------------      ]COG1568Predicted methyltransferase
164pfam01527754287.71.6[                           ----------            ]HTH_Tnp_1Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various Escherichia coli insertion elements and other bacterial transposases some of which are members of the IS3 family.
165pfam01418775087.41.1[                          ------------           ]HTH_6Helix-turn-helix domain, rpiR family. This domain contains a helix-turn-helix motif. The best characterized member of this family is Escherichia coli rpiR, a regulator of the expression of rpiB gene.
166COG41094324887.30.34[                              ------------       ]YtoIPredicted transcriptional regulator containing CBS domains
167PRK064741785187.10.97[                            ------------         ]PRK06474hypothetical protein; Provisional
168COG32846067687.02.5[                 --------------------            ]AcoRTranscriptional regulator of acetoin/glycerol metabolism
169TIGR004981995487.01.2[                           -------------         ]lexASOS regulatory protein LexA. LexA acts as a homodimer to repress a number of genes involved in the response to DNA damage (SOS response), including itself and RecA. RecA, in the presence of single-stranded DNA, acts as a co-protease to activate a latent autolytic protease activity (EC 3.4.21.88) of LexA, where the active site Ser is part of LexA. The autolytic cleavage site is an Ala-Gly bond in LexA (at position 84-85 in E. coli LexA; this sequence is replaced by Gly-Gly in Synechocystis). The cleavage leads to derepression of the SOS regulon and eventually to DNA repair. LexA in Bacillus subtilis is called DinR. LexA is much less broadly distributed than RecA.
170PRK036352946286.70.37[                           ----------------      ]PRK03635chromosome replication initiation inhibitor protein; Validated
171COG17101394586.61.3[                          -----------            ]COG1710Uncharacterized protein
172PRK106512165086.40.62[                           ------------          ]PRK10651transcriptional regulator NarL; Provisional
173COG23783114885.81.5[                            ------------         ]YafYPredicted DNA-binding transcriptional regulator YafY, contains an HTH and WYL domains
174TIGR036421244285.81.4[          ----------                             ]cas_csx14CRISPR-associated protein, Csx14 family. This model describes a protein N-terminal protein sequence domain strictly associated with CRISPR and CRISPR-associated protein systems. This model and TIGR02584 identify two separate clades from a larger homology domain family, both CRISPR-associated, while other homologs are found that may not be. Members are found in bacteria that include Pelotomaculum thermopropionicum SI, Thermoanaerobacter tengcongensis MB4, and Roseiflexus sp. RS-1, and in archaea that include Thermoplasma volcanium, Picrophilus torridus, and Methanospirillum hungatei. The molecular function is unknown.
175PRK099543625885.72[                            ---------------      ]PRK09954putative kinase; Provisional
176COG22381475685.60.58[                             --------------      ]RPS19ARibosomal protein S19E (S16A)
177pfam06971493285.41.7[                            --------             ]Put_DNA-bind_NPutative DNA-binding protein N-terminus. This family represents the N-terminus (approximately 50 residues) of a number of putative bacterial DNA-binding proteins.
178PRK116391696085.31.5[                          ---------------        ]PRK11639zinc uptake transcriptional repressor; Provisional
179COG13181823385.10.72[                              --------           ]COG1318Predicted transcriptional regulator
180cd005921002784.80.97[                               --------          ]HTH_MerR-likeHelix-Turn-Helix DNA binding domain of MerR-like transcription regulators. Helix-turn-helix (HTH) MerR-like transcription regulator, N-terminal domain. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription of multidrug/metal ion transporter genes and oxidative stress regulons by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
181pfam07453372184.80.48[                               -----             ]NUMOD1NUMOD1 domain. This domain probably represents a DNA-binding helix-turn-helix based on its similarity to other families (Bateman A pers obs).
182pfam04703614884.62.2[                            ------------         ]FaeAFaeA-like protein. This family represents a number of fimbrial protein transcription regulators found in Gram-negative bacteria. These proteins are thought to facilitate binding of the leucine-rich regulatory protein to regulatory elements, possibly by inhibiting deoxyadenosine methylation of these elements by deoxyadenosine methylase.
183cd047751023284.50.63[                               ---------         ]HTH_Cfa-likeHelix-Turn-Helix DNA binding domain of Cfa-like transcription regulators. Putative helix-turn-helix (HTH) MerR-like transcription regulators; the HTH domain of Cfa, a cyclopropane fatty acid synthase, and other related methyltransferases, as well as, the N-terminal domain of a conserved, uncharacterized ~172 a.a. protein. Based on sequence similarity of the N-terminal domain, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
184pfam00376382684.51.1[                               --------          ]MerRMerR family regulatory protein.
185PRK097754423584.41.3[                             --------            ]PRK09775putative DNA-binding transcriptional regulator; Provisional
186cd097231324284.31.8[          ----------                             ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as csx13 family
187COG07891243284.30.96[                               ---------         ]SoxRDNA-binding transcriptional regulator, MerR family
188PRK039021426084.31.8[                            --------------       ]PRK03902manganese transport transcriptional regulator; Provisional
189PRK133482946284.11.1[                           ----------------      ]PRK13348chromosome replication initiation inhibitor protein; Provisional
190cd04762492184.01.1[                               -----             ]HTH_MerR-truncHelix-Turn-Helix DNA binding domain of truncated MerR-like proteins. Proteins in this family mostly have a truncated helix-turn-helix (HTH) MerR-like domain. They lack a portion of the C-terminal region, called Wing 2 and the long dimerization helix that is typically present in MerR-like proteins. These truncated domains are found in response regulator receiver (REC) domain proteins (i.e., CheY), cytosine-C5 specific DNA methylases, IS607 transposase-like proteins, and RacA, a bacterial protein that anchors chromosomes to cell poles.
191COG15425936683.41.9[                          ----------------       ]COG1542Uncharacterized protein
192TIGR020101354583.22.4[                             -----------         ]IscRiron-sulfur cluster assembly transcription factor IscR. This model describes IscR, an iron-sulfur binding transcription factor of the ISC iron-sulfur cluster assembly system.
193pfam01371883683.21.7[                            ---------            ]Trp_repressorTrp repressor protein. This protein binds to tryptophan and represses transcription of the Trp operon.
194PRK117532119383.11.5[     ---------------------------------           ]PRK11753DNA-binding transcriptional dual regulator Crp; Provisional
195COG49011074783.01.7[                             -----------         ]RPS25Ribosomal protein S25
196COG56311996182.72.6[                           ---------------       ]COG5631Predicted transcription regulator, contains HTH domain, MarR family
197COG29642204182.51.1[                            ----------           ]YheOPredicted transcriptional regulator YheO, contains PAS and DNA-binding HTH domains
198TIGR003731585282.54.5[                          -------------          ]TIGR00373transcription factor E. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
199TIGR045433315782.21.8[                             --------------      ]ketoArg_3Met2-ketoarginine methyltransferase. This SAM-dependent C-methyltransferase performs the middle step of a three step conversion from arginine to beta-methylarginine. It performs a C-methylation at position 3 of 5-guanidino-2-oxopentanoic acid (keto-arginine). An aminotransferase converts arginine to 5-guanidino-2-oxopentanoic acid, and later converts 5-guanidino-3-methyl-2-oxopentanoic acid to beta-methylarginine.
200PRK119241794282.12.6[                          -----------            ]PRK11924RNA polymerase sigma factor; Provisional
201TIGR02607782382.01.4[                               -----             ]antidote_HigAaddiction module antidote protein, HigA family. Members of this family form a distinct clade within the larger family HTH_3 of helix-turn-helix proteins, described by pfam01381. Members of this clade are strictly bacterial and nearly always shorter than 110 amino acids. This family includes the characterized member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is uncoded by the upstream member of the gene pair.
202cd00093582982.02.4[                             -------             ]HTH_XREHelix-turn-helix XRE-family like proteins. Prokaryotic DNA binding proteins belonging to the xenobiotic response element family of transcriptional regulators.
203TIGR018891095181.05[                           ------------          ]Staph_reg_Sarstaphylococcal accessory regulator family. This model represents a family of transcriptional regulatory proteins in Staphylococcus aureus and Staphylococcus epidermidis. Some members contain two tandem copies of this region. This family is related to the MarR transcriptional regulator family described by pfam01047.
204TIGR01764492281.01.7[                               -----             ]exciseDNA binding domain, excisionase family. An excisionase, or Xis protein, is a small protein that binds and promotes excisive recombination; it is not enzymatically active. This model represents a number of putative excisionases and related proteins from temperate phage, plasmids, and transposons, as well as DNA binding domains of other proteins, such as a DNA modification methylase. This model identifies mostly small proteins and N-terminal regions of large proteins, but some proteins appear to have two copies. This domain appears similar, in both sequence and predicted secondary structure (PSIPRED) to the MerR family of transcriptional regulators (pfam00376).
205pfam07022653080.82.4[                             -------             ]Phage_CI_reprBacteriophage CI repressor helix-turn-helix domain. This family consists of several phage CI repressor proteins and related bacterial sequences. The CI repressor is known to function as a transcriptional switch, determining whether transcription is lytic or lysogenic.
206COG29631165080.44.2[                            ------------         ]InsETransposase and inactivated derivatives
207COG16093332380.41.3[                               ------            ]PurRDNA-binding transcriptional regulator, LacI/PurR family
208pfam044585916680.02.7[                          ----------------       ]DUF505Protein of unknown function (DUF505). Family of uncharacterized prokaryotic proteins.
209COG28263184279.52.2[                           ----------            ]Tra8Transposase and inactivated derivatives, IS30 family
210TIGR026981305379.54.8[                           -------------         ]CopY_TcrYcopper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
211pfam044921005579.34.1[                          --------------         ]Phage_rep_OBacteriophage replication protein O. Replication protein O is necessary for the initiation of bacteriophage DNA replication. Protein O interacts with the lambda replication origin, and also with replication protein P to form an oligomer. It is speculated that the N-terminal half interacts with the replication origin while the C terminal half mediates protein-protein interaction.
212pfam096811214579.03.2[                              -----------        ]Phage_rep_org_NN-terminal phage replisome organizer (Phage_rep_org_N). This entry represents the N-terminal domain of a small family of phage proteins. The protein contains a region of low-complexity sequence that reflects DNA direct repeats able to function as an origin of phage replication. The region is N-terminal to the low-complexity region.
213PRK088983945579.01.4[                             --------------      ]PRK08898coproporphyrinogen III oxidase; Provisional
214COG17372815078.42.7[                          ------------           ]RpiRDNA-binding transcriptional regulator, MurR/RpiR family, contains HTH and SIS domains
215pfam045521603778.22.1[                               ------------      ]Sigma54_DBDSigma-54, DNA binding domain. This DNA binding domain is based on peptide fragmentation data. This domain is proximal to DNA in the promoter/holoenzyme complex. Furthermore this region contains a putative helix-turn-helix motif. At the C-terminus, there is a highly conserved region known as the RpoN box and is the signature of the sigma-54 proteins.
216COG24692846678.00.56[                          -----------------      ]COG2469Uncharacterized protein, contains HTH domain
217pfam034281773277.83.3[                                --------         ]RP-CReplication protein C N-terminal domain. Replication protein C is involved in the early stages of viral DNA replication.
218PRK115692744577.73.4[                            -----------          ]PRK11569transcriptional repressor IclR; Provisional
219COG38771225777.55.4[                          --------------         ]COG3877Uncharacterized protein, DUF2089 family
220TIGR029851614177.35.2[                           ----------            ]Sig70_bacteroi1RNA polymerase sigma-70 factor, Bacteroides expansion family 1. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found primarily in the genus Bacteroides. This family appears to have resulted from a lineage-specific expansion as B. thetaiotaomicron VPI-5482, Bacteroides forsythus ATCC 43037, Bacteroides fragilis YCH46 and Bacteroides fragilis NCTC 9343 contain 25, 12, 24 and 23 members, respectively. There are currentlyonly two known members of this family outside of the Bacteroides, in Rhodopseudomonas and Bradyrhizobium.
221pfam136602288577.13.6[       ------------------------------            ]DUF4147Domain of unknown function (DUF4147). This domain is frequently found at the N-terminus of proteins carrying the glycerate kinase-like domain MOFRL, pfam05161.
222PRK042171104377.03.9[                           ----------            ]PRK04217hypothetical protein; Provisional
223pfam08221624676.98.5[                           -----------           ]HTH_9RNA polymerase III subunit RPC82 helix-turn-helix domain. This family consists of several DNA-directed RNA polymerase III polypeptides which are related to the Saccharomyces cerevisiae RPC82 protein. RNA polymerase C (III) promotes the transcription of tRNA and 5S RNA genes. In Saccharomyces cerevisiae, the enzyme is composed of 15 subunits, ranging from 160 to about 10 kDa. This region is a probably DNA-binding helix-turn-helix.
224TIGR027872514876.83.8[                           ------------          ]codY_GposGTP-sensing transcriptional pleiotropic repressor CodY. This model represents the full length of CodY, a pleiotropic repressor in Bacillus subtilis and other Firmicutes (low-GC Gram-positive bacteria) that responds to intracellular levels of GTP and branched chain amino acids. The C-terminal helix-turn-helix DNA-binding region is modeled by pfam08222 in Pfam.
225pfam01381552576.72.9[                              ------             ]HTH_3Helix-turn-helix. This large family of DNA binding helix-turn helix proteins includes Cro and CI. Within the Neisseria gonorrhoeae phage associated protein NGO0477, the full protein fold incorporates a helix-turn-helix motif, but the function of this member is unlikely to be that of a DNA-binding regulator, the function of most other members, so is not necessarily characteristic of the whole family.
226COG06642143376.64.4[                               --------          ]CrpcAMP-binding domain of CRP or a regulatory subunit of cAMP-dependent protein kinases
227pfam010901406376.61.6[                           ---------------       ]Ribosomal_S19eRibosomal protein S19e.
228TIGR032982926276.62.7[                           ----------------      ]argPtranscriptional regulator, ArgP family. ArgP used to be known as IciA. ArgP is a positive regulator of argK. It is a negative autoregulator in presence of arginine. It competes with DnaA for oriC iteron (13-mer) binding. It activates dnaA and nrd transcription. It has been demonstrated to be part of the pho regulon (). ArgP mutants convey canavanine (an L-arginine structural homolog) sensitivity.
229pfam10668603276.53.8[                            --------             ]Phage_terminasePhage terminase small subunit. This family of small highly conserved proteins come from a subset of Firmicute species. Its putative function is as a phage terminase small subunit.
230TIGR033382123476.52.3[                                --------         ]phnR_burkphosphonate utilization associated transcriptional regulator. This family of proteins are members of the GntR family (pfam00392) containing an N-terminal helix-turn-helix (HTH) motif. This clade is found adjacent to or inside of operons for the degradation of 2-aminoethylphosphonate (AEP) in Polaromonas, Burkholderia, Ralstonia and Verminephrobacter.
231cd047811202776.32.7[                               --------          ]HTH_MerR-like_sg6Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 6) with at least two conserved cysteines present in the C-terminal portion of the protein. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
232PRK101411176176.03.1[                          ---------------        ]PRK10141DNA-binding transcriptional repressor ArsR; Provisional
233pfam00440472775.62.8[                               ------            ]TetR_NBacterial regulatory proteins, tetR family.
234PRK133474536775.63.4[                          -----------------      ]PRK13347coproporphyrinogen III oxidase; Provisional
235pfam14338922974.91.2[                                    -------      ]Mrr_NMrr N-terminal domain. This domain is found at the N-terminus of the Mrr restriction endonuclease catalytic domain, pfam04471. Fold recognition analysis predicts that it is a diverged member of the winged helix variant of helix turn helix proteins. It may play a role in DNA sequence recognition.
236pfam02295614674.912[                             -----------         ]z-alphaAdenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
237PRK068111892474.63.1[                               ------            ]PRK06811RNA polymerase factor sigma-70; Validated
238TIGR023252383274.63.6[                                --------         ]C_P_lyase_phnFphosphonates metabolism transcriptional regulator PhnF. All members of the seed alignment for this family are predicted helix-turn-helix transcriptional regulatory proteins of the broader gntR and are found associated with genes for the import and degradation of phosphonates and/or related compounds (e.g. phosphonites) with a direct C-P bond.
239COG45662024774.16.6[                          ------------           ]FixJTwo-component response regulator, FixJ family, consists of REC and HTH domains
240PRK09334864774.05.8[                             -----------         ]PRK0933430S ribosomal protein S25e; Provisional
241TIGR029991834374.06.1[                          -----------            ]Sig-70_X6RNA polymerase sigma factor, TIGR02999 family. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found in a variety of species including Rhodopirellula baltica which encodes a paralogous group of five.
242PRK001181044573.96.5[                          -----------            ]PRK00118putative DNA-binding protein; Validated
243PRK125151897473.77.1[                  ------------------             ]PRK12515RNA polymerase sigma factor; Provisional
244PRK125281612773.73.3[                              -------            ]PRK12528RNA polymerase sigma factor; Provisional
245PRK094923152673.51.8[                              ------             ]treRtrehalose repressor; Provisional
246pfam02650854573.511[                           -----------           ]HTH_WhiAWhiA C-terminal HTH domain. This domain is found at the C-terminus of the sporulation regulator WhiA. It is predicted to form a DNA-binding helix-turn-helix structure. The WhiA protein also contains two N-terminal domains that are distant homologues of LAGLIDADG homing endonucleases.
247COG25221193073.34.2[                             -------             ]COG2522Predicted transcriptional regulator
248COG23442113973.35.6[                            ----------           ]RexNADH/NAD ratio-sensing transcriptional regulator Rex
249COG33731087773.10.85[                          ---------------------  ]COG3373Predicted transcriptional regulator, contains HTH domain
250pfam032971054673.05[                             -----------         ]Ribosomal_S25S25 ribosomal protein.
251COG13092012772.83.1[                               ------            ]AcrRDNA-binding transcriptional regulator, AcrR family
252COG27391054572.65.7[                          -----------            ]YlxMPredicted DNA-binding protein YlxM, UPF0122 family
253TIGR029521704772.56.1[                          ------------           ]Sig70_famx2RNA polymerase sigma-70 factor, TIGR02952 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family. This family is found in a limited number of Gram-positive bacterial lineages.
254COG44652614872.55[                           ------------          ]CodYGTP-sensing pleiotropic transcriptional regulator CodY
255pfam00046573672.55.7[                           --------              ]HomeoboxHomeobox domain.
256PRK01381993672.52.4[                            ---------            ]PRK01381Trp operon repressor; Provisional
257pfam13411692972.24.1[                               --------          ]MerR_1MerR HTH family regulatory protein.
258pfam13556563171.98.2[                              --------           ]HTH_30PucR C-terminal helix-turn-helix domain. This helix-turn-helix domain is often found at the C-terminus of PucR-like transcriptional regulators and is likely to be DNA-binding.
259pfam138141915171.94.8[                              -------------      ]Replic_RelaxReplication-relaxation. This family includes proteins which are essential for plasmid replication and plasmid DNA relaxation.
260PRK056603785071.93.8[                              -------------      ]PRK05660HemN family oxidoreductase; Provisional
261COG11912474271.48.2[                           ----------            ]FliADNA-directed RNA polymerase specialized sigma subunit
262COG47541575871.18.5[                            ---------------      ]COG4754Uncharacterized protein
263TIGR020637095071.08[                              -------------      ]RNase_Rribonuclease R. This family consists of an exoribonuclease, ribonuclease R, also called VacB. It is one of the eight exoribonucleases reported in E. coli and is broadly distributed throughout the bacteria. In E. coli, double mutants of this protein and polynucleotide phosphorylase are not viable. Scoring between trusted and noise cutoffs to the model are shorter, divergent forms from the Chlamydiae, and divergent forms from the Campylobacterales (including Helicobacter pylori) and Leptospira interrogans.
264pfam01498723370.58.2[                             --------            ]HTH_Tnp_Tc3_2Transposase. Transposase proteins are necessary for efficient DNA transposition. This family includes the amino-terminal region of Tc1, Tc1A, Tc1B and Tc2B transposases of C.elegans. The region encompasses the specific DNA binding and second DNA recognition domains as well as an amino-terminal region of the catalytic domain of Tc3 as described in. Tc3 is a member of the Tc1/mariner family of transposable elements.
265cd00086593869.712[                          ----------             ]homeodomainHomeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
266PRK054722134068.97.3[                            ----------           ]PRK05472redox-sensing transcriptional repressor Rex; Provisional
267pfam12964964368.83.3[                              -------------      ]DUF3853Protein of unknown function (DUF3853). A family of uncharacterized proteins found by clustering human gut metagenomic sequences.
268pfam12116823468.58[                            --------             ]SpoIIIDStage III sporulation protein D. This stage III sporulation protein is a small DNA-binding family that is essential for gene expression of the mother-cell compartment during sporulation. The domain is found in bacteria and viruses, and is about 40 amino acids in length. It has a conserved RGG sequence motif.
269pfam13693783268.56.2[                             --------            ]HTH_35Winged helix-turn-helix DNA-binding.
270COG07583504968.58.6[                            ------------         ]SmfPredicted Rossmann fold nucleotide-binding protein DprA/Smf involved in DNA uptake
271pfam04760522968.46.1[                               -------           ]IF2_NTranslation initiation factor IF-2, N-terminal region. This conserved feature at the N-terminus of bacterial translation initiation factor IF2 has recently had its structure solved. It shows structural similarity to the tRNA anticodon Stem Contact Fold domains of the methionyl-tRNA and glutaminyl-tRNA synthetases, and a similar fold is also found in the B5 domain of the phenylalanine-tRNA synthetase.
272COG11674593268.05.4[                               --------          ]ARO8DNA-binding transcriptional regulator, MocR family, contains an aminotransferase domain
273TIGR022772804167.83.8[                                ----------       ]PaaX_trns_regphenylacetic acid degradation operon negative regulatory protein PaaX. This transcriptional regulator is always found in association with operons believed to be involved in the degradation of phenylacetic acid. The gene product has been shown to bind to the promoter sites and repress their transcription.
274PRK110133095867.78[                           ---------------       ]PRK11013DNA-binding transcriptional regulator LysR; Provisional
275pfam040791584867.712[                              ---------------    ]DUF387Putative transcriptional regulators (Ypuh-like). This family of conserved bacterial proteins are thought to possibly be helix-turn-helix type transcriptional regulators.
276pfam01316704467.112[                              ------------       ]Arg_repressorArginine repressor, DNA binding domain.
277TIGR023954293666.94.2[                               ------------      ]rpoN_sigmaRNA polymerase sigma-54 factor. A sigma factor is a DNA-binding protein protein that binds to the DNA-directed RNA polymerase core to produce the holoenzyme capable of initiating transcription at specific sites. Different sigma factors act in vegetative growth, heat shock, extracytoplasmic functions (ECF), etc. This model represents the clade of sigma factors called sigma-54, or RpoN (unrelated to sigma 70-type factors such as RpoD/SigA). RpoN is responsible for enhancer-dependent transcription, and its presence characteristically is associated with varied panels of activators, most of which are enhancer-binding proteins (but see Brahmachary, et al., ). RpoN may be responsible for transcription of nitrogen fixation genes, flagellins, pilins, etc., and synonyms for the gene symbol rpoN, such as ntrA, reflect these observations
278TIGR022212188166.815[--------------------                             ]cas_TM1812CRISPR-associated protein, TM1812 family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. This family, represented by TM1812 of Thermotoga maritima, is found also in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, a large plasmid of Synechocystis sp. PCC 6803, and Fibrobacter succinogenes S85.
279pfam08535932566.77.1[                               ------            ]KorBKorB domain. This family consists of several KorB transcriptional repressor proteins. The korB gene is a major regulatory element in the replication and maintenance of broad host-range plasmid RK2. It negatively controls the replication gene trfA, the host-lethal determinants kilA and kilB, and the korA-korB operon. This domain includes the DNA-binding HTH motif.
280pfam09397674766.419[                             -----------         ]Ftsk_gammaFtsk gamma domain. This domain directs oriented DNA translocation and forms a winged helix structure. Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding.
281PRK096451734666.111[                           -----------           ]PRK09645RNA polymerase sigma factor SigL; Provisional
282TIGR034182916066.09.4[                           ----------------      ]chol_sulf_TFputative choline sulfate-utilization transcription factor. Members of this protein family are transcription factors of the LysR family. Their genes typically are divergently transcribed from choline-sulfatase genes. That enzyme makes choline, a precursor to the osmoprotectant glycine-betaine, available by hydrolysis of choline sulfate.
283PRK094832175865.77.5[                          ----------------       ]PRK09483response regulator; Provisional
284COG3311702665.35.5[                               ------            ]AlpAPredicted DNA-binding transcriptional regulator AlpA
285PRK001351885065.214[                          -------------          ]scpBsegregation and condensation protein B; Reviewed
286COG43441754565.111[                            -----------          ]COG4344Predicted transciptional regulator, contains HTH domain
287TIGR005384556765.16.8[                          -----------------      ]hemNoxygen-independent coproporphyrinogen III oxidase. This model represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. Several species, including E. coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity. Members of this family have a perfectly conserved motif PYRT
288PRK115121446365.07.9[                           ---------------       ]PRK11512DNA-binding transcriptional repressor MarR; Provisional
289COG37114913665.015[                             --------            ]BglGTranscriptional antiterminator
290PRK0414031710164.87.7[        ----------------------------             ]PRK04140hypothetical protein; Provisional
291cd06571902964.88.8[                              -------            ]Bac_DnaA_CC-terminal domain of bacterial DnaA proteins. The DNA-binding C-terminal domain of DnaA contains a helix-turn-helix motif that specifically interacts with the DnaA box, a 9-mer motif that occurs repetitively in the replication origin oriC. Multiple copies of DnaA, which is an ATPase, bind to 9-mers at the origin and form an initial complex in which the DNA strands are being separated in an ATP-dependent step.
292TIGR020363025164.710[                              -------------      ]dsdCD-serine deaminase transcriptional activator. This family, part of the LysR family of transcriptional regulators, activates transcription of the gene for D-serine deaminase, dsdA. Trusted members of this family so far are found adjacent to dsdA and only in Gammaproteobacteria, including E. coli, Vibrio cholerae, and Colwellia psychrerythraea.
293pfam042971014664.611[                          ------------           ]UPF0122Putative helix-turn-helix protein, YlxM / p13 like. Members of this family are predicted to contain a helix-turn-helix motif, for example residues 37-55 in Mycoplasma mycoides p13. Genes encoding family members are often part of operons that encode components of the SRP pathway, and this protein may regulate the expression of an operon related to the SRP pathway.
294PRK09726883564.612[                           ---------             ]PRK09726antitoxin HipB; Provisional
295cd14322392264.44.1[-----                                            ]UBA_LATSUBA domain found in serine/threonine-protein kinase LATS and similar proteins. The LATS proteins family consists of two isoforms, LATS1 and LATS2, both of which are mammalian homologs of the Drosophila tumor suppressor gene lats/warts. LATS1, also called large tumor suppressor homolog 1, or WARTS protein kinase (warts), is a serine/threonine-protein kinase that highly conserved from fly to human. LATS2, also called kinase phosphorylated during mitosis protein, or large tumor suppressor homolog 2, or serine/threonine-protein kinase KPM, or Warts-like kinase, inhibits the G1/S transition and is essential for embryonic development, proliferation control and genomic integrity. LATS proteins contain an N-terminal ubiquitin-associated (UBA) domain and a C-terminal protein kinase domain.
296TIGR024042333563.78.7[                               ---------         ]trehalos_R_Bsubtrehalose operon repressor, B. subtilis-type. This family consists of repressors of the GntR family typically associated with trehalose utilization operons. Trehalose is imported as trehalose-6-phosphate and then hydrolyzed by alpha,alpha-phosphotrehalase to glucose and glucose-6-P. This family includes repressors mostly from Gram-positive lineages and does not include the TreR from E. coli.
297pfam051583155663.610[                          --------------         ]RNA_pol_Rpc34RNA polymerase Rpc34 subunit. Subunit specific to RNA Pol III, the tRNA specific polymerase. The C34 subunit of yeast RNA Pol III is part of a subcomplex of three subunits which have no counterpart in the other two nuclear RNA polymerases. This subunit interacts with TFIIIB70 and is therefore participates in Pol III recruitment.
298TIGR0018018710163.58.4[           --------------------------            ]parB_partParB/RepB/Spo0J family partition protein. This model represents the most well-conserved core of a set of chromosomal and plasmid partition proteins related to ParB, including Spo0J, RepB, and SopB. Spo0J has been shown to bind a specific DNA sequence that, when introduced into a plasmid, can serve as partition site. Study of RepB, which has nicking-closing activity, suggests that it forms a transient protein-DNA covalent intermediate during the strand transfer reaction.
299PRK100863116663.013[                           ----------------      ]PRK10086DNA-binding transcriptional regulator DsdC; Provisional
300cd011091133262.97.8[                               ---------         ]HTH_YyaNHelix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB. Putative helix-turn-helix (HTH) MerR-like transcription regulators of Bacillus subtilis, YyaN and YraB, and related proteins; N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
301pfam14090705262.616[                            --------------       ]HTH_39Helix-turn-helix domain. This helix-turn-helix domain is often found in phage proteins and is likely to be DNA-binding.
302PRK104112404362.415[                            ----------           ]PRK10411DNA-binding transcriptional activator FucR; Provisional
303PRK039112603561.512[                              ---------          ]PRK03911heat-inducible transcription repressor; Provisional
304pfam053311144761.19[                           -----------           ]DUF742Protein of unknown function (DUF742). This family consists of several uncharacterized Streptomyces proteins as well as one from Mycobacterium tuberculosis. The function of these proteins is unknown.
305TIGR045481785160.715[                          ------------           ]DnaD_MollicutesDnaD family protein, Mollicutes type. This model describes the full length of a family of proteins in the Mollicutes (Mycoplasma, Spiroplasma, Mesoplasma, etc.) homologous to the N-terminal region of DnaD from Bacillus subtilis.
306PRK111392975860.515[                            ---------------      ]PRK11139DNA-binding transcriptional activator GcvA; Provisional
307TIGR021472714959.715[                              ------------       ]Fsuc_secondTIGR02147 family protein. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85 and a smaller number in Bdellovibrio bacteriovorus HD100. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
308COG13733986859.511[                            -----------------    ]COG1373Predicted ATPase, AAA+ superfamily
309pfam11427504359.314[                          -----------            ]HTH_Tnp_Tc3_1Tc3 transposase. Tc3 is transposase with a specific DNA-binding domain which contains three alpha-helices, two of which form a helix-turn-helix motif which makes four base-specific contacts with the major groove. The N-terminus makes contacts with the minor groove. There is a base specific recognition between Tc3 and the transposon DNA. The DNA binding domain forms a dimer in which each monomer binds a separate transposon end. This implicates that the dimer has a role in synapsis and is necessary for the simultaneous cleavage of both transposon termini.
310pfam046451812959.211[                              -------            ]DUF603Protein of unknown function, DUF603. This family includes several uncharacterized proteins from Borrelia species.
311pfam05491755459.125[                           -------------         ]RuvB_CHolliday junction DNA helicase ruvB C-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family consists of the C-terminal region of the RuvB protein which is thought to be helicase DNA-binding domain.
312PRK104032154458.913[                          -----------            ]PRK10403transcriptional regulator NarP; Provisional
313TIGR035412324458.515[                          -----------            ]reg_near_HchALuxR family transcriptional regulatory, chaperone HchA-associated. Members of this protein family belong to the LuxR transcriptional regulator family, and contain both autoinducer binding (pfam03472) and transcriptional regulator (pfam00196) domains. Members, however, occur only in a few members of the Gammaproteobacteria that have the chaperone/aminopeptidase HchA, and are always encoded by the adjacent gene.
314TIGR01321943658.512[                            ---------            ]TrpRtrp operon repressor, proteobacterial. This model represents TrpR, the repressor of the trp operon. It is found so far only in the gamma subdivision of the proteobacteria and in Chlamydia trachomatis. All members belong to species capable of tryptophan biosynthesis.
315pfam077501624358.013[                           -----------           ]GcrAGcrA cell cycle regulator. GcrA is a master cell cycle regulator that, together with CtrA (see pfam00072 and pfam00486), is involved in controlling cell cycle progression and asymmetric polar morphogenesis. During this process, there are temporal and spatial variations in the concentrations of GcrA and CtrA. The variation in concentration produces time and space dependent transcriptional regulation of modular functions that implement cell-cycle processes. More specifically, GcrA acts as an activator of components of the replisome and the segregation machinery.
316PRK103413125757.67.1[                            ---------------      ]PRK10341DNA-binding transcriptional activator TdcA; Provisional
317PRK126793166157.46[                            ---------------      ]cbltranscriptional regulator Cbl; Reviewed
318PRK138244042457.37.2[                                 -----           ]PRK13824replication initiation protein RepC; Provisional
319PRK041582564856.87.6[                           ------------          ]PRK04158transcriptional repressor CodY; Validated
320pfam06056583356.812[                              --------           ]Terminase_5Putative ATPase subunit of terminase (gpP-like). This family of proteins are annotated as ATPase subunits of phage terminase after. Terminases are viral proteins that are involved in packaging viral DNA into the capsid.
321pfam097561895056.320[                             --------------      ]DDRGKDDRGK domain. This is a family of proteins of approximately 300 residues, found in plants and vertebrates. They contain a highly conserved DDRGK motif.
322pfam090023794955.917[        ------------                             ]DUF1887Domain of unknown function (DUF1887). This domain is found in a set of hypothetical bacterial proteins.
323TIGR027163066055.815[                            ---------------      ]C20_methyl_CrtFC-20 methyltransferase BchU. Members of this protein family are the S-adenosylmethionine-depenedent C-20 methyltransferase BchU, part of the pathway of bacteriochlorophyll c production in photosynthetic green sulfur bacteria. The position modified by this enzyme represents the difference between bacteriochlorophylls c and d; strains lacking this protein can only produced bacteriochlorophyll d.
324cd097473785455.735[        ---------------                          ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as Cas02710 family
325pfam07381905955.28.3[                            ---------------      ]DUF1495Winged helix DNA-binding domain (DUF1495). This family consists of several hypothetical archaeal proteins of around 110 residues in length. The structure of this domain possesses a winged helix DNA-binding domain suggesting these proteins are bacterial transcription factors.
326COG12028303055.18.6[                                   --------      ]COG1202Superfamily II helicase, archaea-specific
327pfam00486774455.033[                           ----------            ]Trans_reg_CTranscriptional regulatory protein, C terminal.
328pfam00165422655.021[                              -------            ]HTH_AraCBacterial regulatory helix-turn-helix proteins, AraC family. In the absence of arabinose, the N-terminal arm of AraC binds to the DNA binding domain (pfam00165) and helps to hold the two DNA binding domains in a relative orientation that favours DNA looping. In the presence of arabinose, the arms bind over the arabinose on the dimerization domain, thus freeing the DNA-binding domains. The freed DNA-binding domains are then able to assume a conformation suitable for binding to the adjacent DNA sites that are utilized when AraC activates transcription, and hence AraC ceases looping the DNA when arabinose is added.
329TIGR024053112255.09.3[                               -----             ]trehalos_R_Ecoltrehalose operon repressor, proteobacterial. This family consists of repressors of the LacI family typically associated with trehalose utilization operons. Trehalose is imported as trehalose-6-phosphate and then hydrolyzed by alpha,alpha-phosphotrehalase to glucose and glucose-6-P. This family includes repressors mostly from Gammaproteobacteria and does not include the GntR family TreR of Bacillus subtilis
330cd00383954554.629[                          -----------            ]trans_reg_CEffector domain of response regulator. Bacteria and certain eukaryotes like protozoa and higher plants use two-component signal transduction systems to detect and respond to changes in the environment. The system consists of a sensor histidine kinase and a response regulator. The former autophosphorylates in a histidine residue on detecting an external stimulus. The phosphate is then transferred to an invariant aspartate residue in a highly conserved receiver domain of the response regulator. Phosphorylation activates a variable effector domain of the response regulator, which triggers the cellular response. The C-terminal effector domain contains DNA and RNA polymerase binding sites. Several dimers or monomers bind head to tail to small tandem repeats upstream of the genes. The RNA polymerase binding sites interact with the alpha or sigma subunite of RNA polymerase.
331pfam142771625554.48.4[                          -----------------      ]DUF4364Domain of unknown function (DUF4364). This family of proteins is found in bacteria and archaea. Proteins in this family are approximately 180 amino acids in length.
332COG23794228854.418[       ---------------------------------         ]GckAGlycerate-2-kinase
333pfam11994724754.235[                           -----------           ]DUF3489Protein of unknown function (DUF3489). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 84 to 211 amino acids in length. This protein has a single completely conserved residue W that may be functionally important.
334PRK059324553754.29[                               ------------      ]PRK05932RNA polymerase factor sigma-54; Reviewed
335cd096682147754.180[ -------------------                             ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as TM1812 family
336TIGR027191383753.25.2[                                 ---------       ]repress_PhaQpoly-beta-hydroxybutyrate-responsive repressor. Members of this family are transcriptional regulatory proteins found in the vicinity of poly-beta-hydroxybutyrate (PHB) operons in several species of Bacillus. This protein appears to have repressor activity modulated by PHB itself. This protein belongs to the larger PadR family (see pfam03551).
337pfam12844642952.920[                             -------             ]HTH_19Helix-turn-helix domain. Members of this family contains a DNA-binding helix-turn-helix domain.
338pfam05930512252.615[                               -----             ]Phage_AlpAProphage CP4-57 regulatory protein (AlpA). This family consists of several short bacterial and phage proteins which are related to the Escherichia coli protein AlpA. AlpA suppress two phenotypes of a delta lon protease mutant, overproduction of capsular polysaccharide and sensitivity to UV light. Several of the sequences in this family are thought to be DNA-binding proteins.
339PRK094801943452.517[                            ---------            ]slmAdivision inhibitor protein; Provisional
340TIGR01610954952.426[                          ------------           ]phage_O_Ntermphage replication protein O, N-terminal domain. This model represents the N-terminal region of the phage lambda replication protein O and homologous regions of other phage proteins.
341COG3423823252.218[                             --------            ]SfsBPredicted transcriptional regulator, lambda repressor-like DNA-binding domain
342PRK085582383152.215[                             -------             ]PRK08558adenine phosphoribosyltransferase; Provisional
343TIGR014532142351.512[                               ------            ]grpIintron_endogroup I intron endonuclease. This model represents one subfamily of endonucleases containing the endo/excinuclease amino terminal domain, pfam01541 at its amino end. A distinct subfamily includes excinuclease abc subunit c (uvrC). Members of pfam01541 are often termed GIY-YIG endonucleases after conserved motifs near the amino end. This subfamily in this model is found in open reading frames of group I introns in both phage and mitochondria. The closely related endonucleases of phage T4: segA, segB, segC, segD and segE, score below the trusted cutoff for the family.
344TIGR033392795751.412[                            ---------------      ]phn_lysRaminoethylphosphonate catabolism associated LysR family transcriptional regulator. This group of sequences represents a number of related clades with numerous examples of members adjacent to operons for the degradation of 2-aminoethylphosphonate (AEP) in Pseudomonas, Ralstonia, Bordetella and Burkholderia species. These are transcriptional regulators of the LysR family which contain a helix-turn-helix (HTH) domain (pfam00126) and a periplasmic substrate-binding protein-like domain (pfam03466).
345COG38295604551.116[                          -----------            ]RocRTranscriptional regulator containing PAS, AAA-type ATPase, and DNA-binding Fis domains
346cd04772993150.917[                               ---------         ]HTH_TioE_rpt1First Helix-Turn-Helix DNA binding domain of the regulatory protein TioE. Putative helix-turn-helix (HTH) regulatory protein, TioE, and related proteins. TioE is part of the thiocoraline gene cluster, which is involved in the biosynthesis of the antitumor thiocoraline from the marine actinomycete, Micromonospora. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. Proteins in this family are unique within the MerR superfamily in that they are composed of just two adjacent MerR-like N-terminal domains; this CD contains the N-terminal or first repeat (rpt1) of these tandem MerR-like domain proteins.
347COG22071273250.544[                              --------           ]AraCAraC-type DNA-binding domain and AraC-containing proteins
348pfam13560632850.030[                             -------             ]HTH_31Helix-turn-helix domain. This domain is a helix-turn-helix domain that probably binds to DNA.
349TIGR042852558749.728[           --------------------------            ]nucleoid_nocnucleoid occlusion protein. This model describes nucleoid occlusion protein, a close homolog to ParB chromosome partitioning proteins including Spo0J in Bacillus subtilis. Its gene often is located near the gene for the Spo0J ortholog. This protein bind a specific DNA sequence and blocks cytokinesis from happening until chromosome segregation is complete.
350pfam06970762449.713[                               ------            ]RepA_NReplication initiator protein A (RepA) N-terminus. This of family of predicted proteins represents the N-terminus (approximately 80 residues) of replication initiator protein A (RepA), a DNA replication initiator in plasmids. Most proteins in this family are bacterial, but archaeal and eukaryotic members are also included.
351cd047731082649.717[                               -------           ]HTH_TioE_rpt2Second Helix-Turn-Helix DNA binding domain of the regulatory protein TioE. Putative helix-turn-helix (HTH) regulatory protein, TioE, and related proteins. TioE is part of the thiocoraline gene cluster, which is involved in the biosynthesis of the antitumor thiocoraline from the marine actinomycete, Micromonospora. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. Proteins in this family are unique within the MerR superfamily in that they are composed of just two adjacent MerR-like N-terminal domains; this CD mainly contains the C-terminal or second repeat (rpt2) of these tandem MerR-like domain proteins.
352PRK126823096149.214[                            ---------------      ]PRK12682transcriptional regulator CysB-like protein; Reviewed
353pfam098211204349.215[                                -----------      ]AAA_assoc_CC-terminal AAA-associated domain. This had been thought to be an ATPase domain of ABC-transporter proteins. However, only one member has any trans-membrane regions. It is associated with an upstream ATP-binding cassette family, pfam00005.
354cd047761184449.03.7[                               ------------      ]HTH_GnyRHelix-Turn-Helix DNA binding domain of the regulatory protein GnyR. Putative helix-turn-helix (HTH) regulatory protein, GnyR, and other related proteins. GnyR belongs to the gnyRDBHAL cluster, which is involved in acyclic isoprenoid degradation in Pseudomonas aeruginosa. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
355pfam0967037910648.254[         ----------------------------            ]Cas_Cas02710CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
356PRK126843136148.215[                            ---------------      ]PRK12684transcriptional regulator CysB-like protein; Reviewed
357COG400627811947.41.1E+02[        --------------------------------         ]COG4006CRISPR/Cas system-associated protein Csm6, COG1517 family
358cd097412194247.148[           ----------                            ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as NE0113 family
359pfam09904903646.939[                              --------           ]HTH_43Winged helix-turn helix. This family, found in various hypothetical prokaryotic proteins, is a probable winged helix DNA-binding domain.
360cd047861313146.519[                               ---------         ]HTH_MerR-like_sg7Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 7) with a conserved cysteine present in the C-terminal portion of the protein. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
361COG17772174246.224[                           -----------           ]COG1777Predicted transcriptional regulator
362cd032642117546.11E+02[        ----------------------                   ]ABC_drug_resistance_likeABC-type multidrug transport system, ATPase component. The biological function of this family is not well characterized, but display ABC domains similar to members of ABCA subfamily. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.
363pfam023872793346.14[                               --------          ]IncFII_repAIncFII RepA protein family. This protein is plasmid encoded and found to be essential for plasmid replication.
364PRK15431783745.941[                             ---------           ]PRK15431ferrous iron transport protein FeoC; Provisional
365PRK131821752345.921[                               ------            ]racApolar chromosome segregation protein; Reviewed
366PRK146061884045.716[                           ----------            ]ruvAHolliday junction DNA helicase RuvA; Provisional
367TIGR017141194645.631[                               -----------       ]phage_rep_org_Nphage replisome organizer, putative, N-terminal region. This model represents the N-terminal domain of a small family of phage proteins. The protein contains a region of low-complexity sequence that reflects DNA direct repeats able to function as an origin of phage replication. The region covered by this model is N-terminal to the low-complexity region.
368pfam094553724345.532[          ----------                             ]Cas_DxTHGCRISPR-associated (Cas) DxTHG family. CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. The family describes Cas proteins of about 400 residues that include the motif
369PRK125191943845.538[                           ---------             ]PRK12519RNA polymerase sigma factor; Provisional
370TIGR030202477445.336[                  -------------------            ]EpsAtranscriptional regulator EpsA. Proteins in this family include a C-terminal LuxR transcriptional regulator domain (pfam00196). These proteins are positioned proximal to either EpsH-containing exopolysaccharide biosynthesis operons of the Methylobacillus type, or the associated PEP-CTERM-containing genes.
371pfam099291185845.235[                           ---------------       ]DUF2161Uncharacterized conserved protein (DUF2161). This domain, found in various hypothetical prokaryotic proteins, has no known function.
372COG29731033645.223[                            ---------            ]TrpRTrp operon repressor
373PRK004113949545.119[                ------------------------         ]cdc6cell division control protein 6; Reviewed
374cd01104681944.120[                               -----             ]HTH_MlrA-CarAHelix-Turn-Helix DNA binding domain of the transcription regulators MlrA and CarA. Helix-turn-helix (HTH) transcription regulator MlrA (merR-like regulator A), N-terminal domain. The MlrA protein, also known as YehV, has been shown to control cell-cell aggregation by co-regulating the expression of curli and extracellular matrix production in Escherichia coli and Salmonella typhimurium. Its close homolog, CarA from Myxococcus xanthus, is involved in activation of the carotenoid biosynthesis genes by light. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Many MlrA- and CarA-like proteins in this group appear to lack the long dimerization helix seen in the N-terminal domains of typical MerR-like proteins.
375COG19742013144.122[                               --------          ]LexASOS-response transcriptional repressor LexA (RecA-mediated autopeptidase)
376TIGR040943834443.920[                           -----------           ]adjacent_YSIRKYSIRK-targeted surface antigen transcriptional regulator. Bacteria whose genomes encode only one protein with the YSIRK variant form of signal peptide (TIGR01168) were examined for conserved genes near that one tagged protein. This protein is found adjacent to at various classes of repetitive or low-complexity YSIRK proteins (whether unique in genome or not), in a range of species (Enterococcus faecalis X98, Ruminococcus torques, Coprobacillus sp. D7, Lysinibacillus fusiformis ZC1, Streptococcus equi subsp. equi 4047, etc). The affliated YSIRK proteins include Streptococcal protective antigen (see ) and proteins with the Rib/alpha/Esp surface antigen repeat (see TIGR02331). The last quarter of this protein has an AraC family helix-turn-helix (HTH)transcriptional regulator domain.
377PRK041724896843.644[                        -----------------        ]pheSphenylalanyl-tRNA synthetase subunit alpha; Provisional
378PRK074082563843.218[                           ----------            ]PRK07408RNA polymerase sigma factor SigF; Reviewed
379TIGR028462272243.224[                               -----             ]spore_sigmaKRNA polymerase sigma-K factor. The sporulation-specific transcription factor sigma-K (also called sigma-27) is expressed in the mother cell compartment of endospore-forming bacteria such as Bacillus subtilis. Like its close homolog sigma-E (sigma-29) (see TIGR02835), also specific to the mother cell compartment, it must be activated by a proteolytic cleavage. Note that in Bacillus subtilis (and apparently also Clostridium tetani), but not in other endospore forming species such as Bacillus anthracis, the sigK gene is generated by a non-germline (mother cell only) chromosomal rearrangement that recombines coding regions for the N-terminal and C-terminal regions of sigma-K.
380PRK119201534943.217[                            ------------         ]rirAiron-responsive transcriptional regulator; Reviewed
381PRK069301702742.835[                               ------            ]PRK06930positive control sigma-like factor; Validated
382PRK058032332242.324[                               -----             ]PRK05803sporulation sigma factor SigK; Reviewed
383TIGR02844803442.136[                            --------             ]spore_III_Dsporulation transcriptional regulator SpoIIID. Members of this protein are the transcriptional regulator SpoIIID, or stage III sporulation protein D. It is present in genomes if and only if the species is capable of endospore formation as occurs in the model species Bacillus subtilis. SpoIIID is a DNA binding protein that, in B. subtilis, downregulates many genes but also turns on ten genes.
384pfam136351546942.129[                            -----------------    ]DUF4143Domain of unknown function (DUF4143). This domain is almost always found C-terminal to an ATPase core family.
385pfam127931153241.930[                               --------          ]SgrR_NSugar transport-related sRNA regulator N-term. Small, non-coding RNA molecules play important regulatory roles in a variety of physiological processes in bacteria. SgrR_N is the N-terminus of a family of proteins which regulate the transcription of these sRNAs, in particular SgrS. SgrR_N contains a helix-turn-helix motif characteristic of winged-helix DNA-binding transcriptional regulators. SgrS is a small RNA required for recovery from glucose-phosphate stress in bacteria. In examining the regulation of sgrR expression it was found that SgrR negatively auto-regulates its own transcription in the presence and absence of stress, and thus SgrR coordinates the response to glucose-phosphate stress by binding specifically to sgrS promoter DNA.
386COG55661372541.826[                              -------            ]COG5566Transcriptional regulator, Middle operon regulator (Mor) family
387PRK102163194541.434[                          -----------            ]PRK10216DNA-binding transcriptional regulator YidZ; Provisional
388cd14400391441.318[                                   ----          ]UBA_Gts1p_likeUBA domain found in Saccharomyces cerevisiae protein GTS1 (Gts1p) and similar proteins. Gts1p, also called protein LSR1, is encoded by a pleiotropic gene GTS1 in budding yeast. The formation of Gts1p-mediated protein aggregates may induce reactive oxygen species (ROS) production and apoptosis. Gts1p also plays an important role in the regulation of heat and other stress responses under glucose-limited or -depleted conditions in either batch or continuous culture. Gts1p contains an N-terminal zinc finger motif similar to that of GATA-transcription factors, a ubiquitin-associated (UBA) domain and a C-terminal glutamine-rich strand. The zinc finger is responsible for the binding to the glycolytic enzyme glyceraldehydes-3-phosphate dehydrogenase (GAPDH) which is required for the maintenance of the metabolic oscillations of budding yeast. The polyglutamine sequence is indispensable for the pleiotropy and nuclear localization of Gts1p. It is essential for the transcriptional activation, whereas Gts1p lacks DNA binding activity.
389pfam096232252741.367[              -------                            ]Cas_NE0113CRISPR-associated protein NE0113 (Cas_NE0113). Members of this minor CRISPR-associated (Cas) protein family are encoded in cas gene clusters in Vibrio vulnificus YJ016, Nitrosomonas europaea ATCC 19718, Mannheimia succiniciproducens MBEL55E, and Verrucomicrobium spinosum.
390PRK035731444841.228[                              ------------       ]PRK03573transcriptional regulator SlyA; Provisional
391cd14435874041.152[                                ----------       ]SPO1_TF1_likeBacteriophage SPO1-encoded TF1 binds and bends DNA. This group contains proteins related to bacillus phage SPO1-encoded transcription factor 1 (TF1), a type II DNA-binding protein related to the DNA sequence specific (IHF) and non-specific (HU) domains. Type II DNA-binding proteins bind and bend DNA as dimers. Like IHF, TF1 binds DNA specifically and bends DNA sharply. Bacteriophage SPO1-encoded TF1 recognizes SPO1 phage DNA containing 5-(hydroxymethyl)-2'-deoxyuridine as opposed to thymine, Related family members includes integration host factor (IHF) and HU, also called type II DNA-binding proteins (DNABII), which are small dimeric proteins that specifically bind the DNA minor groove, inducing large bends in the DNA and serving as architectural factors in a variety of cellular processes such as recombination, initiation of replication/transcription and gene regulation. IHF binds DNA in a sequence specific manner while HU displays little or no sequence preference. IHF homologs are usually heterodimers, while HU homologs are typically homodimers (except HU heterodimers from E. coli and other enterobacteria). HU is highly basic and contributes to chromosomal compaction and maintenance of negative supercoiling, thus often referred to as histone-like protein. IHF is an essential cofactor in phage lambda site-specific recombination, having an architectural role during assembly of specialized nucleoprotein structures (snups).
392PRK108571643841.039[                              ----------         ]PRK10857DNA-binding transcriptional regulator IscR; Provisional
393pfam13565773840.760[                           ---------             ]HTH_32Homeodomain-like domain.
394COG15084443640.622[                               ------------      ]RpoNDNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
395PRK098022694540.526[                           -----------           ]PRK09802DNA-binding transcriptional regulator AgaR; Provisional
396cd04780952540.429[                               -------           ]HTH_MerR-like_sg5Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 5), N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
397PRK125131943940.261[                           ---------             ]PRK12513RNA polymerase sigma factor; Provisional
398PRK134132004439.747[                          -----------            ]mpimultiple promoter invertase; Provisional
399pfam02319674339.152[                            -----------          ]E2F_TDPE2F/DP family winged-helix DNA-binding domain. This family contains the transcription factor E2F and its dimerization partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.
400TIGR04353734639.074[                           -----------           ]PqqD_rel_XPqqD family protein, HPr-rel-A system. Members of this protein show distant homology to PqqD, and belong to a three-gene cassette that included the HPr kinase related protein family of TIGR04352. The role of the cassette, and of this protein, are unknown.
401cd00591854538.984[                                -----------      ]HU_IHFDNA sequence specific (IHF) and non-specific (HU) domains. This family includes integration host factor (IHF) and HU, also called type II DNA-binding proteins (DNABII), which are small dimeric proteins that specifically bind the DNA minor groove, inducing large bends in the DNA and serving as architectural factors in a variety of cellular processes such as recombination, initiation of replication/transcription and gene regulation. IHF binds DNA in a sequence specific manner while HU displays little or no sequence preference. IHF homologs are usually heterodimers, while HU homologs are typically homodimers (except HU heterodimers from E. coli and other enterobacteria). HU is highly basic and contributes to chromosomal compaction and maintenance of negative supercoiling, thus often referred to as histone-like protein. IHF is an essential cofactor in phage lambda site-specific recombination, having an architectural role during assembly of specialized nucleoprotein structures (snups). Bacillus phage SPO1-encoded transcription factor 1 (TF1) is another related type II DNA-binding protein. Like IHF, TF1 binds DNA specifically and bends DNA sharply.
402PRK112022033738.824[                               ---------         ]PRK11202DNA-binding transcriptional repressor FabR; Provisional
403cd047701233138.726[                               ---------         ]HTH_HMRTRHelix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators. Helix-turn-helix (HTH) heavy metal resistance transcription regulators (HMRTR): MerR1 (mercury), CueR (copper), CadR (cadmium), PbrR (lead), ZntR (zinc), and other related proteins. These transcription regulators mediate responses to heavy metal stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
404PRK126833096138.720[                            ---------------      ]PRK12683transcriptional regulator CysB-like protein; Reviewed
405COG30931043837.351[                           ---------             ]VapIPlasmid maintenance system antidote protein VapI, contains XRE-type HTH domain
406pfam04539782737.165[                              -------            ]Sigma70_r3Sigma-70 region 3. Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but as some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme.
407cd08780901836.928[                                  -----          ]Death_TRADDDeath Domain of Tumor Necrosis Factor Receptor 1-Associated Death Domain protein. Death domain (DD) of TRADD (TNF Receptor 1-Associated Death Domain or TNFRSF1A-associated via death domain) protein. TRADD is a central signaling adaptor for TNF-receptor 1 (TNFR1), mediating activation of Nuclear Factor -kappaB (NF-kB) and c-Jun N-terminal kinase (JNK), as well as caspase-dependent apoptosis. It also carries important immunological roles including germinal center formation, DR3-mediated T-cell stimulation, and TNFalpha-mediated inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
408cd011081273036.834[                               ---------         ]HTH_CueRHelix-Turn-Helix DNA binding domain of CueR-like transcription regulators. Helix-turn-helix (HTH) transcription regulators CueR and ActP, copper efflux regulators. In Bacillus subtilis, copper induced CueR regulates the copZA operon, preventing copper toxicity. In Rhizobium leguminosarum, ActP controls copper homeostasis; it detects cytoplasmic copper stress and activates transcription in response to increasing copper concentrations. These proteins are comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain winged HTH motifs that mediate DNA binding, while the C-terminal domains have two conserved cysteines that define a monovalent copper ion binding site. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements.
409TIGR020182304236.426[                             ----------          ]his_ut_represhistidine utilization repressor, proteobacterial. This model represents a proteobacterial histidine utilization repressor. It is usually found clustered with the enzymes HutUHIG so that it can regulate its own expression as well. A number of species have several paralogs and may fine-tune the regulation according to levels of degradation intermediates such as urocanate. This family belongs to the larger GntR family of transcriptional regulators.
410COG19612224836.361[                           -----------           ]PinESite-specific DNA recombinase related to the DNA invertase Pin
411TIGR02531873436.269[                            ---------            ]yecD_yerCTrpR-related protein YerC/YecD. This model represents a protein subfamily found mostly in the Firmicutes (Bacillus and allies). This family is similar in sequence to the trp operon repressor TrpR described by TIGR01321, and represents a distinct clade within the broader family described by pfam01371. At least one species, Xylella fastidiosa, in the Proteobacteria, has a member of both this family and TIGR01321. Several genomes with a member of this family do not synthesize tryptophan, and members of this family should not be considered trp operon repressors without new evidence.
412PRK039751413835.850[                          ----------             ]tfxputative transcriptional regulator; Provisional
413PRK094642542835.730[                                -------          ]pdhRtranscriptional regulator PdhR; Reviewed
414PRK101632715235.672[                            ---------------      ]PRK10163DNA-binding transcriptional repressor AllR; Provisional
415cd01105882735.642[                               --------          ]HTH_GlnR-likeHelix-Turn-Helix DNA binding domain of GlnR-like transcription regulators. Helix-turn-helix (HTH) transcription regulator GlnR and related proteins, N-terminal domain. The GlnR and TnrA (also known as ScgR) proteins have been shown to regulate expression of glutamine synthetase as well as several genes involved in nitrogen metabolism. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
416pfam09114966335.433[                           ----------------      ]MotA_activTranscription factor MotA, activation domain. Members of this family of viral protein domains are implicated in transcriptional activation. They are almost completely alpha-helical, with five alpha-helices and a short, two-stranded, beta-ribbon. Four alpha helices (alpha1, alpha3, alpha4 and alpha5) are amphipathic and pack their hydrophobic surfaces around the central helix alpha2.
417pfam05584723435.333[                              ---------          ]Sulfolobus_pRNSulfolobus plasmid regulatory protein. This family consists of several plasmid regulatory proteins from the extreme thermophilic and acidophilic archaea Sulfolobus.
418PRK117162693835.327[                                  ---------      ]PRK11716DNA-binding transcriptional regulator IlvY; Provisional
419pfam05595912934.981[                              -------            ]DUF771Domain of unknown function (DUF771). Family of uncharacterized ORFs found in Bacteriophage and Lactococcus lactis.
420PRK054561724734.843[                               -------------     ]PRK05456ATP-dependent protease subunit HslV; Provisional
421COG44961003534.453[                            ---------            ]YerCPredicted DNA-binding transcriptional regulator YerC, contains ArsR-like HTH domain
422TIGR026191496834.11E+02[        -----------------                        ]TIGR02619putative CRISPR-associated protein, APE2256 family. This model represents a conserved domain of about 150 amino acids found in at least five archaeal species and three bacterial species, exclusively in species with CRISPR (Clustered Regularly Interspaced Short Palidromic Repeats). In six of eight species, the member of this family is in the vicinity of a CRISPR/Cas locus.
423PRK048419036833.942[                     -----------------           ]PRK04841transcriptional regulator MalT; Provisional
424PRK043383823233.851[                               -------           ]PRK04338N(2),N(2)-dimethylguanosine tRNA methyltransferase; Provisional
425COG31322153833.431[                                 ----------      ]YceHUncharacterized conserved protein YceH, UPF0502 family
426TIGR029891594432.81.1E+02[                           ----------            ]Sig-70_gvs1RNA polymerase sigma-70 factor, Rhodopirellula/Verrucomicrobium family. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are abundantly found in the species Rhodopirellula baltica (11), and Verrucomicrobium spinosum (16) and to a lesser extent in Gemmata obscuriglobus (2).
427TIGR029591704632.556[                          -----------            ]SigZRNA polymerase sigma factor, SigZ family. This family of RNA polymerase sigma factors is a member of the Sigma-70 subfamily (TIGR02937). One of these is designated as SigZ in B. subtilis (Swiss_Prot: SIGZ_BACSU). Interestingly, this group has a very sporatic distribution, B. subtilis, for instance, being the only sequenced strain of Bacilli with a member. Dechloromonas aromatica RCB appears to have two of these sigma factors. A member appears on a plasmid found in Photobacterium profundum SS9 and Vibrio fischeri ES114 (where a second one is chromosomally encoded).
428cd096941817232.599[        ------------------                       ]Csm6_III-ACRISPR/Cas system-associated protein Csm6. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; loosely associated with CRISPR/Cas systems
429pfam071061695032.51.1E+02[                             --------------      ]TBPIPTat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation.
430COG3811852832.323[                                    -------      ]YjhXUncharacterized protein YjhX, UPF0386/DUF2084 family
431PRK139802653732.186[                             ---------           ]PRK13980NAD synthetase; Provisional
432TIGR003083744132.137[                            ----------           ]TRM1tRNA(guanine-26,N2-N2) methyltransferase. This enzyme is responsible for two methylations of a characteristic guanine of most tRNA molecules. The activity has been demonstrated for eukaryotic and archaeal proteins, which are active when expressed in E. coli, a species that lacks this enzyme. At least one Eubacterium, Aquifex aeolicus, has an ortholog, as do all completed archaeal genomes.
433pfam043371482331.845[                                 ------          ]DUF480Protein of unknown function, DUF480. This family consists of several proteins of uncharacterized function.
434PRK132392063631.898[                            ---------            ]PRK13239alkylmercury lyase; Provisional
435cd019131714731.843[                               -------------     ]protease_HslVProtease HslV and the ATPase/chaperone HslU are part of an ATP-dependent proteolytic system that is the prokaryotic homolog of the proteasome. HslV is a dimer of hexamers (a dodecamer) that forms a central proteolytic chamber with active sites on the interior walls of the cavity. HslV shares significant sequence and structural similarity with the proteasomal beta-subunit and both are members of the Ntn-family of hydrolases. HslV has a nucleophilic threonine residue at its N-terminus that is exposed after processing of the propeptide and is directly involved in active site catalysis.
436pfam12836653331.567[                              --------           ]HHH_3Helix-hairpin-helix motif. The HhH domain is a short DNA-binding domain.
437PRK102252574131.456[                                ----------       ]PRK10225DNA-binding transcriptional repressor UxuR; Provisional
438pfam00216904431.41.5E+02[                                -----------      ]Bac_DNA_bindingBacterial DNA-binding protein.
439PRK093912303031.337[                               -------           ]fixKtranscriptional regulator FixK; Provisional
440cd04766913530.81.3E+02[                          ----------             ]HTH_HspRHelix-Turn-Helix DNA binding domain of the HspR transcription regulator. Helix-turn-helix (HTH) transcription regulator HspR, N-terminal domain. Heat shock protein regulators (HspR) have been shown to regulate expression of specific regulons in response to high temperature or high osmolarity in Streptomyces and Helicobacter, respectively. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
441pfam087651073830.774[                            ---------            ]MorMor transcription activator family. Mor (Middle operon regulator) is a sequence specific DNA binding protein. It mediates transcription activation through its interactions with the C-terminal domains of the alpha and sigma subunits of bacterial RNA polymerase. The N terminal region of Mor is the dimerization region, and the C terminal contains a helix-turn-helix motif which binds DNA.
442TIGR036921714730.653[                               -------------     ]ATP_dep_HslVATP-dependent protease HslVU, peptidase subunit. The ATP-dependent protease HslVU, a complex of hexameric HslU active as a protein-unfolding ATPase and dodecameric HslV, the catalytic threonine protease.
443pfam05344653530.684[                             --------            ]DUF746Domain of Unknown Function (DUF746). This is a short conserved region found in some transposons. Structural modelling suggests this domain may bind nucleic acids.
444pfam09382923630.541[                                  ---------      ]RQCRQC domain. This DNA-binding domain is found in the RecQ helicase among others and has a helix-turn-helix structure. The RQC domain, found only in RecQ family enzymes, is a high affinity G4 DNA binding domain.
445PRK110743006030.543[                           ----------------      ]PRK11074putative DNA-binding transcriptional regulator; Provisional
446TIGR02647775330.567[                            --------------       ]DNATIGR02647 family protein. Members of this family are found, so far, only in the Gammaproteobacteria. The function is unknown. The location on the chromosome usually is not far from housekeeping genes rather than in what is clearly, say, a prophage region. Some members have been annotated in public databases as DNA-binding protein inhibitor Id-2-related protein, putative transcriptional regulator, or hypothetical DNA binding protein.
447pfam09079853130.377[                                -------          ]Cdc6_CCDC6, C terminal. The C terminal domain of CDC6 assumes a winged helix fold, with a five alpha-helical bundle (alpha15-alpha19) structure, backed on one side by three beta strands (beta6-beta8). It has been shown that this domain acts as a DNA-localisation factor, however its exact function is, as yet, unknown. Putative functions include: (1) mediation of protein-protein interactions and (2) regulation of nucleotide binding and hydrolysis. Mutagenesis studies have shown that this domain is essential for appropriate Cdc6 activity.
448cd14315431930.227[----                                             ]UBA1_UBAP1UBA1 domain found in vertebrate ubiquitin-associated protein 1 (UBAP-1). UBAP-1, also called nasopharyngeal carcinoma-associated gene 20 protein, is a ubiquitously expressed protein that may play an important role in the ubiquitin pathway and cell progression. It co-localizes with TDP-43 proteins in neuronal cytoplasmic inclusions and acts as a genetic risk factor for frontotemporal lobar degeneration (FTLD). Moreover, UBAP-1, together with VPS37A, forms an endosome-specific endosomal sorting complexes I required for transport (ESCRT-I) complex that displays a restricted cellular function, ubiquitin-dependent endosomal sorting and multivesicular body (MVB) biogenesis. UBAP-1 contains an N-terminal UBAP-1-MVB12-associated (UMA) domain, and two tandem ubiquitin-associated (UBA) domains that may be responsible for the binding of ubiquitin-conjugating enzymes. This model corresponds to UBA1 domain.
449pfam061631275430.21.7E+02[                          --------------         ]DUF977Bacterial protein of unknown function (DUF977). This family consists of several hypothetical bacterial proteins from Escherichia coli and Salmonella typhi. The function of this family is unknown.
450PRK101002164430.161[                          -----------            ]PRK10100DNA-binding transcriptional regulator CsgD; Provisional
451COG29962875230.01.3E+02[                          -------------          ]CvfBPredicted RNA-binding protein, contains S1 domains, virulence factor B family
452TIGR011112385429.81.1E+02[  --------------                                 ]mtrAN5-methyltetrahydromethanopterin:coenzyme M methyltransferase subunit A. This model describes N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit A in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase (encoded by subunit A) is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase.
453TIGR034331005829.766[                            --------------       ]padR_acidobacttranscriptional regulator, Acidobacterial, PadR-family. Members of this protein family are putative transcriptional regulators of the PadR family, as found in species of the Acidobacteria. This family of proteins has expanded greatly in this lineage, and where it regularly is found in the vicinity of a putative transporter protein
454COG13861843929.71.4E+02[                            ---------            ]ScpBChromosome segregation and condensation protein ScpB
455pfam130221393529.469[                            --------             ]HTH_Tnp_1_2Helix-turn-helix of insertion element transposase. This is a family of largely phage proteins which are likely to be a helix-turn-helix insertion elements.
456pfam05155921929.446[                                  ----           ]Phage_XPhage X family. This family is the product of Gene X. The function of this protein is unknown.
457PRK082952084429.369[                          -----------            ]PRK08295RNA polymerase factor sigma-70; Validated
458TIGR007211373729.293[                          ----------             ]tfxDNA-binding protein, Tfx family. PSI-BLAST starting with one member of this family converges with significant hits only to other members of the family, which is restricted to the Archaea. Homology is strongest in the helix-turn-helix-containing N-terminal region. Tfx from Methanobacterium thermoautotrophicum is associated with the operon for molybdenum formyl-methanofuran dehydrogenase and binds a DNA sequence near its promoter.
459cd096603945128.81.5E+02[          -------------                          ]Csx1_III-UCRISPR/Cas system-associated protein Csx1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Protein of this family often fused to HTH domain; Some proteins could have an additional fusion with RecB-family nuclease domain; Core domain appears to have a Rossmann-like fold; loosely associated with CRISPR/Cas systems; also known as MJ1666 family
460cd13831865228.81.8E+02[                                --------------   ]HUhistone-like DNA-binding protein HU. This subfamily includes HU and HU-like domains. HU is a conserved nucleoid-associated protein (NAP) which binds non-specifically to duplex DNA with a particular preference for targeting nicked and bent DNA. It is highly basic and contributes to chromosomal compaction and maintenance of negative supercoiling, thus often referred to as histone-like protein. HU can induce DNA bends, condense DNA in a fiber and also interact with single stranded DNA. It contains two homologous subunits, alpha and beta, typically forming homodimers (alpha-alpha and beta-beta), except in E. coli and other enterobacteria, which form heterodimers (alpha-beta). In E. coli, HU binds uniformly to the chromosome, with a preference for damaged or distorted DNA structures and can introduce negative supercoils into closed circular DNA in the presence of topoisomerase I. Anabaena HU (AHU) shows preference for A/T-rich region in the center of its DNA binding site.
461COG48613455728.793[                          --------------         ]COG4861Uncharacterized protein
462cd056444154628.61.3E+02[                           -----------           ]M28_like_3M28 Zn-Peptidases. Peptidase family M28 (also called aminopeptidase Y family), uncharacterized subfamily. The M28 family contains aminopeptidases as well as carboxypeptidases. They typically have co-catalytic zinc ions; each zinc ion is tetrahedrally co-ordinated, with three amino acid ligands plus activated water; one aspartate residue binds both metal ions. Proteins in this subfamily conserve some of the metal-coordinating residues of the typically co-catalytic M28 family, and appear to bind a single metal (Zn) ion.
463pfam020053754028.555[                            ----------           ]TRMN2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The enzyme is found in both eukaryotes and archaebacteria
464COG48001703628.41E+02[                           ---------             ]COG4800Predicted transcriptional regulator with an HTH domain
465TIGR04381493428.31.6E+02[                            ---------            ]HTH_TypRTyrR family helix-turn-helix domain. This model describes the C-terminal DNA-binding helix-turn-helix domain of several regulators of aromatic amino acid metabolism. Examples include TyrR in Escherichia coli and PhhR in Pseudomonas putida. Most members of this family have a sigma-54 interaction domain.
466pfam017101203928.381[                            ----------           ]HTH_Tnp_IS630Transposase. Transposase proteins are necessary for efficient DNA transposition. This family includes insertion sequences from Synechocystis PCC 6803 three of which are characterized as homologous to bacterial IS5- and IS4- and to several members of the IS630-Tc1-mariner superfamily.
467COG21501674928.260[                            ------------         ]COG2150Predicted regulator of amino acid metabolism, contains ACT domain
468PRK096381762328.244[                               -----             ]PRK09638RNA polymerase sigma factor SigY; Reviewed
469pfam013991053328.188[                              --------           ]PCIPCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15).
470cd04331806727.91.2E+02[                       ----------------------    ]RNAP_E_NRNAP_E_N: RpoE, N-terminal ribonucleoprotein (RNP) domain. RpoE (subunit E) is a subunit of the archaeal RNA polymerase (RNAP) that is homologous to Rpb7 of eukaryotic RNAP II, Rpc25 of eukaryotic RNAP III, and Rpa43 of eukaryotic RNAP I. RpoE heterodimerizes with RpoF, another RNA polymerase subunit. RpoE has an elongated two-domain structure that includes an N-terminal RNP domain and a C-terminal oligonucleotide-binding (OB) domain. Both domains of RpoE bind single-stranded RNA.
471PRK103601964427.942[                          -----------            ]PRK10360DNA-binding transcriptional activator UhpA; Provisional
472PRK038372415427.61.2E+02[                          --------------         ]PRK03837transcriptional regulator NanR; Provisional
473cd14270301527.536[---                                              ]UBAUBA domain found in proteins involved in ubiquitin-mediated proteolysis. The ubiquitin-associated (UBA) domains are commonly occurring sequence motifs found in proteins involved in ubiquitin-mediated proteolysis. They contribute to ubiquitin (Ub) binding or ubiquitin-like (UbL) domain binding. However, some kinds of UBA domains can only the bind UbL domain, but not the Ub domain. UBA domains are normally comprised of compact three-helix bundles which contain a conserved GF/Y-loop. They can bind polyubiquitin with high affinity. They also bind monoubiquitin and other proteins. Most UBA domain-containing proteins have one UBA domain, but some harbor two or three UBA domains.
474PRK096371815327.41.2E+02[                        -------------            ]PRK09637RNA polymerase sigma factor SigZ; Provisional
475cd13832854527.32.2E+02[                                -----------      ]IHFIntegration host factor (IHF) and similar proteins. This subfamily includes integration host factor (IHF) and IHF-like domains. IHF is a nucleoid-associated protein (NAP) that binds and sharply bends many DNA targets in a sequence specific manner. It is a heterodimeric protein composed of two highly homologous subunits IHFA (IHF-alpha) and IHFB (IHF-beta). It is known to act as a transcription factor at many gene regulatory regions in E. coli. IHF is an essential cofactor in phage lambda site-specific recombination, having an architectural role during assembly of specialized nucleoprotein structures (snups). IHF is also involved in formation as well as maintenance of bacterial biofilms since it is found in complex with extracellular DNA (eDNA) within the extracellular polymeric substances (EPS) matrix of many biofilms. This subfamily also includes the protein Hbb from tick-borne spirochete Borrelia burgdorferi, responsible for causing Lyme disease in humans. Hbb, a homodimer, shows DNA sequence preferences that are related, yet distinct from those of IHF.
476PRK009734467827.31.4E+02[ -------------------                             ]PRK00973glucose-6-phosphate isomerase; Provisional
477pfam06322642627.333[                                ------           ]Phage_NinHPhage NinH protein. This family consists of several phage NinH proteins. The function of this family is unknown.
478COG4519953327.368[                              --------           ]COG4519Uncharacterized protein
479PRK0961655213727.157[---------------------------------------          ]pheTphenylalanyl-tRNA synthetase subunit beta; Reviewed
480PRK119222312427.089[                               ------            ]PRK11922RNA polymerase sigma factor; Provisional
481PRK103393272227.038[                               -----             ]PRK10339DNA-binding transcriptional repressor EbgR; Provisional
482COG3655732526.959[                              ------             ]YozGDNA-binding transcriptional regulator, XRE family
483COG29098944126.871[                           ----------            ]MalTATP-, maltotriose- and DNA-dependent transcriptional regulator MalT
484cd037581933626.659[        ---------                                ]proteasome_beta_type_2proteasome beta type-2 subunit. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis.Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
485COG36771293926.390[                            ---------            ]InsATransposase
486pfam143371835026.282[                          -------------          ]DUF4393Domain of unknown function (DUF4393). This family of proteins is found in bacteria, archaea and viruses. Proteins in this family are typically between 254 and 285 amino acids in length.
487cd14397411226.241[---                                              ]UBA_LATS1UBA domain found in vertebrate serine/threonine-protein kinase LATS1. LATS1, also called large tumor suppressor homolog 1 or WARTS protein kinase (warts), is a serine/threonine-protein kinase that highly conserved from fly to human. It plays a crucial role in the prevention of tumor formation by controlling mitosis progression. Human LATS1 is the mammalian homologs of Drosophila lats/warts gene that could suppress tumor growth and rescue all developmental defects in flies, including embryonic lethality. It forms a regulatory complex with zyxin, a regulator of actin filament assembly. The LATS1/zyxin complex plays a role in controlling mitosis progression on mitotic apparatus. LATS1 is phosphorylated in a cell-cycle-dependent manner and complexes with CDC2 in early mitosis. It can negatively modulates tumor cell growth by inducing G(2)/M cell cycle transition or apoptosis. It also functions as a mitotic exit network kinase interacting with MOB1A, a protein whose homolog in budding yeast associates with kinases involved in mitotic exit. Moreover, LATS1 acts as a novel cytoskeleton regulator that affects cytokinesis by regulating actin polymerization through inhibiting LIMK1. LATS1 can also inhibit transcription regulation and transformation functions of oncogene YAP by inhibiting its nuclear translocation through phosphorylation. In addition, LATS1 can regulate the transcriptional activity of forkhead L2 (FOXL2) via phosphorylation. It also acts as an acting-binding protein that can negatively regulate the actin polymerization. LATS1 contains an N-terminal ubiquitin-associated (UBA) domain and a C-terminal protein kinase domain.
488pfam098241605825.91.4E+02[              --------------------               ]ArsRArsR transcriptional regulator. Members of this family of archaeal proteins are conserved transcriptional regulators belonging to the ArsR family.
489cd04774962725.577[                               --------          ]HTH_YfmPHelix-Turn-Helix DNA binding domain of the YfmP transcription regulator. Helix-turn-helix (HTH) transcription regulator, YfmP, and related proteins; N-terminal domain. YfmP regulates the multidrug efflux protein, YfmO, and indirectly regulates the expression of the Bacillus subtilis copZA operon encoding a metallochaperone, CopZ, and a CPx-type ATPase efflux protein, CopA. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
490COG17665458425.473[                 ------------------------        ]FliFFlagellar biosynthesis/type III secretory pathway M-ring protein FliF/YscJ
491pfam101411956125.359[                         ---------------         ]ssDNA-exonuc_CSingle-strand DNA-specific exonuclease, C terminal domain. Members of this set of prokaryotic domains are found in a set of single-strand DNA-specific exonucleases, including RecJ. Their exact function has not, as yet, been determined.
492PRK068403393125.255[                              ---------          ]PRK06840hypothetical protein; Validated
493COG08562032825.152[                              -------            ]PyrE2Orotate phosphoribosyltransferase homolog
494COG54051784625.071[                               -------------     ]HslVATP-dependent protease HslVU (ClpYQ), peptidase subunit
495COG14813084524.51.8E+02[                           -----------           ]WhiADNA-binding transcriptional regulator WhiA, involved in cell division
496PRK085993774924.51.4E+02[                              -------------      ]PRK08599coproporphyrinogen III oxidase; Provisional
497COG14381504724.21.5E+02[                            -------------        ]ArgRArginine repressor
498COG20421792824.047[                                    -------      ]Tsr3Ribosome biogenesis protein Tsr3 (rRNA maturation)
499cd14341522124.099[                                -----            ]UBA_MELKUBA domain found in maternal embryonic leucine zipper kinase (MELK) and similar proteins. MELK, also called protein kinase Eg3 (pEg3 kinase), protein kinase PK38 (PK38), or tyrosine-protein kinase MELK, is a cell cycle dependent protein kinase involved in diverse cell processes including stem cell renewal, cell cycle progression, cell proliferation, apoptosis and mRNA processing. It is expressed in normal tissues and especially in cancer cells. It is upregulated in cancer tissues and thus may act as potential anticancer target in diverse tumor entities. MELK comprises an N-terminal protein kinase catalytic domain, followed by an ubiquitin-associated (UBA) domain, and a C-terminal autoinhibitory domain of 5'-AMP-activated protein kinase (AMPK).
500pfam13413622523.71.2E+02[                              -------            ]HTH_25Helix-turn-helix domain. This domain is a helix-turn-helix domain that probably binds to DNA.