match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1COG495136119899.83.4E-19[           --------------------                  ]COG4951Uncharacterized protein
2pfam0189613110198.22.6E-06[                    ---------                    ]DNA_primase_SEukaryotic and archaeal DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits. This family also includes baculovirus late expression factor 1 or LEF-1 proteins. Baculovirus LEF-1 is a DNA primase enzyme. Bacterial DNA primase adopts a different fold to archaeal and eukaryotic primases.
3pfam13414696497.98.6E-05[   -----                                         ]TPR_11TPR repeat.
4pfam14559686097.69.7E-05[    -----                                        ]TPR_19Tetratricopeptide repeat.
5pfam13432655697.50.00056[    ----                                         ]TPR_16Tetratricopeptide repeat.
6pfam13424786396.90.0041[  ------                                         ]TPR_12Tetratricopeptide repeat.
7cd0052513610996.30.039[               --------------                    ]AE_Prim_S_likeAE_Prim_S_like: primase domain similar to that found in the small subunit of archaeal and eukaryotic (A/E) DNA primases. The replication machineries of A/Es are distinct from that of bacteria. Primases are DNA-dependent RNA polymerases which synthesis the short RNA primers required for DNA replication. In eukaryotes, this small catalytically active primase subunit (p50) and a larger primase subunit (p60), referred to jointly as the core primase, associate with the B subunit and the DNA polymerase alpha subunit in a complex, called Pol alpha-pri. In addition to its catalytic role in replication, eukaryotic DNA primase may play a role in coupling replication to DNA damage repair and in checkpoint control during S phase. Pfu41 and Pfu46 comprise the primase complex of the archaea Pyrococcus furiosus; these proteins have sequence identity to the eukaryotic p50 and p60 primase proteins respectively. Pfu41 preferentially uses dNTPs as substrate. Pfu46 regulates the primase activity of Pfu41. Also found in this group is the primase-polymerase (primpol) domain of replicases from archaeal plasmids including the ORF904 protein of pRN1 from Sulfolobus islandicus (pRN1 primpol). The pRN1 primpol domain exhibits DNA polymerase and primase activities; a cluster of active site residues (three acidic residues, and a histidine) is required for both these activities. The pRN1 primpol primase activity prefers dNTPs to rNTPs; however incorporation of dNTPs requires rNTP as cofactor. This group also includes the Pol domain of bacterial LigD proteins such Mycobacterium tuberculosis (Mt)LigD. MtLigD contains an N-terminal Pol domain, a central phosphoesterase module, and a C-terminal ligase domain. LigD Pol plays a role in non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB) in vivo, perhaps by filling in short 5'-overhangs with ribonucleotides; the filled in termini would be sealed by the associated LigD ligase domain. The MtLigD Pol domain is stimulated by manganese, is error-prone, and prefers adding rNTPs to dNTPs in vitro.
8pfam13431343396.20.0044[    ---                                          ]TPR_17Tetratricopeptide repeat.
9pfam13428444195.80.023[   ---                                           ]TPR_14Tetratricopeptide repeat.
10pfam07719343494.80.042[     ---                                         ]TPR_2Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515.
11TIGR025212349194.80.21[--------                                         ]type_IV_pilWtype IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF and PilW. This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
12cd0486023215094.20.087[               --------------                    ]AE_Prim_SAE_Prim_S: primase domain similar to that found in the small subunit of archaeal and eukaryotic (A/E) DNA primases. Primases are DNA-dependent RNA polymerases which synthesis the short RNA primers required for DNA replication. In addition to its catalytic role in replication, DNA primase may play a role in coupling replication to DNA damage repair and in checkpoint control during S phase. In eukaryotes, this small catalytically active primase subunit (p50) and a larger primase subunit (p60), referred to jointly as the core primase, associate with the B subunit and the DNA polymerase alpha subunit in a complex, called Pol alpha-pri. The function of the larger primase subunit is unclear. Included in this group are Pfu41 and Pfu46, these two proteins comprise the primase complex of the archaea Pyrococcus furiosus; Pfu41 and Pfu46 have sequence identity to the eukaryotic p50 and p60 primase proteins respectively. Pfu41 preferentially uses dNTPs as substrate. Pfu46 regulates the primase activity of Pfu41.
13TIGR025212346593.80.13[  ------                                         ]type_IV_pilWtype IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF and PilW. This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
14pfam13181343293.70.099[      --                                         ]TPR_8Tetratricopeptide repeat.
15COG30632508793.50.29[--------                                         ]PilFTfp pilus assembly protein PilF
16pfam00515343393.20.13[     ---                                         ]TPR_1Tetratricopeptide repeat.
17TIGR0291789910993.10.82[---------                                        ]PEP_TPR_lipoputative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
18pfam13371735892.70.34[    -----                                        ]TPR_9Tetratricopeptide repeat.
19pfam134292798492.01[--------                                         ]TPR_15Tetratricopeptide repeat.
20TIGR0277824510591.90.46[              -----------                        ]ligD_polDNA ligase D, polymerase domain. DNA repair of double-stranded breaks by non-homologous end joining (NHEJ) is accomplished by a two-protein system that is present in a minority of prokaryotes. One component is the Ku protein (see TIGR02772), which binds DNA ends. The other is a DNA ligase, a protein that is a multidomain polypeptide in most of those bacteria that have NHEJ, a permuted polypeptide in Mycobacterium tuberculosis and a few other species, and the product of tandem genes in some other bacteria. This model represents the polymerase domain.
21PRK117883896791.80.26[  ------                                         ]PRK11788tetratricopeptide repeat protein; Provisional
22pfam13428443790.90.38[      ---                                        ]TPR_14Tetratricopeptide repeat.
23COG30632506489.90.66[  ------                                         ]PilFTfp pilus assembly protein PilF
24cd0486122711089.70.75[              -----------                        ]LigD_Pol_likeLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. PaeLigD is monomeric, containing an N-terminal phosphoesterase module, a central polymerase (Pol) domain, and a C-terminal ATP-dependent ligase domain. Mycobacterium tuberculosis (Mt)LigD, also found in this group, is monomeric and contains the same modules but these are arranged differently: an N-terminal Pol domain, a central phosphoesterase module, and a C-terminal ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The PaeLigD Pol domain in vitro, in a manganese-dependent fashion, catalyzes templated extensions of 5'-overhang duplex DNA, and nontemplated single-nucleotide additions to blunt-end duplex DNA; it preferentially adds single ribonucleotides at blunt DNA ends. PaeLigD Pol adds a correctly paired rNTP to the DNA primer termini more rapidly than it does a correctly paired dNTP; it has higher infidelity as an RNA polymerase than it does as a DNA polymerase, which is in keeping with the mutagenic property of NHEJ-mediated DNA DSB repair. The MtLigD Pol domain similarly is stimulated by manganese, is error-prone, and prefers adding rNTPs to dNTPs in vitro. The MtLigD Pol domain has been shown to prefer DNA gapped substrates containing a 5'-phosphate group at the gap.
25COG29563895188.71.7[    ----                                         ]YciMLipopolysaccharide biosynthesis regulator YciM, contains six TPR domains and a predicted metal-binding C-terminal domain
26COG42352877787.51.1[  --------                                       ]NrfGCytochrome c-type biogenesis protein CcmH/NrfG
27TIGR0393980010486.32[ ---------                                       ]PGA_TPR_OMPpoly-beta-1,6 N-acetyl-D-glucosamine export porin PgaA. Members of this protein family are the poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin PgaA of Gram-negative bacteria. There is no counterpart in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis systems of Gram-positive bacteria such as Staphylococcus epidermidis. The PGA polysaccharide adhesin is a critical determinant of biofilm formation. The conserved C-terminal domain of this outer membrane protein is preceded by a variable number of TPR repeats.
28pfam00515343186.31.3[   --                                            ]TPR_1Tetratricopeptide repeat.
29TIGR005403675985.90.96[  ------                                         ]TPR_hemY_coliheme biosynthesis-associated TPR protein. Members of this protein family are uncharacterized tetratricopeptide repeat (TPR) proteins invariably found in heme biosynthesis gene clusters. The absence of any invariant residues other than Ala argues against this protein serving as an enzyme per se. The gene symbol hemY assigned in E. coli is unfortunate in that an unrelated protein, protoporphyrinogen oxidase (HemG in E. coli) is designated HemY in Bacillus subtilis.
30PRK117883894785.51.1[    ----                                         ]PRK11788tetratricopeptide repeat protein; Provisional
31pfam13414696285.25.4[-----                                            ]TPR_11TPR repeat.
32COG32852997085.01.9[                  -------                        ]LigDEukaryotic-type DNA primase
33TIGR029178996984.84.5[   ------                                        ]PEP_TPR_lipoputative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
34pfam13174333183.91.5[      --                                         ]TPR_6Tetratricopeptide repeat.
35cd0486422811083.82.2[              -----------                        ]LigD_Pol_like_1LigD_Pol_like_1: Polymerase (Pol) domain of mostly bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 1. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The Pol domains of PaeLigD and Mycobacterium tuberculosis (Mt)LigD are stimulated by manganese, are error-prone, and prefer adding rNTPs to dNTPs in vitro; however PaeLigD and MtLigD belong to other subgroups, proteins in this subgroup await functional characterization.
36PRK111892969083.32.9[ --------                                        ]PRK11189lipoprotein NlpI; Provisional
37pfam13176362683.31.3[      --                                         ]TPR_7Tetratricopeptide repeat.
38TIGR027765527483.12.9[                  -------                        ]NHEJ_ligase_prkDNA ligase D. Members of this protein family are DNA ligases involved in the repair of DNA double-stranded breaks by non-homologous end joining (NHEJ). The system of the bacterial Ku protein (TIGR02772) plus this DNA ligase is seen in about 20 % of bacterial genomes to date and at least one archaeon (Archeoglobus fulgidus). This model describes a central and a C-terminal domain. These two domains may be permuted, as in genus Mycobacterium, or divided into tandem ORFs, and therefore not be identified by this model. An additional N-terminal 3'-phosphoesterase (PE) domain present in some but not all examples of this ligase is not included in the seed alignment for this model; it only represents the central ATP-dependent ligase domain and the C-terminal polymerase domain. Most examples of genes for this ligase are adjacent to the gene for Ku.
39pfam07719343282.92.3[   --                                            ]TPR_2Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515.
40TIGR025521359582.98.6[--------                                         ]LcrH_SycDtype III secretion low calcium response chaperone LcrH/SycD. Genes in this family are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role. In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
41COG50102575782.52.6[    -----                                        ]TadDFlp pilus assembly protein TadD, contains TPR repeats
42COG42352878081.23.2[   -------                                       ]NrfGCytochrome c-type biogenesis protein CcmH/NrfG
43COG478348410081.13[ ---------                                       ]YfgCPutative Zn-dependent protease, contains TPR repeats
44cd0486222710780.33.5[              -----------                        ]PaeLigD_Pol_likePaeLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. PaeLigD is monomeric, containing an N-terminal phosphoesterase module, a central polymerase (Pol) domain, and a C-terminal ATP-dependent ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The PaeLigD Pol domain in vitro, in a manganese-dependent fashion, catalyzes templated extensions of 5'-overhang duplex DNA, and nontemplated single-nucleotide additions to blunt-end duplex DNA; it preferentially adds single ribonucleotides at blunt DNA ends. PaeLigD Pol adds a correctly paired rNTP to the DNA primer termini more rapidly than it does a correctly paired dNTP; it has higher infidelity as an RNA polymerase than it does as a DNA polymerase, which is in keeping with the mutagenic property of NHEJ-mediated DNA DSB repair.
45COG22193637177.03.4[                                  ------         ]PRI2Eukaryotic-type DNA primase, large subunit
46pfam13181343175.75.2[   --                                            ]TPR_8Tetratricopeptide repeat.
47pfam12895814375.310[    ---                                          ]Apc3Anaphase-promoting complex, cyclosome, subunit 3. Apc3, otherwise known as Cdc27, is one of the subunits of the anaphase-promoting complex or cyclosome. The anaphase-promoting complex is a multiprotein subunit E3 ubiquitin ligase complex that controls segregation of chromosomes and exit from mitosis in eukaryotes. The protein members of this family contain TPR repeats just as those of Apc7 do, and it appears that these TPR units bind the C-termini of the APC co-activators CDH1 and CDC20.
48pfam134292798974.66.5[   ----------                                    ]TPR_15Tetratricopeptide repeat.
49pfam13432656373.721[-----                                            ]TPR_16Tetratricopeptide repeat.
50COG04572918673.535[ -------                                         ]TPRTetratricopeptide (TPR) repeat
51COG29762077072.16[   ------                                        ]YfgMPutative negative regulator of RcsB-dependent stress response
52COG04572915771.429[   -----                                         ]TPRTetratricopeptide (TPR) repeat
53COG501025710071.136[---------                                        ]TadDFlp pilus assembly protein TadD, contains TPR repeats
54PRK1144711576870.94.2[    -----                                        ]PRK11447cellulose synthase subunit BcsC; Provisional
55COG47852979069.49.8[ --------                                        ]NlpILipoprotein NlpI, contains TPR repeats
56COG49762875768.57.2[    -----                                        ]COG4976Predicted methyltransferase, contains TPR repeat
57pfam14559686168.318[------                                           ]TPR_19Tetratricopeptide repeat.
58cd048662238367.220[                 --------                        ]LigD_Pol_like_3LigD_Pol_like_3: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 3. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. It has been suggested that LigD Pol contributes to NHEJ-mediated repair DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The Pol domains of PaeLigD and Mycobacterium tuberculosis (Mt)LigD are stimulated by manganese, are error-prone, and prefer adding rNTPs to dNTPs in vitro; however PaeLigD and MtLigD belong to other subgroups, proteins in this subgroup await functional characterization.
59TIGR027951175666.321[    -----                                        ]tol_pal_ybgFtol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
60cd048652288965.114[                ---------                        ]LigD_Pol_like_2LigD_Pol_like_2: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 2. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The Pol domains of PaeLigD and Mycobacterium tuberculosis (Mt)LigD are stimulated by manganese, are error-prone, and prefer adding rNTPs to dNTPs in vitro; however PaeLigD and MtLigD belong to other subgroups, proteins in this subgroup await functional characterization.
61TIGR009906156565.011[   -----                                         ]3a0801s09mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
62TIGR039398008763.540[--------                                         ]PGA_TPR_OMPpoly-beta-1,6 N-acetyl-D-glucosamine export porin PgaA. Members of this protein family are the poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin PgaA of Gram-negative bacteria. There is no counterpart in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis systems of Gram-positive bacteria such as Staphylococcus epidermidis. The PGA polysaccharide adhesin is a critical determinant of biofilm formation. The conserved C-terminal domain of this outer membrane protein is preceded by a variable number of TPR repeats.
63pfam14561906462.024[    ------                                       ]TPR_20Tetratricopeptide repeat.
64cd058043554861.316[   -----                                         ]StaR_likeStaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
65COG36292807061.022[ -------                                         ]DnrIDNA-binding transcriptional activator of the SARP family
66PRK059728605959.814[                    -----                        ]ligDATP-dependent DNA ligase; Reviewed
67TIGR025521355259.430[    ----                                         ]LcrH_SycDtype III secretion low calcium response chaperone LcrH/SycD. Genes in this family are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role. In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
68pfam125695166959.225[   ------                                        ]NARP1NMDA receptor-regulated protein 1. This domain family is found in eukaryotes, and is approximately 40 amino acids in length. The family is found in association with pfam07719, pfam00515. There is a single completely conserved residue L that may be functionally important. NARP1 is the mammalian homologue of a yeast N-terminal acetyltransferase that regulates entry into the G(0) phase of the cell cycle.
69pfam13424783256.514[     ---                                         ]TPR_12Tetratricopeptide repeat.
70COG14673412455.321[                     --                          ]PRI1Eukaryotic-type DNA primase, catalytic (small) subunit
71COG47002518555.199[--------                                         ]COG4700Uncharacterized protein
72pfam126881195554.519[    ----                                         ]TPR_5Tetratrico peptide repeat. BH0479 of Bacillus halodurans is a hypothetical protein which contains a tetratrico peptide repeat (TPR) structural motif. The TPR motif is often involved in mediating protein-protein interactions. This protein is likely to function as a dimer. The first 48 amino acids are not present in the clone construct. This Pfam entry includes tetratricopeptide-like repeats not detected by the pfam00515, pfam07719, pfam07720 and pfam07221 models.
73cd122121156154.534[    -----                                        ]Fis1Mitochondrial Fission Protein Fis1, cytosolic domain. Fis1, along with Dnm1 and Mdv1, is an essential protein in mediating mitochondrial fission. Dnm1 and Fis1 are highly conserved, with a common mechanism in disparate species. In mutants of these proteins, mitochondrial fission is impaired, resulting in networks of undivided mitochondria. The Fis1 N-terminus is cytosolic and tethered to the mitochondrial outer membrane via a C-terminal transmembrane domain. Fis1 appears to act via the recruitment of division complexes to the mitochondrial outer membrane, via interactions with Mdv1 or Caf4. Fis1 has tandem Tetratricopeptide repeat (TPR) motifs which are known to mediate protein-protein interactions.
74pfam07720343054.519[      --                                         ]TPR_3Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones.
75pfam14853533852.717[      ---                                        ]Fis1_TPR_CFis1 C-terminal tetratricopeptide repeat. The mitochondrial fission protein Fis1 consists of two tetratricopeptide repeats. This domain is the C-terminal tetratricopeptide repeat
76PRK022493438352.622[                                 -------         ]PRK02249DNA primase large subunit; Validated
77pfam13374422852.426[     ---                                         ]TPR_10Tetratricopeptide repeat.
78pfam022593506850.145[  ------                                         ]FATFAT domain. The FAT domain is named after FRAP, ATM and TRRAP.
79cd058043557049.434[   ------                                        ]StaR_likeStaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
80PRK142672534148.416[                                          ---    ]PRK14267phosphate ABC transporter ATP-binding protein; Provisional
81COG17292625647.958[    -----                                        ]YbgFPeriplasmic TolA-binding protein (function unknown)
82COG29563894747.524[    ----                                         ]YciMLipopolysaccharide biosynthesis regulator YciM, contains six TPR domains and a predicted metal-binding C-terminal domain
83cd02679793147.221[  ---                                            ]MIT_spastinMIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
84pfam00531812246.451[   --                                            ]DeathDeath domain.
85pfam006371384746.339[   -----                                         ]ClathrinRegion in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain.
86pfam125695164546.136[    ----                                         ]NARP1NMDA receptor-regulated protein 1. This domain family is found in eukaryotes, and is approximately 40 amino acids in length. The family is found in association with pfam07719, pfam00515. There is a single completely conserved residue L that may be functionally important. NARP1 is the mammalian homologue of a yeast N-terminal acetyltransferase that regulates entry into the G(0) phase of the cell cycle.
87PRK026031725445.650[    ----                                         ]PRK02603photosystem I assembly protein Ycf3; Provisional
88cd158322787245.260[  -------                                        ]SNAPSoluble N-ethylmaleimide-sensitive factor (NSF) Attachment Protein family. Members of the soluble NSF attachment protein (SNAP) family are involved in intracellular membrane trafficking, including vesicular transport between the endoplasmic reticulum and Golgi apparatus. Higher eukaryotes contain three isoforms of SNAPs: alpha, beta, and gamma. Alpha-SNAP is universally present in eukaryotes and acts as an adaptor protein between SNARE (integral membrane SNAP receptor) and NSF for recruitment to the 20S complex. Beta-SNAP is brain-specific and shares high sequence identity (about 85%) with alpha-SNAP. Gamma-SNAP is weakly related (about 20-25% identity) to the two other isoforms, and is ubiquitous. It may help regulate the activity of the 20S complex. The X-ray structures of vertebrate gamma-SNAP and yeast Sec17, a SNAP family member, show similar all-helical structures consisting of an N-terminal extended twisted sheet of four Tetratricopeptide repeat (TPR)-like helical hairpins and a C-terminal helical bundle.
89COG17292625643.352[    -----                                        ]YbgFPeriplasmic TolA-binding protein (function unknown)
90cd079061906543.116[              -------                            ]Adenylation_DNA_ligase_LigD_LigCAdenylation domain of Mycobacterium tuberculosis LigD and LigC-like ATP-dependent DNA ligases. Bacterial DNA ligases are divided into two broad classes: NAD-dependent and ATP-dependent. All bacterial species have a NAD-dependent DNA ligase (LigA). Some bacterial genomes contain multiple genes for DNA ligases that are predicted to use ATP as their cofactor, including Mycobacterium tuberculosis LigB, LigC, and LigD. This group is composed of ATP-dependent DNA ligases similar to Mycobacterium tuberculosis LigC. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. Members of this group contain adenylation and C-terminal oligonucleotide/oligosaccharide binding (OB)-fold domains, comprising a catalytic core unit that is common to all members of the ATP-dependent DNA ligase family. The adenylation domain binds ATP and contains many of the active-site residues. The common catalytic core unit comprises six conserved sequence motifs (I, III, IIIa, IV, V and VI) that define this family of related nucleotidyltransferases. LigD consists of a central ATP-dependent DNA ligase catalytic core unit fused to a C-terminal polymerase domain and an N-terminal 3'-phosphoesterase (PE) module. LigD catalyzes the end-healing and end-sealing steps during non-homologous end joining.
91TIGR005403673741.01.6E+02[     ---                                         ]TPR_hemY_coliheme biosynthesis-associated TPR protein. Members of this protein family are uncharacterized tetratricopeptide repeat (TPR) proteins invariably found in heme biosynthesis gene clusters. The absence of any invariant residues other than Ala argues against this protein serving as an enzyme per se. The gene symbol hemY assigned in E. coli is unfortunate in that an unrelated protein, protoporphyrinogen oxidase (HemG in E. coli) is designated HemY in Bacillus subtilis.
92PRK151796946839.971[   ------                                        ]PRK15179Vi polysaccharide biosynthesis protein TviE; Provisional
93COG31183049237.43.1E+02[--------                                         ]YbbNNegative regulator of GroEL, contains thioredoxin-like and TPR-like domains
94TIGR027951176037.097[   ------                                        ]tol_pal_ybgFtol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
95pfam099862145235.11.4E+02[    ----                                         ]DUF2225Uncharacterized protein conserved in bacteria (DUF2225). This domain, found in various hypothetical bacterial proteins, has no known function.
96TIGR033022357634.22.3E+02[  -------                                        ]OM_YfiOouter membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
97PRK111892965233.876[    ----                                         ]PRK11189lipoprotein NlpI; Provisional
98PRK100497658832.01.2E+02[  --------                                       ]pgaAouter membrane protein PgaA; Provisional
99TIGR009906155431.983[    ----                                         ]3a0801s09mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
100pfam12895814329.91.2E+02[    ----                                         ]Apc3Anaphase-promoting complex, cyclosome, subunit 3. Apc3, otherwise known as Cdc27, is one of the subunits of the anaphase-promoting complex or cyclosome. The anaphase-promoting complex is a multiprotein subunit E3 ubiquitin ligase complex that controls segregation of chromosomes and exit from mitosis in eukaryotes. The protein members of this family contain TPR repeats just as those of Apc7 do, and it appears that these TPR units bind the C-termini of the APC co-activators CDH1 and CDC20.
101cd048632317629.792[                  -------                        ]MtLigD_Pol_likeMtLigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Mycobacterium tuberculosis (Mt)LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. MtLigD is monomeric and contains an N-terminal Pol domain, a central phosphoesterase module, and a C-terminal ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The MtLigD Pol domain is stimulated by manganese, is error-prone, and prefers adding rNTPs to dNTPs in vitro. The MtLigD Pol domain has been shown to prefer DNA gapped substrates containing a 5'-phosphate group at the gap.
102pfam092953958229.43.2E+02[-------                                          ]ChAPsChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other.
103pfam116901127329.476[                                         ------- ]DUF3287Protein of unknown function (DUF3287). This eukaryotic family of proteins has no known function.
104TIGR028411403928.263[                      ---                        ]spore_YyaCputative sporulation protein YyaC. A comparative genome analysis of all sequenced genomes of shows a number of proteins conserved strictly among the endospore-forming subset of the Firmicutes. This protein, also called YyaC, is a member of that panel and is otherwise uncharacterized. The second round of PSI-BLAST shows many similarities to the germination protease GPR, which is found in exactly the same set of organisms and has a known role in the sporulation/germination process.
105pfam092501602527.31.1E+02[                      --                         ]Prim-PolBifunctional DNA primase/polymerase, N-terminal. Members of this family adopt a structure consisting of a core of antiparallel beta sheets. They are found in various bacterial hypothetical proteins, and have been shown to harbour both primase and polymerase activities.
106PRK100497657627.31.4E+02[  -------                                        ]pgaAouter membrane protein PgaA; Provisional
107pfam13371736226.73.1E+02[------                                           ]TPR_9Tetratricopeptide repeat.
108cd01670795326.71.6E+02[   -----                                         ]DeathDeath Domain: a protein-protein interaction domain. Death Domains (DDs) are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. Structural analysis of DD-DD complexes show that the domains interact with each other in many different ways. DD-containing proteins serve as adaptors in signaling pathways and they can recruit other proteins into signaling complexes. In mammals, they are prominent components of the programmed cell death (apoptosis) pathway and are found in a number of other signaling pathways. In invertebrates, they are involved in transcriptional regulation of zygotic patterning genes in insect embryogenesis, and are components of the ToII/NF-kappaB pathway, a conserved innate immune pathway in animal cells.
109pfam06967843926.734[                                         ---     ]Mo-nitro_CMo-dependent nitrogenase C-terminus. This family represents the C-terminus (approximately 80 residues) of a number of bacterial Mo-dependent nitrogenases. These are involved in nitrogen fixation in cyanobacteria. Note that many family members are hypothetical proteins.
110COG47834847326.62E+02[  -------                                        ]YfgCPutative Zn-dependent protease, contains TPR repeats
111cd087742254826.41.1E+02[    ----                                         ]14-3-314-3-3 domain. 14-3-3 domain is an essential part of 14-3-3 proteins, a ubiquitous class of regulatory, phosphoserine/threonine-binding proteins found in all eukaryotic cells, including yeast, protozoa and mammalian cells. 14-3-3 proteins play important roles in many biological processes that are regulated by phosphorylation, including cell cycle regulation, cell proliferation, protein trafficking, metabolic regulation and apoptosis. More than 300 binding partners of the 14-3-3 domain have been identified in all subcellular compartments and include transcription factors, signaling molecules, tumor suppressors, biosynthetic enzymes, cytoskeletal proteins and apoptosis factors. 14-3-3 binding can alter the conformation, localization, stability, phosphorylation state, activity as well as molecular interactions of a target protein. They function only as dimers, some preferring strictly homodimeric interaction, while others form heterodimers. Binding of the 14-3-3 domain to its target occurs in a phosphospecific manner where it binds to one of two consensus sequences of their target proteins; RSXpSXP (mode-1) and RXXXpSXP (mode-2). In some instances, 14-3-3 domain containing proteins are involved in regulation and signaling of a number of cellular processes in phosphorylation-independent manner. Many organisms express multiple isoforms: there are seven mammalian 14-3-3 family members (beta, gamma, eta, theta, epsilon, sigma, zeta), each encoded by a distinct gene, while plants contain up to 13 isoforms. The flexible C-terminal segment of 14-3-3 isoforms shows the highest sequence variability and may significantly contribute to individual isoform uniqueness by playing an important regulatory role by occupying the ligand binding groove and blocking the binding of inappropriate ligands in a distinct manner. Elevated amounts of 14-3-3 proteins are found in the cerebrospinal fluid of patients with Creutzfeldt-Jakob disease. In protozoa, like Plasmodium or Cryptosporidium parvum 14-3-3 proteins play an important role in key steps of parasite development.
112PRK103701985425.91.7E+02[    ----                                         ]PRK10370formate-dependent nitrite reductase complex subunit NrfG; Provisional
113pfam022593506625.73.8E+02[  ------                                         ]FATFAT domain. The FAT domain is named after FRAP, ATM and TRRAP.
114cd032101265825.187[                             ------              ]GST_C_PiC-terminal, alpha helical domain of Class Pi Glutathione S-transferases. Glutathione S-transferase (GST) C-terminal domain family, Class Pi subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Pi GST is a homodimeric eukaryotic protein. The human GSTP1 is mainly found in erythrocytes, kidney, placenta and fetal liver. It is involved in stress responses and in cellular proliferation pathways as an inhibitor of JNK (c-Jun N-terminal kinase). Following oxidative stress, monomeric GSTP1 dissociates from JNK and dimerizes, losing its ability to bind JNK and causing an increase in JNK activity, thereby promoting apoptosis. GSTP1 is expressed in various tumors and is the predominant GST in a wide range of cancer cells. It has been implicated in the development of multidrug-resistant tumors.
115pfam099761455224.91.4E+02[   ----                                          ]TPR_21Tetratricopeptide repeat. TPR repeat
116pfam132813746524.688[-----                                            ]DUF4071Domain of unknown function (DUF4071). This domain is found at the N-terminus of many serine-threonine kinase-like proteins.
117cd008711756824.61.2E+02[                                   ------        ]PI4KaPhosphoinositide 4-kinase(PI4K), accessory domain (PIK domain); PIK domain is conserved in PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation. PI4K phosphorylates hydroxylgroup at position 4 on the inositol ring of phosphoinositide, the first commited step in the phosphatidylinositol cycle.
118cd048591527823.687[               ---------                         ]Prim_PolPrim_Pol: Primase-polymerase (primpol) domain of the type found in bifunctional replicases from archaeal plasmids, including ORF904 protein of the crenarchaeal plasmid pRN1 from Sulfolobus islandicus (pRN1 primpol). These primpol domains belong to the archaeal/eukaryal primase (AEP) superfamily. This group includes archaeal plasmids and bacteriophage AEPs. The ORF904 protein is a multifunctional protein having ATPase, primase and DNA polymerase activity, and may play a role in the replication of the archaeal plasmid. The pRN1 primpol domain exhibits DNA polymerase and primase activities; a cluster of active site residues (three acidic residues, and a histidine) is required for both these activities. For pRN1 primpol, the primase activity prefers dNTPs to rNTPs; incorporation of dNTPs requires rNTP as cofactor. The pRN1 primpol contains an unusual zinc-binding stem, which is not conserved in other members of this group.
119PRK1144711575722.91.8E+02[   -----                                         ]PRK11447cellulose synthase subunit BcsC; Provisional
120PRK101535175522.81.3E+02[    -----                                        ]PRK10153DNA-binding transcriptional activator CadC; Provisional
121pfam099761458922.82.7E+02[--------                                         ]TPR_21Tetratricopeptide repeat. TPR repeat
122cd02679793122.31.1E+02[    ----                                         ]MIT_spastinMIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
123cd103951014321.875[               ----                              ]SH2_RIN3Src homology 2 (SH2) domain found in Ras and Rab interactor 3 (RIN3)-like proteins. RIN3, a member of the RIN (AKA Ras interaction/interference) family, have multifunctional domains including SH2 and proline-rich (PR) domains in the N-terminal region, and RIN-family homology (RH), VPS9 and Ras-association (RA) domains in the C-terminal region. RIN proteins function as Rab5-GEFs. RIN3 stimulated the formation of GTP-bound Rab31, a Rab5-subfamily GTPase, and formed enlarged vesicles and tubular structures, where it colocalized with Rab31. Transferrin appeared to be transported partly through the RIN3-positive vesicles to early endosomes. RIN3 interacts via its Pro-rich domain with amphiphysin II, which contains SH3 domain and participates in receptor-mediated endocytosis. RIN3, a Rab5 and Rab31 GEF, plays an important role in the transport pathway from plasma membrane to early endosomes. Mutations in the region between the SH2 and RH domain of RIN3 specifically abolished its GEF action on Rab31, but not Rab5. RIN3 was also found to partially translocate the cation-dependent mannose 6-phosphate receptor from the trans-Golgi network to peripheral vesicles and that this is dependent on its Rab31-GEF activity. These data indicate that RIN3 specifically acts as a GEF for Rab31. In general SH2 domains are involved in signal transduction. They typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
124PRK06423735021.684[                             ----                ]PRK06423phosphoribosylformylglycinamidine synthase; Provisional
125cd0104612310321.41E+02[  ---------                                      ]Rubrerythrin_likerubrerythrin-like, diiron-binding domain. Rubrerythrin-like domain, similar to rubrerythrin, a nonheme iron binding domain found in many air-sensitive bacteria and archaea, and member of a broad superfamily of ferritin-like diiron-carboxylate proteins. Rubrerythrin is thought to reduce hydrogen peroxide as part of an oxidative stress protection system. The rubrerythrin protein has two domains, a binuclear metal center located within a four-helix bundle of the rubrerythrin domain, and a rubredoxin domain. The Rubrerythrin-like domains in this CD are singular domains (no C-terminus rubredoxin domain) and are phylogenetically distinct from rubrerythrin domains of rubrerythrin-rubredoxin proteins.
126TIGR045108144621.31.5E+02[    ----                                         ]mod_pep_cycputative peptide modification system cyclase. Members of this family show homology to mononucleotidyl cyclases and to tetratricopeptide repeat (TPR) proteins. Members occur in next to two other markers of ribosomal peptide modification systems. One is a dehydrogenase related to SagB proteins from thiazole/oxazole modification systems. The other is the putative precursor, related to the nitrile hydratase-related leader peptide (NHLP) and nitrile hydratase alpha subunit families. These systems occur in many species of Xanthomonas and Stenotrophomonas, among others.
127pfam08966723220.949[               --                                ]DUF1882Domain of unknown function (DUF1882). This domain is found in a set of hypothetical bacterial proteins.
128cd12261733620.71.5E+02[               ---                               ]RRM1_3_MRN1RNA recognition motif 1 and 3 in RNA-binding protein MRN1 and similar proteins. This subfamily corresponds to the RRM1 and RRM3 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is an RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).