match no. | target id | target length | alignment length | probability | E-value | coverage | match description |
1 | TIGR03555 | 325 | 284 | 100.0 | 2.1E-70 | [ ------------------------------------------------] | F420_mer | 5,10-methylenetetrahydromethanopterin reductase. Members of this protein family are 5,10-methylenetetrahydromethanopterin reductase, an F420-dependent enzyme of methanogenesis. It is restricted to the Archaea. |
2 | PRK02271 | 325 | 282 | 100.0 | 1.3E-65 | [-------------------------------------------------] | PRK02271 | methylenetetrahydromethanopterin reductase; Provisional |
3 | pfam00296 | 260 | 195 | 99.9 | 7.1E-27 | [ --------------------------------- ] | Bac_luciferase | Luciferase-like monooxygenase. |
4 | COG2141 | 336 | 256 | 99.9 | 7.7E-24 | [ ------------------------------------------ ] | SsuD | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) |
5 | TIGR03619 | 246 | 179 | 99.7 | 1.5E-17 | [ ------------------------------ ] | F420_Rv2161c | probable F420-dependent oxidoreductase, Rv2161c family. Coenzyme F420 has a limited phylogenetic distribution, including methanogenic archaea, Mycobacterium tuberculosis and related species, Colwellia psychrerythraea 34H, Rhodopseudomonas palustris HaA2, and others. Partial phylogenetic profiling identifies protein subfamilies, within the larger family called luciferase-like monooxygenanases (pfam00296), that appear only in F420-positive genomes and are likely to be F420-dependent. This model describes a domain found in a distinctive subset of bacterial luciferase homologs, found only in F420-biosynthesizing members of the Actinobacteria. |
6 | TIGR03560 | 227 | 176 | 99.5 | 1.1E-14 | [ ------------------------------ ] | F420_Rv1855c | probable F420-dependent oxidoreductase, Rv1855c family. Coenzyme F420 has a limited phylogenetic distribution, including methanogenic archaea, Mycobacterium tuberculosis and related species, Colwellia psychrerythraea 34H, Rhodopseudomonas palustris HaA2, and others. Partial phylogenetic profiling identifies protein subfamilies, within the larger family called luciferase-like monooxygenanases (pfam00296), that appear only in F420-positive genomes and are likely to be F420-dependent. This model describes one such subfamily, exemplified by Rv1855c from Mycobacterium tuberculosis. |
7 | TIGR03842 | 330 | 244 | 99.4 | 4.2E-13 | [ ------------------------------------------ ] | F420_CPS_4043 | F420-dependent oxidoreductase, CPS_4043 family. This model represents a family of putative F420-dependent oxidoreductases, fairly closely related to 5,10-methylenetetrahydromethanopterin reductase (mer, TIGR03555), both within the bacterial luciferase-like monoxygenase (LLM) family. A fairly deep split (to about 40 % sequence identity) in the present family separates a strictly Actinobacterial clade from an alpha/beta/gamma-proteobacterial clade, in which the member is often the only apparent F420-dependent LLM family member. The specific function, and whether Actinobacterial and Proteobacterial clades differ in function, are unknown. |
8 | cd01097 | 202 | 151 | 99.4 | 1.2E-12 | [---------------------------------- ] | Tetrahydromethanopterin_reductase | N5,N10-methylenetetrahydromethanopterin reductase (Mer) catalyzes the reduction of N5,N10-methylenetetrahydromethanopterin with reduced coenzyme F420 to N5-methyltetrahydromethanopterin and oxidized coenzyme F420. |
9 | TIGR03557 | 316 | 163 | 99.3 | 1.1E-12 | [ ---------------------------- ] | F420_G6P_family | F420-dependent oxidoreductase, G6PDH family. Members of this protein family include F420-dependent glucose-6-phosphate dehydrogenases (TIGR03554) and related proteins. All members of this family come from species that synthesize coenzyme F420, with the exception of those that belong to TIGR03885, a clade within this family in which cofactor binding may instead be directed to FMN. |
10 | TIGR03559 | 325 | 257 | 99.2 | 2E-11 | [ ----------------------------------------- ] | F420_Rv3520c | probable F420-dependent oxidoreductase, Rv3520c family. Members of this protein family are predicted to be oxidoreductases dependent on coenzyme F420. The family includes a single member in Mycobacterium tuberculosis (Rv3520c/MT3621) but four in Mycobacterium smegmatis. Prediction that this family is F420-dependent is based primarily on Partial Phylogenetic Profiling vs. F420 biosynthesis. |
11 | TIGR03857 | 329 | 179 | 99.2 | 2E-11 | [ ------------------------------ ] | F420_MSMEG_2249 | probable F420-dependent oxidoreductase, MSMEG_2249 family. Coenzyme F420 has a limited phylogenetic distribution, including methanogenic archaea, Mycobacterium tuberculosis and related species, Colwellia psychrerythraea 34H, Rhodopseudomonas palustris HaA2, and others. Partial phylogenetic profiling identifies protein subfamilies, within the larger family called luciferase-like monooxygenanases (pfam00296), that appear only in F420-positive genomes and are likely to be F420-dependent. This model describes a distinctive subfamily, found only in F420-biosynthesizing members of the Actinobacteria of the bacterial luciferase-like monooxygenase (LLM) superfamily. |
12 | TIGR04024 | 330 | 243 | 99.1 | 6.7E-11 | [ ------------------------------------------ ] | F420_NP1902A | coenzyme F420-dependent oxidoreductase, NP1902A family. This subfamily of the luciferase-like monooxygenases is restricted to the order Halobacteriales. SIMBAL analysis strongly suggests this oxidoreductase binds coenzyme F420 rather than FMN. Occasional annotations of members of this family as N5,N10-methylenetetrahydromethanopterin reductase appear to represent overly aggressive transfer of annotation. |
13 | TIGR04465 | 364 | 199 | 99.1 | 9.7E-11 | [ ---------------------------------- ] | ArgArg_F420 | TAT-translocated F420-dependent dehydrogenase, FGD2 family. Members of this family are F420-binding enzymes with a proven functional N-terminal twin-arginine translocation (TAT) signal. Members are homologous to the cytosolic F420-dependent glucose-6-phosphate dehydrogenase but do not share the same function. |
14 | TIGR04020 | 341 | 165 | 99.0 | 2.3E-10 | [ ---------------------------- ] | seco_metab_LLM | natural product biosynthesis luciferase-like monooxygenase domain. This model describes a subfamily within the bacterial luciferase-like monooxygenase (LLM) family that regularly occurs within large non-ribosomal protein synthases/polyketide synthases, but also as small proteins. The LLM family includes members that bind either FMN or F420, and FMN is more likely in this case because many members are from species that lack F420 biosynthesis capability. An example member is the MupA protein of mupirocin biosynthesis in Pseudomonas fluorescens NCIMB 10586. |
15 | TIGR04027 | 326 | 232 | 99.0 | 7.7E-11 | [ ----------------------------------------- ] | LLM_KPN_01858 | putative FMN-dependent luciferase-like monooxygenase, KPN_01858 family. This protein family consists of luciferase-like monooxygenases (LLM), and include KPN_01858 from Klebsiella pneumoniae as a representative member. Most are from species that lack F420 biosynthesis, so the family is likely to bind FMN as its cofactor. This family is closely associated with a binding protein-dependent ABC transporter, suggesting a role in catabolism. |
16 | TIGR03621 | 295 | 223 | 99.0 | 3.8E-10 | [ ------------------------------------------ ] | F420_MSMEG_2516 | probable F420-dependent oxidoreductase, MSMEG_2516 family. Coenzyme F420 is produced by methanogenic archaea, a number of the Actinomycetes (including Mycobacterium tuberculosis), and rare members of other lineages. The resulting information-rich phylogenetic profile identifies candidate F420-dependent oxidoreductases within the family of luciferase-like enzymes (pfam00296), where the species range for the subfamily encompasses many F420-positive genomes without straying beyond. This family is uncharacterized, and named for member MSMEG_2516 from Mycobacterium smegmatis. |
17 | TIGR03858 | 337 | 255 | 99.0 | 6.4E-10 | [ --------------------------------------------] | LLM_2I7G | probable oxidoreductase, LLM family. This model describes a highly conserved, somewhat broadly distributed family withing the luciferase-like monooxygenase (LLM) superfamily. Most members are from species incapable of synthesizing coenzyme F420, bound by some members of the LLM superfamily. Members, therefore, are more likely to use FMN as a cofactor. |
18 | TIGR03571 | 298 | 102 | 98.9 | 6.4E-10 | [ ----------------- ] | lucif_BA3436 | luciferase-type oxidoreductase, BA3436 family. This family is a distinct subgroup among members of the luciferase monooxygenase domain family. The larger family contains both FMN-binding enzymes (luciferase, alkane monooxygenase) and F420-binding enzymes (methylenetetrahydromethanopterin reductase, secondary alcohol dehydrogenase, glucose-6-phosphate dehydrogenase). Although some members of the domain family bind coenzyme F420 rather than FMN, members of this family are from species that lack the genes for F420 biosynthesis. A crystal structure, but not function, is known (but unpublished) for the member from Bacillus cereus, PDB|2B81. |
19 | TIGR03558 | 323 | 230 | 98.9 | 1.3E-09 | [ ------------------------------------------ ] | oxido_grp_1 | luciferase family oxidoreductase, group 1. The Pfam domain family pfam00296 is named for luciferase-like monooxygenases, but the family also contains several coenzyme F420-dependent enzymes. This protein family represents a well-resolved clade within family pfam00296 and shows no restriction to coenzyme F420-positive species, unlike some other clades within pfam00296. |
20 | TIGR03617 | 318 | 249 | 98.8 | 5.9E-09 | [ ----------------------------------------- ] | F420_MSMEG_2256 | probable F420-dependent oxidoreductase, MSMEG_2256 family. Coenzyme F420 has a limited phylogenetic distribution, including methanogenic archaea, Mycobacterium tuberculosis and related species, Colwellia psychrerythraea 34H, Rhodopseudomonas palustris HaA2, and others. Partial phylogenetic profiling identifies protein subfamilies, within the larger family called luciferase-like monooxygenanases (pfam00296), that appear only in F420-positive genomes and are likely to be F420-dependent. This model describes one such subfamily, exemplified by MSMEG_2256 from Mycobacterium smegmatis. |
21 | TIGR03554 | 331 | 173 | 98.6 | 6.2E-08 | [ ----------------------------- ] | F420_G6P_DH | glucose-6-phosphate dehydrogenase (coenzyme-F420). This family consists of the F420-dependent glucose-6-phosphate dehydrogenase of Mycobacterium and Nocardia. It shows homology to several other F420-dependent enzymes rather than to the NAD or NADP-dependent glucose-6-phosphate dehydrogenases. |
22 | TIGR04021 | 350 | 205 | 98.3 | 2.7E-06 | [ ---------------------------------- ] | LLM_DMSO2_sfnG | dimethyl sulfone monooxygenase SfnG. This family of FMNH2-dependent members of the luciferase-like monooxygenase (LLM) family includes SfnG, a monooxygenase that converts dimethylsulphone (DMSO2) to methanesulphonate. This step can be followed immediately by methanesulfonate sulfonatase (an alkanesulfonate monooxygenase - see TIGR03565) for the FMNH2-dependent conversion an inorganic form. |
23 | TIGR03564 | 265 | 243 | 98.3 | 1.1E-06 | [ ------------------------------------------ ] | F420_MSMEG_4879 | F420-dependent oxidoreductase, MSMEG_4879 family. Coenzyme F420 is produced by methanogenic archaea, a number of the Actinomycetes (including Mycobacterium tuberculosis), and rare members of other lineages. The resulting information-rich phylogenetic profile identifies candidate F420-dependent oxidoreductases within the family of luciferase-like enzymes (pfam00296), where the species range for the subfamily encompasses many F420-positive genomes without straying beyond. This family is uncharacterized, and named for member MSMEG_4879 from Mycobacterium smegmatis. |
24 | cd01094 | 244 | 175 | 98.3 | 7.5E-07 | [ ------------------------------ ] | Alkanesulfonate_monoxygenase | Alkanesulfonate monoxygenase is the monoxygenase of a two-component system that catalyzes the conversion of alkanesulfonates to the corresponding aldehyde and sulfite. Alkanesulfonate monoxygenase (SsuD) has an absolute requirement for reduced flavin mononucleotide (FMNH2), which is provided by the NADPH-dependent FMN oxidoreductase (SsuE). |
25 | TIGR03841 | 301 | 172 | 97.9 | 9.6E-06 | [ ----------------------------- ] | F420_Rv3093c | probable F420-dependent oxidoreductase, Rv3093c family. This model describes a small family of enzymes in the bacterial luciferase-like monooxygenase family, which includes F420-dependent enzymes such as N5,N10-methylenetetrahydromethanopterin reductase as well as FMN-dependent enzymes. All members of this family are from species that produce coenzyme F420; SIMBAL analysis suggests that members of this family bind F420 rather than FMN. |
26 | TIGR03854 | 290 | 174 | 97.9 | 2.3E-05 | [ ------------------------------ ] | F420_MSMEG_3544 | probable F420-dependent oxidoreductase, MSMEG_3544 family. Coenzyme F420 has a limited phylogenetic distribution, including methanogenic archaea, Mycobacterium tuberculosis and related species, Colwellia psychrerythraea 34H, Rhodopseudomonas palustris HaA2, and others. Partial phylogenetic profiling identifies protein subfamilies, within the larger family called luciferase-like monooxygenanases (pfam00296), that appear only in F420-positive genomes and are likely to be F420-dependent. This model describes a small family, closely related to other such families in the putative F420-binding region, exemplified by MSMEG_3544 in Mycobacterium smegmatis. |
27 | TIGR03565 | 346 | 250 | 97.8 | 9.6E-05 | [ -------------------------------------------- ] | alk_sulf_monoox | alkanesulfonate monooxygenase, FMNH(2)-dependent. Members of this protein family are monooxygenases that catalyze desulfonation of aliphatic sulfonates such as methane sulfonate. This enzyme uses reduced FMN, although various others members of the same luciferase-like monooxygenase family (pfam00296) are F420-dependent enzymes. |
28 | cd01096 | 315 | 233 | 97.6 | 0.00019 | [ ----------------------------------------- ] | Alkanal_monooxygenase | Alkanal monooxygenase are flavin monoxygenases. Molecular oxygen is activated by reaction with reduced flavin mononucleotide (FMNH2) and reacts with an aldehyde to yield the carboxylic acid, oxidized flavin (FMN) and a blue-green light. Bacterial luciferases are heterodimers made of alpha and beta subunits which are homologous. The single activer center is on the alpha subunit. The alpha subunit has a stretch of 30 amino acid residues that is not present in the beta subunit. The beta subunit does not contain the active site and is required for the formation of the fully active heterodimer. The beta subunit does not contribute anything directly to the active site. Its role is probably to stabilize the high quantum yield conformation of the alpha subunit through interactionbs across the subunit interface. |
29 | PRK00719 | 378 | 130 | 97.1 | 0.0018 | [ ------------------------ ] | PRK00719 | alkanesulfonate monooxygenase; Provisional |
30 | TIGR03856 | 249 | 129 | 96.3 | 0.0014 | [ ------------------------ ] | F420_MSMEG_2906 | probable F420-dependent oxidoreductase, MSMEG_2906 family. This model describes a small family of enzymes in the bacterial luciferase-like monooxygenase family, which includes F420-dependent enzymes such as N5,N10-methylenetetrahydromethanopterin reductase as well as FMN-dependent enzymes. All members of this family are from species that produce coenzyme F420; SIMBAL analysis suggests that members of this family bind F420 rather than FMN. |
31 | TIGR03885 | 315 | 159 | 95.6 | 0.034 | [ ------------------------------ ] | flavin_revert | probable non-F420 flavinoid oxidoreductase. This model represents a clade of proteins within the larger subfamily TIGR03557. The parent model includes the F420-dependent glucose-6-phosphate dehydrogenase (TIGR03554) and many other proteins. Excepting the members of this family, all members of TIGR03557 occur in species capable of synthesizing coenzyme F420. All members of the seed alignment for this model are from species that lack F420 biosynthesis. It is suggested that members of this family bind FMN, or FO, or a novel flavinoid cofactor, but not F420 per se. |
32 | pfam13424 | 78 | 61 | 95.2 | 0.068 | [ ---------- ] | TPR_12 | Tetratricopeptide repeat. |
33 | TIGR04036 | 355 | 227 | 93.9 | 0.037 | [ ------------------------------------------ ] | LLM_CE1758_fam | putative luciferase-like monooxygenase, FMN-dependent, CE1758 family. This tightly conserved subfamily of the bacterial luciferase-like monooxygenase (LLM) family, with members showing > 60 % pairwise sequence identity, includes proteins from both species with and species without the ability to make coenzyme F420. Therefore, the like cofactor is FMN rather than F420. The presence of three members in Kineococcus radiotolerans SRS30216 and two in Saccharopolyspora erythraea NRRL 2338 suggest closely related (subfamily) rather than exactly conserved (equivalog) function. Gene neighborhoods around members are not conserved. |
34 | TIGR03860 | 422 | 242 | 93.3 | 0.1 | [ ------------------------------------------ ] | FMN_nitrolo | FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family. This model represents a distinctive clade, in which all characterized members are FMN-binding, within the larger family of luciferase-like monooxygenases (LLM), among which there are both FMN- and F420-binding enzymes. A well-characterized member is nitrilotriacetate monooxygenase from Aminobacter aminovorans (Chelatobacter heintzii), where nitrilotriacetate is a chelating agent used in detergents. |
35 | pfam13424 | 78 | 73 | 93.2 | 0.31 | [ ----------- ] | TPR_12 | Tetratricopeptide repeat. |
36 | PRK10508 | 333 | 69 | 88.6 | 0.027 | [ ----------- ] | PRK10508 | hypothetical protein; Provisional |
37 | TIGR03620 | 278 | 133 | 86.5 | 0.39 | [ --------------------------- ] | F420_MSMEG_4141 | probable F420-dependent oxidoreductase, MSMEG_4141 family. Members of this protein family, related to F420-dependent oxidoreductases within the larger family of a bacterial luciferase (an FMN-dependent enzyme), occurs only within the small subset of species that synthesize F420. Most such proteins are from members of the Actinobacteria, but at least one species, Sphingomonas wittichii, belongs to the Alphaproteobacteria. |
38 | pfam13374 | 42 | 34 | 79.7 | 0.53 | [ ----- ] | TPR_10 | Tetratricopeptide repeat. |
39 | cd01095 | 358 | 99 | 77.9 | 1.5 | [ ----------------- ] | Nitrilotriacetate_monoxgenase | nitrilotriacetate monoxygenase oxidizes nitrilotriacetate utilizing reduced flavin mononucleotide (FMNH2) and oxygen. The FMNH2 is provided by an NADH:flavin mononucleotide (FMN) oxidorductase that uses NADH to reduce FMN to FMNH2. |
40 | pfam00515 | 34 | 27 | 69.6 | 1 | [ ---- ] | TPR_1 | Tetratricopeptide repeat. |
41 | cd00347 | 90 | 64 | 59.0 | 10 | [ ------------- ] | Flavin_utilizing_monoxygenases | Flavin-utilizing monoxygenases |
42 | pfam13176 | 36 | 30 | 57.9 | 3.9 | [ ----- ] | TPR_7 | Tetratricopeptide repeat. |
43 | cd15832 | 278 | 84 | 56.7 | 72 | [ ------------- ] | SNAP | Soluble N-ethylmaleimide-sensitive factor (NSF) Attachment Protein family. Members of the soluble NSF attachment protein (SNAP) family are involved in intracellular membrane trafficking, including vesicular transport between the endoplasmic reticulum and Golgi apparatus. Higher eukaryotes contain three isoforms of SNAPs: alpha, beta, and gamma. Alpha-SNAP is universally present in eukaryotes and acts as an adaptor protein between SNARE (integral membrane SNAP receptor) and NSF for recruitment to the 20S complex. Beta-SNAP is brain-specific and shares high sequence identity (about 85%) with alpha-SNAP. Gamma-SNAP is weakly related (about 20-25% identity) to the two other isoforms, and is ubiquitous. It may help regulate the activity of the 20S complex. The X-ray structures of vertebrate gamma-SNAP and yeast Sec17, a SNAP family member, show similar all-helical structures consisting of an N-terminal extended twisted sheet of four Tetratricopeptide repeat (TPR)-like helical hairpins and a C-terminal helical bundle. |
44 | cd07592 | 223 | 54 | 54.5 | 36 | [ ---------- ] | BAR_Endophilin_A | The Bin/Amphiphysin/Rvs (BAR) domain of Endophilin-A. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Endophilins are accessory proteins, localized at synapses, which interact with the endocytic proteins, dynamin and synaptojanin. They are essential for synaptic vesicle formation from the plasma membrane. They interact with voltage-gated calcium channels, thus linking vesicle endocytosis to calcium regulation. They also play roles in virus budding, mitochondrial morphology maintenance, receptor-mediated endocytosis inhibition, and endosomal sorting. Endophilins contain an N-terminal N-BAR domain (BAR domain with an additional N-terminal amphipathic helix), followed by a variable region containing proline clusters, and a C-terminal SH3 domain. They are classified into two types, A and B. Vertebrates contain three endophilin-A isoforms. Endophilin-A proteins are enriched in the brain and play multiple roles in receptor-mediated endocytosis. They tubulate membranes and regulate calcium influx into neurons to trigger the activation of the endocytic machinery. They are also involved in the sorting of plasma membrane proteins, actin filament assembly, and the uncoating of clathrin-coated vesicles for fusion with endosomes. The BAR domains of endophilin-A1 and A3 form crescent-shaped dimers that can detect membrane curvature and drive membrane bending. |
45 | pfam13181 | 34 | 26 | 53.7 | 5.7 | [ ---- ] | TPR_8 | Tetratricopeptide repeat. |
46 | PRK09490 | 1229 | 59 | 49.7 | 21 | [ ---------- ] | metH | B12-dependent methionine synthase; Provisional |
47 | PRK13441 | 180 | 86 | 48.8 | 58 | [ -------------- ] | PRK13441 | F0F1 ATP synthase subunit delta; Provisional |
48 | cd07615 | 223 | 51 | 48.7 | 29 | [ --------- ] | BAR_Endophilin_A3 | The Bin/Amphiphysin/Rvs (BAR) domain of Endophilin-A3. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Endophilins are accessory proteins localized at synapses that interacts with the endocytic proteins, dynamin and synaptojanin. They are essential for synaptic vesicle formation from the plasma membrane. They interact with voltage-gated calcium channels, thus linking vesicle endocytosis to calcium regulation. They also play roles in virus budding, mitochondrial morphology maintenance, receptor-mediated endocytosis inhibition, and endosomal sorting. Endophilins contain an N-terminal N-BAR domain (BAR domain with an additional N-terminal amphipathic helix), followed by a variable region containing proline clusters, and a C-terminal SH3 domain. They are classified into two types, A and B. Endophilin-A proteins are enriched in the brain and play multiple roles in receptor-mediated endocytosis. Endophilin-A3 (or endophilin-3) is also referred to as SH3P13 (SH3 domain containing protein 13) or SH3GL3 (SH3 domain containing Grb2-like protein 3). It regulates Arp2/3-dependent actin filament assembly during endocytosis. It binds N-WASP through its SH3 domain and enhances the ability of N-WASP to activate the Arp2/3 complex. Endophilin-A3 co-localizes with the vesicular glutamate transporter 1 (VGLUT1), and may play an important role in the synaptic release of glutamate. |
49 | pfam14938 | 282 | 85 | 48.4 | 1.5E+02 | [ ------------- ] | SNAP | Soluble NSF attachment protein, SNAP. The soluble NSF attachment protein (SNAP) proteins are involved in vesicular transport between the endoplasmic reticulum and Golgi apparatus. They act as adaptors between SNARE (integral membrane SNAP receptor) proteins and NSF (N-ethylmaleimide-sensitive factor). They are structurally similar to TPR repeats. |
50 | pfam04053 | 431 | 119 | 45.0 | 35 | [ -------------------- ] | Coatomer_WDAD | Coatomer WD associated region. This region is composed of WD40 repeats. |
51 | pfam12919 | 472 | 114 | 41.6 | 31 | [ ------------------------ ] | TcdA_TcdB | TcdA/TcdB catalytic glycosyltransferase domain. This domain represents the N-terminal glycosyltransferase from a set of toxins found in some bacteria. This domain in TcdB glycosylates the host RhoA protein. |
52 | TIGR03612 | 355 | 88 | 39.4 | 44 | [ ------------- ] | RutA | pyrimidine utilization protein A. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the luciferase family defined by pfam00296 and is likely a FMN-dependent monoxygenase. |
53 | PRK12381 | 406 | 63 | 38.8 | 55 | [ ---------- ] | PRK12381 | bifunctional succinylornithine transaminase/acetylornithine transaminase; Provisional |
54 | cd00840 | 186 | 65 | 37.5 | 67 | [ ----------- ] | MPP_Mre11_N | Mre11 nuclease, N-terminal metallophosphatase domain. Mre11 (also known as SbcD in Escherichia coli) is a subunit of the MRX protein complex. This complex includes: Mre11, Rad50, and Xrs2/Nbs1, and plays a vital role in several nuclear processes including DNA double-strand break repair, telomere length maintenance, cell cycle checkpoint control, and meiotic recombination, in eukaryotes. During double-strand break repair, the MRX complex is required to hold the two ends of a broken chromosome together. In vitro studies show that Mre11 has 3'-5' exonuclease activity on dsDNA templates and endonuclease activity on dsDNA and ssDNA templates. In addition to the N-terminal phosphatase domain, the eukaryotic MRE11 members of this family have a C-terminal DNA binding domain (not included in this alignment model). MRE11-like proteins are found in prokaryotes and archaea was well as in eukaryotes. Mre11 belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for productive metal coordination. |
55 | cd07985 | 235 | 120 | 37.2 | 87 | [ -------------------- ] | LPLAT_GPAT | Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: GPAT. Lysophospholipid acyltransferase (LPLAT) superfamily member: glycerol-3-phosphate 1-acyltransferase (GPAT, PlsB). LPLATs are acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. This subgroup includes glycerol-3-phosphate 1-acyltransferase (GPAT, PlsB). |
56 | TIGR00959 | 428 | 91 | 36.9 | 58 | [ -------------- ] | ffh | signal recognition particle protein. This model represents Ffh (Fifty-Four Homolog), the protein component that forms the bacterial (and organellar) signal recognition particle together with a 4.5S RNA. Ffh is a GTPase homologous to eukaryotic SRP54 and also to the GTPase FtsY (TIGR00064) that is the receptor for the signal recognition particle. |
57 | pfam10140 | 359 | 26 | 36.4 | 39 | [ ----- ] | YukC | WXG100 protein secretion system (Wss), protein YukC. Members of this family of proteins include predicted membrane proteins homologous to YukC in B. subtilis. The YukC protein family would participate to the formation of a translocon required for the secretion of WXG100 proteins (pfam06013) in monoderm bacteria, the WXG100 protein secretion system (Wss). This family includes EssB in Staphylococcus aureus. |
58 | cd02678 | 75 | 55 | 35.1 | 99 | [ ----------- ] | MIT_VPS4 | MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear. |
59 | pfam02236 | 85 | 28 | 33.5 | 45 | [ ---- ] | Viral_DNA_bi | Viral DNA-binding protein, all alpha domain. This family represents a domain of the viral DNA- binding protein, a multi functional protein involved in DNA replication and transcription control. |
60 | pfam14938 | 282 | 46 | 32.1 | 3.7E+02 | [ ------- ] | SNAP | Soluble NSF attachment protein, SNAP. The soluble NSF attachment protein (SNAP) proteins are involved in vesicular transport between the endoplasmic reticulum and Golgi apparatus. They act as adaptors between SNARE (integral membrane SNAP receptor) proteins and NSF (N-ethylmaleimide-sensitive factor). They are structurally similar to TPR repeats. |
61 | pfam11817 | 240 | 52 | 31.3 | 82 | [ -------- ] | Foie-gras_1 | Foie gras liver health family 1. Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first. The C-terminus of this region contains TPR repeats. |
62 | TIGR03926 | 377 | 26 | 28.5 | 72 | [ ----- ] | T7_EssB | type VII secretion protein EssB. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This protein is designated YukC in Bacillus subtilis and EssB is Staphylococcus aureus. |
63 | TIGR01753 | 140 | 36 | 27.9 | 51 | [ ----- ] | flav_short | flavodoxin, short chain. Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They can act in nitrogen fixation by nitrogenase, in sulfite reduction, and light-dependent NADP+ reduction in during photosynthesis, among other roles. This model describes the short chain type. Many of these are involved in sulfite reduction. |
64 | cd00777 | 280 | 51 | 27.7 | 48 | [--------- ] | AspRS_core | Asp tRNA synthetase (aspRS) class II core domain. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. AspRS is a homodimer, which attaches a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. AspRS in this family differ from those found in the AsxRS family by a GAD insert in the core domain. |
65 | PRK03103 | 409 | 41 | 27.1 | 25 | [ -------] | PRK03103 | DNA polymerase IV; Reviewed |
66 | cd03307 | 326 | 96 | 27.0 | 31 | [ ---------------- ] | Mta_CmuA_like | MtaA_CmuA_like family. MtaA/CmuA, also MtsA, or methyltransferase 2 (MT2) MT2-A and MT2-M isozymes, are methylcobamide:Coenzyme M methyltransferases, which play a role in metabolic pathways of methane formation from various substrates, such as methylated amines and methanol. Coenzyme M, 2-mercaptoethylsulfonate or CoM, is methylated during methanogenesis in a reaction catalyzed by three proteins. A methyltransferase methylates the corrinoid cofactor, which is bound to a second polypeptide, a corrinoid protein. The methylated corrinoid protein then serves as a substrate for MT2-A and related enzymes, which methylate CoM. |
67 | pfam12095 | 83 | 51 | 26.9 | 83 | [ ---------] | DUF3571 | Protein of unknown function (DUF3571). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 85 to 97 amino acids in length. |
68 | pfam00448 | 195 | 58 | 26.7 | 99 | [ ---------- ] | SRP54 | SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins. |
69 | pfam00029 | 107 | 14 | 25.8 | 28 | [ -- ] | Connexin | Connexin. |
70 | cd09274 | 121 | 23 | 25.7 | 44 | [ ---- ] | RNase_HI_RT_Ty3 | Ty3/Gypsy family of RNase HI in long-term repeat retroelements. Ribonuclease H (RNase H) enzymes are divided into two major families, Type 1 and Type 2, based on amino acid sequence similarities and biochemical properties. RNase H is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a sequence non-specific manner in the presence of divalent cations. RNase H is widely present in various organisms, including bacteria, archaea and eukaryotes. RNase HI has also been observed as adjunct domains to the reverse transcriptase gene in retroviruses, in long-term repeat (LTR)-bearing retrotransposons and non-LTR retrotransposons. RNase HI in LTR retrotransposons perform degradation of the original RNA template, generation of a polypurine tract (the primer for plus-strand DNA synthesis), and final removal of RNA primers from newly synthesized minus and plus strands. The catalytic residues for RNase H enzymatic activity, three aspartatic acids and one glutamic acid residue (DEDD), are unvaried across all RNase H domains. Phylogenetic patterns of RNase HI of LTR retroelements is classified into five major families, Ty3/Gypsy, Ty1/Copia, Bel/Pao, DIRS1 and the vertebrate retroviruses. Ty3/Gypsy family widely distributed among the genomes of plants, fungi and animals. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription. |
71 | pfam07719 | 34 | 25 | 25.6 | 32 | [ ---- ] | TPR_2 | Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. |
72 | pfam09325 | 236 | 96 | 25.2 | 1.6E+02 | [ --------------- ] | Vps5 | Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerization domain. |
73 | cd03517 | 95 | 62 | 24.6 | 35 | [ --------- ] | Link_domain_CSPGs_modules_1_3 | Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes. |
74 | COG2264 | 300 | 60 | 23.4 | 1.2E+02 | [ ---------- ] | PrmA | Ribosomal protein L11 methylase PrmA |
75 | cd02649 | 306 | 70 | 22.5 | 16 | [ ----------- ] | nuc_hydro_CeIAG | nuc_hydro_CeIAG: Nucleoside hydrolases similar to the inosine-adenosine-guanosine-preferring nucleoside hydrolase from Caenorhabditis elegans. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the purine-preferring nucleoside hydrolase (IAG-NH) from C. elegans and the salivary purine nucleosidase from Aedes aegypti. C. elegans IAG-NH exhibits a high affinity for the substrate analogue p-nitrophenylriboside (p-NPR). |
76 | cd06068 | 144 | 25 | 22.4 | 42 | [ ------ ] | H2MP_like-1 | Putative |
77 | pfam09840 | 190 | 46 | 22.3 | 2.6E+02 | [ ------- ] | DUF2067 | Uncharacterized protein conserved in archaea (DUF2067). This domain, found in various archaeal proteins, has no known function. |
78 | pfam09731 | 493 | 41 | 21.7 | 1.5E+02 | [ ------ ] | Mitofilin | Mitochondrial inner membrane protein. Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains. |
79 | cd14391 | 36 | 27 | 21.7 | 71 | [ ----- ] | UBA_II_E2_UBCD4 | UBA domain found in Drosophila melanogaster ubiquitin-conjugating enzyme E2-22 kDa (UbcD4) and similar proteins. UbcD4, also called ubiquitin carrier protein or ubiquitin-protein ligase, is a class II E2 ubiquitin-conjugating enzyme encoded by Drosophila E2 gene which is only expressed in pole cells in embryos. It is a putative E2 enzyme homologous to the Huntingtin interacting protein-2 (HIP2) of human. UbcD4 specifically interacts with the polyubiquitin-binding subunit of the proteasome. It contains a C-terminal ubiquitin-associated (UBA) domain in addition to an N-terminal catalytic ubiquitin-conjugating enzyme E2 (UBCc) domain. |
80 | pfam07626 | 68 | 38 | 21.4 | 1E+02 | [ ------ ] | PSD3 | Protein of unknown function (DUF1587). A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also match pfam07624. |
81 | pfam04694 | 60 | 15 | 21.2 | 54 | [ -- ] | Corona_3 | Coronavirus ORF3 protein. |
82 | TIGR03246 | 397 | 72 | 20.8 | 1.7E+02 | [ ------------ ] | arg_catab_astC | succinylornithine transaminase family. Members of the seed alignment for this protein family are the enzyme succinylornithine transaminase (EC 2.6.1.81), which catalyzes the third of five steps in arginine succinyltransferase (AST) pathway, an ammonia-releasing pathway of arginine degradation. All seed alignment sequences are found within arginine succinyltransferase operons, and all proteins that score above 820.0 bits should function as succinylornithine transaminase. However, a number of sequences extremely closely related in sequence, found in different genomic contexts, are likely to act in different biological processes and may act on different substrates. This model is desigated subfamily rather than equivalog, pending further consideration, for this reason. |
83 | pfam04674 | 273 | 33 | 20.4 | 25 | [ ----- ] | Phi_1 | Phosphate-induced protein 1 conserved region. Family of conserved plant proteins. Conserved region identified in a phosphate-induced protein of unknown function. |
84 | pfam04752 | 177 | 54 | 20.1 | 1.6E+02 | [ -------- ] | ChaC | ChaC-like protein. The ChaC protein is thought to be associated with the putative ChaA Ca2+/H+ cation transport protein in Escherichia coli. Its function is not known. This family also includes homologues regions from several other bacterial and eukaryotic proteins. |
85 | COG0783 | 156 | 33 | 20.0 | 1.4E+02 | [ ----- ] | Dps | DNA-binding ferritin-like protein (oxidative damage protectant) |