match no.target idtarget lengthalignment lengthprobabilityE-valuecoveragematch description
1TIGR0369810795100.01.3E-28[   --------------------------------------------- ]clan_AA_DTGFclan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
2COG55501259899.45.8E-13[   ----------------------------------------------]COG5550Predicted aspartyl protease
3cd05484918597.70.00086[ ----------------------------------------        ]retropepsin_like_LTR_2Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
4pfam13650896497.60.00055[ -------------------------------                 ]Asp_protease_2Aspartyl protease. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix.
5pfam00077999097.00.0034[ --------------------------------------------    ]RVPRetroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases such as pepsins, cathepsins, and renins (pfam00026).
6cd00303927496.90.012[ ------------------------------------            ]retropepsin_likeRetropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
7pfam13975725696.50.01[ ---------------------------                     ]gag-asp_proteasgag-polyprotein putative aspartyl protease. This family of putative aspartyl proteases is found pre-dominantly in retroviral proteins.
8TIGR022811219095.60.17[-------------------------------------------      ]clan_AA_DTGAclan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
9cd05483966394.80.28[ ------------------------------                  ]retropepsin_like_bacteriaBacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
10COG35772158993.90.36[-------------------------------------------      ]COG3577Predicted aspartyl protease
11cd05481936993.60.42[ ---------------------------------               ]retropepsin_like_LTR_1Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
12cd054791249093.50.49[ ----------------------------------------------- ]RP_DDIRP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
13PRK074193041479.90.54[                ------                           ]PRK074191,4-dihydroxy-2-naphthoate octaprenyltransferase; Provisional
14TIGR022352851378.80.65[                ------                           ]menA_cyano-plnt1,4-dihydroxy-2-naphthoate phytyltransferase. This family of phytyltransferases, found in plants and cyanobacteria, are involved in the biosythesis of phylloquinone (Vitamin K1). Phylloquinone is a critical component of photosystem I. The closely related MenA enzyme from bacteria transfers a prenyl group (which only differs in the saturation of the isoprenyl groups) in the biosynthesis of menaquinone. Activity towards both substrates in certain organisms should be considered a possibility.
15pfam13975723770.76[                           ------------------    ]gag-asp_proteasgag-polyprotein putative aspartyl protease. This family of putative aspartyl proteases is found pre-dominantly in retroviral proteins.
16cd06095861859.27.8[  --------                                       ]RP_RTVL_H_likeRetropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
17pfam02070251558.64[                 ------                          ]NMUNeuromedin U.
18pfam087572985252.031[            --------------------------------     ]CotHCotH protein. Members of this family include the spore coat protein H (cotH).
19pfam096681248950.813[ --------------------------------------------    ]Asp_proteaseAspartyl protease. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover.
20cd05482878447.784[ ----------------------------------------        ]HIV_retropepsin_likeRetropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
21pfam034192917944.91.2E+02[--------------------------------------           ]Peptidase_U4Sporulation factor SpoIIGA.
22TIGR03807272241.08.9[            ------------                         ]RR_fam_repeatputative cofactor-binding repeat. This model describes a small repeat found in a family of proteins that crosses the plasma membrane by twin-arginine translation, which usually signifies the presence of a bound cofactor. This repeat shows similarity to the beta-helical repeat, in which three beta-strands per repeat wind once per repeat around in a right-handed helical stack of parallel beta structure.
23cd07557923735.821[   ---------------------                         ]trimeric_dUTPaseTrimeric dUTP diphosphatases. Trimeric dUTP diphosphatases, or dUTPases, are the most common family of dUTPase, found in bacteria, eukaryotes, and archaea. They catalyze the hydrolysis of the dUTP-Mg complex (dUTP-Mg) into dUMP and pyrophosphate. This reaction is crucial for the preservation of chromosomal integrity as it removes dUTP and therefore reduces the cellular dUTP/dTTP ratio, and prevents dUTP from being incorporated into DNA. It also provides dUMP as the precursor for dTTP synthesis via the thymidylate synthase pathway. dUTPases are homotrimeric, except some monomeric viral dUTPases, which have been shown to mimic a trimer. Active sites are located at the subunit interface.
24cd137391686735.116[                -------------------------------- ]SPRY_PRY_TRIM1PRY/SPRY domain of tripartite motif-binding protein 1 (TRIM1) or MID2. This domain, consisting of the distinct N-terminal PRY subdomain followed by the SPRY subdomain, is found at the C-terminus of TRIM1 (also known as MID2 or midline 2). MID2 and its close homolog, TRIM18 (also known as MID1), both contain a B30.2-like domain at their C-terminus and a single fibronectin type III (FN3) motif between it and their N-terminal RBCC domain. MID2 and MID1 coiled-coil motifs mediate both homo- and heterodimerization, a prerequisite for association of the rapamycin-sensitive PP2A regulatory subunit Alpha 4 with microtubules. Mutations in MID1 have shown to cause Opitz syndrome, a disorder causing congenital anomalies such as cleft lip and palate as well as heart defects.
25pfam098561564134.635[         --------------------                    ]DUF2083Predicted transcriptional regulator (DUF2083). This domain is found in various prokaryotic transcriptional regulatory proteins belonging to the XRE family. Its exact function is, as yet, unknown.
26cd128921771534.413[                                         ------- ]SPRY_PRY_TRIM18PRY/SPRY domain of TRIM18/MID1, also known as FXY or RNF59. This domain, consisting of the distinct N-terminal PRY subdomain followed by the SPRY subdomain, is at the C-terminus of the overall domain architecture of MID1 (also known as FXY, RNF59, TRIM18) gene represented by a RING finger domain (RING), two B-box motifs (BBOX), coiled-coil C-terminal to Bbox domain (BBC) and fibronectin type 3 domain (FN3). Mutations in the human MID1 gene result in X-linked Opitz G/BBB syndrome (OS), a disorder affecting development of midline structures, causing craniofacial, urogenital, gastrointestinal and cardiovascular abnormalities. A unique MID1 gene mutation located in a variable loop in the SPRY domain alters conformation of the binding pocket and may affect the binding affinity to the PRY/SPRY domain.
27cd008932862633.612[      ----------------                           ]PI4Kc_IIICatalytic domain of Type III Phosphoinositide 4-kinase. PI4Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 4-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) to generate PtdIns(4)P, the major precursor in the synthesis of other phosphoinositides including PtdIns(4,5)P2, PtdIns(3,4)P2, and PtdIns(3,4,5)P3. There are two types of PI4Ks, types II and III. Type II PI4Ks lack the characteristic catalytic kinase domain present in PI3Ks and type III PI4Ks, and are excluded from this family. Two isoforms of type III PI4K, alpha and beta, exist in most eukaryotes. The PI4K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases.
28cd014152308033.269[    ---------------------------------------------]SAICAR_synt_PurCbacterial and archaeal 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR) synthase. A subfamily of SAICAR synthetases represented by the Thermotoga maritima (Tm) enzyme and E. coli PurC. SAICAR synthetase catalyzes the seventh step of the de novo biosynthesis of purine nucleotides (also reported as eighth step). It converts 5-aminoimidazole-4-carboxyribonucleotide (CAIR), ATP, and L-aspartate into 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR), ADP, and phosphate.
29pfam013241545633.229[       -----------------------------             ]Diphtheria_RDiphtheria toxin, R domain. C-terminal receptor binding (R) domain - binds to cell surface receptor, permitting the toxin to enter the cell by receptor mediated endocytosis.
30cd12280812829.675[         -------------                           ]RRM_FETRNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15.
31cd059315475229.545[        -------------------------                ]FAALFatty acyl-AMP ligase (FAAL). FAAL belongs to the class I adenylate forming enzyme family and is homologous to fatty acyl-coenzyme A (CoA) ligases (FACLs). However, FAALs produce only the acyl adenylate and are unable to perform the thioester-forming reaction, while FACLs perform a two-step catalytic reaction; AMP ligation followed by CoA ligation using ATP and CoA as cofactors. FAALs have insertion motifs between the N-terminal and C-terminal subdomains that distinguish them from the FACLs. This insertion motif precludes the binding of CoA, thus preventing CoA ligation. It has been suggested that the acyl adenylates serve as substrates for multifunctional polyketide synthases to permit synthesis of complex lipids such as phthiocerol dimycocerosate, sulfolipids, mycolic acids, and mycobactin.
32pfam052041104429.187[                     --------------------------- ]Hom_endHoming endonuclease. Homing endonucleases are encoded by mobile DNA elements that are found inserted within host genes in all domains of life.
33pfam141911034128.472[             -----------------------             ]YodLYodL-like. The YodL-like protein family includes the B. subtilis YodL protein, which is functionally uncharacterized. This domain family is found in bacteria, and is approximately 100 amino acids in length. There are two completely conserved residues (Y and D) that may be functionally important.
34pfam131891742526.154[                                 ------------    ]Cytidylate_kin2Cytidylate kinase-like family. This family includes enzymes related to cytidylate kinase.
35pfam110271031326.031[        ------                                   ]DUF2615Protein of unknown function (DUF2615). This small. approximately 100 residue, family is conserved from worms to humans. It is cysteine-rich with a characteristic FDxCEC sequence motif. The function is not known.
36TIGR038402131625.414[                 -------                         ]TMPT_Se_Tethiopurine S-methyltransferase, Se/Te detoxification family. Members of this family are thiopurine S-methyltransferase from a branch in which at least some member proteins can perform selenium methylation as a means to detoxify selenium, or perform a related detoxification of tellurium. Note that the EC number definition does not specify a particular thiopurine, but rather represents a class of activity.
37cd027911222525.035[                   ------------                  ]MopB_CT_Nitrate-R-NapA-likeNitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. This CD (MopB_CT_Nitrate-R-Nap) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs
38pfam115131103024.082[                         --------------          ]TA0956Thermoplasma acidophilum protein TA0956. TA0956 is a protein from Thermoplasma acidophilum which currently has no known function however the structure has been determined. The protein has a two-layered alpha/beta-sandwich topology and is a putative Elongation factor 1-alpha binding motif.
39PRK093622388023.986[    ---------------------------------------------]PRK09362phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed
40pfam07582551523.830[             -------                             ]AP_endonuc_2_NAP endonuclease family 2 C terminus. This highly-conserved sequence is found at the C terminus of several apurinic/apyrimidinic (AP) endonucleases. in a range of Gram-positive and Gram-negative bacteria. See also pfam01261.
41cd06094897522.246[     -------------------------------------       ]RP_Saci_likeRP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
42pfam041341131521.128[         -------                                 ]DUF393Protein of unknown function, DUF393. Members of this family have two highly conserved cysteine residues near their N-terminus. The function of these proteins is unknown.
43COG04623144820.792[                     -------------------------   ]PrsAPhosphoribosylpyrophosphate synthetase