Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is yeaE [H]
Identifier: 187736262
GI number: 187736262
Start: 2160807
End: 2161637
Strand: Direct
Name: yeaE [H]
Synonym: Amuc_1775
Alternate gene names: 187736262
Gene position: 2160807-2161637 (Clockwise)
Preceding gene: 187736261
Following gene: 187736273
Centisome position: 81.11
GC content: 55.84
Gene sequence:
>831_bases ATGATGGAACATTCAGTAATTTTTCCGGACCATCGGAAGGTTTGCGCCCTGGGGCAGGGAACCTGGAAAATGGGAAAGTC AGCGTTGAGGGAAGCCGATGAAATAGATGCTTTGCGCGCCGGCATTGAGCTGGGCATGAGCGTAGTGGATACGGCGGAGA TGTATGGCAACGAGGAAATGGTGGGAGCGGCAATCCGCGGTTTGCGCGACCGTGTTTTCCTGGTTACCAAGGTGTTGCCC GGCAACGCCAGCAGGACAGGAACCAAAGCGGCCTGCGAACGGAGCCTGAACAGGCTGAAGACGGATTACGTGGATTTGTT TCTGCTTCACTGGGGAGGTCCCCATCCCATTGAAGATACGGTTGCTTCAATGATTGAGCTGCAGCAGGAGGGAAAAATAA AAGCATGGGGCGTCAGCAATATGGACGTTCCGGAAATGGAGCGGTTTTACGCCGTCCCGGGAGGAGTTTCCTGCGCCGCC AACCAGATTCTTTATAATCTGGCCCATCGAGGTGTGGAGTACGATTTGCTGCCATGGTGCCGGGAACGCCACCTTCCCGT GATGGCCTATTCGCCTGCGGATGAAGGGCGGCTTTCCCGTAATCCGGTGCTGATGGAGATTGCGCAGAAGCATGAGGCCA CCCCGGTGCAGATTGCCCTGGCCTGGATTTTGCGCTGCCCTGGAATGATTGCGATCCCCAAAGCCGGTTCCGTCACGCAT GTGCAGGAAAATTACCGGAGTTTGTCCATCAGGCTGACTGCGGAAGATGTGGATTTGCTGGATGGGGCTTTTCCGCCCCC GGTCAGAAAGGTGCATCTGGATTCCTGGTAA
Upstream 100 bases:
>100_bases GTTCAGGAAAAACGTCCTGATTTTTGAGCTTTTTTTTATCCTCTGCTGTCAAAGGTTTGTACGGCATGGAATGATGCGAT CAGGTAGAAAGAAGATATTA
Downstream 100 bases:
>100_bases ACCGTGATGCCGTCCGGCAGTTGGGGAGAAGCGGGACGCGCGGATTTTCCTGCCGCATCAGAATTGTTTCAAGCCTTTCA TCTGTCCGGGATGAAAGGGC
Product: aldo/keto reductase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 276; Mature: 276
Protein sequence:
>276_residues MMEHSVIFPDHRKVCALGQGTWKMGKSALREADEIDALRAGIELGMSVVDTAEMYGNEEMVGAAIRGLRDRVFLVTKVLP GNASRTGTKAACERSLNRLKTDYVDLFLLHWGGPHPIEDTVASMIELQQEGKIKAWGVSNMDVPEMERFYAVPGGVSCAA NQILYNLAHRGVEYDLLPWCRERHLPVMAYSPADEGRLSRNPVLMEIAQKHEATPVQIALAWILRCPGMIAIPKAGSVTH VQENYRSLSIRLTAEDVDLLDGAFPPPVRKVHLDSW
Sequences:
>Translated_276_residues MMEHSVIFPDHRKVCALGQGTWKMGKSALREADEIDALRAGIELGMSVVDTAEMYGNEEMVGAAIRGLRDRVFLVTKVLP GNASRTGTKAACERSLNRLKTDYVDLFLLHWGGPHPIEDTVASMIELQQEGKIKAWGVSNMDVPEMERFYAVPGGVSCAA NQILYNLAHRGVEYDLLPWCRERHLPVMAYSPADEGRLSRNPVLMEIAQKHEATPVQIALAWILRCPGMIAIPKAGSVTH VQENYRSLSIRLTAEDVDLLDGAFPPPVRKVHLDSW >Mature_276_residues MMEHSVIFPDHRKVCALGQGTWKMGKSALREADEIDALRAGIELGMSVVDTAEMYGNEEMVGAAIRGLRDRVFLVTKVLP GNASRTGTKAACERSLNRLKTDYVDLFLLHWGGPHPIEDTVASMIELQQEGKIKAWGVSNMDVPEMERFYAVPGGVSCAA NQILYNLAHRGVEYDLLPWCRERHLPVMAYSPADEGRLSRNPVLMEIAQKHEATPVQIALAWILRCPGMIAIPKAGSVTH VQENYRSLSIRLTAEDVDLLDGAFPPPVRKVHLDSW
Specific function: Unknown
COG id: COG0656
COG function: function code R; Aldo/keto reductases, related to diketogulonate reductase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI310109922, Length=269, Percent_Identity=28.996282527881, Blast_Score=105, Evalue=4e-23, Organism=Homo sapiens, GI223468663, Length=283, Percent_Identity=27.5618374558304, Blast_Score=101, Evalue=7e-22, Organism=Homo sapiens, GI24497585, Length=287, Percent_Identity=29.9651567944251, Blast_Score=99, Evalue=4e-21, Organism=Homo sapiens, GI45446745, Length=289, Percent_Identity=26.9896193771626, Blast_Score=96, Evalue=2e-20, Organism=Homo sapiens, GI4503285, Length=289, Percent_Identity=26.9896193771626, Blast_Score=96, Evalue=2e-20, Organism=Homo sapiens, GI310109920, Length=289, Percent_Identity=26.9896193771626, Blast_Score=96, Evalue=3e-20, Organism=Homo sapiens, GI5453543, Length=289, Percent_Identity=26.643598615917, Blast_Score=96, Evalue=3e-20, Organism=Homo sapiens, GI291291012, Length=260, Percent_Identity=26.9230769230769, Blast_Score=95, Evalue=6e-20, Organism=Homo sapiens, GI4502049, Length=297, Percent_Identity=28.956228956229, Blast_Score=88, Evalue=8e-18, Organism=Homo sapiens, GI24497577, Length=300, Percent_Identity=23.6666666666667, Blast_Score=84, Evalue=1e-16, Organism=Homo sapiens, GI5174391, Length=300, Percent_Identity=23.6666666666667, Blast_Score=84, Evalue=1e-16, Organism=Escherichia coli, GI1788081, Length=282, Percent_Identity=48.936170212766, Blast_Score=286, Evalue=8e-79, Organism=Escherichia coli, GI1786400, Length=251, Percent_Identity=31.0756972111554, Blast_Score=129, Evalue=2e-31, Organism=Escherichia coli, GI87082198, Length=250, Percent_Identity=31.6, Blast_Score=109, Evalue=2e-25, Organism=Escherichia coli, GI1788070, Length=305, Percent_Identity=28.8524590163934, Blast_Score=99, Evalue=4e-22, Organism=Escherichia coli, GI87081735, Length=275, Percent_Identity=28.7272727272727, Blast_Score=92, Evalue=5e-20, Organism=Escherichia coli, GI1787674, Length=255, Percent_Identity=27.843137254902, Blast_Score=85, Evalue=5e-18, Organism=Escherichia coli, GI48994888, Length=284, Percent_Identity=24.2957746478873, Blast_Score=67, Evalue=2e-12, Organism=Caenorhabditis elegans, GI17552492, Length=265, Percent_Identity=28.6792452830189, Blast_Score=120, Evalue=7e-28, Organism=Caenorhabditis elegans, GI17561298, Length=265, Percent_Identity=28.3018867924528, Blast_Score=105, Evalue=3e-23, Organism=Caenorhabditis elegans, GI71998625, Length=252, Percent_Identity=27.7777777777778, Blast_Score=97, Evalue=8e-21, Organism=Caenorhabditis elegans, GI17564128, Length=290, Percent_Identity=26.2068965517241, Blast_Score=87, Evalue=1e-17, Organism=Caenorhabditis elegans, GI17538386, Length=278, Percent_Identity=27.6978417266187, Blast_Score=86, Evalue=2e-17, Organism=Caenorhabditis elegans, GI17550248, Length=281, Percent_Identity=24.9110320284698, Blast_Score=86, Evalue=2e-17, Organism=Caenorhabditis elegans, GI17537077, Length=286, Percent_Identity=26.2237762237762, Blast_Score=80, Evalue=1e-15, Organism=Caenorhabditis elegans, GI17566692, Length=294, Percent_Identity=25.8503401360544, Blast_Score=79, Evalue=3e-15, Organism=Caenorhabditis elegans, GI17537075, Length=300, Percent_Identity=25.3333333333333, Blast_Score=78, Evalue=6e-15, Organism=Caenorhabditis elegans, GI17561300, Length=254, Percent_Identity=25.5905511811024, Blast_Score=76, Evalue=2e-14, Organism=Caenorhabditis elegans, GI17562292, Length=263, Percent_Identity=25.8555133079848, Blast_Score=75, Evalue=3e-14, Organism=Caenorhabditis elegans, GI17537079, Length=284, Percent_Identity=22.1830985915493, Blast_Score=67, Evalue=1e-11, Organism=Saccharomyces cerevisiae, GI6324694, Length=285, Percent_Identity=27.3684210526316, Blast_Score=100, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6320576, Length=272, Percent_Identity=28.6764705882353, Blast_Score=91, Evalue=1e-19, Organism=Saccharomyces cerevisiae, GI6322556, Length=266, Percent_Identity=26.3157894736842, Blast_Score=84, Evalue=2e-17, Organism=Saccharomyces cerevisiae, GI6320079, Length=233, Percent_Identity=27.0386266094421, Blast_Score=75, Evalue=8e-15, Organism=Saccharomyces cerevisiae, GI6319958, Length=299, Percent_Identity=24.7491638795987, Blast_Score=70, Evalue=3e-13, Organism=Drosophila melanogaster, GI24644950, Length=299, Percent_Identity=28.0936454849498, Blast_Score=99, Evalue=4e-21, Organism=Drosophila melanogaster, GI24657054, Length=276, Percent_Identity=27.8985507246377, Blast_Score=94, Evalue=1e-19, Organism=Drosophila melanogaster, GI45553081, Length=281, Percent_Identity=27.7580071174377, Blast_Score=91, Evalue=1e-18, Organism=Drosophila melanogaster, GI24662789, Length=305, Percent_Identity=24.5901639344262, Blast_Score=89, Evalue=3e-18, Organism=Drosophila melanogaster, GI24663317, Length=290, Percent_Identity=27.2413793103448, Blast_Score=87, Evalue=1e-17, Organism=Drosophila melanogaster, GI20129731, Length=279, Percent_Identity=27.2401433691756, Blast_Score=85, Evalue=5e-17, Organism=Drosophila melanogaster, GI281366140, Length=254, Percent_Identity=27.5590551181102, Blast_Score=83, Evalue=2e-16,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001395 - InterPro: IPR020471 - InterPro: IPR023210 [H]
Pfam domain/function: PF00248 Aldo_ket_red [H]
EC number: NA
Molecular weight: Translated: 30655; Mature: 30655
Theoretical pI: Translated: 6.42; Mature: 6.42
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 4.3 %Met (Translated Protein) 6.2 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 4.3 %Met (Mature Protein) 6.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MMEHSVIFPDHRKVCALGQGTWKMGKSALREADEIDALRAGIELGMSVVDTAEMYGNEEM CCCCCEECCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCHHH VGAAIRGLRDRVFLVTKVLPGNASRTGTKAACERSLNRLKTDYVDLFLLHWGGPHPIEDT HHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHEEEEEECCCCCCHHHH VASMIELQQEGKIKAWGVSNMDVPEMERFYAVPGGVSCAANQILYNLAHRGVEYDLLPWC HHHHHHHHHCCCEEEECCCCCCCCHHHHHEECCCCHHHHHHHHHHHHHHCCCCCCCCHHH RERHLPVMAYSPADEGRLSRNPVLMEIAQKHEATPVQIALAWILRCPGMIAIPKAGSVTH HHCCCCEEEECCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCEEEECCCCCCHH VQENYRSLSIRLTAEDVDLLDGAFPPPVRKVHLDSW HHHCCEEEEEEEEECCHHHHCCCCCCCCEEEECCCC >Mature Secondary Structure MMEHSVIFPDHRKVCALGQGTWKMGKSALREADEIDALRAGIELGMSVVDTAEMYGNEEM CCCCCEECCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCHHH VGAAIRGLRDRVFLVTKVLPGNASRTGTKAACERSLNRLKTDYVDLFLLHWGGPHPIEDT HHHHHHHHHHHEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHEEEEEECCCCCCHHHH VASMIELQQEGKIKAWGVSNMDVPEMERFYAVPGGVSCAANQILYNLAHRGVEYDLLPWC HHHHHHHHHCCCEEEECCCCCCCCHHHHHEECCCCHHHHHHHHHHHHHHCCCCCCCCHHH RERHLPVMAYSPADEGRLSRNPVLMEIAQKHEATPVQIALAWILRCPGMIAIPKAGSVTH HHCCCCEEEECCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCEEEECCCCCCHH VQENYRSLSIRLTAEDVDLLDGAFPPPVRKVHLDSW HHHCCEEEEEEEEECCHHHHCCCCCCCCEEEECCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503 [H]