Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is yghA [H]

Identifier: 187735280

GI number: 187735280

Start: 914942

End: 915841

Strand: Direct

Name: yghA [H]

Synonym: Amuc_0777

Alternate gene names: 187735280

Gene position: 914942-915841 (Clockwise)

Preceding gene: 187735278

Following gene: 187735281

Centisome position: 34.34

GC content: 56.67

Gene sequence:

>900_bases
ATGAGCACAGAAGGTACTAACGAAATACAGGACCCGCGCGGTGAATTCCAACAGGAAAGATATGAAAAGCAGCAGCAGAA
AGCCCCCGGACTGCAATCCGGAATGAAGCCCGTGCCGGATTCCGGGGAACAAAGCTACCATGGCTGCAACAGGCTCCGTG
GACGAAAAGCCCTGGTCACAGGGGGAGACTCGGGAATAGGCCGGGCGGCGGCTATTGCCTACGCACGCGAAGGAGCCGAC
GTAGCCTTGAACTACCTTCCTGAAGAACAAAGCGATGCCGAAGAAGTGGCGGAACTCATCAGAAAAGAGGGCCGGAAGGC
CGTCCTGCTGCCGGGGGACCTCAGCGATGAAGTCTTTTGTAAAAAACTCGTCAGGGATGCCTTGGAAAAACTTGGCGGAC
TGGACATCATGGCCCTGGTAGCCGGTAAACAAGTGGCGGTGGAAGATATCCGGGACATTACCACGGAACAATTAACCAAG
ACATTTGAAGTAAACGTGTTCTCCCTGTTCTGGATTGCCAAGGAAGCCCTTCCGCATCTCAAGCCGGGGACGAGCATCAT
TACCTGCTCCTCCATTCAGGCCTACCAGCCCGGCAAGAACCTGGTGGACTATGCTTCCACAAAAGCGGCCATCATTGCGT
TCAGCCGCTCCCTGGCCAAGCAGGTTGCCCCCAAAGGCATCCGGGTCAACGTGGTGGCTCCCGGCCCCATCTGGACAGCC
CTGCAAGTAACCGGAGGCCAGCTGCAGAAGGATCTGCCGGAATTCGGACGGAAAACGCCTCTCAAAAGGGCAGGGCAACC
TGTGGAACTCTCCGGACTATACGTCTTTCTCGCCTCCCAGGAATCCAGCTTCATTACTGCGGAGGTCTTTGGCGTTACCG
GCGGCATGCATCTGGCCTGA

Upstream 100 bases:

>100_bases
AATGAAGAATGCCGGAAAAGGGAATGCTGCCGGCTATTTCCCGCATGGCAGCGCTTTTCCCCTTCCGTCAACTCCGCATC
AAAATATGACTGACGAAAAA

Downstream 100 bases:

>100_bases
TGCGGAAACCCGCAATCCGGGAAAAGGGCCGGACTGACTTTCCGGAATCCCTTTCCGCAGCGCCACACCCCCGGATGACC
GGGAAAGCGGCCGGAAACGG

Product: short-chain dehydrogenase/reductase SDR

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 299; Mature: 298

Protein sequence:

>299_residues
MSTEGTNEIQDPRGEFQQERYEKQQQKAPGLQSGMKPVPDSGEQSYHGCNRLRGRKALVTGGDSGIGRAAAIAYAREGAD
VALNYLPEEQSDAEEVAELIRKEGRKAVLLPGDLSDEVFCKKLVRDALEKLGGLDIMALVAGKQVAVEDIRDITTEQLTK
TFEVNVFSLFWIAKEALPHLKPGTSIITCSSIQAYQPGKNLVDYASTKAAIIAFSRSLAKQVAPKGIRVNVVAPGPIWTA
LQVTGGQLQKDLPEFGRKTPLKRAGQPVELSGLYVFLASQESSFITAEVFGVTGGMHLA

Sequences:

>Translated_299_residues
MSTEGTNEIQDPRGEFQQERYEKQQQKAPGLQSGMKPVPDSGEQSYHGCNRLRGRKALVTGGDSGIGRAAAIAYAREGAD
VALNYLPEEQSDAEEVAELIRKEGRKAVLLPGDLSDEVFCKKLVRDALEKLGGLDIMALVAGKQVAVEDIRDITTEQLTK
TFEVNVFSLFWIAKEALPHLKPGTSIITCSSIQAYQPGKNLVDYASTKAAIIAFSRSLAKQVAPKGIRVNVVAPGPIWTA
LQVTGGQLQKDLPEFGRKTPLKRAGQPVELSGLYVFLASQESSFITAEVFGVTGGMHLA
>Mature_298_residues
STEGTNEIQDPRGEFQQERYEKQQQKAPGLQSGMKPVPDSGEQSYHGCNRLRGRKALVTGGDSGIGRAAAIAYAREGADV
ALNYLPEEQSDAEEVAELIRKEGRKAVLLPGDLSDEVFCKKLVRDALEKLGGLDIMALVAGKQVAVEDIRDITTEQLTKT
FEVNVFSLFWIAKEALPHLKPGTSIITCSSIQAYQPGKNLVDYASTKAAIIAFSRSLAKQVAPKGIRVNVVAPGPIWTAL
QVTGGQLQKDLPEFGRKTPLKRAGQPVELSGLYVFLASQESSFITAEVFGVTGGMHLA

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the short-chain dehydrogenases/reductases (SDR) family [H]

Homologues:

Organism=Homo sapiens, GI32483357, Length=250, Percent_Identity=31.2, Blast_Score=106, Evalue=3e-23,
Organism=Homo sapiens, GI5031737, Length=275, Percent_Identity=31.6363636363636, Blast_Score=102, Evalue=5e-22,
Organism=Homo sapiens, GI59889578, Length=197, Percent_Identity=32.994923857868, Blast_Score=93, Evalue=3e-19,
Organism=Homo sapiens, GI40254992, Length=244, Percent_Identity=31.1475409836066, Blast_Score=86, Evalue=3e-17,
Organism=Homo sapiens, GI15277342, Length=259, Percent_Identity=29.3436293436293, Blast_Score=86, Evalue=4e-17,
Organism=Homo sapiens, GI33667109, Length=192, Percent_Identity=34.8958333333333, Blast_Score=86, Evalue=4e-17,
Organism=Homo sapiens, GI19923817, Length=263, Percent_Identity=29.277566539924, Blast_Score=85, Evalue=7e-17,
Organism=Homo sapiens, GI126723750, Length=253, Percent_Identity=26.8774703557312, Blast_Score=80, Evalue=3e-15,
Organism=Homo sapiens, GI4758504, Length=269, Percent_Identity=26.0223048327138, Blast_Score=77, Evalue=3e-14,
Organism=Homo sapiens, GI10190704, Length=263, Percent_Identity=29.277566539924, Blast_Score=74, Evalue=2e-13,
Organism=Homo sapiens, GI66933014, Length=261, Percent_Identity=30.6513409961686, Blast_Score=72, Evalue=6e-13,
Organism=Homo sapiens, GI126723191, Length=186, Percent_Identity=29.0322580645161, Blast_Score=70, Evalue=3e-12,
Organism=Homo sapiens, GI210032110, Length=186, Percent_Identity=28.494623655914, Blast_Score=70, Evalue=3e-12,
Organism=Homo sapiens, GI31542939, Length=198, Percent_Identity=28.7878787878788, Blast_Score=67, Evalue=1e-11,
Organism=Homo sapiens, GI32455239, Length=181, Percent_Identity=29.2817679558011, Blast_Score=66, Evalue=3e-11,
Organism=Homo sapiens, GI5031765, Length=181, Percent_Identity=29.2817679558011, Blast_Score=66, Evalue=3e-11,
Organism=Homo sapiens, GI4504505, Length=198, Percent_Identity=29.2929292929293, Blast_Score=65, Evalue=6e-11,
Organism=Escherichia coli, GI1789378, Length=292, Percent_Identity=65.7534246575342, Blast_Score=404, Evalue=1e-114,
Organism=Escherichia coli, GI2367175, Length=249, Percent_Identity=34.9397590361446, Blast_Score=122, Evalue=2e-29,
Organism=Escherichia coli, GI2367365, Length=242, Percent_Identity=29.3388429752066, Blast_Score=102, Evalue=4e-23,
Organism=Escherichia coli, GI87082100, Length=260, Percent_Identity=30.3846153846154, Blast_Score=102, Evalue=4e-23,
Organism=Escherichia coli, GI1787905, Length=250, Percent_Identity=32.8, Blast_Score=101, Evalue=6e-23,
Organism=Escherichia coli, GI87082160, Length=250, Percent_Identity=28.8, Blast_Score=100, Evalue=2e-22,
Organism=Escherichia coli, GI1787335, Length=254, Percent_Identity=31.1023622047244, Blast_Score=99, Evalue=2e-22,
Organism=Escherichia coli, GI1789208, Length=250, Percent_Identity=29.2, Blast_Score=97, Evalue=2e-21,
Organism=Escherichia coli, GI1790717, Length=251, Percent_Identity=27.4900398406374, Blast_Score=89, Evalue=4e-19,
Organism=Escherichia coli, GI1788459, Length=247, Percent_Identity=27.1255060728745, Blast_Score=87, Evalue=1e-18,
Organism=Escherichia coli, GI1787545, Length=258, Percent_Identity=27.5193798449612, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI1786812, Length=265, Percent_Identity=27.9245283018868, Blast_Score=77, Evalue=2e-15,
Organism=Caenorhabditis elegans, GI17561402, Length=256, Percent_Identity=32.8125, Blast_Score=114, Evalue=9e-26,
Organism=Caenorhabditis elegans, GI17560676, Length=252, Percent_Identity=32.9365079365079, Blast_Score=109, Evalue=2e-24,
Organism=Caenorhabditis elegans, GI193204405, Length=268, Percent_Identity=33.955223880597, Blast_Score=107, Evalue=7e-24,
Organism=Caenorhabditis elegans, GI71994604, Length=253, Percent_Identity=34.7826086956522, Blast_Score=107, Evalue=9e-24,
Organism=Caenorhabditis elegans, GI71994600, Length=254, Percent_Identity=31.8897637795276, Blast_Score=100, Evalue=1e-21,
Organism=Caenorhabditis elegans, GI17531453, Length=268, Percent_Identity=32.089552238806, Blast_Score=98, Evalue=5e-21,
Organism=Caenorhabditis elegans, GI72000259, Length=264, Percent_Identity=29.9242424242424, Blast_Score=96, Evalue=2e-20,
Organism=Caenorhabditis elegans, GI17562906, Length=265, Percent_Identity=31.3207547169811, Blast_Score=93, Evalue=2e-19,
Organism=Caenorhabditis elegans, GI17551412, Length=265, Percent_Identity=29.811320754717, Blast_Score=91, Evalue=7e-19,
Organism=Caenorhabditis elegans, GI17544670, Length=267, Percent_Identity=28.4644194756554, Blast_Score=91, Evalue=9e-19,
Organism=Caenorhabditis elegans, GI17560150, Length=267, Percent_Identity=30.3370786516854, Blast_Score=91, Evalue=9e-19,
Organism=Caenorhabditis elegans, GI17562908, Length=267, Percent_Identity=29.9625468164794, Blast_Score=91, Evalue=1e-18,
Organism=Caenorhabditis elegans, GI17562904, Length=251, Percent_Identity=29.8804780876494, Blast_Score=90, Evalue=1e-18,
Organism=Caenorhabditis elegans, GI25147288, Length=251, Percent_Identity=31.0756972111554, Blast_Score=89, Evalue=2e-18,
Organism=Caenorhabditis elegans, GI17562910, Length=265, Percent_Identity=30.5660377358491, Blast_Score=89, Evalue=4e-18,
Organism=Caenorhabditis elegans, GI17555706, Length=276, Percent_Identity=30.7971014492754, Blast_Score=88, Evalue=4e-18,
Organism=Caenorhabditis elegans, GI17538486, Length=267, Percent_Identity=28.8389513108614, Blast_Score=87, Evalue=8e-18,
Organism=Caenorhabditis elegans, GI17538480, Length=275, Percent_Identity=29.4545454545455, Blast_Score=87, Evalue=1e-17,
Organism=Caenorhabditis elegans, GI17563726, Length=260, Percent_Identity=28.8461538461538, Blast_Score=85, Evalue=4e-17,
Organism=Caenorhabditis elegans, GI17560332, Length=269, Percent_Identity=28.6245353159851, Blast_Score=85, Evalue=5e-17,
Organism=Caenorhabditis elegans, GI17559104, Length=266, Percent_Identity=28.5714285714286, Blast_Score=84, Evalue=7e-17,
Organism=Caenorhabditis elegans, GI17565030, Length=273, Percent_Identity=29.3040293040293, Blast_Score=83, Evalue=2e-16,
Organism=Caenorhabditis elegans, GI17562990, Length=269, Percent_Identity=29.7397769516729, Blast_Score=83, Evalue=2e-16,
Organism=Caenorhabditis elegans, GI17560220, Length=266, Percent_Identity=30.0751879699248, Blast_Score=82, Evalue=4e-16,
Organism=Caenorhabditis elegans, GI17564282, Length=195, Percent_Identity=31.2820512820513, Blast_Score=79, Evalue=2e-15,
Organism=Caenorhabditis elegans, GI17538182, Length=202, Percent_Identity=29.7029702970297, Blast_Score=74, Evalue=7e-14,
Organism=Caenorhabditis elegans, GI17508651, Length=250, Percent_Identity=27.2, Blast_Score=74, Evalue=8e-14,
Organism=Caenorhabditis elegans, GI71982365, Length=200, Percent_Identity=27.5, Blast_Score=69, Evalue=2e-12,
Organism=Caenorhabditis elegans, GI115534694, Length=254, Percent_Identity=27.5590551181102, Blast_Score=69, Evalue=3e-12,
Organism=Caenorhabditis elegans, GI17568967, Length=198, Percent_Identity=27.7777777777778, Blast_Score=68, Evalue=5e-12,
Organism=Caenorhabditis elegans, GI17536651, Length=265, Percent_Identity=27.9245283018868, Blast_Score=65, Evalue=5e-11,
Organism=Saccharomyces cerevisiae, GI6324126, Length=256, Percent_Identity=28.515625, Blast_Score=70, Evalue=4e-13,
Organism=Saccharomyces cerevisiae, GI6323882, Length=257, Percent_Identity=25.6809338521401, Blast_Score=70, Evalue=6e-13,
Organism=Saccharomyces cerevisiae, GI6322226, Length=192, Percent_Identity=26.0416666666667, Blast_Score=63, Evalue=5e-11,
Organism=Drosophila melanogaster, GI21355319, Length=248, Percent_Identity=34.2741935483871, Blast_Score=114, Evalue=7e-26,
Organism=Drosophila melanogaster, GI24644339, Length=249, Percent_Identity=31.3253012048193, Blast_Score=105, Evalue=4e-23,
Organism=Drosophila melanogaster, GI21357041, Length=246, Percent_Identity=31.7073170731707, Blast_Score=101, Evalue=8e-22,
Organism=Drosophila melanogaster, GI23397609, Length=255, Percent_Identity=29.0196078431373, Blast_Score=100, Evalue=2e-21,
Organism=Drosophila melanogaster, GI28571526, Length=245, Percent_Identity=30.6122448979592, Blast_Score=97, Evalue=2e-20,
Organism=Drosophila melanogaster, GI24644337, Length=230, Percent_Identity=34.3478260869565, Blast_Score=86, Evalue=4e-17,
Organism=Drosophila melanogaster, GI24639444, Length=250, Percent_Identity=29.6, Blast_Score=82, Evalue=3e-16,
Organism=Drosophila melanogaster, GI24643142, Length=248, Percent_Identity=30.241935483871, Blast_Score=81, Evalue=8e-16,
Organism=Drosophila melanogaster, GI17737361, Length=266, Percent_Identity=25.5639097744361, Blast_Score=78, Evalue=7e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002198
- InterPro:   IPR002347
- InterPro:   IPR016040
- InterPro:   IPR020904 [H]

Pfam domain/function: PF00106 adh_short [H]

EC number: 1.-.-.- [C]

Molecular weight: Translated: 32300; Mature: 32169

Theoretical pI: Translated: 6.79; Mature: 6.79

Prosite motif: PS00061 ADH_SHORT

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSTEGTNEIQDPRGEFQQERYEKQQQKAPGLQSGMKPVPDSGEQSYHGCNRLRGRKALVT
CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCHHHHHHHHHCCCEEEEE
GGDSGIGRAAAIAYAREGADVALNYLPEEQSDAEEVAELIRKEGRKAVLLPGDLSDEVFC
CCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHCCCEEEEECCCCCHHHHH
KKLVRDALEKLGGLDIMALVAGKQVAVEDIRDITTEQLTKTFEVNVFSLFWIAKEALPHL
HHHHHHHHHHHCCCEEEEHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
KPGTSIITCSSIQAYQPGKNLVDYASTKAAIIAFSRSLAKQVAPKGIRVNVVAPGPIWTA
CCCCCEEEECCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEE
LQVTGGQLQKDLPEFGRKTPLKRAGQPVELSGLYVFLASQESSFITAEVFGVTGGMHLA
EEECCHHHHHHHHHHHCCCCHHHCCCCEEECEEEEEEECCCCCEEEEEEEECCCCCCCC
>Mature Secondary Structure 
STEGTNEIQDPRGEFQQERYEKQQQKAPGLQSGMKPVPDSGEQSYHGCNRLRGRKALVT
CCCCCCCCCCCHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCHHHHHHHHHCCCEEEEE
GGDSGIGRAAAIAYAREGADVALNYLPEEQSDAEEVAELIRKEGRKAVLLPGDLSDEVFC
CCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHCCCEEEEECCCCCHHHHH
KKLVRDALEKLGGLDIMALVAGKQVAVEDIRDITTEQLTKTFEVNVFSLFWIAKEALPHL
HHHHHHHHHHHCCCEEEEHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
KPGTSIITCSSIQAYQPGKNLVDYASTKAAIIAFSRSLAKQVAPKGIRVNVVAPGPIWTA
CCCCCEEEECCCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEE
LQVTGGQLQKDLPEFGRKTPLKRAGQPVELSGLYVFLASQESSFITAEVFGVTGGMHLA
EEECCHHHHHHHHHHHCCCCHHHCCCCEEECEEEEEEECCCCCEEEEEEEECCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]