Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is mdoH

Identifier: 209399078

GI number: 209399078

Start: 1413620

End: 1416133

Strand: Direct

Name: mdoH

Synonym: ECH74115_1429

Alternate gene names: 209399078

Gene position: 1413620-1416133 (Clockwise)

Preceding gene: 209398428

Following gene: 209400634

Centisome position: 25.37

GC content: 54.77

Gene sequence:

>2514_bases
ATGCCCATCGCCGCAAGCGAGAAAGCGGCATTGCCGAAGACTGATATCCGCGCCGTTCATCAGGCGCTGGATGCCGAACA
CCGCACCTGGGCGCGGGAGGATGACTCCCCGCAAGGCTCGGTAAAGGCGCGTCTGGAACAAGCCTGGCCAGATTCACTTG
CTGATGGACAGTTAATTAAAGATGACGAAGGGCGCGATCAGCTTAAGGCGATGCCAGAAGCAAAACGCTCCTCGATGTTT
CCCGACCCGTGGCGTACCAACCCGGTAGGCCGTTTCTGGGATCGCCTGCGTGGACGCGATGTAACACCGCGCTATCTGGC
TCGTTTGACCAAAGAAGAGCAGGAGAGCGAGCAAAAGTGGCGTACCGTCGGTACCATCCGCCGTTACATTCTGTTGATCC
TGACGCTCGCGCAAACTGTCGTCGCGACCTGGTATATGAAGACCATTCTTCCTTATCAGGGTTGGGCGCTGATTAATCCT
ATGGATATGGTTGGTCAGGATTTGTGGGTTTCCTTTATGCAGCTTCTGCCTTATATGCTGCAAACCGGTATCCTGATCCT
CTTTGCGGTACTGTTCTGTTGGGTGTCCGCCGGATTCTGGACGGCGTTAATGGGCTTCCTGCAACTACTTATTGGTCGCG
ATAAATACAGTATATCTGCGTCAACAGTTGGCGATGAACCATTAAACCCGGAGCATCGCACGGCGTTGATCATGCCTATC
TGTAACGAAGACGTGAACCGTGTTTTTGCTGGCTTGCGTGCAACGTGGGAATCAGTAAAAGCCACCGGGAATGCCAAACA
CTTTGATGTCTACATTCTGAGTGACAGTTACAACCCGGATATCTGCGTCGCAGAGCAAAAAGCCTGGATGGAGCTTATCG
CTGAAGTCGGTGGCGAAGGTCAGATTTTCTATCGCCGCCGCCGCCGTCGCGTGAAGCGTAAAAGCGGTAATATCGATGAC
TTCTGCCGTCGCTGGGGCAGCCAGTACAGCTATATGGTGGTGCTGGATGCTGACTCGGTAATGACCGGTGATTGTTTGTG
CGGCCTGGTGCGCCTGATGGAAGCCAACCCGAACGCCGGGATCATTCAGTCGTCGCCGAAAGCGTCCGGTATGGATACGC
TGTATGCACGCTGTCAGCAGTTCGCGACCCGCGTGTATGGGCCACTGTTTACAGCCGGTTTGCACTTCTGGCAACTTGGC
GAGTCGCACTACTGGGGGCATAACGCGATTATCCGCGTGAAACCGTTTATCGAGCACTGTGCACTGGCCCCGCTGCCGGG
CGAAGGTTCCTTTGCCGGTTCAATCCTGTCACATGACTTCGTGGAAGCGGCGTTGATGCGCCGTGCAGGTTGGGGAGTCT
GGATTGCTTACGATCTCCCGGGTTCTTATGAAGAATTACCGCCTAACTTGCTTGATGAGCTAAAACGTGACCGCCGCTGG
TGCCACGGTAACCTGATGAACTTCCGTCTGTTCCTGGTGAAGGGTATGCACCCGGTTCACCGTGCGGTATTCCTGACGGG
CGTGATGTCTTATCTCTCCGCTCCGCTGTGGTTTATGTTCCTCGCGCTCTCTACTGCATTGCAGGTAGTGCATGCGTTGA
CCGAACCGCAATACTTCCTGCAACCACGGCAGTTGTTCCCGGTGTGGCCGCAGTGGCGTCCTGAGCTGGCGATTGCACTT
TTTGCTTCGACCATGGTGCTGTTGTTCCTGCCAAAGCTATTGAGCATTTTGCTTATTTGGTGTAAAGGAACGAAAGAATA
TGGCGGCTTCTGGCGCGTTACATTATCGTTGCTGCTGGAAGTGCTGTTTTCCGTGCTGCTGGCTCCGGTACGCATGCTGT
TCCATACGGTCTTCGTCGTCAGCGCGTTCCTTGGCTGGGAAGTGGTGTGGAATTCACCGCAGCGTGATGATGACTCCACT
TCCTGGGGTGAAGCGTTCAAACGCCACGGCTCACAGCTGCTGTTAGGGTTAGTGTGGGCTGTCGGGATGGCGTGGTTGGA
TCTGCGTTTCCTGTTCTGGCTGGCACCGATTGTCTTCTCGTTGATCCTGTCACCGTTTGTTTCGGTGATTTCCAGCCGTG
CCACCGTTGGTCTGCGCACCAAACGCTGGAAACTGTTCCTGATCCCGGAAGAGTATTCACCGCCGCAGGTACTGGTTGAT
ACCGATCGGTTCCTTGAGATGAACCGTCAACGCTCCCTTGATGATGGTTTTATGCACGCAGTGTTTAACCCGTCATTTAA
CGCTCTGGCAACCGCAATGGCGACCGCGCGTCACCGCGCCAGTAAGGTGCTGGAAATCGCCCGTGACCGCCATGTTGAAC
AGGCGCTGAACGAGACGCCAGAGAAGCTGAATCGCGATCGTCGCCTTGTGCTGCTAAGCGATCCGGTGACGATGGCCCGT
CTGCATTTCCGCGTCTGGAATTCCCCTGAGAGATATTCTTCATGGGTGAGTTATTACGAAGGGATAAAGCTCAATCCACT
GGCATTGCGTAAACCGGATGCGGCTTCGCAATAA

Upstream 100 bases:

>100_bases
GAAATGCGCGCTGCGCTGGTGAATGCCGATCAGACGTTGAGTGAAACCTGGAGCTACCAGTTACCTGCCAATGAATAAGA
CAACTGAGTACATTGACGCA

Downstream 100 bases:

>100_bases
AAACGTAGTTGCCTGATGCGCTATGCTTATCAGGCCTACATCGCTCCTGCAATGTGTTGATTTTGCAAGATTTTGTAGGT
CGGATAAGGCGTTCACGCCA

Product: glucosyltransferase MdoH

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 837; Mature: 836

Protein sequence:

>837_residues
MPIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIKDDEGRDQLKAMPEAKRSSMF
PDPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKWRTVGTIRRYILLILTLAQTVVATWYMKTILPYQGWALINP
MDMVGQDLWVSFMQLLPYMLQTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNPEHRTALIMPI
CNEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEGQIFYRRRRRRVKRKSGNIDD
FCRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAGIIQSSPKASGMDTLYARCQQFATRVYGPLFTAGLHFWQLG
ESHYWGHNAIIRVKPFIEHCALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRW
CHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFLQPRQLFPVWPQWRPELAIAL
FASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLEVLFSVLLAPVRMLFHTVFVVSAFLGWEVVWNSPQRDDDST
SWGEAFKRHGSQLLLGLVWAVGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPEEYSPPQVLVD
TDRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETPEKLNRDRRLVLLSDPVTMAR
LHFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ

Sequences:

>Translated_837_residues
MPIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIKDDEGRDQLKAMPEAKRSSMF
PDPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKWRTVGTIRRYILLILTLAQTVVATWYMKTILPYQGWALINP
MDMVGQDLWVSFMQLLPYMLQTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNPEHRTALIMPI
CNEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEGQIFYRRRRRRVKRKSGNIDD
FCRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAGIIQSSPKASGMDTLYARCQQFATRVYGPLFTAGLHFWQLG
ESHYWGHNAIIRVKPFIEHCALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRW
CHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFLQPRQLFPVWPQWRPELAIAL
FASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLEVLFSVLLAPVRMLFHTVFVVSAFLGWEVVWNSPQRDDDST
SWGEAFKRHGSQLLLGLVWAVGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPEEYSPPQVLVD
TDRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETPEKLNRDRRLVLLSDPVTMAR
LHFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ
>Mature_836_residues
PIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIKDDEGRDQLKAMPEAKRSSMFP
DPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKWRTVGTIRRYILLILTLAQTVVATWYMKTILPYQGWALINPM
DMVGQDLWVSFMQLLPYMLQTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNPEHRTALIMPIC
NEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEGQIFYRRRRRRVKRKSGNIDDF
CRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAGIIQSSPKASGMDTLYARCQQFATRVYGPLFTAGLHFWQLGE
SHYWGHNAIIRVKPFIEHCALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRWC
HGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFLQPRQLFPVWPQWRPELAIALF
ASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLEVLFSVLLAPVRMLFHTVFVVSAFLGWEVVWNSPQRDDDSTS
WGEAFKRHGSQLLLGLVWAVGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPEEYSPPQVLVDT
DRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETPEKLNRDRRLVLLSDPVTMARL
HFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ

Specific function: Involved in the biosynthesis of osmoregulated periplasmic glucans (OPGs)

COG id: COG2943

COG function: function code M; Membrane glycosyltransferase

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family. OpgH subfamily

Homologues:

Organism=Escherichia coli, GI1787287, Length=837, Percent_Identity=100, Blast_Score=1721, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): OPGH_ECO57 (P62518)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- RefSeq:   NP_287183.1
- RefSeq:   NP_309454.1
- ProteinModelPortal:   P62518
- EnsemblBacteria:   EBESCT00000025315
- EnsemblBacteria:   EBESCT00000057024
- GeneID:   913619
- GeneID:   959381
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1684
- KEGG:   ecs:ECs1427
- GeneTree:   EBGT00050000010387
- HOGENOM:   HBG668600
- OMA:   MSGECLT
- ProtClustDB:   PRK05454
- BioCyc:   ECOL83334:ECS1427-MONOMER
- HAMAP:   MF_01072
- InterPro:   IPR001173

Pfam domain/function: PF00535 Glycos_transf_2

EC number: 2.4.1.- [C]

Molecular weight: Translated: 95771; Mature: 95640

Theoretical pI: Translated: 8.96; Mature: 8.96

Prosite motif: PS00213 LIPOCALIN

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x1e6b0d98)-; HASH(0x1f1139d4)-; HASH(0x1f14f1c8)-; HASH(0x1f11a678)-; HASH(0x1f146690)-; HASH(0x1f14ba10)-;

Cys/Met content:

1.2 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
4.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIK
CCCCCCCCCCCCHHHHHHHHHHHCHHHHHCCCCCCCCCCHHHHHHHHHCCCCCCCCCEEC
DDEGRDQLKAMPEAKRSSMFPDPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKW
CCCCHHHHHHCCHHHHCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH
RTVGTIRRYILLILTLAQTVVATWYMKTILPYQGWALINPMDMVGQDLWVSFMQLLPYML
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHHHHHHHHHHH
QTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNPEHRTALIMPI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCEEEEEEC
CNEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEG
CCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCEEEECHHHHHHHHHHHCCCC
QIFYRRRRRRVKRKSGNIDDFCRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAG
HHHHHHHHHHHHHHCCCHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHCCCCCC
IIQSSPKASGMDTLYARCQQFATRVYGPLFTAGLHFWQLGESHYWGHNAIIRVKPFIEHC
CEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEHHHHHH
ALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRW
CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHCCHHHHHHHHHHHHH
CHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFL
HCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHC
QPRQLFPVWPQWRPELAIALFASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLE
CCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHH
VLFSVLLAPVRMLFHTVFVVSAFLGWEVVWNSPQRDDDSTSWGEAFKRHGSQLLLGLVWA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
VGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPEEYSPPQVLVD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECEEEEEECCCCCCCEEEEC
TDRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETP
CHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
EKLNRDRRLVLLSDPVTMARLHFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ
HHHCCCCEEEEEECCHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEECCCCCCCCH
>Mature Secondary Structure 
PIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIK
CCCCCCCCCCCHHHHHHHHHHHCHHHHHCCCCCCCCCCHHHHHHHHHCCCCCCCCCEEC
DDEGRDQLKAMPEAKRSSMFPDPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKW
CCCCHHHHHHCCHHHHCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH
RTVGTIRRYILLILTLAQTVVATWYMKTILPYQGWALINPMDMVGQDLWVSFMQLLPYML
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECHHHHHHHHHHHHHHHHHHHHH
QTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNPEHRTALIMPI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCEEEEEEC
CNEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEG
CCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCEEEECHHHHHHHHHHHCCCC
QIFYRRRRRRVKRKSGNIDDFCRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAG
HHHHHHHHHHHHHHCCCHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHCCCCCC
IIQSSPKASGMDTLYARCQQFATRVYGPLFTAGLHFWQLGESHYWGHNAIIRVKPFIEHC
CEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEHHHHHH
ALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRW
CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHCCHHHHHHHHHHHHH
CHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFL
HCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHC
QPRQLFPVWPQWRPELAIALFASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLE
CCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHH
VLFSVLLAPVRMLFHTVFVVSAFLGWEVVWNSPQRDDDSTSWGEAFKRHGSQLLLGLVWA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
VGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPEEYSPPQVLVD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECEEEEEECCCCCCCEEEEC
TDRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETP
CHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
EKLNRDRRLVLLSDPVTMARLHFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ
HHHCCCCEEEEEECCHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCEECCCCCCCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796