Definition Methanosphaera stadtmanae DSM 3091 chromosome, complete genome.
Accession NC_007681
Length 1,767,403

Click here to switch to the map view.

The map label for this gene is epsH [H]

Identifier: 84489043

GI number: 84489043

Start: 266307

End: 268895

Strand: Direct

Name: epsH [H]

Synonym: Msp_0215

Alternate gene names: 84489043

Gene position: 266307-268895 (Clockwise)

Preceding gene: 84489042

Following gene: 84489044

Centisome position: 15.07

GC content: 25.34

Gene sequence:

>2589_bases
ATGCATCCTTTTATTAGTGTAATTATACCTGTTTATAATGTTCAATTCACAATAGAAAGTTGTTTTCTATCTTTAAAAAA
TCAAACTATAGGATTTGAAAATTTAGAAATAATCTTTGTTGATGATTGTTCAACAGATAATAGTCCAACTATAATTGGGA
ATTATGAAAAAAAATATGACAATGTTAAAGCAATCTATTCCAAAGAAAATAGTGGTGTTGCAGGAAAACCTAGAAACATG
GGTATGAATATTGCAACAGCAAAATATGTCATGTTTTTAGATCCTGATGATACTTTTACTGTTGATGCTTGTGAAGTATT
ATATAATGAAATAGAAAAAAGTAAAGCAGATATAGTTAGTGGACTACATTCTAAGAAAAATAAATTTAATAATAATGAAG
AGATATTTCCTGGTTTGATTATTAATACATTTTCAGATCCTGAAAAATCATGGTCTGATAGACAGGGTGATGTTGATGTT
TTCAAACAAAAATATCCTAAACACTTTTATATGGATTCAATTGAAGATAAATTTTCTGTTTTAGGAAATTTTGGGCTTTC
AAGTAAAATATTTAATCTTGATTTTATAAAATCAAATAATATTTCATTTCCAGAATATATTCCAGGTGAGGATTCAGTAT
TTCTTTTCAATGCATTGATAAATGCAAATGGAATTGCTTTTATAAATAAAATCATTTATTCTTACACAACATTTCGTGAT
GGAGATAATAAATCAGTAAGTTTCCAAGTAGATTTAGATAAAAATCTTGGTCGTATAAAAGCATATAGCCTCATGTTAGA
AATCAGTGAAGAAAAAGGTATTGTTGATGAATATGTTCACTATATTCTAGCAAATAAATTAACATATTTCTTAAAAAATT
TCATAATTAAACCAGAATATATTCCTGAAACTGATATGTCCTGTATTTTTGATAAGGGTTATGACTTATTTAATGAAGTT
AATAAAAGAGATAATGGTGTATTAAGTCCTGAATTTAAGAATATTATTGAAAATATTGCCAATAGAAATTATGATGAGAC
AATAACTGCTTGTTATGAATATAAGAGAAAATATTTTAGTAACATTGTTAAAGTTAATAAGAAAACTATTCCATTTGCAA
AGGATATGAATGTAGCTGTAGTATTAGATCCATTTACATATAATTCATATAGTAATGAATTCAATGCAATACCTGTTGAA
CCAGAAAACTGGCATGAAAAATTTGAAAGAGAGGATATAGATTTATTTTTCTGTGAATCAGCATTTAGTGGTGTTGGTGA
AGGTAATCTTGTAGAGGGAATAGCTGTTGAAAATAATTATTCTCCATGGGGTGGAAAAATAGGTGTTAACCTGATTCATG
GATGGGATAGTAGAAATCAATTAATGGATATATTAAAGTACTGTAAAGAACATGGAATTCCTACAATTTTTTGGAATAAA
GAGGATCCAACATCATTTGATAACCCTAATTACAACTTTATAGACACAGCTCTACATTTTGATTATATTTTTACAACAGA
TGAAGATAGTATAATAAGATATAGGGCAAGAGGACATGAAAATGTTCATGTACTCTTATTTGCTTCTCAAATAAATCTAT
TCAATCCTATTAGTACTAAACGTTCTAATGATATTATATTTGCAGGTAGTTGGTATAATCAATTCGAAAATCGTTGTAAA
ACAATGTGTGATTTTTTTGATAAGGTAATTAACAGTAAGTATGGCTTAAAAATATATAACAGGGCATCTGATTCAACTGC
TGAAAATAGAATGTTTCCAGCAAAATATGATCCATTCATATATCCAAAAGTATCCTTTGATAAAATGCCCTCAGTATATA
AAGAAAGTAAAATGGCATTAAATATTAACACAGTTACTGATTCACACACCATGTTTGCAAGACGAGTTTATGAATTAATG
TCTTCAAATACATTCATACTATCAAACTATTCTAAAGGAATATATGAACTTTTTAAAGATAATGTTATCTATTTAGATAG
AATAGATTCTTTGGATTTATCTGAAGAAGAAATATCCAAAATCTGTGAAGAAAATCTCTATGATGTTCTTCAAAATCATA
CATATAGTAATAGATTTAAATATATTTTAAATTGTATTGGTTTTAAATATAAAGAAAGTATTGAAGAAGTAAATATAATA
TATAAGGCAGAAAATGATGATGAAATAGATGAAATCATAGCAGATTTTAATTCTATTGACTATTATTACAAGAACTGTTT
TATTCTCTCTAAAAATGTAAATCTTAAAAAATCAGTTGAGAATAAAGATAATAATATAACATTTATGGACTATGAAAATA
TTTACTTCTTATCTGAAAATAGCTCAGATGAAAATTATTTCTTATTTAGAAATATGGATAATAAAATCAATTCTGACTTT
ATTAAAAAGGCATTGTTACATTATAAATATTTAGAAAATAACATAGGCATAAAAGAAAATAATGAAAAATATGTATTTAA
CAAAACAAGAGAATATGAAGATACATTATTTAATATGAGTCAATTTGATAATATAATTGAAGTTTTACTAAAATACAGAA
TAAATAAATTCTCTGTTTATAATATTTAG

Upstream 100 bases:

>100_bases
TTTGATACAAGAAATATTATAGATAAACATTCAATTCCAGATGATATTATATTATATAACTTTGGAAATTTATATGATGT
AGAATAGTGGTGACTTTTCA

Downstream 100 bases:

>100_bases
TTAAGGTGGGATTATTATTAAAAAGGAAATAAAAATTTCTCTCTGTTTATCAGTTTTTCCAGATAAAAAATCAACAGCAG
AGTGTTTTGAAAGCTTACTT

Product: glycosyltransferase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 862; Mature: 862

Protein sequence:

>862_residues
MHPFISVIIPVYNVQFTIESCFLSLKNQTIGFENLEIIFVDDCSTDNSPTIIGNYEKKYDNVKAIYSKENSGVAGKPRNM
GMNIATAKYVMFLDPDDTFTVDACEVLYNEIEKSKADIVSGLHSKKNKFNNNEEIFPGLIINTFSDPEKSWSDRQGDVDV
FKQKYPKHFYMDSIEDKFSVLGNFGLSSKIFNLDFIKSNNISFPEYIPGEDSVFLFNALINANGIAFINKIIYSYTTFRD
GDNKSVSFQVDLDKNLGRIKAYSLMLEISEEKGIVDEYVHYILANKLTYFLKNFIIKPEYIPETDMSCIFDKGYDLFNEV
NKRDNGVLSPEFKNIIENIANRNYDETITACYEYKRKYFSNIVKVNKKTIPFAKDMNVAVVLDPFTYNSYSNEFNAIPVE
PENWHEKFEREDIDLFFCESAFSGVGEGNLVEGIAVENNYSPWGGKIGVNLIHGWDSRNQLMDILKYCKEHGIPTIFWNK
EDPTSFDNPNYNFIDTALHFDYIFTTDEDSIIRYRARGHENVHVLLFASQINLFNPISTKRSNDIIFAGSWYNQFENRCK
TMCDFFDKVINSKYGLKIYNRASDSTAENRMFPAKYDPFIYPKVSFDKMPSVYKESKMALNINTVTDSHTMFARRVYELM
SSNTFILSNYSKGIYELFKDNVIYLDRIDSLDLSEEEISKICEENLYDVLQNHTYSNRFKYILNCIGFKYKESIEEVNII
YKAENDDEIDEIIADFNSIDYYYKNCFILSKNVNLKKSVENKDNNITFMDYENIYFLSENSSDENYFLFRNMDNKINSDF
IKKALLHYKYLENNIGIKENNEKYVFNKTREYEDTLFNMSQFDNIIEVLLKYRINKFSVYNI

Sequences:

>Translated_862_residues
MHPFISVIIPVYNVQFTIESCFLSLKNQTIGFENLEIIFVDDCSTDNSPTIIGNYEKKYDNVKAIYSKENSGVAGKPRNM
GMNIATAKYVMFLDPDDTFTVDACEVLYNEIEKSKADIVSGLHSKKNKFNNNEEIFPGLIINTFSDPEKSWSDRQGDVDV
FKQKYPKHFYMDSIEDKFSVLGNFGLSSKIFNLDFIKSNNISFPEYIPGEDSVFLFNALINANGIAFINKIIYSYTTFRD
GDNKSVSFQVDLDKNLGRIKAYSLMLEISEEKGIVDEYVHYILANKLTYFLKNFIIKPEYIPETDMSCIFDKGYDLFNEV
NKRDNGVLSPEFKNIIENIANRNYDETITACYEYKRKYFSNIVKVNKKTIPFAKDMNVAVVLDPFTYNSYSNEFNAIPVE
PENWHEKFEREDIDLFFCESAFSGVGEGNLVEGIAVENNYSPWGGKIGVNLIHGWDSRNQLMDILKYCKEHGIPTIFWNK
EDPTSFDNPNYNFIDTALHFDYIFTTDEDSIIRYRARGHENVHVLLFASQINLFNPISTKRSNDIIFAGSWYNQFENRCK
TMCDFFDKVINSKYGLKIYNRASDSTAENRMFPAKYDPFIYPKVSFDKMPSVYKESKMALNINTVTDSHTMFARRVYELM
SSNTFILSNYSKGIYELFKDNVIYLDRIDSLDLSEEEISKICEENLYDVLQNHTYSNRFKYILNCIGFKYKESIEEVNII
YKAENDDEIDEIIADFNSIDYYYKNCFILSKNVNLKKSVENKDNNITFMDYENIYFLSENSSDENYFLFRNMDNKINSDF
IKKALLHYKYLENNIGIKENNEKYVFNKTREYEDTLFNMSQFDNIIEVLLKYRINKFSVYNI
>Mature_862_residues
MHPFISVIIPVYNVQFTIESCFLSLKNQTIGFENLEIIFVDDCSTDNSPTIIGNYEKKYDNVKAIYSKENSGVAGKPRNM
GMNIATAKYVMFLDPDDTFTVDACEVLYNEIEKSKADIVSGLHSKKNKFNNNEEIFPGLIINTFSDPEKSWSDRQGDVDV
FKQKYPKHFYMDSIEDKFSVLGNFGLSSKIFNLDFIKSNNISFPEYIPGEDSVFLFNALINANGIAFINKIIYSYTTFRD
GDNKSVSFQVDLDKNLGRIKAYSLMLEISEEKGIVDEYVHYILANKLTYFLKNFIIKPEYIPETDMSCIFDKGYDLFNEV
NKRDNGVLSPEFKNIIENIANRNYDETITACYEYKRKYFSNIVKVNKKTIPFAKDMNVAVVLDPFTYNSYSNEFNAIPVE
PENWHEKFEREDIDLFFCESAFSGVGEGNLVEGIAVENNYSPWGGKIGVNLIHGWDSRNQLMDILKYCKEHGIPTIFWNK
EDPTSFDNPNYNFIDTALHFDYIFTTDEDSIIRYRARGHENVHVLLFASQINLFNPISTKRSNDIIFAGSWYNQFENRCK
TMCDFFDKVINSKYGLKIYNRASDSTAENRMFPAKYDPFIYPKVSFDKMPSVYKESKMALNINTVTDSHTMFARRVYELM
SSNTFILSNYSKGIYELFKDNVIYLDRIDSLDLSEEEISKICEENLYDVLQNHTYSNRFKYILNCIGFKYKESIEEVNII
YKAENDDEIDEIIADFNSIDYYYKNCFILSKNVNLKKSVENKDNNITFMDYENIYFLSENSSDENYFLFRNMDNKINSDF
IKKALLHYKYLENNIGIKENNEKYVFNKTREYEDTLFNMSQFDNIIEVLLKYRINKFSVYNI

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

Organism=Escherichia coli, GI1790044, Length=223, Percent_Identity=29.5964125560538, Blast_Score=74, Evalue=4e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: 2.-.-.- [C]

Molecular weight: Translated: 100757; Mature: 100757

Theoretical pI: Translated: 4.74; Mature: 4.74

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MHPFISVIIPVYNVQFTIESCFLSLKNQTIGFENLEIIFVDDCSTDNSPTIIGNYEKKYD
CCCHHEEEEEHEEEEEEHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCEEEECCHHHHC
NVKAIYSKENSGVAGKPRNMGMNIATAKYVMFLDPDDTFTVDACEVLYNEIEKSKADIVS
CEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEHHHHHHHHHHHHHHHHHHHH
GLHSKKNKFNNNEEIFPGLIINTFSDPEKSWSDRQGDVDVFKQKYPKHFYMDSIEDKFSV
HHHHHHHCCCCCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHHHCCCHHHHCCHHHHHHH
LGNFGLSSKIFNLDFIKSNNISFPEYIPGEDSVFLFNALINANGIAFINKIIYSYTTFRD
HHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCEEEEEEHHCCCCHHHHHHHHHHHHEEEC
GDNKSVSFQVDLDKNLGRIKAYSLMLEISEEKGIVDEYVHYILANKLTYFLKNFIIKPEY
CCCCEEEEEEEECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
IPETDMSCIFDKGYDLFNEVNKRDNGVLSPEFKNIIENIANRNYDETITACYEYKRKYFS
CCCCCCHHHHHCCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
NIVKVNKKTIPFAKDMNVAVVLDPFTYNSYSNEFNAIPVEPENWHEKFEREDIDLFFCES
HHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCEECCCCCHHHHHHCCCCCEEEEECH
AFSGVGEGNLVEGIAVENNYSPWGGKIGVNLIHGWDSRNQLMDILKYCKEHGIPTIFWNK
HHCCCCCCCEEEEEEEECCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECC
EDPTSFDNPNYNFIDTALHFDYIFTTDEDSIIRYRARGHENVHVLLFASQINLFNPISTK
CCCCCCCCCCCCEEEEEEEEEEEEECCCCCEEEEECCCCCCEEEEEEEECCCCCCCCCCC
RSNDIIFAGSWYNQFENRCKTMCDFFDKVINSKYGLKIYNRASDSTAENRMFPAKYDPFI
CCCCEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHCCCCCCCCCCEE
YPKVSFDKMPSVYKESKMALNINTVTDSHTMFARRVYELMSSNTFILSNYSKGIYELFKD
CCCCCHHHCCHHHHCCCEEEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHC
NVIYLDRIDSLDLSEEEISKICEENLYDVLQNHTYSNRFKYILNCIGFKYKESIEEVNII
CEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCHHHHHHHHEEEE
YKAENDDEIDEIIADFNSIDYYYKNCFILSKNVNLKKSVENKDNNITFMDYENIYFLSEN
EECCCCCHHHHHHHHHHHCCEEEEEEEEEECCCCCHHHCCCCCCCEEEEEECCEEEEECC
SSDENYFLFRNMDNKINSDFIKKALLHYKYLENNIGIKENNEKYVFNKTREYEDTLFNMS
CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHCHH
QFDNIIEVLLKYRINKFSVYNI
HHHHHHHHHHHHCCCCEEEECC
>Mature Secondary Structure
MHPFISVIIPVYNVQFTIESCFLSLKNQTIGFENLEIIFVDDCSTDNSPTIIGNYEKKYD
CCCHHEEEEEHEEEEEEHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCEEEECCHHHHC
NVKAIYSKENSGVAGKPRNMGMNIATAKYVMFLDPDDTFTVDACEVLYNEIEKSKADIVS
CEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEHHHHHHHHHHHHHHHHHHHH
GLHSKKNKFNNNEEIFPGLIINTFSDPEKSWSDRQGDVDVFKQKYPKHFYMDSIEDKFSV
HHHHHHHCCCCCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHHHCCCHHHHCCHHHHHHH
LGNFGLSSKIFNLDFIKSNNISFPEYIPGEDSVFLFNALINANGIAFINKIIYSYTTFRD
HHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCEEEEEEHHCCCCHHHHHHHHHHHHEEEC
GDNKSVSFQVDLDKNLGRIKAYSLMLEISEEKGIVDEYVHYILANKLTYFLKNFIIKPEY
CCCCEEEEEEEECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC
IPETDMSCIFDKGYDLFNEVNKRDNGVLSPEFKNIIENIANRNYDETITACYEYKRKYFS
CCCCCCHHHHHCCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
NIVKVNKKTIPFAKDMNVAVVLDPFTYNSYSNEFNAIPVEPENWHEKFEREDIDLFFCES
HHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCEECCCCCHHHHHHCCCCCEEEEECH
AFSGVGEGNLVEGIAVENNYSPWGGKIGVNLIHGWDSRNQLMDILKYCKEHGIPTIFWNK
HHCCCCCCCEEEEEEEECCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECC
EDPTSFDNPNYNFIDTALHFDYIFTTDEDSIIRYRARGHENVHVLLFASQINLFNPISTK
CCCCCCCCCCCCEEEEEEEEEEEEECCCCCEEEEECCCCCCEEEEEEEECCCCCCCCCCC
RSNDIIFAGSWYNQFENRCKTMCDFFDKVINSKYGLKIYNRASDSTAENRMFPAKYDPFI
CCCCEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHCCCCCCCCCCEE
YPKVSFDKMPSVYKESKMALNINTVTDSHTMFARRVYELMSSNTFILSNYSKGIYELFKD
CCCCCHHHCCHHHHCCCEEEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHC
NVIYLDRIDSLDLSEEEISKICEENLYDVLQNHTYSNRFKYILNCIGFKYKESIEEVNII
CEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCHHHHHHHHEEEE
YKAENDDEIDEIIADFNSIDYYYKNCFILSKNVNLKKSVENKDNNITFMDYENIYFLSEN
EECCCCCHHHHHHHHHHHCCEEEEEEEEEECCCCCHHHCCCCCCCEEEEEECCEEEEECC
SSDENYFLFRNMDNKINSDFIKKALLHYKYLENNIGIKENNEKYVFNKTREYEDTLFNMS
CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHCHH
QFDNIIEVLLKYRINKFSVYNI
HHHHHHHHHHHHCCCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]