Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is epsH [H]

Identifier: 187735259

GI number: 187735259

Start: 887433

End: 888458

Strand: Reverse

Name: epsH [H]

Synonym: Amuc_0754

Alternate gene names: 187735259

Gene position: 888458-887433 (Counterclockwise)

Preceding gene: 187735260

Following gene: 187735258

Centisome position: 33.35

GC content: 57.31

Gene sequence:

>1026_bases
ATGACGGCTCCCGTAAACATTTCCGTGCTGGTTCCGGTTTACAATGTGGAGCCGTACCTGGCCCAGTGCCTGGAAAGCAT
TTGCTCACAGACGCTCCGTGAGCTGGAAATAGTTTGCGTGGATGACGCTTCCACGGACGGTTCCCTGTCCATTCTGCGGG
AATTCGCGGAGCGGGACCCGCGGGTGAAGGTCGTGCAGGCTCCGGAAAACGGCGGCTTATCCCGCTCCCGGAATCTGGCG
ATGAGCCATGCTGTGGGAGAATATCTGTTTCTGGTGGATTCCGACGACTGGCTGGAAACGGATTTGCTGGAGGAGATGTA
CCGCCGTGCGAAGGCGCTGGATGCCGACAGGCTGGCATGCGGGTTCCGGTATTATTACGAGTCCGCCCCGGACCGGGAGG
ACCGGTTTCTGCCGGAGGACATGGCCCCTCCGGAAAAAGGGTGGCTTCCCTGCACTCCGGAGACCATTGGGAAAATACAT
CATGGAGCGGGCGGCATGATGATCAGGCGTTCCATTGTGGAAAAGCATGGCATCCGGTTCCCCGAGGGCGTTGCCTGTGA
AGACCTGTATTTCCATTACGCCGTTTTTCCATGGTGCAGGAGGGTTTGCGTCGTCAGCAGGGCGGCTTACGTTTACCGTA
AGCGGGCCGGATCCATTACCAGCGGTTTTGCGTCCGGCAGTTCCCTCCAGTCGCTGGATTACCTGACGGTGGCGGAACTG
GTGCTGAAGGAATGGAAAGAAGCCGGGATTCTTGAGGAATACAGGACGGCATTTTTGAAAATGCTGGTAATGGGCGTGAG
GAACATCCGCAAATATGCCCCTCATGCCGTCCAAAAGGAGGTTACCCGGAAGGTGACTGATATGCTCCGTCAGGAAAATC
TGTACCGTCCCGCAGAGGATGATGCCTGCCTGTCCCGCCGGGAAGGAAAATTACTGAAAGCCTGGATGGGCGGAAAATCC
GGCTTGGACTTTTCCTATTACTGGAAGAAAATGCGCAAGGCGGGAGCCCGGTTGCTGCGCCGCTGA

Upstream 100 bases:

>100_bases
AGGAAATTACAGTGGACCAGGTATGGTCCGCGCTGGAGCCCTTTCTGCCGCCGCAGGAAGGATTTGAACAAATAAGGGCG
GCAGATTCTCCGCATGGCTC

Downstream 100 bases:

>100_bases
CATCACCCGAATGAATAATCCGATGAAAAAGAATGAATTTGCCGTCGTCCTGGCCAGTGACAACAGGGGCATTCTACCTT
TGAGCGTTACTGTCTTTTCT

Product: glycosyl transferase family 2

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 341; Mature: 340

Protein sequence:

>341_residues
MTAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDPRVKVVQAPENGGLSRSRNLA
MSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLACGFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETIGKIH
HGAGGMMIRRSIVEKHGIRFPEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSLDYLTVAEL
VLKEWKEAGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDMLRQENLYRPAEDDACLSRREGKLLKAWMGGKS
GLDFSYYWKKMRKAGARLLRR

Sequences:

>Translated_341_residues
MTAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDPRVKVVQAPENGGLSRSRNLA
MSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLACGFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETIGKIH
HGAGGMMIRRSIVEKHGIRFPEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSLDYLTVAEL
VLKEWKEAGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDMLRQENLYRPAEDDACLSRREGKLLKAWMGGKS
GLDFSYYWKKMRKAGARLLRR
>Mature_340_residues
TAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDPRVKVVQAPENGGLSRSRNLAM
SHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLACGFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETIGKIHH
GAGGMMIRRSIVEKHGIRFPEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSLDYLTVAELV
LKEWKEAGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDMLRQENLYRPAEDDACLSRREGKLLKAWMGGKSG
LDFSYYWKKMRKAGARLLRR

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

Organism=Escherichia coli, GI1790044, Length=196, Percent_Identity=28.0612244897959, Blast_Score=83, Evalue=2e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: 2.-.-.- [C]

Molecular weight: Translated: 38904; Mature: 38773

Theoretical pI: Translated: 8.13; Mature: 8.13

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.6 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
5.9 %Cys+Met (Translated Protein)
2.6 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
5.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDP
CCCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHHCEEEEEECCCCCCHHHHHHHHHHCCC
RVKVVQAPENGGLSRSRNLAMSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLAC
CEEEEECCCCCCCCHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCHHHHHH
GFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETIGKIHHGAGGMMIRRSIVEKHGIRF
HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCC
PEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSLDYLTVAEL
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCHHHHHHHHHHHH
VLKEWKEAGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDMLRQENLYRPAED
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCC
DACLSRREGKLLKAWMGGKSGLDFSYYWKKMRKAGARLLRR
HHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TAPVNISVLVPVYNVEPYLAQCLESICSQTLRELEIVCVDDASTDGSLSILREFAERDP
CCCCEEEEEEEEECCCHHHHHHHHHHHHHHHHHCEEEEEECCCCCCHHHHHHHHHHCCC
RVKVVQAPENGGLSRSRNLAMSHAVGEYLFLVDSDDWLETDLLEEMYRRAKALDADRLAC
CEEEEECCCCCCCCHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCHHHHHH
GFRYYYESAPDREDRFLPEDMAPPEKGWLPCTPETIGKIHHGAGGMMIRRSIVEKHGIRF
HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCC
PEGVACEDLYFHYAVFPWCRRVCVVSRAAYVYRKRAGSITSGFASGSSLQSLDYLTVAEL
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCHHHHHHHHHHHH
VLKEWKEAGILEEYRTAFLKMLVMGVRNIRKYAPHAVQKEVTRKVTDMLRQENLYRPAED
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCC
DACLSRREGKLLKAWMGGKSGLDFSYYWKKMRKAGARLLRR
HHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]