Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is epsE [H]

Identifier: 226949381

GI number: 226949381

Start: 2377267

End: 2379888

Strand: Direct

Name: epsE [H]

Synonym: CLM_2306

Alternate gene names: 226949381

Gene position: 2377267-2379888 (Clockwise)

Preceding gene: 226949380

Following gene: 226949382

Centisome position: 57.21

GC content: 31.31

Gene sequence:

>2622_bases
ATGAAACCAGAGATTAGTGTTATTATGCCAGTATACAATTGTAAACAGTATATCTTTGAATCCATTAAAAGCATATGTAA
TCAAACTTTTGAAAACTGGGAACTTATTATTATTAATGATAATTCTATAGAAAATATTGAGGAAGAAATAAAAAAAATAC
AGGATGACAGAATTCATTATCATGCTTTTGTAGAACATGAAGGCTTATTTAATTCATTAGAATATGGGTTGCAGCAAGCA
CAGGGAGATTTTATCACATTTCATGATCCAGATGATATCAGTTCTCCTACCAGGTTTAATGAACAGCTTAATTACCTTAA
ATCCAATGATGATTTAGGGATGGTTTCATGCCTTATCAGATGTTTTACTAACGATACTAGTTATCGTAATGCCTGTACTT
TTATAGAGAAAATTCAAAATGCTTATATATCCAGAGAACAAATTGAAAATGCAATTATCAATAAATTTTCTCCAGTTATA
TTTCCTACAATAATGATGCGTAGAAGTCTATTAGATGGAATTGAATTCCATAAAGAAGAAAATGAACTAGAAGACTACTT
TCAGATATTTTTATATTTGCTTAAACAAGGTAGATTAGAAAAGGTCAATAGCGTTCTTTATTATTATAGAAGGCATAAAA
ATTCTTATCATATTCAAAATGAAAAAAATTATTCTGAAACAGTACAAGCTCAGCTAAGTAAAAGCGGAATACAAAATTTT
ATAAAATATAGGGAACTCTACAAAGATTTAAAAAAAGAGCAGTATATAGTCAGTAGATCGAAGAAAGACAGCCCCTTAAG
AATATTGATGCTTATAGATGCCCTAAATATTGGTGGAACAGAAATGTATGTATTAGAGCTTGCAAAATCGCTGGAAAAGT
TAGGGGCACATGTAGTTATTGGAACCTCTGGGGGCCCACTAGTAGAAGTATTCCAACATTATGGATTAAAAGTTGTAAAA
ATTCCTTTTACTAGCGACTATATTTCTAATAAAGACATTATGAAGCTAATTAAATTAACAAAGAAAATAATAGATGAAGA
AAAAATCAATTTGCTTCACTGCCACCTTTTTGCAAGTATGCGTTTAGGAAATGATATTTATAGGAGCTACAAGATCCCAT
ATATAGTCACGTTACATGGTTTATTTTATCCTAATGACGTACTATTTGAGTCCTGTATCAATGCAACTAAAATTATTGCA
GTAAGCAAGCCAATTAAAAAACTAATTGAATCAAAATTAGGCTCAAGAATAAGAGGAGAAATTATGGTGCTTCCAAATGG
AATTGATATGGAAAATTTTCATCCTCAGCATACAGTAAAAAGTTCAAAAAATCAGCTGGGTATACCTGAAAACTCTCAAA
TTATAACTTACTGTAGTCGTCTGGATTGGGGAAAAACTTTTGCTGCCGAAGCTTTTATTTTTGCATGTTTTACTTTAATG
ACTAAGAATGAACATCTACATGCTTTTGTTATTGGAGATGGAGCTGATAAAAATTTAATCACGCATGAAGTAAACATTCT
GAATAAAATATTAAAAAGAGATGCAATTCATGTAGTAGGTGCGAAATTTAACGTACTGCCTTATTATCAGAATGCAGATA
TTGTAGTTGGTACTGCAAGAGTGGCACTTGAGGCCATGAGTTGTGGTAAGCCAGTTATTGCCGTAGGGAATCATGGATAT
ACCGGAGTCATTAATCATAGATGTATGAACGAGCAATGGAATATGTATTTTGGTGATCATGATTCAATAAAAAAAGCAGA
TCCCTTAGTACTTGCAAAAGATTTAGATGGATTGCTACAAGATACCAAAGCATGTAAATCCTTGGGAAAATGGGGTAGAC
GCTGGTGTGAAGAAAAATTTGATAATCGACTTGTAGCAAAAGATATTTTTAACTTATACCAAGAAGTTTTATCCGAAAAA
GAAGTGGAAAATACAGATAAAGAAAATATGCCTAATAAGATAGAAACCGATATACAGACAAAAGAAAGCCTTCTTTTAGA
AAAAACTTCTTCCATAATTAGCAGAATTCCTGATGGGATTGAATTTACTCCAGAAATAAGTGAAGTAGTATTGGGTTCCA
ATAATGCACTTGCACGATACTGCACTCATTGTACCCATTGTAGATTTGACATTACGATGCCTTTTACTATAATATTGAAA
GATAAAAAAGATTCCTGTAAAACATTGACTGTGCCTGATTTGCTTCATATTGAGTTAAATAGCTATAACTATAATAATTG
CATTCACAAACAAGATTGTGAATGTGGAGCACTGGTAAATCTTATAAATAAGTGTGGAATGAGAATAGAAAATGAAATAA
AAAATAAACCTATTATTGACAAGCATAACCAGTATATTATATTTGAGATCATTACCAGGATATACGTTAACTTCTGTGAT
CCAGGTATACTTGATCTGGAAGCTGGAATTTATGGATCAGGTTCTATTATTGGTGAAGATAATCCTCTAGTTAAAGATAC
AGAAAAAGATGAATTTAAAAAAAATATCGCTGCCGAAGATTCTACCAATATACACCATGAGCCCTTTTATTGTTATAAAG
ATGATAAATCCGATTATGATGCTAACTCACTTATGAAAAGCTATCCTTCTAACAGACCTTAA

Upstream 100 bases:

>100_bases
GACTTTGAATTTATTTTTAATCTCCTGCACAACGGCTACAGAGTAGAGAATCTATCCGACATCCTATACTATCAATAGAT
AAAAAAGTAGGTGAATAATT

Downstream 100 bases:

>100_bases
TAAAAAATAAGTAAACTATATGAGAAAGTTGACAATGAGCAGTTATTGCACATTGTCAATTTTCAGTTTTAAGATACATC
CCCCGGGGTAATGCCATTAG

Product: group 2 family glycosyl transferase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 873; Mature: 873

Protein sequence:

>873_residues
MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA
QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI
FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF
IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK
IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA
VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM
TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY
TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK
EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK
DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD
PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP

Sequences:

>Translated_873_residues
MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA
QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI
FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF
IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK
IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA
VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM
TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY
TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK
EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK
DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD
PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP
>Mature_873_residues
MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA
QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI
FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF
IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK
IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA
VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM
TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY
TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK
EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK
DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD
PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: NA

Molecular weight: Translated: 100440; Mature: 100440

Theoretical pI: Translated: 6.51; Mature: 6.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.6 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
2.6 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHY
CCCCCEEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEE
HAFVEHEGLFNSLEYGLQQAQGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIR
EEEEECCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHH
CFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVIFPTIMMRRSLLDGIEFHKEE
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCH
NELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF
HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHH
IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVI
HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEHHCCCCCHHHHHHHHHHHHHCCCEEEE
GTSGGPLVEVFQHYGLKVVKIPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASM
ECCCCHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCHHHHCEEEHHHHHHH
RLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIAVSKPIKKLIESKLGSRIRGE
HHCHHHHHHCCCCEEEEEECCCCCHHHHHHHHCCHHEEEEEHHHHHHHHHHHHCCCCCCE
IMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM
EEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH
TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTAR
CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCCEEEEHHH
VALEAMSCGKPVIAVGNHGYTGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQ
HHHHHHHCCCCEEEECCCCCEEEECCCCCCCCCCEEECCCCCCCCCCCEEEHHHHHHHHH
DTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEKEVENTDKENMPNKIETDIQT
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHH
KESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK
HHHHHHHHHHHHHHHCCCCCCCCCCHHHHEECCCCHHHHHHHHHHCCEEEEECCEEEEEE
DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIID
CCCCCCCEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHCCCCCCC
KHNQYIIFEIITRIYVNFCDPGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAED
CCCCEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEECCCCCCCCCCCHHHHHHHCCCCC
STNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP
CCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCC
>Mature Secondary Structure
MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHY
CCCCCEEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEE
HAFVEHEGLFNSLEYGLQQAQGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIR
EEEEECCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHH
CFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVIFPTIMMRRSLLDGIEFHKEE
HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCH
NELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF
HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHH
IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVI
HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEHHCCCCCHHHHHHHHHHHHHCCCEEEE
GTSGGPLVEVFQHYGLKVVKIPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASM
ECCCCHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCHHHHCEEEHHHHHHH
RLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIAVSKPIKKLIESKLGSRIRGE
HHCHHHHHHCCCCEEEEEECCCCCHHHHHHHHCCHHEEEEEHHHHHHHHHHHHCCCCCCE
IMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM
EEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH
TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTAR
CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCCEEEEHHH
VALEAMSCGKPVIAVGNHGYTGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQ
HHHHHHHCCCCEEEECCCCCEEEECCCCCCCCCCEEECCCCCCCCCCCEEEHHHHHHHHH
DTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEKEVENTDKENMPNKIETDIQT
HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHH
KESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK
HHHHHHHHHHHHHHHCCCCCCCCCCHHHHEECCCCHHHHHHHHHHCCEEEEECCEEEEEE
DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIID
CCCCCCCEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHCCCCCCC
KHNQYIIFEIITRIYVNFCDPGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAED
CCCCEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEECCCCCCCCCCCHHHHHHHCCCCC
STNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP
CCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]