| Definition | Clostridium botulinum A2 str. Kyoto chromosome, complete genome. |
|---|---|
| Accession | NC_012563 |
| Length | 4,155,278 |
Click here to switch to the map view.
The map label for this gene is epsE [H]
Identifier: 226949381
GI number: 226949381
Start: 2377267
End: 2379888
Strand: Direct
Name: epsE [H]
Synonym: CLM_2306
Alternate gene names: 226949381
Gene position: 2377267-2379888 (Clockwise)
Preceding gene: 226949380
Following gene: 226949382
Centisome position: 57.21
GC content: 31.31
Gene sequence:
>2622_bases ATGAAACCAGAGATTAGTGTTATTATGCCAGTATACAATTGTAAACAGTATATCTTTGAATCCATTAAAAGCATATGTAA TCAAACTTTTGAAAACTGGGAACTTATTATTATTAATGATAATTCTATAGAAAATATTGAGGAAGAAATAAAAAAAATAC AGGATGACAGAATTCATTATCATGCTTTTGTAGAACATGAAGGCTTATTTAATTCATTAGAATATGGGTTGCAGCAAGCA CAGGGAGATTTTATCACATTTCATGATCCAGATGATATCAGTTCTCCTACCAGGTTTAATGAACAGCTTAATTACCTTAA ATCCAATGATGATTTAGGGATGGTTTCATGCCTTATCAGATGTTTTACTAACGATACTAGTTATCGTAATGCCTGTACTT TTATAGAGAAAATTCAAAATGCTTATATATCCAGAGAACAAATTGAAAATGCAATTATCAATAAATTTTCTCCAGTTATA TTTCCTACAATAATGATGCGTAGAAGTCTATTAGATGGAATTGAATTCCATAAAGAAGAAAATGAACTAGAAGACTACTT TCAGATATTTTTATATTTGCTTAAACAAGGTAGATTAGAAAAGGTCAATAGCGTTCTTTATTATTATAGAAGGCATAAAA ATTCTTATCATATTCAAAATGAAAAAAATTATTCTGAAACAGTACAAGCTCAGCTAAGTAAAAGCGGAATACAAAATTTT ATAAAATATAGGGAACTCTACAAAGATTTAAAAAAAGAGCAGTATATAGTCAGTAGATCGAAGAAAGACAGCCCCTTAAG AATATTGATGCTTATAGATGCCCTAAATATTGGTGGAACAGAAATGTATGTATTAGAGCTTGCAAAATCGCTGGAAAAGT TAGGGGCACATGTAGTTATTGGAACCTCTGGGGGCCCACTAGTAGAAGTATTCCAACATTATGGATTAAAAGTTGTAAAA ATTCCTTTTACTAGCGACTATATTTCTAATAAAGACATTATGAAGCTAATTAAATTAACAAAGAAAATAATAGATGAAGA AAAAATCAATTTGCTTCACTGCCACCTTTTTGCAAGTATGCGTTTAGGAAATGATATTTATAGGAGCTACAAGATCCCAT ATATAGTCACGTTACATGGTTTATTTTATCCTAATGACGTACTATTTGAGTCCTGTATCAATGCAACTAAAATTATTGCA GTAAGCAAGCCAATTAAAAAACTAATTGAATCAAAATTAGGCTCAAGAATAAGAGGAGAAATTATGGTGCTTCCAAATGG AATTGATATGGAAAATTTTCATCCTCAGCATACAGTAAAAAGTTCAAAAAATCAGCTGGGTATACCTGAAAACTCTCAAA TTATAACTTACTGTAGTCGTCTGGATTGGGGAAAAACTTTTGCTGCCGAAGCTTTTATTTTTGCATGTTTTACTTTAATG ACTAAGAATGAACATCTACATGCTTTTGTTATTGGAGATGGAGCTGATAAAAATTTAATCACGCATGAAGTAAACATTCT GAATAAAATATTAAAAAGAGATGCAATTCATGTAGTAGGTGCGAAATTTAACGTACTGCCTTATTATCAGAATGCAGATA TTGTAGTTGGTACTGCAAGAGTGGCACTTGAGGCCATGAGTTGTGGTAAGCCAGTTATTGCCGTAGGGAATCATGGATAT ACCGGAGTCATTAATCATAGATGTATGAACGAGCAATGGAATATGTATTTTGGTGATCATGATTCAATAAAAAAAGCAGA TCCCTTAGTACTTGCAAAAGATTTAGATGGATTGCTACAAGATACCAAAGCATGTAAATCCTTGGGAAAATGGGGTAGAC GCTGGTGTGAAGAAAAATTTGATAATCGACTTGTAGCAAAAGATATTTTTAACTTATACCAAGAAGTTTTATCCGAAAAA GAAGTGGAAAATACAGATAAAGAAAATATGCCTAATAAGATAGAAACCGATATACAGACAAAAGAAAGCCTTCTTTTAGA AAAAACTTCTTCCATAATTAGCAGAATTCCTGATGGGATTGAATTTACTCCAGAAATAAGTGAAGTAGTATTGGGTTCCA ATAATGCACTTGCACGATACTGCACTCATTGTACCCATTGTAGATTTGACATTACGATGCCTTTTACTATAATATTGAAA GATAAAAAAGATTCCTGTAAAACATTGACTGTGCCTGATTTGCTTCATATTGAGTTAAATAGCTATAACTATAATAATTG CATTCACAAACAAGATTGTGAATGTGGAGCACTGGTAAATCTTATAAATAAGTGTGGAATGAGAATAGAAAATGAAATAA AAAATAAACCTATTATTGACAAGCATAACCAGTATATTATATTTGAGATCATTACCAGGATATACGTTAACTTCTGTGAT CCAGGTATACTTGATCTGGAAGCTGGAATTTATGGATCAGGTTCTATTATTGGTGAAGATAATCCTCTAGTTAAAGATAC AGAAAAAGATGAATTTAAAAAAAATATCGCTGCCGAAGATTCTACCAATATACACCATGAGCCCTTTTATTGTTATAAAG ATGATAAATCCGATTATGATGCTAACTCACTTATGAAAAGCTATCCTTCTAACAGACCTTAA
Upstream 100 bases:
>100_bases GACTTTGAATTTATTTTTAATCTCCTGCACAACGGCTACAGAGTAGAGAATCTATCCGACATCCTATACTATCAATAGAT AAAAAAGTAGGTGAATAATT
Downstream 100 bases:
>100_bases TAAAAAATAAGTAAACTATATGAGAAAGTTGACAATGAGCAGTTATTGCACATTGTCAATTTTCAGTTTTAAGATACATC CCCCGGGGTAATGCCATTAG
Product: group 2 family glycosyl transferase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 873; Mature: 873
Protein sequence:
>873_residues MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP
Sequences:
>Translated_873_residues MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP >Mature_873_residues MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHYHAFVEHEGLFNSLEYGLQQA QGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIRCFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVI FPTIMMRRSLLDGIEFHKEENELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVIGTSGGPLVEVFQHYGLKVVK IPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASMRLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIA VSKPIKKLIESKLGSRIRGEIMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTARVALEAMSCGKPVIAVGNHGY TGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQDTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEK EVENTDKENMPNKIETDIQTKESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIIDKHNQYIIFEIITRIYVNFCD PGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAEDSTNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP
Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]
COG id: COG0463
COG function: function code M; Glycosyltransferases involved in cell wall biogenesis
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyltransferase 2 family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001173 [H]
Pfam domain/function: PF00535 Glycos_transf_2 [H]
EC number: NA
Molecular weight: Translated: 100440; Mature: 100440
Theoretical pI: Translated: 6.51; Mature: 6.51
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.6 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 2.6 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 4.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHY CCCCCEEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEE HAFVEHEGLFNSLEYGLQQAQGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIR EEEEECCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHH CFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVIFPTIMMRRSLLDGIEFHKEE HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCH NELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHH IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVI HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEHHCCCCCHHHHHHHHHHHHHCCCEEEE GTSGGPLVEVFQHYGLKVVKIPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASM ECCCCHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCHHHHCEEEHHHHHHH RLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIAVSKPIKKLIESKLGSRIRGE HHCHHHHHHCCCCEEEEEECCCCCHHHHHHHHCCHHEEEEEHHHHHHHHHHHHCCCCCCE IMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM EEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTAR CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCCEEEEHHH VALEAMSCGKPVIAVGNHGYTGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQ HHHHHHHCCCCEEEECCCCCEEEECCCCCCCCCCEEECCCCCCCCCCCEEEHHHHHHHHH DTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEKEVENTDKENMPNKIETDIQT HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHH KESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK HHHHHHHHHHHHHHHCCCCCCCCCCHHHHEECCCCHHHHHHHHHHCCEEEEECCEEEEEE DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIID CCCCCCCEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHCCCCCCC KHNQYIIFEIITRIYVNFCDPGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAED CCCCEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEECCCCCCCCCCCHHHHHHHCCCCC STNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP CCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCC >Mature Secondary Structure MKPEISVIMPVYNCKQYIFESIKSICNQTFENWELIIINDNSIENIEEEIKKIQDDRIHY CCCCCEEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEE HAFVEHEGLFNSLEYGLQQAQGDFITFHDPDDISSPTRFNEQLNYLKSNDDLGMVSCLIR EEEEECCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHH CFTNDTSYRNACTFIEKIQNAYISREQIENAIINKFSPVIFPTIMMRRSLLDGIEFHKEE HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCH NELEDYFQIFLYLLKQGRLEKVNSVLYYYRRHKNSYHIQNEKNYSETVQAQLSKSGIQNF HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHH IKYRELYKDLKKEQYIVSRSKKDSPLRILMLIDALNIGGTEMYVLELAKSLEKLGAHVVI HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEHHCCCCCHHHHHHHHHHHHHCCCEEEE GTSGGPLVEVFQHYGLKVVKIPFTSDYISNKDIMKLIKLTKKIIDEEKINLLHCHLFASM ECCCCHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCHHHHCEEEHHHHHHH RLGNDIYRSYKIPYIVTLHGLFYPNDVLFESCINATKIIAVSKPIKKLIESKLGSRIRGE HHCHHHHHHCCCCEEEEEECCCCCHHHHHHHHCCHHEEEEEHHHHHHHHHHHHCCCCCCE IMVLPNGIDMENFHPQHTVKSSKNQLGIPENSQIITYCSRLDWGKTFAAEAFIFACFTLM EEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH TKNEHLHAFVIGDGADKNLITHEVNILNKILKRDAIHVVGAKFNVLPYYQNADIVVGTAR CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCEEEECCEEEEEEEECCCCEEEEHHH VALEAMSCGKPVIAVGNHGYTGVINHRCMNEQWNMYFGDHDSIKKADPLVLAKDLDGLLQ HHHHHHHCCCCEEEECCCCCEEEECCCCCCCCCCEEECCCCCCCCCCCEEEHHHHHHHHH DTKACKSLGKWGRRWCEEKFDNRLVAKDIFNLYQEVLSEKEVENTDKENMPNKIETDIQT HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHH KESLLLEKTSSIISRIPDGIEFTPEISEVVLGSNNALARYCTHCTHCRFDITMPFTIILK HHHHHHHHHHHHHHHCCCCCCCCCCHHHHEECCCCHHHHHHHHHHCCEEEEECCEEEEEE DKKDSCKTLTVPDLLHIELNSYNYNNCIHKQDCECGALVNLINKCGMRIENEIKNKPIID CCCCCCCEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHCCCCCCC KHNQYIIFEIITRIYVNFCDPGILDLEAGIYGSGSIIGEDNPLVKDTEKDEFKKNIAAED CCCCEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEECCCCCCCCCCCHHHHHHHCCCCC STNIHHEPFYCYKDDKSDYDANSLMKSYPSNRP CCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8969506; 9384377 [H]