Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is yomE [H]

Identifier: 148379214

GI number: 148379214

Start: 1350684

End: 1351703

Strand: Direct

Name: yomE [H]

Synonym: CBO1231

Alternate gene names: 148379214

Gene position: 1350684-1351703 (Clockwise)

Preceding gene: 148379213

Following gene: 148379215

Centisome position: 34.75

GC content: 30.69

Gene sequence:

>1020_bases
ATGTCTAAAAAGAGACAAAAGGAAATAAAAAGAAGAAAAAAAAGAAAAAAGAAAAAAAAATCATTTATAGGAAGACTATT
TTTGTTTTTAGTTTATGAAGTTATAGTAGGTGGAATTTTTTCTTTATTGATTGCCTTTTATGGACCTTTTGATAATGTAA
AAAGTACATTAGTGGGAACAGCTATGGCTACATATAAACATCAGTATATTGCCACTACTTTTTTATCTAAGGATGAAATA
AATAAAATTTTAAATAAGGATAAAGGAATAAGTAATTCAAGTTTAAAAGAAAATTATGGCGATATAAAAATAAGAAACAA
ATATGGTAATTCAGTAGAAAGATATGATATAAATACAGCTAAATTTGATGGCTATATATTAGAAATTAAAAATCCCCAAA
AAGTAAAAATAGGATATACAAAGTATATGGGAAAAATGGGTGAAAGAACTAGTAAAATGGCTGAGAGACATGGAGCCGTA
GCTGCTGTAAATGGTGGTGGATTTAGGGATGTATCGTCCACAGGCAAACTTTGGACAGGCACCGGAGCCTATCCAGAAGG
ACTTGTAATATCTAATGGTAAAGTTATTTACAATGATTTTAAGTCTGGACAAAAGGTTAACGTTACAGCATTTACAAAGG
AAGGATTATTAGTTGTAGGAGATCATACGGTAGATGAACTTTTAAAAATGGGAGTAGTAGAAGCTTTGTCTTTTAGAAAT
ACATTAATAATTAATGGAAAGCCTATACCTTATAATGAAGGTATAAATCCTAGAACTGCTATAGGACAAAAACAAGATGG
AACTATAGTCTTATTAGTTATAGATGGGAGAAGAGGGATAAAACAAGGAGCTACTCTAGAAGAGGTAGAAAATATACTGC
TGCAAAGAGGAGTAGTGAATGCTAGTAATTTAGATGGAGGATCCTCTTCAACTATGTATTATAAAGGAAAAGTTATAAAT
AGACCTTGTAATTGGGATGGAGAAAGAACAGTAGCCACCTCCATATATGTAGAACCTTAA

Upstream 100 bases:

>100_bases
GTTTTTATATTTGTAACTCATTTGTTACAAATATAATGTATAATTACAATATAATTTATATGTTAATCATAAATAATTTT
TAAGTAAAAAAGGAGTATCT

Downstream 100 bases:

>100_bases
AGGAGAAGACTTACTTATGAAAAAAATTAAAATTATATTTGTCTGGACACTAATTGCTATTGGTTTATCCTTCGCAGCTC
TGCTTTTCGTGGATAAAGTT

Product: membrane protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 339; Mature: 338

Protein sequence:

>339_residues
MSKKRQKEIKRRKKRKKKKKSFIGRLFLFLVYEVIVGGIFSLLIAFYGPFDNVKSTLVGTAMATYKHQYIATTFLSKDEI
NKILNKDKGISNSSLKENYGDIKIRNKYGNSVERYDINTAKFDGYILEIKNPQKVKIGYTKYMGKMGERTSKMAERHGAV
AAVNGGGFRDVSSTGKLWTGTGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDELLKMGVVEALSFRN
TLIINGKPIPYNEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQRGVVNASNLDGGSSSTMYYKGKVIN
RPCNWDGERTVATSIYVEP

Sequences:

>Translated_339_residues
MSKKRQKEIKRRKKRKKKKKSFIGRLFLFLVYEVIVGGIFSLLIAFYGPFDNVKSTLVGTAMATYKHQYIATTFLSKDEI
NKILNKDKGISNSSLKENYGDIKIRNKYGNSVERYDINTAKFDGYILEIKNPQKVKIGYTKYMGKMGERTSKMAERHGAV
AAVNGGGFRDVSSTGKLWTGTGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDELLKMGVVEALSFRN
TLIINGKPIPYNEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQRGVVNASNLDGGSSSTMYYKGKVIN
RPCNWDGERTVATSIYVEP
>Mature_338_residues
SKKRQKEIKRRKKRKKKKKSFIGRLFLFLVYEVIVGGIFSLLIAFYGPFDNVKSTLVGTAMATYKHQYIATTFLSKDEIN
KILNKDKGISNSSLKENYGDIKIRNKYGNSVERYDINTAKFDGYILEIKNPQKVKIGYTKYMGKMGERTSKMAERHGAVA
AVNGGGFRDVSSTGKLWTGTGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDELLKMGVVEALSFRNT
LIINGKPIPYNEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQRGVVNASNLDGGSSSTMYYKGKVINR
PCNWDGERTVATSIYVEP

Specific function: Unknown

COG id: COG4632

COG function: function code G; Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018711
- InterPro:   IPR012334
- InterPro:   IPR011050 [H]

Pfam domain/function: PF09992 DUF2233 [H]

EC number: NA

Molecular weight: Translated: 37648; Mature: 37517

Theoretical pI: Translated: 10.38; Mature: 10.38

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSKKRQKEIKRRKKRKKKKKSFIGRLFLFLVYEVIVGGIFSLLIAFYGPFDNVKSTLVGT
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
AMATYKHQYIATTFLSKDEINKILNKDKGISNSSLKENYGDIKIRNKYGNSVERYDINTA
HHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHCCCCEEEECCCCCCEEEEECCEE
KFDGYILEIKNPQKVKIGYTKYMGKMGERTSKMAERHGAVAAVNGGGFRDVSSTGKLWTG
EECCEEEEECCCCEEEEEHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEC
TGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDELLKMGVVEALSFRN
CCCCCCCEEEECCEEEEEECCCCCEEEEEEEECCCEEEECCCCHHHHHHHHHHHHHHCCC
TLIINGKPIPYNEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQRGVVN
EEEECCCCCCCCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCC
ASNLDGGSSSTMYYKGKVINRPCNWDGERTVATSIYVEP
CCCCCCCCCCEEEEECEEECCCCCCCCCEEEEEEEEECC
>Mature Secondary Structure 
SKKRQKEIKRRKKRKKKKKSFIGRLFLFLVYEVIVGGIFSLLIAFYGPFDNVKSTLVGT
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
AMATYKHQYIATTFLSKDEINKILNKDKGISNSSLKENYGDIKIRNKYGNSVERYDINTA
HHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHCCCCEEEECCCCCCEEEEECCEE
KFDGYILEIKNPQKVKIGYTKYMGKMGERTSKMAERHGAVAAVNGGGFRDVSSTGKLWTG
EECCEEEEECCCCEEEEEHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEC
TGAYPEGLVISNGKVIYNDFKSGQKVNVTAFTKEGLLVVGDHTVDELLKMGVVEALSFRN
CCCCCCCEEEECCEEEEEECCCCCEEEEEEEECCCEEEECCCCHHHHHHHHHHHHHHCCC
TLIINGKPIPYNEGINPRTAIGQKQDGTIVLLVIDGRRGIKQGATLEEVENILLQRGVVN
EEEECCCCCCCCCCCCCHHHCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCC
ASNLDGGSSSTMYYKGKVINRPCNWDGERTVATSIYVEP
CCCCCCCCCCEEEEECEEECCCCCCCCCEEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]