Definition Clostridium botulinum A str. Hall, complete genome.
Accession NC_009698
Length 3,760,560

Click here to switch to the map view.

The map label for this gene is ymxG [H]

Identifier: 153937094

GI number: 153937094

Start: 2385468

End: 2386769

Strand: Reverse

Name: ymxG [H]

Synonym: CLC_2258

Alternate gene names: 153937094

Gene position: 2386769-2385468 (Counterclockwise)

Preceding gene: 153934955

Following gene: 153936559

Centisome position: 63.47

GC content: 26.57

Gene sequence:

>1302_bases
GTGTATAATTTATTTACATTAGATAATGGACTAAGAGTGGTTCTAGAGAATATAGATTATGTGAAATCTGTAAGTGTAGG
ACTTTGGATAGAAAATGGTTCAAGAAATGAGAATTTAAAAAATAATGGTATTTCTCATTTTATAGAGCATATGATGTTTA
AAGGCACAGAAAATAGAAGTGCACTACAAATAGCAGAATGTATAGAAGATGTAGGTGGACAAATAAATGCATTTACAGGT
AAAGAAGCCACTTGTTATTATATAAAAATACTAAATTCTCATATAGAATTGGCTTTAGAAGTTTTATCTGATATGTTATT
TAATAGTAAATTTAAAGAAGAGGACATAGAAAAAGAAAAGGGAGTAATAATTGAAGAAATAAGTATGACTGAAGATTCTC
CGGAGGATGTACTATCAGATTTACATTGCAAGGCTATATGGGGAGATGATTCTATTTCCTACCCTATTTTAGGAACAGTA
GAAACTGTAAAATCCTTTAAAAGAAAAGATATAGTAGATTACATAAATAAATATTATATTCCAGAAAATTCTGTTATATC
TATATGTGGTAATTTTGATATAAATGAATTAGAAAAATTAATAAATAAATATTTTGGTAATTGGAATAGCGGTGAAAATA
AAAACATAACTGTTTATTCCAAACCTAAAATAGAAAATAACCACTTATTTAAAAATAAAAATATAGAACAACTTCATATA
AGTTTAGGTTTTGAGGGATTAGAATTAGGAAATGATGATGCGTATCCTCTTATATTACTTAGTAATGTGTTGGGTGGAGG
AGCTTCATCTATACTATTTCAAAAGATAAGAGAAGAAAAAGGATTATGCTATAGTATATATTCTTATATGTCTTCCTTTA
ATAAAACAGGTGCGGTAAGCATTTATACAGGATTAAATCCAGCCTATACAGAAGACACTATAACCTTAATAAAAAAGGTA
GTAAATGATTTTTCAAAGGAAGGTATAAATAAAGAAAAATTAATAAAATCAAAAGAGCAGTTAAAGGGAAGTTATATATT
AGGCTTAGAAAGCACTAGTACTAGAATGTTTAATAATGGTAAGTCTGTGCTATTTCTAAATAGAATAAATGATCCAGAAA
TAATAATGAAAAAAATAGATAAAATAACTGAAGATAAATTACAGGAAATAATGGATAGAACCTTTGGGGCTGGAATAAAA
AATTCAGCCTTTGTAGGAGAAAAATTAAATTTAGAAAATGTAAAAAATATTCTAGATAGGAACCAAAGAGCTTTCAAAGA
AGCTAAATCAAAATTAATATAA

Upstream 100 bases:

>100_bases
ACAGAGTGCTGATTTGGTGATAATCGCTTTCATAGAGTGACATGGGAGTATTAGTAATGGTAACAATCGCAGAAATACAG
GGTTTAGGAGGAATTAATTA

Downstream 100 bases:

>100_bases
TATATTTTGTAATAAGCTCTCACTATTTTTCATAATATTGAATATGAATATTATGAGAAGTAATGGGAGGATTATTATGG
AAGAAAATATAAAACGCTAT

Product: M16 family peptidase

Products: NA

Alternate protein names: ORFP [H]

Number of amino acids: Translated: 433; Mature: 433

Protein sequence:

>433_residues
MYNLFTLDNGLRVVLENIDYVKSVSVGLWIENGSRNENLKNNGISHFIEHMMFKGTENRSALQIAECIEDVGGQINAFTG
KEATCYYIKILNSHIELALEVLSDMLFNSKFKEEDIEKEKGVIIEEISMTEDSPEDVLSDLHCKAIWGDDSISYPILGTV
ETVKSFKRKDIVDYINKYYIPENSVISICGNFDINELEKLINKYFGNWNSGENKNITVYSKPKIENNHLFKNKNIEQLHI
SLGFEGLELGNDDAYPLILLSNVLGGGASSILFQKIREEKGLCYSIYSYMSSFNKTGAVSIYTGLNPAYTEDTITLIKKV
VNDFSKEGINKEKLIKSKEQLKGSYILGLESTSTRMFNNGKSVLFLNRINDPEIIMKKIDKITEDKLQEIMDRTFGAGIK
NSAFVGEKLNLENVKNILDRNQRAFKEAKSKLI

Sequences:

>Translated_433_residues
MYNLFTLDNGLRVVLENIDYVKSVSVGLWIENGSRNENLKNNGISHFIEHMMFKGTENRSALQIAECIEDVGGQINAFTG
KEATCYYIKILNSHIELALEVLSDMLFNSKFKEEDIEKEKGVIIEEISMTEDSPEDVLSDLHCKAIWGDDSISYPILGTV
ETVKSFKRKDIVDYINKYYIPENSVISICGNFDINELEKLINKYFGNWNSGENKNITVYSKPKIENNHLFKNKNIEQLHI
SLGFEGLELGNDDAYPLILLSNVLGGGASSILFQKIREEKGLCYSIYSYMSSFNKTGAVSIYTGLNPAYTEDTITLIKKV
VNDFSKEGINKEKLIKSKEQLKGSYILGLESTSTRMFNNGKSVLFLNRINDPEIIMKKIDKITEDKLQEIMDRTFGAGIK
NSAFVGEKLNLENVKNILDRNQRAFKEAKSKLI
>Mature_433_residues
MYNLFTLDNGLRVVLENIDYVKSVSVGLWIENGSRNENLKNNGISHFIEHMMFKGTENRSALQIAECIEDVGGQINAFTG
KEATCYYIKILNSHIELALEVLSDMLFNSKFKEEDIEKEKGVIIEEISMTEDSPEDVLSDLHCKAIWGDDSISYPILGTV
ETVKSFKRKDIVDYINKYYIPENSVISICGNFDINELEKLINKYFGNWNSGENKNITVYSKPKIENNHLFKNKNIEQLHI
SLGFEGLELGNDDAYPLILLSNVLGGGASSILFQKIREEKGLCYSIYSYMSSFNKTGAVSIYTGLNPAYTEDTITLIKKV
VNDFSKEGINKEKLIKSKEQLKGSYILGLESTSTRMFNNGKSVLFLNRINDPEIIMKKIDKITEDKLQEIMDRTFGAGIK
NSAFVGEKLNLENVKNILDRNQRAFKEAKSKLI

Specific function: Unknown

COG id: COG0612

COG function: function code R; Predicted Zn-dependent peptidases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M16 family [H]

Homologues:

Organism=Homo sapiens, GI94538354, Length=423, Percent_Identity=30.0236406619385, Blast_Score=189, Evalue=5e-48,
Organism=Homo sapiens, GI46593007, Length=374, Percent_Identity=27.2727272727273, Blast_Score=146, Evalue=5e-35,
Organism=Homo sapiens, GI24308013, Length=462, Percent_Identity=22.7272727272727, Blast_Score=107, Evalue=3e-23,
Organism=Homo sapiens, GI50592988, Length=387, Percent_Identity=19.3798449612403, Blast_Score=80, Evalue=4e-15,
Organism=Homo sapiens, GI155969707, Length=248, Percent_Identity=24.1935483870968, Blast_Score=68, Evalue=2e-11,
Organism=Escherichia coli, GI1787770, Length=244, Percent_Identity=25, Blast_Score=80, Evalue=3e-16,
Organism=Escherichia coli, GI2367164, Length=225, Percent_Identity=26.2222222222222, Blast_Score=64, Evalue=2e-11,
Organism=Caenorhabditis elegans, GI71999683, Length=417, Percent_Identity=28.2973621103118, Blast_Score=182, Evalue=3e-46,
Organism=Caenorhabditis elegans, GI17553678, Length=414, Percent_Identity=24.8792270531401, Blast_Score=156, Evalue=2e-38,
Organism=Caenorhabditis elegans, GI17510601, Length=428, Percent_Identity=20.7943925233645, Blast_Score=85, Evalue=7e-17,
Organism=Saccharomyces cerevisiae, GI6323192, Length=438, Percent_Identity=28.5388127853881, Blast_Score=177, Evalue=4e-45,
Organism=Saccharomyces cerevisiae, GI6321813, Length=445, Percent_Identity=25.6179775280899, Blast_Score=143, Evalue=4e-35,
Organism=Saccharomyces cerevisiae, GI6319426, Length=273, Percent_Identity=24.1758241758242, Blast_Score=64, Evalue=3e-11,
Organism=Drosophila melanogaster, GI21357875, Length=424, Percent_Identity=28.5377358490566, Blast_Score=180, Evalue=2e-45,
Organism=Drosophila melanogaster, GI24646943, Length=424, Percent_Identity=28.5377358490566, Blast_Score=180, Evalue=2e-45,
Organism=Drosophila melanogaster, GI19921772, Length=446, Percent_Identity=23.7668161434978, Blast_Score=119, Evalue=5e-27,
Organism=Drosophila melanogaster, GI24641429, Length=205, Percent_Identity=25.3658536585366, Blast_Score=65, Evalue=9e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011249
- InterPro:   IPR011237
- InterPro:   IPR011765
- InterPro:   IPR001431
- InterPro:   IPR007863 [H]

Pfam domain/function: PF00675 Peptidase_M16; PF05193 Peptidase_M16_C [H]

EC number: 3.4.99.- [C]

Molecular weight: Translated: 49091; Mature: 49091

Theoretical pI: Translated: 5.29; Mature: 5.29

Prosite motif: PS00143 INSULINASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MYNLFTLDNGLRVVLENIDYVKSVSVGLWIENGSRNENLKNNGISHFIEHMMFKGTENRS
CCEEEEECCCHHHHHHHHHHHHHCEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCH
ALQIAECIEDVGGQINAFTGKEATCYYIKILNSHIELALEVLSDMLFNSKFKEEDIEKEK
HHHHHHHHHHCCCCEEEECCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHC
GVIIEEISMTEDSPEDVLSDLHCKAIWGDDSISYPILGTVETVKSFKRKDIVDYINKYYI
CEEEEEECCCCCCHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC
PENSVISICGNFDINELEKLINKYFGNWNSGENKNITVYSKPKIENNHLFKNKNIEQLHI
CCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCEECCCCCEEEEE
SLGFEGLELGNDDAYPLILLSNVLGGGASSILFQKIREEKGLCYSIYSYMSSFNKTGAVS
EECCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCEE
IYTGLNPAYTEDTITLIKKVVNDFSKEGINKEKLIKSKEQLKGSYILGLESTSTRMFNNG
EEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCEEEEECCCCHHHCCCC
KSVLFLNRINDPEIIMKKIDKITEDKLQEIMDRTFGAGIKNSAFVGEKLNLENVKNILDR
CEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH
NQRAFKEAKSKLI
HHHHHHHHHHCCC
>Mature Secondary Structure
MYNLFTLDNGLRVVLENIDYVKSVSVGLWIENGSRNENLKNNGISHFIEHMMFKGTENRS
CCEEEEECCCHHHHHHHHHHHHHCEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCH
ALQIAECIEDVGGQINAFTGKEATCYYIKILNSHIELALEVLSDMLFNSKFKEEDIEKEK
HHHHHHHHHHCCCCEEEECCCCCEEEEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHC
GVIIEEISMTEDSPEDVLSDLHCKAIWGDDSISYPILGTVETVKSFKRKDIVDYINKYYI
CEEEEEECCCCCCHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC
PENSVISICGNFDINELEKLINKYFGNWNSGENKNITVYSKPKIENNHLFKNKNIEQLHI
CCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCEECCCCCEEEEE
SLGFEGLELGNDDAYPLILLSNVLGGGASSILFQKIREEKGLCYSIYSYMSSFNKTGAVS
EECCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCEE
IYTGLNPAYTEDTITLIKKVVNDFSKEGINKEKLIKSKEQLKGSYILGLESTSTRMFNNG
EEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCEEEEECCCCHHHCCCC
KSVLFLNRINDPEIIMKKIDKITEDKLQEIMDRTFGAGIKNSAFVGEKLNLENVKNILDR
CEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH
NQRAFKEAKSKLI
HHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: Zn [C]

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Endopeptidases of unknown catalytic mechanism [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377; 8098035 [H]