Definition Clostridium botulinum A str. Hall, complete genome.
Accession NC_009698
Length 3,760,560

Click here to switch to the map view.

The map label for this gene is dapG [H]

Identifier: 153935207

GI number: 153935207

Start: 2383898

End: 2385103

Strand: Reverse

Name: dapG [H]

Synonym: CLC_2256

Alternate gene names: 153935207

Gene position: 2385103-2383898 (Counterclockwise)

Preceding gene: 153936559

Following gene: 153934679

Centisome position: 63.42

GC content: 31.26

Gene sequence:

>1206_bases
ATGAAAATCTTAATTCAAAAGTTTGGAGGAACATCTGTATCTACAGCTGAAAGAAGATCTTTAGTAGTAGATAAAATAGT
TAAGGCAAAAAAAGCAGGTTATTATCCAGTGGTGGTAGTATCTGCTATGGGTAGAAAAGGACAGCCTTATGCTACAGACA
CATTGAGATCTTTAGTTGAAGAAGATTTTTTAGATAAAAATACCTTAGCTGCTGATCTTTTAATGGGTTGCGGTGAATTA
ATAAGCACAGTAGTTATGAGTTCAGAACTTTTTAATAAGGGAATAGATGCAGTACCTTTAATGGGAGGACAGGCAGGTAT
TATAACAGATAATAATTTTAACAATGCCTCTGTACTTAGAGTAGAAAAAGATAGAATAGTAGATTTACTGAAAAAAGATA
AAATTCCTGTAGTGGCAGGTTTTCAAGGGAAAAGTGAAGATGGATATATAACTACCCTAGGAAGAGGTGGCAGTGATGTT
ACTGCCGCACTTTTAGGAACTGCATTAGAGGCTGAAAGTGTAGAAATATATACGGATGTAGATGGTATAATGACTGCAGA
TCCTAGAATAGTGGAGAATGCCTCCTTAATAAAAGAAATAAGTTATAATGAGGTGTTTCAATTTGCAGATCAAGGGGCTA
AGGTAATACATCCAAGGGCAGTAGAAATTGCTATGACGTCAAATATAAAACTTGTAATAAAAAACACTATGACAGATTGC
AAAGGTACTACTATAAATAATATAGGAATTAAAAATTCAAATAATGTTATAACAGGTATTACCCATATGAGTAATAGAAC
GCAAATTATAGTAGATGCAGAAGAAAATAAAGGAAATAAAAACTATACTAACCTATTAAATTCATTGGCAGAAAATTCTA
TTAGTATTGACCTTATAAATGTTTTTCCTAAAGAAAAAATATTCACTATAGATGAAAAAGATTTTAATGAGTTTAGTTCT
ATAATGGAAGGTTTAAAAATAAAATTTTCCTACTTAAAAGATTGTAGTAAAATAGCTATAATAGGTAGTAGAATGAGAGG
GATACCTGGGGTTATGGCTAAAATATTAAAGGCACTTGTAGAAAGAAATATAGAAGTGCTTCAAACAGCAGACTCACATA
CAACTATATGGTGTCTTGTTTCTAAGGAAGATACAGAAAAAGCTATAAAATCATTGCACTGTGAATTTAAGTTAGATTGT
ATATAA

Upstream 100 bases:

>100_bases
AGTGTCTTGGGACTCTGTAAAAAAAATAGGATCAAGAACTATAATAATTGATGCTGAAAAGAGAGAATTAAAAAAAATCC
GCTAGGGAGATGGTAGACAA

Downstream 100 bases:

>100_bases
ACAAAATTCTTAGCTTCAGGTGGAGTTTTAACTCAAATTTGAGTTAAGAAGTGCTTAGATAGGTAATATCAGACATTATC
TCTAACTTTATAAGTGGGAG

Product: aspartate kinase I

Products: NA

Alternate protein names: Aspartate kinase 1; Aspartokinase I [H]

Number of amino acids: Translated: 401; Mature: 401

Protein sequence:

>401_residues
MKILIQKFGGTSVSTAERRSLVVDKIVKAKKAGYYPVVVVSAMGRKGQPYATDTLRSLVEEDFLDKNTLAADLLMGCGEL
ISTVVMSSELFNKGIDAVPLMGGQAGIITDNNFNNASVLRVEKDRIVDLLKKDKIPVVAGFQGKSEDGYITTLGRGGSDV
TAALLGTALEAESVEIYTDVDGIMTADPRIVENASLIKEISYNEVFQFADQGAKVIHPRAVEIAMTSNIKLVIKNTMTDC
KGTTINNIGIKNSNNVITGITHMSNRTQIIVDAEENKGNKNYTNLLNSLAENSISIDLINVFPKEKIFTIDEKDFNEFSS
IMEGLKIKFSYLKDCSKIAIIGSRMRGIPGVMAKILKALVERNIEVLQTADSHTTIWCLVSKEDTEKAIKSLHCEFKLDC
I

Sequences:

>Translated_401_residues
MKILIQKFGGTSVSTAERRSLVVDKIVKAKKAGYYPVVVVSAMGRKGQPYATDTLRSLVEEDFLDKNTLAADLLMGCGEL
ISTVVMSSELFNKGIDAVPLMGGQAGIITDNNFNNASVLRVEKDRIVDLLKKDKIPVVAGFQGKSEDGYITTLGRGGSDV
TAALLGTALEAESVEIYTDVDGIMTADPRIVENASLIKEISYNEVFQFADQGAKVIHPRAVEIAMTSNIKLVIKNTMTDC
KGTTINNIGIKNSNNVITGITHMSNRTQIIVDAEENKGNKNYTNLLNSLAENSISIDLINVFPKEKIFTIDEKDFNEFSS
IMEGLKIKFSYLKDCSKIAIIGSRMRGIPGVMAKILKALVERNIEVLQTADSHTTIWCLVSKEDTEKAIKSLHCEFKLDC
I
>Mature_401_residues
MKILIQKFGGTSVSTAERRSLVVDKIVKAKKAGYYPVVVVSAMGRKGQPYATDTLRSLVEEDFLDKNTLAADLLMGCGEL
ISTVVMSSELFNKGIDAVPLMGGQAGIITDNNFNNASVLRVEKDRIVDLLKKDKIPVVAGFQGKSEDGYITTLGRGGSDV
TAALLGTALEAESVEIYTDVDGIMTADPRIVENASLIKEISYNEVFQFADQGAKVIHPRAVEIAMTSNIKLVIKNTMTDC
KGTTINNIGIKNSNNVITGITHMSNRTQIIVDAEENKGNKNYTNLLNSLAENSISIDLINVFPKEKIFTIDEKDFNEFSS
IMEGLKIKFSYLKDCSKIAIIGSRMRGIPGVMAKILKALVERNIEVLQTADSHTTIWCLVSKEDTEKAIKSLHCEFKLDC
I

Specific function: Catalyzes 2 nonconsecutive reactions in the common biosynthetic pathway leading from Asp to diaminopimelate and Lys, to Met, and to Thr and Ile. [C]

COG id: COG0527

COG function: function code E; Aspartokinases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the aspartokinase family [H]

Homologues:

Organism=Escherichia coli, GI1786183, Length=462, Percent_Identity=27.0562770562771, Blast_Score=124, Evalue=1e-29,
Organism=Escherichia coli, GI1790455, Length=266, Percent_Identity=30.4511278195489, Blast_Score=94, Evalue=1e-20,
Organism=Escherichia coli, GI1790376, Length=177, Percent_Identity=29.3785310734463, Blast_Score=70, Evalue=2e-13,
Organism=Saccharomyces cerevisiae, GI6320893, Length=364, Percent_Identity=24.7252747252747, Blast_Score=104, Evalue=3e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001048
- InterPro:   IPR005260
- InterPro:   IPR001341
- InterPro:   IPR018042 [H]

Pfam domain/function: PF00696 AA_kinase [H]

EC number: =2.7.2.4 [H]

Molecular weight: Translated: 43872; Mature: 43872

Theoretical pI: Translated: 6.19; Mature: 6.19

Prosite motif: PS00324 ASPARTOKINASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKILIQKFGGTSVSTAERRSLVVDKIVKAKKAGYYPVVVVSAMGRKGQPYATDTLRSLVE
CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHH
EDFLDKNTLAADLLMGCGELISTVVMSSELFNKGIDAVPLMGGQAGIITDNNFNNASVLR
HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCEEEEECCCCCCEEEEE
VEKDRIVDLLKKDKIPVVAGFQGKSEDGYITTLGRGGSDVTAALLGTALEAESVEIYTDV
ECHHHHHHHHHHCCCCEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCEEEEECC
DGIMTADPRIVENASLIKEISYNEVFQFADQGAKVIHPRAVEIAMTSNIKLVIKNTMTDC
CCEEECCCCHHHHHHHHHHCCHHHHHHHHHCCCEEECCCEEEEEEECCEEEEEECCCCCC
KGTTINNIGIKNSNNVITGITHMSNRTQIIVDAEENKGNKNYTNLLNSLAENSISIDLIN
CCCEEECCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHHCCCEEEEEEE
VFPKEKIFTIDEKDFNEFSSIMEGLKIKFSYLKDCSKIAIIGSRMRGIPGVMAKILKALV
ECCCCEEEEECCCCHHHHHHHHHHHEEHHHHHHCCHHEEEEHHHHCCCHHHHHHHHHHHH
ERNIEVLQTADSHTTIWCLVSKEDTEKAIKSLHCEFKLDCI
HCCCEEEEECCCCEEEEEEEECCHHHHHHHHHCCCEEEECC
>Mature Secondary Structure
MKILIQKFGGTSVSTAERRSLVVDKIVKAKKAGYYPVVVVSAMGRKGQPYATDTLRSLVE
CEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHH
EDFLDKNTLAADLLMGCGELISTVVMSSELFNKGIDAVPLMGGQAGIITDNNFNNASVLR
HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCEEEEECCCCCCEEEEE
VEKDRIVDLLKKDKIPVVAGFQGKSEDGYITTLGRGGSDVTAALLGTALEAESVEIYTDV
ECHHHHHHHHHHCCCCEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCEEEEECC
DGIMTADPRIVENASLIKEISYNEVFQFADQGAKVIHPRAVEIAMTSNIKLVIKNTMTDC
CCEEECCCCHHHHHHHHHHCCHHHHHHHHHCCCEEECCCEEEEEEECCEEEEEECCCCCC
KGTTINNIGIKNSNNVITGITHMSNRTQIIVDAEENKGNKNYTNLLNSLAENSISIDLIN
CCCEEECCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHHCCCEEEEEEE
VFPKEKIFTIDEKDFNEFSSIMEGLKIKFSYLKDCSKIAIIGSRMRGIPGVMAKILKALV
ECCCCEEEEECCCCHHHHHHHHHHHEEHHHHHHCCHHEEEEHHHHCCCHHHHHHHHHHHH
ERNIEVLQTADSHTTIWCLVSKEDTEKAIKSLHCEFKLDCI
HCCCEEEEECCCCEEEEEEEECCHHHHHHHHHCCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8098035; 9384377 [H]