Definition | Clostridium botulinum A str. ATCC 3502, complete genome. |
---|---|
Accession | NC_009495 |
Length | 3,886,916 |
Click here to switch to the map view.
The map label for this gene is dhaK [H]
Identifier: 148378967
GI number: 148378967
Start: 1074611
End: 1076371
Strand: Reverse
Name: dhaK [H]
Synonym: CBO0975
Alternate gene names: 148378967
Gene position: 1076371-1074611 (Counterclockwise)
Preceding gene: 148378968
Following gene: 148378966
Centisome position: 27.69
GC content: 33.33
Gene sequence:
>1761_bases ATGAAAAAGATAATAAATAGACCTGAGACTGTAGTTATGGAGATGTGTAATGGAATAGCTATGGCACATCCTGAACTAGA GTTTGTTAGAAAATATAAAGTAATGAAAAAAAAGAATATAAACAAAAATAAAGTTAGCTTAATTAGTGGTGGTGGAAGTG GTCACGAACCAGCTCATGCCGGATTTATTGGAAAAGGAATGTTAGATGCAGCAGTATGCGGAGACATATTCGCTTCTCCC TCTCAAATTCAAGTATATAAAGCTATAAAAGCTACAGCTAGCGAAAAAGGAACATTATTAATTATAAAAAACTATAGTGG AGATATAATGAATTTTAAAAATGCTGCCCATTTAGCAAGTGAGGATGGTATTAAAGTTGACTATGTTAAAGTTGATGACG ATATAGCTGTTGAAGATAGCTTATATACTGTAGGACGTAGAGGAGTTGCAGGTACTGTACTAGTTCATAAAATAGCAGGT GCAGCTGCAGAATTAGGCTTTTCATTAGAAAAAGTTAAATCAATTGCTGAAAAAGCTGTTTCTAATGTTAGAAGCTTAGG TTTTGCTTTCTCTTCTTGTACAGTTCCAGCTAAGAGAACTCCAACTTTTCAATTAGCTGAGGATGAAATGGAATTTGGTG TTGGAATTCATGGCGAACCTGGAATAGTAAGAGAAAAAGTTGCTACAGCAGATGAATTAGCTAAAAAAATTGTTGATTCC ATATTAAAAGACATGAAAATAGATGGATCAAATAATGAAGAAGTAGCGTTATTAATTAATGGTTTTGGTGCTACCCCTCT ACAAGAATTATATCTATTTAATAATTCAGTTACTGCTGAGCTAGCAAAGAAAAATATAAAAATAAATAGAACATTTGTAG GAAACTATATGACGAGTATAGATATGGAGGGAGCTTCAGTATCTATCATGAAATTAGATGATGAGCTTAAAGAATTACTA TCAAAAGAAAGTGATACTCCTGCATTTAAAGTTTCAGGACCAGTTGAATCAGTTGAATATATCAGCTTAGAAGATAATAA TGATATTGAAAATGAAGTTAGTTTTGATTTGGAAACTTGTGAGTGTCATAGCGAAATAAAAGATAAAAAAATCACTTTAG ATAATATGATTTATATTATAGATAAGATGAGTGAAGTAATAATAGCAAATGAAGTTCACTTCTGTGAATTGGATTCTCAT GCTGGTGATGGTGATTTCGGTATGAGTGTAGCAAAAGGCTTTAAACAATTAAAAAGAGAATGGAAACATATTCTTAAAGA AGATAATATGAATATTGGTAAATTCTTAAACGAATGCTCTTTAATAATTATGGAACATTGTGGTGGAGCATCAGGCCCAA TATGGGGATCAGCATTTAGATCCGCTGGAAAACAAGTTGGAGATAAAAAAGAATTATCAGTTTCAGATTTTGCTCAAATG ATGCAGGCAGCTGTAAAAGGAATTCAAGCTACAGGAGAACGTTCTTTTGGAAGAGGAGCAGTAGTTGGAGATAAAACTTT AATTGACGCATTAGTTCCTTGTGCTGATTTGTGGAGTGAGAGTGCTAATAACAATACTTCTATACATGAAGCCTTCCAAA AAGGTGCCGCTGCAGCAGTTAAAGGTGCAAAAATGACTGAAGAAATAGTAGCTCGTATGGGTAGAGCAGGTACAGTTGGT GAAAGAAGCATTGGTTACCCAGATGCAGGCGCTTATGGATTAGGAGTTATATTTACAGAAATATCAAATTCATTAAAATA A
Upstream 100 bases:
>100_bases TCTATGCCATTTAAGGTAACTCCAGAAGCAGTAGCAGCAGCAATAATAACAGCAGATACTTTAGGTAAAAAATATAAATC AAATAAATAGAGGTGTAAAC
Downstream 100 bases:
>100_bases GCCTTCTAATATTATAATAGCTCATACAATATTATAATTTAAAATTTAACCTAAAAACTCCAGTAAACATCTTATTTTTT ACTGGGGTTTTTATTTTTAC
Product: dihydroxyacetone kinase
Products: ADP; glycerone phosphate
Alternate protein names: NA
Number of amino acids: Translated: 586; Mature: 586
Protein sequence:
>586_residues MKKIINRPETVVMEMCNGIAMAHPELEFVRKYKVMKKKNINKNKVSLISGGGSGHEPAHAGFIGKGMLDAAVCGDIFASP SQIQVYKAIKATASEKGTLLIIKNYSGDIMNFKNAAHLASEDGIKVDYVKVDDDIAVEDSLYTVGRRGVAGTVLVHKIAG AAAELGFSLEKVKSIAEKAVSNVRSLGFAFSSCTVPAKRTPTFQLAEDEMEFGVGIHGEPGIVREKVATADELAKKIVDS ILKDMKIDGSNNEEVALLINGFGATPLQELYLFNNSVTAELAKKNIKINRTFVGNYMTSIDMEGASVSIMKLDDELKELL SKESDTPAFKVSGPVESVEYISLEDNNDIENEVSFDLETCECHSEIKDKKITLDNMIYIIDKMSEVIIANEVHFCELDSH AGDGDFGMSVAKGFKQLKREWKHILKEDNMNIGKFLNECSLIIMEHCGGASGPIWGSAFRSAGKQVGDKKELSVSDFAQM MQAAVKGIQATGERSFGRGAVVGDKTLIDALVPCADLWSESANNNTSIHEAFQKGAAAAVKGAKMTEEIVARMGRAGTVG ERSIGYPDAGAYGLGVIFTEISNSLK
Sequences:
>Translated_586_residues MKKIINRPETVVMEMCNGIAMAHPELEFVRKYKVMKKKNINKNKVSLISGGGSGHEPAHAGFIGKGMLDAAVCGDIFASP SQIQVYKAIKATASEKGTLLIIKNYSGDIMNFKNAAHLASEDGIKVDYVKVDDDIAVEDSLYTVGRRGVAGTVLVHKIAG AAAELGFSLEKVKSIAEKAVSNVRSLGFAFSSCTVPAKRTPTFQLAEDEMEFGVGIHGEPGIVREKVATADELAKKIVDS ILKDMKIDGSNNEEVALLINGFGATPLQELYLFNNSVTAELAKKNIKINRTFVGNYMTSIDMEGASVSIMKLDDELKELL SKESDTPAFKVSGPVESVEYISLEDNNDIENEVSFDLETCECHSEIKDKKITLDNMIYIIDKMSEVIIANEVHFCELDSH AGDGDFGMSVAKGFKQLKREWKHILKEDNMNIGKFLNECSLIIMEHCGGASGPIWGSAFRSAGKQVGDKKELSVSDFAQM MQAAVKGIQATGERSFGRGAVVGDKTLIDALVPCADLWSESANNNTSIHEAFQKGAAAAVKGAKMTEEIVARMGRAGTVG ERSIGYPDAGAYGLGVIFTEISNSLK >Mature_586_residues MKKIINRPETVVMEMCNGIAMAHPELEFVRKYKVMKKKNINKNKVSLISGGGSGHEPAHAGFIGKGMLDAAVCGDIFASP SQIQVYKAIKATASEKGTLLIIKNYSGDIMNFKNAAHLASEDGIKVDYVKVDDDIAVEDSLYTVGRRGVAGTVLVHKIAG AAAELGFSLEKVKSIAEKAVSNVRSLGFAFSSCTVPAKRTPTFQLAEDEMEFGVGIHGEPGIVREKVATADELAKKIVDS ILKDMKIDGSNNEEVALLINGFGATPLQELYLFNNSVTAELAKKNIKINRTFVGNYMTSIDMEGASVSIMKLDDELKELL SKESDTPAFKVSGPVESVEYISLEDNNDIENEVSFDLETCECHSEIKDKKITLDNMIYIIDKMSEVIIANEVHFCELDSH AGDGDFGMSVAKGFKQLKREWKHILKEDNMNIGKFLNECSLIIMEHCGGASGPIWGSAFRSAGKQVGDKKELSVSDFAQM MQAAVKGIQATGERSFGRGAVVGDKTLIDALVPCADLWSESANNNTSIHEAFQKGAAAAVKGAKMTEEIVARMGRAGTVG ERSIGYPDAGAYGLGVIFTEISNSLK
Specific function: Dihydroxyacetone binding subunit of the dihydroxyacetone kinase, which is responsible for phosphorylating dihydroxyacetone [H]
COG id: COG2376
COG function: function code G; Dihydroxyacetone kinase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 DhaK domain [H]
Homologues:
Organism=Homo sapiens, GI20149621, Length=578, Percent_Identity=31.3148788927336, Blast_Score=279, Evalue=5e-75, Organism=Escherichia coli, GI87081857, Length=352, Percent_Identity=43.1818181818182, Blast_Score=284, Evalue=1e-77, Organism=Escherichia coli, GI1787449, Length=179, Percent_Identity=31.2849162011173, Blast_Score=89, Evalue=6e-19, Organism=Caenorhabditis elegans, GI17565018, Length=583, Percent_Identity=31.0463121783877, Blast_Score=273, Evalue=2e-73, Organism=Saccharomyces cerevisiae, GI6321055, Length=599, Percent_Identity=32.8881469115192, Blast_Score=237, Evalue=5e-63, Organism=Saccharomyces cerevisiae, GI6323570, Length=593, Percent_Identity=32.8836424957842, Blast_Score=233, Evalue=5e-62,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004006 - InterPro: IPR012736 [H]
Pfam domain/function: PF02733 Dak1 [H]
EC number: 2.7.1.29
Molecular weight: Translated: 63297; Mature: 63297
Theoretical pI: Translated: 5.45; Mature: 5.45
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKKIINRPETVVMEMCNGIAMAHPELEFVRKYKVMKKKNINKNKVSLISGGGSGHEPAHA CCCCCCCHHHHHHHHHCCCEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCC GFIGKGMLDAAVCGDIFASPSQIQVYKAIKATASEKGTLLIIKNYSGDIMNFKNAAHLAS CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCHHHHHCC EDGIKVDYVKVDDDIAVEDSLYTVGRRGVAGTVLVHKIAGAAAELGFSLEKVKSIAEKAV CCCCEEEEEEECCCCEECCHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH SNVRSLGFAFSSCTVPAKRTPTFQLAEDEMEFGVGIHGEPGIVREKVATADELAKKIVDS HHHHHHHHHHHCCCCCCCCCCCEEECCHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHH ILKDMKIDGSNNEEVALLINGFGATPLQELYLFNNSVTAELAKKNIKINRTFVGNYMTSI HHHHCCCCCCCCCEEEEEEECCCCCHHHHHHHHCCCHHHHHHHCCCEEEEEEECCHHEEE DMEGASVSIMKLDDELKELLSKESDTPAFKVSGPVESVEYISLEDNNDIENEVSFDLETC ECCCCEEEEEECCHHHHHHHHCCCCCCEEEECCCCCCEEEEEECCCCCCCCHHCCCHHHH ECHSEIKDKKITLDNMIYIIDKMSEVIIANEVHFCELDSHAGDGDFGMSVAKGFKQLKRE HHHHHHCCCEEEHHHHHHHHHHHHHHHEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHH WKHILKEDNMNIGKFLNECSLIIMEHCGGASGPIWGSAFRSAGKQVGDKKELSVSDFAQM HHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHH MQAAVKGIQATGERSFGRGAVVGDKTLIDALVPCADLWSESANNNTSIHEAFQKGAAAAV HHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCHHHHH KGAKMTEEIVARMGRAGTVGERSIGYPDAGAYGLGVIFTEISNSLK HCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCC >Mature Secondary Structure MKKIINRPETVVMEMCNGIAMAHPELEFVRKYKVMKKKNINKNKVSLISGGGSGHEPAHA CCCCCCCHHHHHHHHHCCCEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCC GFIGKGMLDAAVCGDIFASPSQIQVYKAIKATASEKGTLLIIKNYSGDIMNFKNAAHLAS CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCHHHHHCC EDGIKVDYVKVDDDIAVEDSLYTVGRRGVAGTVLVHKIAGAAAELGFSLEKVKSIAEKAV CCCCEEEEEEECCCCEECCHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH SNVRSLGFAFSSCTVPAKRTPTFQLAEDEMEFGVGIHGEPGIVREKVATADELAKKIVDS HHHHHHHHHHHCCCCCCCCCCCEEECCHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHH ILKDMKIDGSNNEEVALLINGFGATPLQELYLFNNSVTAELAKKNIKINRTFVGNYMTSI HHHHCCCCCCCCCEEEEEEECCCCCHHHHHHHHCCCHHHHHHHCCCEEEEEEECCHHEEE DMEGASVSIMKLDDELKELLSKESDTPAFKVSGPVESVEYISLEDNNDIENEVSFDLETC ECCCCEEEEEECCHHHHHHHHCCCCCCEEEECCCCCCEEEEEECCCCCCCCHHCCCHHHH ECHSEIKDKKITLDNMIYIIDKMSEVIIANEVHFCELDSHAGDGDFGMSVAKGFKQLKRE HHHHHHCCCEEEHHHHHHHHHHHHHHHEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHH WKHILKEDNMNIGKFLNECSLIIMEHCGGASGPIWGSAFRSAGKQVGDKKELSVSDFAQM HHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHH MQAAVKGIQATGERSFGRGAVVGDKTLIDALVPCADLWSESANNNTSIHEAFQKGAAAAV HHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCHHHHH KGAKMTEEIVARMGRAGTVGERSIGYPDAGAYGLGVIFTEISNSLK HCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; glycerone
Specific reaction: ATP + glycerone = ADP + glycerone phosphate
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11337471 [H]