| Definition | Deinococcus geothermalis DSM 11300 plasmid pDGEO01, complete sequence. |
|---|---|
| Accession | NC_008010 |
| Length | 574,127 |
Click here to switch to the map view.
The map label for this gene is gntK [H]
Identifier: 94972280
GI number: 94972280
Start: 211233
End: 212732
Strand: Direct
Name: gntK [H]
Synonym: Dgeo_2817
Alternate gene names: 94972280
Gene position: 211233-212732 (Clockwise)
Preceding gene: 94972281
Following gene: 94972279
Centisome position: 36.79
GC content: 67.73
Gene sequence:
>1500_bases ATGACTGGGCAGCATTTCACGCAGCCGATGCGCACACCTCCGCCCGTGATGCTGAGCCTTGATTTGGGAAGCAGCGGACT CAAAGGATCGGCCTTTGACACCCTGGGTCGCTCGCTGGCGGGGCTGGAGGCACATGCCCCAGTCGCGCTGCGCTATGCCC CCGGCGGGGTCGCAGAGGTGGACCTTCCAGCGCTGATCCAGGGTATCGAAGAGATCCTCGACCGACTACACCAGCGGCTG GGTCAGCGGCCAGTGCTGGGTGTGGCGCTGACCAGCTTTGTGAGCAGTCTGGTTGCGCTGGATACCGCTGACCGGCCGAT TGGCCCGGTGCTTTCCTATGCCGATACCCGTTCGGCCGGCGAGGTGGCCGCCGTCGCGCGGCGGGTGGATCCCGAACAGG TCGGGTGCCCCGCCTTCAGCGCCTACTGGCCCGCCCAGATCCGCTGGTGGCAGGCAGCCCATCCCCACCTCCACGCCGCA CGCTATTGCAGCGTGCCTGATTATCTGCTGCGGCGCTGGACCGGTGCGTGGGTGACCAGTTACTCGTTGGCATCGTGGAC TGGAATGCTAGACCGCTTCAGCCTGAGCTGGAACGCGGAGGCACTGGCCGCCGCCGAGGTGAGGGCCGACCAATTGCCCG AGTTGGCGGACTACGACCTCGCCTTTTCCCTGCGCCCAGAATTCCAAGCCCGTTGGCCAAAACTGGCGCATGTTCCCTTT TATCTGGGGGTTTCCGACGGCGCGACGGCAACGGTGGGAAGTGGCGCCCTCCTGCCCGGCCGCTTCGCCCTGACGGTCGG CAGTACCAGTGCCGTTCGGATGGCCCTCACGGGGCCACCTCCAGCGATTCCGCCGGGCTTGTGGTCGTACCGCATCACGC GGGAGATCCACCTTCTGGGCGGCGCGCTGACCGAGGGGGGCAATCTCTACAGCTGGCTAACCTCTACGCTGCAGCTGGGC GGCAAGGAGCTCGAAGAAGAACTGCTGGGCATTGCGCCCGACAGCCACGGTCTTACCTTTATTCCCTCCTTGGGCGGCAC CCGTAGTCCGGATTATGACCCGCACGCACGCGGCACAGTGCACGGCCTGAGCTATGCCACCACACCCGCACAGATCGCCC GTGCCGGGATGGAGGGGGTGGCCTGCCGCCTGGCTGACCTGGCCTGGCGCCTCCCCATCACCGATGACGCCGTGTTTATC GCCAGCGGCAAGGCGCTGCTGGCCTCACGCCCCTGGCAGCAGATGCTGGCCGATGCCCTGGGGCGTCCTTTGCTGCTGGA AGACCGCCCGGCTGGGGCGAGCGCGCGCGGCGCAGCCCTGCTGGCGCTGGCCGCCCAGGGCCATCCTCTTCCCACCGAGC CGGCCGCGCAGCGCCTGGTGGAACCGGTGCCCGACCATCACGAGCTGTACCGAGCGGCCACCCAGCGGATGCGGGTCCTG GGAGCCGCGTTGGACAGGTTGCGAAAGGCGCAGGAGGTGCCCACATGCGACCCCGCCTGA
Upstream 100 bases:
>100_bases TCGCCGCCGCCGCCGTCCTGATTACGCTGCCGATTATCCTACTCTCCTTGACGGTACAAAAATACATCGTCAAGGGGCTG ACTGCGGGTGGCGTAAAAGG
Downstream 100 bases:
>100_bases CCGACGTGCTGGCTGCTGACCGCCTGCTGCCCCTCTTTACACCGGAGCCAGGAGAGACGGCAGCTAGGCGCCTGACAATT CTGAAGCGAGCTGGGATCCG
Product: carbohydrate kinase, FGGY
Products: NA
Alternate protein names: Gluconate kinase [H]
Number of amino acids: Translated: 499; Mature: 498
Protein sequence:
>499_residues MTGQHFTQPMRTPPPVMLSLDLGSSGLKGSAFDTLGRSLAGLEAHAPVALRYAPGGVAEVDLPALIQGIEEILDRLHQRL GQRPVLGVALTSFVSSLVALDTADRPIGPVLSYADTRSAGEVAAVARRVDPEQVGCPAFSAYWPAQIRWWQAAHPHLHAA RYCSVPDYLLRRWTGAWVTSYSLASWTGMLDRFSLSWNAEALAAAEVRADQLPELADYDLAFSLRPEFQARWPKLAHVPF YLGVSDGATATVGSGALLPGRFALTVGSTSAVRMALTGPPPAIPPGLWSYRITREIHLLGGALTEGGNLYSWLTSTLQLG GKELEEELLGIAPDSHGLTFIPSLGGTRSPDYDPHARGTVHGLSYATTPAQIARAGMEGVACRLADLAWRLPITDDAVFI ASGKALLASRPWQQMLADALGRPLLLEDRPAGASARGAALLALAAQGHPLPTEPAAQRLVEPVPDHHELYRAATQRMRVL GAALDRLRKAQEVPTCDPA
Sequences:
>Translated_499_residues MTGQHFTQPMRTPPPVMLSLDLGSSGLKGSAFDTLGRSLAGLEAHAPVALRYAPGGVAEVDLPALIQGIEEILDRLHQRL GQRPVLGVALTSFVSSLVALDTADRPIGPVLSYADTRSAGEVAAVARRVDPEQVGCPAFSAYWPAQIRWWQAAHPHLHAA RYCSVPDYLLRRWTGAWVTSYSLASWTGMLDRFSLSWNAEALAAAEVRADQLPELADYDLAFSLRPEFQARWPKLAHVPF YLGVSDGATATVGSGALLPGRFALTVGSTSAVRMALTGPPPAIPPGLWSYRITREIHLLGGALTEGGNLYSWLTSTLQLG GKELEEELLGIAPDSHGLTFIPSLGGTRSPDYDPHARGTVHGLSYATTPAQIARAGMEGVACRLADLAWRLPITDDAVFI ASGKALLASRPWQQMLADALGRPLLLEDRPAGASARGAALLALAAQGHPLPTEPAAQRLVEPVPDHHELYRAATQRMRVL GAALDRLRKAQEVPTCDPA >Mature_498_residues TGQHFTQPMRTPPPVMLSLDLGSSGLKGSAFDTLGRSLAGLEAHAPVALRYAPGGVAEVDLPALIQGIEEILDRLHQRLG QRPVLGVALTSFVSSLVALDTADRPIGPVLSYADTRSAGEVAAVARRVDPEQVGCPAFSAYWPAQIRWWQAAHPHLHAAR YCSVPDYLLRRWTGAWVTSYSLASWTGMLDRFSLSWNAEALAAAEVRADQLPELADYDLAFSLRPEFQARWPKLAHVPFY LGVSDGATATVGSGALLPGRFALTVGSTSAVRMALTGPPPAIPPGLWSYRITREIHLLGGALTEGGNLYSWLTSTLQLGG KELEEELLGIAPDSHGLTFIPSLGGTRSPDYDPHARGTVHGLSYATTPAQIARAGMEGVACRLADLAWRLPITDDAVFIA SGKALLASRPWQQMLADALGRPLLLEDRPAGASARGAALLALAAQGHPLPTEPAAQRLVEPVPDHHELYRAATQRMRVLG AALDRLRKAQEVPTCDPA
Specific function: Unknown
COG id: COG1070
COG function: function code G; Sugar (pentulose and hexulose) kinases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the FGGY kinase family [H]
Homologues:
Organism=Homo sapiens, GI42794763, Length=403, Percent_Identity=21.3399503722084, Blast_Score=70, Evalue=6e-12, Organism=Escherichia coli, GI1789987, Length=450, Percent_Identity=27.3333333333333, Blast_Score=84, Evalue=2e-17, Organism=Caenorhabditis elegans, GI17535599, Length=412, Percent_Identity=23.0582524271845, Blast_Score=74, Evalue=1e-13, Organism=Saccharomyces cerevisiae, GI6321755, Length=328, Percent_Identity=26.219512195122, Blast_Score=69, Evalue=2e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000577 - InterPro: IPR018485 - InterPro: IPR018483 - InterPro: IPR018484 - InterPro: IPR006002 [H]
Pfam domain/function: PF02782 FGGY_C; PF00370 FGGY_N [H]
EC number: =2.7.1.12 [H]
Molecular weight: Translated: 53358; Mature: 53227
Theoretical pI: Translated: 6.75; Mature: 6.75
Prosite motif: PS00678 WD_REPEATS_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTGQHFTQPMRTPPPVMLSLDLGSSGLKGSAFDTLGRSLAGLEAHAPVALRYAPGGVAEV CCCCCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCC DLPALIQGIEEILDRLHQRLGQRPVLGVALTSFVSSLVALDTADRPIGPVLSYADTRSAG CHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCH EVAAVARRVDPEQVGCPAFSAYWPAQIRWWQAAHPHLHAARYCSVPDYLLRRWTGAWVTS HHHHHHHHCCHHHCCCCCCCCCCCCCEEEEECCCCCHHHHHHCCCHHHHHHHHCCCHHHH YSLASWTGMLDRFSLSWNAEALAAAEVRADQLPELADYDLAFSLRPEFQARWPKLAHVPF HHHHHHHHHHHHEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEECCCHHHCCCCEECCEE YLGVSDGATATVGSGALLPGRFALTVGSTSAVRMALTGPPPAIPPGLWSYRITREIHLLG EEECCCCCCEECCCCCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCEEHHHEEEEEC GALTEGGNLYSWLTSTLQLGGKELEEELLGIAPDSHGLTFIPSLGGTRSPDYDPHARGTV CHHCCCCCHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCCCCCCCCC HGLSYATTPAQIARAGMEGVACRLADLAWRLPITDDAVFIASGKALLASRPWQQMLADAL CCCCCCCCHHHHHHHCHHHHHHHHHHHHEECCCCCCEEEEECCCHHHHCCCHHHHHHHHC GRPLLLEDRPAGASARGAALLALAAQGHPLPTEPAAQRLVEPVPDHHELYRAATQRMRVL CCCEEEECCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHH GAALDRLRKAQEVPTCDPA HHHHHHHHHHHCCCCCCCC >Mature Secondary Structure TGQHFTQPMRTPPPVMLSLDLGSSGLKGSAFDTLGRSLAGLEAHAPVALRYAPGGVAEV CCCCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCC DLPALIQGIEEILDRLHQRLGQRPVLGVALTSFVSSLVALDTADRPIGPVLSYADTRSAG CHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCH EVAAVARRVDPEQVGCPAFSAYWPAQIRWWQAAHPHLHAARYCSVPDYLLRRWTGAWVTS HHHHHHHHCCHHHCCCCCCCCCCCCCEEEEECCCCCHHHHHHCCCHHHHHHHHCCCHHHH YSLASWTGMLDRFSLSWNAEALAAAEVRADQLPELADYDLAFSLRPEFQARWPKLAHVPF HHHHHHHHHHHHEECCCCCHHHHHHHHHHHCCCCCCCCCEEEEECCCHHHCCCCEECCEE YLGVSDGATATVGSGALLPGRFALTVGSTSAVRMALTGPPPAIPPGLWSYRITREIHLLG EEECCCCCCEECCCCCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCEEHHHEEEEEC GALTEGGNLYSWLTSTLQLGGKELEEELLGIAPDSHGLTFIPSLGGTRSPDYDPHARGTV CHHCCCCCHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCCCCCCCCC HGLSYATTPAQIARAGMEGVACRLADLAWRLPITDDAVFIASGKALLASRPWQQMLADAL CCCCCCCCHHHHHHHCHHHHHHHHHHHHEECCCCCCEEEEECCCHHHHCCCHHHHHHHHC GRPLLLEDRPAGASARGAALLALAAQGHPLPTEPAAQRLVEPVPDHHELYRAATQRMRVL CCCEEEECCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHH GAALDRLRKAQEVPTCDPA HHHHHHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 3020045; 7584049; 9384377 [H]