Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is dkgB [H]

Identifier: 157159668

GI number: 157159668

Start: 228545

End: 229348

Strand: Direct

Name: dkgB [H]

Synonym: EcHS_A0211

Alternate gene names: 157159668

Gene position: 228545-229348 (Clockwise)

Preceding gene: 157159667

Following gene: 157159670

Centisome position: 4.92

GC content: 46.27

Gene sequence:

>804_bases
ATGGCTATCCCTGCATTTGGTTTAGGTACTTTCCGTCTGAAAGACGACGTTGTTATTTCATCTGTGAAAACGGCGCTTGA
ACTTGGTTATCGCGCAATTGATACCGCACAAATCTATGATAACGAAGCCGCAGTAGGTCAGGCGATTGCAGAAAGTGGCG
TGCCACGTCATGAACTCTACATCACCACTAAAATCTGGATTGAAAATCTCAGCAAAGACAAATTGATCCCAAGTCTGAAA
GAGAGCCTGCAAAAATTGCGTACCGATTATGTTGATCTGACGCTAATCCACTGGCCGTCACCAAACGATGAAGTCTCTGT
TGAAGAGTTTATGCAGGCGCTGCTGAAAGCCAAAAAACAAGGGCTGACGCGTGAGATCGGTATTTCCAACTTCACGATCC
CATTGATGGAAAAGGCGATTGCTGCTGTTGGCGCTGAAAACATCGCTACTAACCAGATTGAACTCTCTCCTTATCTGCAA
AACCGTAAAGTGGTTGCCTGGGCTAAACAGCACGGCATCCATATTACTTCCTATATGACGCTGGCGTATGGTAAGGCCCT
GAAAGATGAGGTTATTGCTCGTATCGCAGCTAAACACAATGCGACTCCGGCACAAGTGATTCTGGCGTGGGCTATGGGGG
AAGGTTACTCAGTAATTCCTTCTTCTACTAAACGTAAAAACCTGGAAAGTAATCTTAAGGCACAAAATTTACAGCTCGAT
GCCAAAGATAAAAAAGCGATCGCCGCCCTGGATTGCAACGACCGCCTGGTTAGCCCGAAAGGTCTGGCTCCTGAATGGGA
TTAA

Upstream 100 bases:

>100_bases
AGAATCGCAAAAATCCTCTGCATTTTACGCTCTTTTTCCTCAACAGTCTGAAGCCCATAATCACCTCAGTTAACGAAAAT
AGCATTAAAAGAGGCATATT

Downstream 100 bases:

>100_bases
GCCTCTCTGACAGCTCCTCCGGGAGCTGTTTTTACATGCTCGCTAAGGAAATCGATAAAAGCCCGGATGCGCGTACTTAC
CGCACGGTCGCTGTAATAGA

Product: 2,5-diketo-D-gluconate reductase B

Products: NA

Alternate protein names: 2,5-DKG reductase B; 2,5-DKGR B; 25DKGR-B; AKR5D [H]

Number of amino acids: Translated: 267; Mature: 266

Protein sequence:

>267_residues
MAIPAFGLGTFRLKDDVVISSVKTALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELYITTKIWIENLSKDKLIPSLK
ESLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLKAKKQGLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQ
NRKVVAWAKQHGIHITSYMTLAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLD
AKDKKAIAALDCNDRLVSPKGLAPEWD

Sequences:

>Translated_267_residues
MAIPAFGLGTFRLKDDVVISSVKTALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELYITTKIWIENLSKDKLIPSLK
ESLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLKAKKQGLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQ
NRKVVAWAKQHGIHITSYMTLAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLD
AKDKKAIAALDCNDRLVSPKGLAPEWD
>Mature_266_residues
AIPAFGLGTFRLKDDVVISSVKTALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELYITTKIWIENLSKDKLIPSLKE
SLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLKAKKQGLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQN
RKVVAWAKQHGIHITSYMTLAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLDA
KDKKAIAALDCNDRLVSPKGLAPEWD

Specific function: Catalyzes the reduction of 2,5-diketo-D-gluconic acid (25DKG) to 2-keto-L-gulonic acid (2KLG) [H]

COG id: COG0656

COG function: function code R; Aldo/keto reductases, related to diketogulonate reductase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the aldo/keto reductase family [H]

Homologues:

Organism=Homo sapiens, GI310109922, Length=275, Percent_Identity=32, Blast_Score=143, Evalue=2e-34,
Organism=Homo sapiens, GI310109920, Length=292, Percent_Identity=30.4794520547945, Blast_Score=139, Evalue=2e-33,
Organism=Homo sapiens, GI45446745, Length=292, Percent_Identity=30.4794520547945, Blast_Score=139, Evalue=3e-33,
Organism=Homo sapiens, GI4503285, Length=292, Percent_Identity=30.4794520547945, Blast_Score=139, Evalue=3e-33,
Organism=Homo sapiens, GI5453543, Length=292, Percent_Identity=30.1369863013699, Blast_Score=137, Evalue=8e-33,
Organism=Homo sapiens, GI24497577, Length=289, Percent_Identity=29.757785467128, Blast_Score=137, Evalue=1e-32,
Organism=Homo sapiens, GI5174391, Length=289, Percent_Identity=29.757785467128, Blast_Score=137, Evalue=1e-32,
Organism=Homo sapiens, GI24497585, Length=291, Percent_Identity=29.553264604811, Blast_Score=135, Evalue=4e-32,
Organism=Homo sapiens, GI5174695, Length=293, Percent_Identity=30.7167235494881, Blast_Score=135, Evalue=5e-32,
Organism=Homo sapiens, GI24497583, Length=287, Percent_Identity=31.3588850174216, Blast_Score=134, Evalue=7e-32,
Organism=Homo sapiens, GI93277124, Length=299, Percent_Identity=30.1003344481605, Blast_Score=130, Evalue=1e-30,
Organism=Homo sapiens, GI223468663, Length=285, Percent_Identity=29.8245614035088, Blast_Score=127, Evalue=8e-30,
Organism=Homo sapiens, GI300116273, Length=269, Percent_Identity=30.4832713754647, Blast_Score=124, Evalue=7e-29,
Organism=Homo sapiens, GI4502049, Length=288, Percent_Identity=32.9861111111111, Blast_Score=124, Evalue=9e-29,
Organism=Homo sapiens, GI291291012, Length=273, Percent_Identity=28.9377289377289, Blast_Score=115, Evalue=4e-26,
Organism=Homo sapiens, GI300116271, Length=278, Percent_Identity=30.2158273381295, Blast_Score=115, Evalue=4e-26,
Organism=Homo sapiens, GI310109926, Length=114, Percent_Identity=37.719298245614, Blast_Score=79, Evalue=5e-15,
Organism=Homo sapiens, GI207028673, Length=114, Percent_Identity=37.719298245614, Blast_Score=79, Evalue=6e-15,
Organism=Homo sapiens, GI310109924, Length=155, Percent_Identity=29.0322580645161, Blast_Score=75, Evalue=8e-14,
Organism=Escherichia coli, GI1786400, Length=267, Percent_Identity=98.501872659176, Blast_Score=538, Evalue=1e-154,
Organism=Escherichia coli, GI87082198, Length=256, Percent_Identity=38.28125, Blast_Score=180, Evalue=9e-47,
Organism=Escherichia coli, GI1788081, Length=279, Percent_Identity=29.3906810035842, Blast_Score=108, Evalue=2e-25,
Organism=Escherichia coli, GI48994888, Length=271, Percent_Identity=29.1512915129151, Blast_Score=77, Evalue=1e-15,
Organism=Escherichia coli, GI1787674, Length=270, Percent_Identity=26.6666666666667, Blast_Score=70, Evalue=2e-13,
Organism=Caenorhabditis elegans, GI17550248, Length=271, Percent_Identity=35.4243542435424, Blast_Score=152, Evalue=1e-37,
Organism=Caenorhabditis elegans, GI17537075, Length=290, Percent_Identity=35.5172413793103, Blast_Score=143, Evalue=1e-34,
Organism=Caenorhabditis elegans, GI17537077, Length=288, Percent_Identity=35.4166666666667, Blast_Score=141, Evalue=4e-34,
Organism=Caenorhabditis elegans, GI71998625, Length=262, Percent_Identity=30.9160305343511, Blast_Score=137, Evalue=6e-33,
Organism=Caenorhabditis elegans, GI17561298, Length=276, Percent_Identity=30.4347826086957, Blast_Score=131, Evalue=3e-31,
Organism=Caenorhabditis elegans, GI17552492, Length=267, Percent_Identity=30.3370786516854, Blast_Score=130, Evalue=5e-31,
Organism=Caenorhabditis elegans, GI17538386, Length=281, Percent_Identity=31.3167259786477, Blast_Score=126, Evalue=1e-29,
Organism=Caenorhabditis elegans, GI17561300, Length=277, Percent_Identity=30.6859205776173, Blast_Score=125, Evalue=2e-29,
Organism=Caenorhabditis elegans, GI17566692, Length=262, Percent_Identity=32.0610687022901, Blast_Score=121, Evalue=4e-28,
Organism=Caenorhabditis elegans, GI17537079, Length=294, Percent_Identity=27.2108843537415, Blast_Score=119, Evalue=2e-27,
Organism=Caenorhabditis elegans, GI17564128, Length=292, Percent_Identity=27.0547945205479, Blast_Score=116, Evalue=1e-26,
Organism=Caenorhabditis elegans, GI17562292, Length=268, Percent_Identity=29.4776119402985, Blast_Score=114, Evalue=5e-26,
Organism=Caenorhabditis elegans, GI212645785, Length=140, Percent_Identity=30.7142857142857, Blast_Score=64, Evalue=7e-11,
Organism=Saccharomyces cerevisiae, GI6324694, Length=276, Percent_Identity=35.8695652173913, Blast_Score=158, Evalue=7e-40,
Organism=Saccharomyces cerevisiae, GI6320576, Length=282, Percent_Identity=36.5248226950355, Blast_Score=155, Evalue=7e-39,
Organism=Saccharomyces cerevisiae, GI6321896, Length=306, Percent_Identity=29.0849673202614, Blast_Score=126, Evalue=4e-30,
Organism=Saccharomyces cerevisiae, GI6322556, Length=259, Percent_Identity=26.2548262548263, Blast_Score=115, Evalue=6e-27,
Organism=Saccharomyces cerevisiae, GI6319625, Length=275, Percent_Identity=30.5454545454545, Blast_Score=108, Evalue=6e-25,
Organism=Saccharomyces cerevisiae, GI6320079, Length=258, Percent_Identity=30.2325581395349, Blast_Score=94, Evalue=2e-20,
Organism=Drosophila melanogaster, GI24662789, Length=293, Percent_Identity=31.0580204778157, Blast_Score=141, Evalue=4e-34,
Organism=Drosophila melanogaster, GI24644950, Length=294, Percent_Identity=31.9727891156463, Blast_Score=139, Evalue=2e-33,
Organism=Drosophila melanogaster, GI24657054, Length=288, Percent_Identity=31.25, Blast_Score=135, Evalue=4e-32,
Organism=Drosophila melanogaster, GI221378297, Length=314, Percent_Identity=30.5732484076433, Blast_Score=133, Evalue=1e-31,
Organism=Drosophila melanogaster, GI20129731, Length=286, Percent_Identity=31.1188811188811, Blast_Score=128, Evalue=4e-30,
Organism=Drosophila melanogaster, GI24663317, Length=285, Percent_Identity=29.8245614035088, Blast_Score=128, Evalue=5e-30,
Organism=Drosophila melanogaster, GI21356425, Length=286, Percent_Identity=30.7692307692308, Blast_Score=125, Evalue=3e-29,
Organism=Drosophila melanogaster, GI24662785, Length=289, Percent_Identity=33.5640138408304, Blast_Score=123, Evalue=1e-28,
Organism=Drosophila melanogaster, GI24662781, Length=290, Percent_Identity=33.1034482758621, Blast_Score=119, Evalue=3e-27,
Organism=Drosophila melanogaster, GI45553081, Length=284, Percent_Identity=28.169014084507, Blast_Score=113, Evalue=1e-25,
Organism=Drosophila melanogaster, GI281366140, Length=259, Percent_Identity=26.6409266409266, Blast_Score=94, Evalue=1e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001395
- InterPro:   IPR018170
- InterPro:   IPR020471
- InterPro:   IPR023210 [H]

Pfam domain/function: PF00248 Aldo_ket_red [H]

EC number: =1.1.1.274 [H]

Molecular weight: Translated: 29449; Mature: 29318

Theoretical pI: Translated: 8.39; Mature: 8.39

Prosite motif: PS00798 ALDOKETO_REDUCTASE_1 ; PS00062 ALDOKETO_REDUCTASE_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAIPAFGLGTFRLKDDVVISSVKTALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELY
CCCCCCCCCEEEECCHHHHHHHHHHHHHCHHHCCHHHHCCCHHHHHHHHHHCCCCCCEEE
ITTKIWIENLSKDKLIPSLKESLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLKAKKQ
EEEEHHHHCCCCCCCCHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHC
GLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQNRKVVAWAKQHGIHITSYMT
CCCEECCCCCCCHHHHHHHHHHHCCCCCCCCEEEECHHHCCCEEEEEHHHCCEEEEHHHH
LAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLD
HHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCEECCCCHHHHHHHHCCCCCCEEEC
AKDKKAIAALDCNDRLVSPKGLAPEWD
CCCCCEEEEECCCCCEECCCCCCCCCC
>Mature Secondary Structure 
AIPAFGLGTFRLKDDVVISSVKTALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELY
CCCCCCCCEEEECCHHHHHHHHHHHHHCHHHCCHHHHCCCHHHHHHHHHHCCCCCCEEE
ITTKIWIENLSKDKLIPSLKESLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLKAKKQ
EEEEHHHHCCCCCCCCHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHC
GLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQNRKVVAWAKQHGIHITSYMT
CCCEECCCCCCCHHHHHHHHHHHCCCCCCCCEEEECHHHCCCEEEEEHHHCCEEEEHHHH
LAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLD
HHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCEECCCCHHHHHHHHCCCCCCEEEC
AKDKKAIAALDCNDRLVSPKGLAPEWD
CCCCCEEEEECCCCCEECCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796; 11108008 [H]