Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is ytbE [H]
Identifier: 157159806
GI number: 157159806
Start: 367294
End: 368163
Strand: Direct
Name: ytbE [H]
Synonym: EcHS_A0353
Alternate gene names: 157159806
Gene position: 367294-368163 (Clockwise)
Preceding gene: 157159804
Following gene: 157159810
Centisome position: 7.91
GC content: 42.76
Gene sequence:
>870_bases GTGGAATTTTCCGTATTAAGTAACAATCTGAAAATGCCGATGATGGGATTTGGTGTTTTTCAGGTTACCGATAAAAACGT GTGCAAACAGTCAGTGTTGAACGCTATCCGCACAGGTTATCGACTCATCGACACTGCAGCGGTATACGGTAATGAAGATG CCGTCGGTGAAGCCGTTCGTGAAGCTATATCTGAAGGTTTATGTACCCGAGATGAGTTATTTATAACGTCTAAATTGTGG GTGCAGGATATGTTGAATCAGGATACGGCAGCAGCAGGTATTGAAGCATCATTAAAAAAATCGGGACTAGAGTACTTCGA CCTCTATTTATTGCACCAGGCTATGCGCGATTATTTCAGTGCGTGGCGTGCACTCGAAGATGCCTATGAAGAGGGCAAAT TAAAAGCAATTGGGGTTTCCAATTTCTATCCTCATGTTCTGGCGAACTTTTGTGAAACGGTAAGAGTTAAACCGATGGTC AACCAGGTCGAGTTGCATCCCTATTTTGCCCAACCAGAGGCGCTGGCAACCATGAAGTATTATAACGTGCAGCCTGAAGC ATGGGCTCCGTTAGGTGGTGGACGACATAAACCCTTTGAAAATAATCTGCTTCAGAGTATTGCAGATGCCCATCAAAAAT CGATTTCTCAAGTCATTCTGCGTTGGAATATTCAACGGGGAGTGGTCGTTATTCCGAAATCGACACATCAACAGCGTATC GAAGAAAATTTTGCTATCTGGGATTTCTCACTGACAGAGAAAGAAATGGCACAAATTAGTTCGCTTGATTTGGGTTATGT TGGGGAGTCGGTAAAACATTTTAATCCTGAATTTGTTCGTGGTTGTCTTGCTGTAAAAATACATGATTGA
Upstream 100 bases:
>100_bases TTAATGATTTTTTTGCATAAGTGATATCAAAATCCACGTACTAATTTGAGGTTACGTTTTAACGTAGACTCATTGTTCAT GCCTAATGGAGGGACTGACA
Downstream 100 bases:
>100_bases TATTAATCATATATTTTACCTGAGACGACAAGAATCTTTTAACAGGGGAGTGATATTGATCTTCACTCTGTCATATCTCC GGTAATATGGCGTCAGGCTT
Product: aldo/keto reductase family oxidoreductase
Products: 2-keto-L-gulonate; NADP [C]
Alternate protein names: NA
Number of amino acids: Translated: 289; Mature: 289
Protein sequence:
>289_residues MEFSVLSNNLKMPMMGFGVFQVTDKNVCKQSVLNAIRTGYRLIDTAAVYGNEDAVGEAVREAISEGLCTRDELFITSKLW VQDMLNQDTAAAGIEASLKKSGLEYFDLYLLHQAMRDYFSAWRALEDAYEEGKLKAIGVSNFYPHVLANFCETVRVKPMV NQVELHPYFAQPEALATMKYYNVQPEAWAPLGGGRHKPFENNLLQSIADAHQKSISQVILRWNIQRGVVVIPKSTHQQRI EENFAIWDFSLTEKEMAQISSLDLGYVGESVKHFNPEFVRGCLAVKIHD
Sequences:
>Translated_289_residues MEFSVLSNNLKMPMMGFGVFQVTDKNVCKQSVLNAIRTGYRLIDTAAVYGNEDAVGEAVREAISEGLCTRDELFITSKLW VQDMLNQDTAAAGIEASLKKSGLEYFDLYLLHQAMRDYFSAWRALEDAYEEGKLKAIGVSNFYPHVLANFCETVRVKPMV NQVELHPYFAQPEALATMKYYNVQPEAWAPLGGGRHKPFENNLLQSIADAHQKSISQVILRWNIQRGVVVIPKSTHQQRI EENFAIWDFSLTEKEMAQISSLDLGYVGESVKHFNPEFVRGCLAVKIHD >Mature_289_residues MEFSVLSNNLKMPMMGFGVFQVTDKNVCKQSVLNAIRTGYRLIDTAAVYGNEDAVGEAVREAISEGLCTRDELFITSKLW VQDMLNQDTAAAGIEASLKKSGLEYFDLYLLHQAMRDYFSAWRALEDAYEEGKLKAIGVSNFYPHVLANFCETVRVKPMV NQVELHPYFAQPEALATMKYYNVQPEAWAPLGGGRHKPFENNLLQSIADAHQKSISQVILRWNIQRGVVVIPKSTHQQRI EENFAIWDFSLTEKEMAQISSLDLGYVGESVKHFNPEFVRGCLAVKIHD
Specific function: Reduction [C]
COG id: COG0656
COG function: function code R; Aldo/keto reductases, related to diketogulonate reductase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the aldo/keto reductase family [H]
Homologues:
Organism=Homo sapiens, GI5174695, Length=293, Percent_Identity=37.8839590443686, Blast_Score=184, Evalue=8e-47, Organism=Homo sapiens, GI223468663, Length=295, Percent_Identity=36.271186440678, Blast_Score=170, Evalue=2e-42, Organism=Homo sapiens, GI24497577, Length=289, Percent_Identity=34.2560553633218, Blast_Score=167, Evalue=1e-41, Organism=Homo sapiens, GI5174391, Length=289, Percent_Identity=34.2560553633218, Blast_Score=167, Evalue=1e-41, Organism=Homo sapiens, GI300116271, Length=273, Percent_Identity=37.3626373626374, Blast_Score=164, Evalue=8e-41, Organism=Homo sapiens, GI300116273, Length=276, Percent_Identity=36.231884057971, Blast_Score=163, Evalue=2e-40, Organism=Homo sapiens, GI310109922, Length=265, Percent_Identity=35.4716981132075, Blast_Score=161, Evalue=5e-40, Organism=Homo sapiens, GI45446745, Length=286, Percent_Identity=33.9160839160839, Blast_Score=160, Evalue=2e-39, Organism=Homo sapiens, GI4503285, Length=286, Percent_Identity=33.9160839160839, Blast_Score=160, Evalue=2e-39, Organism=Homo sapiens, GI310109920, Length=286, Percent_Identity=33.5664335664336, Blast_Score=159, Evalue=2e-39, Organism=Homo sapiens, GI24497583, Length=287, Percent_Identity=34.1463414634146, Blast_Score=159, Evalue=2e-39, Organism=Homo sapiens, GI24497585, Length=289, Percent_Identity=34.2560553633218, Blast_Score=159, Evalue=4e-39, Organism=Homo sapiens, GI5453543, Length=286, Percent_Identity=33.5664335664336, Blast_Score=159, Evalue=4e-39, Organism=Homo sapiens, GI291291012, Length=279, Percent_Identity=34.4086021505376, Blast_Score=150, Evalue=2e-36, Organism=Homo sapiens, GI93277124, Length=296, Percent_Identity=32.0945945945946, Blast_Score=148, Evalue=5e-36, Organism=Homo sapiens, GI4502049, Length=293, Percent_Identity=33.1058020477816, Blast_Score=146, Evalue=2e-35, Organism=Homo sapiens, GI310109924, Length=150, Percent_Identity=35.3333333333333, Blast_Score=94, Evalue=1e-19, Organism=Homo sapiens, GI207028673, Length=103, Percent_Identity=39.8058252427184, Blast_Score=78, Evalue=9e-15, Organism=Homo sapiens, GI310109926, Length=103, Percent_Identity=38.8349514563107, Blast_Score=77, Evalue=1e-14, Organism=Escherichia coli, GI87082198, Length=257, Percent_Identity=40.0778210116732, Blast_Score=199, Evalue=2e-52, Organism=Escherichia coli, GI1786400, Length=274, Percent_Identity=32.1167883211679, Blast_Score=133, Evalue=1e-32, Organism=Escherichia coli, GI1788081, Length=274, Percent_Identity=29.1970802919708, Blast_Score=86, Evalue=3e-18, Organism=Escherichia coli, GI48994888, Length=282, Percent_Identity=24.822695035461, Blast_Score=67, Evalue=1e-12, Organism=Caenorhabditis elegans, GI17550248, Length=284, Percent_Identity=36.9718309859155, Blast_Score=171, Evalue=3e-43, Organism=Caenorhabditis elegans, GI17564128, Length=292, Percent_Identity=33.9041095890411, Blast_Score=164, Evalue=3e-41, Organism=Caenorhabditis elegans, GI17537077, Length=288, Percent_Identity=35.0694444444444, Blast_Score=159, Evalue=2e-39, Organism=Caenorhabditis elegans, GI71998625, Length=269, Percent_Identity=34.2007434944238, Blast_Score=155, Evalue=2e-38, Organism=Caenorhabditis elegans, GI17537075, Length=291, Percent_Identity=34.3642611683849, Blast_Score=153, Evalue=8e-38, Organism=Caenorhabditis elegans, GI17566692, Length=282, Percent_Identity=32.9787234042553, Blast_Score=149, Evalue=1e-36, Organism=Caenorhabditis elegans, GI17561298, Length=274, Percent_Identity=31.3868613138686, Blast_Score=144, Evalue=4e-35, Organism=Caenorhabditis elegans, GI17537079, Length=297, Percent_Identity=28.956228956229, Blast_Score=130, Evalue=6e-31, Organism=Caenorhabditis elegans, GI17552492, Length=268, Percent_Identity=28.7313432835821, Blast_Score=127, Evalue=5e-30, Organism=Caenorhabditis elegans, GI17538386, Length=273, Percent_Identity=28.2051282051282, Blast_Score=121, Evalue=4e-28, Organism=Caenorhabditis elegans, GI17562292, Length=276, Percent_Identity=25.7246376811594, Blast_Score=112, Evalue=2e-25, Organism=Caenorhabditis elegans, GI17561300, Length=273, Percent_Identity=27.8388278388278, Blast_Score=107, Evalue=9e-24, Organism=Caenorhabditis elegans, GI17550246, Length=75, Percent_Identity=44, Blast_Score=70, Evalue=2e-12, Organism=Saccharomyces cerevisiae, GI6321896, Length=304, Percent_Identity=35.1973684210526, Blast_Score=187, Evalue=2e-48, Organism=Saccharomyces cerevisiae, GI6320576, Length=288, Percent_Identity=32.6388888888889, Blast_Score=154, Evalue=2e-38, Organism=Saccharomyces cerevisiae, GI6324694, Length=301, Percent_Identity=30.5647840531561, Blast_Score=148, Evalue=8e-37, Organism=Saccharomyces cerevisiae, GI6322556, Length=271, Percent_Identity=33.5793357933579, Blast_Score=142, Evalue=5e-35, Organism=Saccharomyces cerevisiae, GI6319625, Length=312, Percent_Identity=30.4487179487179, Blast_Score=136, Evalue=4e-33, Organism=Saccharomyces cerevisiae, GI6320079, Length=290, Percent_Identity=31.0344827586207, Blast_Score=120, Evalue=3e-28, Organism=Drosophila melanogaster, GI24662789, Length=293, Percent_Identity=38.2252559726962, Blast_Score=187, Evalue=7e-48, Organism=Drosophila melanogaster, GI24657054, Length=289, Percent_Identity=36.6782006920415, Blast_Score=178, Evalue=5e-45, Organism=Drosophila melanogaster, GI21356425, Length=289, Percent_Identity=33.9100346020761, Blast_Score=165, Evalue=3e-41, Organism=Drosophila melanogaster, GI24663317, Length=299, Percent_Identity=34.4481605351171, Blast_Score=164, Evalue=6e-41, Organism=Drosophila melanogaster, GI24644950, Length=300, Percent_Identity=33, Blast_Score=160, Evalue=1e-39, Organism=Drosophila melanogaster, GI20129731, Length=287, Percent_Identity=33.4494773519164, Blast_Score=157, Evalue=7e-39, Organism=Drosophila melanogaster, GI221378297, Length=320, Percent_Identity=30.9375, Blast_Score=152, Evalue=2e-37, Organism=Drosophila melanogaster, GI45553081, Length=288, Percent_Identity=29.5138888888889, Blast_Score=147, Evalue=7e-36, Organism=Drosophila melanogaster, GI24662785, Length=283, Percent_Identity=34.2756183745583, Blast_Score=144, Evalue=9e-35, Organism=Drosophila melanogaster, GI24662781, Length=277, Percent_Identity=33.9350180505415, Blast_Score=142, Evalue=2e-34, Organism=Drosophila melanogaster, GI281366140, Length=255, Percent_Identity=29.4117647058824, Blast_Score=127, Evalue=1e-29,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001395 - InterPro: IPR018170 - InterPro: IPR020471 - InterPro: IPR023210 [H]
Pfam domain/function: PF00248 Aldo_ket_red [H]
EC number: NA
Molecular weight: Translated: 32690; Mature: 32690
Theoretical pI: Translated: 5.86; Mature: 5.86
Prosite motif: PS00798 ALDOKETO_REDUCTASE_1 ; PS00063 ALDOKETO_REDUCTASE_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.5 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEFSVLSNNLKMPMMGFGVFQVTDKNVCKQSVLNAIRTGYRLIDTAAVYGNEDAVGEAVR CCCCEECCCCCCCEECCCEEEECCHHHHHHHHHHHHHHCHHHHHHHHHCCCCHHHHHHHH EAISEGLCTRDELFITSKLWVQDMLNQDTAAAGIEASLKKSGLEYFDLYLLHQAMRDYFS HHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH AWRALEDAYEEGKLKAIGVSNFYPHVLANFCETVRVKPMVNQVELHPYFAQPEALATMKY HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCCCHHHEEECCCCCCCHHHHHHHH YNVQPEAWAPLGGGRHKPFENNLLQSIADAHQKSISQVILRWNIQRGVVVIPKSTHQQRI CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHEECCCCEEEECCCHHHHHH EENFAIWDFSLTEKEMAQISSLDLGYVGESVKHFNPEFVRGCLAVKIHD HCCCEEEEECCCHHHHHHHHCCCHHHHCCHHHHCCHHHHHCEEEEEECC >Mature Secondary Structure MEFSVLSNNLKMPMMGFGVFQVTDKNVCKQSVLNAIRTGYRLIDTAAVYGNEDAVGEAVR CCCCEECCCCCCCEECCCEEEECCHHHHHHHHHHHHHHCHHHHHHHHHCCCCHHHHHHHH EAISEGLCTRDELFITSKLWVQDMLNQDTAAAGIEASLKKSGLEYFDLYLLHQAMRDYFS HHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH AWRALEDAYEEGKLKAIGVSNFYPHVLANFCETVRVKPMVNQVELHPYFAQPEALATMKY HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCCCHHHEEECCCCCCCHHHHHHHH YNVQPEAWAPLGGGRHKPFENNLLQSIADAHQKSISQVILRWNIQRGVVVIPKSTHQQRI CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHEECCCCEEEECCCHHHHHH EENFAIWDFSLTEKEMAQISSLDLGYVGESVKHFNPEFVRGCLAVKIHD HCCCEEEEECCCHHHHHHHHCCCHHHHCCHHHHCCHHHHHCEEEEEECC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: 2,5-diketo-D-gluconate; NADPH [C]
Specific reaction: 2,5-diketo-D-gluconate + NADPH = 2-keto-L-gulonate + NADP [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9387221; 9384377 [H]