Definition Hyphomonas neptunium ATCC 15444 chromosome, complete genome.
Accession NC_008358
Length 3,705,021

Click here to switch to the map view.

The map label for this gene is iolC [H]

Identifier: 114797675

GI number: 114797675

Start: 2272534

End: 2274447

Strand: Direct

Name: iolC [H]

Synonym: HNE_2182

Alternate gene names: 114797675

Gene position: 2272534-2274447 (Clockwise)

Preceding gene: 114798245

Following gene: 114798854

Centisome position: 61.34

GC content: 58.41

Gene sequence:

>1914_bases
ATGACTGACCTTCGCCGCACACTTGATGTGATTACAATCGGTCGCGCGGGCGTAGACCTCTATGGACAGCAGGTTGGTGG
CAATCTGGAAGATGTTCACTCGTTCGCAAAATATCTCGGCGGATGCCCGGCCAACATCGCGGTGGGCGCCTCCCGGCTCG
GGCTTCGCTCTGGAATTATCACGCGGGTCGGTAGCGATCATATGGGACGTTTCCTGCAAACCGAGTTCCAGCGAGAGGGT
GTCAATACTGATGGCATTGTTACAGACAAGGAGCGGCTAACAGCGCTCGTGATTCTCGGAATTCAGGATGAAGACACTTT
CCCGCTCATATTCTACCGGGAGAACTGCGCCGATATGGCTCTGTGTGAAGCCGACATCGACACCTCATTCATAGCGCAAA
GCCGCAGCATCCTGATCACCGGAACTCATCTGTCGACGGAACAGACAAGGGCCACATCCCGCGCAGCCATTACAGCAGCA
AAAGCAAGCAATTGCAGCATCATCCTGGATATCGACTACAGACCCGTCCTCTGGGGACTGACATCGCGGGAATTGGGTGA
GGAACGCTTTGTTTCTGACGCCAAGGTCACAGCCACCCTTCAGGAATTCATCCCGGCGTGCGACCTTATCGTCGGCACCG
AGGAAGAACTGCACATCCTGGGGGGGACGACTGATACCATCGCGGCGATGCACGCTATTCGCGCCCTCTCGGACGCAGTG
ATCGTTTGCAAACGCGGCGCGTTGGGCTGCAGCGTCTTCCCGGAAACGATCCCCCAAACTCTGGATCAGGGCATCTCTGG
TCAGTCGTTCAAGGTTGAAGTGTTCAACGTTCTAGGCGCCGGCGACGCCTTCATGAGCGGCTTCCTGAGCGGCTGGCTCC
ACGATAAACCACTTGAGGAGTGTTGCCGCCTTGGCAACGCTTGCGGCGCGATCGTGGTCTCGCGCCACGGCTGCGCCCCG
GCTATTCCGACCGCAGAAGAACTCGCCTGGTTTCTGGAGAATGGTTCACCGCATCTGGCTCTGCGCAAGGACGCGGCCCT
GGAGCACATTCACTGGACCACAACCCGCAGCCCCCGCGAACGGGATCTGGCAGTCTTCGCCATTGATCACCGATCCCAGC
TAGAAGCCATCTCGCGGGAAGCCGCCGCCCCGCTTGAAAAAATCTCCGAATTCAAATCTCTCGCGCTCAGCGCCCTCCAG
ATGATGGATGTTTCTGATGTGGATCTTGGCATCCTATTGGACGGTCGATACGGAAGCCGGGCCTTGGCTACGGCCAGCGG
CCTGCCAATCTGGGTCGGCCGGGCGGTCGAACTGCCAGGCTCCCGGCCGCTTGACTTCGATAGCCAATCGACCGTCACGG
GCGAGATGCTGGAATGGCCCCGGTCCCAAGTCGCTAAATGCCTGTGTTTTTATCATCCGGATGATCCTTCCGAGATCAAA
AACATTCAGGAGCGCCGAATTCAGGAGGCCTTCAAGGCCAGCCGGGAAACACAACGAGAATTCCTGCTCGAAATTATTTG
TTCGCAAAACGGCGCTCTGGCCGACACCACCGTCAGCTCCGCCATGCGTCGCATTTATGAGCTGGGTGTCTTTCCTGACT
GGTGGAAACTCGAAGCCACCGCCAGCGCGCAAAGCTGGGTGGAGACATCTCGCGTGATCGACGAAATGGACCCGAGGTGC
CGCGGAATCCTACTTCTGGGACTTGCCGCGCCCGCCGAAGACGTGGTGGCATGCTTCAAGGTTGCCGCCGCCTTCCCCCT
TATTAAGGGATTTGCGGTTGGCAGAACAATATTCCAGGAGCCCGCGCGGCAATGGTTTAAGGGAGAAATCGACGATGGCG
AAGCCGTTTTCGCCCTCCGCCGCAATTTCAGCCAGCTGATTGACGGTTGGCGTCAGTCCCGGCAGGCGCCCTGA

Upstream 100 bases:

>100_bases
GGCCGCCGACCGCTCCCCATTTTATTTGCGAGATCTGATTGAAATATATTTTGCACTTGAATAAGAATTAGAAAATTTTT
GCAATAGGACTATTCAGTGC

Downstream 100 bases:

>100_bases
GCGAGAGGACACTGACGGATGGAAACGCTTAACCTGACAATGTCCCAGGCCCTGGTGCGCTATCTGGCCGCCCAGTATAT
GGTGATTGACGGGGAAGAAG

Product: kinase IolC

Products: NA

Alternate protein names: 2-deoxy-5-keto-D-gluconate kinase; DKG kinase [H]

Number of amino acids: Translated: 637; Mature: 636

Protein sequence:

>637_residues
MTDLRRTLDVITIGRAGVDLYGQQVGGNLEDVHSFAKYLGGCPANIAVGASRLGLRSGIITRVGSDHMGRFLQTEFQREG
VNTDGIVTDKERLTALVILGIQDEDTFPLIFYRENCADMALCEADIDTSFIAQSRSILITGTHLSTEQTRATSRAAITAA
KASNCSIILDIDYRPVLWGLTSRELGEERFVSDAKVTATLQEFIPACDLIVGTEEELHILGGTTDTIAAMHAIRALSDAV
IVCKRGALGCSVFPETIPQTLDQGISGQSFKVEVFNVLGAGDAFMSGFLSGWLHDKPLEECCRLGNACGAIVVSRHGCAP
AIPTAEELAWFLENGSPHLALRKDAALEHIHWTTTRSPRERDLAVFAIDHRSQLEAISREAAAPLEKISEFKSLALSALQ
MMDVSDVDLGILLDGRYGSRALATASGLPIWVGRAVELPGSRPLDFDSQSTVTGEMLEWPRSQVAKCLCFYHPDDPSEIK
NIQERRIQEAFKASRETQREFLLEIICSQNGALADTTVSSAMRRIYELGVFPDWWKLEATASAQSWVETSRVIDEMDPRC
RGILLLGLAAPAEDVVACFKVAAAFPLIKGFAVGRTIFQEPARQWFKGEIDDGEAVFALRRNFSQLIDGWRQSRQAP

Sequences:

>Translated_637_residues
MTDLRRTLDVITIGRAGVDLYGQQVGGNLEDVHSFAKYLGGCPANIAVGASRLGLRSGIITRVGSDHMGRFLQTEFQREG
VNTDGIVTDKERLTALVILGIQDEDTFPLIFYRENCADMALCEADIDTSFIAQSRSILITGTHLSTEQTRATSRAAITAA
KASNCSIILDIDYRPVLWGLTSRELGEERFVSDAKVTATLQEFIPACDLIVGTEEELHILGGTTDTIAAMHAIRALSDAV
IVCKRGALGCSVFPETIPQTLDQGISGQSFKVEVFNVLGAGDAFMSGFLSGWLHDKPLEECCRLGNACGAIVVSRHGCAP
AIPTAEELAWFLENGSPHLALRKDAALEHIHWTTTRSPRERDLAVFAIDHRSQLEAISREAAAPLEKISEFKSLALSALQ
MMDVSDVDLGILLDGRYGSRALATASGLPIWVGRAVELPGSRPLDFDSQSTVTGEMLEWPRSQVAKCLCFYHPDDPSEIK
NIQERRIQEAFKASRETQREFLLEIICSQNGALADTTVSSAMRRIYELGVFPDWWKLEATASAQSWVETSRVIDEMDPRC
RGILLLGLAAPAEDVVACFKVAAAFPLIKGFAVGRTIFQEPARQWFKGEIDDGEAVFALRRNFSQLIDGWRQSRQAP
>Mature_636_residues
TDLRRTLDVITIGRAGVDLYGQQVGGNLEDVHSFAKYLGGCPANIAVGASRLGLRSGIITRVGSDHMGRFLQTEFQREGV
NTDGIVTDKERLTALVILGIQDEDTFPLIFYRENCADMALCEADIDTSFIAQSRSILITGTHLSTEQTRATSRAAITAAK
ASNCSIILDIDYRPVLWGLTSRELGEERFVSDAKVTATLQEFIPACDLIVGTEEELHILGGTTDTIAAMHAIRALSDAVI
VCKRGALGCSVFPETIPQTLDQGISGQSFKVEVFNVLGAGDAFMSGFLSGWLHDKPLEECCRLGNACGAIVVSRHGCAPA
IPTAEELAWFLENGSPHLALRKDAALEHIHWTTTRSPRERDLAVFAIDHRSQLEAISREAAAPLEKISEFKSLALSALQM
MDVSDVDLGILLDGRYGSRALATASGLPIWVGRAVELPGSRPLDFDSQSTVTGEMLEWPRSQVAKCLCFYHPDDPSEIKN
IQERRIQEAFKASRETQREFLLEIICSQNGALADTTVSSAMRRIYELGVFPDWWKLEATASAQSWVETSRVIDEMDPRCR
GILLLGLAAPAEDVVACFKVAAAFPLIKGFAVGRTIFQEPARQWFKGEIDDGEAVFALRRNFSQLIDGWRQSRQAP

Specific function: Catalyzes the phosphorylation of 5-dehydro-2-deoxy-D- gluconate (2-deoxy-5-keto-D-gluconate or DKG) to 6-phospho-5- dehydro-2-deoxy-D-gluconate (DKGP) [H]

COG id: COG0524

COG function: function code G; Sugar kinases, ribokinase family

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the carbohydrate kinase pfkB family [H]

Homologues:

Organism=Escherichia coli, GI87081971, Length=346, Percent_Identity=26.3005780346821, Blast_Score=76, Evalue=8e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011611
- InterPro:   IPR002173
- InterPro:   IPR022841 [H]

Pfam domain/function: PF00294 PfkB [H]

EC number: =2.7.1.92 [H]

Molecular weight: Translated: 69767; Mature: 69636

Theoretical pI: Translated: 4.88; Mature: 4.88

Prosite motif: PS00435 PEROXIDASE_1 ; PS00584 PFKB_KINASES_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.5 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
2.5 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTDLRRTLDVITIGRAGVDLYGQQVGGNLEDVHSFAKYLGGCPANIAVGASRLGLRSGII
CCCHHHHHHEEEECCCCHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHCHHHHHHHHHHH
TRVGSDHMGRFLQTEFQREGVNTDGIVTDKERLTALVILGIQDEDTFPLIFYRENCADMA
HHCCCHHHHHHHHHHHHHCCCCCCCEEECHHHEEEEEEEEECCCCCCEEEEEECCCCCHH
LCEADIDTSFIAQSRSILITGTHLSTEQTRATSRAAITAAKASNCSIILDIDYRPVLWGL
HHHCCCCHHHHHCCCCEEEEECCCCCHHHHHHHHHHEEEECCCCCEEEEEECCCCEEECC
TSRELGEERFVSDAKVTATLQEFIPACDLIVGTEEELHILGGTTDTIAAMHAIRALSDAV
CHHHHCHHHHHCCHHHHHHHHHHCCHHHEEECCCCCEEEECCCHHHHHHHHHHHHHHHHH
IVCKRGALGCSVFPETIPQTLDQGISGQSFKVEVFNVLGAGDAFMSGFLSGWLHDKPLEE
EEECCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHHHCCCCCHHH
CCRLGNACGAIVVSRHGCAPAIPTAEELAWFLENGSPHLALRKDAALEHIHWTTTRSPRE
HHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCEEEECCCHHHHEEEECCCCCCC
RDLAVFAIDHRSQLEAISREAAAPLEKISEFKSLALSALQMMDVSDVDLGILLDGRYGSR
CCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCC
ALATASGLPIWVGRAVELPGSRPLDFDSQSTVTGEMLEWPRSQVAKCLCFYHPDDPSEIK
HHHHCCCCCEEECCEEECCCCCCCCCCCCCCCCHHHHHCCHHHHHHEEEEECCCCHHHHH
NIQERRIQEAFKASRETQREFLLEIICSQNGALADTTVSSAMRRIYELGVFPDWWKLEAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEC
ASAQSWVETSRVIDEMDPRCRGILLLGLAAPAEDVVACFKVAAAFPLIKGFAVGRTIFQE
CCHHHHHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PARQWFKGEIDDGEAVFALRRNFSQLIDGWRQSRQAP
HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure 
TDLRRTLDVITIGRAGVDLYGQQVGGNLEDVHSFAKYLGGCPANIAVGASRLGLRSGII
CCHHHHHHEEEECCCCHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHCHHHHHHHHHHH
TRVGSDHMGRFLQTEFQREGVNTDGIVTDKERLTALVILGIQDEDTFPLIFYRENCADMA
HHCCCHHHHHHHHHHHHHCCCCCCCEEECHHHEEEEEEEEECCCCCCEEEEEECCCCCHH
LCEADIDTSFIAQSRSILITGTHLSTEQTRATSRAAITAAKASNCSIILDIDYRPVLWGL
HHHCCCCHHHHHCCCCEEEEECCCCCHHHHHHHHHHEEEECCCCCEEEEEECCCCEEECC
TSRELGEERFVSDAKVTATLQEFIPACDLIVGTEEELHILGGTTDTIAAMHAIRALSDAV
CHHHHCHHHHHCCHHHHHHHHHHCCHHHEEECCCCCEEEECCCHHHHHHHHHHHHHHHHH
IVCKRGALGCSVFPETIPQTLDQGISGQSFKVEVFNVLGAGDAFMSGFLSGWLHDKPLEE
EEECCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEEEECCCCHHHHHHHHHHHCCCCCHHH
CCRLGNACGAIVVSRHGCAPAIPTAEELAWFLENGSPHLALRKDAALEHIHWTTTRSPRE
HHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCCEEEECCCHHHHEEEECCCCCCC
RDLAVFAIDHRSQLEAISREAAAPLEKISEFKSLALSALQMMDVSDVDLGILLDGRYGSR
CCEEEEEECCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCC
ALATASGLPIWVGRAVELPGSRPLDFDSQSTVTGEMLEWPRSQVAKCLCFYHPDDPSEIK
HHHHCCCCCEEECCEEECCCCCCCCCCCCCCCCHHHHHCCHHHHHHEEEEECCCCHHHHH
NIQERRIQEAFKASRETQREFLLEIICSQNGALADTTVSSAMRRIYELGVFPDWWKLEAT
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEC
ASAQSWVETSRVIDEMDPRCRGILLLGLAAPAEDVVACFKVAAAFPLIKGFAVGRTIFQE
CCHHHHHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PARQWFKGEIDDGEAVFALRRNFSQLIDGWRQSRQAP
HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11058132 [H]