Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is gsk

Identifier: 157160004

GI number: 157160004

Start: 566190

End: 567494

Strand: Direct

Name: gsk

Synonym: EcHS_A0554

Alternate gene names: 157160004

Gene position: 566190-567494 (Clockwise)

Preceding gene: 157160002

Following gene: 157160007

Centisome position: 12.19

GC content: 52.03

Gene sequence:

>1305_bases
ATGAAATTTCCCGGTAAACGTAAATCCAAACATTACTTCCCTGTAAATGCACGCGATCCGCTGCTTCAGCAGTTCCAGCC
AGAAAACGAAACCAGCGCCGCCTGGGTAGTGGGTATCGATCAAACGCTGGTCGATATTGAAGCGAAAGTGGATGACGAAT
TCATTGAGCGTTATGGATTAAGCGCCGGGCATTCACTGGTGATTGAGGATGACGTAGCCGAAGCGCTTTATCAGGAACTA
AAACAGAAAAACCTGATTACCCATCAGTTTGCGGGTGGCACTATTGGTAACACCATGCACAACTACTCGGTGCTCGCGGA
CGACCGTTCGGTGCTGCTGGGCGTCATGTGCAGCAATATTGAAATTGGCAGCTATGCCTATCGTTACCTGTGTAACACCT
CCAGCCGTACCGATCTTAACTATCTACAAGGCGTGGATGGTCCGATTGGTCGTTGCTTTACGCTGATTGGCGAGTCCGGG
GAACGTACCTTTGCTATCAGCCCTGGCCACATGAACCAGCTGCGGGCTGAAAGTATTCCGGAAGATGTGATTGCCGGAGC
CTCGGCACTGGTTCTCACCTCTTATCTGGTGCGTTGCAAGCCGGGTGAACCCATGCCGGAAGCAACCATGAAAGCCATTG
AGTACGCGAAGAAATATAACGTACCGGTGGTGCTGACGCTGGGAACTAAGTTTGTCATTGCCGAGAATCCGCAGTGGTGG
CAGCAATTCCTCAAAGACCACGTCTCTATCCTTGCGATGAACGAAGATGAAGCCGAAGCGTTGACCGGAGAAAGCGATCC
GTTGTTGGCATCTGACAAGGCGCTGGACTGGGTAGATCTGGTGCTGTGCACCGCCGGGCCAATCGGCTTGTATATGGCGG
GCTTTACCGAAGACGAAGCGAAACGTAAAACCCAGCATCCGCTGCTGCCGGGCGCTATAGCGGAATTCAACCAGTATGAG
TTTAGCCGCGCCATGCGCCACAAGGATTGCCAGAATCCGCTGCGTGTATATTCGCACATTGCGCCGTACATGGGCGGGCC
GGAAAAAATCATGAACACTAATGGAGCGGGGGATGGCGCATTGGCAGCGTTGCTGCATGACATTACCGCCAACAGCTACC
ATCGTAGCAACGTACCAAACTCCAGCAAACATAAATTCACCTGGTTAACTTATTCATCGTTAGCGCAGGTGTGTAAATAT
GCTAACCGTGTGAGCTATCAGGTACTGAACCAGCATTCACCTCGTTTAACGCGCGGCTTGCCGGAGCGTGAAGACAGCCT
GGAAGAGTCTTACTGGGATCGTTAA

Upstream 100 bases:

>100_bases
TGCGATCCCGCCTGCTGATATTGAAACTGGCTGCGTCTCGCGCGCTCCCGTCAGATTGTGTTAACATTCGCCGCTCAGTT
AACCACCCGTAAAAACAACC

Downstream 100 bases:

>100_bases
GTTATCGTCGGTTCGTAGGCCAGATAAGGCGTTCACGCCGCATCTGGCATTTGGCTCTCGATGCCTGATGCGACGCTGGC
GCGTCTTATCATGCCTACAT

Product: inosine-guanosine kinase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 434; Mature: 434

Protein sequence:

>434_residues
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGLSAGHSLVIEDDVAEALYQEL
KQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNIEIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESG
ERTFAISPGHMNQLRAESIPEDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEAKRKTQHPLLPGAIAEFNQYE
FSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGALAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKY
ANRVSYQVLNQHSPRLTRGLPEREDSLEESYWDR

Sequences:

>Translated_434_residues
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGLSAGHSLVIEDDVAEALYQEL
KQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNIEIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESG
ERTFAISPGHMNQLRAESIPEDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEAKRKTQHPLLPGAIAEFNQYE
FSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGALAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKY
ANRVSYQVLNQHSPRLTRGLPEREDSLEESYWDR
>Mature_434_residues
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGLSAGHSLVIEDDVAEALYQEL
KQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNIEIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESG
ERTFAISPGHMNQLRAESIPEDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEAKRKTQHPLLPGAIAEFNQYE
FSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGALAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKY
ANRVSYQVLNQHSPRLTRGLPEREDSLEESYWDR

Specific function: Unknown

COG id: COG0524

COG function: function code G; Sugar kinases, ribokinase family

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the carbohydrate kinase pfkB family

Homologues:

Organism=Escherichia coli, GI1786684, Length=434, Percent_Identity=100, Blast_Score=912, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): INGK_ECO57 (P0AEW8)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   B90695
- PIR:   F85545
- RefSeq:   NP_286218.1
- RefSeq:   NP_308557.1
- ProteinModelPortal:   P0AEW8
- EnsemblBacteria:   EBESCT00000024490
- EnsemblBacteria:   EBESCT00000057356
- GeneID:   914634
- GeneID:   957453
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z0596
- KEGG:   ecs:ECs0530
- GeneTree:   EBGT00050000011151
- HOGENOM:   HBG298340
- OMA:   IRNTNGA
- ProtClustDB:   PRK15074
- BioCyc:   ECOL83334:ECS0530-MONOMER
- InterPro:   IPR011611
- InterPro:   IPR002173

Pfam domain/function: PF00294 PfkB

EC number: =2.7.1.73

Molecular weight: Translated: 48449; Mature: 48449

Theoretical pI: Translated: 5.59; Mature: 5.59

Prosite motif: PS00583 PFKB_KINASES_1; PS00584 PFKB_KINASES_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGL
CCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCEEEEECCCCEEEEHHHCCHHHHHHHCC
SAGHSLVIEDDVAEALYQELKQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNI
CCCCEEEEEHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHCCEEEECCHHHHHHHHHCCC
EIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESGERTFAISPGHMNQLRAESIP
EECCHHHHHEECCCCCCCHHHHHCCCCHHHHHHHHHCCCCCEEEEECCCCHHHHHHHCCC
EDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
HHHHHCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEEECCHHHH
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEA
HHHHHHHHEEEEECCCHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCHHHEEECCCCHHH
KRKTQHPLLPGAIAEFNQYEFSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGA
HHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHCCCCCCHHH
LAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKYANRVSYQVLNQHSPRLTRGL
HHHHHHHHHCCCCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCHHHCCC
PEREDSLEESYWDR
CCHHHHHHHHHCCC
>Mature Secondary Structure
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGL
CCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCEEEEECCCCEEEEHHHCCHHHHHHHCC
SAGHSLVIEDDVAEALYQELKQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNI
CCCCEEEEEHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHCCEEEECCHHHHHHHHHCCC
EIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESGERTFAISPGHMNQLRAESIP
EECCHHHHHEECCCCCCCHHHHHCCCCHHHHHHHHHCCCCCEEEEECCCCHHHHHHHCCC
EDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
HHHHHCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEEECCHHHH
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEA
HHHHHHHHEEEEECCCHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCHHHEEECCCCHHH
KRKTQHPLLPGAIAEFNQYEFSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGA
HHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHCCCCCCHHH
LAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKYANRVSYQVLNQHSPRLTRGL
HHHHHHHHHCCCCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCHHHCCC
PEREDSLEESYWDR
CCHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796