Definition Prochlorococcus marinus str. MIT 9301, complete genome.
Accession NC_009091
Length 1,641,879

Click here to switch to the map view.

The map label for this gene is zwf [H]

Identifier: 126696519

GI number: 126696519

Start: 993146

End: 994669

Strand: Reverse

Name: zwf [H]

Synonym: P9301_11811

Alternate gene names: 126696519

Gene position: 994669-993146 (Counterclockwise)

Preceding gene: 126696520

Following gene: 126696518

Centisome position: 60.58

GC content: 35.37

Gene sequence:

>1524_bases
ATGCCTTCAACTTTAAGTAATCCTCTAAGATTAGGCTTACGGCAGGAAAGAGTCATATCTCCACAATGCTTGGTAATATT
TGGTGCTAGTGGAGACCTTACTCATAGAAAATTAATACCAGCCTTATTTGAACTCTATTTACAAAGAAGAATTCCTAGTG
AATTTGGAATAGTTGGTTGTGCGAGAAGACCTTGGACTGATAATGAGTTTAGAGAAAAGATGAAAGTAAAGCTATCAAAT
AAAATATCTGGTAAAGAAACGGAGTGGGAACAATTTTCTAATTACCTTTTCTATGAACCTGTTGACTTACAACAAAGTGA
TCATGTCGTAAGGCTTTCTAAAAGATTAAATGAAATTGATAAAAAACAAGCTACTCATGGTAATAGAACATTTTATTTAT
CAGTATCTCCGAATTTCTATGCAAGTGGATGTAAAGCTCTTAAAGAAGCTGGCCTTTTAGATGACCCCAAGAAAAGTCGT
TTAGTGATTGAAAAACCTTTTGGAAGAGATTATTCAAGTGCAAAAAAATTGAATAAAATCGTCCAAAGTTGTGCTGAAGA
AAATCAGATTTATAGGATCGATCATTATTTAGGTAAAGAGACAGTTCAAAATATTCTTGTTTTGAGGTTTGCTAACACTA
TTTTTGAACCAATTTGGAACAGAAATTATATATCAAGTGTTCAAATTACTTCATCTGAAACAGTGGGTGTTGAAGATAGG
GCAGGTTATTACGAAAGCTCAGGTGCTTTAAGGGATATGCTTCAAAACCATATGACACAAATGCTTGCTGTTACTGCTAT
GGAACCTCCTGGGAAGTTTGAGCCAGAAGCAATAAGAAATGAAAAAGCTAAGGTTCTTCAAGCTTCAAAACTTGCTGACG
AAAATGAACCGTGGAATTGTTGCATAAGAGGTCAATATGGAGAGGGAGGGAATATTTCAAATCAACTGAAAGGATATAGG
CAGGAAGATGGTGTTAATTCCAATAGCACAACAGAAACTTATATCGCGACAAAAGTTTTCGTTGATAACTGGCGTTGGCA
AGGTGTTCCATTTTATTTGAGAACAGGTAAAAGACTGCCTAAAAGACTTGGAGAAATAGTCTTGACTTTTAAAGACGTTC
CTGTTCATTTATTTGAATCAACAATAATAAATCCTGCCCCAAATCAACTTATCCTTAGAATTCAGCCAAATGAAGGGGCG
ACTTTCAAGTTTGAGGTAAAATCTCCAGGTTCTGGAATGAAATCAAGACCTGTTGAAATGGAATTTTCTTATGATGAATC
ATTTGGAGAACCATCAGATGAAGGCTATGTAAGATTATTAGCGGATGCAATGCTTTCTGACCCAACTTTATTTACTCGAA
GCGATGAAGTAGAGGCAGCCTGGAAACTTTATACCCCATTAATAGAATTGATGGATAATTCTCCTTGGAAGTTACCTATT
TATAATTATGAATCCATGACGTGGGGACCTCCTGAATCTGATCAATTACTCTCAAAAGATAATATTTTCTGGCGTAGACC
TTAA

Upstream 100 bases:

>100_bases
TTTAGATTTAAGTTTAACCTTCTAAATACTTTTTGGTTTAGACACAATATGTAGCTAATATTTGAAAAAAAGAGGATTTT
AAATAAAGAATAATAAAAAT

Downstream 100 bases:

>100_bases
AAATGAAACCTCAACTTACACTCCAAACCCCGCTAGAGCTGCCTTATGAGGAGATTTCTAATTACCTTAATAAATTATGG
GTTTCAGAAGATAGTGATAA

Product: glucose-6-phosphate 1-dehydrogenase

Products: NA

Alternate protein names: G6PD [H]

Number of amino acids: Translated: 507; Mature: 506

Protein sequence:

>507_residues
MPSTLSNPLRLGLRQERVISPQCLVIFGASGDLTHRKLIPALFELYLQRRIPSEFGIVGCARRPWTDNEFREKMKVKLSN
KISGKETEWEQFSNYLFYEPVDLQQSDHVVRLSKRLNEIDKKQATHGNRTFYLSVSPNFYASGCKALKEAGLLDDPKKSR
LVIEKPFGRDYSSAKKLNKIVQSCAEENQIYRIDHYLGKETVQNILVLRFANTIFEPIWNRNYISSVQITSSETVGVEDR
AGYYESSGALRDMLQNHMTQMLAVTAMEPPGKFEPEAIRNEKAKVLQASKLADENEPWNCCIRGQYGEGGNISNQLKGYR
QEDGVNSNSTTETYIATKVFVDNWRWQGVPFYLRTGKRLPKRLGEIVLTFKDVPVHLFESTIINPAPNQLILRIQPNEGA
TFKFEVKSPGSGMKSRPVEMEFSYDESFGEPSDEGYVRLLADAMLSDPTLFTRSDEVEAAWKLYTPLIELMDNSPWKLPI
YNYESMTWGPPESDQLLSKDNIFWRRP

Sequences:

>Translated_507_residues
MPSTLSNPLRLGLRQERVISPQCLVIFGASGDLTHRKLIPALFELYLQRRIPSEFGIVGCARRPWTDNEFREKMKVKLSN
KISGKETEWEQFSNYLFYEPVDLQQSDHVVRLSKRLNEIDKKQATHGNRTFYLSVSPNFYASGCKALKEAGLLDDPKKSR
LVIEKPFGRDYSSAKKLNKIVQSCAEENQIYRIDHYLGKETVQNILVLRFANTIFEPIWNRNYISSVQITSSETVGVEDR
AGYYESSGALRDMLQNHMTQMLAVTAMEPPGKFEPEAIRNEKAKVLQASKLADENEPWNCCIRGQYGEGGNISNQLKGYR
QEDGVNSNSTTETYIATKVFVDNWRWQGVPFYLRTGKRLPKRLGEIVLTFKDVPVHLFESTIINPAPNQLILRIQPNEGA
TFKFEVKSPGSGMKSRPVEMEFSYDESFGEPSDEGYVRLLADAMLSDPTLFTRSDEVEAAWKLYTPLIELMDNSPWKLPI
YNYESMTWGPPESDQLLSKDNIFWRRP
>Mature_506_residues
PSTLSNPLRLGLRQERVISPQCLVIFGASGDLTHRKLIPALFELYLQRRIPSEFGIVGCARRPWTDNEFREKMKVKLSNK
ISGKETEWEQFSNYLFYEPVDLQQSDHVVRLSKRLNEIDKKQATHGNRTFYLSVSPNFYASGCKALKEAGLLDDPKKSRL
VIEKPFGRDYSSAKKLNKIVQSCAEENQIYRIDHYLGKETVQNILVLRFANTIFEPIWNRNYISSVQITSSETVGVEDRA
GYYESSGALRDMLQNHMTQMLAVTAMEPPGKFEPEAIRNEKAKVLQASKLADENEPWNCCIRGQYGEGGNISNQLKGYRQ
EDGVNSNSTTETYIATKVFVDNWRWQGVPFYLRTGKRLPKRLGEIVLTFKDVPVHLFESTIINPAPNQLILRIQPNEGAT
FKFEVKSPGSGMKSRPVEMEFSYDESFGEPSDEGYVRLLADAMLSDPTLFTRSDEVEAAWKLYTPLIELMDNSPWKLPIY
NYESMTWGPPESDQLLSKDNIFWRRP

Specific function: Pentose phosphate pathway; first step. [C]

COG id: COG0364

COG function: function code G; Glucose-6-phosphate 1-dehydrogenase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glucose-6-phosphate dehydrogenase family [H]

Homologues:

Organism=Homo sapiens, GI109389365, Length=495, Percent_Identity=35.5555555555556, Blast_Score=289, Evalue=5e-78,
Organism=Homo sapiens, GI108773793, Length=495, Percent_Identity=35.5555555555556, Blast_Score=288, Evalue=6e-78,
Organism=Homo sapiens, GI52145310, Length=491, Percent_Identity=27.9022403258656, Blast_Score=169, Evalue=6e-42,
Organism=Escherichia coli, GI1788158, Length=490, Percent_Identity=38.5714285714286, Blast_Score=336, Evalue=2e-93,
Organism=Caenorhabditis elegans, GI17538218, Length=484, Percent_Identity=34.297520661157, Blast_Score=300, Evalue=1e-81,
Organism=Saccharomyces cerevisiae, GI6324088, Length=470, Percent_Identity=34.468085106383, Blast_Score=266, Evalue=8e-72,
Organism=Drosophila melanogaster, GI24643350, Length=489, Percent_Identity=34.1513292433538, Blast_Score=281, Evalue=1e-75,
Organism=Drosophila melanogaster, GI24643352, Length=489, Percent_Identity=34.1513292433538, Blast_Score=280, Evalue=1e-75,
Organism=Drosophila melanogaster, GI221513548, Length=465, Percent_Identity=28.1720430107527, Blast_Score=190, Evalue=2e-48,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001282
- InterPro:   IPR019796
- InterPro:   IPR022675
- InterPro:   IPR022674
- InterPro:   IPR016040 [H]

Pfam domain/function: PF02781 G6PD_C; PF00479 G6PD_N [H]

EC number: =1.1.1.49 [H]

Molecular weight: Translated: 58122; Mature: 57990

Theoretical pI: Translated: 7.46; Mature: 7.46

Prosite motif: PS00069 G6P_DEHYDROGENASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPSTLSNPLRLGLRQERVISPQCLVIFGASGDLTHRKLIPALFELYLQRRIPSEFGIVGC
CCCCCCCHHHHCCCHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCHHHCEEEC
ARRPWTDNEFREKMKVKLSNKISGKETEWEQFSNYLFYEPVDLQQSDHVVRLSKRLNEID
CCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCEEEECCCCCCCCCHHHHHHHHHHHHH
KKQATHGNRTFYLSVSPNFYASGCKALKEAGLLDDPKKSRLVIEKPFGRDYSSAKKLNKI
HHHCCCCCEEEEEEECCCHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCHHHHHHHHHH
VQSCAEENQIYRIDHYLGKETVQNILVLRFANTIFEPIWNRNYISSVQITSSETVGVEDR
HHHHHCCCCEEEEHHHHCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC
AGYYESSGALRDMLQNHMTQMLAVTAMEPPGKFEPEAIRNEKAKVLQASKLADENEPWNC
CCCCCCCHHHHHHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEE
CIRGQYGEGGNISNQLKGYRQEDGVNSNSTTETYIATKVFVDNWRWQGVPFYLRTGKRLP
EEEECCCCCCCCHHHHCCCHHHCCCCCCCCCEEEEEEEEEEECCEECCCCCEECCCHHHH
KRLGEIVLTFKDVPVHLFESTIINPAPNQLILRIQPNEGATFKFEVKSPGSGMKSRPVEM
HHHHHEEEEECCCCHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCCCCCEEE
EFSYDESFGEPSDEGYVRLLADAMLSDPTLFTRSDEVEAAWKLYTPLIELMDNSPWKLPI
EECCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEE
YNYESMTWGPPESDQLLSKDNIFWRRP
ECCCCCCCCCCCCCCHHCCCCEEEECC
>Mature Secondary Structure 
PSTLSNPLRLGLRQERVISPQCLVIFGASGDLTHRKLIPALFELYLQRRIPSEFGIVGC
CCCCCCHHHHCCCHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCHHHCEEEC
ARRPWTDNEFREKMKVKLSNKISGKETEWEQFSNYLFYEPVDLQQSDHVVRLSKRLNEID
CCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCEEEECCCCCCCCCHHHHHHHHHHHHH
KKQATHGNRTFYLSVSPNFYASGCKALKEAGLLDDPKKSRLVIEKPFGRDYSSAKKLNKI
HHHCCCCCEEEEEEECCCHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCHHHHHHHHHH
VQSCAEENQIYRIDHYLGKETVQNILVLRFANTIFEPIWNRNYISSVQITSSETVGVEDR
HHHHHCCCCEEEEHHHHCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCC
AGYYESSGALRDMLQNHMTQMLAVTAMEPPGKFEPEAIRNEKAKVLQASKLADENEPWNC
CCCCCCCHHHHHHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEE
CIRGQYGEGGNISNQLKGYRQEDGVNSNSTTETYIATKVFVDNWRWQGVPFYLRTGKRLP
EEEECCCCCCCCHHHHCCCHHHCCCCCCCCCEEEEEEEEEEECCEECCCCCEECCCHHHH
KRLGEIVLTFKDVPVHLFESTIINPAPNQLILRIQPNEGATFKFEVKSPGSGMKSRPVEM
HHHHHEEEEECCCCHHHHHHHCCCCCCCEEEEEEECCCCCEEEEEECCCCCCCCCCCEEE
EFSYDESFGEPSDEGYVRLLADAMLSDPTLFTRSDEVEAAWKLYTPLIELMDNSPWKLPI
EECCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEE
YNYESMTWGPPESDQLLSKDNIFWRRP
ECCCCCCCCCCCCCCHHCCCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1643289 [H]