The gene/protein map for NC_008254 is currently unavailable.
Definition Mesorhizobium sp. BNC1, complete genome.
Accession NC_008254
Length 4,412,446

Click here to switch to the map view.

The map label for this gene is zwf [H]

Identifier: 110632521

GI number: 110632521

Start: 194059

End: 195531

Strand: Reverse

Name: zwf [H]

Synonym: Meso_0159

Alternate gene names: 110632521

Gene position: 195531-194059 (Counterclockwise)

Preceding gene: 110632522

Following gene: 110632520

Centisome position: 4.43

GC content: 59.67

Gene sequence:

>1473_bases
ATGACGAGCCAAACGATACCGGTAGAGCCTTTTGATCTCGTGGTCTTCGGGGCAACAGGTGACCTTTGCGAACGAAAGCT
GCTGCCCGCCCTCTACCAGCGCCAGCGTGCCGGCCAATTCAGCGAGCCGACACGCATTATCGGCTCTTCCCGCTCAGAGT
TGAGCGACGACCAATACCGCTCTTTCGCACTGGATGCGATCAAAGCGCATGTGGCCAAGGCCGAAATCGACAATCGTGAG
CTCGACCGGTTTCTTACCCGAGTTTCCTACGTGTCGGCGGACGCCACCACGGGCCGAGGCTTTAAAGAGCTGAAGGCGGC
AATCGGAGATAGCGCGAATGTCCGCGCATTCTATCTCGCCGTTGCTCCTTCCCTGTTCGGCAGCATCGTCGCCCAAATCG
ATGCTCACCATATCGCGACGCCGACCTCGCGCGTCATCGTTGAAAAGCCGATCGGACGAAGCCTGGAAACTGCGCGGGAG
GTCAACGATTCGATAGGCCGGGTTTTCGATGAGGCCCGGATTTTCCGCATAGACCATTATCTCGGCAAGGAGACGGTCCA
GAACCTGATGGCGCTGCGCTTTGCCAACGCGCTTTATGAGCCGCTCTGGAATTCGGCCCATGTGGATCACGTACAGATCA
CCGTCGCCGAATCCGTCGGCCTGGAAGGCAGAGCCGGCTACTACGACAAAGCCGGCGCCCTGCGCGATATGGTGCAGAAC
CATATGCTGCAGCTCCTCTGCCTCGTCGCTATGGAAGTGCCTGCCTCTCTGGATGCCGATGCCGTCCGCGACGAAAAGCT
GAAAGTGCTGCGTGCCTTGAGGCCGATCAATCAGCTCAATTCGGAGCGCGTGACGGTGCGCGGGCAATATCGGGCAGGTG
CTTCGAATGGCGGCGCGGTGAAAGGCTATTTGGAAGAGCTTGGGAATGGCGGGAGCGACACGGAAACCTTTGTCGCAGTG
AAGGCGGAGATCGGCAATTGGCGCTGGGCAGACGTGCCCTTTTATCTGCGCACCGGCAAACGCATGGCCACGCGCGTTTC
CGAAATCGTCATCCAGTTCAAGCCCATTCCCCACTCGATCTTCGGCGATGCGGCAGGCCGAATCTTTGCCAACCAGCTCG
TCATTCGCCTGCAGCCGGATGAAGGTGTCAAACAGTGGATCATGATCAAGGATCCCGGTCCGGGCGGCATGCGGCTGCGG
CAGATCCCGCTGGACATGAGCTTTGCGGAATCCTTTCACGAGCGCAATCCGGATGCCTATGAGAGGCTGATCATGGATGT
AATCCGCGGGAACCAGACACTCTTCATGCGCCGCGACGAGGTGGAAGCTGCCTGGCGGTGGATCGACCCAATCCATGCGG
CATGGGAGGAGCGTAGCCAGTCGGTGCAGCCCTATACCGCAGGCACTTGGGGCCCCTCAGCGTCGATCGCCCTCATCGAA
CGTGACGGTCGCACCTGGCATGAGAGCTTCTGA

Upstream 100 bases:

>100_bases
GCCGTCTTATCGCGCCCCGGCAAAAAGCCGGCGGCAGGGCTCTGGCCTTATTAAATCGATTAGACTATGTAGCGACCATA
ACAAAGCGGACGCAAAGCCA

Downstream 100 bases:

>100_bases
TGGCCAAGGCAGTCTGGCATGAATTCGCCGCGAGCGAGGCTCTCGCGGAAGCCCTCGCCGAAGCGGTGGCGCAGCACCTT
GCGGCGGCTCTGGAAAAGCG

Product: glucose-6-phosphate 1-dehydrogenase

Products: NA

Alternate protein names: G6PD [H]

Number of amino acids: Translated: 490; Mature: 489

Protein sequence:

>490_residues
MTSQTIPVEPFDLVVFGATGDLCERKLLPALYQRQRAGQFSEPTRIIGSSRSELSDDQYRSFALDAIKAHVAKAEIDNRE
LDRFLTRVSYVSADATTGRGFKELKAAIGDSANVRAFYLAVAPSLFGSIVAQIDAHHIATPTSRVIVEKPIGRSLETARE
VNDSIGRVFDEARIFRIDHYLGKETVQNLMALRFANALYEPLWNSAHVDHVQITVAESVGLEGRAGYYDKAGALRDMVQN
HMLQLLCLVAMEVPASLDADAVRDEKLKVLRALRPINQLNSERVTVRGQYRAGASNGGAVKGYLEELGNGGSDTETFVAV
KAEIGNWRWADVPFYLRTGKRMATRVSEIVIQFKPIPHSIFGDAAGRIFANQLVIRLQPDEGVKQWIMIKDPGPGGMRLR
QIPLDMSFAESFHERNPDAYERLIMDVIRGNQTLFMRRDEVEAAWRWIDPIHAAWEERSQSVQPYTAGTWGPSASIALIE
RDGRTWHESF

Sequences:

>Translated_490_residues
MTSQTIPVEPFDLVVFGATGDLCERKLLPALYQRQRAGQFSEPTRIIGSSRSELSDDQYRSFALDAIKAHVAKAEIDNRE
LDRFLTRVSYVSADATTGRGFKELKAAIGDSANVRAFYLAVAPSLFGSIVAQIDAHHIATPTSRVIVEKPIGRSLETARE
VNDSIGRVFDEARIFRIDHYLGKETVQNLMALRFANALYEPLWNSAHVDHVQITVAESVGLEGRAGYYDKAGALRDMVQN
HMLQLLCLVAMEVPASLDADAVRDEKLKVLRALRPINQLNSERVTVRGQYRAGASNGGAVKGYLEELGNGGSDTETFVAV
KAEIGNWRWADVPFYLRTGKRMATRVSEIVIQFKPIPHSIFGDAAGRIFANQLVIRLQPDEGVKQWIMIKDPGPGGMRLR
QIPLDMSFAESFHERNPDAYERLIMDVIRGNQTLFMRRDEVEAAWRWIDPIHAAWEERSQSVQPYTAGTWGPSASIALIE
RDGRTWHESF
>Mature_489_residues
TSQTIPVEPFDLVVFGATGDLCERKLLPALYQRQRAGQFSEPTRIIGSSRSELSDDQYRSFALDAIKAHVAKAEIDNREL
DRFLTRVSYVSADATTGRGFKELKAAIGDSANVRAFYLAVAPSLFGSIVAQIDAHHIATPTSRVIVEKPIGRSLETAREV
NDSIGRVFDEARIFRIDHYLGKETVQNLMALRFANALYEPLWNSAHVDHVQITVAESVGLEGRAGYYDKAGALRDMVQNH
MLQLLCLVAMEVPASLDADAVRDEKLKVLRALRPINQLNSERVTVRGQYRAGASNGGAVKGYLEELGNGGSDTETFVAVK
AEIGNWRWADVPFYLRTGKRMATRVSEIVIQFKPIPHSIFGDAAGRIFANQLVIRLQPDEGVKQWIMIKDPGPGGMRLRQ
IPLDMSFAESFHERNPDAYERLIMDVIRGNQTLFMRRDEVEAAWRWIDPIHAAWEERSQSVQPYTAGTWGPSASIALIER
DGRTWHESF

Specific function: Pentose phosphate pathway; first step. [C]

COG id: COG0364

COG function: function code G; Glucose-6-phosphate 1-dehydrogenase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glucose-6-phosphate dehydrogenase family [H]

Homologues:

Organism=Homo sapiens, GI109389365, Length=487, Percent_Identity=38.3983572895277, Blast_Score=319, Evalue=3e-87,
Organism=Homo sapiens, GI108773793, Length=487, Percent_Identity=38.3983572895277, Blast_Score=319, Evalue=3e-87,
Organism=Homo sapiens, GI52145310, Length=492, Percent_Identity=28.4552845528455, Blast_Score=173, Evalue=3e-43,
Organism=Escherichia coli, GI1788158, Length=487, Percent_Identity=48.6652977412731, Blast_Score=484, Evalue=1e-138,
Organism=Caenorhabditis elegans, GI17538218, Length=489, Percent_Identity=37.6278118609407, Blast_Score=322, Evalue=3e-88,
Organism=Saccharomyces cerevisiae, GI6324088, Length=476, Percent_Identity=38.0252100840336, Blast_Score=299, Evalue=7e-82,
Organism=Drosophila melanogaster, GI24643352, Length=490, Percent_Identity=35.9183673469388, Blast_Score=294, Evalue=8e-80,
Organism=Drosophila melanogaster, GI24643350, Length=490, Percent_Identity=35.9183673469388, Blast_Score=294, Evalue=9e-80,
Organism=Drosophila melanogaster, GI221513548, Length=501, Percent_Identity=31.5369261477046, Blast_Score=213, Evalue=2e-55,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001282
- InterPro:   IPR019796
- InterPro:   IPR022675
- InterPro:   IPR022674
- InterPro:   IPR016040 [H]

Pfam domain/function: PF02781 G6PD_C; PF00479 G6PD_N [H]

EC number: =1.1.1.49 [H]

Molecular weight: Translated: 54814; Mature: 54683

Theoretical pI: Translated: 6.75; Mature: 6.75

Prosite motif: PS00069 G6P_DEHYDROGENASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTSQTIPVEPFDLVVFGATGDLCERKLLPALYQRQRAGQFSEPTRIIGSSRSELSDDQYR
CCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCHHHCCHHHHH
SFALDAIKAHVAKAEIDNRELDRFLTRVSYVSADATTGRGFKELKAAIGDSANVRAFYLA
HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCEEEEEH
VAPSLFGSIVAQIDAHHIATPTSRVIVEKPIGRSLETAREVNDSIGRVFDEARIFRIDHY
HHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHEEHHHH
LGKETVQNLMALRFANALYEPLWNSAHVDHVQITVAESVGLEGRAGYYDKAGALRDMVQN
HCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEHHCCCCCCCCCCHHHHHHHHHHHH
HMLQLLCLVAMEVPASLDADAVRDEKLKVLRALRPINQLNSERVTVRGQYRAGASNGGAV
HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCHH
KGYLEELGNGGSDTETFVAVKAEIGNWRWADVPFYLRTGKRMATRVSEIVIQFKPIPHSI
HHHHHHHCCCCCCCCEEEEEEEECCCEEECCCCHHHHCCHHHHHHHHHHHEEECCCCHHH
FGDAAGRIFANQLVIRLQPDEGVKQWIMIKDPGPGGMRLRQIPLDMSFAESFHERNPDAY
HHHHHHHEEEEEEEEEECCCCCCEEEEEEECCCCCCCEEEECCCCHHHHHHHHHCCCHHH
ERLIMDVIRGNQTLFMRRDEVEAAWRWIDPIHAAWEERSQSVQPYTAGTWGPSASIALIE
HHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEE
RDGRTWHESF
CCCCCCCCCC
>Mature Secondary Structure 
TSQTIPVEPFDLVVFGATGDLCERKLLPALYQRQRAGQFSEPTRIIGSSRSELSDDQYR
CCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCHHHCCHHHHH
SFALDAIKAHVAKAEIDNRELDRFLTRVSYVSADATTGRGFKELKAAIGDSANVRAFYLA
HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCEEEEEH
VAPSLFGSIVAQIDAHHIATPTSRVIVEKPIGRSLETAREVNDSIGRVFDEARIFRIDHY
HHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHEEHHHH
LGKETVQNLMALRFANALYEPLWNSAHVDHVQITVAESVGLEGRAGYYDKAGALRDMVQN
HCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEHHCCCCCCCCCCHHHHHHHHHHHH
HMLQLLCLVAMEVPASLDADAVRDEKLKVLRALRPINQLNSERVTVRGQYRAGASNGGAV
HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCHH
KGYLEELGNGGSDTETFVAVKAEIGNWRWADVPFYLRTGKRMATRVSEIVIQFKPIPHSI
HHHHHHHCCCCCCCCEEEEEEEECCCEEECCCCHHHHCCHHHHHHHHHHHEEECCCCHHH
FGDAAGRIFANQLVIRLQPDEGVKQWIMIKDPGPGGMRLRQIPLDMSFAESFHERNPDAY
HHHHHHHEEEEEEEEEECCCCCCEEEEEEECCCCCCCEEEECCCCHHHHHHHHHCCCHHH
ERLIMDVIRGNQTLFMRRDEVEAAWRWIDPIHAAWEERSQSVQPYTAGTWGPSASIALIE
HHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEE
RDGRTWHESF
CCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10400573; 11481430 [H]