Definition | Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome. |
---|---|
Accession | NC_004663 |
Length | 6,260,361 |
Click here to switch to the map view.
The map label for this gene is zwf [H]
Identifier: 29346631
GI number: 29346631
Start: 1518212
End: 1519708
Strand: Reverse
Name: zwf [H]
Synonym: BT_1221
Alternate gene names: 29346631
Gene position: 1519708-1518212 (Counterclockwise)
Preceding gene: 29346632
Following gene: 29346630
Centisome position: 24.28
GC content: 48.9
Gene sequence:
>1497_bases ATGGATAAATTTGCAATGATTATTTTTGGTGCTTCGGGCGACTTGACTAAACGCAAGTTGATGCCCGCATTGTACTCTCT GTATCGGGATAAACGGCTGACGGGGAGTTTTACTGTTCTGGGAATTGGGCGTACTGTCTATTCTGATGAAGATTACCGTT CATATATACTTGGGGAGTTGCAGCAGTTTGTAAAGGCTGAAGAGCAGAATTTGGAGCTGATGTCTTCTTTTGTGTCTCAT TTGTATTATTTGCCGATGGATCCTGCGAAAGTGGAGGGATATTCGCAGTTGCGGGAGCGTTTGGTAGAGCTTACGAAGGA GGTCGATCCGGATAATTTGTTGTTCTATCTGGCTACTCCGCCTTCGCTGTATGGGGTGGTGCCTTTGCATTTGAAGGCGG CCGGGCTGAATACTCCTCATTCTCGTATTATTGTTGAGAAGCCGTTCGGGTATGATCTGGAGTCGGCGCTTGAGTTGAAT AAGATTTACTCTTCTGTGTTTGATGAGCATCAGATTTATCGTATCGATCATTTTCTGGGCAAGGAGACGGCGCAGAATGT GCTTGCTTTCCGTTTCGCCAATGGTATTTTTGAGCCGTTATGGAATCGTAATTATATTGATTATGTGGAGATTACTGCTG TAGAGAATCTGGGTATTGAGCAGCGCGGAGGTTTTTACGAGACGGCGGGGGCTTTGCGGGATATGGTGCAGAATCACCTG ATTCAGCTCGTAGCTCTTACGGCTATGGAGCCGCCGGCGGTATTCAATGCGGATAATTTCCGGAATGAGGTAGTGAAGGT TTATGAGTCGCTGACTCCGCTTACGGAGACGGACTTGAATGAACATATCGTTCGCGGACAATATACGGCTTCGGGCAATA AGAAGGGGTATCGTGAGGAAAAGGGAGTGGCTCCCGACTCGCGTACGGAGACTTATATTGCGATGAAGCTGGGCATCAGT AACTGGCGTTGGAGCGGGGTACCGTTCTACATCCGCACCGGTAAGCAGATGCCGACGAAGGTGACGGAAATTGTCGTTCA CTTCCGCGAGACGCCCCATCAGATGTTCCGCTGTTCCGGCGGTAACTGTCCGAGGGCTAATAAGTTGATCCTTCGTTTGC AACCAAACGAGGGTATTGTGTTGAAGATCGGAATGAAGGTTCCGGGTGCAGGTTTCGAAGTCCGTCAGGTGACGATGGAT TTCAGTTACGCACAGTTGGGCGGCGTGCCGAGCGGTGACGCTTATGCCCGTCTGATTGACGACTGCATTCAGGGAGATCC GACCTTATTTACTCGAAGTGATGCTGTAGAGGCTTCCTGGAACTTCTTTGATCCGGTCTTACGTTATTGGAAAGATAATC CGGACGCACCTTTGTACGGCTATCCGGCAGGTACGTGGGGACCTCTCGAAAGTGAAGCTATGATGCACGAGCATGGGGCA GACTGGACCAATCCGTGTAAGAATTTAACAAATACAGACCAATATTGCGAACTATGA
Upstream 100 bases:
>100_bases GGGCGGTTGGTTGAGATTGGTCGGTTTGTCGGTTGGTTGAGGCTGGTCGGTTTAGGTGCGGTGATGGATGGATAAAAGGT AAAAAGTTAAAAATACGACG
Downstream 100 bases:
>100_bases AACTATCTGTTTTCCCGTCGTCTATGGAGACAGCCCGTTCGTTGATATTCCACCTGGTGGATATCATGAATGCTGAACCG GACAAAACGTTCAACATAGC
Product: glucose-6-phosphate 1-dehydrogenase
Products: NA
Alternate protein names: G6PD [H]
Number of amino acids: Translated: 498; Mature: 498
Protein sequence:
>498_residues MDKFAMIIFGASGDLTKRKLMPALYSLYRDKRLTGSFTVLGIGRTVYSDEDYRSYILGELQQFVKAEEQNLELMSSFVSH LYYLPMDPAKVEGYSQLRERLVELTKEVDPDNLLFYLATPPSLYGVVPLHLKAAGLNTPHSRIIVEKPFGYDLESALELN KIYSSVFDEHQIYRIDHFLGKETAQNVLAFRFANGIFEPLWNRNYIDYVEITAVENLGIEQRGGFYETAGALRDMVQNHL IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLNEHIVRGQYTASGNKKGYREEKGVAPDSRTETYIAMKLGIS NWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFRCSGGNCPRANKLILRLQPNEGIVLKIGMKVPGAGFEVRQVTMD FSYAQLGGVPSGDAYARLIDDCIQGDPTLFTRSDAVEASWNFFDPVLRYWKDNPDAPLYGYPAGTWGPLESEAMMHEHGA DWTNPCKNLTNTDQYCEL
Sequences:
>Translated_498_residues MDKFAMIIFGASGDLTKRKLMPALYSLYRDKRLTGSFTVLGIGRTVYSDEDYRSYILGELQQFVKAEEQNLELMSSFVSH LYYLPMDPAKVEGYSQLRERLVELTKEVDPDNLLFYLATPPSLYGVVPLHLKAAGLNTPHSRIIVEKPFGYDLESALELN KIYSSVFDEHQIYRIDHFLGKETAQNVLAFRFANGIFEPLWNRNYIDYVEITAVENLGIEQRGGFYETAGALRDMVQNHL IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLNEHIVRGQYTASGNKKGYREEKGVAPDSRTETYIAMKLGIS NWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFRCSGGNCPRANKLILRLQPNEGIVLKIGMKVPGAGFEVRQVTMD FSYAQLGGVPSGDAYARLIDDCIQGDPTLFTRSDAVEASWNFFDPVLRYWKDNPDAPLYGYPAGTWGPLESEAMMHEHGA DWTNPCKNLTNTDQYCEL >Mature_498_residues MDKFAMIIFGASGDLTKRKLMPALYSLYRDKRLTGSFTVLGIGRTVYSDEDYRSYILGELQQFVKAEEQNLELMSSFVSH LYYLPMDPAKVEGYSQLRERLVELTKEVDPDNLLFYLATPPSLYGVVPLHLKAAGLNTPHSRIIVEKPFGYDLESALELN KIYSSVFDEHQIYRIDHFLGKETAQNVLAFRFANGIFEPLWNRNYIDYVEITAVENLGIEQRGGFYETAGALRDMVQNHL IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLNEHIVRGQYTASGNKKGYREEKGVAPDSRTETYIAMKLGIS NWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFRCSGGNCPRANKLILRLQPNEGIVLKIGMKVPGAGFEVRQVTMD FSYAQLGGVPSGDAYARLIDDCIQGDPTLFTRSDAVEASWNFFDPVLRYWKDNPDAPLYGYPAGTWGPLESEAMMHEHGA DWTNPCKNLTNTDQYCEL
Specific function: Pentose phosphate pathway; first step. [C]
COG id: COG0364
COG function: function code G; Glucose-6-phosphate 1-dehydrogenase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glucose-6-phosphate dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI108773793, Length=499, Percent_Identity=36.4729458917836, Blast_Score=292, Evalue=5e-79, Organism=Homo sapiens, GI109389365, Length=499, Percent_Identity=36.4729458917836, Blast_Score=292, Evalue=5e-79, Organism=Homo sapiens, GI52145310, Length=492, Percent_Identity=27.6422764227642, Blast_Score=159, Evalue=5e-39, Organism=Escherichia coli, GI1788158, Length=484, Percent_Identity=40.0826446280992, Blast_Score=372, Evalue=1e-104, Organism=Caenorhabditis elegans, GI17538218, Length=496, Percent_Identity=36.6935483870968, Blast_Score=317, Evalue=9e-87, Organism=Saccharomyces cerevisiae, GI6324088, Length=482, Percent_Identity=36.3070539419087, Blast_Score=293, Evalue=6e-80, Organism=Drosophila melanogaster, GI24643350, Length=491, Percent_Identity=34.4195519348269, Blast_Score=281, Evalue=6e-76, Organism=Drosophila melanogaster, GI24643352, Length=489, Percent_Identity=34.3558282208589, Blast_Score=281, Evalue=7e-76, Organism=Drosophila melanogaster, GI221513548, Length=476, Percent_Identity=30.4621848739496, Blast_Score=209, Evalue=5e-54,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001282 - InterPro: IPR019796 - InterPro: IPR022675 - InterPro: IPR022674 - InterPro: IPR016040 [H]
Pfam domain/function: PF02781 G6PD_C; PF00479 G6PD_N [H]
EC number: =1.1.1.49 [H]
Molecular weight: Translated: 56527; Mature: 56527
Theoretical pI: Translated: 5.37; Mature: 5.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDKFAMIIFGASGDLTKRKLMPALYSLYRDKRLTGSFTVLGIGRTVYSDEDYRSYILGEL CCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCEEECCCHHHHHHHHHH QQFVKAEEQNLELMSSFVSHLYYLPMDPAKVEGYSQLRERLVELTKEVDPDNLLFYLATP HHHHHHHHCCHHHHHHHHHHHEECCCCHHHCCCHHHHHHHHHHHHHCCCCCCEEEEEECC PSLYGVVPLHLKAAGLNTPHSRIIVEKPFGYDLESALELNKIYSSVFDEHQIYRIDHFLG CCCEEEEEEEEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEHHHHHC KETAQNVLAFRFANGIFEPLWNRNYIDYVEITAVENLGIEQRGGFYETAGALRDMVQNHL HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEHHCCCCCCCCCHHHHHHHHHHHHHHH IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLNEHIVRGQYTASGNKKGYREE HHHHHHHCCCCCCEECCHHHHHHHHHHHHHHCCCCCCCCHHHHHCCEEECCCCCCCCCHH KGVAPDSRTETYIAMKLGISNWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFRCSG CCCCCCCCCCEEEEEEECCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHCCHHHHHEECC GNCPRANKLILRLQPNEGIVLKIGMKVPGAGFEVRQVTMDFSYAQLGGVPSGDAYARLID CCCCCCCEEEEEEECCCCEEEEECCCCCCCCCEEEEEEEECCHHHHCCCCCCHHHHHHHH DCIQGDPTLFTRSDAVEASWNFFDPVLRYWKDNPDAPLYGYPAGTWGPLESEAMMHEHGA HHHCCCCCEEECCCCCCCCCCHHHHHHHHHCCCCCCCEECCCCCCCCCCCHHHHHHHCCC DWTNPCKNLTNTDQYCEL CHHHHHHCCCCCHHHCCC >Mature Secondary Structure MDKFAMIIFGASGDLTKRKLMPALYSLYRDKRLTGSFTVLGIGRTVYSDEDYRSYILGEL CCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCEEECCCHHHHHHHHHH QQFVKAEEQNLELMSSFVSHLYYLPMDPAKVEGYSQLRERLVELTKEVDPDNLLFYLATP HHHHHHHHCCHHHHHHHHHHHEECCCCHHHCCCHHHHHHHHHHHHHCCCCCCEEEEEECC PSLYGVVPLHLKAAGLNTPHSRIIVEKPFGYDLESALELNKIYSSVFDEHQIYRIDHFLG CCCEEEEEEEEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEHHHHHC KETAQNVLAFRFANGIFEPLWNRNYIDYVEITAVENLGIEQRGGFYETAGALRDMVQNHL HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEHHCCCCCCCCCHHHHHHHHHHHHHHH IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLNEHIVRGQYTASGNKKGYREE HHHHHHHCCCCCCEECCHHHHHHHHHHHHHHCCCCCCCCHHHHHCCEEECCCCCCCCCHH KGVAPDSRTETYIAMKLGISNWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFRCSG CCCCCCCCCCEEEEEEECCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHCCHHHHHEECC GNCPRANKLILRLQPNEGIVLKIGMKVPGAGFEVRQVTMDFSYAQLGGVPSGDAYARLID CCCCCCCEEEEEEECCCCEEEEECCCCCCCCCEEEEEEEECCHHHHCCCCCCHHHHHHHH DCIQGDPTLFTRSDAVEASWNFFDPVLRYWKDNPDAPLYGYPAGTWGPLESEAMMHEHGA HHHCCCCCEEECCCCCCCCCCHHHHHHHHHCCCCCCCEECCCCCCCCCCCHHHHHHHCCC DWTNPCKNLTNTDQYCEL CHHHHHHCCCCCHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7542800 [H]