Definition | Methylibium petroleiphilum PM1 chromosome, complete genome. |
---|---|
Accession | NC_008825 |
Length | 4,044,195 |
Click here to switch to the map view.
The map label for this gene is tktA [H]
Identifier: 124265452
GI number: 124265452
Start: 279373
End: 281415
Strand: Direct
Name: tktA [H]
Synonym: Mpe_A0259
Alternate gene names: 124265452
Gene position: 279373-281415 (Clockwise)
Preceding gene: 124265448
Following gene: 124265453
Centisome position: 6.91
GC content: 69.26
Gene sequence:
>2043_bases ATGGCTGCCCTCGAATCCCGCACGCCCGCCACCCCCGACACCTCGCTGATGGCCAACGCGATCCGCGCGCTGGCGATGGA CGCAGTGCAACAGGCCAATTCGGGCCACCCTGGCGCGCCGATGGGCATGGCCGACATCGCGGTCGCGCTGTGGGGCCGAC ACCTGCGCCACAACCCGGCCAACCCGGCCTGGGCCGACCGTGATCGCTTCGTGCTGAGCAACGGCCACGGATCGATGCTG ATCTATGCGCTGCTGCACCTGACCGGGTATGCGCTGCCGCTCGAGGAACTGCGCAACTTCCGCCAGCTTCACAGCAAGAC GCCGGGCCACCCCGAGGTCGGCATCACGCCCGGCGTCGAGACGACCACCGGCCCGCTGGGGCAGGGCCTCAGCAACGCGG TCGGCATGGCGCTGGCCGAGAAGCTGCTGGCGGCGGAGTTCAACCGCGACGGCTATGCCATCGTCGATCATCACACCTAT GTCTTCCTCGGCGACGGCTGCCTGATGGAGGGCATCAGCCACGAGGCCAGCGCGCTGGCGGGCGCCTGGAAGCTCGACAA GCTGATCGCGCTGTACGACGACAACGGGATCAGCATCGATGGCCAGGTGGCGCCGTGGTTCGTGGACGACACCGCGAAGC GCTTCGCGGCCTACGGCTGGAATGTGATCGGGCCGATCGACGGTCACGATGTCGATGCAGTCGACGCGGCGATCACGCAA GCCAAGCGCCATGTGGGCCAACCCAGTCTGATCGTGTGCAAGACCGCGATCGGCAAGGGCTCACCGAACCGCGCCAATAC CGCCAAGGCGCACGGCGAGCCGCTCGGCGCCGAGGAGATTGCGCTCACGCGGGCCACGATCGGCTGGCCGCATGCGCCAT TCGAGATCCCGGCCGAGGCCTACAACGACTGGGACGCCAAGGACGCCGGTGCCAACCTGCAGGCCGACTGGGAAGGCCGC TTCGAGGCCTACCGCGCCGACCATCCCGCCGCCGCGGCCGAGTTCGAACGCCGGATGGCGGGCGCGCTGCCGGAGAACTT TGCGCAGATCGCAGTGGACGCTGCCATCGCGGCACACGCCAAGGCCGAGACGGTCGCCAGCCGCAAGGCCAGCCAGCTCG CGCTGGAGGCCTTCACCGCTGCGTTGCCCGAACTGCTGGGCGGCAGTGCCGACCTCACCGGCTCGAATCTCACCAACACC AAGAGCACACCGGCGCTGCGCTTCGATGCGGCAGGTGCTTCGAACGGTGGCCGCCACATCAACTACGGCGTGCGCGAGTT CGGCATGGCGGCGTTGATGAACGGCATCGCGCTGCACGGCGGCTTCATCCCCTACGGCGGCACCTTCCTGACCTTCAGCG ACTACTCGCGCAACGCCATCCGCATGGCCGCGCTGATGAAGCAGCGCGTGGTGCATGTCTTCACCCACGACAGCATCGGC CTCGGCGAGGACGGCCCGACGCACCAGAGCGTGGAGCACGCCGCGAGCCTGCGGCTGATCCCGAACCTCGATGTCTGGCG TCCCGCCGACACGGCCGAGACCACGATCGCGTGGACCAAGGCGCTGCTGAACGTGGACCGCCCGACCGCGCTGCTGCTGT CGCGTCAGAACCTGCCCTACCTGCCGAAGGCCGGGCTCGACGAGGTGGCGCGCGGCGCCTATGTGCTGGCCGAGCCGGCC GAGGTGGGGCTGAAGAGGAAGGCCCAGGCAGTGCTCATCGCGACCGGGTCCGAGGTGCAGCTCGCGCTGCATGCGCAGCA GCAGCTCGCCGCCGCCGGCATCGCGGTGCGCGTGGTGTCGATGCCCAGCACCACCACCTTCGATCGCCAGGACGTGGCCT ACAAGCGCCGCGTGTTGCCCGACGGGCTGCCGCGGGTGGCGGTGGAGGCCGGCGTCACCGATGGCTGGTGGAAGTACGGT TGCGCAGCCGTCATCGGCCTCGACACCTACGGCGAGTCGGCCCCGGCCGGTGCGTTGTTCAAGCACTTCGGCTTCACGGC CGAGAACGTGGCCGCGACCGTGCGCCGGGTGCTGAAGAAGTGA
Upstream 100 bases:
>100_bases CAAATCGGCGGTGAGGTGCAGACCCTGCATGGGTGGCGCGACGGGGCTCGTAAAATCGTCGATTCTGCCGCTTCCGTGGC GCCCTTTTCCCCTGCAAACC
Downstream 100 bases:
>100_bases GGGCCCGCCCGCCGCTGCGAGGCTCCGTGACATCTGCCGCGGGGAGCACGGCCGGATGTCGGGGCCGGCGCGGCGGCGCG GGGCGTTTTGTCGTCGAATG
Product: transketolase
Products: NA
Alternate protein names: TK [H]
Number of amino acids: Translated: 680; Mature: 679
Protein sequence:
>680_residues MAALESRTPATPDTSLMANAIRALAMDAVQQANSGHPGAPMGMADIAVALWGRHLRHNPANPAWADRDRFVLSNGHGSML IYALLHLTGYALPLEELRNFRQLHSKTPGHPEVGITPGVETTTGPLGQGLSNAVGMALAEKLLAAEFNRDGYAIVDHHTY VFLGDGCLMEGISHEASALAGAWKLDKLIALYDDNGISIDGQVAPWFVDDTAKRFAAYGWNVIGPIDGHDVDAVDAAITQ AKRHVGQPSLIVCKTAIGKGSPNRANTAKAHGEPLGAEEIALTRATIGWPHAPFEIPAEAYNDWDAKDAGANLQADWEGR FEAYRADHPAAAAEFERRMAGALPENFAQIAVDAAIAAHAKAETVASRKASQLALEAFTAALPELLGGSADLTGSNLTNT KSTPALRFDAAGASNGGRHINYGVREFGMAALMNGIALHGGFIPYGGTFLTFSDYSRNAIRMAALMKQRVVHVFTHDSIG LGEDGPTHQSVEHAASLRLIPNLDVWRPADTAETTIAWTKALLNVDRPTALLLSRQNLPYLPKAGLDEVARGAYVLAEPA EVGLKRKAQAVLIATGSEVQLALHAQQQLAAAGIAVRVVSMPSTTTFDRQDVAYKRRVLPDGLPRVAVEAGVTDGWWKYG CAAVIGLDTYGESAPAGALFKHFGFTAENVAATVRRVLKK
Sequences:
>Translated_680_residues MAALESRTPATPDTSLMANAIRALAMDAVQQANSGHPGAPMGMADIAVALWGRHLRHNPANPAWADRDRFVLSNGHGSML IYALLHLTGYALPLEELRNFRQLHSKTPGHPEVGITPGVETTTGPLGQGLSNAVGMALAEKLLAAEFNRDGYAIVDHHTY VFLGDGCLMEGISHEASALAGAWKLDKLIALYDDNGISIDGQVAPWFVDDTAKRFAAYGWNVIGPIDGHDVDAVDAAITQ AKRHVGQPSLIVCKTAIGKGSPNRANTAKAHGEPLGAEEIALTRATIGWPHAPFEIPAEAYNDWDAKDAGANLQADWEGR FEAYRADHPAAAAEFERRMAGALPENFAQIAVDAAIAAHAKAETVASRKASQLALEAFTAALPELLGGSADLTGSNLTNT KSTPALRFDAAGASNGGRHINYGVREFGMAALMNGIALHGGFIPYGGTFLTFSDYSRNAIRMAALMKQRVVHVFTHDSIG LGEDGPTHQSVEHAASLRLIPNLDVWRPADTAETTIAWTKALLNVDRPTALLLSRQNLPYLPKAGLDEVARGAYVLAEPA EVGLKRKAQAVLIATGSEVQLALHAQQQLAAAGIAVRVVSMPSTTTFDRQDVAYKRRVLPDGLPRVAVEAGVTDGWWKYG CAAVIGLDTYGESAPAGALFKHFGFTAENVAATVRRVLKK >Mature_679_residues AALESRTPATPDTSLMANAIRALAMDAVQQANSGHPGAPMGMADIAVALWGRHLRHNPANPAWADRDRFVLSNGHGSMLI YALLHLTGYALPLEELRNFRQLHSKTPGHPEVGITPGVETTTGPLGQGLSNAVGMALAEKLLAAEFNRDGYAIVDHHTYV FLGDGCLMEGISHEASALAGAWKLDKLIALYDDNGISIDGQVAPWFVDDTAKRFAAYGWNVIGPIDGHDVDAVDAAITQA KRHVGQPSLIVCKTAIGKGSPNRANTAKAHGEPLGAEEIALTRATIGWPHAPFEIPAEAYNDWDAKDAGANLQADWEGRF EAYRADHPAAAAEFERRMAGALPENFAQIAVDAAIAAHAKAETVASRKASQLALEAFTAALPELLGGSADLTGSNLTNTK STPALRFDAAGASNGGRHINYGVREFGMAALMNGIALHGGFIPYGGTFLTFSDYSRNAIRMAALMKQRVVHVFTHDSIGL GEDGPTHQSVEHAASLRLIPNLDVWRPADTAETTIAWTKALLNVDRPTALLLSRQNLPYLPKAGLDEVARGAYVLAEPAE VGLKRKAQAVLIATGSEVQLALHAQQQLAAAGIAVRVVSMPSTTTFDRQDVAYKRRVLPDGLPRVAVEAGVTDGWWKYGC AAVIGLDTYGESAPAGALFKHFGFTAENVAATVRRVLKK
Specific function: Unknown
COG id: COG0021
COG function: function code G; Transketolase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the transketolase family [H]
Homologues:
Organism=Homo sapiens, GI205277463, Length=709, Percent_Identity=26.2341325811001, Blast_Score=177, Evalue=3e-44, Organism=Homo sapiens, GI4507521, Length=709, Percent_Identity=26.2341325811001, Blast_Score=177, Evalue=3e-44, Organism=Homo sapiens, GI133778974, Length=597, Percent_Identity=26.8006700167504, Blast_Score=149, Evalue=7e-36, Organism=Homo sapiens, GI225637459, Length=603, Percent_Identity=23.5489220563847, Blast_Score=114, Evalue=2e-25, Organism=Homo sapiens, GI225637461, Length=507, Percent_Identity=24.0631163708087, Blast_Score=104, Evalue=3e-22, Organism=Homo sapiens, GI225637463, Length=507, Percent_Identity=24.0631163708087, Blast_Score=103, Evalue=5e-22, Organism=Escherichia coli, GI48994911, Length=667, Percent_Identity=60.8695652173913, Blast_Score=819, Evalue=0.0, Organism=Escherichia coli, GI1788808, Length=667, Percent_Identity=59.0704647676162, Blast_Score=791, Evalue=0.0, Organism=Caenorhabditis elegans, GI17539652, Length=607, Percent_Identity=28.171334431631, Blast_Score=174, Evalue=1e-43, Organism=Saccharomyces cerevisiae, GI6325331, Length=670, Percent_Identity=47.0149253731343, Blast_Score=584, Evalue=1e-167, Organism=Saccharomyces cerevisiae, GI6319593, Length=670, Percent_Identity=43.2835820895522, Blast_Score=536, Evalue=1e-153, Organism=Drosophila melanogaster, GI24666278, Length=687, Percent_Identity=24.3085880640466, Blast_Score=152, Evalue=6e-37, Organism=Drosophila melanogaster, GI45551847, Length=268, Percent_Identity=33.2089552238806, Blast_Score=131, Evalue=2e-30, Organism=Drosophila melanogaster, GI45550715, Length=268, Percent_Identity=33.2089552238806, Blast_Score=131, Evalue=2e-30, Organism=Drosophila melanogaster, GI24645119, Length=232, Percent_Identity=33.6206896551724, Blast_Score=110, Evalue=2e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009014 - InterPro: IPR015941 - InterPro: IPR005475 - InterPro: IPR005478 - InterPro: IPR020826 - InterPro: IPR005476 - InterPro: IPR005474 [H]
Pfam domain/function: PF02779 Transket_pyr; PF02780 Transketolase_C; PF00456 Transketolase_N [H]
EC number: =2.2.1.1 [H]
Molecular weight: Translated: 72305; Mature: 72173
Theoretical pI: Translated: 6.58; Mature: 6.58
Prosite motif: PS00801 TRANSKETOLASE_1
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAALESRTPATPDTSLMANAIRALAMDAVQQANSGHPGAPMGMADIAVALWGRHLRHNPA CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCC NPAWADRDRFVLSNGHGSMLIYALLHLTGYALPLEELRNFRQLHSKTPGHPEVGITPGVE CCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC TTTGPLGQGLSNAVGMALAEKLLAAEFNRDGYAIVDHHTYVFLGDGCLMEGISHEASALA CCCCCHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCEEEEECCCHHHHCCCHHHHHHH GAWKLDKLIALYDDNGISIDGQVAPWFVDDTAKRFAAYGWNVIGPIDGHDVDAVDAAITQ HHHHHCEEEEEECCCCEEECCCCCCEEECHHHHHHHHCCCCEECCCCCCCCHHHHHHHHH AKRHVGQPSLIVCKTAIGKGSPNRANTAKAHGEPLGAEEIALTRATIGWPHAPFEIPAEA HHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHHEEEEEECCCCCCCCCCCHHH YNDWDAKDAGANLQADWEGRFEAYRADHPAAAAEFERRMAGALPENFAQIAVDAAIAAHA CCCCCCCCCCCCCEECCCCCHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH KAETVASRKASQLALEAFTAALPELLGGSADLTGSNLTNTKSTPALRFDAAGASNGGRHI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCEE NYGVREFGMAALMNGIALHGGFIPYGGTFLTFSDYSRNAIRMAALMKQRVVHVFTHDSIG ECCHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHEEEEECCCCC LGEDGPTHQSVEHAASLRLIPNLDVWRPADTAETTIAWTKALLNVDRPTALLLSRQNLPY CCCCCCCHHHHHHHHHEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCC LPKAGLDEVARGAYVLAEPAEVGLKRKAQAVLIATGSEVQLALHAQQQLAAAGIAVRVVS CCCCCHHHHHCCCEEEECHHHHCHHCCCCEEEEEECCCEEEEHHHHHHHHHCCEEEEEEE MPSTTTFDRQDVAYKRRVLPDGLPRVAVEAGVTDGWWKYGCAAVIGLDTYGESAPAGALF CCCCCCCCHHHHHHHHHCCCCCCCCHHHCCCCCCCHHHHCCEEEEEECCCCCCCCHHHHH KHFGFTAENVAATVRRVLKK HHHCCCHHHHHHHHHHHHCC >Mature Secondary Structure AALESRTPATPDTSLMANAIRALAMDAVQQANSGHPGAPMGMADIAVALWGRHLRHNPA CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCC NPAWADRDRFVLSNGHGSMLIYALLHLTGYALPLEELRNFRQLHSKTPGHPEVGITPGVE CCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC TTTGPLGQGLSNAVGMALAEKLLAAEFNRDGYAIVDHHTYVFLGDGCLMEGISHEASALA CCCCCHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEECCEEEEECCCHHHHCCCHHHHHHH GAWKLDKLIALYDDNGISIDGQVAPWFVDDTAKRFAAYGWNVIGPIDGHDVDAVDAAITQ HHHHHCEEEEEECCCCEEECCCCCCEEECHHHHHHHHCCCCEECCCCCCCCHHHHHHHHH AKRHVGQPSLIVCKTAIGKGSPNRANTAKAHGEPLGAEEIALTRATIGWPHAPFEIPAEA HHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHHEEEEEECCCCCCCCCCCHHH YNDWDAKDAGANLQADWEGRFEAYRADHPAAAAEFERRMAGALPENFAQIAVDAAIAAHA CCCCCCCCCCCCCEECCCCCHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH KAETVASRKASQLALEAFTAALPELLGGSADLTGSNLTNTKSTPALRFDAAGASNGGRHI HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCEE NYGVREFGMAALMNGIALHGGFIPYGGTFLTFSDYSRNAIRMAALMKQRVVHVFTHDSIG ECCHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHEEEEECCCCC LGEDGPTHQSVEHAASLRLIPNLDVWRPADTAETTIAWTKALLNVDRPTALLLSRQNLPY CCCCCCCHHHHHHHHHEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCC LPKAGLDEVARGAYVLAEPAEVGLKRKAQAVLIATGSEVQLALHAQQQLAAAGIAVRVVS CCCCCHHHHHCCCEEEECHHHHCHHCCCCEEEEEECCCEEEEHHHHHHHHHCCEEEEEEE MPSTTTFDRQDVAYKRRVLPDGLPRVAVEAGVTDGWWKYGCAAVIGLDTYGESAPAGALF CCCCCCCCHHHHHHHHHCCCCCCCCHHHCCCCCCCHHHHCCEEEEEECCCCCCCCHHHHH KHFGFTAENVAATVRRVLKK HHHCCCHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7542800 [H]