Definition | Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome. |
---|---|
Accession | NC_004663 |
Length | 6,260,361 |
Click here to switch to the map view.
The map label for this gene is glgP [H]
Identifier: 29346703
GI number: 29346703
Start: 1610593
End: 1613157
Strand: Reverse
Name: glgP [H]
Synonym: BT_1293
Alternate gene names: 29346703
Gene position: 1613157-1610593 (Counterclockwise)
Preceding gene: 29346704
Following gene: 29346701
Centisome position: 25.77
GC content: 44.68
Gene sequence:
>2565_bases ATGAAAATTAAAGTTAGTAATGTGAATACTCCGAACTGGAAAGAGGTTACAGTGAAGTCACGTATCCCGGCTGAGTTGGA GAAATTGTCCGAGATTTCACGCAACATTTGGTGGGCATGGAATTTTGAAGCGACAGAACTATTTAGAGATCTAGATCCGG AACTTTGGAAAGAATGTGGCCAGAACCCTGTGTTGTTGCTGGAACGTATGAGCTATGAGAAGCTGGAAGCGTTGGCAAAA GACAAGGTGATTCTGAGGAGAATGAATGATGTTTATACAAAATTCAGAGATTACATGGATGTGAAGCCGGATGAAACCCG TCCGTCTGTAGCTTATTTCAGCATGGAATATGGTTTGAGCAGCGTCCTGAAAATATATTCCGGTGGTCTGGGTGTATTGG CCGGTGACTACTTGAAAGAAGCTTCTGACAGCAATGTAGATCTTTGTGCGGTAGGTTTCCTGTATCGTTACGGTTACTTT ACTCAGACGTTGTCTATGGATGGACAGCAGATTGCCAACTACGAAGCGCAGAACTTCGGCCAGCTTCCTATCGACCGTGT GATGGATGCGAATGGTCAGCCGATGGTGGTGGATGTTCCTTATCTGGATTATTATGTACATGCTAACGTATGGCGTGTAA ATGTAGGACGTATTTCTTTGTATCTGCTGGATACAGATAATGAAATGAACAGCGAGTTCGACCGTCCTATTACTCATCAG CTTTATGGTGGCGACTGGGAAAACCGTCTGAAACAGGAAATCCTGTTGGGTATCGGTGGTATCCTGACCCTGAAAGCATT GGGTATCAAAAAAGATGTTTATCATTGTAACGAAGGACATGCTGCATTGATCAATGTGCAGCGTATCTGCGACTATGTAG CTACCGGACTGACATTCGATCAGTCTATCGAGCTGGTTCGCGCTTCTTCTCTTTATACAGTTCATACTCCGGTTCCTGCC GGTCACGACTACTTCGACGAAGGTTTGTTCGGTAAGTACATGGGTGGTTATCCTGCTAGAATGGGTATCAGCTGGGACGA CCTGATGGATCTTGGACGTAACAATCCGGGTGACAAGGGCGAACGTTTCTGTATGTCGGTATTTGCCTGCAACACTTCTC AGGAAGTAAACGGTGTAAGCTGGCTGCACGGAAAAGTTTCTCAGGAGATGTTCTCTACTATCTGGAAAGGTTACTTCCCC GAAGAAATACATGTAGGTTATGTGACTAATGGTGTTCACTTCCCCACATGGAGTGCTACCGAATGGAAAGAACTGTACTT TAAATATTTCAACGAGAACTTCTGGTACGACCAGTCGAATCCTAAGATTTGGGAAGCCATCTATAATGTACCCGATGAAG AGATCTGGAAGACTCGTATGACGATGAAGAATAAGTTGGTGGATTATATCCGCAAATCATTCCGTGATACATGGTTGAAA AATCAGGGAGATCCTTCGCGCATCGTTTCATTGATGGACAAGATTAACCCGAATGCGTTGCTGATTGGTTTCGGTCGTCG TTTCGCTACTTACAAACGTGCGCACTTGTTGTTTACTGACTTGGAACGTCTTTCTAAGATTGTGAACAACCCCGATTATC CGGTACAGTTCCTGTTTACAGGTAAGGCTCATCCGCACGATGGAGCAGGACAGGGTCTGATCAAACGTATTATCGAAATC TCCCGTCGTCCGGAATTCCTGGGTAAGATTATCTTCCTCGAAAACTACGATATGCAGTTGGCGCGTCGTCTGGTTTCAGG CGTTGATATCTGGTTGAACACTCCGACACGTCCGTTGGAAGCATCCGGTACATCAGGTGAAAAGGCTTTGATGAACGGTG TTGTCAACTTCTCTGTATTGGACGGATGGTGGCTGGAAGGCTACCGTGAAGGTGCAGGATGGGCGTTGACTGAAAAACGT ACTTATCAGAATCAGGAACATCAGGATCAGTTGGATGCTGCTACTATCTACAGTATTCTTGAAACAGAAATCCTGCCGTT GTACTATGCTCGTAACAAGAAAGGCTACTCAGAAGGCTGGATCAAGGTAGTGAAGAATTCTATCGCTCAGATCGCTCCTC ACTATACGATGAAACGCCAGTTGGACGACTACTACAATAAGTTCTACAATAAGTTGGCAAAACGTTTCCATATGCTGTCT GCTAATGACAATGCAAAAGCAAAAGAAATTGCTGCATGGAAAGAAGAAGTCGTTGCCAAGTGGGATTCTATCGAAATCGT ATCTTGCGACAAGCTGGAAGATTTGAAAGCCGGTGATATCGAAAGCGGAAAAGAATATACTATTACTTACGTAATCGATG AAAAAGGCTTGAATGATGCTATAGGGCTTGAACTGGTAACTACTTATACAACTGCGGATGGTAAACAACACGTTTACTCT GTAGAACCGTTCAGCGTTATCAAGAAAGAAGGCGACCTTTACACATTCCAGGTTAAACATAGCCTGTCAAATGCCGGTAG CTTCAAGGTGTCTTACCGTATGTTCCCGAAGAATCCGGAACTTCCGCACCGTCAGGACTTCTGCTACGTGCGTTGGTTTA TCTGA
Upstream 100 bases:
>100_bases CAATATTATTACGAGGCTTATGATATTGCCTTGCGCAATGCCATGAAGCGTCAGTTAGGCTAAAGAAATATTTTATTATC AATAAGTAAACAAAAACATT
Downstream 100 bases:
>100_bases TAGCTTGACGAAGTAGGGAAGTGACGCTTCCCTGCTTGACGTCCCTATATAAAAGAAGCCCCTGCAATTCGAGCGATCGA AATTGCAGGGGCTATCTTTT
Product: alpha-glucan phosphorylase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 854; Mature: 854
Protein sequence:
>854_residues MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI
Sequences:
>Translated_854_residues MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI >Mature_854_residues MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI
Specific function: Phosphorylase is an important allosteric enzyme in carbohydrate metabolism. Enzymes from different sources differ in their regulatory mechanisms and in their natural substrates. However, all known phosphorylases share catalytic and structural properties [
COG id: COG0058
COG function: function code G; Glucan phosphorylase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycogen phosphorylase family [H]
Homologues:
Organism=Caenorhabditis elegans, GI32566204, Length=495, Percent_Identity=24.040404040404, Blast_Score=74, Evalue=3e-13, Organism=Caenorhabditis elegans, GI17564550, Length=495, Percent_Identity=24.040404040404, Blast_Score=74, Evalue=4e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011834 - InterPro: IPR000811 [H]
Pfam domain/function: PF00343 Phosphorylase [H]
EC number: =2.4.1.1 [H]
Molecular weight: Translated: 98543; Mature: 98543
Theoretical pI: Translated: 6.26; Mature: 6.26
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECG CEEEECCCCCCCCCEEEECCCCCHHHHHHHHHHCCEEEEECCHHHHHHHHCCHHHHHHHC QNPVLLLERMSYEKLEALAKDKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLS CCCEEEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEHHHHHH SVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYFTQTLSMDGQQIANYEAQNFG HHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCC QLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ CCCHHHHHCCCCCEEEEECCCEEEEEEEEEEEEEECEEEEEEEECCCCCCHHHCCCCHHH LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFD HCCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHEECCCCCEEEEEHHHHHHHHHHCCCCH QSIELVRASSLYTVHTPVPAGHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKG HHHHEEEECCEEEEECCCCCCCCHHHCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCHH ERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFPEEIHVGYVTNGVHFPTWSAT HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCEECCCCCCH EWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK HHHHHHHHHHCCCCEEECCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFT CCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE GKAHPHDGAGQGLIKRIIEISRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLE CCCCCCCCCCHHHHHHHHHHHCCHHHHEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCC ASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKRTYQNQEHQDQLDAATIYSIL CCCCCCHHHHHHCCEEEEEECCHHHHCEECCCCEEEECHHHHCCCHHHHHHHHHHHHHHH ETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS HHHHHEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDA CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCC IGLELVTTYTTADGKQHVYSVEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPE CCEEEEEEEECCCCCCCEEECCHHHHHHCCCCEEEEEEEECCCCCCCEEEEEEECCCCCC LPHRQDFCYVRWFI CCCCCCEEEEEEEC >Mature Secondary Structure MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECG CEEEECCCCCCCCCEEEECCCCCHHHHHHHHHHCCEEEEECCHHHHHHHHCCHHHHHHHC QNPVLLLERMSYEKLEALAKDKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLS CCCEEEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEHHHHHH SVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYFTQTLSMDGQQIANYEAQNFG HHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCC QLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ CCCHHHHHCCCCCEEEEECCCEEEEEEEEEEEEEECEEEEEEEECCCCCCHHHCCCCHHH LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFD HCCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHEECCCCCEEEEEHHHHHHHHHHCCCCH QSIELVRASSLYTVHTPVPAGHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKG HHHHEEEECCEEEEECCCCCCCCHHHCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCHH ERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFPEEIHVGYVTNGVHFPTWSAT HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCEECCCCCCH EWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK HHHHHHHHHHCCCCEEECCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFT CCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE GKAHPHDGAGQGLIKRIIEISRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLE CCCCCCCCCCHHHHHHHHHHHCCHHHHEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCC ASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKRTYQNQEHQDQLDAATIYSIL CCCCCCHHHHHHCCEEEEEECCHHHHCEECCCCEEEECHHHHCCCHHHHHHHHHHHHHHH ETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS HHHHHEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDA CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCC IGLELVTTYTTADGKQHVYSVEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPE CCEEEEEEEECCCCCCCEEECCHHHHHHCCCCEEEEEEEECCCCCCCEEEEEEECCCCCC LPHRQDFCYVRWFI CCCCCCEEEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12788972 [H]