The gene/protein map for NC_004663 is currently unavailable.
Definition Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome.
Accession NC_004663
Length 6,260,361

Click here to switch to the map view.

The map label for this gene is glgP [H]

Identifier: 29346703

GI number: 29346703

Start: 1610593

End: 1613157

Strand: Reverse

Name: glgP [H]

Synonym: BT_1293

Alternate gene names: 29346703

Gene position: 1613157-1610593 (Counterclockwise)

Preceding gene: 29346704

Following gene: 29346701

Centisome position: 25.77

GC content: 44.68

Gene sequence:

>2565_bases
ATGAAAATTAAAGTTAGTAATGTGAATACTCCGAACTGGAAAGAGGTTACAGTGAAGTCACGTATCCCGGCTGAGTTGGA
GAAATTGTCCGAGATTTCACGCAACATTTGGTGGGCATGGAATTTTGAAGCGACAGAACTATTTAGAGATCTAGATCCGG
AACTTTGGAAAGAATGTGGCCAGAACCCTGTGTTGTTGCTGGAACGTATGAGCTATGAGAAGCTGGAAGCGTTGGCAAAA
GACAAGGTGATTCTGAGGAGAATGAATGATGTTTATACAAAATTCAGAGATTACATGGATGTGAAGCCGGATGAAACCCG
TCCGTCTGTAGCTTATTTCAGCATGGAATATGGTTTGAGCAGCGTCCTGAAAATATATTCCGGTGGTCTGGGTGTATTGG
CCGGTGACTACTTGAAAGAAGCTTCTGACAGCAATGTAGATCTTTGTGCGGTAGGTTTCCTGTATCGTTACGGTTACTTT
ACTCAGACGTTGTCTATGGATGGACAGCAGATTGCCAACTACGAAGCGCAGAACTTCGGCCAGCTTCCTATCGACCGTGT
GATGGATGCGAATGGTCAGCCGATGGTGGTGGATGTTCCTTATCTGGATTATTATGTACATGCTAACGTATGGCGTGTAA
ATGTAGGACGTATTTCTTTGTATCTGCTGGATACAGATAATGAAATGAACAGCGAGTTCGACCGTCCTATTACTCATCAG
CTTTATGGTGGCGACTGGGAAAACCGTCTGAAACAGGAAATCCTGTTGGGTATCGGTGGTATCCTGACCCTGAAAGCATT
GGGTATCAAAAAAGATGTTTATCATTGTAACGAAGGACATGCTGCATTGATCAATGTGCAGCGTATCTGCGACTATGTAG
CTACCGGACTGACATTCGATCAGTCTATCGAGCTGGTTCGCGCTTCTTCTCTTTATACAGTTCATACTCCGGTTCCTGCC
GGTCACGACTACTTCGACGAAGGTTTGTTCGGTAAGTACATGGGTGGTTATCCTGCTAGAATGGGTATCAGCTGGGACGA
CCTGATGGATCTTGGACGTAACAATCCGGGTGACAAGGGCGAACGTTTCTGTATGTCGGTATTTGCCTGCAACACTTCTC
AGGAAGTAAACGGTGTAAGCTGGCTGCACGGAAAAGTTTCTCAGGAGATGTTCTCTACTATCTGGAAAGGTTACTTCCCC
GAAGAAATACATGTAGGTTATGTGACTAATGGTGTTCACTTCCCCACATGGAGTGCTACCGAATGGAAAGAACTGTACTT
TAAATATTTCAACGAGAACTTCTGGTACGACCAGTCGAATCCTAAGATTTGGGAAGCCATCTATAATGTACCCGATGAAG
AGATCTGGAAGACTCGTATGACGATGAAGAATAAGTTGGTGGATTATATCCGCAAATCATTCCGTGATACATGGTTGAAA
AATCAGGGAGATCCTTCGCGCATCGTTTCATTGATGGACAAGATTAACCCGAATGCGTTGCTGATTGGTTTCGGTCGTCG
TTTCGCTACTTACAAACGTGCGCACTTGTTGTTTACTGACTTGGAACGTCTTTCTAAGATTGTGAACAACCCCGATTATC
CGGTACAGTTCCTGTTTACAGGTAAGGCTCATCCGCACGATGGAGCAGGACAGGGTCTGATCAAACGTATTATCGAAATC
TCCCGTCGTCCGGAATTCCTGGGTAAGATTATCTTCCTCGAAAACTACGATATGCAGTTGGCGCGTCGTCTGGTTTCAGG
CGTTGATATCTGGTTGAACACTCCGACACGTCCGTTGGAAGCATCCGGTACATCAGGTGAAAAGGCTTTGATGAACGGTG
TTGTCAACTTCTCTGTATTGGACGGATGGTGGCTGGAAGGCTACCGTGAAGGTGCAGGATGGGCGTTGACTGAAAAACGT
ACTTATCAGAATCAGGAACATCAGGATCAGTTGGATGCTGCTACTATCTACAGTATTCTTGAAACAGAAATCCTGCCGTT
GTACTATGCTCGTAACAAGAAAGGCTACTCAGAAGGCTGGATCAAGGTAGTGAAGAATTCTATCGCTCAGATCGCTCCTC
ACTATACGATGAAACGCCAGTTGGACGACTACTACAATAAGTTCTACAATAAGTTGGCAAAACGTTTCCATATGCTGTCT
GCTAATGACAATGCAAAAGCAAAAGAAATTGCTGCATGGAAAGAAGAAGTCGTTGCCAAGTGGGATTCTATCGAAATCGT
ATCTTGCGACAAGCTGGAAGATTTGAAAGCCGGTGATATCGAAAGCGGAAAAGAATATACTATTACTTACGTAATCGATG
AAAAAGGCTTGAATGATGCTATAGGGCTTGAACTGGTAACTACTTATACAACTGCGGATGGTAAACAACACGTTTACTCT
GTAGAACCGTTCAGCGTTATCAAGAAAGAAGGCGACCTTTACACATTCCAGGTTAAACATAGCCTGTCAAATGCCGGTAG
CTTCAAGGTGTCTTACCGTATGTTCCCGAAGAATCCGGAACTTCCGCACCGTCAGGACTTCTGCTACGTGCGTTGGTTTA
TCTGA

Upstream 100 bases:

>100_bases
CAATATTATTACGAGGCTTATGATATTGCCTTGCGCAATGCCATGAAGCGTCAGTTAGGCTAAAGAAATATTTTATTATC
AATAAGTAAACAAAAACATT

Downstream 100 bases:

>100_bases
TAGCTTGACGAAGTAGGGAAGTGACGCTTCCCTGCTTGACGTCCCTATATAAAAGAAGCCCCTGCAATTCGAGCGATCGA
AATTGCAGGGGCTATCTTTT

Product: alpha-glucan phosphorylase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 854; Mature: 854

Protein sequence:

>854_residues
MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK
DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF
TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ
LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA
GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP
EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK
NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI
SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR
TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS
ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS
VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI

Sequences:

>Translated_854_residues
MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK
DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF
TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ
LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA
GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP
EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK
NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI
SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR
TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS
ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS
VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI
>Mature_854_residues
MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECGQNPVLLLERMSYEKLEALAK
DKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLSSVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYF
TQTLSMDGQQIANYEAQNFGQLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ
LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFDQSIELVRASSLYTVHTPVPA
GHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKGERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFP
EEIHVGYVTNGVHFPTWSATEWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK
NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFTGKAHPHDGAGQGLIKRIIEI
SRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLEASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKR
TYQNQEHQDQLDAATIYSILETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS
ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDAIGLELVTTYTTADGKQHVYS
VEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPELPHRQDFCYVRWFI

Specific function: Phosphorylase is an important allosteric enzyme in carbohydrate metabolism. Enzymes from different sources differ in their regulatory mechanisms and in their natural substrates. However, all known phosphorylases share catalytic and structural properties [

COG id: COG0058

COG function: function code G; Glucan phosphorylase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycogen phosphorylase family [H]

Homologues:

Organism=Caenorhabditis elegans, GI32566204, Length=495, Percent_Identity=24.040404040404, Blast_Score=74, Evalue=3e-13,
Organism=Caenorhabditis elegans, GI17564550, Length=495, Percent_Identity=24.040404040404, Blast_Score=74, Evalue=4e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011834
- InterPro:   IPR000811 [H]

Pfam domain/function: PF00343 Phosphorylase [H]

EC number: =2.4.1.1 [H]

Molecular weight: Translated: 98543; Mature: 98543

Theoretical pI: Translated: 6.26; Mature: 6.26

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECG
CEEEECCCCCCCCCEEEECCCCCHHHHHHHHHHCCEEEEECCHHHHHHHHCCHHHHHHHC
QNPVLLLERMSYEKLEALAKDKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLS
CCCEEEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEHHHHHH
SVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYFTQTLSMDGQQIANYEAQNFG
HHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCC
QLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ
CCCHHHHHCCCCCEEEEECCCEEEEEEEEEEEEEECEEEEEEEECCCCCCHHHCCCCHHH
LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFD
HCCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHEECCCCCEEEEEHHHHHHHHHHCCCCH
QSIELVRASSLYTVHTPVPAGHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKG
HHHHEEEECCEEEEECCCCCCCCHHHCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCHH
ERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFPEEIHVGYVTNGVHFPTWSAT
HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCEECCCCCCH
EWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK
HHHHHHHHHHCCCCEEECCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFT
CCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE
GKAHPHDGAGQGLIKRIIEISRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLE
CCCCCCCCCCHHHHHHHHHHHCCHHHHEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCC
ASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKRTYQNQEHQDQLDAATIYSIL
CCCCCCHHHHHHCCEEEEEECCHHHHCEECCCCEEEECHHHHCCCHHHHHHHHHHHHHHH
ETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS
HHHHHEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC
ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDA
CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCC
IGLELVTTYTTADGKQHVYSVEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPE
CCEEEEEEEECCCCCCCEEECCHHHHHHCCCCEEEEEEEECCCCCCCEEEEEEECCCCCC
LPHRQDFCYVRWFI
CCCCCCEEEEEEEC
>Mature Secondary Structure
MKIKVSNVNTPNWKEVTVKSRIPAELEKLSEISRNIWWAWNFEATELFRDLDPELWKECG
CEEEECCCCCCCCCEEEECCCCCHHHHHHHHHHCCEEEEECCHHHHHHHHCCHHHHHHHC
QNPVLLLERMSYEKLEALAKDKVILRRMNDVYTKFRDYMDVKPDETRPSVAYFSMEYGLS
CCCEEEHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEHHHHHH
SVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYFTQTLSMDGQQIANYEAQNFG
HHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCC
QLPIDRVMDANGQPMVVDVPYLDYYVHANVWRVNVGRISLYLLDTDNEMNSEFDRPITHQ
CCCHHHHHCCCCCEEEEECCCEEEEEEEEEEEEEECEEEEEEEECCCCCCHHHCCCCHHH
LYGGDWENRLKQEILLGIGGILTLKALGIKKDVYHCNEGHAALINVQRICDYVATGLTFD
HCCCCHHHHHHHHHHHHCCHHHHHHHHCCCHHHEECCCCCEEEEEHHHHHHHHHHCCCCH
QSIELVRASSLYTVHTPVPAGHDYFDEGLFGKYMGGYPARMGISWDDLMDLGRNNPGDKG
HHHHEEEECCEEEEECCCCCCCCHHHCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCHH
ERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSTIWKGYFPEEIHVGYVTNGVHFPTWSAT
HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCEECCCCCCH
EWKELYFKYFNENFWYDQSNPKIWEAIYNVPDEEIWKTRMTMKNKLVDYIRKSFRDTWLK
HHHHHHHHHHCCCCEEECCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
NQGDPSRIVSLMDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFT
CCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE
GKAHPHDGAGQGLIKRIIEISRRPEFLGKIIFLENYDMQLARRLVSGVDIWLNTPTRPLE
CCCCCCCCCCHHHHHHHHHHHCCHHHHEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCC
ASGTSGEKALMNGVVNFSVLDGWWLEGYREGAGWALTEKRTYQNQEHQDQLDAATIYSIL
CCCCCCHHHHHHCCEEEEEECCHHHHCEECCCCEEEECHHHHCCCHHHHHHHHHHHHHHH
ETEILPLYYARNKKGYSEGWIKVVKNSIAQIAPHYTMKRQLDDYYNKFYNKLAKRFHMLS
HHHHHEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCC
ANDNAKAKEIAAWKEEVVAKWDSIEIVSCDKLEDLKAGDIESGKEYTITYVIDEKGLNDA
CCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCC
IGLELVTTYTTADGKQHVYSVEPFSVIKKEGDLYTFQVKHSLSNAGSFKVSYRMFPKNPE
CCEEEEEEEECCCCCCCEEECCHHHHHHCCCCEEEEEEEECCCCCCCEEEEEEECCCCCC
LPHRQDFCYVRWFI
CCCCCCEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12788972 [H]