Definition Streptococcus pneumoniae Taiwan19F-14, complete genome.
Accession NC_012469
Length 2,112,148

Click here to switch to the map view.

The map label for this gene is malP [H]

Identifier: 225861921

GI number: 225861921

Start: 1966586

End: 1968844

Strand: Reverse

Name: malP [H]

Synonym: SPT_2114

Alternate gene names: 225861921

Gene position: 1968844-1966586 (Counterclockwise)

Preceding gene: 225861922

Following gene: 225861920

Centisome position: 93.22

GC content: 41.39

Gene sequence:

>2259_bases
ATGTTATCACTACAAGAATTTGTACAAAATCGTTACAATAAAACCATTGCAGAATGTAGCAATGAAGAGCTTTACCTTGC
TCTTCTTAACTACAGCAAGCTTGCAAGCAGCCAAAAACCAGTCAACACTGGTAAGAAAAAAGTTTACTACATCTCAGCTG
AGTTCTTGATTGGTAAACTCTTGTCAAACAACTTGATTAACCTTGGTCTTTACGACGATGTTAAAAAAGAACTTGCAGCT
GCAGGTAAAGACTTGATCGAAGTTGAAGAAGTTGAATTGGAACCATCTCTTGGTAATGGTGGTTTGGGACGTTTGGCTGC
CTGCTTTATCGACTCAATTGCTACTCTTGGTTTGAATGGTGACGGTGTTGGTCTTAACTACCACTTTGGTCTTTTCCAAC
AAGTTCTTAAAAACAACCAACAAGAAACAATTCCAAATGCATGGTTGACAGAGCAAAACTGGTTGGTTCGCTCAAGCCGT
AGCTACCAAGTACCATTTGCAGACTTTACTTTGACATCAACTCTTTACGATATTGATGTTACTGGTTATGAAACAGCGAC
TAAAAACCGCTTGCGTTTGTTTGACTTGGATTCAGTTGATTCTTCTATTATTAAAGATGGTATCAACTTTGACAAGACAG
ATATCGCTCGCAACTTAACTCTCTTCCTTTACCCAGATGATAGTGACCGTCAAGGTGAATTGCTCCGTATCTTCCAACAA
TACTTCATGGTTTCAAACGGTGCGCAATTGATCATCGACGAAGCAATCGAAAAAGGAAGCAACTTGCATGACCTTGCTGA
CTACGCAGTTGTCCAAATCAACGATACTCACCCATCAATGGTGATTCCTGAATTGATTCGTCTTTTGACTGCACGTGGTA
TCGAGCTTGACGAAGCAATCTCAATTGTTCGTAGCATGACTGCCTACACTAACCACACAATCCTTGCTGAGGCGCTTGAA
AAATGGCCTCTTGAATTCTTGCAAGAAGTGGTTCCTCACTTGGTACCAATCATCGAAGAATTGGACCGTCGTGTGAAGGC
AGAGTACAAAGATCCAGCTGTTCAAATCATCGATGAGAGCGGACGTGTTCACATGGCTCACATGGATATCCACTACGGAT
ACAGTGTTAACGGGGTTGCAGCACTCCATACTGAAATCTTGAAAAATTCTGAGTTGAAAGCCTTCTACGACCTTTACCCA
GAAAAGTTCAACAACAAAACAAACGGTATCACTTTCCGTCGTTGGCTTATGCATGCTAACCCAAGATTGTCTCACTACTT
GGATGAGATTCTTGGAGATGGTTGGCACCATGAAGCAGATGAGCTTGAAAAACTGTTGTCTTATGAAGACAAAGCAGCTG
TCAAAGAAAAATTGGAAAGCATCAAGGCTCACAACAAACGTAAATTGGCTCGTCACTTGAAAGAACACCAAGGTGTGGAA
ATCAATCCAAATTCTATCTTTGATATCCAAATCAAACGTCTTCACGAGTACAAACGCCAACAAATGAACGCTTTGTACGT
GATCCACAAATACCTTGACATCAAAGCTGGTAACATCCCTGCTCGTCCAATCACAATCTTCTTTGGTGGTAAAGCAGCTC
CAGCCTACACAATCGCTCAAGATATCATTCACTTGATCCTTTGCATGTCAGAAGTTATTGCTAACGATCCAGCAGTAGCT
CCACACTTGCAAGTAGTTATGGTTGAAAACTACAACGTTACTGCAGCAAGCTTCCTTATCCCAGCATGTGATATCTCAGA
ACAAATCTCACTTGCTTCTAAAGAAGCTTCAGGTACTGGTAACATGAAATTCATGTTGAACGGAGCTTTGACACTTGGTA
CTATGGACGGTGCTAACGTGGAAATCGCTGAGTTGGTTGGAGAAGAAAACATCTACATCTTCGGTGAAGATTCAGAAACT
GTTATCGACCTTTACGCAAAAGCAGCTTACAAATCAAGCGAATTCTACGCTCGTGAAGCTATCAAACCATTGGTTGACTT
CATCGTTAGTGATGCAGTTCTTGCAGCTGGAAACAAAGAGCGCTTGGAACGTCTTTACAATGAATTGATCAACAAAGACT
GGTTCATGACTCTTCTTGACTTGGAAGACTACATCAAAGTCAAAGAGCAAATGCTTGCTGACTACGAAGACCGTGACGCA
TGGTTGGATAAAGTCATCGTTAACATTTCTAAAGCAGGATTCTTCTCATCTGACCGTACAATCGCTCAGTATAACGAAGA
CATCTGGCACTTGAACTAA

Upstream 100 bases:

>100_bases
GTCGAGGAAGGTTTGCTTGACTTGACAACAATTTATCGCCGAATTAATGAAAATTTGGTAGATTTAAAGAAATAAGACAA
TAATCAGGAGACAACTAAAC

Downstream 100 bases:

>100_bases
TACTCTTCGAAAATCTCTTCAAACTACGTCAGCTTTATCTGCAACCTCAAAGCGGTGCTTTGAGCAACCTGCGGCTAGCT
TCCTAGTTTGCTCTTTGATT

Product: glycogen phosphorylase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 752; Mature: 752

Protein sequence:

>752_residues
MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKLLSNNLINLGLYDDVKKELAA
AGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNGDGVGLNYHFGLFQQVLKNNQQETIPNAWLTEQNWLVRSSR
SYQVPFADFTLTSTLYDIDVTGYETATKNRLRLFDLDSVDSSIIKDGINFDKTDIARNLTLFLYPDDSDRQGELLRIFQQ
YFMVSNGAQLIIDEAIEKGSNLHDLADYAVVQINDTHPSMVIPELIRLLTARGIELDEAISIVRSMTAYTNHTILAEALE
KWPLEFLQEVVPHLVPIIEELDRRVKAEYKDPAVQIIDESGRVHMAHMDIHYGYSVNGVAALHTEILKNSELKAFYDLYP
EKFNNKTNGITFRRWLMHANPRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVE
INPNSIFDIQIKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQDIIHLILCMSEVIANDPAVA
PHLQVVMVENYNVTAASFLIPACDISEQISLASKEASGTGNMKFMLNGALTLGTMDGANVEIAELVGEENIYIFGEDSET
VIDLYAKAAYKSSEFYAREAIKPLVDFIVSDAVLAAGNKERLERLYNELINKDWFMTLLDLEDYIKVKEQMLADYEDRDA
WLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN

Sequences:

>Translated_752_residues
MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKLLSNNLINLGLYDDVKKELAA
AGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNGDGVGLNYHFGLFQQVLKNNQQETIPNAWLTEQNWLVRSSR
SYQVPFADFTLTSTLYDIDVTGYETATKNRLRLFDLDSVDSSIIKDGINFDKTDIARNLTLFLYPDDSDRQGELLRIFQQ
YFMVSNGAQLIIDEAIEKGSNLHDLADYAVVQINDTHPSMVIPELIRLLTARGIELDEAISIVRSMTAYTNHTILAEALE
KWPLEFLQEVVPHLVPIIEELDRRVKAEYKDPAVQIIDESGRVHMAHMDIHYGYSVNGVAALHTEILKNSELKAFYDLYP
EKFNNKTNGITFRRWLMHANPRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVE
INPNSIFDIQIKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQDIIHLILCMSEVIANDPAVA
PHLQVVMVENYNVTAASFLIPACDISEQISLASKEASGTGNMKFMLNGALTLGTMDGANVEIAELVGEENIYIFGEDSET
VIDLYAKAAYKSSEFYAREAIKPLVDFIVSDAVLAAGNKERLERLYNELINKDWFMTLLDLEDYIKVKEQMLADYEDRDA
WLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN
>Mature_752_residues
MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKLLSNNLINLGLYDDVKKELAA
AGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNGDGVGLNYHFGLFQQVLKNNQQETIPNAWLTEQNWLVRSSR
SYQVPFADFTLTSTLYDIDVTGYETATKNRLRLFDLDSVDSSIIKDGINFDKTDIARNLTLFLYPDDSDRQGELLRIFQQ
YFMVSNGAQLIIDEAIEKGSNLHDLADYAVVQINDTHPSMVIPELIRLLTARGIELDEAISIVRSMTAYTNHTILAEALE
KWPLEFLQEVVPHLVPIIEELDRRVKAEYKDPAVQIIDESGRVHMAHMDIHYGYSVNGVAALHTEILKNSELKAFYDLYP
EKFNNKTNGITFRRWLMHANPRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVE
INPNSIFDIQIKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQDIIHLILCMSEVIANDPAVA
PHLQVVMVENYNVTAASFLIPACDISEQISLASKEASGTGNMKFMLNGALTLGTMDGANVEIAELVGEENIYIFGEDSET
VIDLYAKAAYKSSEFYAREAIKPLVDFIVSDAVLAAGNKERLERLYNELINKDWFMTLLDLEDYIKVKEQMLADYEDRDA
WLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN

Specific function: Phosphorylase is an important allosteric enzyme in carbohydrate metabolism. Enzymes from different sources differ in their regulatory mechanisms and in their natural substrates. However, all known phosphorylases share catalytic and structural properties [

COG id: COG0058

COG function: function code G; Glucan phosphorylase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycogen phosphorylase family [H]

Homologues:

Organism=Homo sapiens, GI5032009, Length=750, Percent_Identity=41.0666666666667, Blast_Score=494, Evalue=1e-139,
Organism=Homo sapiens, GI71037379, Length=755, Percent_Identity=39.8675496688742, Blast_Score=469, Evalue=1e-132,
Organism=Homo sapiens, GI21361370, Length=753, Percent_Identity=39.1766268260292, Blast_Score=465, Evalue=1e-131,
Organism=Homo sapiens, GI255653002, Length=695, Percent_Identity=38.9928057553957, Blast_Score=446, Evalue=1e-125,
Organism=Homo sapiens, GI257900462, Length=604, Percent_Identity=42.3841059602649, Blast_Score=438, Evalue=1e-123,
Organism=Escherichia coli, GI48994936, Length=793, Percent_Identity=39.21815889029, Blast_Score=539, Evalue=1e-154,
Organism=Escherichia coli, GI2367228, Length=758, Percent_Identity=40.2374670184697, Blast_Score=505, Evalue=1e-144,
Organism=Caenorhabditis elegans, GI32566204, Length=789, Percent_Identity=38.4030418250951, Blast_Score=508, Evalue=1e-144,
Organism=Caenorhabditis elegans, GI17564550, Length=789, Percent_Identity=38.4030418250951, Blast_Score=507, Evalue=1e-144,
Organism=Saccharomyces cerevisiae, GI6325418, Length=797, Percent_Identity=35.2572145545797, Blast_Score=404, Evalue=1e-113,
Organism=Drosophila melanogaster, GI78706832, Length=778, Percent_Identity=39.4601542416452, Blast_Score=496, Evalue=1e-140,
Organism=Drosophila melanogaster, GI24581010, Length=778, Percent_Identity=39.4601542416452, Blast_Score=496, Evalue=1e-140,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011833
- InterPro:   IPR000811 [H]

Pfam domain/function: PF00343 Phosphorylase [H]

EC number: =2.4.1.1 [H]

Molecular weight: Translated: 85155; Mature: 85155

Theoretical pI: Translated: 4.81; Mature: 4.81

Prosite motif: PS00102 PHOSPHORYLASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKL
CCCHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHCCCCCCCCCCCEEEEEEHHHHHHHH
LSNNLINLGLYDDVKKELAAAGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNG
HHCCCEEECCHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCC
DGVGLNYHFGLFQQVLKNNQQETIPNAWLTEQNWLVRSSRSYQVPFADFTLTSTLYDIDV
CCCEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEHEEEEEEE
TGYETATKNRLRLFDLDSVDSSIIKDGINFDKTDIARNLTLFLYPDDSDRQGELLRIFQQ
CCCCCCCCCCEEEEECCCCHHHHHHCCCCCCHHHHHCCCEEEEECCCCCCCHHHHHHHHH
YFMVSNGAQLIIDEAIEKGSNLHDLADYAVVQINDTHPSMVIPELIRLLTARGIELDEAI
HHHHCCCCHHEEHHHHHCCCCHHHHCCEEEEEEECCCCCHHHHHHHHHHHHCCCCHHHHH
SIVRSMTAYTNHTILAEALEKWPLEFLQEVVPHLVPIIEELDRRVKAEYKDPAVQIIDES
HHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECC
GRVHMAHMDIHYGYSVNGVAALHTEILKNSELKAFYDLYPEKFNNKTNGITFRRWLMHAN
CCEEEEEEEEEECCCCCCHHHHHHHHHCCCCCHHHHHHCHHHHCCCCCCCHHHHHHHHCC
PRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVE
CHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCE
INPNSIFDIQIKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQ
ECCCCEEEEHHHHHHHHHHHHHHHHHEEHHHHHEECCCCCCCEEEEEECCCCCCHHHHHH
DIIHLILCMSEVIANDPAVAPHLQVVMVENYNVTAASFLIPACDISEQISLASKEASGTG
HHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCEEHHHHEECCCCCHHHHHHHHCCCCCCC
NMKFMLNGALTLGTMDGANVEIAELVGEENIYIFGEDSETVIDLYAKAAYKSSEFYAREA
CEEEEEECEEEEECCCCCCEEHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCHHHHHHHH
IKPLVDFIVSDAVLAAGNKERLERLYNELINKDWFMTLLDLEDYIKVKEQMLADYEDRDA
HHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHCCCCHHHH
WLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN
HHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCC
>Mature Secondary Structure
MLSLQEFVQNRYNKTIAECSNEELYLALLNYSKLASSQKPVNTGKKKVYYISAEFLIGKL
CCCHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHCCCCCCCCCCCEEEEEEHHHHHHHH
LSNNLINLGLYDDVKKELAAAGKDLIEVEEVELEPSLGNGGLGRLAACFIDSIATLGLNG
HHCCCEEECCHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCC
DGVGLNYHFGLFQQVLKNNQQETIPNAWLTEQNWLVRSSRSYQVPFADFTLTSTLYDIDV
CCCEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEHEEEEEEE
TGYETATKNRLRLFDLDSVDSSIIKDGINFDKTDIARNLTLFLYPDDSDRQGELLRIFQQ
CCCCCCCCCCEEEEECCCCHHHHHHCCCCCCHHHHHCCCEEEEECCCCCCCHHHHHHHHH
YFMVSNGAQLIIDEAIEKGSNLHDLADYAVVQINDTHPSMVIPELIRLLTARGIELDEAI
HHHHCCCCHHEEHHHHHCCCCHHHHCCEEEEEEECCCCCHHHHHHHHHHHHCCCCHHHHH
SIVRSMTAYTNHTILAEALEKWPLEFLQEVVPHLVPIIEELDRRVKAEYKDPAVQIIDES
HHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECC
GRVHMAHMDIHYGYSVNGVAALHTEILKNSELKAFYDLYPEKFNNKTNGITFRRWLMHAN
CCEEEEEEEEEECCCCCCHHHHHHHHHCCCCCHHHHHHCHHHHCCCCCCCHHHHHHHHCC
PRLSHYLDEILGDGWHHEADELEKLLSYEDKAAVKEKLESIKAHNKRKLARHLKEHQGVE
CHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCE
INPNSIFDIQIKRLHEYKRQQMNALYVIHKYLDIKAGNIPARPITIFFGGKAAPAYTIAQ
ECCCCEEEEHHHHHHHHHHHHHHHHHEEHHHHHEECCCCCCCEEEEEECCCCCCHHHHHH
DIIHLILCMSEVIANDPAVAPHLQVVMVENYNVTAASFLIPACDISEQISLASKEASGTG
HHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCEEHHHHEECCCCCHHHHHHHHCCCCCCC
NMKFMLNGALTLGTMDGANVEIAELVGEENIYIFGEDSETVIDLYAKAAYKSSEFYAREA
CEEEEEECEEEEECCCCCCEEHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCHHHHHHHH
IKPLVDFIVSDAVLAAGNKERLERLYNELINKDWFMTLLDLEDYIKVKEQMLADYEDRDA
HHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHCCCCHHHH
WLDKVIVNISKAGFFSSDRTIAQYNEDIWHLN
HHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11463916; 6297760 [H]