Definition Prochlorococcus marinus str. NATL1A, complete genome.
Accession NC_008819
Length 1,864,731

Click here to switch to the map view.

The map label for this gene is glyA

Identifier: 124025049

GI number: 124025049

Start: 310495

End: 311730

Strand: Reverse

Name: glyA

Synonym: NATL1_03361

Alternate gene names: 124025049

Gene position: 311730-310495 (Counterclockwise)

Preceding gene: 124025052

Following gene: 124025048

Centisome position: 16.72

GC content: 38.27

Gene sequence:

>1236_bases
ATGAAATGTGATCCAAGTATTGCGAAATTAATAAACAATGAATTATCAAGACAAGAAACTCATTTAGAGCTTATCGCAAG
TGAGAATTTTGCCTCTAAGGCCGTAATGGAAGCCCAAGGATCAGTCCTAACAAATAAATATGCTGAAGGTCTCCCTAACA
AACGCTATTACGGAGGATGTGAGTATATCGACGGAATTGAGCAACTAGCAATAGATAGAGCAAAAAACCTTTTTGGGGCC
AACTGGGCAAACGTCCAACCTCACAGCGGAGCTCAAGCTAACTTTGCAGTTTTCCTTAGCCTTCTAAAGCCGGGGGACAC
AATTATGGGAATGGACTTATCTCATGGAGGTCACCTCACTCATGGTTCACCTGTAAATGTAAGCGGCAAATGGTTTAAAA
CTTGCCATTACGAAGTTGATAAAAAGACTGAAATGCTCGATATGGATGCAATAAGAAAAAAAGCAATTGAAAATCAACCT
AAATTGATTATCTGTGGATTCTCTGCCTATCCTCGAAAAATTGACTTCAAAGCTTTCAGATCAATAGCTGATGAGGTAAA
TGCTTATTTATTAGCTGATATTGCTCATATTGCTGGTTTAGTAGCAAGTGGACTTCACCCAAGTCCAATCCCATATTGTG
ATGTAGTTACAACAACCACTCACAAAACTCTTAGAGGGCCAAGGGGTGGACTAATCCTCTCAAAAGATGAGGAGATAGGA
AAAAAACTTGATAAAGCAGTATTTCCTGGCACCCAAGGAGGTCCTTTAGAACATGTAATCGCAGCCAAGGCTGTTGCATT
CCAAGAAGCTTCTGCACCCGAATTCAAGATTTATAGCCAAAAAGTAATCTCAAATGCACAAGTTCTTTCTAATCAACTTC
AAAAAAGAGGAATTTCAATTGTAAGCAAAGGAACTGACAATCATATAGTTCTTCTTGACCTTAGAAGCATTGGTATGACA
GGTAAAGTTGCTGATCAATTAGTAAGTGATATTAAAATAACCGCGAACAAAAACACTGTACCTTTTGACCCCGAGTCCCC
ATTTGTTACTAGTGGCCTAAGGCTAGGTTCAGCAGCCCTTACGACTAGAGGTTTTAATGAACAAGCCTTTGAAGATGTTG
GTAATATCATTGCAGATAGACTACTTAACCCTAACGATGAAGATATAAAGGAAAATTCAATCAATAAAGTATCTGAACTT
TGCAATAAGTTTCCTTTATATAGTGAAAACATCTAA

Upstream 100 bases:

>100_bases
ATAAACTGAAAATAAGAGAAAGTTTTCTCTCATTGGACTAAACATATGCTTACTATCACATCATAAATAAAAGAATAAAG
TGACTATTGAATCTTCTTTG

Downstream 100 bases:

>100_bases
AGAAATTACTATTAACAAAAAAAATAAATTTGGAGCAGAGATTTTATGCATTGGAAGTGAAATACTTCTAGGAAATATTG
TAAATACAAATTCTCAATGG

Product: serine hydroxymethyltransferase

Products: NA

Alternate protein names: SHMT; Serine methylase

Number of amino acids: Translated: 411; Mature: 411

Protein sequence:

>411_residues
MKCDPSIAKLINNELSRQETHLELIASENFASKAVMEAQGSVLTNKYAEGLPNKRYYGGCEYIDGIEQLAIDRAKNLFGA
NWANVQPHSGAQANFAVFLSLLKPGDTIMGMDLSHGGHLTHGSPVNVSGKWFKTCHYEVDKKTEMLDMDAIRKKAIENQP
KLIICGFSAYPRKIDFKAFRSIADEVNAYLLADIAHIAGLVASGLHPSPIPYCDVVTTTTHKTLRGPRGGLILSKDEEIG
KKLDKAVFPGTQGGPLEHVIAAKAVAFQEASAPEFKIYSQKVISNAQVLSNQLQKRGISIVSKGTDNHIVLLDLRSIGMT
GKVADQLVSDIKITANKNTVPFDPESPFVTSGLRLGSAALTTRGFNEQAFEDVGNIIADRLLNPNDEDIKENSINKVSEL
CNKFPLYSENI

Sequences:

>Translated_411_residues
MKCDPSIAKLINNELSRQETHLELIASENFASKAVMEAQGSVLTNKYAEGLPNKRYYGGCEYIDGIEQLAIDRAKNLFGA
NWANVQPHSGAQANFAVFLSLLKPGDTIMGMDLSHGGHLTHGSPVNVSGKWFKTCHYEVDKKTEMLDMDAIRKKAIENQP
KLIICGFSAYPRKIDFKAFRSIADEVNAYLLADIAHIAGLVASGLHPSPIPYCDVVTTTTHKTLRGPRGGLILSKDEEIG
KKLDKAVFPGTQGGPLEHVIAAKAVAFQEASAPEFKIYSQKVISNAQVLSNQLQKRGISIVSKGTDNHIVLLDLRSIGMT
GKVADQLVSDIKITANKNTVPFDPESPFVTSGLRLGSAALTTRGFNEQAFEDVGNIIADRLLNPNDEDIKENSINKVSEL
CNKFPLYSENI
>Mature_411_residues
MKCDPSIAKLINNELSRQETHLELIASENFASKAVMEAQGSVLTNKYAEGLPNKRYYGGCEYIDGIEQLAIDRAKNLFGA
NWANVQPHSGAQANFAVFLSLLKPGDTIMGMDLSHGGHLTHGSPVNVSGKWFKTCHYEVDKKTEMLDMDAIRKKAIENQP
KLIICGFSAYPRKIDFKAFRSIADEVNAYLLADIAHIAGLVASGLHPSPIPYCDVVTTTTHKTLRGPRGGLILSKDEEIG
KKLDKAVFPGTQGGPLEHVIAAKAVAFQEASAPEFKIYSQKVISNAQVLSNQLQKRGISIVSKGTDNHIVLLDLRSIGMT
GKVADQLVSDIKITANKNTVPFDPESPFVTSGLRLGSAALTTRGFNEQAFEDVGNIIADRLLNPNDEDIKENSINKVSEL
CNKFPLYSENI

Specific function: Interconversion of serine and glycine

COG id: COG0112

COG function: function code E; Glycine/serine hydroxymethyltransferase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the SHMT family

Homologues:

Organism=Homo sapiens, GI19923315, Length=397, Percent_Identity=47.8589420654912, Blast_Score=350, Evalue=1e-96,
Organism=Homo sapiens, GI261862352, Length=397, Percent_Identity=47.8589420654912, Blast_Score=350, Evalue=1e-96,
Organism=Homo sapiens, GI261862350, Length=397, Percent_Identity=47.8589420654912, Blast_Score=350, Evalue=1e-96,
Organism=Homo sapiens, GI261862348, Length=397, Percent_Identity=47.8589420654912, Blast_Score=350, Evalue=1e-96,
Organism=Homo sapiens, GI261862346, Length=397, Percent_Identity=47.103274559194, Blast_Score=337, Evalue=1e-92,
Organism=Homo sapiens, GI22547186, Length=397, Percent_Identity=46.8513853904282, Blast_Score=325, Evalue=4e-89,
Organism=Homo sapiens, GI22547189, Length=383, Percent_Identity=44.9086161879896, Blast_Score=293, Evalue=1e-79,
Organism=Escherichia coli, GI1788902, Length=407, Percent_Identity=56.7567567567568, Blast_Score=457, Evalue=1e-130,
Organism=Caenorhabditis elegans, GI25144729, Length=398, Percent_Identity=46.9849246231156, Blast_Score=360, Evalue=1e-100,
Organism=Caenorhabditis elegans, GI25144732, Length=398, Percent_Identity=46.9849246231156, Blast_Score=360, Evalue=1e-100,
Organism=Saccharomyces cerevisiae, GI6319739, Length=431, Percent_Identity=42.2273781902552, Blast_Score=345, Evalue=1e-95,
Organism=Saccharomyces cerevisiae, GI6323087, Length=397, Percent_Identity=42.8211586901763, Blast_Score=325, Evalue=6e-90,
Organism=Drosophila melanogaster, GI24640005, Length=392, Percent_Identity=46.9387755102041, Blast_Score=360, Evalue=1e-100,
Organism=Drosophila melanogaster, GI221329721, Length=392, Percent_Identity=46.9387755102041, Blast_Score=360, Evalue=1e-99,

Paralogues:

None

Copy number: 3180 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 300 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 240 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 12,000 Molecules/Cell In: Glucose minimal

Swissprot (AC and ID): GLYA_PROM1 (A2C090)

Other databases:

- EMBL:   CP000553
- RefSeq:   YP_001014165.1
- ProteinModelPortal:   A2C090
- SMR:   A2C090
- STRING:   A2C090
- GeneID:   4779415
- GenomeReviews:   CP000553_GR
- KEGG:   pme:NATL1_03361
- eggNOG:   COG0112
- HOGENOM:   HBG301263
- OMA:   ADVEANV
- ProtClustDB:   PRK00011
- BioCyc:   PMAR167555:NATL1_03361-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00051_B
- InterPro:   IPR015424
- InterPro:   IPR015421
- InterPro:   IPR015422
- InterPro:   IPR001085
- InterPro:   IPR019798
- Gene3D:   G3DSA:3.40.640.10
- Gene3D:   G3DSA:3.90.1150.10
- PANTHER:   PTHR11680
- PIRSF:   PIRSF000412

Pfam domain/function: PF00464 SHMT; SSF53383 PyrdxlP-dep_Trfase_major

EC number: =2.1.2.1

Molecular weight: Translated: 44825; Mature: 44825

Theoretical pI: Translated: 7.09; Mature: 7.09

Prosite motif: PS00096 SHMT

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKCDPSIAKLINNELSRQETHLELIASENFASKAVMEAQGSVLTNKYAEGLPNKRYYGGC
CCCCHHHHHHHHHHHHHHHHHHHEEECCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCH
EYIDGIEQLAIDRAKNLFGANWANVQPHSGAQANFAVFLSLLKPGDTIMGMDLSHGGHLT
HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCC
HGSPVNVSGKWFKTCHYEVDKKTEMLDMDAIRKKAIENQPKLIICGFSAYPRKIDFKAFR
CCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHH
SIADEVNAYLLADIAHIAGLVASGLHPSPIPYCDVVTTTTHKTLRGPRGGLILSKDEEIG
HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCHHHHCCCCCCEEEECCHHHH
KKLDKAVFPGTQGGPLEHVIAAKAVAFQEASAPEFKIYSQKVISNAQVLSNQLQKRGISI
HHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEE
VSKGTDNHIVLLDLRSIGMTGKVADQLVSDIKITANKNTVPFDPESPFVTSGLRLGSAAL
EECCCCCEEEEEEHHHCCCCHHHHHHHHHCEEEEECCCCCCCCCCCCCHHCCCHHCCHHH
TTRGFNEQAFEDVGNIIADRLLNPNDEDIKENSINKVSELCNKFPLYSENI
HCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC
>Mature Secondary Structure
MKCDPSIAKLINNELSRQETHLELIASENFASKAVMEAQGSVLTNKYAEGLPNKRYYGGC
CCCCHHHHHHHHHHHHHHHHHHHEEECCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCH
EYIDGIEQLAIDRAKNLFGANWANVQPHSGAQANFAVFLSLLKPGDTIMGMDLSHGGHLT
HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCC
HGSPVNVSGKWFKTCHYEVDKKTEMLDMDAIRKKAIENQPKLIICGFSAYPRKIDFKAFR
CCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHH
SIADEVNAYLLADIAHIAGLVASGLHPSPIPYCDVVTTTTHKTLRGPRGGLILSKDEEIG
HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCHHHHCCCCCCEEEECCHHHH
KKLDKAVFPGTQGGPLEHVIAAKAVAFQEASAPEFKIYSQKVISNAQVLSNQLQKRGISI
HHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEE
VSKGTDNHIVLLDLRSIGMTGKVADQLVSDIKITANKNTVPFDPESPFVTSGLRLGSAAL
EECCCCCEEEEEEHHHCCCCHHHHHHHHHCEEEEECCCCCCCCCCCCCHHCCCHHCCHHH
TTRGFNEQAFEDVGNIIADRLLNPNDEDIKENSINKVSELCNKFPLYSENI
HCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA