Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is proS

Identifier: 15675760

GI number: 15675760

Start: 1633034

End: 1634890

Strand: Reverse

Name: proS

Synonym: SPy_1962

Alternate gene names: 15675760

Gene position: 1634890-1633034 (Counterclockwise)

Preceding gene: 15675761

Following gene: 15675759

Centisome position: 88.26

GC content: 43.4

Gene sequence:

>1857_bases
ATGAAACAAAGTAAACTGCTTATCCCAACCTTGCGCGAAATGCCAAGTGATGCCCAGGTTATCAGCCACGCGCTTATGGT
GCGTGCCGGTTATGTGCGCCAAGTTTCTGCTGGTATCTATGCTTATTTACCACTGGCAAATCGTACCATTGAGAAATTCA
AGACCATCATGCGTGAAGAGTTTGAAAAGATCGGTGCTGTTGAAATGTTGGCGCCAGCTCTTTTGACAGCTGATCTCTGG
CGTGAATCAGGCCGTTATGAGACCTATGGAGAGGACCTCTATAAGCTTAAAAACCGTGATAACTCAGACTTTATCTTGGG
TCCAACTCACGAAGAAACCTTTACGACTTTGGTGCGTGATGCGGTCAAATCTTACAAGCAATTGCCCTTAAACCTTTATC
AAATTCAGTCTAAGTATCGTGATGAAAAACGTCCGCGTAATGGCTTGCTTCGTACCCGTGAGTTTATTATGAAAGACGGC
TATAGTTTCCATCACAACTATGAAGATTTAGATGTGACCTACGAAGATTATCGTCAAGCTTACGAAGCTATTTTTACCAG
AGCTGGTTTAGATTTCAAAGGGATTATTGGAGATGGCGGTGCCATGGGGGGGAAAGATTCCCAAGAATTTATGGCCATTA
CACCAGCTCGTACGGATCTTGATCGCTGGGTGGTTCTTGACAAGTCTATTGCGTCAATGGATGATATTCCAAAAGAGGTC
TTAGAAGATATTAAGGCAGAATTGGCTGCTTGGATGATTTCAGGTGAAGATACTATTGCTTATTCAACAGAATCAAGCTA
CGCTGCCAACCTTGAGATGGCAACTAACGAATACAAACCGTCCTCAAAAGTAGCTGCTGAAGATGCCTTGGCAGAAGTTG
AGACACCACATTGCAAAACGATTGATGAAGTGGCTGCTTTTCTTTCAGTAGATGAAACACAAACCATCAAAACCTTGCTT
TTTGTGGCAGATAATGAACCTGTTGTTGCTTTGCTAGTTGGAAATGACCATATCAATACCGTTAAATTAAAAAACTATCT
AGCTGCTGATTTTTTAGAGCCAGCTAGTGAAGAAGAAGCCCGTGCTTTCTTTGGTGCAGGTTTTGGCTCACTTGGGCCTG
TCAACTTGGCACAAGGTAGTCGTATTGTGGCTGACCGCAAAGTGCAAAACCTTACCAATGCCGTTGCAGGAGCTAACAAG
GATGGTTTCCATATGACAGGAGTCAATCCAGGACGTGACTTCCAGGCTGAATATGTAGATATTCGTGAAGTCAAAGAAGG
GGAAATGTCTCCGGACGGTCATGGTGTTCTCCAGTTTGCGCGTGGTATCGAAGTCGGTCATATCTTCAAGTTGGGCACTC
GTTATTCAGACAGCATGGGAGCAACGATTCTTGATGAAAATGGCAGAACAGTTCCAATTGTGATGGGTTGTTATGGTATC
GGGGTTAGCCGCATTTTGTCTGCTGTCATTGAACAGCATGCCCGTCTCTTTGTGAACAAGACACCAAAAGGCGATTACCG
TTATGCTTGGGGTATTAACTTCCCTAAAGAATTAGCCCCATTTGACGTGCATTTGATTACAGTTAATGTCAAAGATCAAG
TGGCCCAAGACTTGACGGCGAAGTTAGAAGCTGACTTGATGGCTAAAGGGTATGATGTCTTGACAGATGACCGTAATGAA
CGCGTCGGTTCGAAATTCTCTGATAGCGATTTGATTGGTTTGCCAATTCGTGTCACTGTTGGTAAAAAAGCCGCTGAAGG
TATCGTGGAAATTAAAATCAAGGCAACAGGTGACAGCATTGAAGTTAATGCAGAAAACCTCATCGAAACCCTTGAAATTT
TAACAAAAGAACACTAA

Upstream 100 bases:

>100_bases
AGGAATACACATTCAAATTATTGAATAATCCCGCTTCACTCTTACTTGATGTTTCCAGAAGGGTTTGAGGATAAGGGGAT
GTATTAAAACGGAATAATCT

Downstream 100 bases:

>100_bases
AAAAGAAATGAAGAGGCTGCCTGTTTAGAGTGGCCTCTTTTTGCTGTGAACTTGTCTGATACAGAGTGGCTTTTGCTTTA
CCAAAAGCGCTTGGCGATAA

Product: prolyl-tRNA synthetase

Products: NA

Alternate protein names: Proline--tRNA ligase; ProRS

Number of amino acids: Translated: 618; Mature: 618

Protein sequence:

>618_residues
MKQSKLLIPTLREMPSDAQVISHALMVRAGYVRQVSAGIYAYLPLANRTIEKFKTIMREEFEKIGAVEMLAPALLTADLW
RESGRYETYGEDLYKLKNRDNSDFILGPTHEETFTTLVRDAVKSYKQLPLNLYQIQSKYRDEKRPRNGLLRTREFIMKDG
YSFHHNYEDLDVTYEDYRQAYEAIFTRAGLDFKGIIGDGGAMGGKDSQEFMAITPARTDLDRWVVLDKSIASMDDIPKEV
LEDIKAELAAWMISGEDTIAYSTESSYAANLEMATNEYKPSSKVAAEDALAEVETPHCKTIDEVAAFLSVDETQTIKTLL
FVADNEPVVALLVGNDHINTVKLKNYLAADFLEPASEEEARAFFGAGFGSLGPVNLAQGSRIVADRKVQNLTNAVAGANK
DGFHMTGVNPGRDFQAEYVDIREVKEGEMSPDGHGVLQFARGIEVGHIFKLGTRYSDSMGATILDENGRTVPIVMGCYGI
GVSRILSAVIEQHARLFVNKTPKGDYRYAWGINFPKELAPFDVHLITVNVKDQVAQDLTAKLEADLMAKGYDVLTDDRNE
RVGSKFSDSDLIGLPIRVTVGKKAAEGIVEIKIKATGDSIEVNAENLIETLEILTKEH

Sequences:

>Translated_618_residues
MKQSKLLIPTLREMPSDAQVISHALMVRAGYVRQVSAGIYAYLPLANRTIEKFKTIMREEFEKIGAVEMLAPALLTADLW
RESGRYETYGEDLYKLKNRDNSDFILGPTHEETFTTLVRDAVKSYKQLPLNLYQIQSKYRDEKRPRNGLLRTREFIMKDG
YSFHHNYEDLDVTYEDYRQAYEAIFTRAGLDFKGIIGDGGAMGGKDSQEFMAITPARTDLDRWVVLDKSIASMDDIPKEV
LEDIKAELAAWMISGEDTIAYSTESSYAANLEMATNEYKPSSKVAAEDALAEVETPHCKTIDEVAAFLSVDETQTIKTLL
FVADNEPVVALLVGNDHINTVKLKNYLAADFLEPASEEEARAFFGAGFGSLGPVNLAQGSRIVADRKVQNLTNAVAGANK
DGFHMTGVNPGRDFQAEYVDIREVKEGEMSPDGHGVLQFARGIEVGHIFKLGTRYSDSMGATILDENGRTVPIVMGCYGI
GVSRILSAVIEQHARLFVNKTPKGDYRYAWGINFPKELAPFDVHLITVNVKDQVAQDLTAKLEADLMAKGYDVLTDDRNE
RVGSKFSDSDLIGLPIRVTVGKKAAEGIVEIKIKATGDSIEVNAENLIETLEILTKEH
>Mature_618_residues
MKQSKLLIPTLREMPSDAQVISHALMVRAGYVRQVSAGIYAYLPLANRTIEKFKTIMREEFEKIGAVEMLAPALLTADLW
RESGRYETYGEDLYKLKNRDNSDFILGPTHEETFTTLVRDAVKSYKQLPLNLYQIQSKYRDEKRPRNGLLRTREFIMKDG
YSFHHNYEDLDVTYEDYRQAYEAIFTRAGLDFKGIIGDGGAMGGKDSQEFMAITPARTDLDRWVVLDKSIASMDDIPKEV
LEDIKAELAAWMISGEDTIAYSTESSYAANLEMATNEYKPSSKVAAEDALAEVETPHCKTIDEVAAFLSVDETQTIKTLL
FVADNEPVVALLVGNDHINTVKLKNYLAADFLEPASEEEARAFFGAGFGSLGPVNLAQGSRIVADRKVQNLTNAVAGANK
DGFHMTGVNPGRDFQAEYVDIREVKEGEMSPDGHGVLQFARGIEVGHIFKLGTRYSDSMGATILDENGRTVPIVMGCYGI
GVSRILSAVIEQHARLFVNKTPKGDYRYAWGINFPKELAPFDVHLITVNVKDQVAQDLTAKLEADLMAKGYDVLTDDRNE
RVGSKFSDSDLIGLPIRVTVGKKAAEGIVEIKIKATGDSIEVNAENLIETLEILTKEH

Specific function: Catalyzes the attachment of proline to tRNA(Pro) in a two-step reaction:proline is first activated by ATP to form Pro- AMP and then transferred to the acceptor end of tRNA(Pro). As ProRS can inadvertently accommodate and process non-cognate amino acids su

COG id: COG0442

COG function: function code J; Prolyl-tRNA synthetase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family. ProS type 1 subfamily

Homologues:

Organism=Homo sapiens, GI34303926, Length=202, Percent_Identity=40.0990099009901, Blast_Score=160, Evalue=3e-39,
Organism=Escherichia coli, GI1786392, Length=619, Percent_Identity=41.8416801292407, Blast_Score=457, Evalue=1e-129,
Organism=Caenorhabditis elegans, GI115532348, Length=197, Percent_Identity=36.0406091370558, Blast_Score=135, Evalue=5e-32,
Organism=Saccharomyces cerevisiae, GI6320931, Length=199, Percent_Identity=35.1758793969849, Blast_Score=136, Evalue=1e-32,
Organism=Drosophila melanogaster, GI24656200, Length=226, Percent_Identity=32.7433628318584, Blast_Score=134, Evalue=2e-31,

Paralogues:

None

Copy number: 800 Molecules/Cell In: Growth-Phase, Minimal-Media (Based on E. coli). [C]

Swissprot (AC and ID): SYP_STRP1 (Q99XY4)

Other databases:

- EMBL:   AE004092
- EMBL:   CP000017
- RefSeq:   NP_269934.1
- RefSeq:   YP_283036.1
- ProteinModelPortal:   Q99XY4
- SMR:   Q99XY4
- EnsemblBacteria:   EBSTRT00000001204
- EnsemblBacteria:   EBSTRT00000027672
- GeneID:   3571227
- GeneID:   901637
- GenomeReviews:   AE004092_GR
- GenomeReviews:   CP000017_GR
- KEGG:   spy:SPy_1962
- KEGG:   spz:M5005_Spy_1673
- GeneTree:   EBGT00050000026594
- HOGENOM:   HBG403504
- OMA:   DFVLGPT
- ProtClustDB:   PRK09194
- BioCyc:   SPYO160490:SPY1962-MONOMER
- BioCyc:   SPYO293653:M5005_SPY1673-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_01569
- InterPro:   IPR002314
- InterPro:   IPR006195
- InterPro:   IPR004154
- InterPro:   IPR002316
- InterPro:   IPR004500
- InterPro:   IPR007214
- Gene3D:   G3DSA:3.40.50.800
- Gene3D:   G3DSA:3.90.960.10
- PRINTS:   PR01046
- TIGRFAMs:   TIGR00409

Pfam domain/function: PF03129 HGTP_anticodon; PF00587 tRNA-synt_2b; PF04073 YbaK; SSF52954 Anticodon_bd; SSF55826 YbaK/aa-tRNA-synth-assoc-reg

EC number: =6.1.1.15

Molecular weight: Translated: 68704; Mature: 68704

Theoretical pI: Translated: 4.81; Mature: 4.81

Prosite motif: PS50862 AA_TRNA_LIGASE_II

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKQSKLLIPTLREMPSDAQVISHALMVRAGYVRQVSAGIYAYLPLANRTIEKFKTIMREE
CCCCCEECHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEECCHHHHHHHHHHHHHHH
FEKIGAVEMLAPALLTADLWRESGRYETYGEDLYKLKNRDNSDFILGPTHEETFTTLVRD
HHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHH
AVKSYKQLPLNLYQIQSKYRDEKRPRNGLLRTREFIMKDGYSFHHNYEDLDVTYEDYRQA
HHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHH
YEAIFTRAGLDFKGIIGDGGAMGGKDSQEFMAITPARTDLDRWVVLDKSIASMDDIPKEV
HHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEECCCCCCCCCEEEECHHHHHHHHHHHHH
LEDIKAELAAWMISGEDTIAYSTESSYAANLEMATNEYKPSSKVAAEDALAEVETPHCKT
HHHHHHHHHHHEECCCCEEEEECCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCCCCCC
IDEVAAFLSVDETQTIKTLLFVADNEPVVALLVGNDHINTVKLKNYLAADFLEPASEEEA
HHHHHHHHCCCCHHEEEEEEEEECCCCEEEEEECCCCCCEEEHHHHHHHHHCCCCCCHHH
RAFFGAGFGSLGPVNLAQGSRIVADRKVQNLTNAVAGANKDGFHMTGVNPGRDFQAEYVD
HHHHHCCCCCCCCCCCCCCCEEEHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCHHHHH
IREVKEGEMSPDGHGVLQFARGIEVGHIFKLGTRYSDSMGATILDENGRTVPIVMGCYGI
HHHHHCCCCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCEEECCCCCEEEEEEEHHHC
GVSRILSAVIEQHARLFVNKTPKGDYRYAWGINFPKELAPFDVHLITVNVKDQVAQDLTA
CHHHHHHHHHHHHHHEEEECCCCCCEEEEECCCCCHHCCCCEEEEEEEECHHHHHHHHHH
KLEADLMAKGYDVLTDDRNERVGSKFSDSDLIGLPIRVTVGKKAAEGIVEIKIKATGDSI
HHHHHHHHCCCCEECCCCCHHCCCCCCCCCEEEEEEEEEECCCCCCCEEEEEEEECCCCE
EVNAENLIETLEILTKEH
EECHHHHHHHHHHHHCCC
>Mature Secondary Structure
MKQSKLLIPTLREMPSDAQVISHALMVRAGYVRQVSAGIYAYLPLANRTIEKFKTIMREE
CCCCCEECHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEECCHHHHHHHHHHHHHHH
FEKIGAVEMLAPALLTADLWRESGRYETYGEDLYKLKNRDNSDFILGPTHEETFTTLVRD
HHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHH
AVKSYKQLPLNLYQIQSKYRDEKRPRNGLLRTREFIMKDGYSFHHNYEDLDVTYEDYRQA
HHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHH
YEAIFTRAGLDFKGIIGDGGAMGGKDSQEFMAITPARTDLDRWVVLDKSIASMDDIPKEV
HHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEECCCCCCCCCEEEECHHHHHHHHHHHHH
LEDIKAELAAWMISGEDTIAYSTESSYAANLEMATNEYKPSSKVAAEDALAEVETPHCKT
HHHHHHHHHHHEECCCCEEEEECCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCCCCCC
IDEVAAFLSVDETQTIKTLLFVADNEPVVALLVGNDHINTVKLKNYLAADFLEPASEEEA
HHHHHHHHCCCCHHEEEEEEEEECCCCEEEEEECCCCCCEEEHHHHHHHHHCCCCCCHHH
RAFFGAGFGSLGPVNLAQGSRIVADRKVQNLTNAVAGANKDGFHMTGVNPGRDFQAEYVD
HHHHHCCCCCCCCCCCCCCCEEEHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCHHHHH
IREVKEGEMSPDGHGVLQFARGIEVGHIFKLGTRYSDSMGATILDENGRTVPIVMGCYGI
HHHHHCCCCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCEEECCCCCEEEEEEEHHHC
GVSRILSAVIEQHARLFVNKTPKGDYRYAWGINFPKELAPFDVHLITVNVKDQVAQDLTA
CHHHHHHHHHHHHHHEEEECCCCCCEEEEECCCCCHHCCCCEEEEEEEECHHHHHHHHHH
KLEADLMAKGYDVLTDDRNERVGSKFSDSDLIGLPIRVTVGKKAAEGIVEIKIKATGDSI
HHHHHHHHCCCCEECCCCCHHCCCCCCCCCEEEEEEEEEECCCCCCCEEEEEEEECCCCE
EVNAENLIETLEILTKEH
EECHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11296296