Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is trsA

Identifier: 15675940

GI number: 15675940

Start: 1840962

End: 1841984

Strand: Reverse

Name: trsA

Synonym: SPy_2207

Alternate gene names: 15675940

Gene position: 1841984-1840962 (Counterclockwise)

Preceding gene: 15675944

Following gene: 15675939

Centisome position: 99.44

GC content: 36.27

Gene sequence:

>1023_bases
ATGACAAAACCTATTATTTTAACTGGAGATAGACCAACTGGAAAATTACATTTAGGTCATTATGTCGGAAGTCTTAAAAA
TCGTGTCTTTTTACAAAATGAAAACAAGTATAAGATGTTTGTTTTCTTGGCCGATCAACAGGCACTGACGGATCATGCTA
AAGAATCCGAATTAATTCAAGAATCTATTGGAAATGTTGCCTTGGATTACCTCTCTGTCGGTTTGGATCCGAAGCAATCA
ACGATTTTTATTCAAAGTCAGATTCCAGAGCTAGCTGAATTGAGCATGTATTATATGAATCTGGTATCATTAGCACGTTT
GGAAAGAAATCCCACTGTTAAAACCGAAATTGCTCAGAAAGGTTTTGGCGAAAGTATTCCATCTGGTTTTTTGGTTTATC
CAGTATCACAGGCCGCTGATATTACAGCATTTAAAGCGAATTTAGTACCTGTAGGTAACGACCAGAAACCGATGATTGAA
CAAACACGTGAAATTGTGAGAAGTTTTAATCATACTTACCATACAGACTGTTTAGTAGAACCTGAAGGTATTTATCCAGA
AAATGAAAAAGCTGGACGCTTACCTGGTCTTGATGGCAATGCCAAAATGTCTAAGTCATTGGGAAATGGAATCTATCTCT
CAGATGATGCAGATACCGTTCGCAAAAAAGTGATGAGCATGTATACTGATCCAAATCATATTAAAATAGAAGATCCTGGT
CAAATTGAAGGGAATATGGTCTTTCATTATTTGGATATTTTTGCTAGAAAAGAAGATCAAGCTGATATCGAAGCAATGAA
AGAGCATTATCAAATAGGTGGTTTAGGAGATGTGAAAACGAAACGCTACCTTTTAGATATTTTAGAACGTGAATTAGCAC
CTATTCGTGAAAGACGTTTGGAGTACGCTAAAGATATGGGAGAGGTGTTCCGTATGTTACAAGAAGGTAGTCAAAAAGCA
AGAACTGTGGCAGCCAAGACTTTATCAGAAGTGAAGTCAGCAATGGGTATTAATTATTTTTAA

Upstream 100 bases:

>100_bases
GCTAATTGAGGTGGTACCGCGTATTACTTGTAATAACGCCCTCACGTTTTAATAGCGTGGGGACTTTTTGCTATATCAAT
AGATAGTTAGGGAGAAAATG

Downstream 100 bases:

>100_bases
TGTTTTAGAGAAAAAGCTAAGTTAGTAATGATAAGATAATTACTATTATCAAAACAGGACCTAATTTGTCATTTCTTTAC
AGGATAGTAGAGATTATGTT

Product: tryptophanyl-tRNA synthetase II

Products: NA

Alternate protein names: Tryptophan--tRNA ligase; TrpRS

Number of amino acids: Translated: 340; Mature: 339

Protein sequence:

>340_residues
MTKPIILTGDRPTGKLHLGHYVGSLKNRVFLQNENKYKMFVFLADQQALTDHAKESELIQESIGNVALDYLSVGLDPKQS
TIFIQSQIPELAELSMYYMNLVSLARLERNPTVKTEIAQKGFGESIPSGFLVYPVSQAADITAFKANLVPVGNDQKPMIE
QTREIVRSFNHTYHTDCLVEPEGIYPENEKAGRLPGLDGNAKMSKSLGNGIYLSDDADTVRKKVMSMYTDPNHIKIEDPG
QIEGNMVFHYLDIFARKEDQADIEAMKEHYQIGGLGDVKTKRYLLDILERELAPIRERRLEYAKDMGEVFRMLQEGSQKA
RTVAAKTLSEVKSAMGINYF

Sequences:

>Translated_340_residues
MTKPIILTGDRPTGKLHLGHYVGSLKNRVFLQNENKYKMFVFLADQQALTDHAKESELIQESIGNVALDYLSVGLDPKQS
TIFIQSQIPELAELSMYYMNLVSLARLERNPTVKTEIAQKGFGESIPSGFLVYPVSQAADITAFKANLVPVGNDQKPMIE
QTREIVRSFNHTYHTDCLVEPEGIYPENEKAGRLPGLDGNAKMSKSLGNGIYLSDDADTVRKKVMSMYTDPNHIKIEDPG
QIEGNMVFHYLDIFARKEDQADIEAMKEHYQIGGLGDVKTKRYLLDILERELAPIRERRLEYAKDMGEVFRMLQEGSQKA
RTVAAKTLSEVKSAMGINYF
>Mature_339_residues
TKPIILTGDRPTGKLHLGHYVGSLKNRVFLQNENKYKMFVFLADQQALTDHAKESELIQESIGNVALDYLSVGLDPKQST
IFIQSQIPELAELSMYYMNLVSLARLERNPTVKTEIAQKGFGESIPSGFLVYPVSQAADITAFKANLVPVGNDQKPMIEQ
TREIVRSFNHTYHTDCLVEPEGIYPENEKAGRLPGLDGNAKMSKSLGNGIYLSDDADTVRKKVMSMYTDPNHIKIEDPGQ
IEGNMVFHYLDIFARKEDQADIEAMKEHYQIGGLGDVKTKRYLLDILERELAPIRERRLEYAKDMGEVFRMLQEGSQKAR
TVAAKTLSEVKSAMGINYF

Specific function: Unknown

COG id: COG0180

COG function: function code J; Tryptophanyl-tRNA synthetase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-I aminoacyl-tRNA synthetase family

Homologues:

Organism=Homo sapiens, GI7710154, Length=339, Percent_Identity=29.2035398230088, Blast_Score=117, Evalue=1e-26,
Organism=Homo sapiens, GI41352700, Length=172, Percent_Identity=32.5581395348837, Blast_Score=97, Evalue=2e-20,
Organism=Escherichia coli, GI1789786, Length=340, Percent_Identity=31.7647058823529, Blast_Score=150, Evalue=1e-37,
Organism=Caenorhabditis elegans, GI71982800, Length=348, Percent_Identity=28.735632183908, Blast_Score=133, Evalue=1e-31,
Organism=Caenorhabditis elegans, GI71982793, Length=348, Percent_Identity=28.735632183908, Blast_Score=133, Evalue=1e-31,
Organism=Saccharomyces cerevisiae, GI6320474, Length=350, Percent_Identity=26.8571428571429, Blast_Score=110, Evalue=4e-25,
Organism=Drosophila melanogaster, GI24666151, Length=336, Percent_Identity=31.8452380952381, Blast_Score=140, Evalue=1e-33,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): SYW_STRP1 (Q99XH4)

Other databases:

- EMBL:   AE004092
- EMBL:   CP000017
- RefSeq:   NP_270114.1
- RefSeq:   YP_283221.1
- ProteinModelPortal:   Q99XH4
- SMR:   Q99XH4
- EnsemblBacteria:   EBSTRT00000001168
- EnsemblBacteria:   EBSTRT00000029051
- GeneID:   3571030
- GeneID:   901899
- GenomeReviews:   AE004092_GR
- GenomeReviews:   CP000017_GR
- KEGG:   spy:SPy_2207
- KEGG:   spz:M5005_Spy_1858
- GeneTree:   EBGT00050000027780
- HOGENOM:   HBG293263
- OMA:   TYLDAFH
- ProtClustDB:   PRK12282
- BioCyc:   SPYO160490:SPY2207-MONOMER
- BioCyc:   SPYO293653:M5005_SPY1858-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00140_B
- InterPro:   IPR001412
- InterPro:   IPR002305
- InterPro:   IPR014729
- InterPro:   IPR002306
- Gene3D:   G3DSA:3.40.50.620
- PANTHER:   PTHR10055
- PRINTS:   PR01039
- TIGRFAMs:   TIGR00233

Pfam domain/function: PF00579 tRNA-synt_1b

EC number: =6.1.1.2

Molecular weight: Translated: 38330; Mature: 38199

Theoretical pI: Translated: 6.12; Mature: 6.12

Prosite motif: PS00178 AA_TRNA_LIGASE_I

Important sites: BINDING 205-205

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
3.5 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTKPIILTGDRPTGKLHLGHYVGSLKNRVFLQNENKYKMFVFLADQQALTDHAKESELIQ
CCCCEEEECCCCCCEEEHHHHHHHHHCEEEEECCCCEEEEEEEECCHHHHHHHHHHHHHH
ESIGNVALDYLSVGLDPKQSTIFIQSQIPELAELSMYYMNLVSLARLERNPTVKTEIAQK
HHHHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
GFGESIPSGFLVYPVSQAADITAFKANLVPVGNDQKPMIEQTREIVRSFNHTYHTDCLVE
CCCCCCCCCEEEEECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEEC
PEGIYPENEKAGRLPGLDGNAKMSKSLGNGIYLSDDADTVRKKVMSMYTDPNHIKIEDPG
CCCCCCCCCCCCCCCCCCCCCHHHHHCCCCEEECCCHHHHHHHHHHHCCCCCEEEECCCC
QIEGNMVFHYLDIFARKEDQADIEAMKEHYQIGGLGDVKTKRYLLDILERELAPIRERRL
CCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
EYAKDMGEVFRMLQEGSQKARTVAAKTLSEVKSAMGINYF
HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
TKPIILTGDRPTGKLHLGHYVGSLKNRVFLQNENKYKMFVFLADQQALTDHAKESELIQ
CCCEEEECCCCCCEEEHHHHHHHHHCEEEEECCCCEEEEEEEECCHHHHHHHHHHHHHH
ESIGNVALDYLSVGLDPKQSTIFIQSQIPELAELSMYYMNLVSLARLERNPTVKTEIAQK
HHHHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
GFGESIPSGFLVYPVSQAADITAFKANLVPVGNDQKPMIEQTREIVRSFNHTYHTDCLVE
CCCCCCCCCEEEEECCCCCCEEEEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCEEEC
PEGIYPENEKAGRLPGLDGNAKMSKSLGNGIYLSDDADTVRKKVMSMYTDPNHIKIEDPG
CCCCCCCCCCCCCCCCCCCCCHHHHHCCCCEEECCCHHHHHHHHHHHCCCCCEEEECCCC
QIEGNMVFHYLDIFARKEDQADIEAMKEHYQIGGLGDVKTKRYLLDILERELAPIRERRL
CCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
EYAKDMGEVFRMLQEGSQKARTVAAKTLSEVKSAMGINYF
HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11296296