Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is alr

Identifier: 15675637

GI number: 15675637

Start: 1494943

End: 1496043

Strand: Reverse

Name: alr

Synonym: SPy_1802

Alternate gene names: 15675637

Gene position: 1496043-1494943 (Counterclockwise)

Preceding gene: 15675638

Following gene: 15675636

Centisome position: 80.76

GC content: 39.51

Gene sequence:

>1101_bases
ATGATTTCAAGTTTCCATCGCCCAACAGTTGCAAGGGTCAACTTACAAGCTATTAAGGAGAATGTTGCCAGTGTTCAAAA
GCATATTCCACTAGGGGTAAAAACGTATGCAGTTGTCAAGGCTGATGCTTATGGTCATGGTGCTGTCCAGGTGTCAAAAG
CACTCCTACCTCAAGTGGATGGGTACTGTGTGTCAAATCTTGATGAGGCTTTGCAATTACGTCAAGCAGGTATTGATAAA
GAGATTTTAATTCTTGGGGTTTTGCTGCCAAATGAATTAGAGTTAGCAGTTGCTAATGCTATTACTGTTACAATCGCTAG
TTTAGACTGGATAGCTTTAGCTAGACTGGAGAAAAAAGAATGTCAAGGCTTAAAAGTTCATGTAAAAGTTGATTCTGGTA
TGGGGCGGATCGGGCTTCGTTCTTCAAAAGAAGTCAATTTATTGATTGATAGTCTAAAAGAGTTGGGTGCTGATGTAGAA
GGTATTTTCACTCATTTTGCCACAGCTGATGAGGCAGATGATACTAAATTTAACCAGCAGTTACAGTTTTTTAAAAAGCT
GATAGCTGGACTTGAGGATAAGCCTCGTTTAGTACATGCTAGTAATTCAGCCACAAGTATCTGGCATAGTGATACCATTT
TTAATGCTGTTCGTTTAGGAATTGTCAGTTATGGTTTGAATCCAAGTGGTTCTGATCTAAGCTTACCGTTTCCACTGCAA
GAGGCTTTATCTCTAGAATCTAGCTTAGTGCATGTCAAGATGATTTCAGCTGGTGATACAGTCGGTTATGGAGCTACTTA
TACTGCCAAAAAGTCTGAATATGTAGGGACTGTCCCAATCGGTTATGCAGATGGCTGGACCAGGAACATGCAAGGCTTTT
CGGTGTTAGTTGATGGACAATTCTGCGAAATTATAGGGCGTGTATCGATGGATCAACTGACCATACGACTTCCCAAAGCA
TATCCTTTAGGAACAAAAGTCACTTTGATTGGCAGCAATCAGCAAAAAAATATTTCTACAACAGATATCGCAAATTACCG
TAATACAATCAATTATGAAGTTCTATGCCTTTTAAGTGACCGTATTCCTCGGATATATTAA

Upstream 100 bases:

>100_bases
GTCCCATTCTGACAAAATCTCCGTTTAAAGGAAATAGTTTTATTAGCATCTCACATAGTGGCAATTATGTACAGGCTAGT
GTTATTTTGGAGGATAAAAA

Downstream 100 bases:

>100_bases
GAAAATCATGTAAAAATAATTGCATTGTGCTTATCTTAAATGTTATAATAAAGTGAGGACATTAGAAAGAAGTGACTAAT
TAATATATGAATAAAAACAA

Product: alanine racemase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 366; Mature: 366

Protein sequence:

>366_residues
MISSFHRPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSNLDEALQLRQAGIDK
EILILGVLLPNELELAVANAITVTIASLDWIALARLEKKECQGLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVE
GIFTHFATADEADDTKFNQQLQFFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGSDLSLPFPLQ
EALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNMQGFSVLVDGQFCEIIGRVSMDQLTIRLPKA
YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLSDRIPRIY

Sequences:

>Translated_366_residues
MISSFHRPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSNLDEALQLRQAGIDK
EILILGVLLPNELELAVANAITVTIASLDWIALARLEKKECQGLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVE
GIFTHFATADEADDTKFNQQLQFFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGSDLSLPFPLQ
EALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNMQGFSVLVDGQFCEIIGRVSMDQLTIRLPKA
YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLSDRIPRIY
>Mature_366_residues
MISSFHRPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSNLDEALQLRQAGIDK
EILILGVLLPNELELAVANAITVTIASLDWIALARLEKKECQGLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVE
GIFTHFATADEADDTKFNQQLQFFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGSDLSLPFPLQ
EALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNMQGFSVLVDGQFCEIIGRVSMDQLTIRLPKA
YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLSDRIPRIY

Specific function: Provides the D-alanine required for cell wall biosynthesis

COG id: COG0787

COG function: function code M; Alanine racemase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the alanine racemase family

Homologues:

Organism=Escherichia coli, GI1787439, Length=365, Percent_Identity=33.972602739726, Blast_Score=182, Evalue=2e-47,
Organism=Escherichia coli, GI1790487, Length=365, Percent_Identity=32.3287671232877, Blast_Score=152, Evalue=3e-38,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): ALR_STRP1 (Q99Y98)

Other databases:

- EMBL:   AE004092
- EMBL:   CP000017
- RefSeq:   NP_269811.1
- RefSeq:   YP_282895.1
- ProteinModelPortal:   Q99Y98
- SMR:   Q99Y98
- EnsemblBacteria:   EBSTRT00000000052
- EnsemblBacteria:   EBSTRT00000028276
- GeneID:   3571377
- GeneID:   902028
- GenomeReviews:   AE004092_GR
- GenomeReviews:   CP000017_GR
- KEGG:   spy:SPy_1802
- KEGG:   spz:M5005_Spy_1532
- GeneTree:   EBGT00050000028408
- HOGENOM:   HBG712172
- OMA:   TRKEDAN
- ProtClustDB:   PRK00053
- BioCyc:   SPYO160490:SPY1802-MONOMER
- BioCyc:   SPYO293653:M5005_SPY1532-MONOMER
- HAMAP:   MF_01201
- InterPro:   IPR000821
- InterPro:   IPR009006
- InterPro:   IPR011079
- InterPro:   IPR001608
- InterPro:   IPR020622
- Gene3D:   G3DSA:2.40.37.10
- PRINTS:   PR00992
- TIGRFAMs:   TIGR00492

Pfam domain/function: PF00842 Ala_racemase_C; PF01168 Ala_racemase_N; SSF50621 Racem_decarbox_C

EC number: =5.1.1.1

Molecular weight: Translated: 39901; Mature: 39901

Theoretical pI: Translated: 7.23; Mature: 7.23

Prosite motif: PS00395 ALANINE_RACEMASE

Important sites: ACT_SITE 40-40 ACT_SITE 263-263

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MISSFHRPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVD
CCCCCCCCEEEEEHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHHHCCCC
GYCVSNLDEALQLRQAGIDKEILILGVLLPNELELAVANAITVTIASLDWIALARLEKKE
CHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEEEEEEEEEEEEEHHHHHHHHHHHHHH
CQGLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVEGIFTHFATADEADDTKFNQQ
CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHH
LQFFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGSDLSLPFPLQ
HHHHHHHHHCCCCCCCEEEECCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHH
EALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNMQGFSVLVDGQ
HHHHHHHCEEEEEEEECCCCCCCCCEEECCCCCEEEEEEECCCCCCCCCCCCCEEEECCH
FCEIIGRVSMDQLTIRLPKAYPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLSD
HHHHHHCCCCCCEEEECCCCCCCCCEEEEEECCCCCCCCHHHHHHHCCCCCEEEEEEECC
RIPRIY
CCCCCC
>Mature Secondary Structure
MISSFHRPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVD
CCCCCCCCEEEEEHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCHHHHHHHHHCCCC
GYCVSNLDEALQLRQAGIDKEILILGVLLPNELELAVANAITVTIASLDWIALARLEKKE
CHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEEEEEEEEEEEEEHHHHHHHHHHHHHH
CQGLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVEGIFTHFATADEADDTKFNQQ
CCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHH
LQFFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGSDLSLPFPLQ
HHHHHHHHHCCCCCCCEEEECCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHH
EALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNMQGFSVLVDGQ
HHHHHHHCEEEEEEEECCCCCCCCCEEECCCCCEEEEEEECCCCCCCCCCCCCEEEECCH
FCEIIGRVSMDQLTIRLPKAYPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLSD
HHHHHHCCCCCCEEEECCCCCCCCCEEEEEECCCCCCCCHHHHHHHCCCCCEEEEEEECC
RIPRIY
CCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11296296