Definition | Streptococcus pyogenes M1 GAS chromosome, complete genome. |
---|---|
Accession | NC_002737 |
Length | 1,852,441 |
Click here to switch to the map view.
The map label for this gene is emm1 [H]
Identifier: 15675799
GI number: 15675799
Start: 1683787
End: 1685241
Strand: Reverse
Name: emm1 [H]
Synonym: SPy_2018
Alternate gene names: 15675799
Gene position: 1685241-1683787 (Counterclockwise)
Preceding gene: 15675800
Following gene: 15675798
Centisome position: 90.97
GC content: 39.31
Gene sequence:
>1455_bases ATGGCTAAAAATAACACGAATAGACACTATTCGCTTAGAAAATTAAAAACAGGAACGGCTTCAGTAGCGGTAGCTTTGAC TGTTTTAGGGGCAGGTTTTGCGAATCAAACAGAGGTTAAGGCTAACGGTGATGGTAATCCTAGGGAAGTTATAGAAGATC TTGCAGCAAACAATCCCGCAATACAAAATATACGTTTACGTTACGAAAACAAGGACTTAAAAGCGAGATTAGAGAATGCA ATGGAAGTTGCAGGAAGAGATTTTAAGAGAGCTGAAGAACTTGAAAAAGCAAAACAAGCCTTAGAAGACCAGCGTAAAGA TTTAGAAACTAAATTAAAAGAACTACAACAAGACTATGACTTAGCAAAGGAATCAACAAGTTGGGATAGACAAAGACTTG AAAAAGAGTTAGAAGAGAAAAAGGAAGCTCTTGAATTAGCGATAGACCAGGCAAGTCGGGACTACCATAGAGCTACCGCT TTAGAAAAAGAGTTAGAAGAGAAAAAGAAAGCTCTTGAATTAGCGATAGACCAAGCGAGTCAGGACTATAATAGAGCTAA CGTCTTAGAAAAAGAGTTAGAAACGATTACTAGAGAACAAGAGATTAATCGTAATCTTTTAGGCAATGCAAAACTTGAAC TTGATCAACTTTCATCTGAAAAAGAGCAGCTAACGATCGAAAAAGCAAAACTTGAGGAAGAAAAACAAATCTCAGACGCA AGTCGTCAAAGCCTTCGTCGTGACTTGGACGCATCACGTGAAGCTAAGAAACAGGTTGAAAAAGATTTAGCAAACTTGAC TGCTGAACTTGATAAGGTTAAAGAAGACAAACAAATCTCAGACGCAAGCCGTCAAGGCCTTCGCCGTGACTTGGACGCAT CACGTGAAGCTAAGAAACAGGTTGAAAAAGATTTAGCAAACTTGACTGCTGAACTTGATAAGGTTAAAGAAGAAAAACAA ATCTCAGACGCAAGCCGTCAAGGCCTTCGCCGTGACTTGGACGCATCACGTGAAGCTAAGAAACAAGTTGAAAAAGCTTT AGAAGAAGCAAACAGCAAATTAGCTGCTCTTGAAAAACTTAACAAAGAGCTTGAAGAAAGCAAGAAATTAACAGAAAAAG AAAAAGCTGAACTACAAGCAAAACTTGAAGCAGAAGCAAAAGCACTCAAAGAACAATTAGCGAAACAAGCTGAAGAACTT GCAAAACTAAGAGCTGGAAAAGCATCAGACTCACAAACCCCTGATACAAAACCAGGAAACAAAGCTGTTCCAGGTAAAGG TCAAGCACCACAAGCAGGTACAAAACCTAACCAAAACAAAGCACCAATGAAGGAAACTAAGAGACAGTTACCATCAACAG GTGAAACAGCTAACCCATTCTTCACAGCGGCAGCCCTTACTGTTATGGCAACAGCTGGAGTAGCAGCAGTTGTAAAACGC AAAGAAGAAAACTAA
Upstream 100 bases:
>100_bases CTTTACCTTTTGGCTTTTATTATTTACAATAGAATTATTAGAGTTAAACCCTGAAAATGAGGGTTTCTTCCTAAAAAATG ATAGCATAAGGAGCATAAAA
Downstream 100 bases:
>100_bases GCTATCACTTTGTAATACTGAGTGAACATCAAGAGAGAACCAGTCGGTTCTCTCTTTTATGTATAGAAGAATGAGGTTAA GGAGAGGTCACAAACTAAAC
Product: M protein type 1
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 484; Mature: 483
Protein sequence:
>484_residues MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPAIQNIRLRYENKDLKARLENA MEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATA LEKELEEKKKALELAIDQASQDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQ ISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEEL AKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKR KEEN
Sequences:
>Translated_484_residues MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPAIQNIRLRYENKDLKARLENA MEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATA LEKELEEKKKALELAIDQASQDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQ ISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEEL AKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKR KEEN >Mature_483_residues AKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPAIQNIRLRYENKDLKARLENAM EVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDLAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATAL EKELEEKKKALELAIDQASQDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDAS RQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQI SDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELA KLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRK EEN
Specific function: Mediates the attachment of S.pyogenes to skin epithelial cells through the binding of the human membrane cofactor protein CD46. Also binds to the factor H and factor H-like protein 1. These interactions could contribute to the fact that the M6 protein pro
COG id: NA
COG function: NA
Gene ontology:
Cell location: Secreted, cell wall; Peptidoglycan-anchor (Potential) [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the M protein family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR005877 - InterPro: IPR019948 - InterPro: IPR019950 - InterPro: IPR019931 - InterPro: IPR001899 [H]
Pfam domain/function: PF00746 Gram_pos_anchor; PF04650 YSIRK_signal [H]
EC number: NA
Molecular weight: Translated: 54221; Mature: 54090
Theoretical pI: Translated: 6.68; Mature: 6.68
Prosite motif: PS50847 GRAM_POS_ANCHORING
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 0.8 %Met (Translated Protein) 0.8 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 0.6 %Met (Mature Protein) 0.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPA CCCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCCCCCCCEECCCCCCHHHHHHHHHCCCCC IQNIRLRYENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYD HHHHEEEECCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQAS HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHH SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQ HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQTPDTKPGN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC KAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKR CCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH KEEN HCCC >Mature Secondary Structure AKNNTNRHYSLRKLKTGTASVAVALTVLGAGFANQTEVKANGDGNPREVIEDLAANNPA CCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCCCCCCCEECCCCCCHHHHHHHHHCCCCC IQNIRLRYENKDLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYD HHHHEEEECCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LAKESTSWDRQRLEKELEEKKEALELAIDQASRDYHRATALEKELEEKKKALELAIDQAS HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH QDYNRANVLEKELETITREQEINRNLLGNAKLELDQLSSEKEQLTIEKAKLEEEKQISDA HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHH SRQSLRRDLDASREAKKQVEKDLANLTAELDKVKEDKQISDASRQGLRRDLDASREAKKQ HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH NKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQTPDTKPGN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC KAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKR CCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH KEEN HCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA