Definition Streptococcus pyogenes M1 GAS chromosome, complete genome.
Accession NC_002737
Length 1,852,441

Click here to switch to the map view.

The map label for this gene is dppA [H]

Identifier: 15675787

GI number: 15675787

Start: 1666616

End: 1668244

Strand: Direct

Name: dppA [H]

Synonym: SPy_2000

Alternate gene names: 15675787

Gene position: 1666616-1668244 (Clockwise)

Preceding gene: 15675785

Following gene: 15675788

Centisome position: 89.97

GC content: 39.9

Gene sequence:

>1629_bases
GTGTCAAAATACCTAAAATACTTCTCTATTATCACGTTATTTTTGACTGGGCTTATTTTAGTTGCATGTCAACAACAAAA
GCCTCAAACAAAAGAACGTCAGCGCAAACAACGTCCAAAAGACGAACTTGTCGTTTCTATGGGGGCAAAGCTCCCTCATG
AATTCGATCCAAAGGACCGTTATGGAGTCCACAATGAAGGGAATATCACTCATAGCACTCTATTGAAACGTTCTCCTGAA
CTAGATATAAAAGGAGAGCTTGCTAAAACATACCATCTCTCTGAAGATGGGCTGACTTGGTCGTTTGACTTGCATGATGA
TTTTAAATTCTCAAATGGTGAGCCTGTTACTGCTGATGATGTTAAGTTTACTTATGATATGTTGAAAGCAGATGGAAAGG
CTTGGGATCTAACCTTCATTAAGAACGTTGAAGTAGTTGGGAAAAATCAGGTCAATATCCATTTGACTGAGGCGCATTCG
ACATTTACAGCACAGTTGACTGAAATCCCAATCGTCCCTAAAAAACATTACAATGATAAGTATAAGAGCAATCCTATCGG
TTCAGGACCTTACATGGTAAAAGAATATAAGGCTGGAGAACAAGCTATTTTTGTTCGTAACCCTTATTGGCATGGGAAAA
AACCATACTTTAAAAAATGGACTTGGGTCTTACTTGATGAAAACACAGCACTAGCAGCTTTAGAATCTGGTGATGTTGAT
ATGATCTACGCAACGCCAGAACTTGCTGATAAAAAAGTCAAAGGCACCCGCCTCCTTGATATTCCATCAAATGATGTGCG
CGGCTTATCATTACCTTATGTGAAAAAGGGCGTCATCACTGATTCTCCTGATGGTTATCCTGTAGGAAATGATGTCACTA
GTGATCCAGCAATCCGAAAAGCCTTGACTATTGGTTTAAATAGGCAAAAAGTTCTCGATACGGTTTTAAATGGTTATGGT
AAACCAGCTTATTCAATTATTGATAAAACACCATTTTGGAATCCAAAAACAGCCATTAAAGATAATAAAGTAGCTAAAGC
TAAGCAATTATTGACAAAAGCGGGATGGAAAGAACAAGCAGACGGTAGCCGTAAAAAAGGTGACCTTGATGCAGCGTTTG
ATCTGTACTACCCTACTAATGATCAATTGCGAGCGAACTTAGCCGTTGAAGTAGCAGAGCAAGCCAAGGCCCTAGGGATT
ACTATTAAACTCAAAGCTAGTAACTGGGATGAAATGGCAACGAAGTCACATGACTCAGCCTTACTTTATGCCGGAGGACG
TCATCACGCGCAGCAATTTTATGAATCGCATCATCCAAGCCTAGCAGGGAAAGGTTGGACCAATATTACGTTTTATAACA
ATCCTACCGTGACTAAGTACCTTGACAAAGCAATGACATCTTCTGACCTTGATAAAGCTAACGAATATTGGAAGTTAGCG
CAGTGGGATGGCAAAACAGGTGCTTCTACTCTTGGAGATTTGCCAAATGTATGGTTGGTGAGCCTTAACCATACTTATAT
TGGTGATAAACGTATCAATGTAGGTAAACAAGGCGTCCACAGTCATGGTCATGATTGGTCATTATTGACTAACATTGCCG
AGTGGACTTGGGATGAATCAACTAAGTAA

Upstream 100 bases:

>100_bases
ATTAACCAGTTAAACAATTGCCCTCCTATTTGTTTATTGCTATAATGAGATGGAGTCGTTACTATTATTATTTATTAATT
AGAAATAAGGAGATTTGATT

Downstream 100 bases:

>100_bases
CGCGTTTAGTTAATACAGCGACTTTAAGTTTGGAGTCTGAGAAAACAAGATACACAAAACTGTGCTGATCGTTTCCTCGG
CTCCTTTTCTTATCGATTAA

Product: surface lipoprotein

Products: Ni2; [Cytoplasm]; ADP; phosphate [C]

Alternate protein names: NA

Number of amino acids: Translated: 542; Mature: 541

Protein sequence:

>542_residues
MSKYLKYFSIITLFLTGLILVACQQQKPQTKERQRKQRPKDELVVSMGAKLPHEFDPKDRYGVHNEGNITHSTLLKRSPE
LDIKGELAKTYHLSEDGLTWSFDLHDDFKFSNGEPVTADDVKFTYDMLKADGKAWDLTFIKNVEVVGKNQVNIHLTEAHS
TFTAQLTEIPIVPKKHYNDKYKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYFKKWTWVLLDENTALAALESGDVD
MIYATPELADKKVKGTRLLDIPSNDVRGLSLPYVKKGVITDSPDGYPVGNDVTSDPAIRKALTIGLNRQKVLDTVLNGYG
KPAYSIIDKTPFWNPKTAIKDNKVAKAKQLLTKAGWKEQADGSRKKGDLDAAFDLYYPTNDQLRANLAVEVAEQAKALGI
TIKLKASNWDEMATKSHDSALLYAGGRHHAQQFYESHHPSLAGKGWTNITFYNNPTVTKYLDKAMTSSDLDKANEYWKLA
QWDGKTGASTLGDLPNVWLVSLNHTYIGDKRINVGKQGVHSHGHDWSLLTNIAEWTWDESTK

Sequences:

>Translated_542_residues
MSKYLKYFSIITLFLTGLILVACQQQKPQTKERQRKQRPKDELVVSMGAKLPHEFDPKDRYGVHNEGNITHSTLLKRSPE
LDIKGELAKTYHLSEDGLTWSFDLHDDFKFSNGEPVTADDVKFTYDMLKADGKAWDLTFIKNVEVVGKNQVNIHLTEAHS
TFTAQLTEIPIVPKKHYNDKYKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYFKKWTWVLLDENTALAALESGDVD
MIYATPELADKKVKGTRLLDIPSNDVRGLSLPYVKKGVITDSPDGYPVGNDVTSDPAIRKALTIGLNRQKVLDTVLNGYG
KPAYSIIDKTPFWNPKTAIKDNKVAKAKQLLTKAGWKEQADGSRKKGDLDAAFDLYYPTNDQLRANLAVEVAEQAKALGI
TIKLKASNWDEMATKSHDSALLYAGGRHHAQQFYESHHPSLAGKGWTNITFYNNPTVTKYLDKAMTSSDLDKANEYWKLA
QWDGKTGASTLGDLPNVWLVSLNHTYIGDKRINVGKQGVHSHGHDWSLLTNIAEWTWDESTK
>Mature_541_residues
SKYLKYFSIITLFLTGLILVACQQQKPQTKERQRKQRPKDELVVSMGAKLPHEFDPKDRYGVHNEGNITHSTLLKRSPEL
DIKGELAKTYHLSEDGLTWSFDLHDDFKFSNGEPVTADDVKFTYDMLKADGKAWDLTFIKNVEVVGKNQVNIHLTEAHST
FTAQLTEIPIVPKKHYNDKYKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYFKKWTWVLLDENTALAALESGDVDM
IYATPELADKKVKGTRLLDIPSNDVRGLSLPYVKKGVITDSPDGYPVGNDVTSDPAIRKALTIGLNRQKVLDTVLNGYGK
PAYSIIDKTPFWNPKTAIKDNKVAKAKQLLTKAGWKEQADGSRKKGDLDAAFDLYYPTNDQLRANLAVEVAEQAKALGIT
IKLKASNWDEMATKSHDSALLYAGGRHHAQQFYESHHPSLAGKGWTNITFYNNPTVTKYLDKAMTSSDLDKANEYWKLAQ
WDGKTGASTLGDLPNVWLVSLNHTYIGDKRINVGKQGVHSHGHDWSLLTNIAEWTWDESTK

Specific function: Required for transport of an unidentified substrate [H]

COG id: COG0747

COG function: function code E; ABC-type dipeptide transport system, periplasmic component

Gene ontology:

Cell location: Cell membrane; Lipid-anchor (Probable) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the bacterial solute-binding protein 5 family [H]

Homologues:

Organism=Escherichia coli, GI1789887, Length=389, Percent_Identity=27.2493573264781, Blast_Score=114, Evalue=2e-26,
Organism=Escherichia coli, GI1787762, Length=476, Percent_Identity=25.2100840336134, Blast_Score=106, Evalue=5e-24,
Organism=Escherichia coli, GI1787052, Length=466, Percent_Identity=23.8197424892704, Blast_Score=84, Evalue=3e-17,
Organism=Escherichia coli, GI1789966, Length=440, Percent_Identity=24.3181818181818, Blast_Score=78, Evalue=1e-15,
Organism=Escherichia coli, GI87081878, Length=449, Percent_Identity=25.6124721603563, Blast_Score=75, Evalue=1e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000914 [H]

Pfam domain/function: PF00496 SBP_bac_5 [H]

EC number: NA

Molecular weight: Translated: 61048; Mature: 60916

Theoretical pI: Translated: 9.23; Mature: 9.23

Prosite motif: PS00013 PROKAR_LIPOPROTEIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
1.5 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSKYLKYFSIITLFLTGLILVACQQQKPQTKERQRKQRPKDELVVSMGAKLPHEFDPKDR
CCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCC
YGVHNEGNITHSTLLKRSPELDIKGELAKTYHLSEDGLTWSFDLHDDFKFSNGEPVTADD
CCCCCCCCCHHHHHHHCCCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCC
VKFTYDMLKADGKAWDLTFIKNVEVVGKNQVNIHLTEAHSTFTAQLTEIPIVPKKHYNDK
CHHHHHHHHCCCCEEEEEEEECEEEECCCEEEEEEEECCCEEEEEHEECCCCCCCCCCCC
YKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYFKKWTWVLLDENTALAALESGDVD
CCCCCCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCEEEEEEECCCEEEEEECCCEE
MIYATPELADKKVKGTRLLDIPSNDVRGLSLPYVKKGVITDSPDGYPVGNDVTSDPAIRK
EEEECHHHCCCCCCCCEEEECCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCHHHHH
ALTIGLNRQKVLDTVLNGYGKPAYSIIDKTPFWNPKTAIKDNKVAKAKQLLTKAGWKEQA
HHHCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC
DGSRKKGDLDAAFDLYYPTNDQLRANLAVEVAEQAKALGITIKLKASNWDEMATKSHDSA
CCCCCCCCCCEEEEEEECCCCCEEHHHHHHHHHHHHEEEEEEEEECCCCHHHHHCCCCCE
LLYAGGRHHAQQFYESHHPSLAGKGWTNITFYNNPTVTKYLDKAMTSSDLDKANEYWKLA
EEEECCHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHEEEE
QWDGKTGASTLGDLPNVWLVSLNHTYIGDKRINVGKQGVHSHGHDWSLLTNIAEWTWDES
ECCCCCCCHHHCCCCCEEEEEECCEEECCCEECCCCCCCCCCCCCHHHHHHHHHHCCCCC
TK
CC
>Mature Secondary Structure 
SKYLKYFSIITLFLTGLILVACQQQKPQTKERQRKQRPKDELVVSMGAKLPHEFDPKDR
CHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCC
YGVHNEGNITHSTLLKRSPELDIKGELAKTYHLSEDGLTWSFDLHDDFKFSNGEPVTADD
CCCCCCCCCHHHHHHHCCCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCC
VKFTYDMLKADGKAWDLTFIKNVEVVGKNQVNIHLTEAHSTFTAQLTEIPIVPKKHYNDK
CHHHHHHHHCCCCEEEEEEEECEEEECCCEEEEEEEECCCEEEEEHEECCCCCCCCCCCC
YKSNPIGSGPYMVKEYKAGEQAIFVRNPYWHGKKPYFKKWTWVLLDENTALAALESGDVD
CCCCCCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCEEEEEEECCCEEEEEECCCEE
MIYATPELADKKVKGTRLLDIPSNDVRGLSLPYVKKGVITDSPDGYPVGNDVTSDPAIRK
EEEECHHHCCCCCCCCEEEECCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCHHHHH
ALTIGLNRQKVLDTVLNGYGKPAYSIIDKTPFWNPKTAIKDNKVAKAKQLLTKAGWKEQA
HHHCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC
DGSRKKGDLDAAFDLYYPTNDQLRANLAVEVAEQAKALGITIKLKASNWDEMATKSHDSA
CCCCCCCCCCEEEEEEECCCCCEEHHHHHHHHHHHHEEEEEEEEECCCCHHHHHCCCCCE
LLYAGGRHHAQQFYESHHPSLAGKGWTNITFYNNPTVTKYLDKAMTSSDLDKANEYWKLA
EEEECCHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHEEEE
QWDGKTGASTLGDLPNVWLVSLNHTYIGDKRINVGKQGVHSHGHDWSLLTNIAEWTWDES
ECCCCCCCHHHCCCCCEEEEEECCEEECCCEECCCCCCCCCCCCCHHHHHHHHHHCCCCC
TK
CC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Ni2; [Periplasm]; ATP; H2O [C]

Specific reaction: Ni2+ [Periplasm] + ATP + H2O = Ni2+ [Cytoplasm] + ADP + phosphate [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 3453116 [H]