Definition Streptococcus pyogenes MGAS5005 chromosome, complete genome.
Accession NC_007297
Length 1,838,554

Click here to switch to the map view.

The map label for this gene is int.1 [H]

Identifier: 71910865

GI number: 71910865

Start: 1020579

End: 1021667

Strand: Direct

Name: int.1 [H]

Synonym: M5005_Spy_1052

Alternate gene names: 71910865

Gene position: 1020579-1021667 (Clockwise)

Preceding gene: 71910864

Following gene: 71910866

Centisome position: 55.51

GC content: 33.43

Gene sequence:

>1089_bases
ATGAAAATTAAATCATATAAAAAGGAAAATGGTGAAACTGCTTATAAATTTCTTTTGTATGCCGGTTATGTTAATGGAAA
AAGAAAATATATTAGGCGAGAAGGTTTCAAAACTAAGCAGGCTGCAAGGGAAACCTTAATTAGTTTACAAGCTGAACTTG
ATAAACCTAAATCAAGTATGACATTTGGAGCATTGACAGATCAATGGCTAAAGGAATATGAAAAAACCGTTCAGTGCAGT
ACCTACTTAAAAACAGAAAGAAATATTAATAAACATATTTTGCCAAAACTTGATAAAGTGAAGATTGGAGACATCAACCC
ACTACTTATCCAGCGGCTTACTGAAGAATGGTGCAACGATTTAAAATATGGAGGAAAAATTCTTGGGCTTGTTAGGAATA
TCTTAAATCTAGCTGTTAGATACGGATATATCAATAACAATCCAGCTTTGCCAATTACACCTCCAAAAATAAAAAGGAAA
AGAAAAATGAATAATAATTTTTATACACTTGATCAACTTAAACAATTCCTTGAACTAGTTGAAAAAACTGACAACATTGA
AAAAATAGCCTTGTTTAGATTATTAGCATTTACTGGAATACGAAAAGGGGAGCTTCTGGCACTAACTTGGGATGATTTGA
ATGGTAATACTCTATCAATTAATAAAGCTGTCACACGTACTCAAGTTGGACTAGAAATAGATGTTACGAAGACAAAATCG
AGCGATAGATTAATCAGCTTAGATGATGAAACTTTGGAAATTTTACTAGAACTTCATGAAACTTTTCCTACTTCTACTCT
TATGTTCCAATCTGAATCAGGTGGAATTATGACGCCAAGTTTACCACGAAAATGGCTATTGCAAATTATCAAAGGGACAG
ACTTACCACAAATCACAATTCATGGTTTCAGGCACACTCATGCAAGCTTACTTTTCGAATCAGGTCTATCCTTGAAACAG
GTGCAACATAGATTAGGGCATGGAGATTTACAGACAACTATGAACGTATATACTCACATCACGCAATCGGCAATTGATGA
CATTGGAACTAAATTCAATCAATTTGTTACTAACAAGCAACTAGATTGA

Upstream 100 bases:

>100_bases
AGTTTGGCGACTCTGAGCGTGAGGCAAGACAGTATAAGAAACAACCATTAAAAAGGTCATTTTCTTGTACCTATTTTATC
AAATTGAAAGATGGTATGCA

Downstream 100 bases:

>100_bases
CAACTAATTCTCAACAAACGTTAATTTAACAACATTCAAGTAACTCCCACCAGCTCCATCAATGCTTACCGTAAGTAATC
ATAACTTACTAAAACCTTGT

Product: integrase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 362; Mature: 362

Protein sequence:

>362_residues
MKIKSYKKENGETAYKFLLYAGYVNGKRKYIRREGFKTKQAARETLISLQAELDKPKSSMTFGALTDQWLKEYEKTVQCS
TYLKTERNINKHILPKLDKVKIGDINPLLIQRLTEEWCNDLKYGGKILGLVRNILNLAVRYGYINNNPALPITPPKIKRK
RKMNNNFYTLDQLKQFLELVEKTDNIEKIALFRLLAFTGIRKGELLALTWDDLNGNTLSINKAVTRTQVGLEIDVTKTKS
SDRLISLDDETLEILLELHETFPTSTLMFQSESGGIMTPSLPRKWLLQIIKGTDLPQITIHGFRHTHASLLFESGLSLKQ
VQHRLGHGDLQTTMNVYTHITQSAIDDIGTKFNQFVTNKQLD

Sequences:

>Translated_362_residues
MKIKSYKKENGETAYKFLLYAGYVNGKRKYIRREGFKTKQAARETLISLQAELDKPKSSMTFGALTDQWLKEYEKTVQCS
TYLKTERNINKHILPKLDKVKIGDINPLLIQRLTEEWCNDLKYGGKILGLVRNILNLAVRYGYINNNPALPITPPKIKRK
RKMNNNFYTLDQLKQFLELVEKTDNIEKIALFRLLAFTGIRKGELLALTWDDLNGNTLSINKAVTRTQVGLEIDVTKTKS
SDRLISLDDETLEILLELHETFPTSTLMFQSESGGIMTPSLPRKWLLQIIKGTDLPQITIHGFRHTHASLLFESGLSLKQ
VQHRLGHGDLQTTMNVYTHITQSAIDDIGTKFNQFVTNKQLD
>Mature_362_residues
MKIKSYKKENGETAYKFLLYAGYVNGKRKYIRREGFKTKQAARETLISLQAELDKPKSSMTFGALTDQWLKEYEKTVQCS
TYLKTERNINKHILPKLDKVKIGDINPLLIQRLTEEWCNDLKYGGKILGLVRNILNLAVRYGYINNNPALPITPPKIKRK
RKMNNNFYTLDQLKQFLELVEKTDNIEKIALFRLLAFTGIRKGELLALTWDDLNGNTLSINKAVTRTQVGLEIDVTKTKS
SDRLISLDDETLEILLELHETFPTSTLMFQSESGGIMTPSLPRKWLLQIIKGTDLPQITIHGFRHTHASLLFESGLSLKQ
VQHRLGHGDLQTTMNVYTHITQSAIDDIGTKFNQFVTNKQLD

Specific function: Putative integrase that is involved in the insertion of the integrative and conjugative element ICEBs1. Required for the excision of ICEBs1 from the donor cell genome and subsequent integration in the recipient cell genome. Appears not to be transferred t

COG id: COG0582

COG function: function code L; Integrase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'phage' integrase family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011010
- InterPro:   IPR013762
- InterPro:   IPR002104
- InterPro:   IPR023109 [H]

Pfam domain/function: PF00589 Phage_integrase [H]

EC number: NA

Molecular weight: Translated: 41504; Mature: 41504

Theoretical pI: Translated: 10.05; Mature: 10.05

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKIKSYKKENGETAYKFLLYAGYVNGKRKYIRREGFKTKQAARETLISLQAELDKPKSSM
CCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCC
TFGALTDQWLKEYEKTVQCSTYLKTERNINKHILPKLDKVKIGDINPLLIQRLTEEWCND
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCCEEECCCCHHHHHHHHHHHHHH
LKYGGKILGLVRNILNLAVRYGYINNNPALPITPPKIKRKRKMNNNFYTLDQLKQFLELV
HHHCHHHHHHHHHHHHHHHHHCEECCCCCCCCCCHHHHHHHHCCCCCEEHHHHHHHHHHH
EKTDNIEKIALFRLLAFTGIRKGELLALTWDDLNGNTLSINKAVTRTQVGLEIDVTKTKS
HHCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCEEEECHHHHHHCCCEEEEEEECCC
SDRLISLDDETLEILLELHETFPTSTLMFQSESGGIMTPSLPRKWLLQIIKGTDLPQITI
CCCEEECCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCEEEE
HGFRHTHASLLFESGLSLKQVQHRLGHGDLQTTMNVYTHITQSAIDDIGTKFNQFVTNKQ
ECCHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
LD
CC
>Mature Secondary Structure
MKIKSYKKENGETAYKFLLYAGYVNGKRKYIRREGFKTKQAARETLISLQAELDKPKSSM
CCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCC
TFGALTDQWLKEYEKTVQCSTYLKTERNINKHILPKLDKVKIGDINPLLIQRLTEEWCND
CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCCEEECCCCHHHHHHHHHHHHHH
LKYGGKILGLVRNILNLAVRYGYINNNPALPITPPKIKRKRKMNNNFYTLDQLKQFLELV
HHHCHHHHHHHHHHHHHHHHHCEECCCCCCCCCCHHHHHHHHCCCCCEEHHHHHHHHHHH
EKTDNIEKIALFRLLAFTGIRKGELLALTWDDLNGNTLSINKAVTRTQVGLEIDVTKTKS
HHCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCEEEECHHHHHHCCCEEEEEEECCC
SDRLISLDDETLEILLELHETFPTSTLMFQSESGGIMTPSLPRKWLLQIIKGTDLPQITI
CCCEEECCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHCCCCCCEEEE
HGFRHTHASLLFESGLSLKQVQHRLGHGDLQTTMNVYTHITQSAIDDIGTKFNQFVTNKQ
ECCHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
LD
CC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]