| Definition | Clostridium botulinum A2 str. Kyoto chromosome, complete genome. |
|---|---|
| Accession | NC_012563 |
| Length | 4,155,278 |
Click here to switch to the map view.
The map label for this gene is int [H]
Identifier: 226948714
GI number: 226948714
Start: 1685563
End: 1686612
Strand: Reverse
Name: int [H]
Synonym: CLM_1614
Alternate gene names: 226948714
Gene position: 1686612-1685563 (Counterclockwise)
Preceding gene: 226948715
Following gene: 226948712
Centisome position: 40.59
GC content: 25.71
Gene sequence:
>1050_bases ATGGCTGTTTATAAAAATGAAGAAAGAGGAACTTATTTTTGTACTTTTTATTACACTGATTGGACAGGTAAAAAGAAAAG AAAAAAGAAAGAAGGTTTTAAAAGACAAAAAGATGCTAAAGACTATGAAAGAGATTTTTTAAATAAGCAAAAGAATGATC CAGGTATTAATTTCGAGAATTTAGTTAATATATATTTAGATGATATTAAAAATAAAATAAGATTTACTACATTTAGACAA AAGAAGCTTATAGTAGATTTAAAGATAGCCCCTTACTTTAAAGATATAAATTTAAATAATATAACCCCAAATCATATTAG GAAATGGCAAAACAAAATAATGGAAAATGATTATAGTGATACATATTTAAGAACTATTAATAATCAACTTAGTGCCATAT TTAACTTTGCTATTAGATATTATAATTTATCTAGCAACCCAGTGGTTAAAGCTGGTCCTATGGGTAAAAAGAACGCTGAC GTAATGCAGTTTTGGACAGTAGAAGAATTTAAAACATTTATAGAATATGTTAAGAAGCCAATTTATAAATTAGCTTTTAA AATTTTGTTCTGGACAGGTATAAGAAGTGGTGAATTATTAGCCTTAACATATAAAGATATAGATTTAGATAGAAAGATTA TAAATATAAATAAGAACTATGCAAGAATAAATAAAAAGGATATAATAAACCCGCCTAAAACTTTGAAGAGTAAAAGAGAA GTTACTATATCTGATTTCCTTTGTGAAGATATTAAAGAATATAAAAATAAGATATACAACTTAAATGAAAACGAAAGAAT TTTTACCATAGCTAAACAAAATATTAATGCACAACTTAATAGAACTTGTAAAAAAAGTGGTATTAAAAAAATACGTCTGC ATGATTTAAGACATTCCCACGCTTCGCTTTTAATAGAATTAGGATTTACTCCGCTTTTAATATCTGAAAGGCTTGGACAT GAAAACATTGAAACTACATTAAATACATATTCACACTTGTACCCAAACAAACACACTGAGGTAGCTAAAGAATTAGATAA ATTATATTAG
Upstream 100 bases:
>100_bases ATGACAATAGGGAAATTAAAAAAATTATCTCGCGCTTTACAAGTTCATCCTGTAAAGCTTTTAGAAATATTATTGAAAGA AGAAAGGAAAGGAAAAAGAA
Downstream 100 bases:
>100_bases TACGTTTCTAGTACGTTTAAAGATTATTCAATTTATATAACGTTGATATATAGGGATTTATTAAGATTTTCGTTATAATC CCACTTGAGTTATCATTATA
Product: phage integrase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 349; Mature: 348
Protein sequence:
>349_residues MAVYKNEERGTYFCTFYYTDWTGKKKRKKKEGFKRQKDAKDYERDFLNKQKNDPGINFENLVNIYLDDIKNKIRFTTFRQ KKLIVDLKIAPYFKDINLNNITPNHIRKWQNKIMENDYSDTYLRTINNQLSAIFNFAIRYYNLSSNPVVKAGPMGKKNAD VMQFWTVEEFKTFIEYVKKPIYKLAFKILFWTGIRSGELLALTYKDIDLDRKIININKNYARINKKDIINPPKTLKSKRE VTISDFLCEDIKEYKNKIYNLNENERIFTIAKQNINAQLNRTCKKSGIKKIRLHDLRHSHASLLIELGFTPLLISERLGH ENIETTLNTYSHLYPNKHTEVAKELDKLY
Sequences:
>Translated_349_residues MAVYKNEERGTYFCTFYYTDWTGKKKRKKKEGFKRQKDAKDYERDFLNKQKNDPGINFENLVNIYLDDIKNKIRFTTFRQ KKLIVDLKIAPYFKDINLNNITPNHIRKWQNKIMENDYSDTYLRTINNQLSAIFNFAIRYYNLSSNPVVKAGPMGKKNAD VMQFWTVEEFKTFIEYVKKPIYKLAFKILFWTGIRSGELLALTYKDIDLDRKIININKNYARINKKDIINPPKTLKSKRE VTISDFLCEDIKEYKNKIYNLNENERIFTIAKQNINAQLNRTCKKSGIKKIRLHDLRHSHASLLIELGFTPLLISERLGH ENIETTLNTYSHLYPNKHTEVAKELDKLY >Mature_348_residues AVYKNEERGTYFCTFYYTDWTGKKKRKKKEGFKRQKDAKDYERDFLNKQKNDPGINFENLVNIYLDDIKNKIRFTTFRQK KLIVDLKIAPYFKDINLNNITPNHIRKWQNKIMENDYSDTYLRTINNQLSAIFNFAIRYYNLSSNPVVKAGPMGKKNADV MQFWTVEEFKTFIEYVKKPIYKLAFKILFWTGIRSGELLALTYKDIDLDRKIININKNYARINKKDIINPPKTLKSKREV TISDFLCEDIKEYKNKIYNLNENERIFTIAKQNINAQLNRTCKKSGIKKIRLHDLRHSHASLLIELGFTPLLISERLGHE NIETTLNTYSHLYPNKHTEVAKELDKLY
Specific function: Putative integrase that is involved in the insertion of the integrative and conjugative element ICEBs1. Required for the excision of ICEBs1 from the donor cell genome and subsequent integration in the recipient cell genome. Appears not to be transferred t
COG id: COG0582
COG function: function code L; Integrase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family [H]
Homologues:
Organism=Escherichia coli, GI1786748, Length=235, Percent_Identity=25.9574468085106, Blast_Score=78, Evalue=1e-15, Organism=Escherichia coli, GI1787607, Length=250, Percent_Identity=24.4, Blast_Score=76, Evalue=4e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - InterPro: IPR023109 [H]
Pfam domain/function: PF00589 Phage_integrase [H]
EC number: NA
Molecular weight: Translated: 41546; Mature: 41415
Theoretical pI: Translated: 10.18; Mature: 10.18
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.1 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 1.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAVYKNEERGTYFCTFYYTDWTGKKKRKKKEGFKRQKDAKDYERDFLNKQKNDPGINFEN CCCEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHCHHHHHHHHHHCCCCCCCCCHHH LVNIYLDDIKNKIRFTTFRQKKLIVDLKIAPYFKDINLNNITPNHIRKWQNKIMENDYSD HHHHHHHHHHCHHEEEEEECCEEEEEEEECCEEECCCCCCCCHHHHHHHHHHHHCCCCCH TYLRTINNQLSAIFNFAIRYYNLSSNPVVKAGPMGKKNADVMQFWTVEEFKTFIEYVKKP HHHHHHHHHHHHHHHHHHHEEECCCCCEEECCCCCCCCCCHHHEEEHHHHHHHHHHHHHH IYKLAFKILFWTGIRSGELLALTYKDIDLDRKIININKNYARINKKDIINPPKTLKSKRE HHHHHHHHHHHHCCCCCCEEEEEECCCCCCCEEEECCCCHHHCCHHHCCCCCHHHHCCCC VTISDFLCEDIKEYKNKIYNLNENERIFTIAKQNINAQLNRTCKKSGIKKIRLHDLRHSH CHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEHHHHHHHHH ASLLIELGFTPLLISERLGHENIETTLNTYSHLYPNKHTEVAKELDKLY HHEEEECCCCHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCC >Mature Secondary Structure AVYKNEERGTYFCTFYYTDWTGKKKRKKKEGFKRQKDAKDYERDFLNKQKNDPGINFEN CCEECCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHCHHHHHHHHHHCCCCCCCCCHHH LVNIYLDDIKNKIRFTTFRQKKLIVDLKIAPYFKDINLNNITPNHIRKWQNKIMENDYSD HHHHHHHHHHCHHEEEEEECCEEEEEEEECCEEECCCCCCCCHHHHHHHHHHHHCCCCCH TYLRTINNQLSAIFNFAIRYYNLSSNPVVKAGPMGKKNADVMQFWTVEEFKTFIEYVKKP HHHHHHHHHHHHHHHHHHHEEECCCCCEEECCCCCCCCCCHHHEEEHHHHHHHHHHHHHH IYKLAFKILFWTGIRSGELLALTYKDIDLDRKIININKNYARINKKDIINPPKTLKSKRE HHHHHHHHHHHHCCCCCCEEEEEECCCCCCCEEEECCCCHHHCCHHHCCCCCHHHHCCCC VTISDFLCEDIKEYKNKIYNLNENERIFTIAKQNINAQLNRTCKKSGIKKIRLHDLRHSH CHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEHHHHHHHHH ASLLIELGFTPLLISERLGHENIETTLNTYSHLYPNKHTEVAKELDKLY HHEEEECCCCHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9384377 [H]