Definition | Bacillus thuringiensis str. Al Hakam chromosome, complete genome. |
---|---|
Accession | NC_008600 |
Length | 5,257,091 |
Click here to switch to the map view.
The map label for this gene is ina [H]
Identifier: 118476356
GI number: 118476356
Start: 709579
End: 711978
Strand: Direct
Name: ina [H]
Synonym: BALH_0614
Alternate gene names: 118476356
Gene position: 709579-711978 (Clockwise)
Preceding gene: 118476355
Following gene: 118476358
Centisome position: 13.5
GC content: 36.88
Gene sequence:
>2400_bases ATGAGAAGAAAAGCGCCACTTAAAGTGTTATCGTCATTAGCAATTGCGGCAATTATCGGATGTACATCTGTAATGAGTGC TCCATTAGCGTACGCAGAAACGCCAGCAAAAGAGAAAGAAAATGTATCTACAACACCAATTGATTACAATTTAATTCAAG AAGATCGTCTAGCGGAGGCGCTGAAAGAAAGAGGAACAATTAATCCAGCATCTTCTAAAGAAGAGACAAAAAAGGCTGTA GAGAAATATATTGAAAAGAAACAAGGAGACCAGGCAAATAAAGAAATTCTTCCAGCTGATACTGCTAAAGAGGCATCTGA TTTCGTGAAAAAAGTAAAAGAGAAAAAAATGGAAGAAAAGGAGAAAGTAAAGAAACCTGAAAAAAATGTTAGCCCTGAGC AAAAGCCTGAACCAAATAAAAAGCAATTGAATGGACAAGTTCCAACATCTAAAGCAAAGCAAGCGCCATATAAGGGGTCT GTTCGAACAGATAAAGTATTAGTATTACTCGTTGAATTTAGTGATTATAAACATAATAATATTGATCAAACACCAGGATA TATGTATTCGAATGACTTTAGTAGAGAGCATTATCAAAAGATGTTATTTGGTAATGAGCCGTACACATTATTTGATGGTT CAAAAGTAAAAACGTTTAAACAATATTATGAAGAGCAGTCTGGCGGTAGTTATACGACTGATGGATATGTAACAGAATGG TTAACTGTTCCAGGAAAAGCATCTGACTACGGTGCTGATGGTAGCAGTGGTCATGATAACAAAGGTCCAAAAGGCGCACG TGATTTAGTGAAAGAAGCTTTACATGCAGCTGCTGAGAAAGGTTTAGATTTATCTCAATTTGATCAGTTTGATAGATATG ATACAAATAGTGATGGAAATCAAAATGAACCTGACGGTGTAATTGATCATTTAATGGTAATCCATGCTGGTGTTGGTCAA GAAGCTGGTGGAGGTAAATTAGGTGATGATGCCATTTGGTCACATCGTTCAAAATTAGCAATAGATCCAGTAGCAATTGA AGGGACAAAATCAAAGGTAGATTACTTTGGTGGCAAAGTAGCAGCACATGATTACACAATTGAACCAGAAGATGGAGCAG TAGGTGTATTTGCGCATGAATTTGGACATGATCTTGGCTTACCAGATGAATATGATACGAAATATACTGGAACTGGTTCA CCTGTCGAAGCTTGGTCATTAATGAGTGGAGGTAGTTGGACAGGGAAAATTGCAGGAACAGAGCCAACTAGTTTTTCACC ACAAAATAAAGACTTCTTACAAAAGAATATGGGGGGCAACTGGGCAAAAATTTTAGAAGTAGATTACGATAAAATTAAGC GTGGTGTAGGAGTTCCTACATATATTGATCAAAGTGTTACGAAATCAAATCGTCCAGGCGTTGTACGTGTTAACTTACCA GGCAAAAGTGTTGAAACGATTAAACCGGAGTTTGGAAAGCATGCATATTATAGTACAAGAGGCGATGATATGCATACAAC ATTAGAAACACCGTTCTTTGATTTAACAAAAGGAACAAATGCAAAGTTTGATTATAAAGCAAATTATGAGTTAGAAGCAG AGTGCGATTTTGTTGAAGTTCACGCAGTAATAGAAGATGGAACGAAAACATTAATTGATAGACTTGGCGAAAAAGTAGTC CAAGGAGATAAAGACACAACAGACGGAAAATGGATTGATAAATCATACGATTTAAGTCAATTTAAAGGGAAGAAAGTGAA ACTACAATTCGACTATATTACAGATCCAGCTGTAACATATAAAGGTTTCGCGATGGATCATGTAAATGTAACTGTTGATG GACAAGTTGTATTTTCTGATGATGCAGAAGGACAGTCTAAAATGAATGTAAATGGTTTTGTTGTTTCTGATGGGACAGAG AAAAAAGCTCATTATTATTACTTAGAATGGAGAAACTATGCGGGATCAGATAATGGATTAAAAGCAGGAAAAGGTCCAGT GTATAATACAGGTCTTGTCGTTTGGTATGCAGATGATAGCTTTAAAGATAACTGGGTTGGGGTGCATCCAGGTGAAGGAT TCCTTGGGGTTGTAGACTCTCATCCAGAAGCATTTGTTGGCAATTTAAACGGAAAACCAACTTACGGTAACACAGGTATG CAAATTGCAGACGCTGCATTTTCATTTGATCAAACACCAGCATGGAGTGTAAATTCATTAACACGTGGACAGTTTAACTA TTCTGGATTACAAGGTGTTACCACTTTTGATGATTCAAAAGTATATAGTAACAACCAAATTGCAGACGCAGGAAGAAAAG TTCCGAACCTTGGACTTAAATTCCAAGTTGTTGGACAGGCAGATGATAAATCAGCAGGCGCTGTTTGGATTAAACGTTAA
Upstream 100 bases:
>100_bases GAATTATGCCTTTTTATATAGGGAGGGCAGAAAGTAGAGCTTTTGATTTAAATGCACTTATTTATCTGAATTTTAAGTTT TATAAAAGGAGAGAAATAGA
Downstream 100 bases:
>100_bases TAACAGAAAAGCACATTCTTCATATGAAGAGTGTGCTTTTTAAATTGTACAGTAAGTTATTATTTTAGCTTATGTAATAC AAGTGTCTGTTCATATGTAG
Product: immune inhibitor A
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 799; Mature: 799
Protein sequence:
>799_residues MRRKAPLKVLSSLAIAAIIGCTSVMSAPLAYAETPAKEKENVSTTPIDYNLIQEDRLAEALKERGTINPASSKEETKKAV EKYIEKKQGDQANKEILPADTAKEASDFVKKVKEKKMEEKEKVKKPEKNVSPEQKPEPNKKQLNGQVPTSKAKQAPYKGS VRTDKVLVLLVEFSDYKHNNIDQTPGYMYSNDFSREHYQKMLFGNEPYTLFDGSKVKTFKQYYEEQSGGSYTTDGYVTEW LTVPGKASDYGADGSSGHDNKGPKGARDLVKEALHAAAEKGLDLSQFDQFDRYDTNSDGNQNEPDGVIDHLMVIHAGVGQ EAGGGKLGDDAIWSHRSKLAIDPVAIEGTKSKVDYFGGKVAAHDYTIEPEDGAVGVFAHEFGHDLGLPDEYDTKYTGTGS PVEAWSLMSGGSWTGKIAGTEPTSFSPQNKDFLQKNMGGNWAKILEVDYDKIKRGVGVPTYIDQSVTKSNRPGVVRVNLP GKSVETIKPEFGKHAYYSTRGDDMHTTLETPFFDLTKGTNAKFDYKANYELEAECDFVEVHAVIEDGTKTLIDRLGEKVV QGDKDTTDGKWIDKSYDLSQFKGKKVKLQFDYITDPAVTYKGFAMDHVNVTVDGQVVFSDDAEGQSKMNVNGFVVSDGTE KKAHYYYLEWRNYAGSDNGLKAGKGPVYNTGLVVWYADDSFKDNWVGVHPGEGFLGVVDSHPEAFVGNLNGKPTYGNTGM QIADAAFSFDQTPAWSVNSLTRGQFNYSGLQGVTTFDDSKVYSNNQIADAGRKVPNLGLKFQVVGQADDKSAGAVWIKR
Sequences:
>Translated_799_residues MRRKAPLKVLSSLAIAAIIGCTSVMSAPLAYAETPAKEKENVSTTPIDYNLIQEDRLAEALKERGTINPASSKEETKKAV EKYIEKKQGDQANKEILPADTAKEASDFVKKVKEKKMEEKEKVKKPEKNVSPEQKPEPNKKQLNGQVPTSKAKQAPYKGS VRTDKVLVLLVEFSDYKHNNIDQTPGYMYSNDFSREHYQKMLFGNEPYTLFDGSKVKTFKQYYEEQSGGSYTTDGYVTEW LTVPGKASDYGADGSSGHDNKGPKGARDLVKEALHAAAEKGLDLSQFDQFDRYDTNSDGNQNEPDGVIDHLMVIHAGVGQ EAGGGKLGDDAIWSHRSKLAIDPVAIEGTKSKVDYFGGKVAAHDYTIEPEDGAVGVFAHEFGHDLGLPDEYDTKYTGTGS PVEAWSLMSGGSWTGKIAGTEPTSFSPQNKDFLQKNMGGNWAKILEVDYDKIKRGVGVPTYIDQSVTKSNRPGVVRVNLP GKSVETIKPEFGKHAYYSTRGDDMHTTLETPFFDLTKGTNAKFDYKANYELEAECDFVEVHAVIEDGTKTLIDRLGEKVV QGDKDTTDGKWIDKSYDLSQFKGKKVKLQFDYITDPAVTYKGFAMDHVNVTVDGQVVFSDDAEGQSKMNVNGFVVSDGTE KKAHYYYLEWRNYAGSDNGLKAGKGPVYNTGLVVWYADDSFKDNWVGVHPGEGFLGVVDSHPEAFVGNLNGKPTYGNTGM QIADAAFSFDQTPAWSVNSLTRGQFNYSGLQGVTTFDDSKVYSNNQIADAGRKVPNLGLKFQVVGQADDKSAGAVWIKR >Mature_799_residues MRRKAPLKVLSSLAIAAIIGCTSVMSAPLAYAETPAKEKENVSTTPIDYNLIQEDRLAEALKERGTINPASSKEETKKAV EKYIEKKQGDQANKEILPADTAKEASDFVKKVKEKKMEEKEKVKKPEKNVSPEQKPEPNKKQLNGQVPTSKAKQAPYKGS VRTDKVLVLLVEFSDYKHNNIDQTPGYMYSNDFSREHYQKMLFGNEPYTLFDGSKVKTFKQYYEEQSGGSYTTDGYVTEW LTVPGKASDYGADGSSGHDNKGPKGARDLVKEALHAAAEKGLDLSQFDQFDRYDTNSDGNQNEPDGVIDHLMVIHAGVGQ EAGGGKLGDDAIWSHRSKLAIDPVAIEGTKSKVDYFGGKVAAHDYTIEPEDGAVGVFAHEFGHDLGLPDEYDTKYTGTGS PVEAWSLMSGGSWTGKIAGTEPTSFSPQNKDFLQKNMGGNWAKILEVDYDKIKRGVGVPTYIDQSVTKSNRPGVVRVNLP GKSVETIKPEFGKHAYYSTRGDDMHTTLETPFFDLTKGTNAKFDYKANYELEAECDFVEVHAVIEDGTKTLIDRLGEKVV QGDKDTTDGKWIDKSYDLSQFKGKKVKLQFDYITDPAVTYKGFAMDHVNVTVDGQVVFSDDAEGQSKMNVNGFVVSDGTE KKAHYYYLEWRNYAGSDNGLKAGKGPVYNTGLVVWYADDSFKDNWVGVHPGEGFLGVVDSHPEAFVGNLNGKPTYGNTGM QIADAAFSFDQTPAWSVNSLTRGQFNYSGLQGVTTFDDSKVYSNNQIADAGRKVPNLGLKFQVVGQADDKSAGAVWIKR
Specific function: Neutral metalloprotease that is secreted to degrade antibacterial proteins produced by the insect host for its defense (attacins and cecropins). Probably degrades some unknown crucial protein(s) too, since it is toxic when injected to insect larvae [H]
COG id: COG4412
COG function: function code S; Uncharacterized protein conserved in bacteria
Gene ontology:
Cell location: Secreted [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase M6 family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012300 - InterPro: IPR008757 [H]
Pfam domain/function: PF05547 Peptidase_M6 [H]
EC number: NA
Molecular weight: Translated: 87948; Mature: 87948
Theoretical pI: Translated: 5.60; Mature: 5.60
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS00142 ZINC_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.5 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRRKAPLKVLSSLAIAAIIGCTSVMSAPLAYAETPAKEKENVSTTPIDYNLIQEDRLAEA CCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHH LKERGTINPASSKEETKKAVEKYIEKKQGDQANKEILPADTAKEASDFVKKVKEKKMEEK HHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH EKVKKPEKNVSPEQKPEPNKKQLNGQVPTSKAKQAPYKGSVRTDKVLVLLVEFSDYKHNN HHHCCCCCCCCCCCCCCCCHHHCCCCCCCCHHHCCCCCCCCCCCEEEEEEEEECCCCCCC IDQTPGYMYSNDFSREHYQKMLFGNEPYTLFDGSKVKTFKQYYEEQSGGSYTTDGYVTEW CCCCCCEEECCCCCHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCCEEEEE LTVPGKASDYGADGSSGHDNKGPKGARDLVKEALHAAAEKGLDLSQFDQFDRYDTNSDGN EECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCC QNEPDGVIDHLMVIHAGVGQEAGGGKLGDDAIWSHRSKLAIDPVAIEGTKSKVDYFGGKV CCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCEEECCEEECCCCHHHHHCCCEE AAHDYTIEPEDGAVGVFAHEFGHDLGLPDEYDTKYTGTGSPVEAWSLMSGGSWTGKIAGT EECCEEECCCCCEEEEEEHHHCCCCCCCCCCCCCCCCCCCCHHHHEECCCCCEEEEECCC EPTSFSPQNKDFLQKNMGGNWAKILEVDYDKIKRGVGVPTYIDQSVTKSNRPGVVRVNLP CCCCCCCCCHHHHHHCCCCCEEEEEECCHHHHHHCCCCCHHHHHHHHCCCCCCEEEEECC GKSVETIKPEFGKHAYYSTRGDDMHTTLETPFFDLTKGTNAKFDYKANYELEAECDFVEV CCCCHHCCCCCCCCCEEECCCCCCEEECCCCCEECCCCCCCEEEEECCCEEECCCCEEEE HAVIEDGTKTLIDRLGEKVVQGDKDTTDGKWIDKSYDLSQFKGKKVKLQFDYITDPAVTY EEEHHCCHHHHHHHHHHHHHCCCCCCCCCCEECCCCCHHHCCCCEEEEEEEEECCCCEEE KGFAMDHVNVTVDGQVVFSDDAEGQSKMNVNGFVVSDGTEKKAHYYYLEWRNYAGSDNGL CCEEEEEEEEEECCEEEECCCCCCCCEECCCEEEEECCCCCCEEEEEEEEECCCCCCCCC KAGKGPVYNTGLVVWYADDSFKDNWVGVHPGEGFLGVVDSHPEAFVGNLNGKPTYGNTGM CCCCCCCEECCEEEEEECCCCCCCEEEECCCCCCEEEECCCCCCEEECCCCCCCCCCCCC QIADAAFSFDQTPAWSVNSLTRGQFNYSGLQGVTTFDDSKVYSNNQIADAGRKVPNLGLK EEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHCCCCCCCCEE FQVVGQADDKSAGAVWIKR EEEEECCCCCCCCEEEEEC >Mature Secondary Structure MRRKAPLKVLSSLAIAAIIGCTSVMSAPLAYAETPAKEKENVSTTPIDYNLIQEDRLAEA CCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHH LKERGTINPASSKEETKKAVEKYIEKKQGDQANKEILPADTAKEASDFVKKVKEKKMEEK HHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH EKVKKPEKNVSPEQKPEPNKKQLNGQVPTSKAKQAPYKGSVRTDKVLVLLVEFSDYKHNN HHHCCCCCCCCCCCCCCCCHHHCCCCCCCCHHHCCCCCCCCCCCEEEEEEEEECCCCCCC IDQTPGYMYSNDFSREHYQKMLFGNEPYTLFDGSKVKTFKQYYEEQSGGSYTTDGYVTEW CCCCCCEEECCCCCHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCCEEEEE LTVPGKASDYGADGSSGHDNKGPKGARDLVKEALHAAAEKGLDLSQFDQFDRYDTNSDGN EECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCC QNEPDGVIDHLMVIHAGVGQEAGGGKLGDDAIWSHRSKLAIDPVAIEGTKSKVDYFGGKV CCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCEEECCEEECCCCHHHHHCCCEE AAHDYTIEPEDGAVGVFAHEFGHDLGLPDEYDTKYTGTGSPVEAWSLMSGGSWTGKIAGT EECCEEECCCCCEEEEEEHHHCCCCCCCCCCCCCCCCCCCCHHHHEECCCCCEEEEECCC EPTSFSPQNKDFLQKNMGGNWAKILEVDYDKIKRGVGVPTYIDQSVTKSNRPGVVRVNLP CCCCCCCCCHHHHHHCCCCCEEEEEECCHHHHHHCCCCCHHHHHHHHCCCCCCEEEEECC GKSVETIKPEFGKHAYYSTRGDDMHTTLETPFFDLTKGTNAKFDYKANYELEAECDFVEV CCCCHHCCCCCCCCCEEECCCCCCEEECCCCCEECCCCCCCEEEEECCCEEECCCCEEEE HAVIEDGTKTLIDRLGEKVVQGDKDTTDGKWIDKSYDLSQFKGKKVKLQFDYITDPAVTY EEEHHCCHHHHHHHHHHHHHCCCCCCCCCCEECCCCCHHHCCCCEEEEEEEEECCCCEEE KGFAMDHVNVTVDGQVVFSDDAEGQSKMNVNGFVVSDGTEKKAHYYYLEWRNYAGSDNGL CCEEEEEEEEEECCEEEECCCCCCCCEECCCEEEEECCCCCCEEEEEEEEECCCCCCCCC KAGKGPVYNTGLVVWYADDSFKDNWVGVHPGEGFLGVVDSHPEAFVGNLNGKPTYGNTGM CCCCCCCEECCEEEEEECCCCCCCEEEECCCCCCEEEECCCCCCEEECCCCCCCCCCCCC QIADAAFSFDQTPAWSVNSLTRGQFNYSGLQGVTTFDDSKVYSNNQIADAGRKVPNLGLK EEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHCCCCCCCCEE FQVVGQADDKSAGAVWIKR EEEEECCCCCCCCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2089225; 6421577 [H]