Definition | Bacillus thuringiensis str. Al Hakam chromosome, complete genome. |
---|---|
Accession | NC_008600 |
Length | 5,257,091 |
Click here to switch to the map view.
The map label for this gene is lonB [H]
Identifier: 118479637
GI number: 118479637
Start: 4285671
End: 4287344
Strand: Reverse
Name: lonB [H]
Synonym: BALH_4064
Alternate gene names: 118479637
Gene position: 4287344-4285671 (Counterclockwise)
Preceding gene: 118479638
Following gene: 118479636
Centisome position: 81.55
GC content: 39.67
Gene sequence:
>1674_bases ATGATGAGCTGGACAAATATATTTTTACTTGTTCAACTTGTTTTTGGTGTAATTGTCGGATTGTATTTTTGGCATTTACT TCGAAATCAGCGAACACAAAAAGTTTCAATTGATCGAGAATCGAAAAAAGAATTAGAGCAGCTTCGTAAAATGCGTGAGA TTTCTTTAACAGAGCCGCTTGCGGAAAAGGTGCGTCCTACATCCTTTTTAGATATTGTAGGGCAAGAGGACGGGATTAAG TCGTTAAAAGCGGCTCTTTGTGGCCCGAACCCGCAACATGTAATTATATATGGCCCACCAGGTGTCGGAAAGACAGCGGC AGCACGTCTCGTATTAGAAGAAGCGAAACGAAATCCAAAATCTCCATTTCGCACAAATGCAACATTTATTGAACTGGATG CGACGACAGCTCGGTTTGATGAACGTGGTATTGCAGACCCTTTAATCGGTTCGGTGCATGATCCAATTTATCAAGGTGCT GGTGCGATGGGGCAAGCGGGTATTCCGCAACCAAAAAAAGGTGCGGTAACAGATGCACACGGTGGTATTTTGTTTATTGA TGAGATTGGTGAGCTACATCCGATTCAAATGAACAAAATGTTAAAGGTGCTAGAAGATCGAAAAGTGTTTTTGGAAAGTG CATACTACAGTGAAGAGAATACGATGATTCCAACGTATATACATGATATCTTTCAAAAAGGTTTACCAGCCGATTTTCGC TTAGTTGGGGCAACGACACGTTCGCCAGAGGAGATTCCTCCTGCTATTCGGTCGCGCTGTTTAGAAGTCTTCTTCCGTGA GTTAGATACAGAAGAAATTCAAAAAGTAGCGAAGAATGCAGCTGACAAAGTAGAAATGCAAATCGGTGAAAATGGTATTG AAATGATCGGGATGTATGCAAGAAATGGAAGAGAAGCGATTAATCTTGTACAAATTTCTGCTGGAATGGCAATAAATGAA GAACGTTCTTTCATTAAAGATGAAGATATTGAGTGGGTTGTTCACTCTAGTCAGCTTACACCGAAGTATGAAAAACACAT TTATCCTATTCCAAGAATCGGTCTTGTAAATGGACTTGCTGTATACGGACCAAATACAGGCGCGTTATTAGAAATTGAAG TAACAGCAATTAAGGCGAAAGATAAAGGATCGGTAAATGTTACCGGAATTGTTGAAGAGGAAAGTATTGGTAGTCAAACG AAATCAATTCGCCGTAAAAGTATGGCGAAAGGTTCTGTAGATAATGTATTAACAGTACTTCGATCTCTAGATGTGTTGCC AGAAGGATACGATATACATATTAATTTTCCAGGTGGTATCCCGATTGATGGGCCTTCAGCAGGAATTGCAATGGCAACAG GTGTGTATTCAGCAGTGCATCATACGTATGTGAATAATGAAGTGGCGATGACTGGTGAGATAAGTATACATGGAGAAGTG AAACCTATCGGTGGTGTGTACGCAAAAATAAAAGCTGCGAAAAAGGCGGGAGCTAAGAAAGTTATTATTCCTGCTGAAAA CATGCAACCGTTTTTGTACACAATAAAAGGAATTGAAATTATCCCTGTTCGTAAGTTAAAAGAAGTATTTGAGCTAACAT TTATGCAAGAAAATATGCATCGAGAGCTTGATATACATACTACAATAGACGAAACAGATGCACAATCAATGTGA
Upstream 100 bases:
>100_bases GAAGGAGAAACAGCAATTTTTATAATTGCTGTTTTTCTTTATGTTTAGCTATGTTCCTCCGCGGAAATACTAAGAGATAA AACATTGCGGGAGGAAATAA
Downstream 100 bases:
>100_bases GAATGAATGATAATAGTATTTGCCAGACTTCGTTTAAGTTTGTATGCTTAAGCGGAGTCTTTTTTCTTTTCGTAAGAGAG AAAAAAAGCGGGATTTACAG
Product: ATP-dependent protease LA
Products: NA
Alternate protein names: ATP-dependent protease La 2 [H]
Number of amino acids: Translated: 557; Mature: 557
Protein sequence:
>557_residues MMSWTNIFLLVQLVFGVIVGLYFWHLLRNQRTQKVSIDRESKKELEQLRKMREISLTEPLAEKVRPTSFLDIVGQEDGIK SLKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKRNPKSPFRTNATFIELDATTARFDERGIADPLIGSVHDPIYQGA GAMGQAGIPQPKKGAVTDAHGGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYSEENTMIPTYIHDIFQKGLPADFR LVGATTRSPEEIPPAIRSRCLEVFFRELDTEEIQKVAKNAADKVEMQIGENGIEMIGMYARNGREAINLVQISAGMAINE ERSFIKDEDIEWVVHSSQLTPKYEKHIYPIPRIGLVNGLAVYGPNTGALLEIEVTAIKAKDKGSVNVTGIVEEESIGSQT KSIRRKSMAKGSVDNVLTVLRSLDVLPEGYDIHINFPGGIPIDGPSAGIAMATGVYSAVHHTYVNNEVAMTGEISIHGEV KPIGGVYAKIKAAKKAGAKKVIIPAENMQPFLYTIKGIEIIPVRKLKEVFELTFMQENMHRELDIHTTIDETDAQSM
Sequences:
>Translated_557_residues MMSWTNIFLLVQLVFGVIVGLYFWHLLRNQRTQKVSIDRESKKELEQLRKMREISLTEPLAEKVRPTSFLDIVGQEDGIK SLKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKRNPKSPFRTNATFIELDATTARFDERGIADPLIGSVHDPIYQGA GAMGQAGIPQPKKGAVTDAHGGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYSEENTMIPTYIHDIFQKGLPADFR LVGATTRSPEEIPPAIRSRCLEVFFRELDTEEIQKVAKNAADKVEMQIGENGIEMIGMYARNGREAINLVQISAGMAINE ERSFIKDEDIEWVVHSSQLTPKYEKHIYPIPRIGLVNGLAVYGPNTGALLEIEVTAIKAKDKGSVNVTGIVEEESIGSQT KSIRRKSMAKGSVDNVLTVLRSLDVLPEGYDIHINFPGGIPIDGPSAGIAMATGVYSAVHHTYVNNEVAMTGEISIHGEV KPIGGVYAKIKAAKKAGAKKVIIPAENMQPFLYTIKGIEIIPVRKLKEVFELTFMQENMHRELDIHTTIDETDAQSM >Mature_557_residues MMSWTNIFLLVQLVFGVIVGLYFWHLLRNQRTQKVSIDRESKKELEQLRKMREISLTEPLAEKVRPTSFLDIVGQEDGIK SLKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKRNPKSPFRTNATFIELDATTARFDERGIADPLIGSVHDPIYQGA GAMGQAGIPQPKKGAVTDAHGGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYSEENTMIPTYIHDIFQKGLPADFR LVGATTRSPEEIPPAIRSRCLEVFFRELDTEEIQKVAKNAADKVEMQIGENGIEMIGMYARNGREAINLVQISAGMAINE ERSFIKDEDIEWVVHSSQLTPKYEKHIYPIPRIGLVNGLAVYGPNTGALLEIEVTAIKAKDKGSVNVTGIVEEESIGSQT KSIRRKSMAKGSVDNVLTVLRSLDVLPEGYDIHINFPGGIPIDGPSAGIAMATGVYSAVHHTYVNNEVAMTGEISIHGEV KPIGGVYAKIKAAKKAGAKKVIIPAENMQPFLYTIKGIEIIPVRKLKEVFELTFMQENMHRELDIHTTIDETDAQSM
Specific function: ATP-dependent serine protease that mediates the selective degradation of mutant and abnormal proteins as well as certain short-lived regulatory proteins. Required for cellular homeostasis and for survival from DNA damage and developmental changes induced
COG id: COG1067
COG function: function code O; Predicted ATP-dependent protease
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the peptidase S16 family [H]
Homologues:
Organism=Homo sapiens, GI21396489, Length=209, Percent_Identity=30.622009569378, Blast_Score=86, Evalue=8e-17, Organism=Homo sapiens, GI31377667, Length=171, Percent_Identity=35.0877192982456, Blast_Score=82, Evalue=2e-15, Organism=Escherichia coli, GI1786643, Length=198, Percent_Identity=37.3737373737374, Blast_Score=101, Evalue=1e-22, Organism=Caenorhabditis elegans, GI17505831, Length=471, Percent_Identity=25.2653927813163, Blast_Score=77, Evalue=3e-14, Organism=Caenorhabditis elegans, GI17556486, Length=86, Percent_Identity=40.6976744186046, Blast_Score=74, Evalue=3e-13, Organism=Saccharomyces cerevisiae, GI6319449, Length=85, Percent_Identity=43.5294117647059, Blast_Score=74, Evalue=4e-14, Organism=Drosophila melanogaster, GI221513036, Length=540, Percent_Identity=24.0740740740741, Blast_Score=87, Evalue=4e-17, Organism=Drosophila melanogaster, GI24666867, Length=540, Percent_Identity=24.0740740740741, Blast_Score=86, Evalue=6e-17,
Paralogues:
None
Copy number: 2,000 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003593 - InterPro: IPR003959 - InterPro: IPR008269 - InterPro: IPR008268 - InterPro: IPR001984 - InterPro: IPR020568 - InterPro: IPR014251 [H]
Pfam domain/function: PF00004 AAA; PF05362 Lon_C [H]
EC number: =3.4.21.53 [H]
Molecular weight: Translated: 61604; Mature: 61604
Theoretical pI: Translated: 6.41; Mature: 6.41
Prosite motif: PS00676 SIGMA54_INTERACT_2 ; PS50045 SIGMA54_INTERACT_4 ; PS01046 LON_SER
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 3.2 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 3.2 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MMSWTNIFLLVQLVFGVIVGLYFWHLLRNQRTQKVSIDRESKKELEQLRKMREISLTEPL CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHCCCHHH AEKVRPTSFLDIVGQEDGIKSLKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKRNPK HHHCCCCHHHEECCCCCCHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCC SPFRTNATFIELDATTARFDERGIADPLIGSVHDPIYQGAGAMGQAGIPQPKKGAVTDAH CCCCCCCEEEEECCHHHHHCCCCCCCHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCEECC GGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYSEENTMIPTYIHDIFQKGLPADFR CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCEE LVGATTRSPEEIPPAIRSRCLEVFFRELDTEEIQKVAKNAADKVEMQIGENGIEMIGMYA EEECCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHEEEEECCCCHHEEEEHH RNGREAINLVQISAGMAINEERSFIKDEDIEWVVHSSQLTPKYEKHIYPIPRIGLVNGLA CCCCCCEEEEEEECCCEECCCCCCCCCCCCEEEEECCCCCCCHHHCCCCCCCCHHHCCEE VYGPNTGALLEIEVTAIKAKDKGSVNVTGIVEEESIGSQTKSIRRKSMAKGSVDNVLTVL EECCCCCCEEEEEEEEEEECCCCCEEEEEEEECHHCCHHHHHHHHHHHHCCCHHHHHHHH RSLDVLPEGYDIHINFPGGIPIDGPSAGIAMATGVYSAVHHTYVNNEVAMTGEISIHGEV HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEEEECCC KPIGGVYAKIKAAKKAGAKKVIIPAENMQPFLYTIKGIEIIPVRKLKEVFELTFMQENMH CCCCHHHHHHHHHHHCCCCEEEEECCCCCCEEEEECCEEEEEHHHHHHHHHHHHHHHHHC RELDIHTTIDETDAQSM CEEEEEECCCCCCCCCC >Mature Secondary Structure MMSWTNIFLLVQLVFGVIVGLYFWHLLRNQRTQKVSIDRESKKELEQLRKMREISLTEPL CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHCCCHHH AEKVRPTSFLDIVGQEDGIKSLKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKRNPK HHHCCCCHHHEECCCCCCHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCC SPFRTNATFIELDATTARFDERGIADPLIGSVHDPIYQGAGAMGQAGIPQPKKGAVTDAH CCCCCCCEEEEECCHHHHHCCCCCCCHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCEECC GGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYSEENTMIPTYIHDIFQKGLPADFR CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCEE LVGATTRSPEEIPPAIRSRCLEVFFRELDTEEIQKVAKNAADKVEMQIGENGIEMIGMYA EEECCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHEEEEECCCCHHEEEEHH RNGREAINLVQISAGMAINEERSFIKDEDIEWVVHSSQLTPKYEKHIYPIPRIGLVNGLA CCCCCCEEEEEEECCCEECCCCCCCCCCCCEEEEECCCCCCCHHHCCCCCCCCHHHCCEE VYGPNTGALLEIEVTAIKAKDKGSVNVTGIVEEESIGSQTKSIRRKSMAKGSVDNVLTVL EECCCCCCEEEEEEEEEEECCCCCEEEEEEEECHHCCHHHHHHHHHHHHCCCHHHHHHHH RSLDVLPEGYDIHINFPGGIPIDGPSAGIAMATGVYSAVHHTYVNNEVAMTGEISIHGEV HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEEEECCC KPIGGVYAKIKAAKKAGAKKVIIPAENMQPFLYTIKGIEIIPVRKLKEVFELTFMQENMH CCCCHHHHHHHHHHHCCCCEEEEECCCCCCEEEEECCEEEEEHHHHHHHHHHHHHHHHHC RELDIHTTIDETDAQSM CEEEEEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8969504; 9384377; 7961402; 11325926 [H]