Definition | Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome. |
---|---|
Accession | NC_008769 |
Length | 4,374,522 |
Click here to switch to the map view.
The map label for this gene is clpB
Identifier: 121636297
GI number: 121636297
Start: 490269
End: 492815
Strand: Reverse
Name: clpB
Synonym: BCG_0422c
Alternate gene names: 121636297
Gene position: 492815-490269 (Counterclockwise)
Preceding gene: 121636300
Following gene: 121636296
Centisome position: 11.27
GC content: 68.59
Gene sequence:
>2547_bases GTGGACTCGTTTAACCCGACGACCAAGACGCAGGCGGCGCTAACCGCGGCGTTACAGGCGGCTTCGACCGCCGGCAATCC CGAGATCCGGCCCGCTCACCTGCTGATGGCGCTGCTGACCCAAAACGACGGTATCGCCGCACCGCTACTGGAGGCTGTCG GTGTCGAGCCCGCCACCGTCCGCGCCGAAACCCAGCGCCTGCTCGACCGTTTGCCGCAGGCGACTGGAGCCAGCACGCAG CCGCAGCTGTCCCGCGAGTCGTTAGCGGCGATCACCACCGCGCAGCAGCTGGCCACCGAGCTGGACGACGAGTACGTCTC CACCGAGCACGTGATGGTCGGGCTGGCCACCGGTGACTCCGACGTCGCCAAGCTGTTGACCGGCCACGGCGCCTCGCCGC AGGCGCTGCGGGAGGCGTTCGTCAAGGTGCGCGGCAGCGCCCGGGTCACCAGCCCCGAACCGGAGGCGACCTATCAGGCG CTGCAGAAGTACTCCACCGACCTGACCGCCCGCGCCCGCGAAGGCAAACTCGACCCGGTCATCGGCCGCGACAACGAGAT CCGCCGCGTGGTGCAGGTGCTGTCCCGTCGCACCAAGAACAACCCGGTGCTGATCGGTGAGCCCGGCGTCGGCAAGACCG CGATCGTGGAGGGCCTGGCGCAGCGCATCGTGGCCGGCGACGTGCCGGAGAGCTTGCGCGACAAGACCATCGTCGCGCTC GATCTCGGCTCGATGGTCGCCGGCTCCAAATACCGCGGCGAATTCGAGGAACGGCTCAAGGCCGTCCTCGACGACATCAA GAACTCGGCCGGCCAAATCATCACGTTCATCGACGAGCTGCACACCATCGTCGGCGCCGGCGCCACCGGCGAGGGGGCGA TGGACGCCGGCAACATGATCAAGCCGATGCTGGCCCGCGGCGAGTTACGGCTGGTCGGGGCGACCACGCTGGACGAATAC CGCAAGCACATCGAGAAGGACGCCGCGCTCGAGCGCCGTTTCCAACAGGTGTACGTCGGCGAGCCGTCGGTGGAGGACAC CATCGGCATCCTGCGCGGGCTCAAAGACCGCTACGAGGTGCACCACGGGGTGCGCATCACCGACTCGGCGCTGGTGGCAG CTGCCACTTTGAGCGACCGGTATATCACCGCCCGCTTCCTGCCCGACAAGGCCATCGACCTGGTCGACGAGGCGGCCAGC CGGCTGCGGATGGAGATCGACTCGCGGCCCGTCGAGATCGACGAGGTCGAGCGGCTGGTGCGCCGGCTGGAGATCGAAGA GATGGCGCTGTCCAAAGAAGAAGACGAGGCGTCGGCGGAGCGGTTGGCCAAGCTGCGCTCCGAGCTGGCCGACCAGAAAG AGAAGTTGGCCGAGCTCACCACCCGCTGGCAGAACGAGAAGAACGCGATCGAAATCGTCCGCGACCTCAAGGAGCAGCTG GAAGCCCTGCGCGGGGAATCCGAGCGGGCCGAACGCGACGGCGACCTGGCCAAGGCCGCCGAGCTGCGCTACGGACGCAT CCCCGAGGTGGAGAAGAAGCTCGACGCGGCGTTGCCGCAGGCGCAGGCCCGGGAGCAGGTGATGCTCAAGGAGGAGGTCG GTCCCGACGACATCGCCGACGTGGTGTCGGCGTGGACCGGCATCCCGGCCGGTCGGCTGCTGGAAGGCGAGACCGCCAAG CTGCTGCGCATGGAAGACGAGCTGGGCAAGCGGGTCATCGGGCAGAAGGCCGCGGTTACCGCAGTCTCTGATGCGGTGCG GCGCAGCCGGGCCGGGGTGTCCGACCCCAACCGGCCCACCGGGGCGTTCATGTTCCTCGGCCCGACCGGTGTCGGCAAGA CCGAGCTGGCCAAGGCGCTGGCCGACTTCCTGTTCGACGACGAGCGGGCGATGGTCCGCATCGACATGAGCGAGTACGGC GAGAAGCACACCGTGGCTCGGTTGATCGGCGCCCCGCCCGGCTATGTGGGATACGAGGCGGGCGGTCAGCTGACCGAGGC GGTGCGCCGGCGTCCCTACACGGTGGTGCTGTTCGACGAGATCGAGAAGGCGCACCCGGACGTGTTCGACGTGCTGCTGC AGGTCCTCGACGAGGGCCGGCTCACCGACGGGCACGGCCGCACGGTCGACTTCCGCAACACCATCTTGATCCTGACGTCC AACCTGGGGTCGGGTGGCAGCGCCGAGCAGGTGCTGGCCGCGGTGCGCGCTACGTTCAAGCCGGAGTTCATCAACCGGCT CGACGACGTGCTCATCTTTGAGGGTCTCAACCCCGAAGAGCTGGTGCGCATCGTCGACATCCAGCTGGCGCAGCTGGGCA AGCGGCTGGCGCAGCGGCGGCTGCAGCTGCAGGTCTCGCTGCCGGCCAAGCGCTGGTTGGCGCAGCGCGGATTCGACCCG GTGTACGGGGCGCGGCCGTTGCGCCGGCTGGTGCAGCAGGCCATCGGTGACCAGCTGGCCAAGATGCTGTTGGCCGGCCA GGTGCACGACGGCGATACCGTGCCGGTCAACGTCAGCCCCGACGCCGACTCGCTGATCCTGGGCTGA
Upstream 100 bases:
>100_bases GGCGGATGCACAAGTGGGTAAAATTGAGCGGAACAGACTCAACATTGACGGCGTTGAACAACCCGACAAGCATTTCGAAC GGACCCCGAATGGAGGTGTC
Downstream 100 bases:
>100_bases TTTGTCTGACGAGCAGACGCGGAATCGCACGCGTGAGGTCCGCGCAGTGCGATTCCGCGTCTGCTCGGCGTGGAAGGTGA CTGGGTTGTGACCTGGTCGC
Product: putative endopeptidase ATP binding protein (chain b) clpB
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 848; Mature: 848
Protein sequence:
>848_residues MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG
Sequences:
>Translated_848_residues MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG >Mature_848_residues MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG
Specific function: Part of a stress-induced multi-chaperone system, it is involved in the recovery of the cell from heat-induced damage, in cooperation with DnaK, DnaJ and GrpE. Acts before DnaK, in the processing of protein aggregates. Protein binding stimulates the ATPase
COG id: COG0542
COG function: function code O; ATPases with chaperone activity, ATP-binding subunit
Gene ontology:
Cell location: Cytoplasm (Probable)
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the clpA/clpB family
Homologues:
Organism=Homo sapiens, GI13540606, Length=328, Percent_Identity=35.3658536585366, Blast_Score=201, Evalue=2e-51, Organism=Escherichia coli, GI1788943, Length=847, Percent_Identity=55.2538370720189, Blast_Score=929, Evalue=0.0, Organism=Escherichia coli, GI1787109, Length=426, Percent_Identity=41.5492957746479, Blast_Score=292, Evalue=7e-80, Organism=Saccharomyces cerevisiae, GI6320464, Length=719, Percent_Identity=54.1029207232267, Blast_Score=780, Evalue=0.0, Organism=Saccharomyces cerevisiae, GI6323002, Length=863, Percent_Identity=41.7149478563152, Blast_Score=656, Evalue=0.0,
Paralogues:
None
Copy number: 560 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): CLPB_MYCBO (P63287)
Other databases:
- EMBL: U73653 - EMBL: BX248335 - RefSeq: NP_854054.1 - ProteinModelPortal: P63287 - EnsemblBacteria: EBMYCT00000015456 - GeneID: 1091320 - GenomeReviews: BX248333_GR - KEGG: mbo:Mb0391c - GeneTree: EBGT00070000031953 - HOGENOM: HBG413133 - OMA: IQMGRLR - ProtClustDB: CLSK790482 - BioCyc: MBOV233413:MB0391C-MONOMER - GO: GO:0005737 - InterPro: IPR003593 - InterPro: IPR013093 - InterPro: IPR003959 - InterPro: IPR018368 - InterPro: IPR017730 - InterPro: IPR001270 - InterPro: IPR019489 - InterPro: IPR004176 - InterPro: IPR023150 - Gene3D: G3DSA:1.10.1780.10 - PRINTS: PR00300 - SMART: SM00382 - TIGRFAMs: TIGR03346
Pfam domain/function: PF00004 AAA; PF07724 AAA_2; PF02861 Clp_N; PF10431 ClpB_D2-small
EC number: NA
Molecular weight: Translated: 92570; Mature: 92570
Theoretical pI: Translated: 4.98; Mature: 4.98
Prosite motif: PS00870 CLPAB_1; PS00871 CLPAB_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATV CCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHCCCCHHH RAETQRLLDRLPQATGASTQPQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDS HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCH DVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQALQKYSTDLTARAREGKLDPV HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCC IGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL CCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHCCCCHHHCCCEEEEE DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMI ECCHHHCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH KPMLARGELRLVGATTLDEYRKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEV HHHHCCCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH HHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAASRLRMEIDSRPVEIDEVERLV HCCCEEECHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH RRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIAD HHHCCCHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHH VVSAWTGIPAGRLLEGETAKLLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPT HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCC GAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYGEKHTVARLIGAPPGYVGYEA CEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEEHHHCCCHHHHHHHHCCCCCCCCCCC GGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS CCHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEE NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRR CCCCCCCHHHHHHHHHHHCCHHHHHHHHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHH LQLQVSLPAKRWLAQRGFDPVYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSP HHEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECC DADSLILG CCCCCCCC >Mature Secondary Structure MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATV CCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHCCCCHHH RAETQRLLDRLPQATGASTQPQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDS HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCH DVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQALQKYSTDLTARAREGKLDPV HHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCC IGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL CCCCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHCCCCHHHCCCEEEEE DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMI ECCHHHCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH KPMLARGELRLVGATTLDEYRKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEV HHHHCCCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH HHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAASRLRMEIDSRPVEIDEVERLV HCCCEEECHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH RRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIAD HHHCCCHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHH VVSAWTGIPAGRLLEGETAKLLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPT HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCC GAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYGEKHTVARLIGAPPGYVGYEA CEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEEHHHCCCHHHHHHHHCCCCCCCCCCC GGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS CCHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHHHHCCCCCCCCCCEEEECCEEEEEEE NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRR CCCCCCCHHHHHHHHHHHCCHHHHHHHHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHH LQLQVSLPAKRWLAQRGFDPVYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSP HHEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECC DADSLILG CCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Hydrolase; Acting on peptide bonds (Peptidases); Serine endopeptidases [C]
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9692182; 12788972