Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is clpX

Identifier: 49187364

GI number: 49187364

Start: 4281616

End: 4282875

Strand: Reverse

Name: clpX

Synonym: BAS4369

Alternate gene names: 49187364

Gene position: 4282875-4281616 (Counterclockwise)

Preceding gene: 49187365

Following gene: 49187363

Centisome position: 81.91

GC content: 37.94

Gene sequence:

>1260_bases
ATGTTTAAATTTAATGATGAAAAAGGGCAATTAAAATGTTCTTTCTGTGGTAAAACACAAACACAAGTCCGAAAGTTAGT
TGCAGGTCCAGGTGTTTACATTTGTGACGAGTGTATCGAACTTTGTACTGAAATTGTACAAGAGGAGCTTGCGAAAGACG
AAGAAGTAGAATTCAAAGATGTACCGAAACCGGTAGAAATTCGTGAAATTTTAGATGAGTATGTCATCGGACAAGATAAC
GCGAAAAAAGCACTAGCGGTAGCGGTATATAACCATTACAAACGCATTAATTCTAACAGCAAAATTGATGATGTAGAATT
AGCGAAGAGTAATATCGCACTTATCGGGCCAACAGGTAGTGGTAAAACATTACTGGCACAAACGTTAGCGCGTATTTTAA
ATGTTCCATTTGCAATCGCGGACGCAACATCTTTAACTGAAGCTGGATACGTTGGGGAAGATGTAGAAAACATCTTACTT
AAATTAATCCAAGCAGCGGATTATGATGTAGAAAAAGCGGAAAAAGGAATCATTTATATTGATGAGATTGATAAAGTGGC
ACGTAAGTCCGAAAATCCATCAATTACACGTGATGTATCTGGTGAAGGTGTGCAGCAGGCACTTCTGAAAATTTTAGAAG
GTACTGTAGCAAGCGTTCCACCTCAAGGTGGTCGTAAGCATCCGCACCAAGAGTTTATTCAAATTGATACAACGAATATC
TTATTCATCTGTGGTGGAGCGTTTGATGGCATCGAGCCAATTATTAAACGCCGTCTTGGTGAAAAGGTAATTGGATTTGG
TTCTGAGAAGAAAAATGCTGATGTAAATGAGAAGCATGTTTTATCTCACGTATTACCAGAAGACCTTTTAAGATTTGGTT
TAATTCCAGAGTTTATCGGTCGTCTTCCAGTTATTGCGAACCTAGAGCCACTTGATGAAGATGCTCTTGTTGATATTTTA
ACGAAACCGAAAAATGCACTTGTTAAGCAATTCCAAAAACTATTGGAGCTTGACGATGTTGAGTTAGAGTTTGAAGAAGG
TGCACTAATTGAAATTGCGAAAAAAGCAATTGAACGTAAAACAGGTGCTCGTGGACTTCGTTCTATTATTGAAGGCTTAA
TGCTTGAGGTAATGTTCGAGCTACCATCTCGCAAAGATATCGAGAAGTGTATTCTTACAAAAGAAACAGTAGCTGATAAT
GCAGCACCAAAATTGGTGTTACAAGACGGTACTGTACTTGATACAAAAACATCTGCATAA

Upstream 100 bases:

>100_bases
ATTTCTCGCCACGTAGTGACATAATATGTATTTACATACGATACATTTATCGTGTACATACGAAGGCAAGAGGTTAGCAC
TTTGTAAGGGGTGTGAAAAT

Downstream 100 bases:

>100_bases
TGCGAAGGAGAAACAGCAATTTTTATAATTGCTGTTTTTCTTTATGTTTAGCTATGTTCCTCCGCGGAAATACTAAGAGA
TAAAACATTGCGGGAGGAAA

Product: ATP-dependent protease ATP-binding subunit ClpX

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 419; Mature: 419

Protein sequence:

>419_residues
MFKFNDEKGQLKCSFCGKTQTQVRKLVAGPGVYICDECIELCTEIVQEELAKDEEVEFKDVPKPVEIREILDEYVIGQDN
AKKALAVAVYNHYKRINSNSKIDDVELAKSNIALIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILL
KLIQAADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
LFICGGAFDGIEPIIKRRLGEKVIGFGSEKKNADVNEKHVLSHVLPEDLLRFGLIPEFIGRLPVIANLEPLDEDALVDIL
TKPKNALVKQFQKLLELDDVELEFEEGALIEIAKKAIERKTGARGLRSIIEGLMLEVMFELPSRKDIEKCILTKETVADN
AAPKLVLQDGTVLDTKTSA

Sequences:

>Translated_419_residues
MFKFNDEKGQLKCSFCGKTQTQVRKLVAGPGVYICDECIELCTEIVQEELAKDEEVEFKDVPKPVEIREILDEYVIGQDN
AKKALAVAVYNHYKRINSNSKIDDVELAKSNIALIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILL
KLIQAADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
LFICGGAFDGIEPIIKRRLGEKVIGFGSEKKNADVNEKHVLSHVLPEDLLRFGLIPEFIGRLPVIANLEPLDEDALVDIL
TKPKNALVKQFQKLLELDDVELEFEEGALIEIAKKAIERKTGARGLRSIIEGLMLEVMFELPSRKDIEKCILTKETVADN
AAPKLVLQDGTVLDTKTSA
>Mature_419_residues
MFKFNDEKGQLKCSFCGKTQTQVRKLVAGPGVYICDECIELCTEIVQEELAKDEEVEFKDVPKPVEIREILDEYVIGQDN
AKKALAVAVYNHYKRINSNSKIDDVELAKSNIALIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILL
KLIQAADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
LFICGGAFDGIEPIIKRRLGEKVIGFGSEKKNADVNEKHVLSHVLPEDLLRFGLIPEFIGRLPVIANLEPLDEDALVDIL
TKPKNALVKQFQKLLELDDVELEFEEGALIEIAKKAIERKTGARGLRSIIEGLMLEVMFELPSRKDIEKCILTKETVADN
AAPKLVLQDGTVLDTKTSA

Specific function: ATP-dependent specificity component of the Clp protease. It directs the protease to specific substrates. Can perform chaperone functions in the absence of ClpP

COG id: COG1219

COG function: function code O; ATP-dependent protease Clp, ATPase subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the ClpX chaperone family

Homologues:

Organism=Homo sapiens, GI7242140, Length=333, Percent_Identity=51.0510510510511, Blast_Score=314, Evalue=9e-86,
Organism=Escherichia coli, GI1786642, Length=409, Percent_Identity=61.6136919315403, Blast_Score=513, Evalue=1e-147,
Organism=Escherichia coli, GI1790366, Length=103, Percent_Identity=45.6310679611651, Blast_Score=97, Evalue=2e-21,
Organism=Caenorhabditis elegans, GI71982908, Length=311, Percent_Identity=45.9807073954984, Blast_Score=275, Evalue=3e-74,
Organism=Caenorhabditis elegans, GI71982905, Length=311, Percent_Identity=45.9807073954984, Blast_Score=275, Evalue=4e-74,
Organism=Caenorhabditis elegans, GI71988663, Length=434, Percent_Identity=36.6359447004608, Blast_Score=248, Evalue=4e-66,
Organism=Caenorhabditis elegans, GI71988660, Length=260, Percent_Identity=38.8461538461538, Blast_Score=173, Evalue=1e-43,
Organism=Saccharomyces cerevisiae, GI6319704, Length=412, Percent_Identity=38.8349514563107, Blast_Score=260, Evalue=2e-70,
Organism=Drosophila melanogaster, GI24648291, Length=319, Percent_Identity=49.2163009404389, Blast_Score=296, Evalue=1e-80,
Organism=Drosophila melanogaster, GI24648289, Length=319, Percent_Identity=49.2163009404389, Blast_Score=296, Evalue=1e-80,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): CLPX_BACAA (C3P9F7)

Other databases:

- EMBL:   CP001598
- RefSeq:   YP_002868756.1
- ProteinModelPortal:   C3P9F7
- EnsemblBacteria:   EBBACT00000129234
- GeneID:   7851138
- GenomeReviews:   CP001598_GR
- KEGG:   bai:BAA_4722
- GeneTree:   EBGT00050000000359
- ProtClustDB:   PRK05342
- HAMAP:   MF_00175
- InterPro:   IPR003593
- InterPro:   IPR013093
- InterPro:   IPR019489
- InterPro:   IPR004487
- InterPro:   IPR010603
- PANTHER:   PTHR11262:SF4
- SMART:   SM00382
- TIGRFAMs:   TIGR00382

Pfam domain/function: PF07724 AAA_2; PF10431 ClpB_D2-small; PF06689 zf-C4_ClpX

EC number: 3.4.21.-

Molecular weight: Translated: 46200; Mature: 46200

Theoretical pI: Translated: 4.68; Mature: 4.68

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFKFNDEKGQLKCSFCGKTQTQVRKLVAGPGVYICDECIELCTEIVQEELAKDEEVEFKD
CCCCCCCCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHC
VPKPVEIREILDEYVIGQDNAKKALAVAVYNHYKRINSNSKIDDVELAKSNIALIGPTGS
CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCEEEECCCCC
GKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILLKLIQAADYDVEKAEKGIIYI
CHHHHHHHHHHHHCCCCHHHCCHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHCCCEEEE
DEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCHHHCCCCCCCCCCCHHHHEEECCCEE
LFICGGAFDGIEPIIKRRLGEKVIGFGSEKKNADVNEKHVLSHVLPEDLLRFGLIPEFIG
EEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHC
RLPVIANLEPLDEDALVDILTKPKNALVKQFQKLLELDDVELEFEEGALIEIAKKAIERK
CCCEEECCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHH
TGARGLRSIIEGLMLEVMFELPSRKDIEKCILTKETVADNAAPKLVLQDGTVLDTKTSA
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCEEEEECCEEECCCCCC
>Mature Secondary Structure
MFKFNDEKGQLKCSFCGKTQTQVRKLVAGPGVYICDECIELCTEIVQEELAKDEEVEFKD
CCCCCCCCCCEEEEECCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHC
VPKPVEIREILDEYVIGQDNAKKALAVAVYNHYKRINSNSKIDDVELAKSNIALIGPTGS
CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCEEEECCCCC
GKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILLKLIQAADYDVEKAEKGIIYI
CHHHHHHHHHHHHCCCCHHHCCHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHCCCEEEE
DEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCHHHCCCCCCCCCCCHHHHEEECCCEE
LFICGGAFDGIEPIIKRRLGEKVIGFGSEKKNADVNEKHVLSHVLPEDLLRFGLIPEFIG
EEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHC
RLPVIANLEPLDEDALVDILTKPKNALVKQFQKLLELDDVELEFEEGALIEIAKKAIERK
CCCEEECCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHH
TGARGLRSIIEGLMLEVMFELPSRKDIEKCILTKETVADNAAPKLVLQDGTVLDTKTSA
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCEEEEECCEEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA