Definition Ehrlichia chaffeensis str. Arkansas, complete genome.
Accession NC_007799
Length 1,176,248

Click here to switch to the map view.

The map label for this gene is clpX [H]

Identifier: 88658550

GI number: 88658550

Start: 927610

End: 928830

Strand: Reverse

Name: clpX [H]

Synonym: ECH_0900

Alternate gene names: 88658550

Gene position: 928830-927610 (Counterclockwise)

Preceding gene: 88658316

Following gene: 88657790

Centisome position: 78.97

GC content: 32.51

Gene sequence:

>1221_bases
ATGGCAGATAATGAAAAAAATTCTTGTAGCTGTTCTTTCTGCGGAAAGATTCATAGTGAAGTACGTAAGTTAATTGCTGG
GCCATCAGTTTTTATTTGTAATGAGTGTATTGATTTATGTAGTGGTATATTACAAGAAGAAAGTAGATCTTATAAAAAGA
CGGATACTCTTAAGTTAAAACCAAAGGAAATAAAGAAAGTTCTTGATGAGTATGTTATAGGGCAAGAGCACTCAAAAAAA
GTTTTATCAGTTGCTGTGTATAATCATTATAAACGTTTATCGAATTTAAGTGTTATTAGTGAAGTTGAGATTTCTAAGTC
AAATGTTTTGTTGATTGGACCTACTGGTTCTGGAAAAACATTATTAGCTCGTACTTTAGCTAGAGTTTTACAAGTTCCTT
TTGCGATGGCTGATGCTACTACTTTAACGGAAGCAGGGTATGTTGGAGAGGATGTAGAGAATATATTGTTAAAATTATTG
CAGGCAGCTAATTTTAATGTTGATGCAGCACAACGTGGCATAATTTATATTGATGAAGTAGATAAAATTTCTAGAAAGTC
TGAAAATACTTCTATTACTCGTGATGTATCTGGCGAAGGTGTTCAACAAGCTTTATTGAAAGTTATTGAAGGCACAGTTT
CTTCTGTCCCACCTCAAGGTGGTAGGAAGCATCCACATCAAGAGTTTATACAAATAAATACTGATAATATTTTATTTATA
TTTGGTGGTGCGTTTGATGGTTTAGATAAAATTATAGAATCTCGTCATAGAGGTAGTAGTATGGGGTTTGAAGCGAATGT
ACAAAAAGTATCAAAGAATAAAGATATTTTTTGTTACACTGAGCCAGAAGATTTAGTGAAGTTTGGTTTAATTCCGGAGT
TTGTTGGTAGAATTCCTGTTATTACATCTTTAGGTGAGCTTGATGAGAGTACTTTATGTCGTATTTTAGTTGAACCAAAA
AATTCTTTGGTTAAACAATACAAGAAACTTTTTGAAATGGATAATATTAATCTTCAGTTTGATGATAGTGCATTATCAGT
AATTGCAAAAAAGGCTGCTGTCAGGAAAACTGGTGCTAGAGGTTTAAGGGCTATTTTAGAAGCATTATTACTTGATTTAA
TGTTTGAAAGCCCTGGAAGTTCTGATGTGAATCAAGTAGTAATTAGTAAGGAGATGGTTGAAGAGTTGATGGTAAGCTCG
CACTTATTTTTAAAACATTAA

Upstream 100 bases:

>100_bases
AATTTTATGAGAGCTGAGAAAGCTAAAGATTTTGGTATTATTGATAAAGTTATTGAGAAGCGTCTCGATATTGGGGTAGA
GTAATTAAAATTGGAGAGAT

Downstream 100 bases:

>100_bases
AGGTATTATATGAAAAATAAAACTTTACTACCAGTACTCACGCTACGTGATACTATAGTATTTCCTCAAGTAGTAATACC
GTTATTTGTTGGAAGAGAAA

Product: ATP-dependent protease ATP-binding subunit ClpX

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 406; Mature: 405

Protein sequence:

>406_residues
MADNEKNSCSCSFCGKIHSEVRKLIAGPSVFICNECIDLCSGILQEESRSYKKTDTLKLKPKEIKKVLDEYVIGQEHSKK
VLSVAVYNHYKRLSNLSVISEVEISKSNVLLIGPTGSGKTLLARTLARVLQVPFAMADATTLTEAGYVGEDVENILLKLL
QAANFNVDAAQRGIIYIDEVDKISRKSENTSITRDVSGEGVQQALLKVIEGTVSSVPPQGGRKHPHQEFIQINTDNILFI
FGGAFDGLDKIIESRHRGSSMGFEANVQKVSKNKDIFCYTEPEDLVKFGLIPEFVGRIPVITSLGELDESTLCRILVEPK
NSLVKQYKKLFEMDNINLQFDDSALSVIAKKAAVRKTGARGLRAILEALLLDLMFESPGSSDVNQVVISKEMVEELMVSS
HLFLKH

Sequences:

>Translated_406_residues
MADNEKNSCSCSFCGKIHSEVRKLIAGPSVFICNECIDLCSGILQEESRSYKKTDTLKLKPKEIKKVLDEYVIGQEHSKK
VLSVAVYNHYKRLSNLSVISEVEISKSNVLLIGPTGSGKTLLARTLARVLQVPFAMADATTLTEAGYVGEDVENILLKLL
QAANFNVDAAQRGIIYIDEVDKISRKSENTSITRDVSGEGVQQALLKVIEGTVSSVPPQGGRKHPHQEFIQINTDNILFI
FGGAFDGLDKIIESRHRGSSMGFEANVQKVSKNKDIFCYTEPEDLVKFGLIPEFVGRIPVITSLGELDESTLCRILVEPK
NSLVKQYKKLFEMDNINLQFDDSALSVIAKKAAVRKTGARGLRAILEALLLDLMFESPGSSDVNQVVISKEMVEELMVSS
HLFLKH
>Mature_405_residues
ADNEKNSCSCSFCGKIHSEVRKLIAGPSVFICNECIDLCSGILQEESRSYKKTDTLKLKPKEIKKVLDEYVIGQEHSKKV
LSVAVYNHYKRLSNLSVISEVEISKSNVLLIGPTGSGKTLLARTLARVLQVPFAMADATTLTEAGYVGEDVENILLKLLQ
AANFNVDAAQRGIIYIDEVDKISRKSENTSITRDVSGEGVQQALLKVIEGTVSSVPPQGGRKHPHQEFIQINTDNILFIF
GGAFDGLDKIIESRHRGSSMGFEANVQKVSKNKDIFCYTEPEDLVKFGLIPEFVGRIPVITSLGELDESTLCRILVEPKN
SLVKQYKKLFEMDNINLQFDDSALSVIAKKAAVRKTGARGLRAILEALLLDLMFESPGSSDVNQVVISKEMVEELMVSSH
LFLKH

Specific function: ATP-dependent specificity component of the Clp protease. It directs the protease to specific substrates. Can perform chaperone functions in the absence of ClpP [H]

COG id: COG1219

COG function: function code O; ATP-dependent protease Clp, ATPase subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the ClpX chaperone family [H]

Homologues:

Organism=Homo sapiens, GI7242140, Length=318, Percent_Identity=49.685534591195, Blast_Score=298, Evalue=6e-81,
Organism=Escherichia coli, GI1786642, Length=405, Percent_Identity=61.7283950617284, Blast_Score=504, Evalue=1e-144,
Organism=Escherichia coli, GI1790366, Length=225, Percent_Identity=33.3333333333333, Blast_Score=97, Evalue=1e-21,
Organism=Caenorhabditis elegans, GI71982908, Length=307, Percent_Identity=47.557003257329, Blast_Score=272, Evalue=2e-73,
Organism=Caenorhabditis elegans, GI71982905, Length=307, Percent_Identity=47.557003257329, Blast_Score=272, Evalue=3e-73,
Organism=Caenorhabditis elegans, GI71988663, Length=411, Percent_Identity=37.712895377129, Blast_Score=240, Evalue=1e-63,
Organism=Caenorhabditis elegans, GI71988660, Length=251, Percent_Identity=39.4422310756972, Blast_Score=161, Evalue=7e-40,
Organism=Saccharomyces cerevisiae, GI6319704, Length=429, Percent_Identity=39.1608391608392, Blast_Score=263, Evalue=5e-71,
Organism=Drosophila melanogaster, GI24648291, Length=312, Percent_Identity=47.4358974358974, Blast_Score=278, Evalue=7e-75,
Organism=Drosophila melanogaster, GI24648289, Length=312, Percent_Identity=47.4358974358974, Blast_Score=278, Evalue=7e-75,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR013093
- InterPro:   IPR019489
- InterPro:   IPR004487
- InterPro:   IPR010603 [H]

Pfam domain/function: PF07724 AAA_2; PF10431 ClpB_D2-small; PF06689 zf-C4_ClpX [H]

EC number: NA

Molecular weight: Translated: 44852; Mature: 44721

Theoretical pI: Translated: 6.71; Mature: 6.71

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MADNEKNSCSCSFCGKIHSEVRKLIAGPSVFICNECIDLCSGILQEESRSYKKTDTLKLK
CCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEC
PKEIKKVLDEYVIGQEHSKKVLSVAVYNHYKRLSNLSVISEVEISKSNVLLIGPTGSGKT
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHEEECCCCEEEEECCCCCHH
LLARTLARVLQVPFAMADATTLTEAGYVGEDVENILLKLLQAANFNVDAAQRGIIYIDEV
HHHHHHHHHHHCCHHHHCHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHHCCCEEEEECH
DKISRKSENTSITRDVSGEGVQQALLKVIEGTVSSVPPQGGRKHPHQEFIQINTDNILFI
HHHHHCCCCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHEEECCCCEEEE
FGGAFDGLDKIIESRHRGSSMGFEANVQKVSKNKDIFCYTEPEDLVKFGLIPEFVGRIPV
ECCCHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCEEEEECHHHHHHHCCCHHHHCCCCH
ITSLGELDESTLCRILVEPKNSLVKQYKKLFEMDNINLQFDDSALSVIAKKAAVRKTGAR
HHHHHHCCHHHHHHEEECCHHHHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHH
GLRAILEALLLDLMFESPGSSDVNQVVISKEMVEELMVSSHLFLKH
HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
ADNEKNSCSCSFCGKIHSEVRKLIAGPSVFICNECIDLCSGILQEESRSYKKTDTLKLK
CCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEC
PKEIKKVLDEYVIGQEHSKKVLSVAVYNHYKRLSNLSVISEVEISKSNVLLIGPTGSGKT
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHEEECCCCEEEEECCCCCHH
LLARTLARVLQVPFAMADATTLTEAGYVGEDVENILLKLLQAANFNVDAAQRGIIYIDEV
HHHHHHHHHHHCCHHHHCHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCHHCCCEEEEECH
DKISRKSENTSITRDVSGEGVQQALLKVIEGTVSSVPPQGGRKHPHQEFIQINTDNILFI
HHHHHCCCCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHEEECCCCEEEE
FGGAFDGLDKIIESRHRGSSMGFEANVQKVSKNKDIFCYTEPEDLVKFGLIPEFVGRIPV
ECCCHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCEEEEECHHHHHHHCCCHHHHCCCCH
ITSLGELDESTLCRILVEPKNSLVKQYKKLFEMDNINLQFDDSALSVIAKKAAVRKTGAR
HHHHHHCCHHHHHHEEECCHHHHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHH
GLRAILEALLLDLMFESPGSSDVNQVVISKEMVEELMVSSHLFLKH
HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA