Definition Chlamydia muridarum Nigg, complete genome.
Accession NC_002620
Length 1,072,950

Click here to switch to the map view.

The map label for this gene is Not Available

Identifier: 15835238

GI number: 15835238

Start: 746174

End: 748633

Strand: Direct

Name: Not Available

Synonym: TC0623

Alternate gene names: 15835238

Gene position: 746174-748633 (Clockwise)

Preceding gene: 15835232

Following gene: 15835239

Centisome position: 69.54

GC content: 41.79

Gene sequence:

>2460_bases
GTGAACTCGACGAATAATACAGACTCTCAAAATCTGGATCCGAATGCTTCAGAAGTCGAAAAACTTCTTGATGAATCCGC
TGAGGCAGAAGAGAAGACAGATGACCATACTCCCCCCTCAGAACTTTTTATTCTTCCCCTGAATAAACGTCCTTTCTTCC
CAGGCATGGCAGCCCCACTGCTAATAGAAGCAGGTCCTCATTATGAAGTGCTTACTCTGCTTGCTAAATCTTCCCAAAAA
CATATCGGTCTCGTTCTAACTAAAAAAGAAGATGCCAACACCTTAAAAATTGGGTTTAATCAACTTCATCGAGTCGGCGT
ATCTGCGCGCATTTTACGCATTATGCCTATCGAAGGAGGCAGCGCACAGGTATTATTGAGTATAGAGGACCGTATTCGGA
TCGTTAAACCCGTACAAGACAAATATCTAAAAGCCAAGGTCGCATACCATAAAGAAAATAAAGAACTGACTGAAGAGCTA
AAGGCTTACTCTATTAGCATCGTATCGATTATTAAGGACCTTTTGAAGCTCAACCCACTATTCAAAGAAGAGCTTCAGAT
TTTCTTGGGGCACTCCGATTTTACTGAACCTGGTAAGTTAGCAGATTTTTCAGTTGCTCTAACTACAGCAACACGAGAAG
AGCTTCAAGAAGTTTTAGAAACGACTGATATGCACGATCGCATAGACAAGGCTCTTGTTTTACTTAAAAAAGAACTTGAT
CTAAGTCGACTACAAAGCAGCATTAATCAAAAAATTGAAGCAACCATCACAAAAAGTCAAAAAGAGTTCTTCTTGAAAGA
ACAACTAAAGACTATAAAAAAAGAATTAGGTCTTGAAAAAGACGACCATGCTGTTGATCTCGAAAAATTTATGGAACGTT
TAAACAAACGAGATGTTCCTCAGTATGCGATGGATGTTATCCAAGATGAAATGGATAAACTACAGACATTGGAAACCTCT
TCAGCGGAATATGCTGTCTGCCGCAACTATCTCGATTGGTTAACCATTGTGCCTTGGGGAATCCAAACCAAGGAATACCA
TGATTTGAAAAAAGCTGAATCTATCTTAAACAAAGACCATTATGGTTTAGAAGATATCAAGCAGCGTATTCTTGAGTTAA
TCAGTGTAGGCAAGTTAGCGAACGGAATGAAAGGAAGCATCATTTGCCTAGTTGGTCCTCCAGGAGTAGGGAAGACTAGT
ATTGGTCGAAGCATTGCAAAAGTTCTTCACCGTAAATTCTTCCGATTTTCTGTAGGAGGAATGCGCGATGAAGCTGAGAT
TAAAGGCCATCGTCGAACCTATATTGGGGCCATGCCAGGAAAGCTTGTGCAAGCTTTGAAACAGAGTGCTATCATGAATC
CGGTCATTATGATTGATGAGGTCGATAAGATCGGTTCCAGCTATCATGGAGACCCTGCTTCTGCGCTTCTTGAAGTATTG
GATCCTGAGCAAAACAAAGATTTCTTAGATCATTATTTGGACGTTCGCGTTGATCTGTCCAACGTGCTTTTCATTTTGAC
AGCAAACGTATTGGATTCCATTCCAGACCCGTTATTGGACCGTATGGAAGTATTACGTTTATCCGGATACATTCTGGAAG
AAAAATTGCAGATAGCTACCAAATACCTTGTTCCGCGAGCAAGGAAAGAAATGGGACTTTCCGCACAAAACGTCTCCTTC
CAGCCTGAAGCTCTCAAACATATGATCAATAACTATGCTAGGGAAGCAGGCGTGCGTACACTTAATGAAAACATTAAGAA
AGTATTGCGAAAAGTGGCTCTAAAGATTGTTCAAAATCAAGAGAAAAATCCTTCTAAGAAATCTCGGTTTACAATTACGC
CAAAAAATCTACAGGATTATCTCGGCAAACCTATTTTTTCCAGTGATCGTTTCTACGAAAAAACTCCAGTAGGAGTCGCT
ACGGGCCTGGCCTGGACTTCTTTAGGAGGAGCCACCCTATACATAGAAAGCGTTCAAGTCCCCTCGTCATCAGGTAAAGC
TGACATGCATCTAACTGGTCAAGCTGGAGACGTCATGAAAGAGTCGTCACAGATAGCCTGGACGTACCTTCACAGCGCTC
TAGAACGTTATGCTCCAGGACGACCATTCTTTGAAAAATCTCAGGTCCATATTCACATTCCTGAGGGAGCTACTCCGAAG
GATGGCCCTTCTGCAGGGATCACTATGGTAACCTCGTTACTTTCTTTACTTTTGGATGTTCCCGTCCTGAATAACCTTGG
CATGACCGGGGAACTCACCCTAACGGGAAGAGTATTAGGCATAGGCGGTATACGAGAGAAACTCATTGCAGCAAGAAGAT
CTAAACTCAATGTTTTGATTTTCCCTGAAGATAATCGCCGAGATTATGACGAACTCCCTGCCTATCTTAAAAAAGGCTTG
AAGGTACACTTCGTTACGCACTATGACGATGTGTTCAAAATAGCTTTCCCTGGGGTCTAA

Upstream 100 bases:

>100_bases
GTTGTGATTTTGTAGTATATCAATAAGAAGAAGTAATATTAAAATGCGCACACAACCCTTGCTCTTCCAAGGATTTTGAA
CTAGTTGCCCAAGGATAATT

Downstream 100 bases:

>100_bases
ACACGGCTCTTAAATGGAGTTCTCTGAGTTATGAAACTCAGAGAACTCTTTCTATCCCTTTTCTCAACAAATTCAAATTT
AGTATAATTTGCCTTCAAAA

Product: Lon family protease

Products: NA

Alternate protein names: ATP-dependent protease La

Number of amino acids: Translated: 819; Mature: 819

Protein sequence:

>819_residues
MNSTNNTDSQNLDPNASEVEKLLDESAEAEEKTDDHTPPSELFILPLNKRPFFPGMAAPLLIEAGPHYEVLTLLAKSSQK
HIGLVLTKKEDANTLKIGFNQLHRVGVSARILRIMPIEGGSAQVLLSIEDRIRIVKPVQDKYLKAKVAYHKENKELTEEL
KAYSISIVSIIKDLLKLNPLFKEELQIFLGHSDFTEPGKLADFSVALTTATREELQEVLETTDMHDRIDKALVLLKKELD
LSRLQSSINQKIEATITKSQKEFFLKEQLKTIKKELGLEKDDHAVDLEKFMERLNKRDVPQYAMDVIQDEMDKLQTLETS
SAEYAVCRNYLDWLTIVPWGIQTKEYHDLKKAESILNKDHYGLEDIKQRILELISVGKLANGMKGSIICLVGPPGVGKTS
IGRSIAKVLHRKFFRFSVGGMRDEAEIKGHRRTYIGAMPGKLVQALKQSAIMNPVIMIDEVDKIGSSYHGDPASALLEVL
DPEQNKDFLDHYLDVRVDLSNVLFILTANVLDSIPDPLLDRMEVLRLSGYILEEKLQIATKYLVPRARKEMGLSAQNVSF
QPEALKHMINNYAREAGVRTLNENIKKVLRKVALKIVQNQEKNPSKKSRFTITPKNLQDYLGKPIFSSDRFYEKTPVGVA
TGLAWTSLGGATLYIESVQVPSSSGKADMHLTGQAGDVMKESSQIAWTYLHSALERYAPGRPFFEKSQVHIHIPEGATPK
DGPSAGITMVTSLLSLLLDVPVLNNLGMTGELTLTGRVLGIGGIREKLIAARRSKLNVLIFPEDNRRDYDELPAYLKKGL
KVHFVTHYDDVFKIAFPGV

Sequences:

>Translated_819_residues
MNSTNNTDSQNLDPNASEVEKLLDESAEAEEKTDDHTPPSELFILPLNKRPFFPGMAAPLLIEAGPHYEVLTLLAKSSQK
HIGLVLTKKEDANTLKIGFNQLHRVGVSARILRIMPIEGGSAQVLLSIEDRIRIVKPVQDKYLKAKVAYHKENKELTEEL
KAYSISIVSIIKDLLKLNPLFKEELQIFLGHSDFTEPGKLADFSVALTTATREELQEVLETTDMHDRIDKALVLLKKELD
LSRLQSSINQKIEATITKSQKEFFLKEQLKTIKKELGLEKDDHAVDLEKFMERLNKRDVPQYAMDVIQDEMDKLQTLETS
SAEYAVCRNYLDWLTIVPWGIQTKEYHDLKKAESILNKDHYGLEDIKQRILELISVGKLANGMKGSIICLVGPPGVGKTS
IGRSIAKVLHRKFFRFSVGGMRDEAEIKGHRRTYIGAMPGKLVQALKQSAIMNPVIMIDEVDKIGSSYHGDPASALLEVL
DPEQNKDFLDHYLDVRVDLSNVLFILTANVLDSIPDPLLDRMEVLRLSGYILEEKLQIATKYLVPRARKEMGLSAQNVSF
QPEALKHMINNYAREAGVRTLNENIKKVLRKVALKIVQNQEKNPSKKSRFTITPKNLQDYLGKPIFSSDRFYEKTPVGVA
TGLAWTSLGGATLYIESVQVPSSSGKADMHLTGQAGDVMKESSQIAWTYLHSALERYAPGRPFFEKSQVHIHIPEGATPK
DGPSAGITMVTSLLSLLLDVPVLNNLGMTGELTLTGRVLGIGGIREKLIAARRSKLNVLIFPEDNRRDYDELPAYLKKGL
KVHFVTHYDDVFKIAFPGV
>Mature_819_residues
MNSTNNTDSQNLDPNASEVEKLLDESAEAEEKTDDHTPPSELFILPLNKRPFFPGMAAPLLIEAGPHYEVLTLLAKSSQK
HIGLVLTKKEDANTLKIGFNQLHRVGVSARILRIMPIEGGSAQVLLSIEDRIRIVKPVQDKYLKAKVAYHKENKELTEEL
KAYSISIVSIIKDLLKLNPLFKEELQIFLGHSDFTEPGKLADFSVALTTATREELQEVLETTDMHDRIDKALVLLKKELD
LSRLQSSINQKIEATITKSQKEFFLKEQLKTIKKELGLEKDDHAVDLEKFMERLNKRDVPQYAMDVIQDEMDKLQTLETS
SAEYAVCRNYLDWLTIVPWGIQTKEYHDLKKAESILNKDHYGLEDIKQRILELISVGKLANGMKGSIICLVGPPGVGKTS
IGRSIAKVLHRKFFRFSVGGMRDEAEIKGHRRTYIGAMPGKLVQALKQSAIMNPVIMIDEVDKIGSSYHGDPASALLEVL
DPEQNKDFLDHYLDVRVDLSNVLFILTANVLDSIPDPLLDRMEVLRLSGYILEEKLQIATKYLVPRARKEMGLSAQNVSF
QPEALKHMINNYAREAGVRTLNENIKKVLRKVALKIVQNQEKNPSKKSRFTITPKNLQDYLGKPIFSSDRFYEKTPVGVA
TGLAWTSLGGATLYIESVQVPSSSGKADMHLTGQAGDVMKESSQIAWTYLHSALERYAPGRPFFEKSQVHIHIPEGATPK
DGPSAGITMVTSLLSLLLDVPVLNNLGMTGELTLTGRVLGIGGIREKLIAARRSKLNVLIFPEDNRRDYDELPAYLKKGL
KVHFVTHYDDVFKIAFPGV

Specific function: ATP-dependent serine protease that mediates the selective degradation of mutant and abnormal proteins as well as certain short-lived regulatory proteins. Required for cellular homeostasis and for survival from DNA damage and developmental changes induced

COG id: COG0466

COG function: function code O; ATP-dependent Lon protease, bacterial type

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 Lon domain

Homologues:

Organism=Homo sapiens, GI21396489, Length=674, Percent_Identity=52.5222551928783, Blast_Score=733, Evalue=0.0,
Organism=Homo sapiens, GI31377667, Length=843, Percent_Identity=33.3333333333333, Blast_Score=458, Evalue=1e-128,
Organism=Escherichia coli, GI1786643, Length=777, Percent_Identity=40.5405405405405, Blast_Score=570, Evalue=1e-163,
Organism=Caenorhabditis elegans, GI17505831, Length=695, Percent_Identity=47.1942446043166, Blast_Score=650, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17556486, Length=630, Percent_Identity=34.9206349206349, Blast_Score=403, Evalue=1e-112,
Organism=Saccharomyces cerevisiae, GI6319449, Length=717, Percent_Identity=47.1408647140865, Blast_Score=679, Evalue=0.0,
Organism=Drosophila melanogaster, GI221513036, Length=673, Percent_Identity=53.0460624071322, Blast_Score=719, Evalue=0.0,
Organism=Drosophila melanogaster, GI24666867, Length=673, Percent_Identity=53.0460624071322, Blast_Score=719, Evalue=0.0,

Paralogues:

None

Copy number: 2,000 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): LON_CHLMU (Q9PK50)

Other databases:

- EMBL:   AE002160
- PIR:   E81681
- RefSeq:   NP_296997.1
- ProteinModelPortal:   Q9PK50
- MEROPS:   S16.002
- GeneID:   1245983
- GenomeReviews:   AE002160_GR
- KEGG:   cmu:TC0623
- TIGR:   TC_0623
- HOGENOM:   HBG566281
- OMA:   LPEPNRG
- PhylomeDB:   Q9PK50
- ProtClustDB:   CLSK2459232
- BioCyc:   CMUR243161:TC_0623-MONOMER
- BRENDA:   3.4.21.53
- GO:   GO:0005737
- GO:   GO:0006508
- InterPro:   IPR003593
- InterPro:   IPR003959
- InterPro:   IPR008269
- InterPro:   IPR004815
- InterPro:   IPR003111
- InterPro:   IPR008268
- InterPro:   IPR001984
- InterPro:   IPR015947
- InterPro:   IPR020568
- PRINTS:   PR00830
- SMART:   SM00382
- SMART:   SM00464
- TIGRFAMs:   TIGR00763

Pfam domain/function: PF00004 AAA; PF02190 LON; PF05362 Lon_C; SSF88697 PUA-like; SSF54211 Ribosomal_S5_D2-typ_fold

EC number: =3.4.21.53

Molecular weight: Translated: 91845; Mature: 91845

Theoretical pI: Translated: 7.63; Mature: 7.63

Prosite motif: PS01046 LON_SER

Important sites: ACT_SITE 724-724 ACT_SITE 767-767

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNSTNNTDSQNLDPNASEVEKLLDESAEAEEKTDDHTPPSELFILPLNKRPFFPGMAAPL
CCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHCCCCCCCCCCEEEEECCCCCCCCCCCCCE
LIEAGPHYEVLTLLAKSSQKHIGLVLTKKEDANTLKIGFNQLHRVGVSARILRIMPIEGG
EEECCCCHHHHHHHHHCCCCCEEEEEEECCCCCEEEHHHHHHHHHCCCEEEEEEEECCCC
SAQVLLSIEDRIRIVKPVQDKYLKAKVAYHKENKELTEELKAYSISIVSIIKDLLKLNPL
CCEEEEEECCCHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCH
FKEELQIFLGHSDFTEPGKLADFSVALTTATREELQEVLETTDMHDRIDKALVLLKKELD
HHHHHHEEECCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC
LSRLQSSINQKIEATITKSQKEFFLKEQLKTIKKELGLEKDDHAVDLEKFMERLNKRDVP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCH
QYAMDVIQDEMDKLQTLETSSAEYAVCRNYLDWLTIVPWGIQTKEYHDLKKAESILNKDH
HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCC
YGLEDIKQRILELISVGKLANGMKGSIICLVGPPGVGKTSIGRSIAKVLHRKFFRFSVGG
CCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCC
MRDEAEIKGHRRTYIGAMPGKLVQALKQSAIMNPVIMIDEVDKIGSSYHGDPASALLEVL
CCCHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEECHHHHCCCCCCCHHHHHHHHH
DPEQNKDFLDHYLDVRVDLSNVLFILTANVLDSIPDPLLDRMEVLRLSGYILEEKLQIAT
CCCCCHHHHHHHHHHEECHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
KYLVPRARKEMGLSAQNVSFQPEALKHMINNYAREAGVRTLNENIKKVLRKVALKIVQNQ
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
EKNPSKKSRFTITPKNLQDYLGKPIFSSDRFYEKTPVGVATGLAWTSLGGATLYIESVQV
CCCCCCCCEEEECHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHCCCCEEEEEEEEEC
PSSSGKADMHLTGQAGDVMKESSQIAWTYLHSALERYAPGRPFFEKSQVHIHIPEGATPK
CCCCCCCCEEEECCCCHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCC
DGPSAGITMVTSLLSLLLDVPVLNNLGMTGELTLTGRVLGIGGIREKLIAARRSKLNVLI
CCCCCHHHHHHHHHHHHHHCHHHHCCCCCEEEEEEEEEEECCHHHHHHHHHHHCCCEEEE
FPEDNRRDYDELPAYLKKGLKVHFVTHYDDVFKIAFPGV
ECCCCCCCHHHHHHHHHCCCEEEEEEECCCEEEEECCCC
>Mature Secondary Structure
MNSTNNTDSQNLDPNASEVEKLLDESAEAEEKTDDHTPPSELFILPLNKRPFFPGMAAPL
CCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHCCCCCCCCCCEEEEECCCCCCCCCCCCCE
LIEAGPHYEVLTLLAKSSQKHIGLVLTKKEDANTLKIGFNQLHRVGVSARILRIMPIEGG
EEECCCCHHHHHHHHHCCCCCEEEEEEECCCCCEEEHHHHHHHHHCCCEEEEEEEECCCC
SAQVLLSIEDRIRIVKPVQDKYLKAKVAYHKENKELTEELKAYSISIVSIIKDLLKLNPL
CCEEEEEECCCHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCH
FKEELQIFLGHSDFTEPGKLADFSVALTTATREELQEVLETTDMHDRIDKALVLLKKELD
HHHHHHEEECCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC
LSRLQSSINQKIEATITKSQKEFFLKEQLKTIKKELGLEKDDHAVDLEKFMERLNKRDVP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCH
QYAMDVIQDEMDKLQTLETSSAEYAVCRNYLDWLTIVPWGIQTKEYHDLKKAESILNKDH
HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCC
YGLEDIKQRILELISVGKLANGMKGSIICLVGPPGVGKTSIGRSIAKVLHRKFFRFSVGG
CCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCC
MRDEAEIKGHRRTYIGAMPGKLVQALKQSAIMNPVIMIDEVDKIGSSYHGDPASALLEVL
CCCHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEECHHHHCCCCCCCHHHHHHHHH
DPEQNKDFLDHYLDVRVDLSNVLFILTANVLDSIPDPLLDRMEVLRLSGYILEEKLQIAT
CCCCCHHHHHHHHHHEECHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
KYLVPRARKEMGLSAQNVSFQPEALKHMINNYAREAGVRTLNENIKKVLRKVALKIVQNQ
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
EKNPSKKSRFTITPKNLQDYLGKPIFSSDRFYEKTPVGVATGLAWTSLGGATLYIESVQV
CCCCCCCCEEEECHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHCCCCEEEEEEEEEC
PSSSGKADMHLTGQAGDVMKESSQIAWTYLHSALERYAPGRPFFEKSQVHIHIPEGATPK
CCCCCCCCEEEECCCCHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCC
DGPSAGITMVTSLLSLLLDVPVLNNLGMTGELTLTGRVLGIGGIREKLIAARRSKLNVLI
CCCCCHHHHHHHHHHHHHHCHHHHCCCCCEEEEEEEEEEECCHHHHHHHHHHHCCCEEEE
FPEDNRRDYDELPAYLKKGLKVHFVTHYDDVFKIAFPGV
ECCCCCCCHHHHHHHHHCCCEEEEEEECCCEEEEECCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 10684935