Definition | Clostridium difficile 630 chromosome, complete genome. |
---|---|
Accession | NC_009089 |
Length | 4,290,252 |
Click here to switch to the map view.
The map label for this gene is thlA1
Identifier: 126698643
GI number: 126698643
Start: 1251772
End: 1252947
Strand: Direct
Name: thlA1
Synonym: CD1059
Alternate gene names: 126698643
Gene position: 1251772-1252947 (Clockwise)
Preceding gene: 126698642
Following gene: 126698644
Centisome position: 29.18
GC content: 36.05
Gene sequence:
>1176_bases ATGAGAGAAGTAGTAATTGCCAGTGCAGCTAGAACAGCAGTAGGAAGTTTTGGAGGAGCATTTAAATCAGTTTCAGCGGT AGAGTTAGGGGTAACAGCAGCTAAAGAAGCTATAAAAAGAGCTAACATAACTCCAGATATGATAGATGAATCTCTTTTAG GGGGAGTACTTACAGCAGGTCTTGGACAAAATATAGCAAGACAAATAGCATTAGGAGCAGGAATACCAGTAGAAAAACCA GCTATGACTATAAATATAGTTTGTGGTTCTGGATTAAGATCTGTTTCAATGGCATCTCAACTTATAGCATTAGGTGATGC TGATATAATGTTAGTTGGTGGAGCTGAAAACATGAGTATGTCTCCTTATTTAGTACCAAGTGCGAGATATGGTGCAAGAA TGGGTGATGCTGCTTTTGTTGATTCAATGATAAAAGATGGATTATCAGACATATTTAATAACTATCACATGGGTATTACT GCTGAAAACATAGCAGAGCAATGGAATATAACTAGAGAAGAACAAGATGAATTAGCTCTTGCAAGTCAAAATAAAGCTGA AAAAGCTCAAGCTGAAGGAAAATTTGATGAAGAAATAGTTCCTGTTGTTATAAAAGGAAGAAAAGGTGACACTGTAGTAG ATAAAGATGAATATATTAAGCCTGGCACTACAATGGAGAAACTTGCTAAGTTAAGACCTGCATTTAAAAAAGATGGAACA GTTACTGCTGGTAATGCATCAGGAATAAATGATGGTGCTGCTATGCTAGTAGTAATGGCTAAAGAAAAAGCTGAAGAACT AGGAATAGAGCCTCTTGCAACTATAGTTTCTTATGGAACAGCTGGTGTTGACCCTAAAATAATGGGATATGGACCAGTTC CAGCAACTAAAAAAGCTTTAGAAGCTGCTAATATGACTATTGAAGATATAGATTTAGTTGAAGCTAATGAGGCATTTGCT GCCCAATCTGTAGCTGTAATAAGAGACTTAAATATAGATATGAATAAAGTTAATGTTAATGGTGGAGCAATAGCTATAGG ACATCCAATAGGATGCTCAGGAGCAAGAATACTTACTACACTTTTATATGAAATGAAGAGAAGAGATGCTAAAACTGGTC TTGCTACACTTTGTATAGGTGGTGGAATGGGAACTACTTTAATAGTTAAGAGATAG
Upstream 100 bases:
>100_bases ATTATAATAAATAAGAATTTGGGAATTAAAAGTTTAAATAAAATTGTTTAAAAAACAATTTCGGATATATGAAAAATCTA ATTTAATGGGGGTAATGAAT
Downstream 100 bases:
>100_bases TTTTAGATTATAATATTTACAAATTAAATATTTAATTATGAATTTAACCTGTAAGAATAACAAATAGTTATTCTTACAGG TTTTTATAGTTAAAAATAAT
Product: acetyl-CoA acetyltransferase
Products: NA
Alternate protein names: Acetoacetyl-CoA thiolase
Number of amino acids: Translated: 391; Mature: 391
Protein sequence:
>391_residues MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAGLGQNIARQIALGAGIPVEKP AMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSMSPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGIT AENIAEQWNITREEQDELALASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKALEAANMTIEDIDLVEANEAFA AQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTTLLYEMKRRDAKTGLATLCIGGGMGTTLIVKR
Sequences:
>Translated_391_residues MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAGLGQNIARQIALGAGIPVEKP AMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSMSPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGIT AENIAEQWNITREEQDELALASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKALEAANMTIEDIDLVEANEAFA AQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTTLLYEMKRRDAKTGLATLCIGGGMGTTLIVKR >Mature_391_residues MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAGLGQNIARQIALGAGIPVEKP AMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSMSPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGIT AENIAEQWNITREEQDELALASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKALEAANMTIEDIDLVEANEAFA AQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTTLLYEMKRRDAKTGLATLCIGGGMGTTLIVKR
Specific function: SHORT-CHAIN FATTY ACIDS METABOLISM. [C]
COG id: COG0183
COG function: function code I; Acetyl-CoA acetyltransferase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the thiolase family
Homologues:
Organism=Homo sapiens, GI148539872, Length=390, Percent_Identity=50.7692307692308, Blast_Score=412, Evalue=1e-115, Organism=Homo sapiens, GI167614485, Length=391, Percent_Identity=43.2225063938619, Blast_Score=346, Evalue=2e-95, Organism=Homo sapiens, GI4557237, Length=394, Percent_Identity=42.8934010152284, Blast_Score=302, Evalue=4e-82, Organism=Homo sapiens, GI4501853, Length=397, Percent_Identity=42.0654911838791, Blast_Score=282, Evalue=3e-76, Organism=Homo sapiens, GI4504327, Length=425, Percent_Identity=32, Blast_Score=198, Evalue=6e-51, Organism=Homo sapiens, GI194353979, Length=391, Percent_Identity=30.9462915601023, Blast_Score=156, Evalue=4e-38, Organism=Escherichia coli, GI1788554, Length=392, Percent_Identity=55.8673469387755, Blast_Score=409, Evalue=1e-115, Organism=Escherichia coli, GI87082165, Length=391, Percent_Identity=50.1278772378517, Blast_Score=392, Evalue=1e-110, Organism=Escherichia coli, GI1787663, Length=400, Percent_Identity=43, Blast_Score=319, Evalue=2e-88, Organism=Escherichia coli, GI48994986, Length=404, Percent_Identity=43.5643564356436, Blast_Score=275, Evalue=3e-75, Organism=Escherichia coli, GI1788683, Length=409, Percent_Identity=33.0073349633252, Blast_Score=204, Evalue=7e-54, Organism=Caenorhabditis elegans, GI133906874, Length=389, Percent_Identity=46.7866323907455, Blast_Score=360, Evalue=1e-100, Organism=Caenorhabditis elegans, GI17535921, Length=392, Percent_Identity=42.8571428571429, Blast_Score=295, Evalue=4e-80, Organism=Caenorhabditis elegans, GI25147385, Length=390, Percent_Identity=40.5128205128205, Blast_Score=290, Evalue=9e-79, Organism=Caenorhabditis elegans, GI17551802, Length=430, Percent_Identity=31.3953488372093, Blast_Score=191, Evalue=8e-49, Organism=Caenorhabditis elegans, GI17535917, Length=397, Percent_Identity=30.4785894206549, Blast_Score=183, Evalue=2e-46, Organism=Saccharomyces cerevisiae, GI6325229, Length=400, Percent_Identity=43.25, Blast_Score=318, Evalue=7e-88, Organism=Saccharomyces cerevisiae, GI6322031, Length=399, Percent_Identity=39.3483709273183, Blast_Score=248, Evalue=1e-66, Organism=Drosophila melanogaster, GI24655093, Length=391, Percent_Identity=53.7084398976982, Blast_Score=431, Evalue=1e-121, Organism=Drosophila melanogaster, GI17648125, Length=391, Percent_Identity=44.2455242966752, Blast_Score=333, Evalue=1e-91, Organism=Drosophila melanogaster, GI24640423, Length=391, Percent_Identity=40.920716112532, Blast_Score=290, Evalue=1e-78, Organism=Drosophila melanogaster, GI17137578, Length=425, Percent_Identity=31.7647058823529, Blast_Score=194, Evalue=8e-50,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): THLA_CLOD6 (Q18AR0)
Other databases:
- EMBL: AM180355 - RefSeq: YP_001087540.1 - ProteinModelPortal: Q18AR0 - SMR: Q18AR0 - STRING: Q18AR0 - GeneID: 4915191 - GenomeReviews: AM180355_GR - KEGG: cdf:CD1059 - NMPDR: fig|1496.1.peg.3819 - eggNOG: COG0183 - HOGENOM: HBG370930 - OMA: AMTINIV - ProtClustDB: CLSK2534674 - GO: GO:0005737 - InterPro: IPR002155 - InterPro: IPR016039 - InterPro: IPR016038 - InterPro: IPR020615 - InterPro: IPR020617 - InterPro: IPR020613 - InterPro: IPR020616 - Gene3D: G3DSA:3.40.47.10 - PANTHER: PTHR18919 - PIRSF: PIRSF000429 - TIGRFAMs: TIGR01930
Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N; SSF53901 Thiolase-like
EC number: =2.3.1.9
Molecular weight: Translated: 40861; Mature: 40861
Theoretical pI: Translated: 4.94; Mature: 4.94
Prosite motif: PS00098 THIOLASE_1; PS00737 THIOLASE_2
Important sites: ACT_SITE 88-88 ACT_SITE 348-348 ACT_SITE 378-378
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 4.6 %Met (Translated Protein) 5.4 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 4.6 %Met (Mature Protein) 5.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAG CCCEEEEHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH LGQNIARQIALGAGIPVEKPAMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSM HHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCC SPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGITAENIAEQWNITREEQDELAL CCEECCCHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHCCCCCCHHHHHHH ASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT HCCCHHHHHHHCCCCCCCEEEEEEECCCCCEEECCCCCCCCCCHHHHHHHHCCHHHCCCC VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKAL EEECCCCCCCCCCEEEEEEEHHHHHHCCCHHHHHHHHCCCCCCCCCEEECCCCCHHHHHH EAANMTIEDIDLVEANEAFAAQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTT HHHCCEEECCEEEECCHHHHHHHHHHHEECCCCEEEEEECCCEEEECCCCCCCHHHHHHH LLYEMKRRDAKTGLATLCIGGGMGTTLIVKR HHHHHHHHHHHCCEEEEEECCCCCCEEEEEC >Mature Secondary Structure MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAG CCCEEEEHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH LGQNIARQIALGAGIPVEKPAMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSM HHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCC SPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGITAENIAEQWNITREEQDELAL CCEECCCHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEHHHHHHHCCCCCCHHHHHHH ASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT HCCCHHHHHHHCCCCCCCEEEEEEECCCCCEEECCCCCCCCCCHHHHHHHHCCHHHCCCC VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKAL EEECCCCCCCCCCEEEEEEEHHHHHHCCCHHHHHHHHCCCCCCCCCEEECCCCCHHHHHH EAANMTIEDIDLVEANEAFAAQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTT HHHCCEEECCEEEECCHHHHHHHHHHHEECCCCEEEEEECCCEEEECCCCCCCHHHHHHH LLYEMKRRDAKTGLATLCIGGGMGTTLIVKR HHHHHHHHHHHCCEEEEEECCCCCCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA