| Definition | Kosmotoga olearia TBF 19.5.1, complete genome. |
|---|---|
| Accession | NC_012785 |
| Length | 2,302,126 |
Click here to switch to the map view.
The map label for this gene is thlA [H]
Identifier: 239616998
GI number: 239616998
Start: 645969
End: 647174
Strand: Reverse
Name: thlA [H]
Synonym: Kole_0598
Alternate gene names: 239616998
Gene position: 647174-645969 (Counterclockwise)
Preceding gene: 239616999
Following gene: 239616997
Centisome position: 28.11
GC content: 45.11
Gene sequence:
>1206_bases GTGAAAAGAAGAGTTTTTATTGTTGGAGCTAAAAGAACGGCTATTGGTGTATTCGGTGGAAGCCTTAAGAGTATTTCTGC ACCAAAACTTGGTTCTATAGCGATTAAGGCGGCTATCGACCAGGCCGGTGTTGAACCTTCAGATATTAATGAAGTTATCG TTGGTAATGTTCTTATGGCAGGTCAGGGTATGGGTCCGGCCAGACAGTCTTCTATTTATGCGGGAATTCCTGTGGAAGTT CCAGCTTACACAGTTCATATGGTTTGTGGGAGCGGTATGAAGTCTATCATTATTGGTGCTAAAGATATCGCTTATGGTGA GGCCGATCTTGTGGTAGCAGGCGGTATGGAAAACATGTCTCAGGCACCGTATCTGGTCGATTATAAGGCGAGGTTTGGAG CTAAATTTGGCGATATGAAGATGACTGACCACATGGTGTTTGACGGTCTTACTGATATTTTTAACCAGGTTCACATGGGT ATAACCGCCGAAGAAATCGCTTCCCGGTTCGAAATATCTCGTGAAGAACAGGATGAATATGCCCTTGAAAGCCAGAATCG CGCTCGTGCGGCAATTGCTGCAGGGAAATTCAAAGACGAGATAGTACCCGTTGAGGTTGTTGAGAAAAAACAGACCAGGA TTTTTGATACCGACGAAGGTCCCAGAGAAACCAGCATTGAAAAACTTGCAAGGTTAAGACCTGCTTTCAAGAAAGATGGA ACAGTAACCGCTGGAAATTCCTCTACCATTAACGATGGAGCCAGTGCAGTTATTCTCGCAAGCGAGGATTACGTAAAAGC TCACGGTTTGAAACCTCTGGCGGAAGTTATTGCCTGGGGACAGGGTGGAGTCGATCCGATGGTTATGGGGCTTGGGCCGG TTCCCGCAACCGATAATGCCCTCAAATATGCAGGTCTAAAGTTCACAGATATCGATCTTATCGAGGCAAATGAAGCTTTT GCTGTGCAAACACTCGGGGTTATCCGTAAATGGAATGAAATGTATGGAGTCAGCAAAGATTATGTAATTGAAAGAGCAAA CGTCAATGGAGGTGCCATAGCTCTTGGACACCCGATAGGTTGTAGTGGAAACAGAATAGTTGTGACCTTGCTTTATGAAA TGATGAAACGCGGAGTTGAACTCGGCCTTGCCACGCTGTGTATTGGTGGTGGAATGGGAACAGCCATTATTATAAAGAGA ATCTGA
Upstream 100 bases:
>100_bases TTCTCATTCCGTAAACCTGATTTTGTCATGTACAAAAACGCCCTGATAGAATAAATTGAATAAAACTTTTCAAGTATCTA TAACAAATGGAGGGGTACAC
Downstream 100 bases:
>100_bases GTTAGTCCGTTGTTCCTGATAAGGATCTAAGTAGTCCTAACCATGACTTGGAATCCGAAGGGAGGATCTAAGGTGAAGGT CATCAAAAGTAAGGAAGTGG
Product: acetyl-CoA acetyltransferase
Products: NA
Alternate protein names: Acetoacetyl-CoA thiolase [H]
Number of amino acids: Translated: 401; Mature: 401
Protein sequence:
>401_residues MKRRVFIVGAKRTAIGVFGGSLKSISAPKLGSIAIKAAIDQAGVEPSDINEVIVGNVLMAGQGMGPARQSSIYAGIPVEV PAYTVHMVCGSGMKSIIIGAKDIAYGEADLVVAGGMENMSQAPYLVDYKARFGAKFGDMKMTDHMVFDGLTDIFNQVHMG ITAEEIASRFEISREEQDEYALESQNRARAAIAAGKFKDEIVPVEVVEKKQTRIFDTDEGPRETSIEKLARLRPAFKKDG TVTAGNSSTINDGASAVILASEDYVKAHGLKPLAEVIAWGQGGVDPMVMGLGPVPATDNALKYAGLKFTDIDLIEANEAF AVQTLGVIRKWNEMYGVSKDYVIERANVNGGAIALGHPIGCSGNRIVVTLLYEMMKRGVELGLATLCIGGGMGTAIIIKR I
Sequences:
>Translated_401_residues MKRRVFIVGAKRTAIGVFGGSLKSISAPKLGSIAIKAAIDQAGVEPSDINEVIVGNVLMAGQGMGPARQSSIYAGIPVEV PAYTVHMVCGSGMKSIIIGAKDIAYGEADLVVAGGMENMSQAPYLVDYKARFGAKFGDMKMTDHMVFDGLTDIFNQVHMG ITAEEIASRFEISREEQDEYALESQNRARAAIAAGKFKDEIVPVEVVEKKQTRIFDTDEGPRETSIEKLARLRPAFKKDG TVTAGNSSTINDGASAVILASEDYVKAHGLKPLAEVIAWGQGGVDPMVMGLGPVPATDNALKYAGLKFTDIDLIEANEAF AVQTLGVIRKWNEMYGVSKDYVIERANVNGGAIALGHPIGCSGNRIVVTLLYEMMKRGVELGLATLCIGGGMGTAIIIKR I >Mature_401_residues MKRRVFIVGAKRTAIGVFGGSLKSISAPKLGSIAIKAAIDQAGVEPSDINEVIVGNVLMAGQGMGPARQSSIYAGIPVEV PAYTVHMVCGSGMKSIIIGAKDIAYGEADLVVAGGMENMSQAPYLVDYKARFGAKFGDMKMTDHMVFDGLTDIFNQVHMG ITAEEIASRFEISREEQDEYALESQNRARAAIAAGKFKDEIVPVEVVEKKQTRIFDTDEGPRETSIEKLARLRPAFKKDG TVTAGNSSTINDGASAVILASEDYVKAHGLKPLAEVIAWGQGGVDPMVMGLGPVPATDNALKYAGLKFTDIDLIEANEAF AVQTLGVIRKWNEMYGVSKDYVIERANVNGGAIALGHPIGCSGNRIVVTLLYEMMKRGVELGLATLCIGGGMGTAIIIKR I
Specific function: Unknown
COG id: COG0183
COG function: function code I; Acetyl-CoA acetyltransferase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the thiolase family [H]
Homologues:
Organism=Homo sapiens, GI148539872, Length=399, Percent_Identity=47.3684210526316, Blast_Score=370, Evalue=1e-102, Organism=Homo sapiens, GI167614485, Length=400, Percent_Identity=49, Blast_Score=361, Evalue=1e-100, Organism=Homo sapiens, GI4557237, Length=401, Percent_Identity=42.643391521197, Blast_Score=317, Evalue=9e-87, Organism=Homo sapiens, GI4501853, Length=403, Percent_Identity=38.9578163771712, Blast_Score=236, Evalue=3e-62, Organism=Homo sapiens, GI4504327, Length=426, Percent_Identity=33.5680751173709, Blast_Score=192, Evalue=3e-49, Organism=Homo sapiens, GI194353979, Length=397, Percent_Identity=29.4710327455919, Blast_Score=137, Evalue=1e-32, Organism=Escherichia coli, GI87082165, Length=401, Percent_Identity=52.1197007481297, Blast_Score=394, Evalue=1e-111, Organism=Escherichia coli, GI1788554, Length=400, Percent_Identity=54, Blast_Score=393, Evalue=1e-110, Organism=Escherichia coli, GI1787663, Length=406, Percent_Identity=39.9014778325123, Blast_Score=277, Evalue=1e-75, Organism=Escherichia coli, GI48994986, Length=409, Percent_Identity=39.3643031784841, Blast_Score=243, Evalue=1e-65, Organism=Escherichia coli, GI1788683, Length=409, Percent_Identity=32.2738386308068, Blast_Score=170, Evalue=1e-43, Organism=Caenorhabditis elegans, GI133906874, Length=398, Percent_Identity=44.9748743718593, Blast_Score=343, Evalue=9e-95, Organism=Caenorhabditis elegans, GI25147385, Length=399, Percent_Identity=41.6040100250627, Blast_Score=310, Evalue=1e-84, Organism=Caenorhabditis elegans, GI17535921, Length=401, Percent_Identity=41.3965087281795, Blast_Score=285, Evalue=3e-77, Organism=Caenorhabditis elegans, GI17535917, Length=409, Percent_Identity=32.7628361858191, Blast_Score=201, Evalue=7e-52, Organism=Caenorhabditis elegans, GI17551802, Length=422, Percent_Identity=30.8056872037915, Blast_Score=187, Evalue=7e-48, Organism=Caenorhabditis elegans, GI17537653, Length=390, Percent_Identity=23.8461538461538, Blast_Score=67, Evalue=2e-11, Organism=Saccharomyces cerevisiae, GI6325229, Length=407, Percent_Identity=43.4889434889435, Blast_Score=303, Evalue=4e-83, Organism=Saccharomyces cerevisiae, GI6322031, Length=404, Percent_Identity=37.1287128712871, Blast_Score=212, Evalue=7e-56, Organism=Drosophila melanogaster, GI24655093, Length=398, Percent_Identity=48.9949748743719, Blast_Score=384, Evalue=1e-107, Organism=Drosophila melanogaster, GI24640423, Length=399, Percent_Identity=43.1077694235589, Blast_Score=325, Evalue=2e-89, Organism=Drosophila melanogaster, GI17648125, Length=402, Percent_Identity=44.0298507462687, Blast_Score=314, Evalue=6e-86, Organism=Drosophila melanogaster, GI17137578, Length=427, Percent_Identity=31.3817330210773, Blast_Score=181, Evalue=1e-45,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002155 - InterPro: IPR016039 - InterPro: IPR016038 - InterPro: IPR020615 - InterPro: IPR020610 - InterPro: IPR020617 - InterPro: IPR020613 - InterPro: IPR020616 [H]
Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]
EC number: =2.3.1.9 [H]
Molecular weight: Translated: 42705; Mature: 42705
Theoretical pI: Translated: 6.08; Mature: 6.08
Prosite motif: PS00737 THIOLASE_2 ; PS00099 THIOLASE_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 4.2 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 4.2 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKRRVFIVGAKRTAIGVFGGSLKSISAPKLGSIAIKAAIDQAGVEPSDINEVIVGNVLMA CCCEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHCCCCCCCCCHHHCCCEEEE GQGMGPARQSSIYAGIPVEVPAYTVHMVCGSGMKSIIIGAKDIAYGEADLVVAGGMENMS CCCCCCCHHCCEEECCCCCCCCEEEEEECCCCCHHEEEECHHHCCCCCCEEEECCCCCCC QAPYLVDYKARFGAKFGDMKMTDHMVFDGLTDIFNQVHMGITAEEIASRFEISREEQDEY CCCEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHH ALESQNRARAAIAAGKFKDEIVPVEVVEKKQTRIFDTDEGPRETSIEKLARLRPAFKKDG HHHCCCCHHHHHECCCCCCCCCCHHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCCCCCC TVTAGNSSTINDGASAVILASEDYVKAHGLKPLAEVIAWGQGGVDPMVMGLGPVPATDNA CEECCCCCCCCCCCCEEEEECCCHHHHCCCCHHHHHHHCCCCCCCCEEEECCCCCCCCCC LKYAGLKFTDIDLIEANEAFAVQTLGVIRKWNEMYGVSKDYVIERANVNGGAIALGHPIG CEECCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCHHHEEEECCCCCCEEEECCCCC CSGNRIVVTLLYEMMKRGVELGLATLCIGGGMGTAIIIKRI CCCCEEHHHHHHHHHHHHHHHCEEHEEECCCCCEEEEEEEC >Mature Secondary Structure MKRRVFIVGAKRTAIGVFGGSLKSISAPKLGSIAIKAAIDQAGVEPSDINEVIVGNVLMA CCCEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEHHHHCCCCCCCCCHHHCCCEEEE GQGMGPARQSSIYAGIPVEVPAYTVHMVCGSGMKSIIIGAKDIAYGEADLVVAGGMENMS CCCCCCCHHCCEEECCCCCCCCEEEEEECCCCCHHEEEECHHHCCCCCCEEEECCCCCCC QAPYLVDYKARFGAKFGDMKMTDHMVFDGLTDIFNQVHMGITAEEIASRFEISREEQDEY CCCEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCHHHHH ALESQNRARAAIAAGKFKDEIVPVEVVEKKQTRIFDTDEGPRETSIEKLARLRPAFKKDG HHHCCCCHHHHHECCCCCCCCCCHHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCCCCCC TVTAGNSSTINDGASAVILASEDYVKAHGLKPLAEVIAWGQGGVDPMVMGLGPVPATDNA CEECCCCCCCCCCCCEEEEECCCHHHHCCCCHHHHHHHCCCCCCCCEEEECCCCCCCCCC LKYAGLKFTDIDLIEANEAFAVQTLGVIRKWNEMYGVSKDYVIERANVNGGAIALGHPIG CEECCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCHHHEEEECCCCCCEEEECCCCC CSGNRIVVTLLYEMMKRGVELGLATLCIGGGMGTAIIIKRI CCCCEEHHHHHHHHHHHHHHHCEEHEEECCCCCEEEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7867955; 11075929; 11466286; 1685080 [H]