The gene/protein map for NC_002754 is currently unavailable.
Definition Sulfolobus solfataricus P2 chromosome, complete genome.
Accession NC_002754
Length 2,992,245

Click here to switch to the map view.

The map label for this gene is acaB-4 [H]

Identifier: 15899132

GI number: 15899132

Start: 2166513

End: 2167706

Strand: Reverse

Name: acaB-4 [H]

Synonym: SSO2377

Alternate gene names: 15899132

Gene position: 2167706-2166513 (Counterclockwise)

Preceding gene: 15899133

Following gene: 15899131

Centisome position: 72.44

GC content: 38.36

Gene sequence:

>1194_bases
ATGCCCGAAAGTGTATATATTGCATCGGCTGTTAGAACGCCTATTGGAAAGTTTGGCGGTGCTTTAAGAAACCTCTCACC
AGTAGATTTAGGTTCAATAGTAATTAGAGAAGCGTTAAGAAGAGCTAATGTGGAACCTGGGAAGATAGATATGGCAATTA
TGGGGAACGTCCTAAGAGCTGGGCATGGCCAAGATATAGCCAGACAGTGCGCAATTAGTGCTGGGATTCCATTTGAGATA
GACGGATTTTCTGTGGATATGGTTTGCTCTTCAGGGATGATAAGCGTAATTACTGCTTCACAGATGATCAAATCTGGAGA
TGCTGATATAATAGTTGCGGGTGGAACGGAAAATATGAGCCAAGCAATGTTTGCCATAAAATCTGATATTAGATGGGGTG
TGAAAATGCTAATGAATAGAAATATTGAGCTCATTGATACTATGTTATACGATGGATTAACAGATCCCTTCCAGTATAAA
GTAATGGGACAAGAGGCTGATATGGTAGCAAAATCTCATAACATTTCAAGGAAGGAGTTGGATGAAGTTGCTTATCAAAG
TCATCTAAGAGCTCATAAGGCTACGGTTAATGGATACTTTAAGTCAGAAATTGTCGAAATTAAAGCTGATGGAAAGGTAA
TTAATACTGATGAGGGAATAAGAGCCGATACCAGTTTAGATAAGCTCTCTAGTTTACCTCCAGCATTTACTGATGATGGT
CTACACACTGCTGGAAATTCGTCTCAAATATCTGATGGTGCCGCAGCGTTGGTACTAGTGAGCGAAAAAGCTGCTAAGGA
ACTTAAAATAGAACCTATAGCCCGAATTCTAGGATATAGTTGGGTTGGTATAGAAAGTTGGAGGTTTACTGAAGCACCAA
TTTTTGCTATTAAAAAATTATTAAGCAAATTAGATACCGATATAAACCATTTCGATTATTTTGAGAATAATGAAGCATTT
GCCGTAAATAATGTTCTAATAAATAGATACCTAGGAATACCTTATGATAGACTTAACGTATTTGGAGGGGCTATAGCGTT
AGGTCACCCAATAGGTGCAAGTGGTGCTAGAATTATAGTAACTCTCTTGAACGTGTTATCTAAAATGCATGGAACAAGAG
GAATAGCCAGTATATGTCATGGAATTGGAGGATCAACTGCGATTGCGATCGAACTCTTGAAAGAGATGAAGTAA

Upstream 100 bases:

>100_bases
GAAAATGGTATGATGTTAAAATAACTGAAGCATCATTCTATGATTTGCGTGGGATTCTTGCTTAAAAAATTTAAACCCTA
TAGAGGAATGTATATTCAGT

Downstream 100 bases:

>100_bases
TTTTTTATGCATATATGAGAATAGACTATTGCAGGTGAATAAATAAGTTGCCTAAGAAAGATAGAGCGCAGGAAGCACCT
AGTAGAGATGTGCCAAGACC

Product: Acetyl-CoA c-acetyltransferase (acetoacetyl-CoA thiolase) (acaB-4)

Products: NA

Alternate protein names: Acetoacetyl-CoA thiolase [H]

Number of amino acids: Translated: 397; Mature: 396

Protein sequence:

>397_residues
MPESVYIASAVRTPIGKFGGALRNLSPVDLGSIVIREALRRANVEPGKIDMAIMGNVLRAGHGQDIARQCAISAGIPFEI
DGFSVDMVCSSGMISVITASQMIKSGDADIIVAGGTENMSQAMFAIKSDIRWGVKMLMNRNIELIDTMLYDGLTDPFQYK
VMGQEADMVAKSHNISRKELDEVAYQSHLRAHKATVNGYFKSEIVEIKADGKVINTDEGIRADTSLDKLSSLPPAFTDDG
LHTAGNSSQISDGAAALVLVSEKAAKELKIEPIARILGYSWVGIESWRFTEAPIFAIKKLLSKLDTDINHFDYFENNEAF
AVNNVLINRYLGIPYDRLNVFGGAIALGHPIGASGARIIVTLLNVLSKMHGTRGIASICHGIGGSTAIAIELLKEMK

Sequences:

>Translated_397_residues
MPESVYIASAVRTPIGKFGGALRNLSPVDLGSIVIREALRRANVEPGKIDMAIMGNVLRAGHGQDIARQCAISAGIPFEI
DGFSVDMVCSSGMISVITASQMIKSGDADIIVAGGTENMSQAMFAIKSDIRWGVKMLMNRNIELIDTMLYDGLTDPFQYK
VMGQEADMVAKSHNISRKELDEVAYQSHLRAHKATVNGYFKSEIVEIKADGKVINTDEGIRADTSLDKLSSLPPAFTDDG
LHTAGNSSQISDGAAALVLVSEKAAKELKIEPIARILGYSWVGIESWRFTEAPIFAIKKLLSKLDTDINHFDYFENNEAF
AVNNVLINRYLGIPYDRLNVFGGAIALGHPIGASGARIIVTLLNVLSKMHGTRGIASICHGIGGSTAIAIELLKEMK
>Mature_396_residues
PESVYIASAVRTPIGKFGGALRNLSPVDLGSIVIREALRRANVEPGKIDMAIMGNVLRAGHGQDIARQCAISAGIPFEID
GFSVDMVCSSGMISVITASQMIKSGDADIIVAGGTENMSQAMFAIKSDIRWGVKMLMNRNIELIDTMLYDGLTDPFQYKV
MGQEADMVAKSHNISRKELDEVAYQSHLRAHKATVNGYFKSEIVEIKADGKVINTDEGIRADTSLDKLSSLPPAFTDDGL
HTAGNSSQISDGAAALVLVSEKAAKELKIEPIARILGYSWVGIESWRFTEAPIFAIKKLLSKLDTDINHFDYFENNEAFA
VNNVLINRYLGIPYDRLNVFGGAIALGHPIGASGARIIVTLLNVLSKMHGTRGIASICHGIGGSTAIAIELLKEMK

Specific function: Unknown

COG id: COG0183

COG function: function code I; Acetyl-CoA acetyltransferase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiolase family [H]

Homologues:

Organism=Homo sapiens, GI148539872, Length=399, Percent_Identity=42.6065162907268, Blast_Score=285, Evalue=6e-77,
Organism=Homo sapiens, GI167614485, Length=391, Percent_Identity=37.0843989769821, Blast_Score=251, Evalue=9e-67,
Organism=Homo sapiens, GI4557237, Length=404, Percent_Identity=37.6237623762376, Blast_Score=236, Evalue=3e-62,
Organism=Homo sapiens, GI4501853, Length=399, Percent_Identity=34.5864661654135, Blast_Score=197, Evalue=2e-50,
Organism=Homo sapiens, GI4504327, Length=425, Percent_Identity=30.3529411764706, Blast_Score=175, Evalue=8e-44,
Organism=Homo sapiens, GI194353979, Length=392, Percent_Identity=25.5102040816327, Blast_Score=107, Evalue=3e-23,
Organism=Escherichia coli, GI87082165, Length=393, Percent_Identity=40.2035623409669, Blast_Score=280, Evalue=1e-76,
Organism=Escherichia coli, GI1788554, Length=395, Percent_Identity=41.2658227848101, Blast_Score=262, Evalue=3e-71,
Organism=Escherichia coli, GI1787663, Length=400, Percent_Identity=34.5, Blast_Score=224, Evalue=6e-60,
Organism=Escherichia coli, GI48994986, Length=401, Percent_Identity=36.6583541147132, Blast_Score=212, Evalue=4e-56,
Organism=Escherichia coli, GI1788683, Length=410, Percent_Identity=27.0731707317073, Blast_Score=157, Evalue=1e-39,
Organism=Caenorhabditis elegans, GI133906874, Length=393, Percent_Identity=41.4758269720102, Blast_Score=278, Evalue=4e-75,
Organism=Caenorhabditis elegans, GI25147385, Length=393, Percent_Identity=35.6234096692112, Blast_Score=241, Evalue=4e-64,
Organism=Caenorhabditis elegans, GI17535921, Length=398, Percent_Identity=36.9346733668342, Blast_Score=228, Evalue=4e-60,
Organism=Caenorhabditis elegans, GI17551802, Length=423, Percent_Identity=30.7328605200946, Blast_Score=180, Evalue=1e-45,
Organism=Caenorhabditis elegans, GI17535917, Length=400, Percent_Identity=30.5, Blast_Score=167, Evalue=7e-42,
Organism=Saccharomyces cerevisiae, GI6325229, Length=402, Percent_Identity=39.8009950248756, Blast_Score=266, Evalue=6e-72,
Organism=Saccharomyces cerevisiae, GI6322031, Length=399, Percent_Identity=34.5864661654135, Blast_Score=196, Evalue=5e-51,
Organism=Drosophila melanogaster, GI24655093, Length=394, Percent_Identity=43.9086294416244, Blast_Score=303, Evalue=1e-82,
Organism=Drosophila melanogaster, GI24640423, Length=397, Percent_Identity=38.5390428211587, Blast_Score=234, Evalue=8e-62,
Organism=Drosophila melanogaster, GI17648125, Length=396, Percent_Identity=35.1010101010101, Blast_Score=223, Evalue=2e-58,
Organism=Drosophila melanogaster, GI17137578, Length=425, Percent_Identity=29.1764705882353, Blast_Score=165, Evalue=6e-41,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002155
- InterPro:   IPR016039
- InterPro:   IPR016038
- InterPro:   IPR020615
- InterPro:   IPR020610
- InterPro:   IPR020617
- InterPro:   IPR020613
- InterPro:   IPR020616 [H]

Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]

EC number: =2.3.1.9 [H]

Molecular weight: Translated: 42856; Mature: 42725

Theoretical pI: Translated: 6.43; Mature: 6.43

Prosite motif: PS00737 THIOLASE_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.5 %Met     (Mature Protein)
4.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPESVYIASAVRTPIGKFGGALRNLSPVDLGSIVIREALRRANVEPGKIDMAIMGNVLRA
CCCCEEEHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC
GHGQDIARQCAISAGIPFEIDGFSVDMVCSSGMISVITASQMIKSGDADIIVAGGTENMS
CCCHHHHHHHHHCCCCCEEECCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCHHH
QAMFAIKSDIRWGVKMLMNRNIELIDTMLYDGLTDPFQYKVMGQEADMVAKSHNISRKEL
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHCCCCHHHH
DEVAYQSHLRAHKATVNGYFKSEIVEIKADGKVINTDEGIRADTSLDKLSSLPPAFTDDG
HHHHHHHHHHHHHHHHCCEECCEEEEEEECCEEECCCCCCCCCCCHHHHHCCCCCCCCCC
LHTAGNSSQISDGAAALVLVSEKAAKELKIEPIARILGYSWVGIESWRFTEAPIFAIKKL
CCCCCCCCCCCCCCEEEEEEECHHHHHCCHHHHHHHHCCCEECCCCCCCCCCHHHHHHHH
LSKLDTDINHFDYFENNEAFAVNNVLINRYLGIPYDRLNVFGGAIALGHPIGASGARIIV
HHHHCCCCCHHCCCCCCCEEEEHHHHHHHHCCCCHHHHHHHCCCEEECCCCCCCHHHHHH
TLLNVLSKMHGTRGIASICHGIGGSTAIAIELLKEMK
HHHHHHHHHCCCCHHHHHHHCCCCCHHHHHHHHHHCC
>Mature Secondary Structure 
PESVYIASAVRTPIGKFGGALRNLSPVDLGSIVIREALRRANVEPGKIDMAIMGNVLRA
CCCEEEHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC
GHGQDIARQCAISAGIPFEIDGFSVDMVCSSGMISVITASQMIKSGDADIIVAGGTENMS
CCCHHHHHHHHHCCCCCEEECCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCHHH
QAMFAIKSDIRWGVKMLMNRNIELIDTMLYDGLTDPFQYKVMGQEADMVAKSHNISRKEL
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHCCCCHHHH
DEVAYQSHLRAHKATVNGYFKSEIVEIKADGKVINTDEGIRADTSLDKLSSLPPAFTDDG
HHHHHHHHHHHHHHHHCCEECCEEEEEEECCEEECCCCCCCCCCCHHHHHCCCCCCCCCC
LHTAGNSSQISDGAAALVLVSEKAAKELKIEPIARILGYSWVGIESWRFTEAPIFAIKKL
CCCCCCCCCCCCCCEEEEEEECHHHHHCCHHHHHHHHCCCEECCCCCCCCCCHHHHHHHH
LSKLDTDINHFDYFENNEAFAVNNVLINRYLGIPYDRLNVFGGAIALGHPIGASGARIIV
HHHHCCCCCHHCCCCCCCEEEEHHHHHHHHCCCCHHHHHHHCCCEEECCCCCCCHHHHHH
TLLNVLSKMHGTRGIASICHGIGGSTAIAIELLKEMK
HHHHHHHHHCCCCHHHHHHHCCCCCHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7867955; 11075929; 11466286; 1685080 [H]