Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is fadA [H]

Identifier: 222526705

GI number: 222526705

Start: 4334335

End: 4335513

Strand: Reverse

Name: fadA [H]

Synonym: Chy400_3476

Alternate gene names: 222526705

Gene position: 4335513-4334335 (Counterclockwise)

Preceding gene: 222526706

Following gene: 222526704

Centisome position: 82.28

GC content: 60.98

Gene sequence:

>1179_bases
ATGCGTGAGGCAGTGATTGTTTCTGGAGTACGCACGGCAGTCGGCAAAGCCGGTCGAGGTGCACTGCGTACCGTTCGCCC
TGACGATCTGGCAGCAATCGTCGTGAAAGCGGCTATCGAACGGGCCGGCATTGATCCGGCATTGGTGGAAGACGTGATTA
TGGGCTGTGCAATGCCCGAAGGCGAACAGGGCCTGAACGTTGCCCGGATCGCCGCCCAACGCGCCGGCCTCCCCGACAGC
GTGTGTGGCGTTACCGTCAACCGGTTCTGCGCTTCGGGATTGCAGACGATTGCGATGGCGGCCTACCAGGTTATGTCAGG
GCAAAGTGATGTCGTGGTGGCCGGCGGTACCGAAAGCATGAGCATGGTACCGATGAGTGGCAACAAGTTCTCGCCGAACC
CCTATCTGGCCCAGCACGATCCGGCAGTGTACATGAGCATGGGCTTAACTGCCGAGCAGGTAGCGCGGCGCTTTGAAATT
GATCGTGAAGAGCAGGATGCTTTCGCCTTGCGGTCCCACCAGCGCGCACTGGCAGCACAGGCTGCCGGACTGTTCGACCG
GAGCATTGTGCCGGTAGAAGTGGAGCTAGTCGAGCCGGGAGCAGACGGTCGCCCACAGCGCCGTGTCATGGTCTTCGACC
GCGATGAGGGACCACGTGCCGACACCTCAGCCGAGGCCCTGGCGAAACTGAAGCCAGTCTTTGCCGCCGAAGGCACCGTC
ACCGCCGGCAACTCATCCCAGATGAGCGATGGTGCGGCTGCTGTTGTCGTGATGAGTGCTGAACGTGCGGCAGCGCTAGG
CTTGAAACCACGGGCACGCTTCGTGAGCTTCGCAGTCGGCGGCGTGGAGCCAGAAGTTATGGGGATTGGCCCGGTCGTTG
CCATACCCAAAGCGCTGAAGCTGGCCGGCCTGACCCTTGCCGACATTGATCTGATCGAACTCAATGAAGCCTTCGCCGCC
CAATCGATTGCCGTCATTCGTCAGCTCGATCTTGACGAAGAGCGGGTGAATGTCAACGGCGGTGCAATTGCGTTGGGACA
TCCGCTGGGATGCACCGGTGCCAAATTGACCGTCCAGATCCTCGATGAACTCGAGCGGCGCGGCGGTCGTTACGGCATGG
TGACGATGTGCATCGGGGGAGGTATGGGTGCCGCAGGCATTTTCGAGCGGATTAGCTAA

Upstream 100 bases:

>100_bases
CCGAGGCAACGAGATATGGGCCTGTTCCTTCGTAACTGAAACAGAAGTAAGACACATCCCCGGTTCTAATGGCACTCACG
ATGAACAAAGGAGTATCACA

Downstream 100 bases:

>100_bases
GGAACAACGCCTGTTGCAACCGGTGTGAACAACAACAGGGCGGGGGCAATGCCTCCGCCCTGTTGGTTGCTGGCCTTTTG
ATCACTGGCTACCTGCGACA

Product: acetyl-CoA acetyltransferase

Products: NA

Alternate protein names: Acetyl-CoA acyltransferase; Beta-ketothiolase [H]

Number of amino acids: Translated: 392; Mature: 392

Protein sequence:

>392_residues
MREAVIVSGVRTAVGKAGRGALRTVRPDDLAAIVVKAAIERAGIDPALVEDVIMGCAMPEGEQGLNVARIAAQRAGLPDS
VCGVTVNRFCASGLQTIAMAAYQVMSGQSDVVVAGGTESMSMVPMSGNKFSPNPYLAQHDPAVYMSMGLTAEQVARRFEI
DREEQDAFALRSHQRALAAQAAGLFDRSIVPVEVELVEPGADGRPQRRVMVFDRDEGPRADTSAEALAKLKPVFAAEGTV
TAGNSSQMSDGAAAVVVMSAERAAALGLKPRARFVSFAVGGVEPEVMGIGPVVAIPKALKLAGLTLADIDLIELNEAFAA
QSIAVIRQLDLDEERVNVNGGAIALGHPLGCTGAKLTVQILDELERRGGRYGMVTMCIGGGMGAAGIFERIS

Sequences:

>Translated_392_residues
MREAVIVSGVRTAVGKAGRGALRTVRPDDLAAIVVKAAIERAGIDPALVEDVIMGCAMPEGEQGLNVARIAAQRAGLPDS
VCGVTVNRFCASGLQTIAMAAYQVMSGQSDVVVAGGTESMSMVPMSGNKFSPNPYLAQHDPAVYMSMGLTAEQVARRFEI
DREEQDAFALRSHQRALAAQAAGLFDRSIVPVEVELVEPGADGRPQRRVMVFDRDEGPRADTSAEALAKLKPVFAAEGTV
TAGNSSQMSDGAAAVVVMSAERAAALGLKPRARFVSFAVGGVEPEVMGIGPVVAIPKALKLAGLTLADIDLIELNEAFAA
QSIAVIRQLDLDEERVNVNGGAIALGHPLGCTGAKLTVQILDELERRGGRYGMVTMCIGGGMGAAGIFERIS
>Mature_392_residues
MREAVIVSGVRTAVGKAGRGALRTVRPDDLAAIVVKAAIERAGIDPALVEDVIMGCAMPEGEQGLNVARIAAQRAGLPDS
VCGVTVNRFCASGLQTIAMAAYQVMSGQSDVVVAGGTESMSMVPMSGNKFSPNPYLAQHDPAVYMSMGLTAEQVARRFEI
DREEQDAFALRSHQRALAAQAAGLFDRSIVPVEVELVEPGADGRPQRRVMVFDRDEGPRADTSAEALAKLKPVFAAEGTV
TAGNSSQMSDGAAAVVVMSAERAAALGLKPRARFVSFAVGGVEPEVMGIGPVVAIPKALKLAGLTLADIDLIELNEAFAA
QSIAVIRQLDLDEERVNVNGGAIALGHPLGCTGAKLTVQILDELERRGGRYGMVTMCIGGGMGAAGIFERIS

Specific function: Involved in the degradation of long-chain fatty acids [H]

COG id: COG0183

COG function: function code I; Acetyl-CoA acetyltransferase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the thiolase family [H]

Homologues:

Organism=Homo sapiens, GI4501853, Length=390, Percent_Identity=45.6410256410256, Blast_Score=314, Evalue=9e-86,
Organism=Homo sapiens, GI167614485, Length=401, Percent_Identity=40.8977556109726, Blast_Score=296, Evalue=2e-80,
Organism=Homo sapiens, GI148539872, Length=399, Percent_Identity=42.3558897243108, Blast_Score=280, Evalue=2e-75,
Organism=Homo sapiens, GI4557237, Length=403, Percent_Identity=40.6947890818859, Blast_Score=254, Evalue=9e-68,
Organism=Homo sapiens, GI4504327, Length=437, Percent_Identity=35.9267734553776, Blast_Score=221, Evalue=9e-58,
Organism=Homo sapiens, GI194353979, Length=241, Percent_Identity=35.2697095435685, Blast_Score=132, Evalue=5e-31,
Organism=Escherichia coli, GI1787663, Length=412, Percent_Identity=47.0873786407767, Blast_Score=325, Evalue=3e-90,
Organism=Escherichia coli, GI1788554, Length=406, Percent_Identity=47.7832512315271, Blast_Score=324, Evalue=6e-90,
Organism=Escherichia coli, GI48994986, Length=397, Percent_Identity=46.8513853904282, Blast_Score=318, Evalue=3e-88,
Organism=Escherichia coli, GI87082165, Length=402, Percent_Identity=44.0298507462687, Blast_Score=303, Evalue=2e-83,
Organism=Escherichia coli, GI1788683, Length=421, Percent_Identity=33.729216152019, Blast_Score=196, Evalue=3e-51,
Organism=Caenorhabditis elegans, GI133906874, Length=398, Percent_Identity=42.2110552763819, Blast_Score=299, Evalue=2e-81,
Organism=Caenorhabditis elegans, GI17535921, Length=411, Percent_Identity=35.2798053527981, Blast_Score=222, Evalue=3e-58,
Organism=Caenorhabditis elegans, GI25147385, Length=401, Percent_Identity=32.6683291770574, Blast_Score=218, Evalue=5e-57,
Organism=Caenorhabditis elegans, GI17551802, Length=436, Percent_Identity=30.2752293577982, Blast_Score=181, Evalue=6e-46,
Organism=Caenorhabditis elegans, GI17535917, Length=398, Percent_Identity=28.643216080402, Blast_Score=157, Evalue=1e-38,
Organism=Saccharomyces cerevisiae, GI6322031, Length=400, Percent_Identity=40.5, Blast_Score=259, Evalue=7e-70,
Organism=Saccharomyces cerevisiae, GI6325229, Length=409, Percent_Identity=36.1858190709046, Blast_Score=225, Evalue=9e-60,
Organism=Drosophila melanogaster, GI24655093, Length=403, Percent_Identity=43.424317617866, Blast_Score=314, Evalue=5e-86,
Organism=Drosophila melanogaster, GI17648125, Length=399, Percent_Identity=38.3458646616541, Blast_Score=251, Evalue=5e-67,
Organism=Drosophila melanogaster, GI24640423, Length=404, Percent_Identity=39.3564356435644, Blast_Score=247, Evalue=1e-65,
Organism=Drosophila melanogaster, GI17137578, Length=434, Percent_Identity=34.7926267281106, Blast_Score=210, Evalue=1e-54,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002155
- InterPro:   IPR016039
- InterPro:   IPR016038
- InterPro:   IPR020615
- InterPro:   IPR020610
- InterPro:   IPR020617
- InterPro:   IPR020613
- InterPro:   IPR020616 [H]

Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]

EC number: =2.3.1.16 [H]

Molecular weight: Translated: 40869; Mature: 40869

Theoretical pI: Translated: 4.93; Mature: 4.93

Prosite motif: PS00098 THIOLASE_1 ; PS00737 THIOLASE_2 ; PS00099 THIOLASE_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
4.3 %Met     (Translated Protein)
5.6 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
4.3 %Met     (Mature Protein)
5.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MREAVIVSGVRTAVGKAGRGALRTVRPDDLAAIVVKAAIERAGIDPALVEDVIMGCAMPE
CCCEEEECHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCC
GEQGLNVARIAAQRAGLPDSVCGVTVNRFCASGLQTIAMAAYQVMSGQSDVVVAGGTESM
CCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCE
SMVPMSGNKFSPNPYLAQHDPAVYMSMGLTAEQVARRFEIDREEQDAFALRSHQRALAAQ
EEEECCCCCCCCCCCCCCCCCEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH
AAGLFDRSIVPVEVELVEPGADGRPQRRVMVFDRDEGPRADTSAEALAKLKPVFAAEGTV
HHHHHCCCCCEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHCHHEECCCEE
TAGNSSQMSDGAAAVVVMSAERAAALGLKPRARFVSFAVGGVEPEVMGIGPVVAIPKALK
ECCCCCCCCCCCEEEEEEECCHHHHCCCCCHHHEEEEEECCCCCCEEECCCHHHHCHHHH
LAGLTLADIDLIELNEAFAAQSIAVIRQLDLDEERVNVNGGAIALGHPLGCTGAKLTVQI
HCCCEEECEEEEECCHHHHHHHHHHHHHCCCCHHHEECCCCEEEECCCCCCCCCHHHHHH
LDELERRGGRYGMVTMCIGGGMGAAGIFERIS
HHHHHHCCCCEEEEEEEECCCCCHHHHHHHCC
>Mature Secondary Structure
MREAVIVSGVRTAVGKAGRGALRTVRPDDLAAIVVKAAIERAGIDPALVEDVIMGCAMPE
CCCEEEECHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCC
GEQGLNVARIAAQRAGLPDSVCGVTVNRFCASGLQTIAMAAYQVMSGQSDVVVAGGTESM
CCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCE
SMVPMSGNKFSPNPYLAQHDPAVYMSMGLTAEQVARRFEIDREEQDAFALRSHQRALAAQ
EEEECCCCCCCCCCCCCCCCCEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH
AAGLFDRSIVPVEVELVEPGADGRPQRRVMVFDRDEGPRADTSAEALAKLKPVFAAEGTV
HHHHHCCCCCEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHCHHEECCCEE
TAGNSSQMSDGAAAVVVMSAERAAALGLKPRARFVSFAVGGVEPEVMGIGPVVAIPKALK
ECCCCCCCCCCCEEEEEEECCHHHHCCCCCHHHEEEEEECCCCCCEEECCCHHHHCHHHH
LAGLTLADIDLIELNEAFAAQSIAVIRQLDLDEERVNVNGGAIALGHPLGCTGAKLTVQI
HCCCEEECEEEEECCHHHHHHHHHHHHHCCCCHHHEECCCCEEEECCCCCCCCCHHHHHH
LDELERRGGRYGMVTMCIGGGMGAAGIFERIS
HHHHHHCCCCEEEEEEEECCCCCHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]