Definition | Chloroflexus sp. Y-400-fl chromosome, complete genome. |
---|---|
Accession | NC_012032 |
Length | 5,268,950 |
Click here to switch to the map view.
The map label for this gene is 222523543
Identifier: 222523543
GI number: 222523543
Start: 295152
End: 296318
Strand: Reverse
Name: 222523543
Synonym: Chy400_0249
Alternate gene names: NA
Gene position: 296318-295152 (Counterclockwise)
Preceding gene: 222523544
Following gene: 222523542
Centisome position: 5.62
GC content: 60.67
Gene sequence:
>1167_bases ATGACCAATGTCTATATTGCAGGGATAGGGGCAACCGCCGTTGGCGAACATTATCGCCGTGGGCTGGCCGATCTGGTGAG TGAGGCAGCGCGGGCGGCACTGGCGAGTGCGCCAGAGATTGCTCCGCATCAGATTGGAGCGCTCTATGTTGGCAGTGCCT TCAGCGAGGAACTCTACGGTCAAAGCCAGGCTGGTGCGTATCTGGCCAGCATCCTGGGGCTGTCACCGTCGATTCCGGCA TATCGAGTTGAGGCGGCTGGTGCCAGTGGGGCGCTGGCGTTGTACCAGGCGGTGCAGGCCGTGCAAAACGGCGTGGCGGT GGCACTCGTGATCGGGGTTGATAAGGTCACCGATCATCTCGAAGATGAGATCGAGGCTGCGCAGGCAATGGCTGCCGACA GCACTGAAGAGGCGTTGCACGGGGTGACATTGACGGCACAATGGGCGATGCTGATGCGCCGGTATATGCACGAGTATGGC TATACCGCCGATGCATTTGCGCCGTTTCCGATCAATGCTCACGCCAACGGTGTGCACAACCCGCTGGCGCTGTACCGCTT TGCGATTGATGCGAATAAATATCGGAAAGCCGCCCAGATCGCTTCACCGTTGAACATGCTCGATTGCAGTACGCTGGCCG ACGGTGCTGCTGCGCTGATCGTTGTGGGTGAGCAGATTGCCCGTGAACTCGACCGCCCACGGATTCGCATTGCCGGTTCG GCTGTTGCCACCGATCATCCGGCGCTCCATCGGCGGCGAAATCCGCTCGACTTGAGTGCAGCGCGGGCGAGTGCCCATAT TGCACTTGGCCGGGCGCATCTGGGTGTTGGTGATGTGCAGGTGTGGGAATTGACCGATCCGCACGGAATTGCCGCCACCC TTGCCTTAGAGGCAATTGGCTGCTACGAACCTGGAACAGCACCGCGCTATGCAGCCGAGGGCGCAATCACCCCAACCGGG AAGACGCCAATTGCTACCTTCGGCGGTTACAAAGCACGCGGTGATGTGGGCGGCGCCAGTGGGGTCTATCAGGTGATCGA ACTGACTCGCCAGCTTAGTGGGCAGGCCGGCCCGACCCAGGTGAGCAATGCCCGGATTGGTCTGAGCCAGTCACTCGGCG GGATCGGGGCGACTGCTGTCAGCCATGTTCTGATTCGCGAATCGTAA
Upstream 100 bases:
>100_bases TGTCGCGGCCTACCTGCAACGGGCAGTGATGATTGATTACGCCATCTATGCGAAGTGGCGCGGTAAGCTGGTGATGGGAT AGTCTATCAGGAGTGCAACA
Downstream 100 bases:
>100_bases GTTCACCCCTTCACAGAAATGGTTGCGTCAGGTACAATAAGGTTGTTTTCCGCCTGTTGTAACCAAGGAGCGCTCCCGTG ATTCAGGAAATGGTCAATGG
Product: acetyl-CoA acetyltransferase-like protein
Products: CoA; 3-oxoacyl-CoA
Alternate protein names: NA
Number of amino acids: Translated: 388; Mature: 387
Protein sequence:
>388_residues MTNVYIAGIGATAVGEHYRRGLADLVSEAARAALASAPEIAPHQIGALYVGSAFSEELYGQSQAGAYLASILGLSPSIPA YRVEAAGASGALALYQAVQAVQNGVAVALVIGVDKVTDHLEDEIEAAQAMAADSTEEALHGVTLTAQWAMLMRRYMHEYG YTADAFAPFPINAHANGVHNPLALYRFAIDANKYRKAAQIASPLNMLDCSTLADGAAALIVVGEQIARELDRPRIRIAGS AVATDHPALHRRRNPLDLSAARASAHIALGRAHLGVGDVQVWELTDPHGIAATLALEAIGCYEPGTAPRYAAEGAITPTG KTPIATFGGYKARGDVGGASGVYQVIELTRQLSGQAGPTQVSNARIGLSQSLGGIGATAVSHVLIRES
Sequences:
>Translated_388_residues MTNVYIAGIGATAVGEHYRRGLADLVSEAARAALASAPEIAPHQIGALYVGSAFSEELYGQSQAGAYLASILGLSPSIPA YRVEAAGASGALALYQAVQAVQNGVAVALVIGVDKVTDHLEDEIEAAQAMAADSTEEALHGVTLTAQWAMLMRRYMHEYG YTADAFAPFPINAHANGVHNPLALYRFAIDANKYRKAAQIASPLNMLDCSTLADGAAALIVVGEQIARELDRPRIRIAGS AVATDHPALHRRRNPLDLSAARASAHIALGRAHLGVGDVQVWELTDPHGIAATLALEAIGCYEPGTAPRYAAEGAITPTG KTPIATFGGYKARGDVGGASGVYQVIELTRQLSGQAGPTQVSNARIGLSQSLGGIGATAVSHVLIRES >Mature_387_residues TNVYIAGIGATAVGEHYRRGLADLVSEAARAALASAPEIAPHQIGALYVGSAFSEELYGQSQAGAYLASILGLSPSIPAY RVEAAGASGALALYQAVQAVQNGVAVALVIGVDKVTDHLEDEIEAAQAMAADSTEEALHGVTLTAQWAMLMRRYMHEYGY TADAFAPFPINAHANGVHNPLALYRFAIDANKYRKAAQIASPLNMLDCSTLADGAAALIVVGEQIARELDRPRIRIAGSA VATDHPALHRRRNPLDLSAARASAHIALGRAHLGVGDVQVWELTDPHGIAATLALEAIGCYEPGTAPRYAAEGAITPTGK TPIATFGGYKARGDVGGASGVYQVIELTRQLSGQAGPTQVSNARIGLSQSLGGIGATAVSHVLIRES
Specific function: Unknown
COG id: COG0183
COG function: function code I; Acetyl-CoA acetyltransferase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI19923233, Length=405, Percent_Identity=26.4197530864197, Blast_Score=104, Evalue=1e-22, Organism=Homo sapiens, GI302344760, Length=331, Percent_Identity=26.2839879154079, Blast_Score=89, Evalue=5e-18, Organism=Homo sapiens, GI302344767, Length=325, Percent_Identity=26.1538461538462, Blast_Score=89, Evalue=6e-18, Organism=Homo sapiens, GI302344762, Length=234, Percent_Identity=27.3504273504274, Blast_Score=80, Evalue=2e-15, Organism=Caenorhabditis elegans, GI17537653, Length=390, Percent_Identity=26.9230769230769, Blast_Score=108, Evalue=6e-24, Organism=Drosophila melanogaster, GI19921506, Length=397, Percent_Identity=27.455919395466, Blast_Score=103, Evalue=2e-22, Organism=Drosophila melanogaster, GI24585051, Length=397, Percent_Identity=27.7078085642317, Blast_Score=103, Evalue=2e-22,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002155 - InterPro: IPR016039 - InterPro: IPR016038 - InterPro: IPR020617 - InterPro: IPR020616 [H]
Pfam domain/function: PF02803 Thiolase_C; PF00108 Thiolase_N [H]
EC number: 2.3.1.16
Molecular weight: Translated: 40274; Mature: 40143
Theoretical pI: Translated: 6.32; Mature: 6.32
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTNVYIAGIGATAVGEHYRRGLADLVSEAARAALASAPEIAPHQIGALYVGSAFSEELYG CCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCEEEECCHHHHHHCC QSQAGAYLASILGLSPSIPAYRVEAAGASGALALYQAVQAVQNGVAVALVIGVDKVTDHL CCHHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCEEEEEEEHHHHHHHH EDEIEAAQAMAADSTEEALHGVTLTAQWAMLMRRYMHEYGYTADAFAPFPINAHANGVHN HHHHHHHHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC PLALYRFAIDANKYRKAAQIASPLNMLDCSTLADGAAALIVVGEQIARELDRPRIRIAGS HHHHHHHHHCHHHHHHHHHHHCCHHHHHHHHHCCCCEEEEEECHHHHHHCCCCEEEEECC AVATDHPALHRRRNPLDLSAARASAHIALGRAHLGVGDVQVWELTDPHGIAATLALEAIG CCCCCCHHHHHCCCCCCHHHHHHHHHEEECHHHCCCCCEEEEEECCCCCHHHHHHHHHHC CYEPGTAPRYAAEGAITPTGKTPIATFGGYKARGDVGGASGVYQVIELTRQLSGQAGPTQ CCCCCCCCCCCCCCCCCCCCCCCCEECCCEECCCCCCCCHHHHHHHHHHHHHCCCCCCCC VSNARIGLSQSLGGIGATAVSHVLIRES CCCCCCCHHHCCCCCHHHHHHHHHHCCC >Mature Secondary Structure TNVYIAGIGATAVGEHYRRGLADLVSEAARAALASAPEIAPHQIGALYVGSAFSEELYG CEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCEEEECCHHHHHHCC QSQAGAYLASILGLSPSIPAYRVEAAGASGALALYQAVQAVQNGVAVALVIGVDKVTDHL CCHHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCEEEEEEEHHHHHHHH EDEIEAAQAMAADSTEEALHGVTLTAQWAMLMRRYMHEYGYTADAFAPFPINAHANGVHN HHHHHHHHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC PLALYRFAIDANKYRKAAQIASPLNMLDCSTLADGAAALIVVGEQIARELDRPRIRIAGS HHHHHHHHHCHHHHHHHHHHHCCHHHHHHHHHCCCCEEEEEECHHHHHHCCCCEEEEECC AVATDHPALHRRRNPLDLSAARASAHIALGRAHLGVGDVQVWELTDPHGIAATLALEAIG CCCCCCHHHHHCCCCCCHHHHHHHHHEEECHHHCCCCCEEEEEECCCCCHHHHHHHHHHC CYEPGTAPRYAAEGAITPTGKTPIATFGGYKARGDVGGASGVYQVIELTRQLSGQAGPTQ CCCCCCCCCCCCCCCCCCCCCCCCEECCCEECCCCCCCCHHHHHHHHHHHHHCCCCCCCC VSNARIGLSQSLGGIGATAVSHVLIRES CCCCCCCHHHCCCCCHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: acyl-CoA; acetyl-CoA
Specific reaction: acyl-CoA + acetyl-CoA = CoA + 3-oxoacyl-CoA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9371463 [H]