Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is yfdE [H]

Identifier: 222523485

GI number: 222523485

Start: 224481

End: 225767

Strand: Reverse

Name: yfdE [H]

Synonym: Chy400_0191

Alternate gene names: 222523485

Gene position: 225767-224481 (Counterclockwise)

Preceding gene: 222523489

Following gene: 222523484

Centisome position: 4.28

GC content: 58.51

Gene sequence:

>1287_bases
ATGCCCCCCACAGGAGAAGAACCATCAGGACACGCAGAATCAAAGCCGCCGGCCAGTGATCCGATGAGCACACCGGGCAC
CGGTCAGGAGCAGTTGCCGTTGAGTGGCATTCGGGTCATTGATGTAGGTAATTTTCTGGCCGGCCCGTATGCTGCTTCCA
TCCTGGGTGAATTCGGTGCCGAGGTGCTCAAGATCGAACACCCGCTGGGTGGCGATCCGATGCGTCGTTTCGGCACTGCA
ACTGCGCGCCACGATGCAACACTGGCCTGGCTGAGCGAGGCCCGTAACCGTAAGTCGGTCACGATTGATCTGCGTCAGCA
AGAGGGCGTTGCGCTCTTTCTGAAGCTGGTCGCCAAATCCGACATTCTGATTGAAAACTTTCGCCCCGGTACGATGGAAG
AATGGGGCTTGAGCTGGCCTGTTTTGCAGGCGACGAATCCCGGACTGATTATGCTGCGGGTGTCGGGCTATGGTCAGACC
GGGCCGTACCGTCGGCGTTCGGGGTTTGCCCATATTGCCCACGCTTTCAGCGGCCTCTCGTATCTGGCCGGGTTCCCCGG
CGAAACGCCAGTCTTGCCGGGAACGGCACCGCTCGGCGACTATATCGCCAGTCTGTTCGGGGCGATTGGGATTTTGATCG
CGCTGCGCCACAAAGAGCAGACCGGACGCGGGCAGTTGATCGATGTCGGGATTTACGAAGCGGTCTTCCGGATTCTGGAT
GAGATTGCCCCGGCTTACGGTCTGTTCGGCAAGATTCGTGAACGCGAAGGGGCCGGGAGTTTTATTGCTGTTCCGCATGG
CCATTTCCGCTCGAAGGACGGCAAGTGGGTTGCGATTGCCTGTACCACCGACAAGATGTTTGAACGGCTGGCCGAAGCAA
TGGAGCGCCCGGAACTGGCTTCGCCGGAACTGTACGGCGATCAACGCAAACGGCTGGCAGCACGCGATATTGTGAACCAG
ATCACGATTGAATGGGTCGGTTCGTTGACGCGCGACGAGGTGATGCGGCGTTGTCTGGAGAAGGAAGTTCCCGTTGGCCC
ACTCAACAGCATCGCCGATATGTTCAACGACGAACATTTTCTGGCTCGCGGCAACTTTGCCTGTATCGAAGCCGAGGGTA
TCGGCGAAGTGGTGGTTCCGAACGTGATCCCCAGACTGTCAGAAACACCGGGACGGGTGACCAACCTCGGCCCACCGCTG
GGGAATGCCACGTATGAGGTGTTGCGCGAGCTGCTTGATATTTCTGCCGAAGAGATCAAGCGTCTGCGCAGCCGCAAGAT
TATTTAG

Upstream 100 bases:

>100_bases
GCAAGGAGAGGGAAGAGGAATGGCAAAAGCGTCACGCCTGACCAGATCAACCGGTCAGCCAACGGAGGTGTCAGAAGGAC
AGGTCACCGGGACAAGCGAG

Downstream 100 bases:

>100_bases
CCATCCCGATCACAAGTTGCTACCTTGGTCACCACAGAGACACGGAGGACACCGAGGAACGGGTATAACCTGACTTACTG
CCTTGAGGCGTGACGTAGTG

Product: L-carnitine dehydratase/bile acid-inducible protein F

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 428; Mature: 427

Protein sequence:

>428_residues
MPPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGAEVLKIEHPLGGDPMRRFGTA
TARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKSDILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQT
GPYRRRSGFAHIAHAFSGLSYLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILD
EIAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELASPELYGDQRKRLAARDIVNQ
ITIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHFLARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPL
GNATYEVLRELLDISAEEIKRLRSRKII

Sequences:

>Translated_428_residues
MPPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGAEVLKIEHPLGGDPMRRFGTA
TARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKSDILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQT
GPYRRRSGFAHIAHAFSGLSYLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILD
EIAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELASPELYGDQRKRLAARDIVNQ
ITIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHFLARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPL
GNATYEVLRELLDISAEEIKRLRSRKII
>Mature_427_residues
PPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGAEVLKIEHPLGGDPMRRFGTAT
ARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKSDILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQTG
PYRRRSGFAHIAHAFSGLSYLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILDE
IAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELASPELYGDQRKRLAARDIVNQI
TIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHFLARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPLG
NATYEVLRELLDISAEEIKRLRSRKII

Specific function: Unknown

COG id: COG1804

COG function: function code C; Predicted acyl-CoA transferases/carnitine dehydratase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the CaiB/BaiF CoA-transferase family [H]

Homologues:

Organism=Homo sapiens, GI300863128, Length=396, Percent_Identity=29.7979797979798, Blast_Score=193, Evalue=3e-49,
Organism=Homo sapiens, GI300863124, Length=422, Percent_Identity=28.436018957346, Blast_Score=182, Evalue=4e-46,
Organism=Homo sapiens, GI300863126, Length=396, Percent_Identity=26.5151515151515, Blast_Score=151, Evalue=1e-36,
Organism=Homo sapiens, GI13376042, Length=422, Percent_Identity=26.0663507109005, Blast_Score=144, Evalue=2e-34,
Organism=Homo sapiens, GI266456254, Length=410, Percent_Identity=28.5365853658537, Blast_Score=136, Evalue=3e-32,
Organism=Homo sapiens, GI42794625, Length=410, Percent_Identity=28.5365853658537, Blast_Score=136, Evalue=3e-32,
Organism=Homo sapiens, GI266458393, Length=269, Percent_Identity=29.7397769516729, Blast_Score=116, Evalue=5e-26,
Organism=Homo sapiens, GI266458397, Length=265, Percent_Identity=30.188679245283, Blast_Score=115, Evalue=8e-26,
Organism=Homo sapiens, GI266458395, Length=147, Percent_Identity=34.0136054421769, Blast_Score=89, Evalue=6e-18,
Organism=Homo sapiens, GI42822893, Length=147, Percent_Identity=34.0136054421769, Blast_Score=89, Evalue=9e-18,
Organism=Escherichia coli, GI87082093, Length=342, Percent_Identity=32.7485380116959, Blast_Score=182, Evalue=4e-47,
Organism=Escherichia coli, GI1788717, Length=419, Percent_Identity=27.6849642004773, Blast_Score=125, Evalue=4e-30,
Organism=Escherichia coli, GI1786222, Length=402, Percent_Identity=27.1144278606965, Blast_Score=112, Evalue=4e-26,
Organism=Caenorhabditis elegans, GI115535051, Length=360, Percent_Identity=25.2777777777778, Blast_Score=94, Evalue=1e-19,
Organism=Caenorhabditis elegans, GI32564160, Length=265, Percent_Identity=27.1698113207547, Blast_Score=93, Evalue=3e-19,
Organism=Drosophila melanogaster, GI24648431, Length=403, Percent_Identity=32.5062034739454, Blast_Score=191, Evalue=9e-49,
Organism=Drosophila melanogaster, GI24585488, Length=400, Percent_Identity=27.5, Blast_Score=127, Evalue=2e-29,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003673 [H]

Pfam domain/function: PF02515 CoA_transf_3 [H]

EC number: NA

Molecular weight: Translated: 46723; Mature: 46592

Theoretical pI: Translated: 6.26; Mature: 6.26

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCH
EVLKIEHPLGGDPMRRFGTATARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKS
HHEEECCCCCCCHHHHHCCCHHHCCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHC
DILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQTGPYRRRSGFAHIAHAFSGLS
CCEEECCCCCCHHHHCCCCCEEECCCCCEEEEEECCCCCCCCCHHHCCHHHHHHHHHHHH
YLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILD
HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHHHH
EIAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELA
HHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCC
SPELYGDQRKRLAARDIVNQITIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHF
CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHCCCCE
LARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPLGNATYEVLRELLDISAEEIK
EECCCEEEEECCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHH
RLRSRKII
HHHHCCCC
>Mature Secondary Structure 
PPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGA
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCH
EVLKIEHPLGGDPMRRFGTATARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKS
HHEEECCCCCCCHHHHHCCCHHHCCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHC
DILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQTGPYRRRSGFAHIAHAFSGLS
CCEEECCCCCCHHHHCCCCCEEECCCCCEEEEEECCCCCCCCCHHHCCHHHHHHHHHHHH
YLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILD
HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHHHH
EIAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELA
HHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCC
SPELYGDQRKRLAARDIVNQITIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHF
CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHCCCCE
LARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPLGNATYEVLRELLDISAEEIK
EECCCEEEEECCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHH
RLRSRKII
HHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9205837; 9278503; 8125343 [H]