Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is yfdE [H]

Identifier: 121639161

GI number: 121639161

Start: 3605919

End: 3607103

Strand: Direct

Name: yfdE [H]

Synonym: BCG_3301

Alternate gene names: 121639161

Gene position: 3605919-3607103 (Clockwise)

Preceding gene: 121639159

Following gene: 121639162

Centisome position: 82.43

GC content: 65.23

Gene sequence:

>1185_bases
ATGCCGACCAGCAACCCCGCCAAACCACTTGACGGGTTTCGGGTATTGGATTTCACCCAGAACGTGGCCGGGCCGCTGGC
CGGGCAGGTGCTGGTCGACCTGGGGGCTGAAGTCATCAAGGTGGAGGCGCCCGGCGGTGAAGCGGCCCGTCAGATCACCT
CGGTGTTACCCGGACGCCCGCCCCTGGCCACCTACTTTCTGCCCAACAATCGTGGCAAGAAGTCGGTGACGGTGGACCTA
ACCACCGAGCAGGCCAAGCAGCAGATGCTGCGGCTCGCGGACACCGCCGACGTTGTCTTGGAGGCGTTTCGGCCCGGCAC
CATGGAAAAGCTGGGCCTAGGCCCTGATGACTTGCGCTCTCGTAACCCCAACCTGATCTACGCGCGCCTAACCGCTTACG
GCGGCAACGGCCCGCACGGCAGCCGGCCGGGAATCGACCTGGTGGTGGCCGCCGAGGCCGGCATGACCACCGGAATGCCC
ACGCCTGAGGGCAAGCCACAGATCATCCCATTTCAGCTCGTCGACAACGCCAGCGGTCACGTGCTGGCCCAGGCCGTGCT
GGCCGCGCTGCTGCACCGCGAGCGGAACGGGGTGGCCGACGTCGTCCAGGTCGCGATGTACGACGTCGCGGTGGGACTAC
AAGCCAACCAGCTGATGATGCATCTCAATCGGGCCGCTAGCGACCAGCCGAAGCCTGAACCGGCACCGAAGGCCAAGCGG
CGCAAGGGAGTCGGCTTCGCTACCCAGCCATCGGACGCGTTTCGCACCGCCGATAGGTACATCGTCATCAGCGCATATGT
GCCCAAACACTGGCAGAAGCTGTGCTACCTCATCGGCCGGCCTGACCTCGTTGAAGATCAACGATTTGCCGAACAACGCT
CCCGGTCGATCAACTACGCCGAGTTGACCGCCGAGTTGGAATTGGCACTGGCCAGCAAGACCGCCACCGAATGGGTCCAG
TTGCTGCAGGCAAACGGCCTCATGGCCTGCCTCGCCCATACCTGGAAACAGGTCGTCGACACCCCCCTTTTCGCCGAGAG
CGACCTCACCCTGGAAGTCGGTCGCGGGGCGGACACCATCACGGTGATCCGCACACCGGCGCGCTACGCCAGCTTCCGCG
CGGTCGTCACCGATCCCCCGCCCACCGCCGGCGAACACAATGCCGTGTTTCTGGCCCGGCCCTGA

Upstream 100 bases:

>100_bases
CCCGACGATGGTAGAGGCAAGACATGCCGGGCGGTCGCCGCGGCGTCGCGAACCCGTATGGTTCAGGGAGGATGCCGCAC
GCCAGGGAAGGTCACCACCG

Downstream 100 bases:

>100_bases
CGCTGTGACCATTCCGAGGAGTCAACACATGAGCACCGCAGTCAACAGCTGCACCGAGGCGCCCGCATCGCGATCACAGT
GGATGCTGGCTAATCTGCGG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 394; Mature: 393

Protein sequence:

>394_residues
MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDL
TTEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMP
TPEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKR
RKGVGFATQPSDAFRTADRYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQ
LLQANGLMACLAHTWKQVVDTPLFAESDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP

Sequences:

>Translated_394_residues
MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDL
TTEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMP
TPEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKR
RKGVGFATQPSDAFRTADRYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQ
LLQANGLMACLAHTWKQVVDTPLFAESDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP
>Mature_393_residues
PTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDLT
TEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMPT
PEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKRR
KGVGFATQPSDAFRTADRYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQL
LQANGLMACLAHTWKQVVDTPLFAESDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP

Specific function: Unknown

COG id: COG1804

COG function: function code C; Predicted acyl-CoA transferases/carnitine dehydratase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the CaiB/BaiF CoA-transferase family [H]

Homologues:

Organism=Homo sapiens, GI300863128, Length=395, Percent_Identity=29.1139240506329, Blast_Score=166, Evalue=3e-41,
Organism=Homo sapiens, GI300863124, Length=421, Percent_Identity=27.5534441805226, Blast_Score=154, Evalue=1e-37,
Organism=Homo sapiens, GI300863126, Length=392, Percent_Identity=28.0612244897959, Blast_Score=145, Evalue=9e-35,
Organism=Homo sapiens, GI13376042, Length=421, Percent_Identity=24.7030878859857, Blast_Score=113, Evalue=3e-25,
Organism=Homo sapiens, GI42794625, Length=334, Percent_Identity=28.7425149700599, Blast_Score=110, Evalue=2e-24,
Organism=Homo sapiens, GI266456254, Length=334, Percent_Identity=28.7425149700599, Blast_Score=110, Evalue=2e-24,
Organism=Homo sapiens, GI266458393, Length=281, Percent_Identity=29.5373665480427, Blast_Score=100, Evalue=3e-21,
Organism=Homo sapiens, GI266458397, Length=200, Percent_Identity=33.5, Blast_Score=99, Evalue=6e-21,
Organism=Homo sapiens, GI42822893, Length=168, Percent_Identity=35.1190476190476, Blast_Score=91, Evalue=2e-18,
Organism=Homo sapiens, GI266458395, Length=168, Percent_Identity=35.1190476190476, Blast_Score=91, Evalue=2e-18,
Organism=Escherichia coli, GI87082093, Length=354, Percent_Identity=30.225988700565, Blast_Score=159, Evalue=3e-40,
Organism=Escherichia coli, GI1788717, Length=409, Percent_Identity=25.9168704156479, Blast_Score=114, Evalue=1e-26,
Organism=Escherichia coli, GI1786222, Length=318, Percent_Identity=24.8427672955975, Blast_Score=74, Evalue=1e-14,
Organism=Caenorhabditis elegans, GI32564160, Length=325, Percent_Identity=24.6153846153846, Blast_Score=86, Evalue=3e-17,
Organism=Caenorhabditis elegans, GI115535051, Length=358, Percent_Identity=24.0223463687151, Blast_Score=81, Evalue=1e-15,
Organism=Drosophila melanogaster, GI24648431, Length=380, Percent_Identity=29.7368421052632, Blast_Score=171, Evalue=7e-43,
Organism=Drosophila melanogaster, GI24585488, Length=318, Percent_Identity=31.7610062893082, Blast_Score=124, Evalue=8e-29,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003673 [H]

Pfam domain/function: PF02515 CoA_transf_3 [H]

EC number: NA

Molecular weight: Translated: 42531; Mature: 42400

Theoretical pI: Translated: 8.21; Mature: 8.21

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRP
CCCCCCCCCCCCCEEEEHHHHHCCHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCC
PLATYFLPNNRGKKSVTVDLTTEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRS
CEEEEEECCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHC
RNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMPTPEGKPQIIPFQLVDNASGH
CCCCEEEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCCCH
VLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKR
HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHH
RKGVGFATQPSDAFRTADRYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYA
HCCCCCCCCCCHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHH
ELTAELELALASKTATEWVQLLQANGLMACLAHTWKQVVDTPLFAESDLTLEVGRGADTI
HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCEE
TVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP
EEEECCHHHHEEEEEEECCCCCCCCCCEEEEECC
>Mature Secondary Structure 
PTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRP
CCCCCCCCCCCCEEEEHHHHHCCHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCC
PLATYFLPNNRGKKSVTVDLTTEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRS
CEEEEEECCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHC
RNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMPTPEGKPQIIPFQLVDNASGH
CCCCEEEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCCCH
VLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKR
HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCHHHH
RKGVGFATQPSDAFRTADRYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYA
HCCCCCCCCCCHHHHCCCCEEEEEECCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHH
ELTAELELALASKTATEWVQLLQANGLMACLAHTWKQVVDTPLFAESDLTLEVGRGADTI
HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCEE
TVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP
EEEECCHHHHEEEEEEECCCCCCCCCCEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9205837; 9278503; 8125343 [H]