Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is pflD [H]

Identifier: 226949401

GI number: 226949401

Start: 2398878

End: 2401418

Strand: Reverse

Name: pflD [H]

Synonym: CLM_2326

Alternate gene names: 226949401

Gene position: 2401418-2398878 (Counterclockwise)

Preceding gene: 226949402

Following gene: 226949400

Centisome position: 57.79

GC content: 35.1

Gene sequence:

>2541_bases
TTGGATATACGTGAATTTTCAAATAAATTTATGGAAGCTACTAAAAATATGTCTGATGAAGAACGCGCTGGATTAATGAA
GATGTTTCAAAGTGTTTCTAATGAAATAACAAGAGAAGAACCAGCTACATCAAAGGTTGCGTGTGATAATAATGGTGAAA
TACCAGATGGAATGACAGAGCGTCTAGTTAAGTTAAAGGAAACTTATCTAAAACATGTTCCAACAATAACAACTCATAGA
GCAAGAGCTATTACTAAAATAGCAAAAGAAAACCCTGGTACTCCTAAATCAGTTTTACGTGGTAAATGTTTTAAACATTG
TTGCGAAACTGCACCACTTGTAATTCAAGACAATGAACTTATAGTTGGAGCACCAAATGGACAACCTCGTGCAGGAGCAT
TTTCTCCTGATATAGCTTGGAGATGGATGGTTGATGAAATTGATACAATAGGAACTCGTCCACAAGATCCATTCTATATA
TCAGAAGAAGATAAGAAAATCATGCGTGAGGAGTTATTCCCATATTGGGCGGGTAAATCCGTTGATGAATACTGCGAAGA
TCAATATCGTGAAGCAGGAGTATGGGAACTTTCAGGAGAATCTTTTGTTTCAGATTGTTCATACCACGCAGTAAATGGTG
GAGGAGACTCTAACCCAGGATATGACGTTGTATTAATGAAAAAAGGTATGCTTGATATAAAGAGAGAAGCAGAAGAAAAA
TTAGCTGAACTTAAATATGAAAATCCAGAGGATATAGATAAAATCTATTTCTACAAATCATTAATTGATACCGCTGAAGG
TGTTATGATATATGCTAAACGTATGTCAGATTATGCTGCTGAATTAGCTCAAAAAGAAACTAACCCTAAGCGTAAAGCAG
AACTTCTAAAGATTTCTGAAATAAATGCTAGAGTTCCAGCACATAAGCCAAGTACTTATTGGGAAGCAATTCAAGCAGTT
TGGACTATAGAATCATTACTTGTAGTTGAGGAAAATCAAACAGGTATGTCTATAGGACGTGTTGACCAATACATGTACCC
ATTCTACAAAGCTGATATTGAAGCTGGACGTATGACTGATTATGAAGCATTTGAATTATCAGGTTGTATGCTTATAAAAA
TGTCTGAAATGATGTGGATAACAAGTGAAGGTGGTTCTAAATTCTTCGCAGGTTATCAACCATTTGTAAATATGTGTGTA
GGTGGTGTTACTAGAGAAGGCCGTGATGCTACAAACGAATTAACATACCTTTTAATGGATGCAGTTCGTCATGTTAAAAT
ATATCAACCATCTTTAGCTTGCCGTATACACAAATCATCACCACAAAAATATCTTAAAAAGATAGTTGACGTTATTCGTG
CAGGTATGGGATTCCCAGCATGCCACTTTGACGATGTTCATATAAAAATTATGTTAGCTAAAGGTGTTTCTATAGAAGAC
GCAAGAGATTACTGCTTAATGGGTTGTGTTGAACCACAAAAATCCGGAAGACTATATCAATGGACTTCTACAGGATACAC
TCAATGGCCTATATGTATAGAACTTGTTTTAAACAATGGTGTACCATTATGGTATGGTAAACAAGTATGCCCAGATATGG
GAGACTTAAGCCAGTTTAAAACTTATGAACAATTTGAAGCAGCTGTTAAAGAGCAAATCAAATTCATTACTAAATGGACA
AGTGTTGCTACAGTAATTTCTCAGCGTGTTCATAAAGAACTTGCTCCAAAGCCACTTATGTCTATGATGTATGAAGGTTG
TATGGAAAATGGTAGAGGGGTAGAAGCCGGCGGAGCTATGTATAACTTCGGACCTGGAGTAGTATGGAGTGGACTTGCTA
CTTATGCAGATTCCATGGCAGCTATAAAGAAATTAGTATTTGAAGATAAAAAATATACTCTACAAGAAATGAACGAAGCT
TTAAAAGCAGATTTCGTTGGATATGAACAATTAAGAAAAGATTGTTTAGAAGCACCTAAGTATGGTAACGATGATGATTA
TGCAGATTTAATTGCTGCTGATTTAATTAACTTTACAGAACAAGAACATCGTAAATATAAAACATTATATTCAGTACTTA
GTCATGGTACTTTATCAATATCAAACAATACTCCATTTGGACAAATGACTGGAGCTACAGCAAATGGACGTAGAGCATGG
ATGCCTTTATCAGATGGTATAAGTCCATCACAAGGCGCTGATTTTAAAGGCCCTACTTCTATAATAAAATCTGTTTCTAA
GATGTCTTGTGAAGATATGAATATAGGTATGGTTCATAACTTTAAGTTAATAGCTGGTCTTCTTGATACACCAGAAGGAG
AACAAGGAATCATTACATTATTACGTAGTGCTTGTGCCCTTCAACTTGGAGAAGTTCAATTTAACTATTTAGACAACAAG
ACTTTAATAGAAGCTCAAAAACATCCAGATCAATATCGTGATTTAATTGTTCGTGTTGCTGGATACAGTGCATTCTTCGT
TGAGTTATGTAAAGATGTTCAAGATGAAATTATAAGTAGAACTATGCTTACACATTTCTAA

Upstream 100 bases:

>100_bases
TATAAGTTATTAGAGAGTATTTTAAAAAAAGTTCTACAGGACTTAAAATAAAAAATCATATTTAATATATGAAATACATT
AAAGAATGTGAGGGAATAAA

Downstream 100 bases:

>100_bases
TAAAAATTTGATGGAAAATATATAACTTATAGAAAATATATAACTTAGTCCCTTTAAAATAATTTATATAAAGTCTGCAA
TCTACAGCCTGTTTTGTAGA

Product: formate acetyltransferase

Products: NA

Alternate protein names: Pyruvate formate-lyase 2 [H]

Number of amino acids: Translated: 846; Mature: 846

Protein sequence:

>846_residues
MDIREFSNKFMEATKNMSDEERAGLMKMFQSVSNEITREEPATSKVACDNNGEIPDGMTERLVKLKETYLKHVPTITTHR
ARAITKIAKENPGTPKSVLRGKCFKHCCETAPLVIQDNELIVGAPNGQPRAGAFSPDIAWRWMVDEIDTIGTRPQDPFYI
SEEDKKIMREELFPYWAGKSVDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVVLMKKGMLDIKREAEEK
LAELKYENPEDIDKIYFYKSLIDTAEGVMIYAKRMSDYAAELAQKETNPKRKAELLKISEINARVPAHKPSTYWEAIQAV
WTIESLLVVEENQTGMSIGRVDQYMYPFYKADIEAGRMTDYEAFELSGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCV
GGVTREGRDATNELTYLLMDAVRHVKIYQPSLACRIHKSSPQKYLKKIVDVIRAGMGFPACHFDDVHIKIMLAKGVSIED
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNNGVPLWYGKQVCPDMGDLSQFKTYEQFEAAVKEQIKFITKWT
SVATVISQRVHKELAPKPLMSMMYEGCMENGRGVEAGGAMYNFGPGVVWSGLATYADSMAAIKKLVFEDKKYTLQEMNEA
LKADFVGYEQLRKDCLEAPKYGNDDDYADLIAADLINFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAW
MPLSDGISPSQGADFKGPTSIIKSVSKMSCEDMNIGMVHNFKLIAGLLDTPEGEQGIITLLRSACALQLGEVQFNYLDNK
TLIEAQKHPDQYRDLIVRVAGYSAFFVELCKDVQDEIISRTMLTHF

Sequences:

>Translated_846_residues
MDIREFSNKFMEATKNMSDEERAGLMKMFQSVSNEITREEPATSKVACDNNGEIPDGMTERLVKLKETYLKHVPTITTHR
ARAITKIAKENPGTPKSVLRGKCFKHCCETAPLVIQDNELIVGAPNGQPRAGAFSPDIAWRWMVDEIDTIGTRPQDPFYI
SEEDKKIMREELFPYWAGKSVDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVVLMKKGMLDIKREAEEK
LAELKYENPEDIDKIYFYKSLIDTAEGVMIYAKRMSDYAAELAQKETNPKRKAELLKISEINARVPAHKPSTYWEAIQAV
WTIESLLVVEENQTGMSIGRVDQYMYPFYKADIEAGRMTDYEAFELSGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCV
GGVTREGRDATNELTYLLMDAVRHVKIYQPSLACRIHKSSPQKYLKKIVDVIRAGMGFPACHFDDVHIKIMLAKGVSIED
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNNGVPLWYGKQVCPDMGDLSQFKTYEQFEAAVKEQIKFITKWT
SVATVISQRVHKELAPKPLMSMMYEGCMENGRGVEAGGAMYNFGPGVVWSGLATYADSMAAIKKLVFEDKKYTLQEMNEA
LKADFVGYEQLRKDCLEAPKYGNDDDYADLIAADLINFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAW
MPLSDGISPSQGADFKGPTSIIKSVSKMSCEDMNIGMVHNFKLIAGLLDTPEGEQGIITLLRSACALQLGEVQFNYLDNK
TLIEAQKHPDQYRDLIVRVAGYSAFFVELCKDVQDEIISRTMLTHF
>Mature_846_residues
MDIREFSNKFMEATKNMSDEERAGLMKMFQSVSNEITREEPATSKVACDNNGEIPDGMTERLVKLKETYLKHVPTITTHR
ARAITKIAKENPGTPKSVLRGKCFKHCCETAPLVIQDNELIVGAPNGQPRAGAFSPDIAWRWMVDEIDTIGTRPQDPFYI
SEEDKKIMREELFPYWAGKSVDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVVLMKKGMLDIKREAEEK
LAELKYENPEDIDKIYFYKSLIDTAEGVMIYAKRMSDYAAELAQKETNPKRKAELLKISEINARVPAHKPSTYWEAIQAV
WTIESLLVVEENQTGMSIGRVDQYMYPFYKADIEAGRMTDYEAFELSGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCV
GGVTREGRDATNELTYLLMDAVRHVKIYQPSLACRIHKSSPQKYLKKIVDVIRAGMGFPACHFDDVHIKIMLAKGVSIED
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNNGVPLWYGKQVCPDMGDLSQFKTYEQFEAAVKEQIKFITKWT
SVATVISQRVHKELAPKPLMSMMYEGCMENGRGVEAGGAMYNFGPGVVWSGLATYADSMAAIKKLVFEDKKYTLQEMNEA
LKADFVGYEQLRKDCLEAPKYGNDDDYADLIAADLINFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAW
MPLSDGISPSQGADFKGPTSIIKSVSKMSCEDMNIGMVHNFKLIAGLLDTPEGEQGIITLLRSACALQLGEVQFNYLDNK
TLIEAQKHPDQYRDLIVRVAGYSAFFVELCKDVQDEIISRTMLTHF

Specific function: Glucose metabolism (nonoxidative conversion). [C]

COG id: COG1882

COG function: function code C; Pyruvate-formate lyase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 pyruvate formate lyase domain [H]

Homologues:

Organism=Escherichia coli, GI1790388, Length=796, Percent_Identity=33.7939698492462, Blast_Score=436, Evalue=1e-123,
Organism=Escherichia coli, GI1787044, Length=819, Percent_Identity=32.6007326007326, Blast_Score=406, Evalue=1e-114,
Organism=Escherichia coli, GI48994926, Length=591, Percent_Identity=22.8426395939086, Blast_Score=128, Evalue=2e-30,
Organism=Escherichia coli, GI1787131, Length=543, Percent_Identity=22.2836095764273, Blast_Score=119, Evalue=8e-28,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001150
- InterPro:   IPR019777
- InterPro:   IPR004184
- InterPro:   IPR010098 [H]

Pfam domain/function: PF01228 Gly_radical; PF02901 PFL [H]

EC number: =2.3.1.54 [H]

Molecular weight: Translated: 95431; Mature: 95431

Theoretical pI: Translated: 5.19; Mature: 5.19

Prosite motif: PS00850 GLY_RADICAL_1 ; PS51149 GLY_RADICAL_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
4.5 %Met     (Translated Protein)
6.7 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
4.5 %Met     (Mature Protein)
6.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDIREFSNKFMEATKNMSDEERAGLMKMFQSVSNEITREEPATSKVACDNNGEIPDGMTE
CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCHHHH
RLVKLKETYLKHVPTITTHRARAITKIAKENPGTPKSVLRGKCFKHCCETAPLVIQDNEL
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEECCEE
IVGAPNGQPRAGAFSPDIAWRWMVDEIDTIGTRPQDPFYISEEDKKIMREELFPYWAGKS
EEECCCCCCCCCCCCCCCEEEEEHHHHHHCCCCCCCCEEECCHHHHHHHHHHCCCCCCCC
VDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVVLMKKGMLDIKREAEEK
HHHHHHHHHHHCCCEEECCCHHHHCCCCEEECCCCCCCCCEEEEEEECCCHHHHHHHHHH
LAELKYENPEDIDKIYFYKSLIDTAEGVMIYAKRMSDYAAELAQKETNPKRKAELLKISE
HHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
INARVPAHKPSTYWEAIQAVWTIESLLVVEENQTGMSIGRVDQYMYPFYKADIEAGRMTD
CCCCCCCCCCHHHHHHHHHHHHHHHHEEEECCCCCCCHHHHHHHHCCHHHHCCCCCCCCC
YEAFELSGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCVGGVTREGRDATNELTYLLMD
CCHHCCCCEEEEEEECEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHH
AVRHVKIYQPSLACRIHKSSPQKYLKKIVDVIRAGMGFPACHFDDVHIKIMLAKGVSIED
HHHHHEEECCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCC
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNNGVPLWYGKQVCPDMGDLSQFK
CCCHHHHCCCCCCCCCCEEEEECCCCCCCCHHEEEECCCCCCEEECHHHCCCCCCHHHHH
TYEQFEAAVKEQIKFITKWTSVATVISQRVHKELAPKPLMSMMYEGCMENGRGVEAGGAM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCE
YNFGPGVVWSGLATYADSMAAIKKLVFEDKKYTLQEMNEALKADFVGYEQLRKDCLEAPK
EECCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCC
YGNDDDYADLIAADLINFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAW
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCEEE
MPLSDGISPSQGADFKGPTSIIKSVSKMSCEDMNIGMVHNFKLIAGLLDTPEGEQGIITL
CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCHHHHHH
LRSACALQLGEVQFNYLDNKTLIEAQKHPDQYRDLIVRVAGYSAFFVELCKDVQDEIISR
HHHHHHHHHCCEEEEECCCCEEEHHHCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH
TMLTHF
HHHHCC
>Mature Secondary Structure
MDIREFSNKFMEATKNMSDEERAGLMKMFQSVSNEITREEPATSKVACDNNGEIPDGMTE
CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCHHHH
RLVKLKETYLKHVPTITTHRARAITKIAKENPGTPKSVLRGKCFKHCCETAPLVIQDNEL
HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEECCEE
IVGAPNGQPRAGAFSPDIAWRWMVDEIDTIGTRPQDPFYISEEDKKIMREELFPYWAGKS
EEECCCCCCCCCCCCCCCEEEEEHHHHHHCCCCCCCCEEECCHHHHHHHHHHCCCCCCCC
VDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVVLMKKGMLDIKREAEEK
HHHHHHHHHHHCCCEEECCCHHHHCCCCEEECCCCCCCCCEEEEEEECCCHHHHHHHHHH
LAELKYENPEDIDKIYFYKSLIDTAEGVMIYAKRMSDYAAELAQKETNPKRKAELLKISE
HHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
INARVPAHKPSTYWEAIQAVWTIESLLVVEENQTGMSIGRVDQYMYPFYKADIEAGRMTD
CCCCCCCCCCHHHHHHHHHHHHHHHHEEEECCCCCCCHHHHHHHHCCHHHHCCCCCCCCC
YEAFELSGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCVGGVTREGRDATNELTYLLMD
CCHHCCCCEEEEEEECEEEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHH
AVRHVKIYQPSLACRIHKSSPQKYLKKIVDVIRAGMGFPACHFDDVHIKIMLAKGVSIED
HHHHHEEECCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCC
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNNGVPLWYGKQVCPDMGDLSQFK
CCCHHHHCCCCCCCCCCEEEEECCCCCCCCHHEEEECCCCCCEEECHHHCCCCCCHHHHH
TYEQFEAAVKEQIKFITKWTSVATVISQRVHKELAPKPLMSMMYEGCMENGRGVEAGGAM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCE
YNFGPGVVWSGLATYADSMAAIKKLVFEDKKYTLQEMNEALKADFVGYEQLRKDCLEAPK
EECCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCC
YGNDDDYADLIAADLINFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAW
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCEEE
MPLSDGISPSQGADFKGPTSIIKSVSKMSCEDMNIGMVHNFKLIAGLLDTPEGEQGIITL
CCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCHHHHHH
LRSACALQLGEVQFNYLDNKTLIEAQKHPDQYRDLIVRVAGYSAFFVELCKDVQDEIISR
HHHHHHHHHCCEEEEECCCCEEEHHHCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH
TMLTHF
HHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8265357; 9278503; 7773398 [H]