Definition | Clostridium botulinum A str. ATCC 3502, complete genome. |
---|---|
Accession | NC_009495 |
Length | 3,886,916 |
Click here to switch to the map view.
The map label for this gene is hbpA [H]
Identifier: 148378807
GI number: 148378807
Start: 922099
End: 924579
Strand: Reverse
Name: hbpA [H]
Synonym: CBO0815
Alternate gene names: 148378807
Gene position: 924579-922099 (Counterclockwise)
Preceding gene: 148378830
Following gene: 148378798
Centisome position: 23.79
GC content: 27.25
Gene sequence:
>2481_bases ATGTTTAATTTTAAATTAAAAAGATCTTCCATAGATAAAAATATTAATAGCTTAAATGAAAAAAATATAGAAAAGCCCTG TAATATTGCAATCTATGAAGATAACTTAAAAGTTCTAACTAATAACCAAAGAAAAATAGTAGACAAACTTGATAAAAAAA TAATAGAAACAGATTCTGTTACAGAATTACTAATTAAAATGACAAAGGATATATCTAATTATGTAGAAATGGAAATGGAT TCTATATCTAAAGTTACAGGAGAAATAAGTAACTATTCTGCCATAGCAGAGGAGGTTTTCTCCAGTACAGAAAATTCAAA GCAAATATCAGAAAGCACCATGGAAGTTGCTAAAGAAGGCAATGAAGCTGCTTTAAATTCCATAGAGGCTATGAAAGAAA TAGAAGGATCTATGTTATATTCTAAAACAGTAGTAAAGGATTTAAGTACTAAAGCTTTAGATATTAACAATATGCTAGAT GTAATAAAAGATATTGCAAACAACACTAATTTATTATCTTTAAATGCCTCTATTGAAGCTGCAAGAGCTGGCGAGGCTGG TAAAGGCTTCGCAGTAGTTGCACATGAAGTAAAAAAACTAGCAGAAAGAAGTATGGATTCCGTTGATTTTATAGGAAATA ATATAAAAGAAATAAATATTAGCATAGACAATGCTATAAAGGCTATAAATGAAACTATGAATAAAGTAAAAGAAGGTACC GAAATAGCTAATAAAACTATGGAAACCTTCAACAGCATAATATCTTCTATAAAAACTAGTACCTCTGTATCTGAAGAAAT TAATGATGCTATTACAAAACAAATTGGCCATTTAGAAAACGTAATTAACTCTACTGAAGAGATGAATACTACTTCTGAAA AGTTAATGTTTATAGTAGAATTAGCTTCTTTAAATACTCAATATACAAAAACTTCTTTAAAGGATTTATCCGAGGTTTCC CAAAACTTAAAATATATTAGTAATAATCTTTTAAATGAAATAGAAGTTGATTCTAAAGAAAATAATGTAATTTTAAATAC TTATATTAATGGTAGACCTCTATATTTAGACCCTGCTCTAAGCTATGAACTTAATAGTAGCCTTCTATTAAATAATATAC ATATAGGTCTTTTAACCATAAACTCCTATGGAGAAATATCTCCTGGAATAGCTAAAAGTTGGTATTTAGAAAAGGACAAT CTTACTTGGGTTTTTAATCTTAAAAAAGGAATAAAATTTCATAATGGTAAAGAGGTTACCTCAGAAGATGTAAAATTTTC ATTAGAAAGACTTTTAGATCCTAAACTTGACTCTCCCAACGGGTGGCTACTAGAAATTATAGAAGGATCTGAAGATTTTA AAAAAGGTGCAGCAAAGCATGTTTCTGGTATTAAGATATTGGATAAACATAGAATTTCTTTAACTTTATCCTATTCTTAC AGTGGATTTTTATTAAATCTTGGATTAGAACTTTGCGGTATAATAAATAAGGATTCTATAAATCAGGGGGATGTAGTAGG CTGTGGTCCTTATAAAATATCTGAATTTAATGATGAAGGATGCAAACTAGAGGCCTTCAAAGAATATTTTAATGGAGCCC CTTATATTGATATTATAAATATTAATTTTAAATCTGAATCTCCTATAGATGATTTCTTAAATAAAGCTTTAGATGTATTA ACCATAAATGATAAAAACGAATATACAACTTTATGTTCTAATAAAAATATAAATCTTATAGAGCAAGATTTACTAGCAAC TTACTATGCTAGTTTTAACATGAAATCTAACTCTATTTTTTCTAGGGATAAAGATGTAAGATATGCTCTTAACTTAGCCA TAGACAAGAATAGAATAATAAAGGATATATTAGGTGGATTGGGAGTTGAAGCAAAGGGTCCTTTCCCTCCAAGCATAATA CCTAATAATAAGTTAAGAGGTTTTTCTCATAATAAATCTAAAGCCAAAGAAATTCTTTCTAGAAGTGATTTTAATAGATC TCGAGATAAATTAAATATATTAATCCGAAAAGACGAGGATTCTTTATTTTCTAAGATAACAGAATATATATTAGAAGATT TAAAAAATATAGGAATAGATTGTATTGTAAAAGAAGTAAATTCTTCTGAATACTTAAACTTAGATAATATTTTAAAATGT GACATGGCTATAAGCAGATGGTGCGCTGATTCTGGAGATCCTGATAATTTCTTAGAACCAATATTTAATATAGAAAATGT ATCTAACATATCTAGGTATGACAATAAACTAGTAAATGAAAAATTAAAAAAAGCTAAAAACTTAATTAATCCTGAAAAAA GGAAAAAATTATATGAAGAAATACAGGAAATTATAGTAGAGGATGTTCCATGGATATTTTTATATCATCCTAAACTAGCT ATAGCAGTACAAAATAATATACTTGGATTGAATGCTAATCCTTTAGGACTTTTCAAATATGAAGATATAATAAAAAATTA G
Upstream 100 bases:
>100_bases TTTTAAAAATTATATTTAAATGAATATTTTTCATACTTTAACAAATTATTTAAATGAGTTAAAATGTTGGTAGTAACTAT TATTTTATAGGGGGATTGAA
Downstream 100 bases:
>100_bases ATACCTCATATTTTTATCTTGTGAAAAGGGTTGTCTCAAAATAGATTTAATTTTGAGACAACTTTTAGTTTTATTATATT AAAAATCCGCTGCTTGTGGA
Product: methyl-accepting chemotaxis protein/ extracellular solute-binding protein, family 5
Products: ADP; phosphate; dipeptides [Cytoplasm] [C]
Alternate protein names: Hemin-binding lipoprotein [H]
Number of amino acids: Translated: 826; Mature: 826
Protein sequence:
>826_residues MFNFKLKRSSIDKNINSLNEKNIEKPCNIAIYEDNLKVLTNNQRKIVDKLDKKIIETDSVTELLIKMTKDISNYVEMEMD SISKVTGEISNYSAIAEEVFSSTENSKQISESTMEVAKEGNEAALNSIEAMKEIEGSMLYSKTVVKDLSTKALDINNMLD VIKDIANNTNLLSLNASIEAARAGEAGKGFAVVAHEVKKLAERSMDSVDFIGNNIKEINISIDNAIKAINETMNKVKEGT EIANKTMETFNSIISSIKTSTSVSEEINDAITKQIGHLENVINSTEEMNTTSEKLMFIVELASLNTQYTKTSLKDLSEVS QNLKYISNNLLNEIEVDSKENNVILNTYINGRPLYLDPALSYELNSSLLLNNIHIGLLTINSYGEISPGIAKSWYLEKDN LTWVFNLKKGIKFHNGKEVTSEDVKFSLERLLDPKLDSPNGWLLEIIEGSEDFKKGAAKHVSGIKILDKHRISLTLSYSY SGFLLNLGLELCGIINKDSINQGDVVGCGPYKISEFNDEGCKLEAFKEYFNGAPYIDIININFKSESPIDDFLNKALDVL TINDKNEYTTLCSNKNINLIEQDLLATYYASFNMKSNSIFSRDKDVRYALNLAIDKNRIIKDILGGLGVEAKGPFPPSII PNNKLRGFSHNKSKAKEILSRSDFNRSRDKLNILIRKDEDSLFSKITEYILEDLKNIGIDCIVKEVNSSEYLNLDNILKC DMAISRWCADSGDPDNFLEPIFNIENVSNISRYDNKLVNEKLKKAKNLINPEKRKKLYEEIQEIIVEDVPWIFLYHPKLA IAVQNNILGLNANPLGLFKYEDIIKN
Sequences:
>Translated_826_residues MFNFKLKRSSIDKNINSLNEKNIEKPCNIAIYEDNLKVLTNNQRKIVDKLDKKIIETDSVTELLIKMTKDISNYVEMEMD SISKVTGEISNYSAIAEEVFSSTENSKQISESTMEVAKEGNEAALNSIEAMKEIEGSMLYSKTVVKDLSTKALDINNMLD VIKDIANNTNLLSLNASIEAARAGEAGKGFAVVAHEVKKLAERSMDSVDFIGNNIKEINISIDNAIKAINETMNKVKEGT EIANKTMETFNSIISSIKTSTSVSEEINDAITKQIGHLENVINSTEEMNTTSEKLMFIVELASLNTQYTKTSLKDLSEVS QNLKYISNNLLNEIEVDSKENNVILNTYINGRPLYLDPALSYELNSSLLLNNIHIGLLTINSYGEISPGIAKSWYLEKDN LTWVFNLKKGIKFHNGKEVTSEDVKFSLERLLDPKLDSPNGWLLEIIEGSEDFKKGAAKHVSGIKILDKHRISLTLSYSY SGFLLNLGLELCGIINKDSINQGDVVGCGPYKISEFNDEGCKLEAFKEYFNGAPYIDIININFKSESPIDDFLNKALDVL TINDKNEYTTLCSNKNINLIEQDLLATYYASFNMKSNSIFSRDKDVRYALNLAIDKNRIIKDILGGLGVEAKGPFPPSII PNNKLRGFSHNKSKAKEILSRSDFNRSRDKLNILIRKDEDSLFSKITEYILEDLKNIGIDCIVKEVNSSEYLNLDNILKC DMAISRWCADSGDPDNFLEPIFNIENVSNISRYDNKLVNEKLKKAKNLINPEKRKKLYEEIQEIIVEDVPWIFLYHPKLA IAVQNNILGLNANPLGLFKYEDIIKN >Mature_826_residues MFNFKLKRSSIDKNINSLNEKNIEKPCNIAIYEDNLKVLTNNQRKIVDKLDKKIIETDSVTELLIKMTKDISNYVEMEMD SISKVTGEISNYSAIAEEVFSSTENSKQISESTMEVAKEGNEAALNSIEAMKEIEGSMLYSKTVVKDLSTKALDINNMLD VIKDIANNTNLLSLNASIEAARAGEAGKGFAVVAHEVKKLAERSMDSVDFIGNNIKEINISIDNAIKAINETMNKVKEGT EIANKTMETFNSIISSIKTSTSVSEEINDAITKQIGHLENVINSTEEMNTTSEKLMFIVELASLNTQYTKTSLKDLSEVS QNLKYISNNLLNEIEVDSKENNVILNTYINGRPLYLDPALSYELNSSLLLNNIHIGLLTINSYGEISPGIAKSWYLEKDN LTWVFNLKKGIKFHNGKEVTSEDVKFSLERLLDPKLDSPNGWLLEIIEGSEDFKKGAAKHVSGIKILDKHRISLTLSYSY SGFLLNLGLELCGIINKDSINQGDVVGCGPYKISEFNDEGCKLEAFKEYFNGAPYIDIININFKSESPIDDFLNKALDVL TINDKNEYTTLCSNKNINLIEQDLLATYYASFNMKSNSIFSRDKDVRYALNLAIDKNRIIKDILGGLGVEAKGPFPPSII PNNKLRGFSHNKSKAKEILSRSDFNRSRDKLNILIRKDEDSLFSKITEYILEDLKNIGIDCIVKEVNSSEYLNLDNILKC DMAISRWCADSGDPDNFLEPIFNIENVSNISRYDNKLVNEKLKKAKNLINPEKRKKLYEEIQEIIVEDVPWIFLYHPKLA IAVQNNILGLNANPLGLFKYEDIIKN
Specific function: Important role in heme acquisition or metabolism [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Lipid-anchor [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
Organism=Escherichia coli, GI1789966, Length=509, Percent_Identity=24.3614931237721, Blast_Score=160, Evalue=3e-40, Organism=Escherichia coli, GI1787052, Length=495, Percent_Identity=24.8484848484848, Blast_Score=132, Evalue=6e-32, Organism=Escherichia coli, GI87081878, Length=478, Percent_Identity=25.3138075313808, Blast_Score=126, Evalue=7e-30, Organism=Escherichia coli, GI1787762, Length=489, Percent_Identity=23.721881390593, Blast_Score=107, Evalue=3e-24, Organism=Escherichia coli, GI1787551, Length=472, Percent_Identity=23.0932203389831, Blast_Score=102, Evalue=1e-22, Organism=Escherichia coli, GI1788194, Length=306, Percent_Identity=27.7777777777778, Blast_Score=100, Evalue=7e-22, Organism=Escherichia coli, GI1789453, Length=257, Percent_Identity=26.0700389105058, Blast_Score=96, Evalue=7e-21, Organism=Escherichia coli, GI1787495, Length=466, Percent_Identity=23.6051502145923, Blast_Score=92, Evalue=1e-19, Organism=Escherichia coli, GI2367378, Length=228, Percent_Identity=29.3859649122807, Blast_Score=92, Evalue=1e-19, Organism=Escherichia coli, GI1788195, Length=309, Percent_Identity=24.2718446601942, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1787690, Length=204, Percent_Identity=32.843137254902, Blast_Score=82, Evalue=1e-16,
Paralogues:
None
Copy number: 660 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 2980 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 40 Molecules/Cell In: Stationary Phase, Rich Media (Based on E. coli). [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 92987; Mature: 92987
Theoretical pI: Translated: 4.91; Mature: 4.91
Prosite motif: PS01040 SBP_BACTERIAL_5 ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFNFKLKRSSIDKNINSLNEKNIEKPCNIAIYEDNLKVLTNNQRKIVDKLDKKIIETDSV CCCEEECHHHHHHHHHHHCCCCCCCCCCEEEEECCEEEEECCHHHHHHHHHHHHHCCCHH TELLIKMTKDISNYVEMEMDSISKVTGEISNYSAIAEEVFSSTENSKQISESTMEVAKEG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC NEAALNSIEAMKEIEGSMLYSKTVVKDLSTKALDINNMLDVIKDIANNTNLLSLNASIEA CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCEEEECCCCCH ARAGEAGKGFAVVAHEVKKLAERSMDSVDFIGNNIKEINISIDNAIKAINETMNKVKEGT HCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHCCCEEEEEEEHHHHHHHHHHHHHHHHHHH EIANKTMETFNSIISSIKTSTSVSEEINDAITKQIGHLENVINSTEEMNTTSEKLMFIVE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHEEH LASLNTQYTKTSLKDLSEVSQNLKYISNNLLNEIEVDSKENNVILNTYINGRPLYLDPAL HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCEEEEEEECCEEEEECCCC SYELNSSLLLNNIHIGLLTINSYGEISPGIAKSWYLEKDNLTWVFNLKKGIKFHNGKEVT CCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCEEEECCCEEEEEEECCCCEECCCCCCC SEDVKFSLERLLDPKLDSPNGWLLEIIEGSEDFKKGAAKHVSGIKILDKHRISLTLSYSY HHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHHHHHHHHCCCEEEECCEEEEEEEECC SGFLLNLGLELCGIINKDSINQGDVVGCGPYKISEFNDEGCKLEAFKEYFNGAPYIDIIN CCHHHHCCHHHHCCCCCCCCCCCCEEECCCEEECCCCCCCCHHHHHHHHHCCCCEEEEEE INFKSESPIDDFLNKALDVLTINDKNEYTTLCSNKNINLIEQDLLATYYASFNMKSNSIF EECCCCCCHHHHHHHHEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCC SRDKDVRYALNLAIDKNRIIKDILGGLGVEAKGPFPPSIIPNNKLRGFSHNKSKAKEILS CCCCCEEEEEEEEECHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH RSDFNRSRDKLNILIRKDEDSLFSKITEYILEDLKNIGIDCIVKEVNSSEYLNLDNILKC HCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHH DMAISRWCADSGDPDNFLEPIFNIENVSNISRYDNKLVNEKLKKAKNLINPEKRKKLYEE HHHHHHHHCCCCCCHHHHHHHHCHHCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH IQEIIVEDVPWIFLYHPKLAIAVQNNILGLNANPLGLFKYEDIIKN HHHHHHHCCCEEEEECCEEEEEEECCEEECCCCCCEEEEHHHHHCC >Mature Secondary Structure MFNFKLKRSSIDKNINSLNEKNIEKPCNIAIYEDNLKVLTNNQRKIVDKLDKKIIETDSV CCCEEECHHHHHHHHHHHCCCCCCCCCCEEEEECCEEEEECCHHHHHHHHHHHHHCCCHH TELLIKMTKDISNYVEMEMDSISKVTGEISNYSAIAEEVFSSTENSKQISESTMEVAKEG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC NEAALNSIEAMKEIEGSMLYSKTVVKDLSTKALDINNMLDVIKDIANNTNLLSLNASIEA CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCEEEECCCCCH ARAGEAGKGFAVVAHEVKKLAERSMDSVDFIGNNIKEINISIDNAIKAINETMNKVKEGT HCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHCCCEEEEEEEHHHHHHHHHHHHHHHHHHH EIANKTMETFNSIISSIKTSTSVSEEINDAITKQIGHLENVINSTEEMNTTSEKLMFIVE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHEEH LASLNTQYTKTSLKDLSEVSQNLKYISNNLLNEIEVDSKENNVILNTYINGRPLYLDPAL HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCEEEEEEECCEEEEECCCC SYELNSSLLLNNIHIGLLTINSYGEISPGIAKSWYLEKDNLTWVFNLKKGIKFHNGKEVT CCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCEEEECCCEEEEEEECCCCEECCCCCCC SEDVKFSLERLLDPKLDSPNGWLLEIIEGSEDFKKGAAKHVSGIKILDKHRISLTLSYSY HHHHHHHHHHHCCCCCCCCCCCEEEEECCCHHHHHHHHHHHCCCEEEECCEEEEEEEECC SGFLLNLGLELCGIINKDSINQGDVVGCGPYKISEFNDEGCKLEAFKEYFNGAPYIDIIN CCHHHHCCHHHHCCCCCCCCCCCCEEECCCEEECCCCCCCCHHHHHHHHHCCCCEEEEEE INFKSESPIDDFLNKALDVLTINDKNEYTTLCSNKNINLIEQDLLATYYASFNMKSNSIF EECCCCCCHHHHHHHHEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCC SRDKDVRYALNLAIDKNRIIKDILGGLGVEAKGPFPPSIIPNNKLRGFSHNKSKAKEILS CCCCCEEEEEEEEECHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH RSDFNRSRDKLNILIRKDEDSLFSKITEYILEDLKNIGIDCIVKEVNSSEYLNLDNILKC HCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHH DMAISRWCADSGDPDNFLEPIFNIENVSNISRYDNKLVNEKLKKAKNLINPEKRKKLYEE HHHHHHHHCCCCCCHHHHHHHHCHHCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH IQEIIVEDVPWIFLYHPKLAIAVQNNILGLNANPLGLFKYEDIIKN HHHHHHHCCCEEEEECCEEEEEEECCEEECCCCCCEEEEHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: ATP; dipeptides [Periplasm]; H2O [C]
Specific reaction: ATP + dipeptides [Periplasm] + H2O = ADP + phosphate + dipeptides [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 1339409; 7542800; 2041470 [H]