| Definition | Clostridium botulinum A str. Hall, complete genome. |
|---|---|
| Accession | NC_009698 |
| Length | 3,760,560 |
Click here to switch to the map view.
The map label for this gene is mcpB [H]
Identifier: 153937753
GI number: 153937753
Start: 1869277
End: 1871298
Strand: Direct
Name: mcpB [H]
Synonym: CLC_1760
Alternate gene names: 153937753
Gene position: 1869277-1871298 (Clockwise)
Preceding gene: 153936973
Following gene: 153937823
Centisome position: 49.71
GC content: 27.6
Gene sequence:
>2022_bases ATGTTTACTAAAGAAAATCTTAAGAAACTTAGTTTTAAAAAGAAGATAGTAATAACTTCTCTATTGATTTTTTTAATTTC AATGAGTTTATTAACGGGTTTTACTTTTAAAATGGTAAGTAGTAAATTCAAAAATCAAGTTAGAGAAGATGGGTTAAACT TAGCAAATCAAGTAAGCTATCAAATTACTTCATCTCAAAAAGCTACAAAGGAGATAGATAAAGTTTTAGCAGACAAAGTT TTATCCATTTCAAAGATGGTTATAGAAAACAAAAATATTTCTAATGAATATTTAACCAGTGTTGCAGTAAAATGTAATAT ACAAGAAATAAATATAACAGATAAAAATGGGAAAATAATATATTCAAATATGCCAGAAAATATAAACTATGTATATCCCT TGGATTATTCTGGACAAGATATATTAAAGGGTAAAAAAGATGAAATTATTGAAGAAATAAGACAAAATAAAGTAAATCAT AATTATTATAAGTATTGTGCATTAGCTATGCCTAATGGTGGTCTAATACAAACAGGTATTAATGCAAATGAAATCCACAA TATTAATAAATCTGTAGATCCACAAATTATATTAGAAAAATTAACAAAAGATAGTAATATAAAATTTGCTTTAGTTATGG ATAATAAACTTAAAGTTACCCACCATAGTGATAAAAAAAGGATAGGAAAAATTTTAACTGATACTGGAAGTAAAACTGCA ATAGATACTGGGAAGAATTATACATCTACATATGATTATGAAGGTGAAAAAGTTTATGATATTATAATGCCTTTAAAAGA TGAAAGTGGTAAACTATTAGGATCTATGGATATAGGAGTATCTTTAGCTACTCAAGAAACCGCTCTTAGAAATATATTAA TAACTTCTATATTGATAAGCTTAATAACTTTTGTTTTAGCTGGATTAGTAATACTTTATATAATAAAACTATCTTTAAAA CCACTAGATAATTTGTCTAGTATAGCTCAAAAGGTTTCAAAGGGTGATTTAACAGAAAAGGTTGAAATTGTTAATGAAGA TGAAATTGGAAAGCTAAGCAAAATATTTAATACTATGATAGATAGTTTAAGAGAAATTACACGAAATATAAATAATTTTT CAATACAGTTAGCTGGTTCCTCTCAGGAAATTTTATCCTCTGCAGAGCAAACTTCAGCAGTATCAGAGGAAATTTCTAGT GCTACTGAAGAAATTGCTTCGGGAGCTGAAAATCAAGTGAAGGCTAGTAATGAATCTTCCCTATTAATGAATGATGTTAT GGGAAATATGTATACTTTAAAAGAGGAGTTTGATGAAATAATATCTTTTTCTAATAATACGAATACATTAGCTTCAAAGG GACAAGAAAACATGTCTAATATGGTACAGCAAATGGCTACAATAAAAAATAGTGTGGTAAATTCATCTAATATAATGTAT GATTTACAAAAGAATTCAGAGGAAATAGGAAATATTGTAGAAATTATAAATACTATAGCAGATCAAACTAATTTATTAGC ATTAAATGCCTCCATTGAAGCTGCTAGAGCAGGAGAAGCTGGAAAAGGTTTTGCAGTAGTAGCAGATGAAGTTAGAAAAC TTGCGGAAGAATCTATAAATTCTGCTAATAATATTAAGAATTTAATAATGAATACTCAAGATAAAACTAAGACTGCTTTA AATTCTATAAAGGATGGTGCTTCACAATCTGAAAAAGGTGAAAGCATAGTTGCTGAGGTAAAAGAATCTTTAGGAGAAAT TTTAAATTCATTTTCTAATGTGAACCATAAGTTTGCAAGTGTAGATTCTATGATTACAGCTTCCAATGATAGTATTACAG CTATGGCATCAAAATTATATGATATAGAAACCATATCTAATACAGCCTCCGCTAATACAGAAGAGGTAGCTGCTTCTACA GAAGAACAGAGCGCTACCATAGAAGAAATTACTGAATCTATAGAGAAGTTAGTTAGTATGGTAGAAAATTTAAAGGAAAG TGTATCTATATTTAAACTTTAA
Upstream 100 bases:
>100_bases AGTTTAAATTTTGATTTTCGTTTGATTTAAATTTAAAAAATCATTATAATATACTTACATAATGCTAAATTATGTAATAA TATTTAGGGGGATTAGTATA
Downstream 100 bases:
>100_bases CTAATATAAAATGGAACCTAGATTAAAAAATCCAGGTTCCATTTTATATTCCTATATATATTAGATAAAGCCCCATAGCT AATATAATTATTCCTAATAT
Product: methyl-accepting chemotaxis protein
Products: NA
Alternate protein names: H3 [H]
Number of amino acids: Translated: 673; Mature: 673
Protein sequence:
>673_residues MFTKENLKKLSFKKKIVITSLLIFLISMSLLTGFTFKMVSSKFKNQVREDGLNLANQVSYQITSSQKATKEIDKVLADKV LSISKMVIENKNISNEYLTSVAVKCNIQEINITDKNGKIIYSNMPENINYVYPLDYSGQDILKGKKDEIIEEIRQNKVNH NYYKYCALAMPNGGLIQTGINANEIHNINKSVDPQIILEKLTKDSNIKFALVMDNKLKVTHHSDKKRIGKILTDTGSKTA IDTGKNYTSTYDYEGEKVYDIIMPLKDESGKLLGSMDIGVSLATQETALRNILITSILISLITFVLAGLVILYIIKLSLK PLDNLSSIAQKVSKGDLTEKVEIVNEDEIGKLSKIFNTMIDSLREITRNINNFSIQLAGSSQEILSSAEQTSAVSEEISS ATEEIASGAENQVKASNESSLLMNDVMGNMYTLKEEFDEIISFSNNTNTLASKGQENMSNMVQQMATIKNSVVNSSNIMY DLQKNSEEIGNIVEIINTIADQTNLLALNASIEAARAGEAGKGFAVVADEVRKLAEESINSANNIKNLIMNTQDKTKTAL NSIKDGASQSEKGESIVAEVKESLGEILNSFSNVNHKFASVDSMITASNDSITAMASKLYDIETISNTASANTEEVAAST EEQSATIEEITESIEKLVSMVENLKESVSIFKL
Sequences:
>Translated_673_residues MFTKENLKKLSFKKKIVITSLLIFLISMSLLTGFTFKMVSSKFKNQVREDGLNLANQVSYQITSSQKATKEIDKVLADKV LSISKMVIENKNISNEYLTSVAVKCNIQEINITDKNGKIIYSNMPENINYVYPLDYSGQDILKGKKDEIIEEIRQNKVNH NYYKYCALAMPNGGLIQTGINANEIHNINKSVDPQIILEKLTKDSNIKFALVMDNKLKVTHHSDKKRIGKILTDTGSKTA IDTGKNYTSTYDYEGEKVYDIIMPLKDESGKLLGSMDIGVSLATQETALRNILITSILISLITFVLAGLVILYIIKLSLK PLDNLSSIAQKVSKGDLTEKVEIVNEDEIGKLSKIFNTMIDSLREITRNINNFSIQLAGSSQEILSSAEQTSAVSEEISS ATEEIASGAENQVKASNESSLLMNDVMGNMYTLKEEFDEIISFSNNTNTLASKGQENMSNMVQQMATIKNSVVNSSNIMY DLQKNSEEIGNIVEIINTIADQTNLLALNASIEAARAGEAGKGFAVVADEVRKLAEESINSANNIKNLIMNTQDKTKTAL NSIKDGASQSEKGESIVAEVKESLGEILNSFSNVNHKFASVDSMITASNDSITAMASKLYDIETISNTASANTEEVAAST EEQSATIEEITESIEKLVSMVENLKESVSIFKL >Mature_673_residues MFTKENLKKLSFKKKIVITSLLIFLISMSLLTGFTFKMVSSKFKNQVREDGLNLANQVSYQITSSQKATKEIDKVLADKV LSISKMVIENKNISNEYLTSVAVKCNIQEINITDKNGKIIYSNMPENINYVYPLDYSGQDILKGKKDEIIEEIRQNKVNH NYYKYCALAMPNGGLIQTGINANEIHNINKSVDPQIILEKLTKDSNIKFALVMDNKLKVTHHSDKKRIGKILTDTGSKTA IDTGKNYTSTYDYEGEKVYDIIMPLKDESGKLLGSMDIGVSLATQETALRNILITSILISLITFVLAGLVILYIIKLSLK PLDNLSSIAQKVSKGDLTEKVEIVNEDEIGKLSKIFNTMIDSLREITRNINNFSIQLAGSSQEILSSAEQTSAVSEEISS ATEEIASGAENQVKASNESSLLMNDVMGNMYTLKEEFDEIISFSNNTNTLASKGQENMSNMVQQMATIKNSVVNSSNIMY DLQKNSEEIGNIVEIINTIADQTNLLALNASIEAARAGEAGKGFAVVADEVRKLAEESINSANNIKNLIMNTQDKTKTAL NSIKDGASQSEKGESIVAEVKESLGEILNSFSNVNHKFASVDSMITASNDSITAMASKLYDIETISNTASANTEEVAAST EEQSATIEEITESIEKLVSMVENLKESVSIFKL
Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of
COG id: COG0840
COG function: function code NT; Methyl-accepting chemotaxis protein
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 methyl-accepting transducer domain [H]
Homologues:
Organism=Escherichia coli, GI1787690, Length=399, Percent_Identity=28.0701754385965, Blast_Score=116, Evalue=5e-27, Organism=Escherichia coli, GI2367378, Length=300, Percent_Identity=28.6666666666667, Blast_Score=110, Evalue=2e-25, Organism=Escherichia coli, GI1789453, Length=306, Percent_Identity=30.3921568627451, Blast_Score=109, Evalue=7e-25, Organism=Escherichia coli, GI1788194, Length=389, Percent_Identity=26.4781491002571, Blast_Score=103, Evalue=5e-23, Organism=Escherichia coli, GI1788195, Length=281, Percent_Identity=29.5373665480427, Blast_Score=102, Evalue=6e-23,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004010 - InterPro: IPR003122 - InterPro: IPR004089 - InterPro: IPR003660 - InterPro: IPR022094 [H]
Pfam domain/function: PF02743 Cache_1; PF00672 HAMP; PF12332 McpA_N; PF00015 MCPsignal; PF02203 TarH [H]
EC number: NA
Molecular weight: Translated: 74056; Mature: 74056
Theoretical pI: Translated: 4.80; Mature: 4.80
Prosite motif: PS50885 HAMP ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MFTKENLKKLSFKKKIVITSLLIFLISMSLLTGFTFKMVSSKFKNQVREDGLNLANQVSY CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHE QITSSQKATKEIDKVLADKVLSISKMVIENKNISNEYLTSVAVKCNIQEINITDKNGKII EECCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHEEEEEEEEEEECCCCEEE YSNMPENINYVYPLDYSGQDILKGKKDEIIEEIRQNKVNHNYYKYCALAMPNGGLIQTGI ECCCCCCCCEEEECCCCCHHHHCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEECC NANEIHNINKSVDPQIILEKLTKDSNIKFALVMDNKLKVTHHSDKKRIGKILTDTGSKTA CHHHHCCCCCCCCHHHHHHHHCCCCCCEEEEEECCEEEEEECCHHHHHHHHHHCCCCCCE IDTGKNYTSTYDYEGEKVYDIIMPLKDESGKLLGSMDIGVSLATQETALRNILITSILIS ECCCCCCCCCCCCCCCEEEEEEEECCCCCCCEEEEECCCEEEHHHHHHHHHHHHHHHHHH LITFVLAGLVILYIIKLSLKPLDNLSSIAQKVSKGDLTEKVEIVNEDEIGKLSKIFNTMI HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH DSLREITRNINNFSIQLAGSSQEILSSAEQTSAVSEEISSATEEIASGAENQVKASNESS HHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCH LLMNDVMGNMYTLKEEFDEIISFSNNTNTLASKGQENMSNMVQQMATIKNSVVNSSNIMY HHHHHHHCHHHHHHHHHHHHHCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEEE DLQKNSEEIGNIVEIINTIADQTNLLALNASIEAARAGEAGKGFAVVADEVRKLAEESIN EECCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHCCCCCCCHHHHHHHHHHHHHHHCC SANNIKNLIMNTQDKTKTALNSIKDGASQSEKGESIVAEVKESLGEILNSFSNVNHKFAS HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHH VDSMITASNDSITAMASKLYDIETISNTASANTEEVAASTEEQSATIEEITESIEKLVSM HHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHH VENLKESVSIFKL HHHHHHHHHHHCC >Mature Secondary Structure MFTKENLKKLSFKKKIVITSLLIFLISMSLLTGFTFKMVSSKFKNQVREDGLNLANQVSY CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHE QITSSQKATKEIDKVLADKVLSISKMVIENKNISNEYLTSVAVKCNIQEINITDKNGKII EECCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHEEEEEEEEEEECCCCEEE YSNMPENINYVYPLDYSGQDILKGKKDEIIEEIRQNKVNHNYYKYCALAMPNGGLIQTGI ECCCCCCCCEEEECCCCCHHHHCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCCEEEECC NANEIHNINKSVDPQIILEKLTKDSNIKFALVMDNKLKVTHHSDKKRIGKILTDTGSKTA CHHHHCCCCCCCCHHHHHHHHCCCCCCEEEEEECCEEEEEECCHHHHHHHHHHCCCCCCE IDTGKNYTSTYDYEGEKVYDIIMPLKDESGKLLGSMDIGVSLATQETALRNILITSILIS ECCCCCCCCCCCCCCCEEEEEEEECCCCCCCEEEEECCCEEEHHHHHHHHHHHHHHHHHH LITFVLAGLVILYIIKLSLKPLDNLSSIAQKVSKGDLTEKVEIVNEDEIGKLSKIFNTMI HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH DSLREITRNINNFSIQLAGSSQEILSSAEQTSAVSEEISSATEEIASGAENQVKASNESS HHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCH LLMNDVMGNMYTLKEEFDEIISFSNNTNTLASKGQENMSNMVQQMATIKNSVVNSSNIMY HHHHHHHCHHHHHHHHHHHHHCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEEE DLQKNSEEIGNIVEIINTIADQTNLLALNASIEAARAGEAGKGFAVVADEVRKLAEESIN EECCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHCCCCCCCHHHHHHHHHHHHHHHCC SANNIKNLIMNTQDKTKTALNSIKDGASQSEKGESIVAEVKESLGEILNSFSNVNHKFAS HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHH VDSMITASNDSITAMASKLYDIETISNTASANTEEVAASTEEQSATIEEITESIEKLVSM HHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHH VENLKESVSIFKL HHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8188684; 9384377 [H]