The gene/protein map for NC_009698 is currently unavailable.
Definition Clostridium botulinum A str. Hall, complete genome.
Accession NC_009698
Length 3,760,560

Click here to switch to the map view.

The map label for this gene is mcpB [H]

Identifier: 153936930

GI number: 153936930

Start: 1779738

End: 1781744

Strand: Direct

Name: mcpB [H]

Synonym: CLC_1690

Alternate gene names: 153936930

Gene position: 1779738-1781744 (Clockwise)

Preceding gene: 153934615

Following gene: 153935200

Centisome position: 47.33

GC content: 28.9

Gene sequence:

>2007_bases
ATGAGAAGTATAAAATCTAAAATAATAGCTATCATATCTATTGTTTGTATATTGAGTATAGGTTTATGCTCTTCCATAAG
TTATTATTTTTCATATAAAGCAATAATGAAAGAAACAACTAATAAAGTGAGTATGGCCTCTCAAAAATATTCAGAAATAA
TAGAGGGATGGCTATTAACAAAAACTAAATTTATAGATTCTATGATATTAGATATCCAATATAATGATAAATATGACAAA
AAATATTTAGAAGAATATTTTAGATTACAAGCTAAAGCTAATAAAGATATTATAAGTATATATTCAGGTTTTAATAATAA
GGAGTTTAGGTCTATTGAAGGAGTTCCACCTACCCAAAATTATGATTGCACACAAAGACCATGGTACAAAGATACAATTG
AAAAGAATGAAGTAATGTATTCTTCTCCTTATCTTGATCCAAATACAAAAAAAATGATAATAACTATAGCCAAGCCAATT
AAAAAGGATGGTAAATCTATAGGAGTTTTAGCAATAGATATAACTTTAGATTATATTAAAAATTCTGTTGAGCAGGCTAC
ACCTGTAGAAAAAAGTTATGGATTTTTATTAGATAAAGATAATAATTTCATAGTACATAGAAATAGAGAATTTCAGCCTA
AGGATGGAAAAAACTATAATGTAAAAGAAGTTATGGATAAGGGCTTAGAGAAATTAGCTTCCTTAGATTCTAAGAATAAT
AATGCTCTAATTTTAAAGGATTTTGATAATGAGAAAAAGGTCTTTACAAAAACAATAATACCTTCATCAAAATGGTCCAT
AGGATTCGTGGTACCATTATCAGAATTTAAAAAGCCATTAAATAATATAATAATTTCATTTATATCCATTGCTATTTTAT
GTTTGGTGGCAGGAACATTATTTGCCATATATTCAGCTAAAAGAATATCTGATCCTATTTTAAAAATAACTGAATTGGTA
AATGAAACTAAAAGTTTAAACTTAAAAGATGATTATAACTATGACTATATAAATTCATATAAAGATGAAGTAGGAATTAT
AGGAAAAGCAGTTATTCATCTAAGGGAAGAGTTAAGAAATATTATAGAAGAACTTAAAAACTCCTCTAATGATGTTTTAA
AATACTCAGAATCGATAAATGAAGCTACAGGGGAAACAGTACAATCTATAGATGCCATATCAAAAACTGTAGATGAATTA
GCACAGGGCTCTGTGGATCAGGCTAAGGATGCACAAAATGGTTCCGAAAGATTATTTACTTTAGCCGAAGAAATTAAAAT
AACAGATGAGAGTGCAGATTTAGTTAAAAAATATTCCTTAGAAACAAAGGAAAATAGTGAGAAGGGTATAGCAACTATGA
AAGAGACCATAGAAAAATTTAAGGAAAACAATAAAGTAAATAAAGAATTGGGAAATAATGTAGACATGTTAGCAAATAGG
TCTGGTTCAATAGGAGAAATAATAAATTCTATACAGTCTATTGCACAGCAAACAAATTTATTAGCCTTAAATGCAGCTAT
AGAAGCAGCCAGAGCAGGAGAGGTAGGGAAAGGTTTTGCAGTAGTAGCCGAAGAAATAAGAAAACTAGCAGAGCAAACCT
CTACTTCTACTAAAGAAATAGAGAATATAGTGGAAGAAATACAATTTGAAATTAATAAAACAAAGGATAATATGGATGTA
TCTCAGAGGGTAGTACAGGAGGTAAACGGAGCTATGAATATATCTAAAGAATCCTTTGATAACATTACAAACTCTATAGA
AATCATAGTAGAACAAATTGAGCTTCTGGTACATAATGTTAAAAAAGTAGATTCAGATAAAGATGAAGTTTTAGCATCTG
TACAAGGTATATCCGCTATAGCAGAGGAATCCGCAGCATCAACAGAAGAAGTATCCGCTACCGTTGAACAGCAAGCAGCT
TCTATGGAAAGTATGTCTCAAACTGCAGAAAACTTAAAAGAAATAGCAAGTACATTAGATACTGTAGTAAATAAATTTGA
AATTTAA

Upstream 100 bases:

>100_bases
TTAAAAAAACAACAAAGTTTGTTCATTTTTTTAGATATTAGACGATATAAATTAATATAGTATACAAATTATATTTTAAT
AAAGATAGGAGTGTAGCAGT

Downstream 100 bases:

>100_bases
ATAGGTGTTATAAAGCCCCATGGTAATCCATGGGGCTTTTCAATAATATGAGTATATTTTTAAATCTTCTTATTTATTTT
GTTTTTTATGTAGCACAAAC

Product: methyl-accepting chemotaxis protein

Products: NA

Alternate protein names: H3 [H]

Number of amino acids: Translated: 668; Mature: 668

Protein sequence:

>668_residues
MRSIKSKIIAIISIVCILSIGLCSSISYYFSYKAIMKETTNKVSMASQKYSEIIEGWLLTKTKFIDSMILDIQYNDKYDK
KYLEEYFRLQAKANKDIISIYSGFNNKEFRSIEGVPPTQNYDCTQRPWYKDTIEKNEVMYSSPYLDPNTKKMIITIAKPI
KKDGKSIGVLAIDITLDYIKNSVEQATPVEKSYGFLLDKDNNFIVHRNREFQPKDGKNYNVKEVMDKGLEKLASLDSKNN
NALILKDFDNEKKVFTKTIIPSSKWSIGFVVPLSEFKKPLNNIIISFISIAILCLVAGTLFAIYSAKRISDPILKITELV
NETKSLNLKDDYNYDYINSYKDEVGIIGKAVIHLREELRNIIEELKNSSNDVLKYSESINEATGETVQSIDAISKTVDEL
AQGSVDQAKDAQNGSERLFTLAEEIKITDESADLVKKYSLETKENSEKGIATMKETIEKFKENNKVNKELGNNVDMLANR
SGSIGEIINSIQSIAQQTNLLALNAAIEAARAGEVGKGFAVVAEEIRKLAEQTSTSTKEIENIVEEIQFEINKTKDNMDV
SQRVVQEVNGAMNISKESFDNITNSIEIIVEQIELLVHNVKKVDSDKDEVLASVQGISAIAEESAASTEEVSATVEQQAA
SMESMSQTAENLKEIASTLDTVVNKFEI

Sequences:

>Translated_668_residues
MRSIKSKIIAIISIVCILSIGLCSSISYYFSYKAIMKETTNKVSMASQKYSEIIEGWLLTKTKFIDSMILDIQYNDKYDK
KYLEEYFRLQAKANKDIISIYSGFNNKEFRSIEGVPPTQNYDCTQRPWYKDTIEKNEVMYSSPYLDPNTKKMIITIAKPI
KKDGKSIGVLAIDITLDYIKNSVEQATPVEKSYGFLLDKDNNFIVHRNREFQPKDGKNYNVKEVMDKGLEKLASLDSKNN
NALILKDFDNEKKVFTKTIIPSSKWSIGFVVPLSEFKKPLNNIIISFISIAILCLVAGTLFAIYSAKRISDPILKITELV
NETKSLNLKDDYNYDYINSYKDEVGIIGKAVIHLREELRNIIEELKNSSNDVLKYSESINEATGETVQSIDAISKTVDEL
AQGSVDQAKDAQNGSERLFTLAEEIKITDESADLVKKYSLETKENSEKGIATMKETIEKFKENNKVNKELGNNVDMLANR
SGSIGEIINSIQSIAQQTNLLALNAAIEAARAGEVGKGFAVVAEEIRKLAEQTSTSTKEIENIVEEIQFEINKTKDNMDV
SQRVVQEVNGAMNISKESFDNITNSIEIIVEQIELLVHNVKKVDSDKDEVLASVQGISAIAEESAASTEEVSATVEQQAA
SMESMSQTAENLKEIASTLDTVVNKFEI
>Mature_668_residues
MRSIKSKIIAIISIVCILSIGLCSSISYYFSYKAIMKETTNKVSMASQKYSEIIEGWLLTKTKFIDSMILDIQYNDKYDK
KYLEEYFRLQAKANKDIISIYSGFNNKEFRSIEGVPPTQNYDCTQRPWYKDTIEKNEVMYSSPYLDPNTKKMIITIAKPI
KKDGKSIGVLAIDITLDYIKNSVEQATPVEKSYGFLLDKDNNFIVHRNREFQPKDGKNYNVKEVMDKGLEKLASLDSKNN
NALILKDFDNEKKVFTKTIIPSSKWSIGFVVPLSEFKKPLNNIIISFISIAILCLVAGTLFAIYSAKRISDPILKITELV
NETKSLNLKDDYNYDYINSYKDEVGIIGKAVIHLREELRNIIEELKNSSNDVLKYSESINEATGETVQSIDAISKTVDEL
AQGSVDQAKDAQNGSERLFTLAEEIKITDESADLVKKYSLETKENSEKGIATMKETIEKFKENNKVNKELGNNVDMLANR
SGSIGEIINSIQSIAQQTNLLALNAAIEAARAGEVGKGFAVVAEEIRKLAEQTSTSTKEIENIVEEIQFEINKTKDNMDV
SQRVVQEVNGAMNISKESFDNITNSIEIIVEQIELLVHNVKKVDSDKDEVLASVQGISAIAEESAASTEEVSATVEQQAA
SMESMSQTAENLKEIASTLDTVVNKFEI

Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 methyl-accepting transducer domain [H]

Homologues:

Organism=Escherichia coli, GI2367378, Length=301, Percent_Identity=25.2491694352159, Blast_Score=95, Evalue=1e-20,
Organism=Escherichia coli, GI1789453, Length=364, Percent_Identity=25.5494505494505, Blast_Score=92, Evalue=1e-19,
Organism=Escherichia coli, GI1788195, Length=235, Percent_Identity=28.5106382978723, Blast_Score=88, Evalue=2e-18,
Organism=Escherichia coli, GI1787690, Length=305, Percent_Identity=25.9016393442623, Blast_Score=85, Evalue=2e-17,
Organism=Escherichia coli, GI1788194, Length=189, Percent_Identity=31.7460317460317, Blast_Score=84, Evalue=4e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004010
- InterPro:   IPR003122
- InterPro:   IPR004089
- InterPro:   IPR003660
- InterPro:   IPR022094 [H]

Pfam domain/function: PF02743 Cache_1; PF00672 HAMP; PF12332 McpA_N; PF00015 MCPsignal; PF02203 TarH [H]

EC number: NA

Molecular weight: Translated: 74906; Mature: 74906

Theoretical pI: Translated: 4.76; Mature: 4.76

Prosite motif: PS50111 CHEMOTAXIS_TRANSDUC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRSIKSKIIAIISIVCILSIGLCSSISYYFSYKAIMKETTNKVSMASQKYSEIIEGWLLT
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KTKFIDSMILDIQYNDKYDKKYLEEYFRLQAKANKDIISIYSGFNNKEFRSIEGVPPTQN
HHHHHHHHHEEEEECCCHHHHHHHHHHHHHHHCCCCCEEHHCCCCCCCCCCCCCCCCCCC
YDCTQRPWYKDTIEKNEVMYSSPYLDPNTKKMIITIAKPIKKDGKSIGVLAIDITLDYIK
CCCCCCCCCHHHHCCCCEEEECCCCCCCCCEEEEEEEHHHHCCCCEEEEEEEEEEHHHHH
NSVEQATPVEKSYGFLLDKDNNFIVHRNREFQPKDGKNYNVKEVMDKGLEKLASLDSKNN
HHHHHCCCCHHHCCEEEECCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC
NALILKDFDNEKKVFTKTIIPSSKWSIGFVVPLSEFKKPLNNIIISFISIAILCLVAGTL
CEEEEECCCCCHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FAIYSAKRISDPILKITELVNETKSLNLKDDYNYDYINSYKDEVGIIGKAVIHLREELRN
HHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
IIEELKNSSNDVLKYSESINEATGETVQSIDAISKTVDELAQGSVDQAKDAQNGSERLFT
HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCHHHHHH
LAEEIKITDESADLVKKYSLETKENSEKGIATMKETIEKFKENNKVNKELGNNVDMLANR
HHHHHEECCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHCC
SGSIGEIINSIQSIAQQTNLLALNAAIEAARAGEVGKGFAVVAEEIRKLAEQTSTSTKEI
CCCHHHHHHHHHHHHHHHHHEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHH
ENIVEEIQFEINKTKDNMDVSQRVVQEVNGAMNISKESFDNITNSIEIIVEQIELLVHNV
HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
KKVDSDKDEVLASVQGISAIAEESAASTEEVSATVEQQAASMESMSQTAENLKEIASTLD
HHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
TVVNKFEI
HHHHHCCC
>Mature Secondary Structure
MRSIKSKIIAIISIVCILSIGLCSSISYYFSYKAIMKETTNKVSMASQKYSEIIEGWLLT
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KTKFIDSMILDIQYNDKYDKKYLEEYFRLQAKANKDIISIYSGFNNKEFRSIEGVPPTQN
HHHHHHHHHEEEEECCCHHHHHHHHHHHHHHHCCCCCEEHHCCCCCCCCCCCCCCCCCCC
YDCTQRPWYKDTIEKNEVMYSSPYLDPNTKKMIITIAKPIKKDGKSIGVLAIDITLDYIK
CCCCCCCCCHHHHCCCCEEEECCCCCCCCCEEEEEEEHHHHCCCCEEEEEEEEEEHHHHH
NSVEQATPVEKSYGFLLDKDNNFIVHRNREFQPKDGKNYNVKEVMDKGLEKLASLDSKNN
HHHHHCCCCHHHCCEEEECCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC
NALILKDFDNEKKVFTKTIIPSSKWSIGFVVPLSEFKKPLNNIIISFISIAILCLVAGTL
CEEEEECCCCCHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FAIYSAKRISDPILKITELVNETKSLNLKDDYNYDYINSYKDEVGIIGKAVIHLREELRN
HHHHHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
IIEELKNSSNDVLKYSESINEATGETVQSIDAISKTVDELAQGSVDQAKDAQNGSERLFT
HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCHHHHHH
LAEEIKITDESADLVKKYSLETKENSEKGIATMKETIEKFKENNKVNKELGNNVDMLANR
HHHHHEECCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHCC
SGSIGEIINSIQSIAQQTNLLALNAAIEAARAGEVGKGFAVVAEEIRKLAEQTSTSTKEI
CCCHHHHHHHHHHHHHHHHHEEHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHH
ENIVEEIQFEINKTKDNMDVSQRVVQEVNGAMNISKESFDNITNSIEIIVEQIELLVHNV
HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH
KKVDSDKDEVLASVQGISAIAEESAASTEEVSATVEQQAASMESMSQTAENLKEIASTLD
HHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
TVVNKFEI
HHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8188684; 9384377 [H]