Definition Rhizobium etli CFN 42 plasmid p42f, complete sequence.
Accession NC_007766
Length 642,517

Click here to switch to the map view.

The map label for this gene is mcpB

Identifier: 86361029

GI number: 86361029

Start: 335382

End: 337526

Strand: Reverse

Name: mcpB

Synonym: RHE_PF00299

Alternate gene names: 86361029

Gene position: 337526-335382 (Counterclockwise)

Preceding gene: 86361030

Following gene: 86361026

Centisome position: 52.53

GC content: 64.76

Gene sequence:

>2145_bases
ATGTTGCATTTCTGGAATAAATTCGGCATTCGCGCGCAGATCACCGCAGGCTTCGTGCCGCTGATCCTGCTGATGAGCCT
GCTGACCGTCAGCGCGATCTCCGGCATGAACGGGCTCGCTTCGATCTTCGCCTCCTACCGCGCCACGGCCGGCCAGAGCC
TCGCCATCTCGGACTACAGTGACCAGCTGCACGAGATCCAAATGTCGGTCGAAGCCTTCCGCTCCACGCCGACCCAGGCC
GTGGTTGACAGTTTCCGCGCCGGCGTCAAAGCCTTCGAAGCGGACGATCCGCGCTTTGCCGGCAACAAGGACCTGCAGTC
TGGCCTGGCGACGATCCGCCAGGATATCGCCGCCTATGGCAAGGCTTTTGAGCAGATCGTCTCGCTCCAGGCCCGTCGCG
ACCTGCTGATCTCCAAGGTCACCGAATTCGGTCCCTGGACCAGCATCGCGCTCAACGACGTCATGCGCAGCGCCTGGCGC
CAGAACGACGTCGCCCTTCTGCACATGACGGCCGAGACGCTGGAAGCCCTCAACCGCAGCCTCTATTTCTCCGAACGCTT
CGTGCATTCCGACGATTTCGCCGCTTACGATACGGCGCAGACAGCACTTGCCGAAGCGGTGGCGCTCAATGATGCCGCCG
CCAAGGCTGCCAAGAACGAACTGCAGAAGAAGCGCCTGATGGGCGCCGGCCAGCTGATGCAGAACTACACTGCCCGCCTA
GGCGACATGAAGGAGTTGCTGCAGGCCTCCGGCAATATCCGGCAGACCCAGCTTAATGTGCTCGCACCGAAGATCGCTGG
GGAGTTCAGGGATCTGCAGGCGACGGTGACCGGCGCGCAGAAGACGCTCGACGGTTCGGTGGAAGCAACGGTTGCTTCTG
CGACCAGCACGACGCTCGTCATCAGCGGTCTGCTGATCGTCATCGGCCTGGTGCTCTCCTATTTCGTCGGCCGGCTGATT
TCCTCGGCGGTGCGCGGCATGGCGCATTCCATGGAGCAGCTTGCCCGCGGCGACGACGCCATCGTCATCACCGGCGTCGA
GCACCGGCACGAACTCGGCGCCATGGCGCGCTCGCTGAAGGTTTTCCAGGAAACCGGACGCGCCAAGCTGATCGCCGAAG
CCAATGCCGAGCGCGCCCGCCTTGCCGCCGAAGAAGAGCGGCTGCGCCAGGAAGCCGAGCGGCTTTCCGACGCGCAGGTG
ATGGAGCATGCCTTCCGCCAAATCTCGCTCGGTCTCGATGCGCTCTCGAAGGGTGATCTCTCCGTGCGCGTCGGCGAAGT
CGACCATCGTTACGTCAGGATCCGCGATCACTTCAACAGCTCGGTCGCAAGTCTCGAGGAAGCGATCGACTCCGTCATCC
GTGCGGTCACGACCATCCGCTCCGGCCTTGCGGAAATCTCGACCGCCTCCAACGATCTCGCCCGCCGTACCGAGCAGCAG
GCGGCTTCGCTTGAGGAAACTGTCGCGGCGCTGGGCGACGTCACCCGCGGCGTCAACGGAACGGCGGAGGGCGCCGGCCG
CGCCCAGGCCGTGGTGGCGACGGCCCGCACCAATGCGGAAAAGGGCGGCGAGATCGTCTCGCGAGCCATCGCCGCGATGA
CGGAAATCCAGAACTCCTCGTCGAAGATCGGCAACATTATCAGCGTCATCGACGAGATCGCCTTCCAGACCAACCTGCTG
GCGCTGAATGCCGGTGTCGAGGCGGCGCGTGCCGGCGAGGCCGGCAAGGGCTTCGCCGTCGTCGCCCAGGAAGTTCGCGA
GCTCGCTCAGCGCTCGGCCAATGCGGCACGGGAGATCAAGGAACTGATTTCCACCTCCTCGGCACAGGTCAAGACCGGCG
TTGAGCTGGTGGGTGAATCCGGTCTCTCGCTCGAACAGATCGTCGAGCAGGTCACGGCTATGAACGCGACCGTCGCCGAC
ATCGCCGTCGCCGCCCGCGAGCAGGCGGCCAGCCTGCGCGAGGTCTCGGCCGCCGGCGACCAGATGGACAAGGTAACGCA
GCAGAACGCCGCGATGGTCGAGGAAACCACGGCCGCCGCCCAGAGCCTGACGCAGGAAACCGAGAGCCTCGCCGAGCTGC
TGCGGCGCTTCAAGACAGGCAACGGCCGGGCATCGAACCACCGCCACTACGCGATGGCGTCGTAA

Upstream 100 bases:

>100_bases
GAGAAGGTGCCGCGCGAAAACAACGTTCCCTTCGAACTCGTCACCCCTCAGAACATTGCCCAATACCTGCCGAAGAGCCA
GTGAGCATAAGAGGCCTATG

Downstream 100 bases:

>100_bases
CGTCCTGCTAACGCAGCCAAGAGAACGCCGCCGCCTTCAAAGGGCGGCGGTCGCGTCAATGGCCTCCGGCCGCCGGACTG
TGCGGCGCGCCGGCGATGCT

Product: methyl-accepting chemotaxis protein

Products: NA

Alternate protein names: Methyl-accepting chemotaxis protein [H]

Number of amino acids: Translated: 714; Mature: 714

Protein sequence:

>714_residues
MLHFWNKFGIRAQITAGFVPLILLMSLLTVSAISGMNGLASIFASYRATAGQSLAISDYSDQLHEIQMSVEAFRSTPTQA
VVDSFRAGVKAFEADDPRFAGNKDLQSGLATIRQDIAAYGKAFEQIVSLQARRDLLISKVTEFGPWTSIALNDVMRSAWR
QNDVALLHMTAETLEALNRSLYFSERFVHSDDFAAYDTAQTALAEAVALNDAAAKAAKNELQKKRLMGAGQLMQNYTARL
GDMKELLQASGNIRQTQLNVLAPKIAGEFRDLQATVTGAQKTLDGSVEATVASATSTTLVISGLLIVIGLVLSYFVGRLI
SSAVRGMAHSMEQLARGDDAIVITGVEHRHELGAMARSLKVFQETGRAKLIAEANAERARLAAEEERLRQEAERLSDAQV
MEHAFRQISLGLDALSKGDLSVRVGEVDHRYVRIRDHFNSSVASLEEAIDSVIRAVTTIRSGLAEISTASNDLARRTEQQ
AASLEETVAALGDVTRGVNGTAEGAGRAQAVVATARTNAEKGGEIVSRAIAAMTEIQNSSSKIGNIISVIDEIAFQTNLL
ALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAREIKELISTSSAQVKTGVELVGESGLSLEQIVEQVTAMNATVAD
IAVAAREQAASLREVSAAGDQMDKVTQQNAAMVEETTAAAQSLTQETESLAELLRRFKTGNGRASNHRHYAMAS

Sequences:

>Translated_714_residues
MLHFWNKFGIRAQITAGFVPLILLMSLLTVSAISGMNGLASIFASYRATAGQSLAISDYSDQLHEIQMSVEAFRSTPTQA
VVDSFRAGVKAFEADDPRFAGNKDLQSGLATIRQDIAAYGKAFEQIVSLQARRDLLISKVTEFGPWTSIALNDVMRSAWR
QNDVALLHMTAETLEALNRSLYFSERFVHSDDFAAYDTAQTALAEAVALNDAAAKAAKNELQKKRLMGAGQLMQNYTARL
GDMKELLQASGNIRQTQLNVLAPKIAGEFRDLQATVTGAQKTLDGSVEATVASATSTTLVISGLLIVIGLVLSYFVGRLI
SSAVRGMAHSMEQLARGDDAIVITGVEHRHELGAMARSLKVFQETGRAKLIAEANAERARLAAEEERLRQEAERLSDAQV
MEHAFRQISLGLDALSKGDLSVRVGEVDHRYVRIRDHFNSSVASLEEAIDSVIRAVTTIRSGLAEISTASNDLARRTEQQ
AASLEETVAALGDVTRGVNGTAEGAGRAQAVVATARTNAEKGGEIVSRAIAAMTEIQNSSSKIGNIISVIDEIAFQTNLL
ALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAREIKELISTSSAQVKTGVELVGESGLSLEQIVEQVTAMNATVAD
IAVAAREQAASLREVSAAGDQMDKVTQQNAAMVEETTAAAQSLTQETESLAELLRRFKTGNGRASNHRHYAMAS
>Mature_714_residues
MLHFWNKFGIRAQITAGFVPLILLMSLLTVSAISGMNGLASIFASYRATAGQSLAISDYSDQLHEIQMSVEAFRSTPTQA
VVDSFRAGVKAFEADDPRFAGNKDLQSGLATIRQDIAAYGKAFEQIVSLQARRDLLISKVTEFGPWTSIALNDVMRSAWR
QNDVALLHMTAETLEALNRSLYFSERFVHSDDFAAYDTAQTALAEAVALNDAAAKAAKNELQKKRLMGAGQLMQNYTARL
GDMKELLQASGNIRQTQLNVLAPKIAGEFRDLQATVTGAQKTLDGSVEATVASATSTTLVISGLLIVIGLVLSYFVGRLI
SSAVRGMAHSMEQLARGDDAIVITGVEHRHELGAMARSLKVFQETGRAKLIAEANAERARLAAEEERLRQEAERLSDAQV
MEHAFRQISLGLDALSKGDLSVRVGEVDHRYVRIRDHFNSSVASLEEAIDSVIRAVTTIRSGLAEISTASNDLARRTEQQ
AASLEETVAALGDVTRGVNGTAEGAGRAQAVVATARTNAEKGGEIVSRAIAAMTEIQNSSSKIGNIISVIDEIAFQTNLL
ALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAREIKELISTSSAQVKTGVELVGESGLSLEQIVEQVTAMNATVAD
IAVAAREQAASLREVSAAGDQMDKVTQQNAAMVEETTAAAQSLTQETESLAELLRRFKTGNGRASNHRHYAMAS

Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of

COG id: COG0840

COG function: function code NT; Methyl-accepting chemotaxis protein

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein. Note=Localized at the flagellum-bearing pole of the swarmer cell [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 methyl-accepting transducer domain [H]

Homologues:

Organism=Escherichia coli, GI1788194, Length=293, Percent_Identity=43.6860068259386, Blast_Score=194, Evalue=2e-50,
Organism=Escherichia coli, GI2367378, Length=211, Percent_Identity=53.0805687203791, Blast_Score=192, Evalue=5e-50,
Organism=Escherichia coli, GI1787690, Length=310, Percent_Identity=40.3225806451613, Blast_Score=188, Evalue=9e-49,
Organism=Escherichia coli, GI1788195, Length=230, Percent_Identity=48.2608695652174, Blast_Score=188, Evalue=1e-48,
Organism=Escherichia coli, GI1789453, Length=228, Percent_Identity=41.6666666666667, Blast_Score=166, Evalue=4e-42,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004090
- InterPro:   IPR004089
- InterPro:   IPR003660 [H]

Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal [H]

EC number: NA

Molecular weight: Translated: 76451; Mature: 76451

Theoretical pI: Translated: 5.59; Mature: 5.59

Prosite motif: PS50885 HAMP ; PS50111 CHEMOTAXIS_TRANSDUC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLHFWNKFGIRAQITAGFVPLILLMSLLTVSAISGMNGLASIFASYRATAGQSLAISDYS
CCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEECHHH
DQLHEIQMSVEAFRSTPTQAVVDSFRAGVKAFEADDPRFAGNKDLQSGLATIRQDIAAYG
HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH
KAFEQIVSLQARRDLLISKVTEFGPWTSIALNDVMRSAWRQNDVALLHMTAETLEALNRS
HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHH
LYFSERFVHSDDFAAYDTAQTALAEAVALNDAAAKAAKNELQKKRLMGAGQLMQNYTARL
HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GDMKELLQASGNIRQTQLNVLAPKIAGEFRDLQATVTGAQKTLDGSVEATVASATSTTLV
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
ISGLLIVIGLVLSYFVGRLISSAVRGMAHSMEQLARGDDAIVITGVEHRHELGAMARSLK
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHH
VFQETGRAKLIAEANAERARLAAEEERLRQEAERLSDAQVMEHAFRQISLGLDALSKGDL
HHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
SVRVGEVDHRYVRIRDHFNSSVASLEEAIDSVIRAVTTIRSGLAEISTASNDLARRTEQQ
EEEEECCCCCEEEEHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AASLEETVAALGDVTRGVNGTAEGAGRAQAVVATARTNAEKGGEIVSRAIAAMTEIQNSS
HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCH
SKIGNIISVIDEIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAREIK
HHHHHHHHHHHHHHHHHHHEEEECCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
ELISTSSAQVKTGVELVGESGLSLEQIVEQVTAMNATVADIAVAAREQAASLREVSAAGD
HHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH
QMDKVTQQNAAMVEETTAAAQSLTQETESLAELLRRFKTGNGRASNHRHYAMAS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MLHFWNKFGIRAQITAGFVPLILLMSLLTVSAISGMNGLASIFASYRATAGQSLAISDYS
CCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCEEECHHH
DQLHEIQMSVEAFRSTPTQAVVDSFRAGVKAFEADDPRFAGNKDLQSGLATIRQDIAAYG
HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH
KAFEQIVSLQARRDLLISKVTEFGPWTSIALNDVMRSAWRQNDVALLHMTAETLEALNRS
HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHH
LYFSERFVHSDDFAAYDTAQTALAEAVALNDAAAKAAKNELQKKRLMGAGQLMQNYTARL
HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GDMKELLQASGNIRQTQLNVLAPKIAGEFRDLQATVTGAQKTLDGSVEATVASATSTTLV
HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
ISGLLIVIGLVLSYFVGRLISSAVRGMAHSMEQLARGDDAIVITGVEHRHELGAMARSLK
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHH
VFQETGRAKLIAEANAERARLAAEEERLRQEAERLSDAQVMEHAFRQISLGLDALSKGDL
HHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
SVRVGEVDHRYVRIRDHFNSSVASLEEAIDSVIRAVTTIRSGLAEISTASNDLARRTEQQ
EEEEECCCCCEEEEHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AASLEETVAALGDVTRGVNGTAEGAGRAQAVVATARTNAEKGGEIVSRAIAAMTEIQNSS
HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCH
SKIGNIISVIDEIAFQTNLLALNAGVEAARAGEAGKGFAVVAQEVRELAQRSANAAREIK
HHHHHHHHHHHHHHHHHHHEEEECCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH
ELISTSSAQVKTGVELVGESGLSLEQIVEQVTAMNATVADIAVAAREQAASLREVSAAGD
HHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHH
QMDKVTQQNAAMVEETTAAAQSLTQETESLAELLRRFKTGNGRASNHRHYAMAS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 1577276; 11259647 [H]