Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is mcpE [H]

Identifier: 159184351

GI number: 159184351

Start: 504030

End: 505736

Strand: Direct

Name: mcpE [H]

Synonym: Atu0514

Alternate gene names: 159184351

Gene position: 504030-505736 (Clockwise)

Preceding gene: 159184347

Following gene: 15887863

Centisome position: 17.74

GC content: 62.92

Gene sequence:

>1707_bases
ATGGGTATGTTCGTTCAAAAATCCCTGGAGGGTGCTGGCGTTTTGAACCTTGGCCAATTGGCCGGGTTTTACCGGCCATC
ATGTTGTTGGGGACGTCGGTTGGGGCACGATTTGCCATCAGATCAAGCGCGTAGATCGGCAGGCGGAAGCCTGGCAGGAC
GGTTGCGCTTTGCCGGCCTTGATGAGATGCAATCCGATTTCCTGCGCAATTATCGCGGCATGCTGGAGCCTTACGTCAAG
GCCGGCCTGCGCGACGTGATGACCCGTTTCCAGTCCATGCCGGACTGTTCGCCGTCCTTCGAAAGCGAAAACCAACTCGA
TCGCCTGCACGATCTGCAATCCTCGCACTGGAGCGTCCTGACGGATGCGCGTTTCGATGCGCTTTATGCCGAGCGGGTGA
AGGTTCTTTCCGATAATGCCGGTCGCATGGGTCTCGATCCGCGCTGGCAGATCGCTAGCCACGCCGTCGTTCTGGAGCAT
CTGCTCGGCGGGCTGGTGGCTGAACACGCGCCCCGCTCCATCCTGCCCGGCAACCGTAAAAAGAGCCGCGAACTGGCGGA
CGCCGTGAAGAACGTCGTCCGGCTGGTGATGGTCGATACCGAGATTGCCGTGTCGCTGCGCTTCAACGAACTGCGCCTGC
GCCATGGCCGCGAACTCCAGGAACAGCGCGAGAATGACCGCTCGGAAGCGGCGAATTTGCTGGGCACGGCCCTGACCGCC
TTTGCCGCGGGCAATTTGCAGGCCCGCATCGGGGACGATGTGCCCGACGCTTACAGGGACGTCGCAGCCACCTTCAACAC
GGCGCTCGAGACGATCGGCGCGTCGCTGATAGCCGCCCAGAACGGTGTAGGCGAGGCCGAGGCGCTGAGCGCCCGCTTTG
CCGATATCGGCCGCTCGATTGCGGAGCGTTCACGCCAGCAGGCCGAAGCGCTTACCGAGACCTCCCGTGCACTCCAGGTG
ATGATTGCGCATGTGGCTGAAAACGGCGCCCGTATATCGGCGACGGAAAAGGCGGTTTCCAGCGCCCGCGACGCGGCCGT
CGAAAGCGGGAGGGCGATCGGCGAGGCGATCGACGCCATGTCGGATATCGAACAATCAGCCGAACAGATCGGACGGATCA
TCGGCACCATCGACGAGATCGCCTTCCAGACCAACCTTCTCGCCCTCAATGCCGGCATCGAGGCGGCACGGGCCGGCGAC
AGCGGACGCGGTTTCGCCGTCGTTGCGCAGGAAGTCCGGGCATTGGCCCAGCGTTCCGCCGATGCCGCAAGAGAGATCAA
GAGCCTTGTCGGCTCCACCAAAACGCAGGTCGAGGGTGGGGTGCGCATGGTGAACCGCACGCAGGAGGCCATTGGCGGGG
TCGTCCGGCAGGTATCCGGGATCAACGACATGATCGCCGAGGTTTCGCGGCACACCGCCGACCACGCCGGAGAATTGCAA
TCGGTCGCCGGTGATATAGACGAACAGCAACGGCAGGCCGGGCGGAATGTCGCCGACATGGGCGCCAGCGCCTCCGAGGC
CGATGCGCTGCACACCGTCATTCTGGAACTCGGCCGCACCGTGCGCGCGTTCCGTATTGCCCGCCAGGAACATTTCGCTG
CCGCCGCCGGTGCGGACTGGCAGCACTCATCATTTATGGCCCGTGGGGATAACCGCACGGATAACAGCCAGAATTACCAA
CAATTCAGACGTCAGGGAGTCATGTAA

Upstream 100 bases:

>100_bases
CTTCCAGGCATGACGCCGCACTGCCGTACCGGGCTCGATCCGGTATTTTCGTTTCTTGGTTCAAGAAGGTCTGGCAAGCG
TCGATGCCTCATCGTCCACG

Downstream 100 bases:

>100_bases
TGGCAGCCAGAAAAGCAGCTGAGGCGACGCTGAAACTGTCACCGGTGCTGGATCTCAACGAAGCATCGGCTCTGCACGGC
AAGCTGATGACGCTGAGAGG

Product: methyl-accepting chemotaxis protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 568; Mature: 567

Protein sequence:

>568_residues
MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVK
AGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEH
LLGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA
FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQV
MIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGD
SGRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ
SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQ
QFRRQGVM

Sequences:

>Translated_568_residues
MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVK
AGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEH
LLGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA
FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQV
MIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGD
SGRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ
SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQ
QFRRQGVM
>Mature_567_residues
GMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVKA
GLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHL
LGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTAF
AAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQVM
IAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDS
GRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQS
VAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQQ
FRRQGVM

Specific function: Signal Transducer For Aerotaxis. The Aerotactic Response Is The Accumulation Of Cells Around Air Bubbles. The Nature Of The Sensory Stimulus Detected By This Protein Is The Proton Motive Force Or Cellular Redox State. It Uses A FAD Prosthetic Group As A R

COG id: NA

COG function: NA

Gene ontology:

Cell location: Integral Membrane. Inner Membrane [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 methyl-accepting transducer domain [H]

Homologues:

Organism=Escherichia coli, GI1789453, Length=187, Percent_Identity=41.1764705882353, Blast_Score=137, Evalue=2e-33,
Organism=Escherichia coli, GI1788195, Length=228, Percent_Identity=38.1578947368421, Blast_Score=136, Evalue=4e-33,
Organism=Escherichia coli, GI2367378, Length=236, Percent_Identity=38.135593220339, Blast_Score=133, Evalue=4e-32,
Organism=Escherichia coli, GI1788194, Length=276, Percent_Identity=31.8840579710145, Blast_Score=125, Evalue=9e-30,
Organism=Escherichia coli, GI1787690, Length=204, Percent_Identity=35.2941176470588, Blast_Score=110, Evalue=3e-25,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004090
- InterPro:   IPR004089
- InterPro:   IPR012292
- InterPro:   IPR009050 [H]

Pfam domain/function: PF00015 MCPsignal [H]

EC number: NA

Molecular weight: Translated: 61453; Mature: 61321

Theoretical pI: Translated: 6.22; Mature: 6.22

Prosite motif: PS50885 HAMP ; PS50111 CHEMOTAXIS_TRANSDUC_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGL
CCCCHHHHCCCCCCCCHHHHHCCCCCCHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHCCH
DEMQSDFLRNYRGMLEPYVKAGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCHHH
TDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHLLGGLVAEHAPRSILPGNRK
HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH
KSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA
HHHHHHHHHHHHHHHHHHCCHHEEEEEHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH
FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSI
HHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
AERSRQQAEALTETSRALQVMIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAM
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH
SDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDSGRGFAVVAQEVRALAQRSA
HHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCHHHCCCCCCCCHHHHHHHHHHHHHHHH
DAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH
SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADW
HHHCCHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
QHSSFMARGDNRTDNSQNYQQFRRQGVM
CCCHHHHCCCCCCCCCHHHHHHHHCCCC
>Mature Secondary Structure 
GMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGL
CCCHHHHCCCCCCCCHHHHHCCCCCCHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHCCH
DEMQSDFLRNYRGMLEPYVKAGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCHHH
TDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHLLGGLVAEHAPRSILPGNRK
HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH
KSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA
HHHHHHHHHHHHHHHHHHCCHHEEEEEHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH
FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSI
HHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
AERSRQQAEALTETSRALQVMIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAM
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH
SDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDSGRGFAVVAQEVRALAQRSA
HHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCHHHCCCCCCCCHHHHHHHHHHHHHHHH
DAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH
SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADW
HHHCCHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
QHSSFMARGDNRTDNSQNYQQFRRQGVM
CCCHHHHCCCCCCCCCHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7623670; 11481430 [H]