Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is mcpE [H]
Identifier: 159184351
GI number: 159184351
Start: 504030
End: 505736
Strand: Direct
Name: mcpE [H]
Synonym: Atu0514
Alternate gene names: 159184351
Gene position: 504030-505736 (Clockwise)
Preceding gene: 159184347
Following gene: 15887863
Centisome position: 17.74
GC content: 62.92
Gene sequence:
>1707_bases ATGGGTATGTTCGTTCAAAAATCCCTGGAGGGTGCTGGCGTTTTGAACCTTGGCCAATTGGCCGGGTTTTACCGGCCATC ATGTTGTTGGGGACGTCGGTTGGGGCACGATTTGCCATCAGATCAAGCGCGTAGATCGGCAGGCGGAAGCCTGGCAGGAC GGTTGCGCTTTGCCGGCCTTGATGAGATGCAATCCGATTTCCTGCGCAATTATCGCGGCATGCTGGAGCCTTACGTCAAG GCCGGCCTGCGCGACGTGATGACCCGTTTCCAGTCCATGCCGGACTGTTCGCCGTCCTTCGAAAGCGAAAACCAACTCGA TCGCCTGCACGATCTGCAATCCTCGCACTGGAGCGTCCTGACGGATGCGCGTTTCGATGCGCTTTATGCCGAGCGGGTGA AGGTTCTTTCCGATAATGCCGGTCGCATGGGTCTCGATCCGCGCTGGCAGATCGCTAGCCACGCCGTCGTTCTGGAGCAT CTGCTCGGCGGGCTGGTGGCTGAACACGCGCCCCGCTCCATCCTGCCCGGCAACCGTAAAAAGAGCCGCGAACTGGCGGA CGCCGTGAAGAACGTCGTCCGGCTGGTGATGGTCGATACCGAGATTGCCGTGTCGCTGCGCTTCAACGAACTGCGCCTGC GCCATGGCCGCGAACTCCAGGAACAGCGCGAGAATGACCGCTCGGAAGCGGCGAATTTGCTGGGCACGGCCCTGACCGCC TTTGCCGCGGGCAATTTGCAGGCCCGCATCGGGGACGATGTGCCCGACGCTTACAGGGACGTCGCAGCCACCTTCAACAC GGCGCTCGAGACGATCGGCGCGTCGCTGATAGCCGCCCAGAACGGTGTAGGCGAGGCCGAGGCGCTGAGCGCCCGCTTTG CCGATATCGGCCGCTCGATTGCGGAGCGTTCACGCCAGCAGGCCGAAGCGCTTACCGAGACCTCCCGTGCACTCCAGGTG ATGATTGCGCATGTGGCTGAAAACGGCGCCCGTATATCGGCGACGGAAAAGGCGGTTTCCAGCGCCCGCGACGCGGCCGT CGAAAGCGGGAGGGCGATCGGCGAGGCGATCGACGCCATGTCGGATATCGAACAATCAGCCGAACAGATCGGACGGATCA TCGGCACCATCGACGAGATCGCCTTCCAGACCAACCTTCTCGCCCTCAATGCCGGCATCGAGGCGGCACGGGCCGGCGAC AGCGGACGCGGTTTCGCCGTCGTTGCGCAGGAAGTCCGGGCATTGGCCCAGCGTTCCGCCGATGCCGCAAGAGAGATCAA GAGCCTTGTCGGCTCCACCAAAACGCAGGTCGAGGGTGGGGTGCGCATGGTGAACCGCACGCAGGAGGCCATTGGCGGGG TCGTCCGGCAGGTATCCGGGATCAACGACATGATCGCCGAGGTTTCGCGGCACACCGCCGACCACGCCGGAGAATTGCAA TCGGTCGCCGGTGATATAGACGAACAGCAACGGCAGGCCGGGCGGAATGTCGCCGACATGGGCGCCAGCGCCTCCGAGGC CGATGCGCTGCACACCGTCATTCTGGAACTCGGCCGCACCGTGCGCGCGTTCCGTATTGCCCGCCAGGAACATTTCGCTG CCGCCGCCGGTGCGGACTGGCAGCACTCATCATTTATGGCCCGTGGGGATAACCGCACGGATAACAGCCAGAATTACCAA CAATTCAGACGTCAGGGAGTCATGTAA
Upstream 100 bases:
>100_bases CTTCCAGGCATGACGCCGCACTGCCGTACCGGGCTCGATCCGGTATTTTCGTTTCTTGGTTCAAGAAGGTCTGGCAAGCG TCGATGCCTCATCGTCCACG
Downstream 100 bases:
>100_bases TGGCAGCCAGAAAAGCAGCTGAGGCGACGCTGAAACTGTCACCGGTGCTGGATCTCAACGAAGCATCGGCTCTGCACGGC AAGCTGATGACGCTGAGAGG
Product: methyl-accepting chemotaxis protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 568; Mature: 567
Protein sequence:
>568_residues MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVK AGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEH LLGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQV MIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGD SGRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQ QFRRQGVM
Sequences:
>Translated_568_residues MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVK AGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEH LLGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQV MIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGD SGRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQ QFRRQGVM >Mature_567_residues GMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGLDEMQSDFLRNYRGMLEPYVKA GLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVLTDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHL LGGLVAEHAPRSILPGNRKKSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTAF AAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSIAERSRQQAEALTETSRALQVM IAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAMSDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDS GRGFAVVAQEVRALAQRSADAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQS VAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADWQHSSFMARGDNRTDNSQNYQQ FRRQGVM
Specific function: Signal Transducer For Aerotaxis. The Aerotactic Response Is The Accumulation Of Cells Around Air Bubbles. The Nature Of The Sensory Stimulus Detected By This Protein Is The Proton Motive Force Or Cellular Redox State. It Uses A FAD Prosthetic Group As A R
COG id: NA
COG function: NA
Gene ontology:
Cell location: Integral Membrane. Inner Membrane [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 methyl-accepting transducer domain [H]
Homologues:
Organism=Escherichia coli, GI1789453, Length=187, Percent_Identity=41.1764705882353, Blast_Score=137, Evalue=2e-33, Organism=Escherichia coli, GI1788195, Length=228, Percent_Identity=38.1578947368421, Blast_Score=136, Evalue=4e-33, Organism=Escherichia coli, GI2367378, Length=236, Percent_Identity=38.135593220339, Blast_Score=133, Evalue=4e-32, Organism=Escherichia coli, GI1788194, Length=276, Percent_Identity=31.8840579710145, Blast_Score=125, Evalue=9e-30, Organism=Escherichia coli, GI1787690, Length=204, Percent_Identity=35.2941176470588, Blast_Score=110, Evalue=3e-25,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004090 - InterPro: IPR004089 - InterPro: IPR012292 - InterPro: IPR009050 [H]
Pfam domain/function: PF00015 MCPsignal [H]
EC number: NA
Molecular weight: Translated: 61453; Mature: 61321
Theoretical pI: Translated: 6.22; Mature: 6.22
Prosite motif: PS50885 HAMP ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGL CCCCHHHHCCCCCCCCHHHHHCCCCCCHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHCCH DEMQSDFLRNYRGMLEPYVKAGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCHHH TDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHLLGGLVAEHAPRSILPGNRK HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH KSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA HHHHHHHHHHHHHHHHHHCCHHEEEEEHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSI HHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH AERSRQQAEALTETSRALQVMIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAM HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH SDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDSGRGFAVVAQEVRALAQRSA HHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCHHHCCCCCCCCHHHHHHHHHHHHHHHH DAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADW HHHCCHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC QHSSFMARGDNRTDNSQNYQQFRRQGVM CCCHHHHCCCCCCCCCHHHHHHHHCCCC >Mature Secondary Structure GMFVQKSLEGAGVLNLGQLAGFYRPSCCWGRRLGHDLPSDQARRSAGGSLAGRLRFAGL CCCHHHHCCCCCCCCHHHHHCCCCCCHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHCCH DEMQSDFLRNYRGMLEPYVKAGLRDVMTRFQSMPDCSPSFESENQLDRLHDLQSSHWSVL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCHHH TDARFDALYAERVKVLSDNAGRMGLDPRWQIASHAVVLEHLLGGLVAEHAPRSILPGNRK HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH KSRELADAVKNVVRLVMVDTEIAVSLRFNELRLRHGRELQEQRENDRSEAANLLGTALTA HHHHHHHHHHHHHHHHHHCCHHEEEEEHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH FAAGNLQARIGDDVPDAYRDVAATFNTALETIGASLIAAQNGVGEAEALSARFADIGRSI HHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH AERSRQQAEALTETSRALQVMIAHVAENGARISATEKAVSSARDAAVESGRAIGEAIDAM HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH SDIEQSAEQIGRIIGTIDEIAFQTNLLALNAGIEAARAGDSGRGFAVVAQEVRALAQRSA HHHHHHHHHHHHHHHHHHHHHHHHHHEEEECCCHHHCCCCCCCCHHHHHHHHHHHHHHHH DAAREIKSLVGSTKTQVEGGVRMVNRTQEAIGGVVRQVSGINDMIAEVSRHTADHAGELQ HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH SVAGDIDEQQRQAGRNVADMGASASEADALHTVILELGRTVRAFRIARQEHFAAAAGADW HHHCCHHHHHHHHCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC QHSSFMARGDNRTDNSQNYQQFRRQGVM CCCHHHHCCCCCCCCCHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7623670; 11481430 [H]