| Definition | Escherichia coli 55989, complete genome. |
|---|---|
| Accession | NC_011748 |
| Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is tsr
Identifier: 218698193
GI number: 218698193
Start: 5104964
End: 5106628
Strand: Direct
Name: tsr
Synonym: EC55989_5017
Alternate gene names: 218698193
Gene position: 5104964-5106628 (Clockwise)
Preceding gene: 218698189
Following gene: 218698196
Centisome position: 99.03
GC content: 54.05
Gene sequence:
>1665_bases ATGTTAAAACGTATCAAAATTGTGACCAGCTTACTGCTGGTTTTGGCCGTTTTTGGCCTTTTACAACTGACATCAGGCGG TCTGTTCTTTAATGCCTTAAAGAATGACAAAGAAAATTTCACTGTTTTACAAACCATTCGCCAGCAGCAATCCACGCTGA ATGGCAGCTGGGTCGCGTTGTTGCAGACGCGTAACACCCTCAACCGCGCGGGTATCCGCTACATGATGGATCAGAATAAT ATTGGTAGCGGTTCAACCGTTGCTGAGCTGATGCAGAGTGCCAGTATTTCGCTGAAACAGGCGGAAAAAAACTGGGCGGA TTACGAAGCGTTGCCGCGTGACCCGCGTCAGAGCACCGCCGCAGCGGCAGAGATCAAACGTAATTACGATATTTATCACA ATGCGCTGGCGGAGCTGATCCAACTATTAGGTGCAGGCAAAATCAACGAGTTCTTTGATCAGCCGACCCAGGGATATCAG GACGGTTTCGAGAAGCAGTATGTAGCTTACATGGAGCAAGACGATCGGCTCTATGATATCGCCGTCAGCGATAACAATGC CTCCTACAGCCAGGCGATGTGGATTCTGGTGGGCGTGATGATCGTCGTACTGGCGGTCATCTTCGCCGTCTGGTTCGGTA TTAAAGCCTCGCTGGTAGCGCCAATGAATCGCCTGATTGACAGCATTCGTCATATTGCAGGCGGCGATCTGGTGAAACCG ATTGAGGTGGATGGCTCTAATGAGATGGGGCAACTGGCAGAGAGTTTGCGCCATATGCAGGGAGAGCTGATGCGTACCGT CGGTGATGTGCGCAACGGGGCCAATGCCATCTATAGCGGTGCCAGCGAAATCGCCACCGGCAATAACGATCTCTCTTCGC GCACCGAGCAACAGGCCGCTTCGCTGGAAGAGACGGCAGCCAGCATGGAGCAACTGACCGCAACGGTGAAACAGAACGCC GAGAATGCGCGTCAGGCCAGCCATCTGGCGTTAAGTGCTTCTGAAACGGCGCAACGCGGCGGCAAAGTGGTGGATAACGT AGTACAGACCATGCGCGATATCTCCACCAGTTCGCAGAAAATCGCCGATATTATCAGCGTAATTGACGGCATTGCCTTCC AGACCAATATTCTGGCTTTGAACGCGGCGGTTGAAGCAGCGCGCGCGGGTGAGCAAGGGCGCGGTTTTGCGGTGGTCGCG GGTGAAGTGCGTAATCTGGCTCAGCGCAGCGCTCAGGCGGCTCGTGAAATTAAAAGCCTGATTGAAGACTCGGTGGGGAA AGTAGATGTTGGCTCTACGCTGGTCGAAAGCGCAGGGGAAACAATGGCGGAGATTGTCAGTGCCGTGACCCGCGTGACGG ACATTATGGGCGAAATTGCTTCTGCTTCTGATGAGCAGAGCCGTGGTATCGATCAGGTTGGCTTAGCGGTTGCTGAGATG GACCGGGTAACTCAACAGAACGCTGCGCTGGTGGAAGAATCTGCCGCTGCCGCCGCCGCGCTGGAAGAGCAGGCCAGTCG CCTGACCGAAGCTGTGGCAGTGTTCCGGATTCAGCAACAGCAGCAACAGCAGCGTGAAACATCGGCTGTGGTAAAAAACG TGACGCCAGCTACGCCGCGTAAAATGGCAGTGGCAGATAGCGGGGAGAACTGGGAAACGTTTTAA
Upstream 100 bases:
>100_bases AGTGTGAATAAAATTACTCGGCGTAATCTCCGCGGGATATTCATAAAGTTTTTCCTTTCCAGGCCGAAAATCTTGCATCG GTCCACAGGAAAGAGAAACT
Downstream 100 bases:
>100_bases TCGCCATAAAAATGCCCGATAAGCAAAATGTTATCGGGCATAGGGAGTTTAATCTTTACGCGGGTCGTTGATCGGCTGAC GAACCAGGAAGATGTACGCC
Product: methyl-accepting chemotaxis protein I, serine sensor receptor
Products: NA
Alternate protein names: MCP-I; Serine chemoreceptor protein [H]
Number of amino acids: Translated: 554; Mature: 554
Protein sequence:
>554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQDDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKNVTPATPRKMAVADSGENWETF
Sequences:
>Translated_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQDDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKNVTPATPRKMAVADSGENWETF >Mature_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQDDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKNVTPATPRKMAVADSGENWETF
Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of
COG id: COG0840
COG function: function code NT; Methyl-accepting chemotaxis protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 methyl-accepting transducer domain [H]
Homologues:
Organism=Escherichia coli, GI2367378, Length=554, Percent_Identity=98.3754512635379, Blast_Score=1053, Evalue=0.0, Organism=Escherichia coli, GI1788195, Length=556, Percent_Identity=58.273381294964, Blast_Score=573, Evalue=1e-164, Organism=Escherichia coli, GI1788194, Length=531, Percent_Identity=46.7043314500942, Blast_Score=432, Evalue=1e-122, Organism=Escherichia coli, GI1787690, Length=452, Percent_Identity=47.3451327433628, Blast_Score=362, Evalue=1e-101, Organism=Escherichia coli, GI1789453, Length=324, Percent_Identity=40.7407407407407, Blast_Score=224, Evalue=1e-59,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004090 - InterPro: IPR003122 - InterPro: IPR004091 - InterPro: IPR004089 - InterPro: IPR003660 [H]
Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal; PF02203 TarH [H]
EC number: NA
Molecular weight: Translated: 59825; Mature: 59825
Theoretical pI: Translated: 4.60; Mature: 4.60
Prosite motif: PS50885 HAMP ; PS00538 CHEMOTAXIS_TRANSDUC_1 ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHHHHCCCCCEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA HHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQDDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAAREIKSL HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKNVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC >Mature Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHHHHCCCCCEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA HHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQDDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAAREIKSL HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKNVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 6402709; 7610040; 9278503; 8384293; 6213619; 2033064; 10466731 [H]