| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is tsr [H]
Identifier: 209396680
GI number: 209396680
Start: 5523631
End: 5525295
Strand: Direct
Name: tsr [H]
Synonym: ECH74115_5868
Alternate gene names: 209396680
Gene position: 5523631-5525295 (Clockwise)
Preceding gene: 209397187
Following gene: 209397114
Centisome position: 99.13
GC content: 54.05
Gene sequence:
>1665_bases ATGTTAAAACGTATCAAGATCGTGACCAGCTTACTGCTGGTTTTGGCCGTTTTTGGCCTTTTACAACTGACATCAGGCGG TCTGTTCTTTAATGCCTTAAAGAATGACAAAGAAAATTTCACTGTTTTACAAACCATTCGCCAGCAGCAATCCACGCTGA ATGGCAGCTGGGTCGCGTTGTTGCAGACGCGTAACACCCTCAACCGCGCGGGTATCCGCTACATGATGGATCAGAATAAT ATTGGTAGCGGTTCAACCGTTGCTGAGCTGATGCAGAGTGCCAGTATTTCGTTGAAACAGGCGGAAAAAAACTGGGCAGA TTACGAAGCGTTGCCGCGTGACCCGCGTCAGAGCACCGCCGCAGCGGCAGAGATCAAACGTAATTACGATATTTATCACA ATGCGCTGGCGGAGCTGATCCAACTGTTAGGTGCAGGCAAAATCAACGAGTTCTTTGATCAGCCGACCCAGGGATATCAG GACGGTTTCGAGAAGCAGTATGTGGCTTACATGGAGCAAAACGATCGGCTCTATGATATCGCCGTCAGCGATAACAATGC CTCCTACAGCCAGGCGATGTGGATTCTGGTGGGCGTGATGATCGTCGTACTGGCGGTCATCTTCGCCGTCTGGTTCGGTA TTAAAGCCTCGCTGGTAGCGCCAATGAATCGCCTGATTGACAGCATTCGTCATATTGCAGGCGGCGATCTGGTGAAACCG ATTGAGGTGGATGGCTCTAATGAGATGGGGCAACTGGCAGAGAGTTTGCGCCATATGCAGGGAGAGCTGATGCGTACCGT CGGTGATGTGCGCAACGGGGCCAATGCCATCTATAGCGGTGCCAGCGAAATCGCTACCGGCAATAACGATCTCTCTTCGC GCACCGAGCAACAGGCCGCTTCGCTGGAAGAGACGGCAGCCAGCATGGAGCAACTCACCGCAACGGTGAAACAGAACGCC GAGAATGCGCGCCAGGCCAGCCACCTGGCGTTAAGTGCTTCTGAAACGGCGCAACGCGGCGGCAAAGTGGTAGATAACGT GGTGCAGACTATGCGTGATATCTCCACCAGTTCGCAGAAAATCGCCGATATTATCAGCGTAATTGACGGCATTGCCTTCC AGACCAATATTCTGGCTTTGAACGCGGCGGTTGAAGCAGCGCGTGCGGGTGAGCAAGGACGTGGTTTTGCGGTGGTTGCG GGAGAAGTGCGTAATCTGGCCCAGCGTAGCGCCCAGGCGGCTCGTGAAATTAAAAGCCTGATTGAAGACTCGGTGGGGAA AGTGGATGTTGGCTCTACGCTGGTCGAAAGCGCCGGGGAAACAATGGCGGAGATTGTCAGTGCTGTGACCCGCGTGACGG ACATTATGGGCGAAATAGCTTCTGCTTCTGATGAGCAGAGCCGTGGTATCGATCAGGTTGGCTTAGCGGTTGCTGAGATG GACCGGGTAACTCAACAGAACGCTGCGCTGGTGGAAGAGTCTGCCGCTGCCGCCGCCGCGTTGGAAGAGCAGGCCAGTCG CCTGACCGAAGCTGTGGCAGTGTTCCGGATTCAGCAACAGCAGCAACAGCAGCGTGAAACATCGGCTGTGGTAAAAACCG TGACGCCAGCTACGCCGCGTAAAATGGCAGTGGCAGATAGCGGGGAGAACTGGGAAACGTTTTAA
Upstream 100 bases:
>100_bases AAAAAAAGTGAGGTAAAATTAACCAGTTTCAAGTCGCTACCTATAAAGTTTTCTCATTTCAGGCCGAAAATTTTGCATCG GTCCACAGGAAAGAGAAACC
Downstream 100 bases:
>100_bases TCGCCATAAAAATGCCCGATAAGCAAAATGTTATCGGGCATAGGGAGTTTAATCTTTACGCGGGTCGTTGATCGGCTGAC GAACCAGGAAGATGTACGCC
Product: methyl-accepting chemotaxis protein I
Products: NA
Alternate protein names: MCP-I; Serine chemoreceptor protein [H]
Number of amino acids: Translated: 554; Mature: 554
Protein sequence:
>554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF
Sequences:
>Translated_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF >Mature_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF
Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of
COG id: COG0840
COG function: function code NT; Methyl-accepting chemotaxis protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 methyl-accepting transducer domain [H]
Homologues:
Organism=Escherichia coli, GI2367378, Length=554, Percent_Identity=98.7364620938628, Blast_Score=1058, Evalue=0.0, Organism=Escherichia coli, GI1788195, Length=556, Percent_Identity=58.273381294964, Blast_Score=573, Evalue=1e-165, Organism=Escherichia coli, GI1788194, Length=531, Percent_Identity=46.7043314500942, Blast_Score=432, Evalue=1e-122, Organism=Escherichia coli, GI1787690, Length=452, Percent_Identity=47.3451327433628, Blast_Score=362, Evalue=1e-101, Organism=Escherichia coli, GI1789453, Length=324, Percent_Identity=40.7407407407407, Blast_Score=224, Evalue=1e-59,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004090 - InterPro: IPR003122 - InterPro: IPR004091 - InterPro: IPR004089 - InterPro: IPR003660 [H]
Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal; PF02203 TarH [H]
EC number: NA
Molecular weight: Translated: 59811; Mature: 59811
Theoretical pI: Translated: 4.63; Mature: 4.63
Prosite motif: PS50885 HAMP ; PS00538 CHEMOTAXIS_TRANSDUC_1 ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHHHHCCCCCEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA HHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQNDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAAREIKSL HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC >Mature Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHHHHCCCCCEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA HHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQNDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAAREIKSL HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 6402709; 7610040; 9278503; 8384293; 6213619; 2033064; 10466731 [H]