| Definition | Shigella flexneri 2a str. 2457T, complete genome. |
|---|---|
| Accession | NC_004741 |
| Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is tsr
Identifier: 30065597
GI number: 30065597
Start: 4550098
End: 4551762
Strand: Direct
Name: tsr
Synonym: S4656
Alternate gene names: 30065597
Gene position: 4550098-4551762 (Clockwise)
Preceding gene: 30065596
Following gene: 30065605
Centisome position: 98.93
GC content: 53.99
Gene sequence:
>1665_bases ATGTTAAAACGTATCAAGATCGTGACCAGCTTACTGCTGGTTTTGGCCGTTTTTGGCCTTTTACAACTGACATCAGGCGG TCTGTTTTTTAATGCCTTAAAGAATGACAAAGAAAATTTCACTGTTTTACAAACCATTCGCCAGCAGCAACCCACGCTGA ATGGCAGCTGGGTCGCGTTGTTGCAGACGCGTAACACCCTCAACCGCGCGGGTATCCGCTACATGATGGATCAGAATAAT ATTGGTAGCGGTTCAACCGTTGCTGAGCTGATGCAGAGTGCCAGTATTTCGCTGAAACAGGCGGAAAAAAACTGGGCGGA TTACGAAGCGTTGCCGCGTGACCCGCGTCAGAGCACCGCCGCAGCGGCAGAGATCAAACGTAATTACGATATTTATCACA ATGCGCTGGCGGAGCTGATCCAACTGTTAGGTGCAGGCAAAATCAACGAGTTCTTTGATCAGCCGACCCAGGGATATCAG GACGGTTTCGAGAAGCAGTATGTGGCTTACATGGAGCAAAACGATCGGCTCTATGATATCGCCGTCAGCGATAACAATGC CTCCTACAGCCAGGCGATGTGGATTCTGGTGGGCGTGATGATCGTCGTACTGGCGGTCATCTTCGCCGTCTGGTTCGGTA TTAAAGCCTCGCTGGTAGCGCCAATGAATCGCCTGATTGACAGCATTCGTCATATTGCAGGCGGCGATCTGGTGAAACCG ATTGAGGTGGATGGCTCTAATGAGATGGGGCAACTGGCAGAGAGTTTGCGCCATATGCAGGGAGAGCTGATGCGTACCGT CGGTGATGTGCGCAACGGGGCCAATGCCATCTATAGCGGTGCCAGCGAAATCGCTACCGGCAATAATGATCTCTCTTCGC GCACCGAGCAACAGGCCGCTTCGCTGGAAGAGACGGCAGCCAGCATGGAGCAACTGACCGCAACGGTGAAACAGAACGCC GAGAATGCGCGCCAGGCCAGCCATCTGGCATTAAGTGCTTCTGAAACGGCGCAACGCGGCGGCAAAGTGGTAGATAACGT GGTGCAGACTATGCGCGATATCTCCACCAGTTCGCAGAAAATCGCCGATATTATCAGCGTAATTGACGGCATTACCTTCC AGACCAATATTCTGGCTTTGAACGCGGCGGTTGAAGCAGCGCGTGCGGGTGAGCAAGGACGTGGTTTTGCGGTGGTTGCG GGAGAAGTGCGTAATCTGGCCCAGCGTAGCGCCCAGGCGGTTCGTGAAATTAAAAGCCTGATTGAAGACTCGGTGGGGAA AGTGGATGTTGGCTCTACGCTGGTCGAAAGCGCCGGGGAAACAATGGCGGAGATTGTCAGTGCTGTGACCCGCGTGACGG ACATTATGGGCGAAATAGCTTCTGCTTCTGATGAGCAGAGCCGTGGTATCGATCAGGTTGGCTTAGCGGTTGCTGAGATG GACCGGGTAACTCAACAGAACGCCGCGCTGGTGGAAGAATCTGCCGCTGCCGCCGCCGCGCTGGAAGAGCAGGCCAGTCG CCTGACCGAAGCTGTGGCAGTGTTCCGGATTCAGCAACAGCAGCAACAGCAGCGTGAAACATCGGCTGTGGTAAAAACCG TGACGCCAGCTACGCCGCGTAAAATGGCAGTGGCAGATAGCGGGGAGAACTGGGAAACGTTTTAA
Upstream 100 bases:
>100_bases CCAGCGCCTGGTTTTTCCATGGATGGCGGGTTACACCTTTTCATAAAGTTTTTGCTTTCCAGGCCGAAAATCTTGCATCG GTCCACAGGAAAGAGAAACC
Downstream 100 bases:
>100_bases TCGCCATAAAAATGCCCGATAAGCAAAATGTTATCGGGCATAGGGAGTTTAATCTTTACGCGGGTCGTTGATCGGCTGGC GAACCAGGAAGATGTACGCC
Product: methyl-accepting chemotaxis protein I, serine sensor receptor
Products: NA
Alternate protein names: MCP-I; Serine chemoreceptor protein [H]
Number of amino acids: Translated: 554; Mature: 554
Protein sequence:
>554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQPTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGITFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAVREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF
Sequences:
>Translated_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQPTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGITFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAVREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF >Mature_554_residues MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQPTLNGSWVALLQTRNTLNRAGIRYMMDQNN IGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ DGFEKQYVAYMEQNDRLYDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGITFQTNILALNAAVEAARAGEQGRGFAVVA GEVRNLAQRSAQAVREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPRKMAVADSGENWETF
Specific function: Chemotactic-signal transducers respond to changes in the concentration of attractants and repellents in the environment, transduce a signal from the outside to the inside of the cell, and facilitate sensory adaptation through the variation of the level of
COG id: COG0840
COG function: function code NT; Methyl-accepting chemotaxis protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 methyl-accepting transducer domain [H]
Homologues:
Organism=Escherichia coli, GI2367378, Length=554, Percent_Identity=98.1949458483755, Blast_Score=1052, Evalue=0.0, Organism=Escherichia coli, GI1788195, Length=556, Percent_Identity=57.9136690647482, Blast_Score=570, Evalue=1e-163, Organism=Escherichia coli, GI1788194, Length=531, Percent_Identity=46.3276836158192, Blast_Score=429, Evalue=1e-121, Organism=Escherichia coli, GI1787690, Length=536, Percent_Identity=42.7238805970149, Blast_Score=359, Evalue=1e-100, Organism=Escherichia coli, GI1789453, Length=323, Percent_Identity=39.6284829721362, Blast_Score=221, Evalue=9e-59,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004090 - InterPro: IPR003122 - InterPro: IPR004091 - InterPro: IPR004089 - InterPro: IPR003660 [H]
Pfam domain/function: PF00672 HAMP; PF00015 MCPsignal; PF02203 TarH [H]
EC number: NA
Molecular weight: Translated: 59879; Mature: 59879
Theoretical pI: Translated: 4.63; Mature: 4.63
Prosite motif: PS50885 HAMP ; PS00538 CHEMOTAXIS_TRANSDUC_1 ; PS50111 CHEMOTAXIS_TRANSDUC_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQPTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHCCCCCCCEEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA EHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQNDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGITFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAVREIKSL HHHHHHHHCCCCEEHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC >Mature Secondary Structure MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQPTLNGSWVAL CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHCCCCHHHHHHHHHHHCCCCCCCEEEE LQTRNTLNRAGIRYMMDQNNIGSGSTVAELMQSASISLKQAEKNWADYEALPRDPRQSTA EHHHHHHHHHCCEEEECCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCHHHH AAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQNDRLYDI HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEEE AVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP EEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAA EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHCCCCCCHHHHHHHHHH SLEETAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQK HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHCCHHHH IADIISVIDGITFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQAVREIKSL HHHHHHHHCCCCEEHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHH IEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHHH DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQQQQRETSAVVKTVTPATPR HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC KMAVADSGENWETF CEEECCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 6402709; 7610040; 9278503; 8384293; 6213619; 2033064; 10466731 [H]