Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is lysR

Identifier: 218696435

GI number: 218696435

Start: 3196173

End: 3197108

Strand: Direct

Name: lysR

Synonym: EC55989_3116

Alternate gene names: 218696435

Gene position: 3196173-3197108 (Clockwise)

Preceding gene: 218696433

Following gene: 218696441

Centisome position: 62.0

GC content: 53.53

Gene sequence:

>936_bases
ATGGCCGCCGTTAACTTACGTCATATTGAAATTTTTCATGCGGTAATGACCGCCGGAAGCCTGACTGAGGCAGCACACCT
GCTACACACCTCACAGCCAACCGTCAGCCGCGAACTGGCGCGCTTTGAGAAGGTGATCGGGCTAAAATTGTTTGAGCGCG
TACGTGGACGATTACATCCTACCGTGCAAGGACTGCGTCTGTTTGAAGAAGTGCAACGATCCTGGTACGGACTGGATCGC
ATTGTCAGCGCCGCAGAAAGTCTGCGCGAGTTTCGCCAGGGAGAACTGTCTATTGCCTGCCTGCCGGTCTTTTCGCAATC
TTTTTTACCGCAGCTCCTGCAACCCTTTCTGGCACGTTATCCCGATGTCAGCTTAAATATCGTGCCCCAGGAATCACCGC
TACTTGAAGAGTGGCTCTCGGCCCAGCGTCATGATTTAGGACTCACTGAGACACTCCATACGCCTGCGGGAACAGAACGT
ACCGAATTACTCTCTTTAGATGAAGTGTGTGTGTTACCTCCGGGTCATCCGCTGGCGGTAAAAAAGGTATTAACGCCGGA
TGATTTTCAGGGTGAGAACTACATCAGCCTTTCCCGTACTGACAGCTATCGCCAGTTGCTGGATCAGCTATTTACTGAAC
ATCAGGTTAAACGACGCATGATCGTAGAAACCCACAGCGCCGCGTCAGTCTGCGCAATGGTACGGGCGGGGGTAGGTGTT
TCGGTGGTTAACCCGCTCACCGCACTGGATTATGCGGCAAGCGGTTTAGTGGTGCGGCGGTTCAGTATTGCGGTTCCGTT
CACCGTCAGCCTGATCCGCCCCCTGCACCGCCCGTCATCAGCGCTGGTGCAGGCGTTTAGTGAGCATTTACAAGCGGGGT
TACCGAAACTGGTCACTTCTCTTGACGCGATTTTGTCGTCAGCTACGACAGCATAA

Upstream 100 bases:

>100_bases
TTTTTATGATTACGCCACATCATAAAAAGAATAAAAAATATCGATTTATGTCGAGTCTATGCAAAAATGATATGGATTAC
CGGATTGCGAGAGAGCGCTA

Downstream 100 bases:

>100_bases
AAGCGACAGCATCCTCGGCATGGATCGCCGCGGTATCAAACACAGGCAAAACACTGCGCTCTTCTGGCACCAGTAAACCA
ATTTCTGTGCAGCCAAAAAT

Product: DNA-binding transcriptional regulator LysR

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 311; Mature: 310

Protein sequence:

>311_residues
MAAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKLFERVRGRLHPTVQGLRLFEEVQRSWYGLDR
IVSAAESLREFRQGELSIACLPVFSQSFLPQLLQPFLARYPDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHTPAGTER
TELLSLDEVCVLPPGHPLAVKKVLTPDDFQGENYISLSRTDSYRQLLDQLFTEHQVKRRMIVETHSAASVCAMVRAGVGV
SVVNPLTALDYAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTSLDAILSSATTA

Sequences:

>Translated_311_residues
MAAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKLFERVRGRLHPTVQGLRLFEEVQRSWYGLDR
IVSAAESLREFRQGELSIACLPVFSQSFLPQLLQPFLARYPDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHTPAGTER
TELLSLDEVCVLPPGHPLAVKKVLTPDDFQGENYISLSRTDSYRQLLDQLFTEHQVKRRMIVETHSAASVCAMVRAGVGV
SVVNPLTALDYAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTSLDAILSSATTA
>Mature_310_residues
AAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKLFERVRGRLHPTVQGLRLFEEVQRSWYGLDRI
VSAAESLREFRQGELSIACLPVFSQSFLPQLLQPFLARYPDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHTPAGTERT
ELLSLDEVCVLPPGHPLAVKKVLTPDDFQGENYISLSRTDSYRQLLDQLFTEHQVKRRMIVETHSAASVCAMVRAGVGVS
VVNPLTALDYAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTSLDAILSSATTA

Specific function: This protein activates the transcription of the lysA gene encoding diaminopimelate decarboxylase. LysR is also a negative regulator of its own expression [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789204, Length=311, Percent_Identity=99.3569131832797, Blast_Score=623, Evalue=1e-180,
Organism=Escherichia coli, GI157672245, Length=230, Percent_Identity=28.2608695652174, Blast_Score=92, Evalue=6e-20,
Organism=Escherichia coli, GI145693105, Length=268, Percent_Identity=23.8805970149254, Blast_Score=88, Evalue=6e-19,
Organism=Escherichia coli, GI1788887, Length=275, Percent_Identity=27.2727272727273, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI1787530, Length=262, Percent_Identity=26.3358778625954, Blast_Score=73, Evalue=2e-14,
Organism=Escherichia coli, GI1788297, Length=245, Percent_Identity=25.3061224489796, Blast_Score=73, Evalue=2e-14,
Organism=Escherichia coli, GI1788481, Length=304, Percent_Identity=25.9868421052632, Blast_Score=72, Evalue=5e-14,
Organism=Escherichia coli, GI1790262, Length=264, Percent_Identity=25.3787878787879, Blast_Score=69, Evalue=3e-13,
Organism=Escherichia coli, GI1787879, Length=251, Percent_Identity=26.6932270916335, Blast_Score=65, Evalue=7e-12,
Organism=Escherichia coli, GI87081904, Length=185, Percent_Identity=28.6486486486486, Blast_Score=64, Evalue=9e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 34423; Mature: 34292

Theoretical pI: Translated: 7.16; Mature: 7.16

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKLFERVRGRLHP
CCCCCHHHHHHHHHHHHHCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH
TVQGLRLFEEVQRSWYGLDRIVSAAESLREFRQGELSIACLPVFSQSFLPQLLQPFLARY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHC
PDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHTPAGTERTELLSLDEVCVLPPGHPLAV
CCCEEEECCCCCHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHCCCCCEEECCCCCCHHH
KKVLTPDDFQGENYISLSRTDSYRQLLDQLFTEHQVKRRMIVETHSAASVCAMVRAGVGV
HCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH
SVVNPLTALDYAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTS
HHHHHHHHHHHHHCCHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHH
LDAILSSATTA
HHHHHHHCCCC
>Mature Secondary Structure 
AAVNLRHIEIFHAVMTAGSLTEAAHLLHTSQPTVSRELARFEKVIGLKLFERVRGRLHP
CCCCHHHHHHHHHHHHHCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCH
TVQGLRLFEEVQRSWYGLDRIVSAAESLREFRQGELSIACLPVFSQSFLPQLLQPFLARY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHC
PDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHTPAGTERTELLSLDEVCVLPPGHPLAV
CCCEEEECCCCCHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHCCCCCEEECCCCCCHHH
KKVLTPDDFQGENYISLSRTDSYRQLLDQLFTEHQVKRRMIVETHSAASVCAMVRAGVGV
HCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH
SVVNPLTALDYAASGLVVRRFSIAVPFTVSLIRPLHRPSSALVQAFSEHLQAGLPKLVTS
HHHHHHHHHHHHHCCHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHH
LDAILSSATTA
HHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 6350602; 9278503; 2836407 [H]