Definition Shewanella baltica OS195 chromosome, complete genome.
Accession NC_009997
Length 5,347,283

Click here to switch to the map view.

The map label for this gene is trpI [H]

Identifier: 160877066

GI number: 160877066

Start: 4666525

End: 4667391

Strand: Direct

Name: trpI [H]

Synonym: Sbal195_3962

Alternate gene names: 160877066

Gene position: 4666525-4667391 (Clockwise)

Preceding gene: 160877055

Following gene: 160877074

Centisome position: 87.27

GC content: 41.29

Gene sequence:

>867_bases
ATGAGACATTTAAAAGCATTCTATGTATTTCATGTTACGGCAGAGTCATCGAATTACAGTAAAGCAGCAGAGAAACTTCA
TATCACTCACGGAGCAGTAAGTAAGCAAATTAAATTGCTGGAGAGTCATCTATCGCAGATGCTGTTTTACAAGCAAGGCA
GAGGAATGTGTCTAACGCAGGAAGGTGAATTGCTCAAGCAGTACACTGACATTGCTTTTAATGCTTTAGATAATGGAATT
AAGAAGTTAACAAGGGAAAGTAACAAACATCTCGAAGTTTCCTGTGAACCAACATTAACGATGCGCTGGCTGATGCCCCG
ACTGGCCAGTTTTTTTCGGCTTACGGGTATTGATGTTCGGCTATCAACCGCTGGAGGACCTGTAGATTTAAATGAAACAG
GATTATCTTTAGCCATTCGGCGCGATGACTTCGATATTAGCCCTGAATATACGGCGCACAACTTAGTCAATGAGTGGGTT
GGTCCTGTTTTTTCCCCTGAATATTGGAGAAAATCTCAGAACAACTTAGGTGATATTGTTCTTCTTCATAGTGAAACGAG
ACCTTCTGCATGGTCAGATTGGTCATCACGCTCAACACATACCTTTTATACGAACAAGTCCAAATCCTTTGAGCATTTCT
ATTTCTGCTTACAGGCAGCAGTTGACGGTTTAGGGGCTGCGATTGGATCATATCCTCTTATAGTAGATGACTTGAAAAGT
GGGAGACTGGTCGCTCCATTTGGATTTACTCAATCCGGACATAAGTACTTACTGCTCAGTCATACAGAAAGACTGGGAAG
CAATGAATTAGAGTTCTTAACGTGGTTAGAAAAAACCATGTCTTTATGCCAACCAGAAACTCCTTAG

Upstream 100 bases:

>100_bases
TTTTATCTCCTCACTTTTCATAAGTTTGAGATAATTGTATAGATTTATTGATGAGTTATAATCGAAAAAAAGGCATCCTA
AATGTGAGAAAAAGTCACAG

Downstream 100 bases:

>100_bases
TTCAGATTTTAGGTATGGAACGCTAACTTAGAAATATCCCAAGCCGTTCAAATCTCACATCACCTTAATAACTGATTTTT
TACCCAGCGAGCAGACAGAC

Product: LysR family transcriptional regulator

Products: NA

Alternate protein names: TrpBA operon transcriptional activator [H]

Number of amino acids: Translated: 288; Mature: 288

Protein sequence:

>288_residues
MRHLKAFYVFHVTAESSNYSKAAEKLHITHGAVSKQIKLLESHLSQMLFYKQGRGMCLTQEGELLKQYTDIAFNALDNGI
KKLTRESNKHLEVSCEPTLTMRWLMPRLASFFRLTGIDVRLSTAGGPVDLNETGLSLAIRRDDFDISPEYTAHNLVNEWV
GPVFSPEYWRKSQNNLGDIVLLHSETRPSAWSDWSSRSTHTFYTNKSKSFEHFYFCLQAAVDGLGAAIGSYPLIVDDLKS
GRLVAPFGFTQSGHKYLLLSHTERLGSNELEFLTWLEKTMSLCQPETP

Sequences:

>Translated_288_residues
MRHLKAFYVFHVTAESSNYSKAAEKLHITHGAVSKQIKLLESHLSQMLFYKQGRGMCLTQEGELLKQYTDIAFNALDNGI
KKLTRESNKHLEVSCEPTLTMRWLMPRLASFFRLTGIDVRLSTAGGPVDLNETGLSLAIRRDDFDISPEYTAHNLVNEWV
GPVFSPEYWRKSQNNLGDIVLLHSETRPSAWSDWSSRSTHTFYTNKSKSFEHFYFCLQAAVDGLGAAIGSYPLIVDDLKS
GRLVAPFGFTQSGHKYLLLSHTERLGSNELEFLTWLEKTMSLCQPETP
>Mature_288_residues
MRHLKAFYVFHVTAESSNYSKAAEKLHITHGAVSKQIKLLESHLSQMLFYKQGRGMCLTQEGELLKQYTDIAFNALDNGI
KKLTRESNKHLEVSCEPTLTMRWLMPRLASFFRLTGIDVRLSTAGGPVDLNETGLSLAIRRDDFDISPEYTAHNLVNEWV
GPVFSPEYWRKSQNNLGDIVLLHSETRPSAWSDWSSRSTHTFYTNKSKSFEHFYFCLQAAVDGLGAAIGSYPLIVDDLKS
GRLVAPFGFTQSGHKYLLLSHTERLGSNELEFLTWLEKTMSLCQPETP

Specific function: Activates the expression of the trpBA genes encoding the two tryptophan synthase subunits. In the absence of the inducer (indoleglycerol phosphate), trpI binds upstream of the trpAB operon, overlapping its own promoter region [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lysR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789173, Length=280, Percent_Identity=32.1428571428571, Blast_Score=112, Evalue=2e-26,
Organism=Escherichia coli, GI1786448, Length=287, Percent_Identity=28.5714285714286, Blast_Score=112, Evalue=3e-26,
Organism=Escherichia coli, GI1788706, Length=292, Percent_Identity=28.7671232876712, Blast_Score=104, Evalue=7e-24,
Organism=Escherichia coli, GI157672245, Length=221, Percent_Identity=26.6968325791855, Blast_Score=69, Evalue=4e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000847
- InterPro:   IPR005119
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00126 HTH_1; PF03466 LysR_substrate [H]

EC number: NA

Molecular weight: Translated: 32671; Mature: 32671

Theoretical pI: Translated: 7.39; Mature: 7.39

Prosite motif: PS50931 HTH_LYSR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRHLKAFYVFHVTAESSNYSKAAEKLHITHGAVSKQIKLLESHLSQMLFYKQGRGMCLTQ
CCCEEEEEEEEEEECCCCHHHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEC
EGELLKQYTDIAFNALDNGIKKLTRESNKHLEVSCEPTLTMRWLMPRLASFFRLTGIDVR
CHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCEEE
LSTAGGPVDLNETGLSLAIRRDDFDISPEYTAHNLVNEWVGPVFSPEYWRKSQNNLGDIV
EECCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHCCCCCCCEE
LLHSETRPSAWSDWSSRSTHTFYTNKSKSFEHFYFCLQAAVDGLGAAIGSYPLIVDDLKS
EEECCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCC
GRLVAPFGFTQSGHKYLLLSHTERLGSNELEFLTWLEKTMSLCQPETP
CCEEECCCCCCCCCEEEEEEEHHHCCCCHHHHHHHHHHHHHHCCCCCC
>Mature Secondary Structure
MRHLKAFYVFHVTAESSNYSKAAEKLHITHGAVSKQIKLLESHLSQMLFYKQGRGMCLTQ
CCCEEEEEEEEEEECCCCHHHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEC
EGELLKQYTDIAFNALDNGIKKLTRESNKHLEVSCEPTLTMRWLMPRLASFFRLTGIDVR
CHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCEEE
LSTAGGPVDLNETGLSLAIRRDDFDISPEYTAHNLVNEWVGPVFSPEYWRKSQNNLGDIV
EECCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHCCCCCCCEE
LLHSETRPSAWSDWSSRSTHTFYTNKSKSFEHFYFCLQAAVDGLGAAIGSYPLIVDDLKS
EEECCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCC
GRLVAPFGFTQSGHKYLLLSHTERLGSNELEFLTWLEKTMSLCQPETP
CCEEECCCCCCCCCEEEEEEEHHHCCCCHHHHHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8423001 [H]