Definition Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 chromosome, complete genome.
Accession NC_003197
Length 4,857,432

Click here to switch to the map view.

The map label for this gene is yqiR [H]

Identifier: 16764029

GI number: 16764029

Start: 714545

End: 716473

Strand: Direct

Name: yqiR [H]

Synonym: STM0652

Alternate gene names: 16764029

Gene position: 714545-716473 (Clockwise)

Preceding gene: 16764028

Following gene: 16764030

Centisome position: 14.71

GC content: 50.08

Gene sequence:

>1929_bases
ATGAATAAGACGAAAGATATAGCCGCATCTCCTCTCTGTTTTGTCTCTCCTTACCCTCAACTGGCAAAGGCTGCCGAGGC
GCTGGTCGCGCAGTTGGATTACGCCGTCACTATTCATCAAACGACGCTTAATCGTATCCTGGATGAGCTACCTTTATTAG
AGTCCCGTGGGCACCAGGTACTGATTAGTCGTGGCGGATGTGCGGAAATATTAAAAAAGCACAGTAAATTACCGGTAGTC
GAAATTAAAATGTCCGGCTACGACATTCTTGATGCGCTTATCCCTTTTAAAGGACAAAAAGGCACTGTCGGTATTGTCGG
CTTTTCCAGTGTGATCAAAGGATGCGCGCGCGTAGCGGAACAGTTAAATATTAACTATAAAATTTTTACCTTACAGGGAA
ATGATAAAGAAACGATTTCTTGCCTGAAGCGGCAATTAGCGTCCACGCCATTAGATTGCATTGTTGGCGATACCGTTTGT
CAGGATTATTTTTCACCGCTGGGCTCGCAATTCCGTTTACTTGATTCCAGCCCCGCCTCAATAACCGAAGCTCTGGAAGA
AGCCCGCTCATTATATCTGGCTTTTCGCAGCCAATTACTCGAGCGTCATCATCTGCAGCTGATTCTCGATCAGTTTGATA
AAGCTGTTATCACGCTTGATGATACCGGCGCGTTACTGCATTACAATAAATATGCGAGCCAACTTTTTAAAATTAACGCC
TCCGGTGAGATTTATGACGCATCTTTCCTGAAACAGGTATTGCACCAGGAGCGGCATACATTACGTGAGGGAAAAACCGT
CAGCGCGAAAGTCGTCGATACGCCGCAAGGCGCGATGGTAGTTAATCTGTATCCGGTATTTGCGGCCAGACAGTTAAGCC
GGGTAGTGTTGACGATGCAAACCGTCTCCAGTTTACAGGGGGCGGAACATCATGTTCGCCGCCAGGAACTGTCTCGCCGC
GGCTTGAGCGCCCGCTATCATTTCGACGATCTGCTTACCGAAAACCCGGAAATGCTGCGTCGTCTGGCGATCATTAAAAA
TTATGCCGGTACGGACGCGACTATCTTAATTAATGGCGAAAGCGGGACGGGAAAAGAGGTGCTGGCGCAAAGTATTCATA
ATGCCAGCCAACGCGTCAACGGCCCGTTTGTCGCCATTAACTGCGGCGCGATGGCCCCTCAGATTCTGGAGAGCGAACTC
TTTGGCTATGTCGCCGGCGCATTTACCGGCGCGTCGCCGAAAGGCAAAATAGGCCTGTTTGAATTAGCGCACCACGGTAC
CATTTTTCTGGATGAGATTAGCGAGCTGGATAAACCGCTACAGACGCGCTTATTACGGGTATTGCAGGAGCGGCAGATTA
TGCGGCTGGGCTCAGACCAGATGATACCTGTTGATATTCGCGTGATTGCGGCGACCAATCAGACGTTAACGAAGCTCATT
GCGGACGGGACATTTCGCGAAGATCTTTACTATCGGCTTAACGTATTAAAAGTGACCACCATCCCGCTACGCAAACGTCC
GGAGGATATCAAAGCCATCGGCCTGTCGCTGCTTACTAGTTTCAGCCAGCATTATAAACGCCCGGCACTAACGTTAACCC
CGGCGCTGTGGCAAGAGCTTCAGCGCTTCGCCTGGCCCGGAAACGTCAGGCAGTTAAGCAATATTATCGAACGGCTGGTA
CTCTCTATTGATCACTCCCCGGCAACGCTGGATGAGGGTCGCCTCTTGCTGGACGATCTGGAAGAGGGGAGCCGACGCGA
GCCAACTACCTGCCACGACTGCCAGATGCTGGCTGGCGATTATAAAACGATTCGCCTGCGTATTTTGAGAAAATTATTAG
AGGCGGAAAGGGATAATAAATCGTTAGTGGCGAAACGGTTAAATGTCGATCGTACCTCGCTGACGCGCTGGATACGCGAG
TCGGCCTGA

Upstream 100 bases:

>100_bases
CGTAAACTGAACCCAGCCGCCGCAGGCGGTTAGCGTTTCGGCGGAAAATAAATCACGACGGGCGGTCTCGCCCGTTCATA
AACCAACAGGATGGTGAGTG

Downstream 100 bases:

>100_bases
CAAGAATTTACAGACTCTGTGGGCAGCCTTGCAAAGCGTAACGCAAATAACGTCTATTATTATAGGCAGTTAACGATCCA
AGAGGTGAAGTGATGAACAA

Product: sigma-54 dependent transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 642; Mature: 642

Protein sequence:

>642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKINA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPTTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA

Sequences:

>Translated_642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKINA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPTTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA
>Mature_642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKINA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPTTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA

Specific function: Member Of The Two-Component Regulatory System Atos/Atoc Involved In The Transcriptional Regulation Of The Ato Genes For Acetoacetate Metabolism. Also An Inhibitor Of Polyamine Biosynthesis. [C]

COG id: COG3829

COG function: function code KT; Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1788550, Length=358, Percent_Identity=37.9888268156425, Blast_Score=238, Evalue=1e-63,
Organism=Escherichia coli, GI1788905, Length=308, Percent_Identity=42.5324675324675, Blast_Score=230, Evalue=2e-61,
Organism=Escherichia coli, GI1786524, Length=381, Percent_Identity=38.8451443569554, Blast_Score=230, Evalue=3e-61,
Organism=Escherichia coli, GI1789233, Length=320, Percent_Identity=40, Blast_Score=222, Evalue=6e-59,
Organism=Escherichia coli, GI87082117, Length=258, Percent_Identity=46.1240310077519, Blast_Score=219, Evalue=4e-58,
Organism=Escherichia coli, GI1789087, Length=235, Percent_Identity=45.9574468085106, Blast_Score=212, Evalue=6e-56,
Organism=Escherichia coli, GI1790437, Length=231, Percent_Identity=44.5887445887446, Blast_Score=205, Evalue=8e-54,
Organism=Escherichia coli, GI1790299, Length=232, Percent_Identity=44.8275862068966, Blast_Score=195, Evalue=8e-51,
Organism=Escherichia coli, GI1787583, Length=375, Percent_Identity=33.8666666666667, Blast_Score=187, Evalue=3e-48,
Organism=Escherichia coli, GI87082152, Length=227, Percent_Identity=42.7312775330396, Blast_Score=181, Evalue=9e-47,
Organism=Escherichia coli, GI87081872, Length=245, Percent_Identity=39.5918367346939, Blast_Score=170, Evalue=2e-43,
Organism=Escherichia coli, GI87081858, Length=385, Percent_Identity=30.1298701298701, Blast_Score=154, Evalue=1e-38,
Organism=Escherichia coli, GI1789828, Length=311, Percent_Identity=33.1189710610932, Blast_Score=136, Evalue=4e-33,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR002197
- InterPro:   IPR016040
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF00989 PAS; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 71753; Mature: 71753

Theoretical pI: Translated: 8.71; Mature: 8.71

Prosite motif: PS00675 SIGMA54_INTERACT_1 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQV
CCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHEEEEHHHHHHHHHHHCCHHHCCCCEE
LISRGGCAEILKKHSKLPVVEIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAE
EEECCCHHHHHHHCCCCCEEEEEECCHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHH
QLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVCQDYFSPLGSQFRLLDSSPAS
HHCCCEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCHH
ITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKINA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEEHHHHHHHEEECC
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQ
CCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHHH
TVSSLQGAEHHVRRQELSRRGLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGE
HHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECC
SGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESELFGYVAGAFTGASPKGKIGLF
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEHE
ELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
EECCCCEEEHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCHHHHHHH
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQEL
HCCCHHHHHHHEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEECHHHHHHH
QRFAWPGNVRQLSNIIERLVLSIDHSPATLDEGRLLLDDLEEGSRREPTTCHDCQMLAGD
HHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHCC
YKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRESA
HHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCC
>Mature Secondary Structure
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQV
CCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHEEEEHHHHHHHHHHHCCHHHCCCCEE
LISRGGCAEILKKHSKLPVVEIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAE
EEECCCHHHHHHHCCCCCEEEEEECCHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHH
QLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVCQDYFSPLGSQFRLLDSSPAS
HHCCCEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCHH
ITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKINA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEEHHHHHHHEEECC
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQ
CCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHHH
TVSSLQGAEHHVRRQELSRRGLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGE
HHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECC
SGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESELFGYVAGAFTGASPKGKIGLF
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEHE
ELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
EECCCCEEEHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCHHHHHHH
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQEL
HCCCHHHHHHHEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEECHHHHHHH
QRFAWPGNVRQLSNIIERLVLSIDHSPATLDEGRLLLDDLEEGSRREPTTCHDCQMLAGD
HHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHCC
YKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRESA
HHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969508; 9384377 [H]