The gene/protein map for NC_002754 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 chromosome, complete genome.
Accession NC_011094
Length 4,709,075

Click here to switch to the map view.

The map label for this gene is yqiR [H]

Identifier: 194736230

GI number: 194736230

Start: 787858

End: 789786

Strand: Direct

Name: yqiR [H]

Synonym: SeSA_A0812

Alternate gene names: 194736230

Gene position: 787858-789786 (Clockwise)

Preceding gene: 194735971

Following gene: 194736800

Centisome position: 16.73

GC content: 50.18

Gene sequence:

>1929_bases
ATGAATAAGACGAAAGATATAGCCGCATCTCCTCTCTGTTTTGTCTCTCCTTACCCTCAACTGGCAAAGGCTGCCGAGGC
GCTGGTCGCGCAGTTGGATTACGCCGTCACTATTCATCAAACGACGCTTAATCGTATCCTGGATGAGCTACCTTTATTAG
AGTCCCGTGGGCACCAGGTACTGATTAGTCGTGGCGGCTGTGCGGAAATATTAAAAAAGCACAGTAAATTACCGGTAGTC
GAAATTAAAATGTCCGGCTACGACATTCTTGATGCGCTTATCCCTTTTAAAGGACAAAAAGGCACTGTCGGTATTGTCGG
CTTTTCCAGTGTGATCAAAGGATGCGCGCGCGTAGCGGAACAGTTAAATATTAACTATAAAATTTTTACCTTACAGGGAA
ATGATAAAGAAACGATTTCTTGCCTGAAGCGGCAATTAGCGTCCACGCCATTAGATTGCATTGTTGGCGATACCGTTTGT
CAGGATTATTTTTCACCGCTGGGCTCGCAATTCCGTTTACTTGATTCCAGCCCCGCCTCAATAACCGAAGCTCTGGAAGA
AGCCCGCTCATTATATCTGGCTTTTCGCAGCCAATTACTGGAGCGCCATCATCTGCAGCTGATTCTCGATCAGTTTGATA
AAGCTGTTATCACGCTTGATGATACCGGCGCGTTACTGCATTACAATAAATATGCGAGCCAACTTTTTAAAGTTAACGCC
TCCGGTGAAATTTATGACGCGTCTTTCCTGAAACAGGTATTGCACCAGGAGCGGCATACATTACGCGAGGGAAAAACCGT
CAGCGCGAAAGTCGTCGATACGCCGCAAGGCGCGATGGTGGTTAATCTGTATCCGGTATTTGCGGCCAGACAGTTAAGCC
GGGTAGTGTTGACGATGCAAACCGTCTCCAGTTTACAGGGGGCGGAACATCATGTTCGCCGCCAGGAACTGTCTCGCCGC
GGCTTGAGCGCCCGCTATCATTTCGACGATCTGCTTACCGAAAACCCGGAAATGCTGCGTCGTCTGGCGATCATTAAAAA
TTATGCCGGTACGGACGCGACTATCTTAATTAATGGCGAAAGCGGGACGGGGAAAGAGGTGCTGGCGCAAAGTATTCATA
ATGCCAGCCAACGCGTTAACGGCCCGTTTGTCGCCATTAACTGCGGCGCGATGGCCCCTCAGATTCTGGAGAGCGAACTC
TTTGGCTATGTCGCCGGCGCATTTACCGGCGCGTCGCCGAAAGGCAAAATAGGCCTGTTTGAATTAGCGCACCACGGTAC
CATTTTTTTGGATGAGATTAGCGAACTGGATAAACCGCTACAGACGCGCTTATTACGGGTATTGCAGGAGCGGCAGATTA
TGCGGCTGGGCTCAGACCAGATGATACCTGTTGATATTCGCGTGATTGCAGCGACCAATCAGACATTAACGAAGCTCATT
GCGGACGGGACATTTCGCGAAGATCTTTACTATCGGCTTAACGTATTAAAAGTGACCACCATCCCGCTACGCAAACGTCC
GGAGGATATCAAAGCCATCGGCCTGTCGCTGCTTACCAGTTTCAGCCAGCATTATAAACGCCCGGCACTAACGTTAACCC
CGGCGCTGTGGCAAGAGCTTCAGCGCTTCGCCTGGCCCGGAAACGTCAGGCAGTTAAGCAATATTATCGAACGGCTGGTG
CTCTCTATTGATCACTCCCCGGCAACGCTGGATGAGGGCCGCCTCTTGCTGGACGATCTGGAAGAGGGGAGCCGACGCGA
GCCAAGTACCTGCCACGACTGCCAGATGCTGGCTGGCGATTATAAAACGATTCGCCTGCGTATTTTGAGAAAATTATTAG
AGGCGGAAAGGGATAATAAATCGTTAGTGGCGAAACGATTAAATGTCGATCGTACCTCGCTGACGCGCTGGATACGTGAG
TCGGCCTGA

Upstream 100 bases:

>100_bases
CGTAAACTGAACCCAGCCGCCGCAGGCGGTTAGCGTTTCGGCGGAAAATAAATCACGACGGGCGGTCTCGCCCGTTCATA
AACCAACAGGATGGTGAGTG

Downstream 100 bases:

>100_bases
CAAGAATTTACAGACTCTGTGGGCAGCCTTGCAAAGCGTAACGCAAATAACGTCTATTATTATAGACAGTTAACGATCCA
AGAGGTGAAGTGATGAACAA

Product: putative sigma-54 dependent transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 642; Mature: 642

Protein sequence:

>642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKVNA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPSTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA

Sequences:

>Translated_642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKVNA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPSTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA
>Mature_642_residues
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQVLISRGGCAEILKKHSKLPVV
EIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAEQLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVC
QDYFSPLGSQFRLLDSSPASITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKVNA
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQTVSSLQGAEHHVRRQELSRR
GLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGESGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESEL
FGYVAGAFTGASPKGKIGLFELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQELQRFAWPGNVRQLSNIIERLV
LSIDHSPATLDEGRLLLDDLEEGSRREPSTCHDCQMLAGDYKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRE
SA

Specific function: Member Of The Two-Component Regulatory System Atos/Atoc Involved In The Transcriptional Regulation Of The Ato Genes For Acetoacetate Metabolism. Also An Inhibitor Of Polyamine Biosynthesis. [C]

COG id: COG3829

COG function: function code KT; Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains

Gene ontology:

Cell location: Cytoplasmic [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1788550, Length=358, Percent_Identity=37.9888268156425, Blast_Score=237, Evalue=1e-63,
Organism=Escherichia coli, GI1786524, Length=381, Percent_Identity=38.8451443569554, Blast_Score=230, Evalue=2e-61,
Organism=Escherichia coli, GI1788905, Length=308, Percent_Identity=42.5324675324675, Blast_Score=230, Evalue=2e-61,
Organism=Escherichia coli, GI1789233, Length=234, Percent_Identity=47.4358974358974, Blast_Score=222, Evalue=7e-59,
Organism=Escherichia coli, GI87082117, Length=258, Percent_Identity=46.1240310077519, Blast_Score=219, Evalue=4e-58,
Organism=Escherichia coli, GI1789087, Length=312, Percent_Identity=40.7051282051282, Blast_Score=212, Evalue=5e-56,
Organism=Escherichia coli, GI1790437, Length=231, Percent_Identity=44.5887445887446, Blast_Score=205, Evalue=9e-54,
Organism=Escherichia coli, GI1790299, Length=232, Percent_Identity=44.8275862068966, Blast_Score=195, Evalue=8e-51,
Organism=Escherichia coli, GI1787583, Length=365, Percent_Identity=33.972602739726, Blast_Score=186, Evalue=5e-48,
Organism=Escherichia coli, GI87082152, Length=227, Percent_Identity=42.7312775330396, Blast_Score=182, Evalue=8e-47,
Organism=Escherichia coli, GI87081872, Length=245, Percent_Identity=39.5918367346939, Blast_Score=170, Evalue=2e-43,
Organism=Escherichia coli, GI87081858, Length=385, Percent_Identity=29.8701298701299, Blast_Score=154, Evalue=2e-38,
Organism=Escherichia coli, GI1789828, Length=311, Percent_Identity=33.1189710610932, Blast_Score=136, Evalue=4e-33,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR002197
- InterPro:   IPR016040
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF00989 PAS; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 71725; Mature: 71725

Theoretical pI: Translated: 8.71; Mature: 8.71

Prosite motif: PS00675 SIGMA54_INTERACT_1 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQV
CCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHEEEEHHHHHHHHHHHCCHHHCCCCEE
LISRGGCAEILKKHSKLPVVEIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAE
EEECCCHHHHHHHCCCCCEEEEEECCHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHH
QLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVCQDYFSPLGSQFRLLDSSPAS
HHCCCEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCHH
ITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKVNA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEHHHHHHHHEEECC
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQ
CCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHHH
TVSSLQGAEHHVRRQELSRRGLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGE
HHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECC
SGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESELFGYVAGAFTGASPKGKIGLF
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEHE
ELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
EECCCCEEEHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCHHHHHHH
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQEL
HCCCHHHHHHHEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEECHHHHHHH
QRFAWPGNVRQLSNIIERLVLSIDHSPATLDEGRLLLDDLEEGSRREPSTCHDCQMLAGD
HHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHCC
YKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRESA
HHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCC
>Mature Secondary Structure
MNKTKDIAASPLCFVSPYPQLAKAAEALVAQLDYAVTIHQTTLNRILDELPLLESRGHQV
CCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHEEEEHHHHHHHHHHHCCHHHCCCCEE
LISRGGCAEILKKHSKLPVVEIKMSGYDILDALIPFKGQKGTVGIVGFSSVIKGCARVAE
EEECCCHHHHHHHCCCCCEEEEEECCHHHHHHHHCCCCCCCCEEEEEHHHHHHHHHHHHH
QLNINYKIFTLQGNDKETISCLKRQLASTPLDCIVGDTVCQDYFSPLGSQFRLLDSSPAS
HHCCCEEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCHH
ITEALEEARSLYLAFRSQLLERHHLQLILDQFDKAVITLDDTGALLHYNKYASQLFKVNA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEHHHHHHHHEEECC
SGEIYDASFLKQVLHQERHTLREGKTVSAKVVDTPQGAMVVNLYPVFAARQLSRVVLTMQ
CCCEECHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEHHHHHHHHHHHHHHHHH
TVSSLQGAEHHVRRQELSRRGLSARYHFDDLLTENPEMLRRLAIIKNYAGTDATILINGE
HHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECC
SGTGKEVLAQSIHNASQRVNGPFVAINCGAMAPQILESELFGYVAGAFTGASPKGKIGLF
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEHE
ELAHHGTIFLDEISELDKPLQTRLLRVLQERQIMRLGSDQMIPVDIRVIAATNQTLTKLI
EECCCCEEEHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCHHHHHHH
ADGTFREDLYYRLNVLKVTTIPLRKRPEDIKAIGLSLLTSFSQHYKRPALTLTPALWQEL
HCCCHHHHHHHEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCEEECHHHHHHH
QRFAWPGNVRQLSNIIERLVLSIDHSPATLDEGRLLLDDLEEGSRREPSTCHDCQMLAGD
HHHCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHCC
YKTIRLRILRKLLEAERDNKSLVAKRLNVDRTSLTRWIRESA
HHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8969508; 9384377 [H]