The gene/protein map for NC_004631 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is rocR [H]

Identifier: 29141018

GI number: 29141018

Start: 575392

End: 576819

Strand: Reverse

Name: rocR [H]

Synonym: t0503

Alternate gene names: 29141018

Gene position: 576819-575392 (Counterclockwise)

Preceding gene: 29141029

Following gene: 29141008

Centisome position: 12.04

GC content: 52.17

Gene sequence:

>1428_bases
ATGGCATCCACCAATCAGGAACTGGCCTCGGCGCTCAGGATGTTTTCCCGCTTTTTCGATTTGATTCATCAGCCGTTGGC
CGTCATTAATGAGCGTGGTGAATACGTTTACTACAATCAGGAAAGCGCGGATCTGGACGGCTACAGCATTGAACGGGCAA
TGGGAAAACATATGCTGGATGTCTATCCGGGCATGAAAGAAACCCAAAGTACGATGCTTTCATCGTTAAAAAAAGGCGTG
GAATACATTGGCCATTATCAAATTTATTACAATGCGCGAGGCCAGGCCGTTGACTATCAGCACACCACCGCGTCGCTCTA
TGCAAGCGATGGCGGCATGGTCGGCGTTATCGAAATCGGCAGGAATATGTCCGGCGTCAGGCGGCTCCAGGAGCAGGTGG
TAGAACTGAACCAACTGCTGTATGCCGATCACCATGAGAAGCACCATGCCATTATTACCGAAAATCCGGAAATGCTCAGT
AATATCGCCAAAGCCAAACGGCTGGCCGCCAGTAATATTCCGGTGACGATTGTCGGGGAGACGGGAACGGGTAAGGAGCT
ATTTTCCCGCCTGATACATCAATGCAGCAAGCGGGCGAATAAGCCATTTATCGCCCTGAACTGCGGCGCTCTACCGCCTA
CGCTTATCGAAAGTACGCTTTTCGGCACCGTGCGCGGCGCCTATACCGGCGCGGAAAATAGCCAGGGCTATCTGGAACTG
GCAAACGGGGGAACGCTCTTTCTTGATGAGCTGAACGCCATGCCGATAGAAATGCAAAGTAAGCTGCTGCGATTCTTGCA
GGATAAAACCTTCTGGCGGCTCGGCGGACAACAGCAACTCCACTCTGATGTCAGGATCGTTGCCGCCATGAACGAAGCGC
CCGTCAAATTAATTCAACAAGAACGCCTGCGGGCAGATCTTTTTTATCGGTTGAGCGTCGGAATGTTGACGTTACCGCCG
TTGCGCGCCCGCCCGGAAGATATCCCCTTACTGGCGAATTATTTTATTGATAAATACCGTAATGACGTGCCGCAGGACAT
TCACGGATTAAGCGAGACGGCGCGTGCTGATCTGCTCAATCACGCCTGGCCGGGTAATGTCAGGATGCTGGAAAACGCGA
TTGTACGCAGCATGATCATGCAGGAAAAAGACGGGCTGCTGAAACACATCATTTTTGAACAGGACGAGTTAAATTTAGGC
GTACCGGAAACCGCGCCGGAGAATCCCCTTCCCTCGTCACCCGATCCGCAGTATGAAGGGTCGCTGGAGGTACGGGTTGC
CAACTACGAAAGGCATTTAATAGAAACCGCGCTGGATACGCATCAGGGGAACATTGCCGCCGCGGCCCGTAGCCTTAACG
TATCCCGCACCACGTTACAGTACAAAGTACAAAAATACGCTATTCGCTTCGGCGTGGTGCGTAATTGA

Upstream 100 bases:

>100_bases
TGTTTCTCATTTTCGTCATTAAGTTTGCTATTCATCATAACGACGAAAATTCGTCATTTTTATAATGGCAACATGAACAA
TAGATATGAGGAATGCATGT

Downstream 100 bases:

>100_bases
CATCGGCCAGCGGCATCGCGCCGCTGGCAAAGCCATTAGCCTTCGTTATGCATTTCCAAGTTTTCCATTTCGTTCTGGAA
CAGCACCGCTTTGGCGTCGT

Product: transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 475; Mature: 474

Protein sequence:

>475_residues
MASTNQELASALRMFSRFFDLIHQPLAVINERGEYVYYNQESADLDGYSIERAMGKHMLDVYPGMKETQSTMLSSLKKGV
EYIGHYQIYYNARGQAVDYQHTTASLYASDGGMVGVIEIGRNMSGVRRLQEQVVELNQLLYADHHEKHHAIITENPEMLS
NIAKAKRLAASNIPVTIVGETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENSQGYLEL
ANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKLIQQERLRADLFYRLSVGMLTLPP
LRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARADLLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIFEQDELNLG
VPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTTLQYKVQKYAIRFGVVRN

Sequences:

>Translated_475_residues
MASTNQELASALRMFSRFFDLIHQPLAVINERGEYVYYNQESADLDGYSIERAMGKHMLDVYPGMKETQSTMLSSLKKGV
EYIGHYQIYYNARGQAVDYQHTTASLYASDGGMVGVIEIGRNMSGVRRLQEQVVELNQLLYADHHEKHHAIITENPEMLS
NIAKAKRLAASNIPVTIVGETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENSQGYLEL
ANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKLIQQERLRADLFYRLSVGMLTLPP
LRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARADLLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIFEQDELNLG
VPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTTLQYKVQKYAIRFGVVRN
>Mature_474_residues
ASTNQELASALRMFSRFFDLIHQPLAVINERGEYVYYNQESADLDGYSIERAMGKHMLDVYPGMKETQSTMLSSLKKGVE
YIGHYQIYYNARGQAVDYQHTTASLYASDGGMVGVIEIGRNMSGVRRLQEQVVELNQLLYADHHEKHHAIITENPEMLSN
IAKAKRLAASNIPVTIVGETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENSQGYLELA
NGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKLIQQERLRADLFYRLSVGMLTLPPL
RARPEDIPLLANYFIDKYRNDVPQDIHGLSETARADLLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIFEQDELNLGV
PETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTTLQYKVQKYAIRFGVVRN

Specific function: Positive regulator of arginine catabolism. Controls the transcription of the two operons rocABC and rocDEF and probably acts by binding to the corresponding upstream activating sequences [H]

COG id: COG3829

COG function: function code KT; Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1788550, Length=349, Percent_Identity=40.1146131805158, Blast_Score=243, Evalue=2e-65,
Organism=Escherichia coli, GI87082117, Length=331, Percent_Identity=39.5770392749245, Blast_Score=224, Evalue=9e-60,
Organism=Escherichia coli, GI1789087, Length=351, Percent_Identity=39.031339031339, Blast_Score=214, Evalue=1e-56,
Organism=Escherichia coli, GI1790299, Length=334, Percent_Identity=39.5209580838323, Blast_Score=202, Evalue=5e-53,
Organism=Escherichia coli, GI1787583, Length=320, Percent_Identity=36.875, Blast_Score=201, Evalue=9e-53,
Organism=Escherichia coli, GI1790437, Length=316, Percent_Identity=37.0253164556962, Blast_Score=199, Evalue=3e-52,
Organism=Escherichia coli, GI87082152, Length=327, Percent_Identity=38.2262996941896, Blast_Score=199, Evalue=3e-52,
Organism=Escherichia coli, GI1788905, Length=329, Percent_Identity=38.2978723404255, Blast_Score=199, Evalue=4e-52,
Organism=Escherichia coli, GI1789233, Length=307, Percent_Identity=37.4592833876222, Blast_Score=194, Evalue=9e-51,
Organism=Escherichia coli, GI87081872, Length=325, Percent_Identity=36.6153846153846, Blast_Score=179, Evalue=5e-46,
Organism=Escherichia coli, GI1786524, Length=321, Percent_Identity=36.7601246105919, Blast_Score=167, Evalue=1e-42,
Organism=Escherichia coli, GI1789828, Length=258, Percent_Identity=31.7829457364341, Blast_Score=130, Evalue=2e-31,
Organism=Escherichia coli, GI87081858, Length=325, Percent_Identity=28.3076923076923, Blast_Score=124, Evalue=2e-29,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR000014
- InterPro:   IPR013656
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF08448 PAS_4; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 53387; Mature: 53256

Theoretical pI: Translated: 6.68; Mature: 6.68

Prosite motif: PS50112 PAS ; PS00676 SIGMA54_INTERACT_2 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MASTNQELASALRMFSRFFDLIHQPLAVINERGEYVYYNQESADLDGYSIERAMGKHMLD
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHH
VYPGMKETQSTMLSSLKKGVEYIGHYQIYYNARGQAVDYQHTTASLYASDGGMVGVIEIG
HCCCCHHHHHHHHHHHHHHHHHHEEEEEEECCCCCEECCCCCEEEEEECCCCEEEEEEEC
RNMSGVRRLQEQVVELNQLLYADHHEKHHAIITENPEMLSNIAKAKRLAASNIPVTIVGE
CCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHHHHHCCCCEEEEEC
TGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENSQGYLEL
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE
ANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKLIQQ
CCCCEEEEECCCCCCHHHHHHHHHHHHHCHHHHCCCHHHHHCCEEEEEECCHHHHHHHHH
ERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARADLLN
HHHHHHHHHHHHCCCEECCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH
HAWPGNVRMLENAIVRSMIMQEKDGLLKHIIFEQDELNLGVPETAPENPLPSSPDPQYEG
CCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC
SLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTTLQYKVQKYAIRFGVVRN
CEEEEEECHHHHHHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
ASTNQELASALRMFSRFFDLIHQPLAVINERGEYVYYNQESADLDGYSIERAMGKHMLD
CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHH
VYPGMKETQSTMLSSLKKGVEYIGHYQIYYNARGQAVDYQHTTASLYASDGGMVGVIEIG
HCCCCHHHHHHHHHHHHHHHHHHEEEEEEECCCCCEECCCCCEEEEEECCCCEEEEEEEC
RNMSGVRRLQEQVVELNQLLYADHHEKHHAIITENPEMLSNIAKAKRLAASNIPVTIVGE
CCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHHHHHCCCCEEEEEC
TGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENSQGYLEL
CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE
ANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKLIQQ
CCCCEEEEECCCCCCHHHHHHHHHHHHHCHHHHCCCHHHHHCCEEEEEECCHHHHHHHHH
ERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARADLLN
HHHHHHHHHHHHCCCEECCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH
HAWPGNVRMLENAIVRSMIMQEKDGLLKHIIFEQDELNLGVPETAPENPLPSSPDPQYEG
CCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC
SLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTTLQYKVQKYAIRFGVVRN
CEEEEEECHHHHHHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8113162; 9384377 [H]