Definition Shigella boydii CDC 3083-94 chromosome, complete genome.
Accession NC_010658
Length 4,615,997

Click here to switch to the map view.

The map label for this gene is rpoN

Identifier: 187730690

GI number: 187730690

Start: 3322580

End: 3324013

Strand: Reverse

Name: rpoN

Synonym: SbBS512_E3568

Alternate gene names: 187730690

Gene position: 3324013-3322580 (Counterclockwise)

Preceding gene: 187730239

Following gene: 187732959

Centisome position: 72.01

GC content: 52.93

Gene sequence:

>1434_bases
ATGAAGCAAGGTTTGCAACTCAGGCTTAGCCAACAACTGGCGATGACGCCACAGCTTCAACAGGCGATTCGTCTGTTGCA
GTTGTCGACGCTGGAACTTCAGCAGGAGCTACAGCAGGCGCTGGAGAGTAATCCGCTGCTTGAGCAAATCGACACTCATG
AAGAAATCGACACCCGCGAAACGCAAGACAGTGAAACGCTGGACACCGCCGACGCGCTCGAACAAAAAGAGATGCCGGAA
GAGCTGCCGCTCGATGCCAGTTGGGACACCATTTACACCGCTGGTACACCATCCGGCACCAGCGGTGACTACATTGACGA
TGAGCTGCCGGTCTATCAGGGCGAAACGACGCAGACCTTGCAGGATTACCTGATGTGGCAGGTCGAGCTGACACCGTTTT
CCGACACTGACCGCGCTATTGCTACCTCTATCGTCGATGCCGTTGATGACACCGGTTATCTGACTGTCCCGCTGGAAGAT
ATTCTCGAAAGTATGGGCGATGAAGAGATCGACATCGACGAGGTTGAAGCCGTCCTTAAGCGGATCCAACGGTTTGATCC
GGTCGGTGTAGCGGCAAAAGATCTGCGTGACTGCCTGCTAATCCAACTCTCCCAATTCGATAAAACCACGCCGTGGCTGG
AAGAGGCCAGACTGATCATTAGCGATCATCTCGATCTGTTAGCCAATCACGACTTCCGCACTTTAATGCGCGTCACGCGT
CTGAAAGAAGATGTGCTGAAAGAAGCCGTCAATCTGATCCAGTCGCTCGATCCGCGCCCCGGGCAGTCGATCCAGACTGG
CGAACCTGAGTATGTCATTCCAGATGTGCTGGTGCGTAAGCATAACGGTCACTGGACGGTAGAACTCAACAGTGACAGCA
TTCCGCGTCTGCAAATCAACCAGCACTACGCCTCGATGTGCAATAACGCGCGCAACGATGGTGACAGCCAGTTTATCCGC
AGCAATCTGCAGGATGCCAAATGGTTGATCAAGAGTCTGGAAAGCCGTAACGATACGCTACTGCGCGTGAGTCGCTGTAT
CGTTGAACAGCAGCAAGCCTTCTTTGAGCAAGGTGAAGAATATATGAAACCGATGGTACTGGCCGATATCGCCCAGGCTG
TCGAAATGCATGAATCGACGATATCTCGCGTGACCACGCAAAAATACCTGCATAGTCCACGAGGCATTTTTGAACTGAAG
TATTTCTTTTCCAGTCACGTCAATACCGAGGGCGGCGGCGAAGCTTCCTCCACGGCGATTCGTGCGCTGGTGAAGAAATT
AATCGCGGCGGAAAACCCAGCGAAACCGTTGAGCGACAGCAAGTTAACCTCTTTGCTGTCGGAACAAGGTATCATGGTGG
CACGCCGCACTGTTGCGAAGTACCGAGAGTCTTTATCCATTCCGCCGTCAAACCAGCGTAAACAGCTCGTTTGA

Upstream 100 bases:

>100_bases
TACAAGACGAACACGTTAAGCGTGTATACCTTGGGGAAGACTTCAGACTCTGATAGGGTAGAAGTTTGCGACGTTTTAGC
AGGAGAGTACGATTCTGAAC

Downstream 100 bases:

>100_bases
CCCAACCGATAAGGAAGACACTATGCAGCTCAACATTACCGGAAATCACGTCGAGATCACCGAGGCACTGCGCGAATTTG
TTAGAGCCAAATTTGCCAAA

Product: RNA polymerase factor sigma-54

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 477; Mature: 477

Protein sequence:

>477_residues
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE
ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED
ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR
SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV

Sequences:

>Translated_477_residues
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE
ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED
ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR
SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
>Mature_477_residues
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE
ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED
ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR
SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of enzymes involved in arginine catabolism. The open complex (sigma-

COG id: COG1508

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-54 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789594, Length=477, Percent_Identity=99.58071278826, Blast_Score=973, Evalue=0.0,

Paralogues:

None

Copy number: 70 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000394
- InterPro:   IPR007046
- InterPro:   IPR007634 [H]

Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]

EC number: NA

Molecular weight: Translated: 53994; Mature: 53994

Theoretical pI: Translated: 4.36; Mature: 4.36

Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRE
CCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCHHHHCCCC
TQDSETLDTADALEQKEMPEELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTL
CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHH
QDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLEDILESMGDEEIDIDEVEAVLK
HHHHEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCCCHHHHHHHHH
RIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQIN
HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCHHHHEECCCEEEEEECCCCCCEEEHH
QHYASMCNNARNDGDSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEE
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
YMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELKYFFSSHVNTEGGGEASSTAI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCHHHHHH
RALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
HHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC
>Mature Secondary Structure
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRE
CCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCHHHHCCCC
TQDSETLDTADALEQKEMPEELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTL
CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHH
QDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLEDILESMGDEEIDIDEVEAVLK
HHHHEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCCCHHHHHHHHH
RIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQIN
HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCHHHHEECCCEEEEEECCCCCCEEEHH
QHYASMCNNARNDGDSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEE
HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
YMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELKYFFSSHVNTEGGGEASSTAI
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCHHHHHH
RALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
HHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2203540; 8444818; 8025669; 7876255; 9278503 [H]