| Definition | Escherichia coli UTI89 chromosome, complete genome. |
|---|---|
| Accession | NC_007946 |
| Length | 5,065,741 |
Click here to switch to the map view.
The map label for this gene is rpoN [H]
Identifier: 91212627
GI number: 91212627
Start: 3564395
End: 3565828
Strand: Direct
Name: rpoN [H]
Synonym: UTI89_C3638
Alternate gene names: 91212627
Gene position: 3564395-3565828 (Clockwise)
Preceding gene: 91212626
Following gene: 91212628
Centisome position: 70.36
GC content: 52.44
Gene sequence:
>1434_bases ATGAAGCAAGGTTTGCAACTCAGGCTTAGCCAACAACTGGCGATGACGCCACAGCTCCAACAGGCAATTCGTCTGTTGCA GTTGTCGACGCTGGAACTTCAGCAGGAGCTACAGCAGGCGCTGGAGAGCAATCCGCTGCTTGAGCAAATCGACACTCATG AAGAAATCGACACCCGCGAAACGCAAGACAGTGAAACACTGGACACCGCCGACGCGCTCGAACAAAAAGAGATGCCGGAA GAGCTGCCGCTCGATGCCAGTTGGGACACCATTTACACCGCTGGTACACCATCCGGCACCAGCGGTGACTACATTGACGA CGAGCTGCCGGTCTATCAGGGCGAAACGACACAGACCTTACAGGATTACCTGATGTGGCAGGTCGAGTTGACACCATTTT CCGACACTGACCGCGCGATTGCTACCTCTATCGTCGATGCCGTTGATGACACCGGTTATCTGACTGTCCCGCTGGAAGAT ATTCTCGAAAGTATGGGCGATGAAGAGATTGACATCGACGAAGTTGAAGCCGTCCTTAAGCGGATCCAACGGTTTGATCC GGTCGGTGTGGCGGCAAAAGATCTGCGTGACTGCCTGCTGATCCAACTCTCCCAATTCGATAAGACCACGCCATGGCTGG AAGAGGCCAGACTGATCATTAGCGATCATCTCGATCTATTAGCCAATCACGACTTCCGCACTTTAATGCGCGTCACGCGT CTGAAAGAAGATGTGCTGAAAGAAGCCGTCAATCTGATCCAGTCGCTCGATCCGCGCCCCGGGCAGTCGATCCAGACTGG CGAACCTGAGTATGTCATTCCAGATGTGCTGGTGCGTAAGCATAACGGTCACTGGACGGTAGAACTCAACAGTGACAGCA TTCCGCGTTTGCAAATCAACCAGCACTACGCCTCGATGTGCAATAACGCGCGCAACGATGGTGATAGCCAGTTTATCCGC AGCAATCTGCAGGATGCCAAATGGTTGATCAAGAGTCTGGAAAGCCGTAACGATACGCTACTGCGCGTGAGTCGCTGTAT CGTTGAACAGCAGCAAGCCTTCTTTGAGCAAGGCGAAGAATATATGAAACCGATGGTACTGGCTGATATCGCCCAGGCCG TCGAGATGCATGAATCGACGATATCTCGCGTGACCACGCAAAAATACCTGCATAGTCCACGAGGCATTTTTGAACTGAAG TATTTCTTTTCCAGTCACGTCAATACTGAAGGCGGGGGTGAAGCCTCTTCCACAGCGATTCGCGCACTGGTGAAGAAATT AATCGCGGCGGAAAACCCAGCGAAACCGTTGAGCGACAGCAAGTTAACCTCTTTGCTGTCGGAACAAGGTATCATGGTGG CACGCCGCACTGTTGCGAAGTACCGAGAGTCTTTATCCATTCCGCCGTCAAACCAGCGTAAACAGCTCGTTTGA
Upstream 100 bases:
>100_bases TACAAGACGAACACGTTAAGCGTGTATACCTTGGGGAAGACTTCAGACTCTGATAGGGTAGAAGTTTGCGACGTTTTAGC AGGAGAGTACGATTCTGAAC
Downstream 100 bases:
>100_bases CCCAACCGATAAGGAAGACACTATGCAGCTCAACATTACCGGAAATAACGTCGAGATCACCGAGGCACTGCGCGAATTTG TTACAGCCAAATTTGCCAAA
Product: RNA polymerase factor sigma-54
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 477; Mature: 477
Protein sequence:
>477_residues MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
Sequences:
>Translated_477_residues MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV >Mature_477_residues MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLED ILESMGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of enzymes involved in arginine catabolism. The open complex (sigma-
COG id: COG1508
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-54 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789594, Length=477, Percent_Identity=99.58071278826, Blast_Score=973, Evalue=0.0,
Paralogues:
None
Copy number: 70 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000394 - InterPro: IPR007046 - InterPro: IPR007634 [H]
Pfam domain/function: PF00309 Sigma54_AID; PF04963 Sigma54_CBD; PF04552 Sigma54_DBD [H]
EC number: NA
Molecular weight: Translated: 53994; Mature: 53994
Theoretical pI: Translated: 4.36; Mature: 4.36
Prosite motif: PS00717 SIGMA54_1 ; PS00718 SIGMA54_2 ; PS50044 SIGMA54_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRE CCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCHHHHCCCC TQDSETLDTADALEQKEMPEELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTL CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHH QDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLEDILESMGDEEIDIDEVEAVLK HHHHEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCCCHHHHHHHHH RIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQIN HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCHHHHEECCCEEEEEECCCCCCEEEHH QHYASMCNNARNDGDSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEE HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH YMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELKYFFSSHVNTEGGGEASSTAI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCHHHHHH RALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV HHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC >Mature Secondary Structure MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRE CCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCHHHHCCCC TQDSETLDTADALEQKEMPEELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTL CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHH QDYLMWQVELTPFSDTDRAIATSIVDAVDDTGYLTVPLEDILESMGDEEIDIDEVEAVLK HHHHEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECHHHHHHHCCCCCCCHHHHHHHHH RIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQIN HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCHHHHEECCCEEEEEECCCCCCEEEHH QHYASMCNNARNDGDSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEE HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH YMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELKYFFSSHVNTEGGGEASSTAI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCHHHHHH RALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV HHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2203540; 8444818; 8025669; 7876255; 9278503 [H]