| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is rpoS [H]
Identifier: 209395891
GI number: 209395891
Start: 3691527
End: 3692519
Strand: Reverse
Name: rpoS [H]
Synonym: ECH74115_3992
Alternate gene names: 209395891
Gene position: 3692519-3691527 (Counterclockwise)
Preceding gene: 209400494
Following gene: 209397606
Centisome position: 66.27
GC content: 51.86
Gene sequence:
>993_bases ATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGA AAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTG TGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCG CGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCG CCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGTGCGGTAGAGAAGT TTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAAC CAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCA TAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTC GTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGAT GAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAA CGCCAAACAGCGTGAAGTACTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTG AAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGCGAAATCCTGCAAACGCAG GGGCTGAATATCGAAGCGCTGTTCCGCGAGTAA
Upstream 100 bases:
>100_bases AATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGGCGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGA TCACGGGTAGGAGCCACCTT
Downstream 100 bases:
>100_bases GTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTTCTTTTGTTTGGGAAGCTTGTGTTTTTGTCGCCT GGATAAGACACGTCAGCGTC
Product: RNA polymerase sigma factor RpoS
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 330; Mature: 329
Protein sequence:
>330_residues MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFA RRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQ GLNIEALFRE
Sequences:
>Translated_330_residues MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFA RRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQ GLNIEALFRE >Mature_329_residues SQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFAR RALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQ TRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADE KENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQG LNIEALFRE
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is required for the normal expression of the enterotoxin, yst, in the stationary phase [H]
COG id: COG0568
COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789098, Length=330, Percent_Identity=99.6969696969697, Blast_Score=657, Evalue=0.0, Organism=Escherichia coli, GI1789448, Length=247, Percent_Identity=42.1052631578947, Blast_Score=209, Evalue=3e-55, Organism=Escherichia coli, GI1789871, Length=268, Percent_Identity=26.865671641791, Blast_Score=92, Evalue=6e-20,
Paralogues:
None
Copy number: <10 (log phase) 250 (stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR000943 - InterPro: IPR009042 - InterPro: IPR007627 - InterPro: IPR007624 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR012761 - InterPro: IPR011991 [H]
Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 37973; Mature: 37842
Theoretical pI: Translated: 4.56; Mature: 4.56
Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 1.5 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 1.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQL CCCCCEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHHHHHHHH YLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLI HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHH EEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLR HCCCCHHHHHHHHCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHH TARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD HHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRE CCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHH RVRQIQVEGLRRLREILQTQGLNIEALFRE HHHHHHHHHHHHHHHHHHHCCCCEEEECCC >Mature Secondary Structure SQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQL CCCCEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHHHHHHHH YLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLI HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEHHHHH EEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLR HCCCCHHHHHHHHCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHH TARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD HHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHC EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRE CCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHH RVRQIQVEGLRRLREILQTQGLNIEALFRE HHHHHHHHHHHHHHHHHHHCCCCEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7729893 [H]