| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is rpoH [H]
Identifier: 157162938
GI number: 157162938
Start: 3646196
End: 3647050
Strand: Reverse
Name: rpoH [H]
Synonym: EcHS_A3660
Alternate gene names: 157162938
Gene position: 3647050-3646196 (Counterclockwise)
Preceding gene: 157162939
Following gene: 157162937
Centisome position: 78.54
GC content: 54.27
Gene sequence:
>855_bases ATGACTGACAAAATGCAAAGTTTAGCTTTAGCCCCAGTTGGCAACCTGGATTCCTACATCCGGGCAGCTAACGCGTGGCC GATGTTGTCGGCTGACGAGGAGCGGGCGCTGGCTGAAAAGCTGCATTACCATGGCGATCTGGAAGCAGCTAAAACGCTGA TCCTGTCTCACCTGCGGTTTGTTGTTCATATTGCTCGTAATTATGCGGGCTATGGCCTGCCACAGGCGGATTTGATTCAG GAAGGTAACATCGGCCTGATGAAAGCAGTGCGCCGTTTTAACCCGGAAGTGGGTGTGCGCCTGGTCTCCTTCGCCGTTCA CTGGATCAAAGCAGAGATCCACGAATACGTCCTGCGTAACTGGCGTATCGTCAAAGTTGCGACCACCAAAGCGCAGCGCA AACTGTTCTTCAACCTGCGTAAAACCAAGCAGCGTCTGGGCTGGTTTAACCAGGATGAAGTCGAAATGGTGGCCCGTGAA CTGGGCGTAACCAGCAAAGACGTACGTGAGATGGAATCACGTATGGCGGCACAGGACATGACCTTTGACCTGTCTTCCGA CGACGATTCCGACAGCCAACCGATGGCACCGGTGCTCTATCTGCAGGATAAATCATCTAACTTTGCCGACGGCATCGAAG ATGATAACTGGGAAGAGCAGGCGGCAAACCGTCTGACCGACGCGATGCAGGGTCTGGACGAACGCAGCCAGGACATCATC CGCGCGCGCTGGCTGGACGAAGACAACAAGTCCACGTTGCAGGAACTGGCTGACCGTTACGGCGTTTCCGCTGAACGTGT ACGCCAGCTGGAAAAGAACGCGATGAAAAAACTGCGCGCTGCTATAGAAGCGTAA
Upstream 100 bases:
>100_bases GGTCTGATAAAACAGTGAATGATAACCTCGTTGCTCTTAAGCTCTGGCACAGTTGTTGCTACCACTGAAGCGCCAGAAGA TATCGATTGAGAGGATTTGA
Downstream 100 bases:
>100_bases TTTCCGCTATTAAGCAGAGAACCCTGGATGCGAGTCCGGGGTTTTTGTTTTTTGAGCCTCTACAATAATCAATTTCCCCT CCGGCAAAACGCCAATCCCC
Product: RNA polymerase factor sigma-32
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 284; Mature: 283
Protein sequence:
>284_residues MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQ EGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARE LGVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
Sequences:
>Translated_284_residues MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQ EGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARE LGVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA >Mature_283_residues TDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQE GNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVAREL GVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDIIR ARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]
COG id: COG0568
COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]
Homologues:
Organism=Escherichia coli, GI1789871, Length=284, Percent_Identity=100, Blast_Score=582, Evalue=1e-168, Organism=Escherichia coli, GI1789098, Length=268, Percent_Identity=27.6119402985075, Blast_Score=97, Evalue=2e-21, Organism=Escherichia coli, GI1789448, Length=242, Percent_Identity=28.9256198347107, Blast_Score=93, Evalue=2e-20,
Paralogues:
None
Copy number: <10 [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR000943 - InterPro: IPR007627 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR012759 - InterPro: IPR011991 [H]
Pfam domain/function: PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 32469; Mature: 32338
Theoretical pI: Translated: 5.72; Mature: 5.72
Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.5 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRF CCCHHHHHHCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH VVHIARNYAGYGLPQADLIQEGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRN HHHHHHHHCCCCCCHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHC WRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARELGVTSKDVREMESRMAAQDM CEEEEEEHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC TFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII EEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA HHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC >Mature Secondary Structure TDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRF CCHHHHHHCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH VVHIARNYAGYGLPQADLIQEGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRN HHHHHHHHCCCCCCHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHC WRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARELGVTSKDVREMESRMAAQDM CEEEEEEHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC TFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII EEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA HHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7501460 [H]