The gene/protein map for NC_011748 is currently unavailable.
Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is rpoH

Identifier: 218697144

GI number: 218697144

Start: 3937578

End: 3938432

Strand: Reverse

Name: rpoH

Synonym: EC55989_3869

Alternate gene names: 218697144

Gene position: 3938432-3937578 (Counterclockwise)

Preceding gene: 218697145

Following gene: 218697143

Centisome position: 76.4

GC content: 54.04

Gene sequence:

>855_bases
ATGACTGACAAAATGCAAAGTTTAGCTTTAGCCCCAGTTGGCAACCTGGATTCCTACATCCGGGCAGCTAACGCGTGGCC
GATGTTGTCGGCTGACGAGGAGCGGGCGCTGGCTGAAAAGCTGCATTACCATGGCGATCTGGAAGCAGCTAAAACGCTGA
TCCTGTCTCACCTGCGGTTTGTTGTTCATATTGCTCGTAATTATGCGGGCTATGGCCTGCCACAGGCGGATTTGATTCAG
GAAGGTAACATCGGCCTGATGAAAGCAGTGCGCCGTTTCAACCCGGAAGTGGGTGTGCGCCTGGTCTCCTTCGCCGTTCA
CTGGATCAAAGCAGAGATCCACGAATACGTTCTGCGTAACTGGCGTATCGTCAAAGTTGCGACCACCAAAGCGCAGCGCA
AACTGTTCTTCAACCTGCGTAAAACTAAGCAGCGTCTGGGCTGGTTTAACCAGGATGAAGTCGAAATGGTGGCCCGTGAA
CTGGGCGTAACCAGCAAAGACGTTCGTGAGATGGAATCACGTATGGCGGCACAGGACATGACCTTTGACCTGTCTTCCGA
CGACGATTCCGACAGCCAGCCGATGGCACCGGTGCTCTATCTGCAGGATAAATCATCTAACTTTGCCGACGGCATCGAAG
ATGATAACTGGGAAGAGCAGGCGGCAAACCGTCTGACCGACGCGATGCAAGGTCTGGACGAACGCAGCCAGGACATCATC
CGCGCGCGCTGGCTGGACGAAGACAACAAGTCCACGTTGCAGGAACTGGCTGACCGTTACGGTGTTTCCGCTGAACGTGT
GCGCCAACTGGAAAAGAACGCGATGAAAAAACTTCGCGCCGCTATTGAAGCGTAA

Upstream 100 bases:

>100_bases
GGTCTGATAAAACAGTGAATGATAACCTCGTTGCTCTTAAGCTCTGGCACAGTTGTTGCTACCACTGAAGCGCCAGAAGA
TATCGATTGAGAGGATTTGA

Downstream 100 bases:

>100_bases
TTTCCGCTATTAAGCAGAGAACCCTGGATGAGAGTCCGGGGTTTTTGTTTTTTGGGCCTCTACAATAATCAATTCCCCCT
CCGGCAAAACGTCAATCCCC

Product: RNA polymerase factor sigma-32

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 284; Mature: 283

Protein sequence:

>284_residues
MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQ
EGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARE
LGVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII
RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA

Sequences:

>Translated_284_residues
MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQ
EGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARE
LGVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII
RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
>Mature_283_residues
TDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQE
GNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVAREL
GVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDIIR
ARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]

Homologues:

Organism=Escherichia coli, GI1789871, Length=284, Percent_Identity=100, Blast_Score=582, Evalue=1e-168,
Organism=Escherichia coli, GI1789098, Length=268, Percent_Identity=27.6119402985075, Blast_Score=97, Evalue=2e-21,
Organism=Escherichia coli, GI1789448, Length=242, Percent_Identity=28.9256198347107, Blast_Score=93, Evalue=2e-20,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR007627
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR012759
- InterPro:   IPR011991 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 32469; Mature: 32338

Theoretical pI: Translated: 5.72; Mature: 5.72

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.5 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRF
CCCHHHHHHCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
VVHIARNYAGYGLPQADLIQEGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRN
HHHHHHHHCCCCCCHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHC
WRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARELGVTSKDVREMESRMAAQDM
CEEEEEEHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC
TFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII
EEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
HHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
TDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRF
CCHHHHHHCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
VVHIARNYAGYGLPQADLIQEGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRN
HHHHHHHHCCCCCCHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHC
WRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARELGVTSKDVREMESRMAAQDM
CEEEEEEHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC
TFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII
EEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
HHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7501460 [H]