The gene/protein map for NC_007778 is currently unavailable.
Definition Rhodopseudomonas palustris HaA2, complete genome.
Accession NC_007778
Length 5,331,656

Click here to switch to the map view.

The map label for this gene is rpoH [H]

Identifier: 86747323

GI number: 86747323

Start: 221351

End: 222340

Strand: Reverse

Name: rpoH [H]

Synonym: RPB_0197

Alternate gene names: 86747323

Gene position: 222340-221351 (Counterclockwise)

Preceding gene: 86747324

Following gene: 86747321

Centisome position: 4.17

GC content: 65.35

Gene sequence:

>990_bases
ATGACCTCGGCATTTTCTTCGTCGTCCCTCGCCCCGGCACACTCCGACGCTGCGGTTTTCGATGAGAAGAGCTACATGAG
GGCGATCGGCCGCTATCCGGTCCTGGAGCCGGATGAAGAAGCTCGGTTGTGGCAGCGATGGCTGCAGCATCGCGACAAAG
CGGCGGCTGACGCGCTGATCACCAGCCACCTCCGGCTCGCCGCGAAACTGGCTCGCGACTTCCGACGCTATGGCTTTCCG
CTGGGGGATCTGATCGCCGAAGCGAATCTCGGACTGATGATGGCGCTCGACCGGTTCGACCCCGAACGCGGCGCGCGGTT
CTCGACCTGCGCGGTGTGGTGGATCCGGTCAGCGATCTACGATCACATCATCCGATCGTGGTCGCTGGTGCGGATCGGCC
GGACGCCTGCGCAGAAGAAGTTGTTCTTCCGGCTTCGCGGCGAGATCCGCCGGCTCCAGCCCGATCACCACGGCACGCTC
ACCAAGGAATTGGCCGAACAGATTTCAGCGACGCTCGACGTTCCGCTTCGCGAAGTCATCGAGATGGAGCAGCGCCTGTC
CGGCGACCGGTCCTTGAACACGCCGTTGTCTGATCTCGACGAGAGCGGCGAGTGGCAGGATCTGATTGCCGACGACGCGC
CGAACGCCGAGGCGGTCCTCGCCGGCCACGACGAACTCGACCATCAACGCCGCGCGTTGCAGGACGCGCTGGTTCAGCTC
GATGCCCGCGAACGCTACATCTTTTCGGCCAGACATTTGGGCGAGCGTCCCGCCAGCTTCGAGACGATCGGTCAGTCGCT
CTCGATCTCGGCGGAACGGGTGCGGCAGATCGAGGCCCGCGCATTCGCCAAGGTCGCGAACTCTGCCCGCCGAACGTGCG
GGACGGCGCGGCCGGCCGCACGTGTCACCAGCAACCGAAAGACGACCGCTCTGACCGCTCCGCCCAACTGGATCGGCCAC
AACGCAGCCGCGGTCCACGCCTCGGTCTGA

Upstream 100 bases:

>100_bases
TGTTCTCTGCCCTCGTCATAACACCTAGATTTAATGTTCAGCCGAAATTCATCTCTCTACATTTGTTCGCAGAGTTCAGC
ATCGGCATTGGAGCCCACCC

Downstream 100 bases:

>100_bases
GCGTCCAGGCCGGTGCGACCCGGCCGGGCGATGCAGGCGCCGCCGCACACGCTTCCCTCTATTTTGATGTTCGTAACCTT
CAGTATTCATCGCGGATGGT

Product: sigma 32 (RpoH)

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 329; Mature: 328

Protein sequence:

>329_residues
MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFP
LGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTL
TKELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL
DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGH
NAAAVHASV

Sequences:

>Translated_329_residues
MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFP
LGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTL
TKELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL
DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGH
NAAAVHASV
>Mature_328_residues
TSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFPL
GDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLT
KELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQLD
ARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGHN
AAAVHASV

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]

Homologues:

Organism=Escherichia coli, GI1789871, Length=300, Percent_Identity=35, Blast_Score=157, Evalue=8e-40,
Organism=Escherichia coli, GI1789098, Length=267, Percent_Identity=32.2097378277154, Blast_Score=121, Evalue=5e-29,
Organism=Escherichia coli, GI1789448, Length=240, Percent_Identity=31.6666666666667, Blast_Score=92, Evalue=4e-20,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR009042
- InterPro:   IPR007627
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR012759
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 36892; Mature: 36761

Theoretical pI: Translated: 8.71; Mature: 8.71

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALI
CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
TSHLRLAAKLARDFRRYGFPLGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIY
HHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH
DHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLTKELAEQISATLDVPLREVI
HHHHHHCHHEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHH
EMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL
HHHHHHCCCCCCCCCHHHHCCCCCHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHHHH
DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAA
HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH
RVTSNRKTTALTAPPNWIGHNAAAVHASV
HHCCCCCEEEEECCCCCCCCCCCEEECCC
>Mature Secondary Structure 
TSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALI
CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH
TSHLRLAAKLARDFRRYGFPLGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIY
HHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH
DHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLTKELAEQISATLDVPLREVI
HHHHHHCHHEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHH
EMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL
HHHHHHCCCCCCCCCHHHHCCCCCHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHHHH
DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAA
HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH
RVTSNRKTTALTAPPNWIGHNAAAVHASV
HHCCCCCEEEEECCCCCCCCCCCEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7501460 [H]