Definition | Rhodopseudomonas palustris HaA2, complete genome. |
---|---|
Accession | NC_007778 |
Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is rpoH [H]
Identifier: 86747323
GI number: 86747323
Start: 221351
End: 222340
Strand: Reverse
Name: rpoH [H]
Synonym: RPB_0197
Alternate gene names: 86747323
Gene position: 222340-221351 (Counterclockwise)
Preceding gene: 86747324
Following gene: 86747321
Centisome position: 4.17
GC content: 65.35
Gene sequence:
>990_bases ATGACCTCGGCATTTTCTTCGTCGTCCCTCGCCCCGGCACACTCCGACGCTGCGGTTTTCGATGAGAAGAGCTACATGAG GGCGATCGGCCGCTATCCGGTCCTGGAGCCGGATGAAGAAGCTCGGTTGTGGCAGCGATGGCTGCAGCATCGCGACAAAG CGGCGGCTGACGCGCTGATCACCAGCCACCTCCGGCTCGCCGCGAAACTGGCTCGCGACTTCCGACGCTATGGCTTTCCG CTGGGGGATCTGATCGCCGAAGCGAATCTCGGACTGATGATGGCGCTCGACCGGTTCGACCCCGAACGCGGCGCGCGGTT CTCGACCTGCGCGGTGTGGTGGATCCGGTCAGCGATCTACGATCACATCATCCGATCGTGGTCGCTGGTGCGGATCGGCC GGACGCCTGCGCAGAAGAAGTTGTTCTTCCGGCTTCGCGGCGAGATCCGCCGGCTCCAGCCCGATCACCACGGCACGCTC ACCAAGGAATTGGCCGAACAGATTTCAGCGACGCTCGACGTTCCGCTTCGCGAAGTCATCGAGATGGAGCAGCGCCTGTC CGGCGACCGGTCCTTGAACACGCCGTTGTCTGATCTCGACGAGAGCGGCGAGTGGCAGGATCTGATTGCCGACGACGCGC CGAACGCCGAGGCGGTCCTCGCCGGCCACGACGAACTCGACCATCAACGCCGCGCGTTGCAGGACGCGCTGGTTCAGCTC GATGCCCGCGAACGCTACATCTTTTCGGCCAGACATTTGGGCGAGCGTCCCGCCAGCTTCGAGACGATCGGTCAGTCGCT CTCGATCTCGGCGGAACGGGTGCGGCAGATCGAGGCCCGCGCATTCGCCAAGGTCGCGAACTCTGCCCGCCGAACGTGCG GGACGGCGCGGCCGGCCGCACGTGTCACCAGCAACCGAAAGACGACCGCTCTGACCGCTCCGCCCAACTGGATCGGCCAC AACGCAGCCGCGGTCCACGCCTCGGTCTGA
Upstream 100 bases:
>100_bases TGTTCTCTGCCCTCGTCATAACACCTAGATTTAATGTTCAGCCGAAATTCATCTCTCTACATTTGTTCGCAGAGTTCAGC ATCGGCATTGGAGCCCACCC
Downstream 100 bases:
>100_bases GCGTCCAGGCCGGTGCGACCCGGCCGGGCGATGCAGGCGCCGCCGCACACGCTTCCCTCTATTTTGATGTTCGTAACCTT CAGTATTCATCGCGGATGGT
Product: sigma 32 (RpoH)
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 329; Mature: 328
Protein sequence:
>329_residues MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFP LGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTL TKELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGH NAAAVHASV
Sequences:
>Translated_329_residues MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFP LGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTL TKELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGH NAAAVHASV >Mature_328_residues TSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALITSHLRLAAKLARDFRRYGFPL GDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIYDHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLT KELAEQISATLDVPLREVIEMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQLD ARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAARVTSNRKTTALTAPPNWIGHN AAAVHASV
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]
COG id: COG0568
COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]
Homologues:
Organism=Escherichia coli, GI1789871, Length=300, Percent_Identity=35, Blast_Score=157, Evalue=8e-40, Organism=Escherichia coli, GI1789098, Length=267, Percent_Identity=32.2097378277154, Blast_Score=121, Evalue=5e-29, Organism=Escherichia coli, GI1789448, Length=240, Percent_Identity=31.6666666666667, Blast_Score=92, Evalue=4e-20,
Paralogues:
None
Copy number: <10 [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR000943 - InterPro: IPR009042 - InterPro: IPR007627 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR012759 - InterPro: IPR011991 [H]
Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 36892; Mature: 36761
Theoretical pI: Translated: 8.71; Mature: 8.71
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.2 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALI CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH TSHLRLAAKLARDFRRYGFPLGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIY HHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH DHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLTKELAEQISATLDVPLREVI HHHHHHCHHEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHH EMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL HHHHHHCCCCCCCCCHHHHCCCCCHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHHHH DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAA HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH RVTSNRKTTALTAPPNWIGHNAAAVHASV HHCCCCCEEEEECCCCCCCCCCCEEECCC >Mature Secondary Structure TSAFSSSSLAPAHSDAAVFDEKSYMRAIGRYPVLEPDEEARLWQRWLQHRDKAAADALI CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH TSHLRLAAKLARDFRRYGFPLGDLIAEANLGLMMALDRFDPERGARFSTCAVWWIRSAIY HHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH DHIIRSWSLVRIGRTPAQKKLFFRLRGEIRRLQPDHHGTLTKELAEQISATLDVPLREVI HHHHHHCHHEEECCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHH EMEQRLSGDRSLNTPLSDLDESGEWQDLIADDAPNAEAVLAGHDELDHQRRALQDALVQL HHHHHHCCCCCCCCCHHHHCCCCCHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHHHH DARERYIFSARHLGERPASFETIGQSLSISAERVRQIEARAFAKVANSARRTCGTARPAA HHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH RVTSNRKTTALTAPPNWIGHNAAAVHASV HHCCCCCEEEEECCCCCCCCCCCEEECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7501460 [H]