The gene/protein map for NC_011369 is currently unavailable.
Definition Rhizobium leguminosarum bv. trifolii WSM2304 chromosome, complete genome.
Accession NC_011369
Length 4,537,948

Click here to switch to the map view.

The map label for this gene is rpoH [H]

Identifier: 209550663

GI number: 209550663

Start: 3147817

End: 3148725

Strand: Direct

Name: rpoH [H]

Synonym: Rleg2_3087

Alternate gene names: 209550663

Gene position: 3147817-3148725 (Clockwise)

Preceding gene: 209550662

Following gene: 209550666

Centisome position: 69.37

GC content: 62.05

Gene sequence:

>909_bases
ATGGCCCGAAATACCTTGCCGTCCATTACCGCCGGCGAAGCCGGTCTCAATCGTTACCTCGACGAAATCCGCAAGTTTCC
GATGCTCGAGCCGCAGCAGGAATACATGCTCGCCAAGCGTTATGCCGAGCATGGCGATCGCGACGCCGCCCACAAGCTCG
TCACCAGCCATCTTCGCCTCGTCGCAAAGATCGCGATGGGTTATCGCGGCTATGGCCTGCCGATCGGCGAAGTCGTTTCC
GAGGGCAATGTCGGCCTGATGCAGGCCGTCAAGAAGTTCGATGCCGAGCGCGGCTTCCGCCTTGCCACCTACGCCATGTG
GTGGATCAAGGCCTCGATCCAGGAATATATTCTGCGTTCTTGGTCGCTGGTGAAGATGGGCACGACCGCCAACCAGAAGC
GCCTGTTCTTCAACCTGCGCCGGCTGAAAGGCCGCATCCAGGCGATCGATGACGGCGACCTGAAGCCGGAGCACGTCTCC
GAAATCGCCACCAAGCTGAAGGTCTCGGAGGAGGAGGTCATTTCGATGAACCGCCGCCTGTCCGGCGACGCCTCGCTGAA
CGCGCCGATCAAGGCGGCCGAAGGCGACAGCGGCCAATGGCAGGATTGGCTGGTGGACGACCATGACAGCCAGGAAGACG
TGCTGATCGAACAGGACGAGCTCGATACCCGCCGGCGCATGCTGGCGAAAGCGATGAGCGTGCTGAACGAGCGCGAACGC
CGCATCTTCGAGGCTCGCCGCCTCGCCGAGGATCCGGTGACGCTGGAAGACCTGTCGACAGAATTCGACATCAGCCGCGA
ACGCGTCCGTCAGATCGAGGTCCGCGCTTTCGAGAAGGTGCAGGACGCTGTCCGCAAGGAAGCCCAGGAACGCGCCAAGG
CCGTCCGCGTCGTCGAAGCAACTGCATAA

Upstream 100 bases:

>100_bases
TAGTGCGGGCTCGCGGTAGAGCCGCACGTCCCGGTTTGCCATTTTTAACGCGAGGACGCGTTTCCAACGGGAACCGGAAT
TCACTTTAGGAGGGTGCACT

Downstream 100 bases:

>100_bases
ACCTGCCTCGTCGGAAAATTGAAAAGGCGGGTCCTTGCGGCCCGCCTTTGTGATTCTCTGCTTTTCGCAACCTCGGCACC
CCGTCAAAGGTCGAGAGTGC

Product: RNA polymerase factor sigma-32

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 302; Mature: 301

Protein sequence:

>302_residues
MARNTLPSITAGEAGLNRYLDEIRKFPMLEPQQEYMLAKRYAEHGDRDAAHKLVTSHLRLVAKIAMGYRGYGLPIGEVVS
EGNVGLMQAVKKFDAERGFRLATYAMWWIKASIQEYILRSWSLVKMGTTANQKRLFFNLRRLKGRIQAIDDGDLKPEHVS
EIATKLKVSEEEVISMNRRLSGDASLNAPIKAAEGDSGQWQDWLVDDHDSQEDVLIEQDELDTRRRMLAKAMSVLNERER
RIFEARRLAEDPVTLEDLSTEFDISRERVRQIEVRAFEKVQDAVRKEAQERAKAVRVVEATA

Sequences:

>Translated_302_residues
MARNTLPSITAGEAGLNRYLDEIRKFPMLEPQQEYMLAKRYAEHGDRDAAHKLVTSHLRLVAKIAMGYRGYGLPIGEVVS
EGNVGLMQAVKKFDAERGFRLATYAMWWIKASIQEYILRSWSLVKMGTTANQKRLFFNLRRLKGRIQAIDDGDLKPEHVS
EIATKLKVSEEEVISMNRRLSGDASLNAPIKAAEGDSGQWQDWLVDDHDSQEDVLIEQDELDTRRRMLAKAMSVLNERER
RIFEARRLAEDPVTLEDLSTEFDISRERVRQIEVRAFEKVQDAVRKEAQERAKAVRVVEATA
>Mature_301_residues
ARNTLPSITAGEAGLNRYLDEIRKFPMLEPQQEYMLAKRYAEHGDRDAAHKLVTSHLRLVAKIAMGYRGYGLPIGEVVSE
GNVGLMQAVKKFDAERGFRLATYAMWWIKASIQEYILRSWSLVKMGTTANQKRLFFNLRRLKGRIQAIDDGDLKPEHVSE
IATKLKVSEEEVISMNRRLSGDASLNAPIKAAEGDSGQWQDWLVDDHDSQEDVLIEQDELDTRRRMLAKAMSVLNERERR
IFEARRLAEDPVTLEDLSTEFDISRERVRQIEVRAFEKVQDAVRKEAQERAKAVRVVEATA

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]

Homologues:

Organism=Escherichia coli, GI1789871, Length=285, Percent_Identity=36.4912280701754, Blast_Score=174, Evalue=7e-45,
Organism=Escherichia coli, GI1789098, Length=274, Percent_Identity=32.4817518248175, Blast_Score=119, Evalue=3e-28,
Organism=Escherichia coli, GI1789448, Length=250, Percent_Identity=28.8, Blast_Score=92, Evalue=3e-20,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR009042
- InterPro:   IPR007627
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR012759
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 34603; Mature: 34472

Theoretical pI: Translated: 7.10; Mature: 7.10

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
3.3 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MARNTLPSITAGEAGLNRYLDEIRKFPMLEPQQEYMLAKRYAEHGDRDAAHKLVTSHLRL
CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
VAKIAMGYRGYGLPIGEVVSEGNVGLMQAVKKFDAERGFRLATYAMWWIKASIQEYILRS
HHHHHHCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHC
WSLVKMGTTANQKRLFFNLRRLKGRIQAIDDGDLKPEHVSEIATKLKVSEEEVISMNRRL
CHHEEECCCCCHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHHCCHHHHHHHHHCC
SGDASLNAPIKAAEGDSGQWQDWLVDDHDSQEDVLIEQDELDTRRRMLAKAMSVLNERER
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHHHHHHH
RIFEARRLAEDPVTLEDLSTEFDISRERVRQIEVRAFEKVQDAVRKEAQERAKAVRVVEA
HHHHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
TA
CC
>Mature Secondary Structure 
ARNTLPSITAGEAGLNRYLDEIRKFPMLEPQQEYMLAKRYAEHGDRDAAHKLVTSHLRL
CCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
VAKIAMGYRGYGLPIGEVVSEGNVGLMQAVKKFDAERGFRLATYAMWWIKASIQEYILRS
HHHHHHCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHC
WSLVKMGTTANQKRLFFNLRRLKGRIQAIDDGDLKPEHVSEIATKLKVSEEEVISMNRRL
CHHEEECCCCCHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHHCCHHHHHHHHHCC
SGDASLNAPIKAAEGDSGQWQDWLVDDHDSQEDVLIEQDELDTRRRMLAKAMSVLNERER
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHHHHHHH
RIFEARRLAEDPVTLEDLSTEFDISRERVRQIEVRAFEKVQDAVRKEAQERAKAVRVVEA
HHHHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
TA
CC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7501460 [H]