Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is rpoH [H]

Identifier: 120612533

GI number: 120612533

Start: 4318313

End: 4319254

Strand: Direct

Name: rpoH [H]

Synonym: Aave_3894

Alternate gene names: 120612533

Gene position: 4318313-4319254 (Clockwise)

Preceding gene: 120612532

Following gene: 120612534

Centisome position: 80.67

GC content: 65.71

Gene sequence:

>942_bases
ATGACGATTCAGTCTGGAACCGCCGCCACTGCGCTTGCACCGGTCAACGCCTGGGCGCTGATCCCCCCGCTGGGCAATCT
GGATGCGTACATCTCCGCGGTGAACCGCCTGCCGATGCTCACGGCCGAGGAAGAGCGCACCTACGCCCGCCGGCTGAAGG
AGCACAACGACGTGGAGGCCGCGGGCCGGATGGTGATGTCGCATCTGCGGCTGGTGGTTTCGATCGCCCGGCAGTACCTC
GGCTACGGCCTGCCGCATGGCGACCTGATCCAGGAGGGCAACGTGGGCCTGATGAAAGCCGTGAAACGCTTTGATCCGGA
CCAGAACGTGCGCCTGGTGAGCTACGCCATGCACTGGATCAAGGCCGAGATCCACGAGTACATCCTGAAGAACTGGCGCA
TGGTGAAGGTGGCCACCACCAAGAGCCAGCGCAAGCTGTTCTTCAACCTGCGCTCGATGAAGCAGGGCTTCAAGGCCGAC
GCCGCCGCCGGGGACGCGGGCACGCACCGCGAGACGCTCTCCGAGCAGGAGATCGACGTCGTGGCCCAGCAGCTCAACGT
CAAGCGCGAGGAAGTCATCGAGATGGAAACGCGCCTGTCGGGAGGCGACGTGATGCTGGACCCGGCCCCGTCCGATGACG
GCGAACAGGCCTACGGACCCATCGCCTACCTGGCCGACGGCATGCACGAGCCGACCGCCATGATCGAATCGCGCCAGCGC
GACGTGCTGGCCACGGACGGCATCGCCAATGCCCTCGCTACGCTGGACGACCGGAGCCGCCGCATCGTCGAGGAGCGCTG
GCTCAAGGTCAACGACGACGGTTCGGGCGGCATGACGCTGCATGAACTGGCTGCGGTGTACGGCGTGAGCGCCGAGCGCA
TCCGGCAGATCGAGGTCGCCGCGATGAAGAAGATGAAGAAGGCGCTGGCGGAATACGCCTGA

Upstream 100 bases:

>100_bases
TCGCAGACATCCCCAGTCCTACGCAGGGAAAGGAACTTCGGCCTTAGCACTCCCTCCAAGAGACTGCTAAAGTGAAAGAA
TGGAAGAAAGGATTTCTGGA

Downstream 100 bases:

>100_bases
CACCCTCGCACCGTTTCGCGCTGCGGAATCCTCGTGGCGCCAGCGATAATCGGGCGTGCTACCTGAACGGGTAGCGCGCC
TTTTTTTCATCTGCCGCACA

Product: RNA polymerase sigma-32 subunit RpoH

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 313; Mature: 312

Protein sequence:

>313_residues
MTIQSGTAATALAPVNAWALIPPLGNLDAYISAVNRLPMLTAEEERTYARRLKEHNDVEAAGRMVMSHLRLVVSIARQYL
GYGLPHGDLIQEGNVGLMKAVKRFDPDQNVRLVSYAMHWIKAEIHEYILKNWRMVKVATTKSQRKLFFNLRSMKQGFKAD
AAAGDAGTHRETLSEQEIDVVAQQLNVKREEVIEMETRLSGGDVMLDPAPSDDGEQAYGPIAYLADGMHEPTAMIESRQR
DVLATDGIANALATLDDRSRRIVEERWLKVNDDGSGGMTLHELAAVYGVSAERIRQIEVAAMKKMKKALAEYA

Sequences:

>Translated_313_residues
MTIQSGTAATALAPVNAWALIPPLGNLDAYISAVNRLPMLTAEEERTYARRLKEHNDVEAAGRMVMSHLRLVVSIARQYL
GYGLPHGDLIQEGNVGLMKAVKRFDPDQNVRLVSYAMHWIKAEIHEYILKNWRMVKVATTKSQRKLFFNLRSMKQGFKAD
AAAGDAGTHRETLSEQEIDVVAQQLNVKREEVIEMETRLSGGDVMLDPAPSDDGEQAYGPIAYLADGMHEPTAMIESRQR
DVLATDGIANALATLDDRSRRIVEERWLKVNDDGSGGMTLHELAAVYGVSAERIRQIEVAAMKKMKKALAEYA
>Mature_312_residues
TIQSGTAATALAPVNAWALIPPLGNLDAYISAVNRLPMLTAEEERTYARRLKEHNDVEAAGRMVMSHLRLVVSIARQYLG
YGLPHGDLIQEGNVGLMKAVKRFDPDQNVRLVSYAMHWIKAEIHEYILKNWRMVKVATTKSQRKLFFNLRSMKQGFKADA
AAGDAGTHRETLSEQEIDVVAQQLNVKREEVIEMETRLSGGDVMLDPAPSDDGEQAYGPIAYLADGMHEPTAMIESRQRD
VLATDGIANALATLDDRSRRIVEERWLKVNDDGSGGMTLHELAAVYGVSAERIRQIEVAAMKKMKKALAEYA

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]

Homologues:

Organism=Escherichia coli, GI1789871, Length=290, Percent_Identity=49.3103448275862, Blast_Score=273, Evalue=1e-74,
Organism=Escherichia coli, GI1789098, Length=287, Percent_Identity=32.0557491289199, Blast_Score=106, Evalue=2e-24,
Organism=Escherichia coli, GI1789448, Length=279, Percent_Identity=30.1075268817204, Blast_Score=82, Evalue=5e-17,

Paralogues:

None

Copy number: <10 [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR009042
- InterPro:   IPR007627
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR012759
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 34789; Mature: 34658

Theoretical pI: Translated: 6.53; Mature: 6.53

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
4.8 %Met     (Translated Protein)
4.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
4.5 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTIQSGTAATALAPVNAWALIPPLGNLDAYISAVNRLPMLTAEEERTYARRLKEHNDVEA
CCCCCCCCHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHH
AGRMVMSHLRLVVSIARQYLGYGLPHGDLIQEGNVGLMKAVKRFDPDQNVRLVSYAMHWI
HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHH
KAEIHEYILKNWRMVKVATTKSQRKLFFNLRSMKQGFKADAAAGDAGTHRETLSEQEIDV
HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH
VAQQLNVKREEVIEMETRLSGGDVMLDPAPSDDGEQAYGPIAYLADGMHEPTAMIESRQR
HHHHHCCHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHCCHHHHHCCCCCHHHHHHHHCC
DVLATDGIANALATLDDRSRRIVEERWLKVNDDGSGGMTLHELAAVYGVSAERIRQIEVA
CHHHHHHHHHHHHHHHHHHHHHHHHHCEEECCCCCCCHHHHHHHHHHCCCHHHHHHHHHH
AMKKMKKALAEYA
HHHHHHHHHHHCC
>Mature Secondary Structure 
TIQSGTAATALAPVNAWALIPPLGNLDAYISAVNRLPMLTAEEERTYARRLKEHNDVEA
CCCCCCCHHHCCCCCCEEECCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHH
AGRMVMSHLRLVVSIARQYLGYGLPHGDLIQEGNVGLMKAVKRFDPDQNVRLVSYAMHWI
HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHH
KAEIHEYILKNWRMVKVATTKSQRKLFFNLRSMKQGFKADAAAGDAGTHRETLSEQEIDV
HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH
VAQQLNVKREEVIEMETRLSGGDVMLDPAPSDDGEQAYGPIAYLADGMHEPTAMIESRQR
HHHHHCCHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHCCHHHHHCCCCCHHHHHHHHCC
DVLATDGIANALATLDDRSRRIVEERWLKVNDDGSGGMTLHELAAVYGVSAERIRQIEVA
CHHHHHHHHHHHHHHHHHHHHHHHHHCEEECCCCCCCHHHHHHHHHHCCCHHHHHHHHHH
AMKKMKKALAEYA
HHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7542800 [H]