| Definition | Ehrlichia chaffeensis str. Arkansas, complete genome. |
|---|---|
| Accession | NC_007799 |
| Length | 1,176,248 |
Click here to switch to the map view.
The map label for this gene is rpoH [H]
Identifier: 88658518
GI number: 88658518
Start: 670231
End: 671124
Strand: Direct
Name: rpoH [H]
Synonym: ECH_0655
Alternate gene names: 88658518
Gene position: 670231-671124 (Clockwise)
Preceding gene: 88658336
Following gene: 88658477
Centisome position: 56.98
GC content: 31.43
Gene sequence:
>894_bases ATGTTAACAAATTCTATATTTTCCCTAACTCAAGACAATTTAATGTCCTATATCAATGAAGTGCATGCATTTCCGATTTT GTCTCCTGAAGAAGAAGACAGGTTAGCAAGAAATTGGTATGAAAATGGGATCGTTGCTGACGCACATAGGTTAGTTACTA GTCATCTAAGGCTAGTAGTCAAAGTTGCATTAAGCTTTAAAAATTATGAATTGCCTCTTATAGAGCTAATAATGGAAGGA AATATAGGGCTGATGCAGGCTGTAAAAAAGTTCAATCCCACTCTTGGCTTTAGGTTATCCACTTATGCTATTTGGTGGAT CAAAGCTTTTATTAAGGACTATATTCTTAAATCTTGGTCGTGCATTAAAATTGGTACAACACAAGCACAAAGGAAGTTAT TCTTTAGCTTAAGGAAAATTAAGAAAAAACTTTTTAAATATAACCACAATATTACAAAAGAAGATATAAAGCTAATTGCA AATAAATGTTCAACTTCTGAACAAGAAGTAGAACAGATGAACAGGTATTTTCTCTACAGAGATAGATCCCTGAATGAACT AGTATTCTCTAATGATAATCAAAATGGAGTCGAATTACAAGAGATTATAAAGTGTGATACCCCAAACCAAGAGGATACAT ATTTACTAAATGAAGAGTTAAATATAAAAAAGGCTTTAATTTCACAAGCTTTATCAACACTAAATGAAAGATACCGCGAC ATATTCATCAGGCGGCGACTCATCGAAGAACCAGATACTTTAGACAAATTAAGTCAAGAGTATAATATATCAAAAGAGAG AGTTAGACAAATAGAAATGCATGCTTTTACTAAAGTAAAGAATTTTATTATATCTGAAAGAGAAAAACTAGGTCATTGTA ATATCAATAGTTAA
Upstream 100 bases:
>100_bases TTATATTAACAATTAATACACTAAAACCCTAAAAATTTACTATACATCATCATATCACTTTGATATAATACAGCGTTGTA ATCAAGTTTCATATGTCAAT
Downstream 100 bases:
>100_bases ATAAGAATAGCATATCGATCTAACTATACGCTTAACATAAAAAGAAACTAATATACATTAATCTTAGAAAAATAAAAGAA GATGTTAAGAGAAATAAGAC
Product: RNA polymerase sigma-32 factor
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 297; Mature: 297
Protein sequence:
>297_residues MLTNSIFSLTQDNLMSYINEVHAFPILSPEEEDRLARNWYENGIVADAHRLVTSHLRLVVKVALSFKNYELPLIELIMEG NIGLMQAVKKFNPTLGFRLSTYAIWWIKAFIKDYILKSWSCIKIGTTQAQRKLFFSLRKIKKKLFKYNHNITKEDIKLIA NKCSTSEQEVEQMNRYFLYRDRSLNELVFSNDNQNGVELQEIIKCDTPNQEDTYLLNEELNIKKALISQALSTLNERYRD IFIRRRLIEEPDTLDKLSQEYNISKERVRQIEMHAFTKVKNFIISEREKLGHCNINS
Sequences:
>Translated_297_residues MLTNSIFSLTQDNLMSYINEVHAFPILSPEEEDRLARNWYENGIVADAHRLVTSHLRLVVKVALSFKNYELPLIELIMEG NIGLMQAVKKFNPTLGFRLSTYAIWWIKAFIKDYILKSWSCIKIGTTQAQRKLFFSLRKIKKKLFKYNHNITKEDIKLIA NKCSTSEQEVEQMNRYFLYRDRSLNELVFSNDNQNGVELQEIIKCDTPNQEDTYLLNEELNIKKALISQALSTLNERYRD IFIRRRLIEEPDTLDKLSQEYNISKERVRQIEMHAFTKVKNFIISEREKLGHCNINS >Mature_297_residues MLTNSIFSLTQDNLMSYINEVHAFPILSPEEEDRLARNWYENGIVADAHRLVTSHLRLVVKVALSFKNYELPLIELIMEG NIGLMQAVKKFNPTLGFRLSTYAIWWIKAFIKDYILKSWSCIKIGTTQAQRKLFFSLRKIKKKLFKYNHNITKEDIKLIA NKCSTSEQEVEQMNRYFLYRDRSLNELVFSNDNQNGVELQEIIKCDTPNQEDTYLLNEELNIKKALISQALSTLNERYRD IFIRRRLIEEPDTLDKLSQEYNISKERVRQIEMHAFTKVKNFIISEREKLGHCNINS
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of heat shock promoters [H]
COG id: COG0568
COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family. Sigma-32 subfamily [H]
Homologues:
Organism=Escherichia coli, GI1789871, Length=273, Percent_Identity=35.8974358974359, Blast_Score=175, Evalue=3e-45, Organism=Escherichia coli, GI1789098, Length=290, Percent_Identity=31.7241379310345, Blast_Score=124, Evalue=1e-29, Organism=Escherichia coli, GI1789448, Length=253, Percent_Identity=26.4822134387352, Blast_Score=79, Evalue=4e-16,
Paralogues:
None
Copy number: <10 [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR000943 - InterPro: IPR009042 - InterPro: IPR007627 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR012759 - InterPro: IPR011991 [H]
Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 35053; Mature: 35053
Theoretical pI: Translated: 9.01; Mature: 9.01
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 3.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTNSIFSLTQDNLMSYINEVHAFPILSPEEEDRLARNWYENGIVADAHRLVTSHLRLVV CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH KVALSFKNYELPLIELIMEGNIGLMQAVKKFNPTLGFRLSTYAIWWIKAFIKDYILKSWS HHHHHHCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCC CIKIGTTQAQRKLFFSLRKIKKKLFKYNHNITKEDIKLIANKCSTSEQEVEQMNRYFLYR EEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHC DRSLNELVFSNDNQNGVELQEIIKCDTPNQEDTYLLNEELNIKKALISQALSTLNERYRD CCCCCHHHCCCCCCCCHHHHHHHHCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHHHH IFIRRRLIEEPDTLDKLSQEYNISKERVRQIEMHAFTKVKNFIISEREKLGHCNINS HHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC >Mature Secondary Structure MLTNSIFSLTQDNLMSYINEVHAFPILSPEEEDRLARNWYENGIVADAHRLVTSHLRLVV CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH KVALSFKNYELPLIELIMEGNIGLMQAVKKFNPTLGFRLSTYAIWWIKAFIKDYILKSWS HHHHHHCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCC CIKIGTTQAQRKLFFSLRKIKKKLFKYNHNITKEDIKLIANKCSTSEQEVEQMNRYFLYR EEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHC DRSLNELVFSNDNQNGVELQEIIKCDTPNQEDTYLLNEELNIKKALISQALSTLNERYRD CCCCCHHHCCCCCCCCHHHHHHHHCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHHHH IFIRRRLIEEPDTLDKLSQEYNISKERVRQIEMHAFTKVKNFIISEREKLGHCNINS HHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7501460 [H]