Definition Streptococcus pneumoniae D39, complete genome.
Accession NC_008533
Length 2,046,115

Click here to switch to the map view.

The map label for this gene is rpoD

Identifier: 116516638

GI number: 116516638

Start: 972812

End: 973921

Strand: Direct

Name: rpoD

Synonym: SPD_0958

Alternate gene names: 116516638

Gene position: 972812-973921 (Clockwise)

Preceding gene: 116515597

Following gene: 116516796

Centisome position: 47.54

GC content: 43.15

Gene sequence:

>1110_bases
ATGGCAACAAAACAAAAAGAAGTAACAACATTTGACGTACAGGTAGCAGAATTTATCCGTAATCATAAGCAAAAAGGGAC
AGCAACAGATGATGAAATCAATGCAAGTCTGGTTATTCCTTTTACCTTGGACGCTGATGGGATTGAAGATCTCTTGCAAC
GGATTCAGGATGCAGGGATTTCTATCACAGATAACGAAGGAAATCCAAGTGCGCGTGTTCTTAGCAATGAAGAAGAACCA
GAACTCAGCGATGAGGACTTGATTGGGTCAACTTCTGCTAAGGTCAATGACCCTGTCCGTATGTACTTGAAAGAAATAGG
GGTCGTTCCTCTCTTGACCAATGAAGAGGAGAAAGAGTTGGCACTGGCTGTTGAAGCTGGTGATATCGAAGCCAAACAAC
GTCTTGCGGAAGCCAATCTTCGTTTGGTTGTTTCCATTGCCAAACGCTATGTCGGTCGTGGTATGCAGTTCCTTGACTTG
ATTCAAGAAGGAAATATGGGCTTGATGAAGGCGGTTGACAAGTTTGACTATTCTAAAGGGTTCAAGTTTTCAACTTATGC
AACTTGGTGGATTCGTCAGGCTATCACTCGTGCTATTGCGGACCAAGCTCGTACCATCCGTATCCCAGTTCACATGGTTG
AAACTATCAATAAATTGGTTCGTGAACAGCGGAATCTCCTTCAAGAATTGGGGCAAGATCCGACACCAGAACAGATTGCT
GAACGAATGGATATGACACCTGATAAGGTTCGTGAAATCTTGAAGATTGCCCAAGAACCAGTATCTCTTGAAACTCCTAT
CGGTGAAGAGGACGATAGCCACCTTGGAGACTTTATCGAAGATGAAGTGATTGAAAATCCAGTGGATTATACGACTCGTA
TCGTCTTGCGTGAGCAATTGGATGAAATCTTAGATACTCTTACAGACCGTGAAGAAAATGTTCTGCGTCTACGTTTTGGA
CTAGATGATGGAAAAATGCGCACACTTGAAGATGTGGGGAAAGTCTTTAACGTAACTCGTGAGCGTATCCGTCAGATTGA
AGCAAAGGCTTTGAGAAAACTACGCCAACCAAGTCGTAGCAAACCGCTTCGTGATTTTATTGAAGACTAA

Upstream 100 bases:

>100_bases
TCAAAAAGAAGGTGCAGGAAGCTAGCCATGTAGGAGATACAGATACAGCCCTAGAAGAATTGGAACGTTTAATTTCCCAA
AAGAGAAGAATGGAGTAATA

Downstream 100 bases:

>100_bases
GAGTGAGGAAAATATGGCTTATACAGAAGAGCAAATTGAAAACATCAAAACACGGATTTTAACAGCCTTGGAAGAAGTCA
TCGACCCTGAGTTGGGAATC

Product: RNA polymerase sigma factor RpoD

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 369; Mature: 368

Protein sequence:

>369_residues
MATKQKEVTTFDVQVAEFIRNHKQKGTATDDEINASLVIPFTLDADGIEDLLQRIQDAGISITDNEGNPSARVLSNEEEP
ELSDEDLIGSTSAKVNDPVRMYLKEIGVVPLLTNEEEKELALAVEAGDIEAKQRLAEANLRLVVSIAKRYVGRGMQFLDL
IQEGNMGLMKAVDKFDYSKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLVREQRNLLQELGQDPTPEQIA
ERMDMTPDKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDEVIENPVDYTTRIVLREQLDEILDTLTDREENVLRLRFG
LDDGKMRTLEDVGKVFNVTRERIRQIEAKALRKLRQPSRSKPLRDFIED

Sequences:

>Translated_369_residues
MATKQKEVTTFDVQVAEFIRNHKQKGTATDDEINASLVIPFTLDADGIEDLLQRIQDAGISITDNEGNPSARVLSNEEEP
ELSDEDLIGSTSAKVNDPVRMYLKEIGVVPLLTNEEEKELALAVEAGDIEAKQRLAEANLRLVVSIAKRYVGRGMQFLDL
IQEGNMGLMKAVDKFDYSKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLVREQRNLLQELGQDPTPEQIA
ERMDMTPDKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDEVIENPVDYTTRIVLREQLDEILDTLTDREENVLRLRFG
LDDGKMRTLEDVGKVFNVTRERIRQIEAKALRKLRQPSRSKPLRDFIED
>Mature_368_residues
ATKQKEVTTFDVQVAEFIRNHKQKGTATDDEINASLVIPFTLDADGIEDLLQRIQDAGISITDNEGNPSARVLSNEEEPE
LSDEDLIGSTSAKVNDPVRMYLKEIGVVPLLTNEEEKELALAVEAGDIEAKQRLAEANLRLVVSIAKRYVGRGMQFLDLI
QEGNMGLMKAVDKFDYSKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLVREQRNLLQELGQDPTPEQIAE
RMDMTPDKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDEVIENPVDYTTRIVLREQLDEILDTLTDREENVLRLRFGL
DDGKMRTLEDVGKVFNVTRERIRQIEAKALRKLRQPSRSKPLRDFIED

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This is the primary sigma factor of this bacterium

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family

Homologues:

Organism=Escherichia coli, GI1789448, Length=245, Percent_Identity=67.7551020408163, Blast_Score=346, Evalue=1e-96,
Organism=Escherichia coli, GI1789098, Length=291, Percent_Identity=43.6426116838488, Blast_Score=239, Evalue=2e-64,
Organism=Escherichia coli, GI1789871, Length=273, Percent_Identity=30.03663003663, Blast_Score=111, Evalue=8e-26,
Organism=Escherichia coli, GI1788231, Length=202, Percent_Identity=30.1980198019802, Blast_Score=72, Evalue=7e-14,

Paralogues:

None

Copy number: 700 (log & stationary phase) [C]

Swissprot (AC and ID): RPOD_STRPN (P0A4I9)

Other databases:

- EMBL:   AE005672
- PIR:   A95124
- RefSeq:   NP_345546.1
- ProteinModelPortal:   P0A4I9
- SMR:   P0A4I9
- EnsemblBacteria:   EBSTRT00000026782
- GeneID:   931587
- GenomeReviews:   AE005672_GR
- KEGG:   spn:SP_1073
- TIGR:   SP_1073
- GeneTree:   EBGT00050000027574
- HOGENOM:   HBG745096
- OMA:   GDEEAKQ
- ProtClustDB:   PRK09210
- BioCyc:   SPNE170187-1:SP_1073-MONOMER
- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR009042
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR007127
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR012760
- InterPro:   IPR011991
- Gene3D:   G3DSA:1.10.10.10
- PRINTS:   PR00046
- TIGRFAMs:   TIGR02393
- TIGRFAMs:   TIGR02937

Pfam domain/function: PF03979 Sigma70_r1_1; PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4; SSF88946 Sigma_r2; SSF88659 Sigma_r3_r4

EC number: NA

Molecular weight: Translated: 42031; Mature: 41900

Theoretical pI: Translated: 4.47; Mature: 4.47

Prosite motif: PS00715 SIGMA70_1; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MATKQKEVTTFDVQVAEFIRNHKQKGTATDDEINASLVIPFTLDADGIEDLLQRIQDAGI
CCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCC
SITDNEGNPSARVLSNEEEPELSDEDLIGSTSAKVNDPVRMYLKEIGVVPLLTNEEEKEL
EEECCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEECCCCCHHE
ALAVEAGDIEAKQRLAEANLRLVVSIAKRYVGRGMQFLDLIQEGNMGLMKAVDKFDYSKG
EEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCHHHHHHHHHHCCCC
FKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLVREQRNLLQELGQDPTPEQIA
CCCHHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH
ERMDMTPDKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDEVIENPVDYTTRIVLREQL
HHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH
DEILDTLTDREENVLRLRFGLDDGKMRTLEDVGKVFNVTRERIRQIEAKALRKLRQPSRS
HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
KPLRDFIED
CCHHHHHCC
>Mature Secondary Structure 
ATKQKEVTTFDVQVAEFIRNHKQKGTATDDEINASLVIPFTLDADGIEDLLQRIQDAGI
CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCC
SITDNEGNPSARVLSNEEEPELSDEDLIGSTSAKVNDPVRMYLKEIGVVPLLTNEEEKEL
EEECCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEECCCCCHHE
ALAVEAGDIEAKQRLAEANLRLVVSIAKRYVGRGMQFLDLIQEGNMGLMKAVDKFDYSKG
EEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCHHHHHHHHHHCCCC
FKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLVREQRNLLQELGQDPTPEQIA
CCCHHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH
ERMDMTPDKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDEVIENPVDYTTRIVLREQL
HHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH
DEILDTLTDREENVLRLRFGLDDGKMRTLEDVGKVFNVTRERIRQIEAKALRKLRQPSRS
HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
KPLRDFIED
CCHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11463916