The gene/protein map for NC_005071 is currently unavailable.
Definition Prochlorococcus marinus str. MIT 9313 chromosome, complete genome.
Accession NC_005071
Length 2,410,873

Click here to switch to the map view.

The map label for this gene is rpoD2 [H]

Identifier: 33863339

GI number: 33863339

Start: 1153457

End: 1154419

Strand: Reverse

Name: rpoD2 [H]

Synonym: PMT1068

Alternate gene names: 33863339

Gene position: 1154419-1153457 (Counterclockwise)

Preceding gene: 33863342

Following gene: 33863337

Centisome position: 47.88

GC content: 54.62

Gene sequence:

>963_bases
ATGGCTCCCTTGGCGGTGCTTTCAGATGTCGACCTGGTGCGTTCGTACCTGCGCGATATCGGTCGAGTGCCGCTGCTGAG
CCATGAGCAGGAGATCACGCTGGGTCGTCAGGTGCAGGAGTTGATGTCTTTAGAGCAGCTTGAGTCTGAACTGGAAGGTC
AAACAGGTGAGCCAGCGAGTCGTAAAGAGCTAGCGAAGGCAGCTGGCTTGAGTGAGTTGCAGCTCAAGAAGAAGTTGCAG
AGCGGACGACGTGCGAAGGAGCGGATGGTGTCTGCGAACCTGCGCTTAGTGGTGAGTGTTGCCAAGAAGTACACCAAACG
GAATATGGAGCTACTTGATTTGATCCAAGAGGGAACGATCGGCTTGGTGAGGGGAGTAGAGAAGTTCGACCCAACCCGTG
GCTACAAGTTTTCGACCTATGCGTATTGGTGGATTCGTCAGGGGATCACGCGTGCGATTGCGGAGAAGAGCCGGACGATC
CGTCTGCCGATCCATATCACGGAGATGCTGAACAAGCTCAAGAAAGGCCAGCGAGAATTAAGTCAGGAGATGGGGCGCAC
GCCAACAGTGAGCGAACTTGCAGAGTTTGTGGAGTTGCCCGAGGAGGAGGTGAAGGATCTGATGTGCCGTGCCCGTCAGC
CGATGAGTTTGGAGATGAAGGTGGGAGATGGGGATGAAACGGAGTTGCTTGAGTTGCTTGCCGGGGAAGAGGAGTTACCG
AGTGAGAAGGTGGAAGTGGATTGCATGAAAGGCGATTTACGTACCTTGCTAGAAAAGTTGCCCGAGCTGCAGGGTCGTGT
GCTGCGGATGCGTTATGGAATCGACGGAGGGGAGCCGATGAACCTCACCGGGATTGCTAAGACTTTAGGAATGAGTCGCG
ATCGAACACGCCGTCTGGAGAGGGAAGGCTTGGCGTTGATGCGAACCTCTTCGTTTGAACTTGAGGCTTATATGGCGGTT
TGA

Upstream 100 bases:

>100_bases
TTCTTCATTTTGGCAATGGAGTGGAATTCGTCCGCAAGGCATTGATTTGTTCTTGGCAAACGATGGTAAATTAGGAAACC
TGGTTGTTTCGTTATTCAGT

Downstream 100 bases:

>100_bases
AACTTGCTTTTTAGTTTTCTGAGAGTCGACAACCAAAGCTAAGGTTTAGGGATTATTCCTGGATTAATTTATCCCCACTC
TCGTCCGTATCAGTTGCATA

Product: type II alternative sigma-70 family RNA polymerase sigma factor

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 320; Mature: 319

Protein sequence:

>320_residues
MAPLAVLSDVDLVRSYLRDIGRVPLLSHEQEITLGRQVQELMSLEQLESELEGQTGEPASRKELAKAAGLSELQLKKKLQ
SGRRAKERMVSANLRLVVSVAKKYTKRNMELLDLIQEGTIGLVRGVEKFDPTRGYKFSTYAYWWIRQGITRAIAEKSRTI
RLPIHITEMLNKLKKGQRELSQEMGRTPTVSELAEFVELPEEEVKDLMCRARQPMSLEMKVGDGDETELLELLAGEEELP
SEKVEVDCMKGDLRTLLEKLPELQGRVLRMRYGIDGGEPMNLTGIAKTLGMSRDRTRRLEREGLALMRTSSFELEAYMAV

Sequences:

>Translated_320_residues
MAPLAVLSDVDLVRSYLRDIGRVPLLSHEQEITLGRQVQELMSLEQLESELEGQTGEPASRKELAKAAGLSELQLKKKLQ
SGRRAKERMVSANLRLVVSVAKKYTKRNMELLDLIQEGTIGLVRGVEKFDPTRGYKFSTYAYWWIRQGITRAIAEKSRTI
RLPIHITEMLNKLKKGQRELSQEMGRTPTVSELAEFVELPEEEVKDLMCRARQPMSLEMKVGDGDETELLELLAGEEELP
SEKVEVDCMKGDLRTLLEKLPELQGRVLRMRYGIDGGEPMNLTGIAKTLGMSRDRTRRLEREGLALMRTSSFELEAYMAV
>Mature_319_residues
APLAVLSDVDLVRSYLRDIGRVPLLSHEQEITLGRQVQELMSLEQLESELEGQTGEPASRKELAKAAGLSELQLKKKLQS
GRRAKERMVSANLRLVVSVAKKYTKRNMELLDLIQEGTIGLVRGVEKFDPTRGYKFSTYAYWWIRQGITRAIAEKSRTIR
LPIHITEMLNKLKKGQRELSQEMGRTPTVSELAEFVELPEEEVKDLMCRARQPMSLEMKVGDGDETELLELLAGEEELPS
EKVEVDCMKGDLRTLLEKLPELQGRVLRMRYGIDGGEPMNLTGIAKTLGMSRDRTRRLEREGLALMRTSSFELEAYMAV

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is a component of the biological clock pathway that affects the circadian expression of a subset of ge

COG id: COG0568

COG function: function code K; DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789448, Length=231, Percent_Identity=45.021645021645, Blast_Score=218, Evalue=4e-58,
Organism=Escherichia coli, GI1789098, Length=312, Percent_Identity=35.8974358974359, Blast_Score=186, Evalue=1e-48,
Organism=Escherichia coli, GI1789871, Length=236, Percent_Identity=24.1525423728814, Blast_Score=66, Evalue=3e-12,

Paralogues:

None

Copy number: 700 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR000943
- InterPro:   IPR009042
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR017848
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00140 Sigma70_r1_2; PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 36337; Mature: 36206

Theoretical pI: Translated: 6.60; Mature: 6.60

Prosite motif: PS00715 SIGMA70_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
4.7 %Met     (Translated Protein)
5.3 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
4.4 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAPLAVLSDVDLVRSYLRDIGRVPLLSHEQEITLGRQVQELMSLEQLESELEGQTGEPAS
CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH
RKELAKAAGLSELQLKKKLQSGRRAKERMVSANLRLVVSVAKKYTKRNMELLDLIQEGTI
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCH
GLVRGVEKFDPTRGYKFSTYAYWWIRQGITRAIAEKSRTIRLPIHITEMLNKLKKGQREL
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHHH
SQEMGRTPTVSELAEFVELPEEEVKDLMCRARQPMSLEMKVGDGDETELLELLAGEEELP
HHHHCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCCC
SEKVEVDCMKGDLRTLLEKLPELQGRVLRMRYGIDGGEPMNLTGIAKTLGMSRDRTRRLE
CCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCHHHHHHHH
REGLALMRTSSFELEAYMAV
HHCHHEEECCCCCEEEEECC
>Mature Secondary Structure 
APLAVLSDVDLVRSYLRDIGRVPLLSHEQEITLGRQVQELMSLEQLESELEGQTGEPAS
CCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH
RKELAKAAGLSELQLKKKLQSGRRAKERMVSANLRLVVSVAKKYTKRNMELLDLIQEGTI
HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCH
GLVRGVEKFDPTRGYKFSTYAYWWIRQGITRAIAEKSRTIRLPIHITEMLNKLKKGQREL
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHHH
SQEMGRTPTVSELAEFVELPEEEVKDLMCRARQPMSLEMKVGDGDETELLELLAGEEELP
HHHHCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCCC
SEKVEVDCMKGDLRTLLEKLPELQGRVLRMRYGIDGGEPMNLTGIAKTLGMSRDRTRRLE
CCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCHHHHHHHH
REGLALMRTSSFELEAYMAV
HHCHHEEECCCCCEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8665856; 1368828 [H]