The gene/protein map for NC_003997 is currently unavailable.
Definition Bacillus anthracis str. Ames, complete genome.
Accession NC_003997
Length 5,227,293

Click here to switch to the map view.

The map label for this gene is sigG [H]

Identifier: 30263905

GI number: 30263905

Start: 3720596

End: 3721375

Strand: Reverse

Name: sigG [H]

Synonym: BA_4042

Alternate gene names: 30263905

Gene position: 3721375-3720596 (Counterclockwise)

Preceding gene: 30263906

Following gene: 30263904

Centisome position: 71.19

GC content: 35.64

Gene sequence:

>780_bases
TTGACGAGAAACAAAGTAGAAATTTGCGGTGTTGATACAGCTAAACTTCCCGTACTAAAAAATGATGAAATGCGTAAATT
ATTTCGTGAAATGCAAAGTGGAGAGATAAGCGCAAGAGAGAAATTAGTGAATGGAAACTTACGTCTTGTACTGAGCGTCA
TCCAAAGATTTAATAACAGAGGAGAATATGTTGACGATTTATTTCAAGTTGGTTGTATCGGACTTATGAAATCCATTGAT
AATTTTGATTTAGGCCAAAATGTAAAATTTTCAACGTATGCTGTGCCGATGATTATTGGGGAAATACGCAGATATTTGCG
TGATAACAATCCGATTCGCGTATCTCGCTCATTACGAGATATTGCGTATAAAGCGTTACAAGTGAGAGAAAAGTTGATTG
CAGAAAATTCAAAAGAACCAACAGCAATGGATATTGCAAAAGTGCTTGAAGTGACTCATGAAGAAATCGTTTTTGCTTTA
GATGCAATTCAAGATCCAGTTTCATTATTTGAACCGATTTATAACGATGGGGGAGATCCTATCTTTGTTATGGATCAGTT
AAGTGATGAAAAACAAAAGGACGAGCAGTGGGTTGAAGAGCTAGCACTAAAAGAAGGAATGAAGCGTTTAAATGATCGTG
AGAAAATGATTATTCGCAAACGTTTCTTCCAAGGGAAAACACAAATGGAAGTTGCAGAAGAAATTGGGATTTCTCAAGCA
CAAGTGTCACGTTTAGAGAAATCAGCTATTAAACAAATGAATAAGACAATTCAAGGATAA

Upstream 100 bases:

>100_bases
GACAACGAAAATGAAGAATGAAGAGAAAACCGCATGTATAAAATCCCCTTCAAAGGAAATACTTTACACTGTACAGCAAC
TCCCGATAGGAGGGAACACT

Downstream 100 bases:

>100_bases
GGTTTCACCATTTATGGTGAAACCTTTTTATATGTCATATAGTTTGTTTACGCTCATAGTAGAGTAGCTGTGTTCATATA
ATGAAAACAATATGGGATTG

Product: sporulation sigma factor SigG

Products: NA

Alternate protein names: Stage III sporulation protein G [H]

Number of amino acids: Translated: 259; Mature: 258

Protein sequence:

>259_residues
MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSID
NFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFAL
DAIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA
QVSRLEKSAIKQMNKTIQG

Sequences:

>Translated_259_residues
MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSID
NFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFAL
DAIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA
QVSRLEKSAIKQMNKTIQG
>Mature_258_residues
TRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSIDN
FDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALD
AIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQAQ
VSRLEKSAIKQMNKTIQG

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes in the forespore [H]

COG id: COG1191

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789448, Length=247, Percent_Identity=28.7449392712551, Blast_Score=91, Evalue=6e-20,
Organism=Escherichia coli, GI1789098, Length=257, Percent_Identity=25.6809338521401, Blast_Score=87, Evalue=1e-18,
Organism=Escherichia coli, GI1789871, Length=260, Percent_Identity=27.3076923076923, Blast_Score=78, Evalue=6e-16,

Paralogues:

None

Copy number: 700 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR014322
- InterPro:   IPR014212
- InterPro:   IPR000943
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR011991 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 29788; Mature: 29657

Theoretical pI: Translated: 6.14; Mature: 6.14

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
4.2 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNR
CCCCCEEEECCCHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCC
GEYVDDLFQVGCIGLMKSIDNFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRD
CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHH
IAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALDAIQDPVSLFEPIYNDGGDP
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCE
IFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA
EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHH
QVSRLEKSAIKQMNKTIQG
HHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
TRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNR
CCCCEEEECCCHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCC
GEYVDDLFQVGCIGLMKSIDNFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRD
CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHH
IAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALDAIQDPVSLFEPIYNDGGDP
HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCE
IFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA
EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHH
QVSRLEKSAIKQMNKTIQG
HHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2497052; 2459711; 9384377 [H]