| Definition | Bacillus anthracis str. Ames, complete genome. |
|---|---|
| Accession | NC_003997 |
| Length | 5,227,293 |
Click here to switch to the map view.
The map label for this gene is sigG [H]
Identifier: 30263905
GI number: 30263905
Start: 3720596
End: 3721375
Strand: Reverse
Name: sigG [H]
Synonym: BA_4042
Alternate gene names: 30263905
Gene position: 3721375-3720596 (Counterclockwise)
Preceding gene: 30263906
Following gene: 30263904
Centisome position: 71.19
GC content: 35.64
Gene sequence:
>780_bases TTGACGAGAAACAAAGTAGAAATTTGCGGTGTTGATACAGCTAAACTTCCCGTACTAAAAAATGATGAAATGCGTAAATT ATTTCGTGAAATGCAAAGTGGAGAGATAAGCGCAAGAGAGAAATTAGTGAATGGAAACTTACGTCTTGTACTGAGCGTCA TCCAAAGATTTAATAACAGAGGAGAATATGTTGACGATTTATTTCAAGTTGGTTGTATCGGACTTATGAAATCCATTGAT AATTTTGATTTAGGCCAAAATGTAAAATTTTCAACGTATGCTGTGCCGATGATTATTGGGGAAATACGCAGATATTTGCG TGATAACAATCCGATTCGCGTATCTCGCTCATTACGAGATATTGCGTATAAAGCGTTACAAGTGAGAGAAAAGTTGATTG CAGAAAATTCAAAAGAACCAACAGCAATGGATATTGCAAAAGTGCTTGAAGTGACTCATGAAGAAATCGTTTTTGCTTTA GATGCAATTCAAGATCCAGTTTCATTATTTGAACCGATTTATAACGATGGGGGAGATCCTATCTTTGTTATGGATCAGTT AAGTGATGAAAAACAAAAGGACGAGCAGTGGGTTGAAGAGCTAGCACTAAAAGAAGGAATGAAGCGTTTAAATGATCGTG AGAAAATGATTATTCGCAAACGTTTCTTCCAAGGGAAAACACAAATGGAAGTTGCAGAAGAAATTGGGATTTCTCAAGCA CAAGTGTCACGTTTAGAGAAATCAGCTATTAAACAAATGAATAAGACAATTCAAGGATAA
Upstream 100 bases:
>100_bases GACAACGAAAATGAAGAATGAAGAGAAAACCGCATGTATAAAATCCCCTTCAAAGGAAATACTTTACACTGTACAGCAAC TCCCGATAGGAGGGAACACT
Downstream 100 bases:
>100_bases GGTTTCACCATTTATGGTGAAACCTTTTTATATGTCATATAGTTTGTTTACGCTCATAGTAGAGTAGCTGTGTTCATATA ATGAAAACAATATGGGATTG
Product: sporulation sigma factor SigG
Products: NA
Alternate protein names: Stage III sporulation protein G [H]
Number of amino acids: Translated: 259; Mature: 258
Protein sequence:
>259_residues MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSID NFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFAL DAIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA QVSRLEKSAIKQMNKTIQG
Sequences:
>Translated_259_residues MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSID NFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFAL DAIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA QVSRLEKSAIKQMNKTIQG >Mature_258_residues TRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSIDN FDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALD AIQDPVSLFEPIYNDGGDPIFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQAQ VSRLEKSAIKQMNKTIQG
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes in the forespore [H]
COG id: COG1191
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789448, Length=247, Percent_Identity=28.7449392712551, Blast_Score=91, Evalue=6e-20, Organism=Escherichia coli, GI1789098, Length=257, Percent_Identity=25.6809338521401, Blast_Score=87, Evalue=1e-18, Organism=Escherichia coli, GI1789871, Length=260, Percent_Identity=27.3076923076923, Blast_Score=78, Evalue=6e-16,
Paralogues:
None
Copy number: 700 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR014322 - InterPro: IPR014212 - InterPro: IPR000943 - InterPro: IPR007627 - InterPro: IPR007624 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR011991 [H]
Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 29788; Mature: 29657
Theoretical pI: Translated: 6.14; Mature: 6.14
Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 4.2 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.9 %Met (Mature Protein) 4.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNR CCCCCEEEECCCHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCC GEYVDDLFQVGCIGLMKSIDNFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRD CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHH IAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALDAIQDPVSLFEPIYNDGGDP HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCE IFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHH QVSRLEKSAIKQMNKTIQG HHHHHHHHHHHHHHHHCCC >Mature Secondary Structure TRNKVEICGVDTAKLPVLKNDEMRKLFREMQSGEISAREKLVNGNLRLVLSVIQRFNNR CCCCEEEECCCHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCC GEYVDDLFQVGCIGLMKSIDNFDLGQNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRD CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHH IAYKALQVREKLIAENSKEPTAMDIAKVLEVTHEEIVFALDAIQDPVSLFEPIYNDGGDP HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCE IFVMDQLSDEKQKDEQWVEELALKEGMKRLNDREKMIIRKRFFQGKTQMEVAEEIGISQA EEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHH QVSRLEKSAIKQMNKTIQG HHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 2497052; 2459711; 9384377 [H]