| Definition | Clostridium botulinum A str. ATCC 19397, complete genome. |
|---|---|
| Accession | NC_009697 |
| Length | 3,863,450 |
Click here to switch to the map view.
The map label for this gene is sigG [H]
Identifier: 153933176
GI number: 153933176
Start: 2542472
End: 2543245
Strand: Reverse
Name: sigG [H]
Synonym: CLB_2408
Alternate gene names: 153933176
Gene position: 2543245-2542472 (Counterclockwise)
Preceding gene: 153932419
Following gene: 153933553
Centisome position: 65.83
GC content: 29.59
Gene sequence:
>774_bases ATGGTTATTAATAAAGTTGAAATTTGTGGAGTAAATACATCAAAATTACCAGTATTAAAGGATAAAGAGATGAAAAAATT ACTGGTAAGGATTAGGAACGGTGAAACGGAATGTAGAGAAGAATTTATACAAGGAAATTTAAGGTTAGTATTAAGTGTAA TAAAAAGATTTAACAATCGAGGGGAAAATGTGGATGATCTTTTTCAAGTTGGATGTATAGGCCTTATAAAAGCCATAGAC AATTTTGATTTAAGTCAAAATGTAAAATTCTCTACTTATGCTGTTCCTATGATAATAGGAGAAATAAGAAGATACCTAAG AGATAATAATTCCATAAGAGTTAGTAGATCTTTAAGAGATATAGCCTATAAAGCTCTACAGGCTAGAGACAAATTAATAA AAAATAATAATAAAGAACCAACAGTGTCTCAAATAGCTAAAGAATTAGAACTACCAAGAGAAGAGGTAGTGTTTGCTTTA GACGCTATACAGGATCCAGTGTCTTTGTTTGAACCTATATATCATGATGGTGGTGACGCCATATTTGTTATGGATCAAAT AAGTGACACTAAAAATATAGATGAGAATTGGATAGAAAATATATCTATAAAAGAAGCTATGAAAAAATTAAATGACAGAG AAAAACTAATACTTAATTTAAGGTTTTTTGATGGAAGAACTCAAATGGAAGTAGCCGATGAAATAGGCATATCTCAAGCA CAGGTATCTAGGTTAGAAAAAACTGCTTTAAAACATATGAGAAAGTATGTTTAA
Upstream 100 bases:
>100_bases AAAAAGAAATAAATAAAATGGTTTAGTTTGTCCTTAGTATAAAATTAGATTCCTTTGGAAACAATTTAAATGCAATCATT CTGAAGGGACTGATGACTTT
Downstream 100 bases:
>100_bases TCAAAAAATGTTCTTAAATTATTAGAGAACACTTTAAGCTACTGATAAATATTTTTTATCAGTAGCTTTTTATTTTTTAA TGTTTTTCATAGAATAGGAT
Product: sporulation sigma factor SigG
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 257; Mature: 257
Protein sequence:
>257_residues MVINKVEICGVNTSKLPVLKDKEMKKLLVRIRNGETECREEFIQGNLRLVLSVIKRFNNRGENVDDLFQVGCIGLIKAID NFDLSQNVKFSTYAVPMIIGEIRRYLRDNNSIRVSRSLRDIAYKALQARDKLIKNNNKEPTVSQIAKELELPREEVVFAL DAIQDPVSLFEPIYHDGGDAIFVMDQISDTKNIDENWIENISIKEAMKKLNDREKLILNLRFFDGRTQMEVADEIGISQA QVSRLEKTALKHMRKYV
Sequences:
>Translated_257_residues MVINKVEICGVNTSKLPVLKDKEMKKLLVRIRNGETECREEFIQGNLRLVLSVIKRFNNRGENVDDLFQVGCIGLIKAID NFDLSQNVKFSTYAVPMIIGEIRRYLRDNNSIRVSRSLRDIAYKALQARDKLIKNNNKEPTVSQIAKELELPREEVVFAL DAIQDPVSLFEPIYHDGGDAIFVMDQISDTKNIDENWIENISIKEAMKKLNDREKLILNLRFFDGRTQMEVADEIGISQA QVSRLEKTALKHMRKYV >Mature_257_residues MVINKVEICGVNTSKLPVLKDKEMKKLLVRIRNGETECREEFIQGNLRLVLSVIKRFNNRGENVDDLFQVGCIGLIKAID NFDLSQNVKFSTYAVPMIIGEIRRYLRDNNSIRVSRSLRDIAYKALQARDKLIKNNNKEPTVSQIAKELELPREEVVFAL DAIQDPVSLFEPIYHDGGDAIFVMDQISDTKNIDENWIENISIKEAMKKLNDREKLILNLRFFDGRTQMEVADEIGISQA QVSRLEKTALKHMRKYV
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes [H]
COG id: COG1191
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789448, Length=238, Percent_Identity=29.4117647058824, Blast_Score=96, Evalue=2e-21, Organism=Escherichia coli, GI1789098, Length=249, Percent_Identity=27.710843373494, Blast_Score=87, Evalue=9e-19, Organism=Escherichia coli, GI1789871, Length=259, Percent_Identity=27.027027027027, Blast_Score=78, Evalue=5e-16, Organism=Escherichia coli, GI1788231, Length=223, Percent_Identity=27.3542600896861, Blast_Score=68, Evalue=6e-13,
Paralogues:
None
Copy number: 700 (log & stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR014322 - InterPro: IPR014212 - InterPro: IPR000943 - InterPro: IPR007627 - InterPro: IPR007624 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR011991 [H]
Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 29629; Mature: 29629
Theoretical pI: Translated: 8.44; Mature: 8.44
Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVINKVEICGVNTSKLPVLKDKEMKKLLVRIRNGETECREEFIQGNLRLVLSVIKRFNNR CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHCCC GENVDDLFQVGCIGLIKAIDNFDLSQNVKFSTYAVPMIIGEIRRYLRDNNSIRVSRSLRD CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEHHHHHH IAYKALQARDKLIKNNNKEPTVSQIAKELELPREEVVFALDAIQDPVSLFEPIYHDGGDA HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHCCCCE IFVMDQISDTKNIDENWIENISIKEAMKKLNDREKLILNLRFFDGRTQMEVADEIGISQA EEEEECCCCCCCCCHHHHHCCCHHHHHHHHCCCHHEEEEEEEECCCCHHHHHHHHCCCHH QVSRLEKTALKHMRKYV HHHHHHHHHHHHHHHCC >Mature Secondary Structure MVINKVEICGVNTSKLPVLKDKEMKKLLVRIRNGETECREEFIQGNLRLVLSVIKRFNNR CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHCCC GENVDDLFQVGCIGLIKAIDNFDLSQNVKFSTYAVPMIIGEIRRYLRDNNSIRVSRSLRD CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEHHHHHH IAYKALQARDKLIKNNNKEPTVSQIAKELELPREEVVFALDAIQDPVSLFEPIYHDGGDA HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHCHHHHHHHHHHCCCCE IFVMDQISDTKNIDENWIENISIKEAMKKLNDREKLILNLRFFDGRTQMEVADEIGISQA EEEEECCCCCCCCCHHHHHCCCHHHHHHHHCCCHHEEEEEEEECCCCHHHHHHHHCCCHH QVSRLEKTALKHMRKYV HHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7961408; 7883192; 11466286 [H]