| Definition | Clostridium botulinum A str. ATCC 19397, complete genome. |
|---|---|
| Accession | NC_009697 |
| Length | 3,863,450 |
Click here to switch to the map view.
The map label for this gene is sigE [H]
Identifier: 153932419
GI number: 153932419
Start: 2543320
End: 2544027
Strand: Reverse
Name: sigE [H]
Synonym: CLB_2409
Alternate gene names: 153932419
Gene position: 2544027-2543320 (Counterclockwise)
Preceding gene: 153931338
Following gene: 153933176
Centisome position: 65.85
GC content: 26.84
Gene sequence:
>708_bases ATGATAAATTTAAAGGTATTATTAAACAGAATATTAATAAAATTTAAATTGTTTTTTAAAAGCGTTTATTACATAGGTGG GAATGATGCACTTCCACCGCCACTTTCAAAAGAAGAAGAAGAATATTTTGTACAAAGGTTAATAAATGGAGATGAAAAAG TTCGATCCGTTTTAATAGAAAGAAATTTAAGATTAGTGGTTTATATAGCTAGAAAATTTGAAAATACAGGCATATGTATA GAAGATCTAGTATCCGTAGGAACTATAGGACTAATTAAGGCAGTAAATACTTTTAAACCAGACAAAAAAATTAAATTAGC AACCTATGCTTCAAGATGTATAGAAAATGAGATATTAATGTATTTAAGAAGAAATAGTAAAGTAAAAGCAGAAATATCCT TTTATGAACCTTTAAATATTGATTGGGATGGAAATGAGCTATTACTTTCAGATATATTGGGAACAGACAATGATGAGGTC TATAATTTAATAGAAGATGAAGTAGATAAACAATTACTATTATTAGCTATGAAAAAATTGAACGAAAGAGAAAAGGAAAT AGTAAGATTAAGATTTGGTCTTAATGGGAAGAGAGAAAAAACTCAAAAAGAAGTAGCAGATATGCTGGGTATATCTCAGT CCTATATATCAAGGCTAGAAAAAAGAATAATTAAGACTCTAAAAAAAGAAATAAATAAAATGGTTTAG
Upstream 100 bases:
>100_bases AAAAAAAAGAAGTTATAATAGCACTTTCAGAAGGTAAACTTAGTGGTATAAAAGACTATAGAGCACTTTTATCCCGAGGA ATTATATAGGAGGGACTTTT
Downstream 100 bases:
>100_bases TTTGTCCTTAGTATAAAATTAGATTCCTTTGGAAACAATTTAAATGCAATCATTCTGAAGGGACTGATGACTTTATGGTT ATTAATAAAGTTGAAATTTG
Product: sporulation sigma factor SigE
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 235; Mature: 235
Protein sequence:
>235_residues MINLKVLLNRILIKFKLFFKSVYYIGGNDALPPPLSKEEEEYFVQRLINGDEKVRSVLIERNLRLVVYIARKFENTGICI EDLVSVGTIGLIKAVNTFKPDKKIKLATYASRCIENEILMYLRRNSKVKAEISFYEPLNIDWDGNELLLSDILGTDNDEV YNLIEDEVDKQLLLLAMKKLNEREKEIVRLRFGLNGKREKTQKEVADMLGISQSYISRLEKRIIKTLKKEINKMV
Sequences:
>Translated_235_residues MINLKVLLNRILIKFKLFFKSVYYIGGNDALPPPLSKEEEEYFVQRLINGDEKVRSVLIERNLRLVVYIARKFENTGICI EDLVSVGTIGLIKAVNTFKPDKKIKLATYASRCIENEILMYLRRNSKVKAEISFYEPLNIDWDGNELLLSDILGTDNDEV YNLIEDEVDKQLLLLAMKKLNEREKEIVRLRFGLNGKREKTQKEVADMLGISQSYISRLEKRIIKTLKKEINKMV >Mature_235_residues MINLKVLLNRILIKFKLFFKSVYYIGGNDALPPPLSKEEEEYFVQRLINGDEKVRSVLIERNLRLVVYIARKFENTGICI EDLVSVGTIGLIKAVNTFKPDKKIKLATYASRCIENEILMYLRRNSKVKAEISFYEPLNIDWDGNELLLSDILGTDNDEV YNLIEDEVDKQLLLLAMKKLNEREKEIVRLRFGLNGKREKTQKEVADMLGISQSYISRLEKRIIKTLKKEINKMV
Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes [H]
COG id: COG1191
COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sigma-70 factor family [H]
Homologues:
Organism=Escherichia coli, GI1789098, Length=277, Percent_Identity=28.5198555956679, Blast_Score=88, Evalue=6e-19, Organism=Escherichia coli, GI1789871, Length=106, Percent_Identity=37.7358490566038, Blast_Score=71, Evalue=7e-14,
Paralogues:
None
Copy number: <10 (log phase) 250 (stationary phase) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014284 - InterPro: IPR014200 - InterPro: IPR016263 - InterPro: IPR000943 - InterPro: IPR007627 - InterPro: IPR007630 - InterPro: IPR013325 - InterPro: IPR013324 - InterPro: IPR011991 [H]
Pfam domain/function: PF04542 Sigma70_r2; PF04545 Sigma70_r4 [H]
EC number: NA
Molecular weight: Translated: 27348; Mature: 27348
Theoretical pI: Translated: 9.53; Mature: 9.53
Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 3.0 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MINLKVLLNRILIKFKLFFKSVYYIGGNDALPPPLSKEEEEYFVQRLINGDEKVRSVLIE CCHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHH RNLRLVVYIARKFENTGICIEDLVSVGTIGLIKAVNTFKPDKKIKLATYASRCIENEILM CCCEEEEEEEHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH YLRRNSKVKAEISFYEPLNIDWDGNELLLSDILGTDNDEVYNLIEDEVDKQLLLLAMKKL HHHCCCCEEEEEEEECCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH NEREKEIVRLRFGLNGKREKTQKEVADMLGISQSYISRLEKRIIKTLKKEINKMV CHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCC >Mature Secondary Structure MINLKVLLNRILIKFKLFFKSVYYIGGNDALPPPLSKEEEEYFVQRLINGDEKVRSVLIE CCHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHH RNLRLVVYIARKFENTGICIEDLVSVGTIGLIKAVNTFKPDKKIKLATYASRCIENEILM CCCEEEEEEEHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH YLRRNSKVKAEISFYEPLNIDWDGNELLLSDILGTDNDEVYNLIEDEVDKQLLLLAMKKL HHHCCCCEEEEEEEECCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH NEREKEIVRLRFGLNGKREKTQKEVADMLGISQSYISRLEKRIIKTLKKEINKMV CHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 7883192; 11466286; 7961408 [H]