Definition Clostridium botulinum A2 str. Kyoto chromosome, complete genome.
Accession NC_012563
Length 4,155,278

Click here to switch to the map view.

The map label for this gene is sigF [H]

Identifier: 226950516

GI number: 226950516

Start: 3512655

End: 3513410

Strand: Reverse

Name: sigF [H]

Synonym: CLM_3492

Alternate gene names: 226950516

Gene position: 3513410-3512655 (Counterclockwise)

Preceding gene: 226950517

Following gene: 226950515

Centisome position: 84.55

GC content: 27.38

Gene sequence:

>756_bases
ATGAATAGTGAAAAGGTAATCATGGAAACTAAAAGTTTTGAAGACAACATGGAACTTATGGAAAAGGCAAGATCTGGGAA
TAAAGAAGCTTTAGATAAATTAGTAGAAGTAAATTTACCTTTAGTTTCAGCAATAAGTAAAAAATTTTTAAATAGAGGAT
ATGAATATGACGATATATTTCAGATAGGATGCATCGGTTTAGTAAAGGCTATAAATAATTTTGAAACAAAATATAATGTA
AAGTTCTCAACCTATGCGGTTCCTATGATAATGGGAGAAATAAAGAGATTCTTAAGGGATGATGGAATAATAAAAGTAAG
TAGAAGTATAAAAACAGCGGCAAAAAAATTACATTATGATAAAGAAAAGCTTTGTAAAGAATTAAATAGAGAACCTACAA
TAGAAGAATTATCACAGTTTTCAGGATATACTGTAGATGAAATATTAATGGCTACAGAATCATCGAGTTCCCCCCAATAT
TTATATGATGTTATACATCAAGATGATGGAGCACCAGTTTTGCTTATTGACAAAATAAGTGAAAATACAGAAGAAGATAA
TAAAATTGTAGATAATATAGCATTAAAAGAGGCTCTTAAAAATTTAGATATTAAGTCAAGGCAAATAATAATACTAAGAT
ATTTTAAGGATAAAACTCAAATAGAAGTAGCTAAACAATTAGGAATAAGTCAAGTACAGGTGTCAAGAATAGAGAAAAAA
GTATTAAAATTGATGAAAGAAAAACTTACTGTATAA

Upstream 100 bases:

>100_bases
GGAAACATTCATGGATAGTCTTCAAGTTCATTCAGAAAAGGGGAAGGGAACTAAAATAATTATGAAAAAGGTTTTTAAAT
AATTAAGTTAGGTGAATAAT

Downstream 100 bases:

>100_bases
AAACCATATGAATATTTTCATATGGTTTTTTGTTTTTTGTAAAATTCTATTCCCCTAAATCTGTTAATGTCATCAATAAA
TGATTTTATTCTTGAGCCAA

Product: sporulation sigma factor SigF

Products: NA

Alternate protein names: Sporulation sigma factor; Stage II sporulation protein AC [H]

Number of amino acids: Translated: 251; Mature: 251

Protein sequence:

>251_residues
MNSEKVIMETKSFEDNMELMEKARSGNKEALDKLVEVNLPLVSAISKKFLNRGYEYDDIFQIGCIGLVKAINNFETKYNV
KFSTYAVPMIMGEIKRFLRDDGIIKVSRSIKTAAKKLHYDKEKLCKELNREPTIEELSQFSGYTVDEILMATESSSSPQY
LYDVIHQDDGAPVLLIDKISENTEEDNKIVDNIALKEALKNLDIKSRQIIILRYFKDKTQIEVAKQLGISQVQVSRIEKK
VLKLMKEKLTV

Sequences:

>Translated_251_residues
MNSEKVIMETKSFEDNMELMEKARSGNKEALDKLVEVNLPLVSAISKKFLNRGYEYDDIFQIGCIGLVKAINNFETKYNV
KFSTYAVPMIMGEIKRFLRDDGIIKVSRSIKTAAKKLHYDKEKLCKELNREPTIEELSQFSGYTVDEILMATESSSSPQY
LYDVIHQDDGAPVLLIDKISENTEEDNKIVDNIALKEALKNLDIKSRQIIILRYFKDKTQIEVAKQLGISQVQVSRIEKK
VLKLMKEKLTV
>Mature_251_residues
MNSEKVIMETKSFEDNMELMEKARSGNKEALDKLVEVNLPLVSAISKKFLNRGYEYDDIFQIGCIGLVKAINNFETKYNV
KFSTYAVPMIMGEIKRFLRDDGIIKVSRSIKTAAKKLHYDKEKLCKELNREPTIEELSQFSGYTVDEILMATESSSSPQY
LYDVIHQDDGAPVLLIDKISENTEEDNKIVDNIALKEALKNLDIKSRQIIILRYFKDKTQIEVAKQLGISQVQVSRIEKK
VLKLMKEKLTV

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes [H]

COG id: COG1191

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789448, Length=235, Percent_Identity=31.063829787234, Blast_Score=93, Evalue=2e-20,
Organism=Escherichia coli, GI1789098, Length=258, Percent_Identity=25.1937984496124, Blast_Score=71, Evalue=7e-14,
Organism=Escherichia coli, GI1789871, Length=237, Percent_Identity=27.4261603375527, Blast_Score=68, Evalue=6e-13,
Organism=Escherichia coli, GI1788231, Length=199, Percent_Identity=25.1256281407035, Blast_Score=62, Evalue=4e-11,

Paralogues:

None

Copy number: 700 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR014322
- InterPro:   IPR014236
- InterPro:   IPR000943
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR011991 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 28811; Mature: 28811

Theoretical pI: Translated: 7.41; Mature: 7.41

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNSEKVIMETKSFEDNMELMEKARSGNKEALDKLVEVNLPLVSAISKKFLNRGYEYDDIF
CCCCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHH
QIGCIGLVKAINNFETKYNVKFSTYAVPMIMGEIKRFLRDDGIIKVSRSIKTAAKKLHYD
HHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHCCC
KEKLCKELNREPTIEELSQFSGYTVDEILMATESSSSPQYLYDVIHQDDGAPVLLIDKIS
HHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHCCCCCHHHHHHHHHCCCCCCEEEEECCC
ENTEEDNKIVDNIALKEALKNLDIKSRQIIILRYFKDKTQIEVAKQLGISQVQVSRIEKK
CCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHCCHHHHHHHHHHHH
VLKLMKEKLTV
HHHHHHHHCCC
>Mature Secondary Structure
MNSEKVIMETKSFEDNMELMEKARSGNKEALDKLVEVNLPLVSAISKKFLNRGYEYDDIF
CCCCEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHH
QIGCIGLVKAINNFETKYNVKFSTYAVPMIMGEIKRFLRDDGIIKVSRSIKTAAKKLHYD
HHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHCCC
KEKLCKELNREPTIEELSQFSGYTVDEILMATESSSSPQYLYDVIHQDDGAPVLLIDKIS
HHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHCCCCCHHHHHHHHHCCCCCCEEEEECCC
ENTEEDNKIVDNIALKEALKNLDIKSRQIIILRYFKDKTQIEVAKQLGISQVQVSRIEKK
CCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHHHCCHHHHHHHHHHHH
VLKLMKEKLTV
HHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2513372 [H]