Definition Moorella thermoacetica ATCC 39073, complete genome.
Accession NC_007644
Length 2,628,784

Click here to switch to the map view.

The map label for this gene is sigF [H]

Identifier: 83590339

GI number: 83590339

Start: 1542563

End: 1543375

Strand: Reverse

Name: sigF [H]

Synonym: Moth_1496

Alternate gene names: 83590339

Gene position: 1543375-1542563 (Counterclockwise)

Preceding gene: 83590340

Following gene: 83590338

Centisome position: 58.71

GC content: 58.43

Gene sequence:

>813_bases
ATGCGCCTGTCGGAAATGAATCTCCCCCGCTTTCCCCTCCTGTCCGAAGCCGAGACGGATGAGTTGTTGCGCCGGGCCAA
GGCAGGGGATAAGGAGGCCCGGGAGCGGCTGATTAACTGTAACCTGAAGCTCGTTTTTAATCTGGTGCAACGCTTCGAGA
AACGTAACTACGAGCTGGAAGATCTCTTCCAGATCGGAACCATCGGGCTCATCAAGGCCATTGATAAGTTTGACTTAAGC
TATAAGGTGCGGTTTTCCACTTATGCCGTACCCATGATCCTGGGGGAAATTCGGCGCTTTTTGCGGGATGACAGCGCCGT
CAAGGTCAGTCGCTCCTTAAAGGAAACGGCCTTTAAAGTCAACCGCACCCGGGAGGAACTGGCCAAGAAATTCGGCCGGG
AACCGGCCATCGGCGAGATAGCCGAGGCCCTGGACCTCTCCCGGGAGGAGATTATAGCCGCCCTGGAAGCCGTGCAGATG
CCTAGTTCCATCCACGACACCGTCTACCAGGACGACGGCGATCCCATCTACGTTCTCGACCAGCTGGCCTCCGAAGACGG
GGAGGAACCGGAGTGGCTGGATAAGATTGCCCTGAAGGAGGTCCTGCGCCAGTTGCCGGAAAAACACCGACGGGTGCTGG
TCCTGCGTTTCTTCCAGGATAAAACCCAGGCGGAAGTGGCGGCCCGAATGGGGCTCTCCCAGGTACAGATCTCCCGCATT
GAGCGCCAGGCCCTGCAAAGAATTAGGGAACTGCTCCAGGCGGAAGGGAGCTGGGATGGGGACAGCGTACCGCTGCCCGG
GTGCCGGGAGTAA

Upstream 100 bases:

>100_bases
CCGGGGAACCACGGTCAGGATGTTGAAGGTCCTCAAGAAAGCGGGAGCAGAGTGATGGAGGGCTGCCCGGTAACCGGGCG
GTTGAGGTGGCGGTGAAGAA

Downstream 100 bases:

>100_bases
ACAGAGTTGACGATTAGTTAACCAGCAGGGAAAAATATAGTTACGGCGAATAATTAACCCTGGAACCTGTGAATAAAGAT
GAGGTTTTGGAAGGGATTGA

Product: sigma 28 (flagella/sporulation)

Products: NA

Alternate protein names: Sporulation sigma factor; Stage II sporulation protein AC [H]

Number of amino acids: Translated: 270; Mature: 270

Protein sequence:

>270_residues
MRLSEMNLPRFPLLSEAETDELLRRAKAGDKEARERLINCNLKLVFNLVQRFEKRNYELEDLFQIGTIGLIKAIDKFDLS
YKVRFSTYAVPMILGEIRRFLRDDSAVKVSRSLKETAFKVNRTREELAKKFGREPAIGEIAEALDLSREEIIAALEAVQM
PSSIHDTVYQDDGDPIYVLDQLASEDGEEPEWLDKIALKEVLRQLPEKHRRVLVLRFFQDKTQAEVAARMGLSQVQISRI
ERQALQRIRELLQAEGSWDGDSVPLPGCRE

Sequences:

>Translated_270_residues
MRLSEMNLPRFPLLSEAETDELLRRAKAGDKEARERLINCNLKLVFNLVQRFEKRNYELEDLFQIGTIGLIKAIDKFDLS
YKVRFSTYAVPMILGEIRRFLRDDSAVKVSRSLKETAFKVNRTREELAKKFGREPAIGEIAEALDLSREEIIAALEAVQM
PSSIHDTVYQDDGDPIYVLDQLASEDGEEPEWLDKIALKEVLRQLPEKHRRVLVLRFFQDKTQAEVAARMGLSQVQISRI
ERQALQRIRELLQAEGSWDGDSVPLPGCRE
>Mature_270_residues
MRLSEMNLPRFPLLSEAETDELLRRAKAGDKEARERLINCNLKLVFNLVQRFEKRNYELEDLFQIGTIGLIKAIDKFDLS
YKVRFSTYAVPMILGEIRRFLRDDSAVKVSRSLKETAFKVNRTREELAKKFGREPAIGEIAEALDLSREEIIAALEAVQM
PSSIHDTVYQDDGDPIYVLDQLASEDGEEPEWLDKIALKEVLRQLPEKHRRVLVLRFFQDKTQAEVAARMGLSQVQISRI
ERQALQRIRELLQAEGSWDGDSVPLPGCRE

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes [H]

COG id: COG1191

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789098, Length=255, Percent_Identity=32.9411764705882, Blast_Score=105, Evalue=4e-24,
Organism=Escherichia coli, GI1789448, Length=244, Percent_Identity=29.9180327868852, Blast_Score=94, Evalue=1e-20,
Organism=Escherichia coli, GI1788231, Length=202, Percent_Identity=27.7227722772277, Blast_Score=67, Evalue=8e-13,
Organism=Escherichia coli, GI1789871, Length=261, Percent_Identity=25.2873563218391, Blast_Score=65, Evalue=6e-12,

Paralogues:

None

Copy number: <10 (log phase) 250 (stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR014322
- InterPro:   IPR014236
- InterPro:   IPR000943
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR011991 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 31116; Mature: 31116

Theoretical pI: Translated: 5.20; Mature: 5.20

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRLSEMNLPRFPLLSEAETDELLRRAKAGDKEARERLINCNLKLVFNLVQRFEKRNYELE
CCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH
DLFQIGTIGLIKAIDKFDLSYKVRFSTYAVPMILGEIRRFLRDDSAVKVSRSLKETAFKV
HHHHHHHHHHHHHHHHHCCEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH
NRTREELAKKFGREPAIGEIAEALDLSREEIIAALEAVQMPSSIHDTVYQDDGDPIYVLD
HHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHH
QLASEDGEEPEWLDKIALKEVLRQLPEKHRRVLVLRFFQDKTQAEVAARMGLSQVQISRI
HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH
ERQALQRIRELLQAEGSWDGDSVPLPGCRE
HHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MRLSEMNLPRFPLLSEAETDELLRRAKAGDKEARERLINCNLKLVFNLVQRFEKRNYELE
CCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH
DLFQIGTIGLIKAIDKFDLSYKVRFSTYAVPMILGEIRRFLRDDSAVKVSRSLKETAFKV
HHHHHHHHHHHHHHHHHCCEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH
NRTREELAKKFGREPAIGEIAEALDLSREEIIAALEAVQMPSSIHDTVYQDDGDPIYVLD
HHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHH
QLASEDGEEPEWLDKIALKEVLRQLPEKHRRVLVLRFFQDKTQAEVAARMGLSQVQISRI
HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH
ERQALQRIRELLQAEGSWDGDSVPLPGCRE
HHHHHHHHHHHHHHCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 2513372 [H]