The gene/protein map for NC_010320 is currently unavailable.
Definition Thermoanaerobacter sp. X514 chromosome, complete genome.
Accession NC_010320
Length 2,457,259

Click here to switch to the map view.

The map label for this gene is sigG [H]

Identifier: 167040623

GI number: 167040623

Start: 2008953

End: 2009723

Strand: Reverse

Name: sigG [H]

Synonym: Teth514_2000

Alternate gene names: 167040623

Gene position: 2009723-2008953 (Counterclockwise)

Preceding gene: 167040624

Following gene: 167040622

Centisome position: 81.79

GC content: 35.41

Gene sequence:

>771_bases
ATGAATAACAAAGTTGAAATTTGCGGTGTCAATACCTCTAAGTTACCTGTTTTAAAACCTTCCAAGCAAAAAGAGCTACT
GCAGCGGATGAAAAATGGAGACAAAAAAGCAAGAGAAGAATTTATAAATGGCAATTTGCGATTGGTACTAAGTGTTATTC
AAAGATTTAATAATCGCGGGGAATACGTAGATGATTTGTTTCAAGTAGGATGTATAGGGCTTATAAAAGCTATTGACAAT
TTTGACTTAAATCAAAATGTAAAATTTTCTACTTATGCTGTTCCAATGATAATTGGAGAAATAAGAAGATACTTAAGAGA
TAACACTCCTATAAGGGTGAGCCGCTCTTTAAGAGATATAGCCTATAAAGCATTACAGGTAAGGGACAAATTAGTATCGG
AAAATTCCAAAGAGCCGACAGTAGGCGAAATAGCTAAAGAGCTGGACCTTCCAAGAGAGGAAGTGGTAATGGCTCTTGAT
GCTATTCAAGAACCGGTTTCTTTGTTTGAACCTATATATCATGATGGTGGAGATGCAATTTATGTGATGGACCAAGTCAG
CGATGACAAAAATATGGATGAAGTGTGGCTAGAGAAAATTGCATTAAAAGAAGCAATACAAAAACTAAGCGAAAGGGAAA
AAATGATATTGACGATGAGATTTTTTGAAGGGAAAACCCAGATGGAGGTAGCAAAAGAGATAGGAATATCTCAAGCACAA
GTTTCAAGATTAGAAAAAGCGGCTTTAAATCACATGAGGAAATACATCTAG

Upstream 100 bases:

>100_bases
ATTAAAAAAAGAAATGAATAGACTAGTTTAAGCGTTTCTGTATATTTTTTTATCCTGCCGGCAATACTGTTAATAAGCTA
AACATGTGGGGGCAGGATTT

Downstream 100 bases:

>100_bases
CTTATCAGGAATGATAAGCTAGATTTTTTCATATAATTTTATGAAAAGGACAAAAGGGGGAAATGAGGATGATTAAAGCT
TCAGAGTTAAGAGATAAAGA

Product: sporulation sigma factor SigG

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 256; Mature: 256

Protein sequence:

>256_residues
MNNKVEICGVNTSKLPVLKPSKQKELLQRMKNGDKKAREEFINGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLIKAIDN
FDLNQNVKFSTYAVPMIIGEIRRYLRDNTPIRVSRSLRDIAYKALQVRDKLVSENSKEPTVGEIAKELDLPREEVVMALD
AIQEPVSLFEPIYHDGGDAIYVMDQVSDDKNMDEVWLEKIALKEAIQKLSEREKMILTMRFFEGKTQMEVAKEIGISQAQ
VSRLEKAALNHMRKYI

Sequences:

>Translated_256_residues
MNNKVEICGVNTSKLPVLKPSKQKELLQRMKNGDKKAREEFINGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLIKAIDN
FDLNQNVKFSTYAVPMIIGEIRRYLRDNTPIRVSRSLRDIAYKALQVRDKLVSENSKEPTVGEIAKELDLPREEVVMALD
AIQEPVSLFEPIYHDGGDAIYVMDQVSDDKNMDEVWLEKIALKEAIQKLSEREKMILTMRFFEGKTQMEVAKEIGISQAQ
VSRLEKAALNHMRKYI
>Mature_256_residues
MNNKVEICGVNTSKLPVLKPSKQKELLQRMKNGDKKAREEFINGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLIKAIDN
FDLNQNVKFSTYAVPMIIGEIRRYLRDNTPIRVSRSLRDIAYKALQVRDKLVSENSKEPTVGEIAKELDLPREEVVMALD
AIQEPVSLFEPIYHDGGDAIYVMDQVSDDKNMDEVWLEKIALKEAIQKLSEREKMILTMRFFEGKTQMEVAKEIGISQAQ
VSRLEKAALNHMRKYI

Specific function: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is responsible for the expression of sporulation specific genes [H]

COG id: COG1191

COG function: function code K; DNA-directed RNA polymerase specialized sigma subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sigma-70 factor family [H]

Homologues:

Organism=Escherichia coli, GI1789448, Length=244, Percent_Identity=31.1475409836066, Blast_Score=104, Evalue=5e-24,
Organism=Escherichia coli, GI1789098, Length=256, Percent_Identity=29.6875, Blast_Score=98, Evalue=5e-22,
Organism=Escherichia coli, GI1789871, Length=266, Percent_Identity=26.3157894736842, Blast_Score=75, Evalue=4e-15,
Organism=Escherichia coli, GI1788231, Length=202, Percent_Identity=29.7029702970297, Blast_Score=72, Evalue=2e-14,

Paralogues:

None

Copy number: 700 (log & stationary phase) [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014284
- InterPro:   IPR014322
- InterPro:   IPR014212
- InterPro:   IPR000943
- InterPro:   IPR007627
- InterPro:   IPR007624
- InterPro:   IPR007630
- InterPro:   IPR013325
- InterPro:   IPR013324
- InterPro:   IPR011991 [H]

Pfam domain/function: PF04542 Sigma70_r2; PF04539 Sigma70_r3; PF04545 Sigma70_r4 [H]

EC number: NA

Molecular weight: Translated: 29421; Mature: 29421

Theoretical pI: Translated: 8.17; Mature: 8.17

Prosite motif: PS00715 SIGMA70_1 ; PS00716 SIGMA70_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.9 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.9 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNNKVEICGVNTSKLPVLKPSKQKELLQRMKNGDKKAREEFINGNLRLVLSVIQRFNNRG
CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCCC
EYVDDLFQVGCIGLIKAIDNFDLNQNVKFSTYAVPMIIGEIRRYLRDNTPIRVSRSLRDI
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHH
AYKALQVRDKLVSENSKEPTVGEIAKELDLPREEVVMALDAIQEPVSLFEPIYHDGGDAI
HHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEE
YVMDQVSDDKNMDEVWLEKIALKEAIQKLSEREKMILTMRFFEGKTQMEVAKEIGISQAQ
EEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHH
VSRLEKAALNHMRKYI
HHHHHHHHHHHHHHCC
>Mature Secondary Structure
MNNKVEICGVNTSKLPVLKPSKQKELLQRMKNGDKKAREEFINGNLRLVLSVIQRFNNRG
CCCCEEEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHCCCC
EYVDDLFQVGCIGLIKAIDNFDLNQNVKFSTYAVPMIIGEIRRYLRDNTPIRVSRSLRDI
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHH
AYKALQVRDKLVSENSKEPTVGEIAKELDLPREEVVMALDAIQEPVSLFEPIYHDGGDAI
HHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEE
YVMDQVSDDKNMDEVWLEKIALKEAIQKLSEREKMILTMRFFEGKTQMEVAKEIGISQAQ
EEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHH
VSRLEKAALNHMRKYI
HHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7961408; 7883192; 11466286 [H]