Definition Thermoanaerobacter sp. X514 chromosome, complete genome.
Accession NC_010320
Length 2,457,259

Click here to switch to the map view.

The map label for this gene is stc [H]

Identifier: 167040658

GI number: 167040658

Start: 2044147

End: 2046114

Strand: Reverse

Name: stc [H]

Synonym: Teth514_2035

Alternate gene names: 167040658

Gene position: 2046114-2044147 (Counterclockwise)

Preceding gene: 167040659

Following gene: 167040657

Centisome position: 83.27

GC content: 33.59

Gene sequence:

>1968_bases
ATGAAAGATTTTATATCTTTTATGGAGAAAGCTTGGTGGAGATACGTTAATGAGGGGGTAATTACTGAAGGTGTTAAAGA
AGAGATTAGAGAGTCATGGGAACTTTGTAAATTCTATGGTGTAGACCCTTTTGGAGGCGTTGGGGAAATATTGGATGAAG
ATAATATGAAAATAAGGTTGAGGCAAAATGAGGATTTAATATCTGTTGCACATCCTATTATGGAAAATATATATAAACAA
GTAACTGGTTCGGGCTTTTTGTTTGTTCTTGTAGACAGGGACGGTTATCTTATCGATAGAATTGGCGATGATAGTATAAT
GGGAGAGACGAGAAAATTAAACTTTGTTGAGGGGGCTTTATGGACAGAAGAAGCGGTAGGGACTAATGCGATTGCTTTGG
CTTTAAAACTTGATAAACCTATACAACTGGTGGGAGCCGAACATTATTGTATTACCCATCACATAGCTACATGTTCTGCA
GCACCTATTCATGATGAAAACGGTAATATCATAGGATGCCTTGATATGACTGGATTAAAAGAGGATGCACACCCCCACAA
TTTAGGTATTGTACTTGCTGGAGCTTATTCTATTGAAAAGCAGCTTGCGCTTATCAAGTCTCATAAACTTATAGACGCTA
CTTTTGATTCTATTCACGAAGGCATGCTTATAATGGACCATAATTTTGTTATTCAAAGGGTTAACGAAATAGCATTAAAG
ATATTAAATATTTCAAAGAAGGAAATTATAGGCATGCATATACAGTCCATACTTGAAGATGTGGATATAATTGAAGATAT
ACTAAGTCAGACAGAACCCTATTACGATGTTGAATGTATTTTTTATGGTCGAAACAAAAAAATACCATGTCGTATAAATG
CTGTTCCTATAGTAGCTAACAGAAAAATTATAGGTACGGTTGTAACTTTTAGAGAAGAGAAATATGTTCATAATGTTGTT
AATAAATTAGCAGGATTTAAAGCGAGTTATACATTTGAAGATATAATCACAAATGATGAAAATATGAAAAAAATCATAGA
AACTGCTAAAAAAGCTGCGAAGAGTGATTGCAATATCCTTATAGAAGGTGAAAGTGGGACAGGTAAAGAACTTTTTGCGC
AGGCGATACATAACTATAGTAAACGTTCAAAAGGGCCATTTATTGCAGTAAACTGTGCTTCTATACCCAGAGAATTAGTA
GAAAGTGAACTTTTTGGTTACGAAAGAGGAGCGTTTACTGGGGCAAATAAAGAAGGTAAACCTGGTAAATTTGAACTAGC
GGACAATGGAACGATATTTTTAGATGAAATTGGAGAGTTGCCCTATGAGGTGCAATCTAAACTTTTAAGAGTGCTGGATA
ACCATAAAATTGTAAGGGTGGGTGGCACTGAGGAAAAGAAATTGAATGTAAGGGTTATTGCGGCAACCAACAGAAATTTA
AGTGAAGAAGTGAATAAAAAGAATTTTAGAAATGATTTATATTATAGGATAAATGTCATAAAAATAAACATCCCTCCTTT
AAGAGAGAGAAAAGGAGACATTGAACTTCTTGCTAAAGTGTTTGTAGAACGACTTAATAGATATAATGCTTTAAGCGTTT
CAGATTATAAGGTTTTAAGCGAATCGTTTATAGAACGATTGATGAATTACAACTGGCCAGGAAATGTTCGAGAGTTACAA
AATGTTATTGACAGGGCATATTACCTTACGGAAGGCAAAGTAATTATCGAAGAGTATTTCCCTGAAAATATAAGTGAAAA
TAGTAATGCACAACAGGAAAATAATACTACAATCTTTCCAATTGAAGTTATTGAAGAAAAAAACATAAGAGAAGCACTAA
AAATAGCCAAGGGTAATATATTGAAGGCAGCTGAAATGTTAAATTTGAGTAGAGCTACTATTTATAGAAAGATAAAAAAA
TACAATATAAGTGTAAAGGAATTGCTGTCTCAAATTGAGACTAAATAA

Upstream 100 bases:

>100_bases
TCTTATTATGCTGATGAAGGATTGAAGTTGACAAAGGCTTAGTTTTTAAGCCTTTGTTTTTGCATGAGGAGAATTTTAAA
ACGATTTAAGGGGGAGTGTC

Downstream 100 bases:

>100_bases
AAGGAATCTCAAAATGAGACAAATGATTTTGTATAAATATGAGATTGATAATTTATTAACCCTTAATTTAAGGGGTTTTT
TATTTTGGCATGAAATTTGA

Product: Fis family GAF modulated sigma54 specific transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 655; Mature: 655

Protein sequence:

>655_residues
MKDFISFMEKAWWRYVNEGVITEGVKEEIRESWELCKFYGVDPFGGVGEILDEDNMKIRLRQNEDLISVAHPIMENIYKQ
VTGSGFLFVLVDRDGYLIDRIGDDSIMGETRKLNFVEGALWTEEAVGTNAIALALKLDKPIQLVGAEHYCITHHIATCSA
APIHDENGNIIGCLDMTGLKEDAHPHNLGIVLAGAYSIEKQLALIKSHKLIDATFDSIHEGMLIMDHNFVIQRVNEIALK
ILNISKKEIIGMHIQSILEDVDIIEDILSQTEPYYDVECIFYGRNKKIPCRINAVPIVANRKIIGTVVTFREEKYVHNVV
NKLAGFKASYTFEDIITNDENMKKIIETAKKAAKSDCNILIEGESGTGKELFAQAIHNYSKRSKGPFIAVNCASIPRELV
ESELFGYERGAFTGANKEGKPGKFELADNGTIFLDEIGELPYEVQSKLLRVLDNHKIVRVGGTEEKKLNVRVIAATNRNL
SEEVNKKNFRNDLYYRINVIKINIPPLRERKGDIELLAKVFVERLNRYNALSVSDYKVLSESFIERLMNYNWPGNVRELQ
NVIDRAYYLTEGKVIIEEYFPENISENSNAQQENNTTIFPIEVIEEKNIREALKIAKGNILKAAEMLNLSRATIYRKIKK
YNISVKELLSQIETK

Sequences:

>Translated_655_residues
MKDFISFMEKAWWRYVNEGVITEGVKEEIRESWELCKFYGVDPFGGVGEILDEDNMKIRLRQNEDLISVAHPIMENIYKQ
VTGSGFLFVLVDRDGYLIDRIGDDSIMGETRKLNFVEGALWTEEAVGTNAIALALKLDKPIQLVGAEHYCITHHIATCSA
APIHDENGNIIGCLDMTGLKEDAHPHNLGIVLAGAYSIEKQLALIKSHKLIDATFDSIHEGMLIMDHNFVIQRVNEIALK
ILNISKKEIIGMHIQSILEDVDIIEDILSQTEPYYDVECIFYGRNKKIPCRINAVPIVANRKIIGTVVTFREEKYVHNVV
NKLAGFKASYTFEDIITNDENMKKIIETAKKAAKSDCNILIEGESGTGKELFAQAIHNYSKRSKGPFIAVNCASIPRELV
ESELFGYERGAFTGANKEGKPGKFELADNGTIFLDEIGELPYEVQSKLLRVLDNHKIVRVGGTEEKKLNVRVIAATNRNL
SEEVNKKNFRNDLYYRINVIKINIPPLRERKGDIELLAKVFVERLNRYNALSVSDYKVLSESFIERLMNYNWPGNVRELQ
NVIDRAYYLTEGKVIIEEYFPENISENSNAQQENNTTIFPIEVIEEKNIREALKIAKGNILKAAEMLNLSRATIYRKIKK
YNISVKELLSQIETK
>Mature_655_residues
MKDFISFMEKAWWRYVNEGVITEGVKEEIRESWELCKFYGVDPFGGVGEILDEDNMKIRLRQNEDLISVAHPIMENIYKQ
VTGSGFLFVLVDRDGYLIDRIGDDSIMGETRKLNFVEGALWTEEAVGTNAIALALKLDKPIQLVGAEHYCITHHIATCSA
APIHDENGNIIGCLDMTGLKEDAHPHNLGIVLAGAYSIEKQLALIKSHKLIDATFDSIHEGMLIMDHNFVIQRVNEIALK
ILNISKKEIIGMHIQSILEDVDIIEDILSQTEPYYDVECIFYGRNKKIPCRINAVPIVANRKIIGTVVTFREEKYVHNVV
NKLAGFKASYTFEDIITNDENMKKIIETAKKAAKSDCNILIEGESGTGKELFAQAIHNYSKRSKGPFIAVNCASIPRELV
ESELFGYERGAFTGANKEGKPGKFELADNGTIFLDEIGELPYEVQSKLLRVLDNHKIVRVGGTEEKKLNVRVIAATNRNL
SEEVNKKNFRNDLYYRINVIKINIPPLRERKGDIELLAKVFVERLNRYNALSVSDYKVLSESFIERLMNYNWPGNVRELQ
NVIDRAYYLTEGKVIIEEYFPENISENSNAQQENNTTIFPIEVIEEKNIREALKIAKGNILKAAEMLNLSRATIYRKIKK
YNISVKELLSQIETK

Specific function: Member Of The Two-Component Regulatory System Atos/Atoc Involved In The Transcriptional Regulation Of The Ato Genes For Acetoacetate Metabolism. Also An Inhibitor Of Polyamine Biosynthesis. [C]

COG id: COG3284

COG function: function code QK; Transcriptional activator of acetoin/glycerol metabolism

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 sigma-54 factor interaction domain [H]

Homologues:

Organism=Escherichia coli, GI1788550, Length=340, Percent_Identity=41.7647058823529, Blast_Score=267, Evalue=1e-72,
Organism=Escherichia coli, GI87082117, Length=329, Percent_Identity=41.6413373860182, Blast_Score=262, Evalue=6e-71,
Organism=Escherichia coli, GI1789233, Length=451, Percent_Identity=34.1463414634146, Blast_Score=253, Evalue=2e-68,
Organism=Escherichia coli, GI1788905, Length=324, Percent_Identity=41.0493827160494, Blast_Score=249, Evalue=3e-67,
Organism=Escherichia coli, GI1790299, Length=334, Percent_Identity=38.9221556886228, Blast_Score=244, Evalue=2e-65,
Organism=Escherichia coli, GI1790437, Length=306, Percent_Identity=40.5228758169935, Blast_Score=240, Evalue=2e-64,
Organism=Escherichia coli, GI1789087, Length=322, Percent_Identity=41.9254658385093, Blast_Score=240, Evalue=2e-64,
Organism=Escherichia coli, GI1786524, Length=333, Percent_Identity=38.4384384384384, Blast_Score=232, Evalue=5e-62,
Organism=Escherichia coli, GI87081858, Length=638, Percent_Identity=25.705329153605, Blast_Score=226, Evalue=3e-60,
Organism=Escherichia coli, GI87082152, Length=314, Percent_Identity=43.312101910828, Blast_Score=225, Evalue=7e-60,
Organism=Escherichia coli, GI87081872, Length=313, Percent_Identity=40.5750798722045, Blast_Score=215, Evalue=6e-57,
Organism=Escherichia coli, GI1787583, Length=351, Percent_Identity=36.4672364672365, Blast_Score=209, Evalue=4e-55,
Organism=Escherichia coli, GI1789828, Length=259, Percent_Identity=33.976833976834, Blast_Score=132, Evalue=8e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003593
- InterPro:   IPR020441
- InterPro:   IPR009057
- InterPro:   IPR002197
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR013767
- InterPro:   IPR002078 [H]

Pfam domain/function: PF02954 HTH_8; PF00989 PAS; PF00158 Sigma54_activat [H]

EC number: NA

Molecular weight: Translated: 74448; Mature: 74448

Theoretical pI: Translated: 5.84; Mature: 5.84

Prosite motif: PS50112 PAS ; PS00675 SIGMA54_INTERACT_1 ; PS00676 SIGMA54_INTERACT_2 ; PS00688 SIGMA54_INTERACT_3 ; PS50045 SIGMA54_INTERACT_4

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKDFISFMEKAWWRYVNEGVITEGVKEEIRESWELCKFYGVDPFGGVGEILDEDNMKIRL
CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCEEEE
RQNEDLISVAHPIMENIYKQVTGSGFLFVLVDRDGYLIDRIGDDSIMGETRKLNFVEGAL
ECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEECCCCCCCCCCCEEEEHHCCC
WTEEAVGTNAIALALKLDKPIQLVGAEHYCITHHIATCSAAPIHDENGNIIGCLDMTGLK
CCHHHCCCCEEEEEEECCCCEEEECCCCEEEEEHHHHCCCCCEECCCCCEEEEEECCCCC
EDAHPHNLGIVLAGAYSIEKQLALIKSHKLIDATFDSIHEGMLIMDHNFVIQRVNEIALK
CCCCCCCCEEEEEECHHHHHHHHHHHHCCHHHHHHHHHHCCEEEEECHHHHHHHHHHHHH
ILNISKKEIIGMHIQSILEDVDIIEDILSQTEPYYDVECIFYGRNKKIPCRINAVPIVAN
HHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCEEEEEEEEEEC
RKIIGTVVTFREEKYVHNVVNKLAGFKASYTFEDIITNDENMKKIIETAKKAAKSDCNIL
CEEEEEEEHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHHCCCCCEEE
IEGESGTGKELFAQAIHNYSKRSKGPFIAVNCASIPRELVESELFGYERGAFTGANKEGK
EECCCCCCHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
PGKFELADNGTIFLDEIGELPYEVQSKLLRVLDNHKIVRVGGTEEKKLNVRVIAATNRNL
CCEEEECCCCEEEEECCCCCCHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEEEECCCCH
SEEVNKKNFRNDLYYRINVIKINIPPLRERKGDIELLAKVFVERLNRYNALSVSDYKVLS
HHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHH
ESFIERLMNYNWPGNVRELQNVIDRAYYLTEGKVIIEEYFPENISENSNAQQENNTTIFP
HHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCCEEEEECCCCCCCCCCCCCCCCCCEEEE
IEVIEEKNIREALKIAKGNILKAAEMLNLSRATIYRKIKKYNISVKELLSQIETK
EEECCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCC
>Mature Secondary Structure
MKDFISFMEKAWWRYVNEGVITEGVKEEIRESWELCKFYGVDPFGGVGEILDEDNMKIRL
CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCEEEE
RQNEDLISVAHPIMENIYKQVTGSGFLFVLVDRDGYLIDRIGDDSIMGETRKLNFVEGAL
ECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEECCCCCCCCCCCEEEEHHCCC
WTEEAVGTNAIALALKLDKPIQLVGAEHYCITHHIATCSAAPIHDENGNIIGCLDMTGLK
CCHHHCCCCEEEEEEECCCCEEEECCCCEEEEEHHHHCCCCCEECCCCCEEEEEECCCCC
EDAHPHNLGIVLAGAYSIEKQLALIKSHKLIDATFDSIHEGMLIMDHNFVIQRVNEIALK
CCCCCCCCEEEEEECHHHHHHHHHHHHCCHHHHHHHHHHCCEEEEECHHHHHHHHHHHHH
ILNISKKEIIGMHIQSILEDVDIIEDILSQTEPYYDVECIFYGRNKKIPCRINAVPIVAN
HHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCEEEEEEEEEEC
RKIIGTVVTFREEKYVHNVVNKLAGFKASYTFEDIITNDENMKKIIETAKKAAKSDCNIL
CEEEEEEEHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHHCCCCCEEE
IEGESGTGKELFAQAIHNYSKRSKGPFIAVNCASIPRELVESELFGYERGAFTGANKEGK
EECCCCCCHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCCCCCCCCCCCCC
PGKFELADNGTIFLDEIGELPYEVQSKLLRVLDNHKIVRVGGTEEKKLNVRVIAATNRNL
CCEEEECCCCEEEEECCCCCCHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEEEECCCCH
SEEVNKKNFRNDLYYRINVIKINIPPLRERKGDIELLAKVFVERLNRYNALSVSDYKVLS
HHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHH
ESFIERLMNYNWPGNVRELQNVIDRAYYLTEGKVIIEEYFPENISENSNAQQENNTTIFP
HHHHHHHHCCCCCCCHHHHHHHHHHHEEEECCCEEEEECCCCCCCCCCCCCCCCCCEEEE
IEVIEEKNIREALKIAKGNILKAAEMLNLSRATIYRKIKKYNISVKELLSQIETK
EEECCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA