The gene/protein map for NC_002937 is currently unavailable.
Definition Thermoanaerobacter tengcongensis MB4, complete genome.
Accession NC_003869
Length 2,689,445

Click here to switch to the map view.

The map label for this gene is PurR4 [H]

Identifier: 20806985

GI number: 20806985

Start: 490882

End: 491895

Strand: Direct

Name: PurR4 [H]

Synonym: TTE0480

Alternate gene names: 20806985

Gene position: 490882-491895 (Clockwise)

Preceding gene: 20806984

Following gene: 20806986

Centisome position: 18.25

GC content: 40.63

Gene sequence:

>1014_bases
ATGGGTGCAACTATAAAAGATGTAGCGAGAGAGGCGAAAGTTTCCATTGCTACAGTTTCAAGAGTTTTAAACAACAGCGC
TGTTGTGACAGAAGAGACAAGACAGAGGGTTTTAGAGGCAATAAAAAAGACGGGTTACAAACCCAATGCTCTTGCAAGAA
GCTTAAAGATTCAAAAAACTCACACTATTGGTCTTATCATACCCGACATTTCAAGCACCTTTTACCCTGAGGTGGTAAGA
GGTATAGAGGACATTGCTGCAATGTATAATTATAATATCTTCTTGTGCAACACCGACCAGAAAGAAGATAAAGAAATAAA
ATATATAGAAATTCTGGGAGAAAAGCAGGTAGACGGAATTATATTCATGGGGGATGTAGTAAGAGACAGCGTAATTCAAG
CTTTTAACGAGTTTAAAGTGCCGGTAGTGCTTGCAGGCACACAGGACAAAGAGAAAAGGTATCCCAGTGTAATGATTGAC
AATGAAAAAGCCGCCTATGATGCGGTGAAATACCTCATTTCCCTTGGGCACAAGAAGATAGGAATGATTGCAGGTTCAAT
GCAAGACCCAATAGCAGGTCTTCAAAGAATAGAAGGTTATAAAAGGGCTTTGGAAGAACACGGCATAAAATATGACCCTG
AGCTCGTAGTAGAAGGCGAATTTAAGACGAGGAAGGCATATCTTGCAATGCTTAAACTTCTCGAGCACAAAGTTACAGCT
GTTTTTGCGGCTTCAGATGACATGGCTGCAGCAGCTATAAACGCTATATTTGACTCAAACTTAAGGGTTCCTGATGACAT
ACATGTGGTGGGCTTTGATAATACCTATATTTCCACGATTTTCAGGCCTACTATAACTACAATACTGCAGCCTGCCTATG
ACATAGGAGCTGTTGCTATGAGACTTTTGACAAAGCTTCTGGGCAAAGAGCCAATAGAAGAGATGCATGTCATTTTGCCT
CATCAGCTGATTGTAAGAGAATCTACGGGATTTAAAGAAGGAAGTAGCCGATGA

Upstream 100 bases:

>100_bases
ATGAAAATGAATGAAAAAAATTGAAAAATCTATTGACAAAATCTCTAAGAACAAGTAAAATAACATAAAGCGAAAACGAT
TAAGCCTGGGAGTGATAGGA

Downstream 100 bases:

>100_bases
GGCTATTTTTTTTATTCGCCTTTATGATATACTAGATTTTGGGAGGTGAAATTGGTGGAGAGATTGCTTATCATAGATGA
CGAAGAAATGTTTGTTAAGG

Product: transcriptional regulator

Products: NA

Alternate protein names: Glucose-resistance amylase regulator [H]

Number of amino acids: Translated: 337; Mature: 336

Protein sequence:

>337_residues
MGATIKDVAREAKVSIATVSRVLNNSAVVTEETRQRVLEAIKKTGYKPNALARSLKIQKTHTIGLIIPDISSTFYPEVVR
GIEDIAAMYNYNIFLCNTDQKEDKEIKYIEILGEKQVDGIIFMGDVVRDSVIQAFNEFKVPVVLAGTQDKEKRYPSVMID
NEKAAYDAVKYLISLGHKKIGMIAGSMQDPIAGLQRIEGYKRALEEHGIKYDPELVVEGEFKTRKAYLAMLKLLEHKVTA
VFAASDDMAAAAINAIFDSNLRVPDDIHVVGFDNTYISTIFRPTITTILQPAYDIGAVAMRLLTKLLGKEPIEEMHVILP
HQLIVRESTGFKEGSSR

Sequences:

>Translated_337_residues
MGATIKDVAREAKVSIATVSRVLNNSAVVTEETRQRVLEAIKKTGYKPNALARSLKIQKTHTIGLIIPDISSTFYPEVVR
GIEDIAAMYNYNIFLCNTDQKEDKEIKYIEILGEKQVDGIIFMGDVVRDSVIQAFNEFKVPVVLAGTQDKEKRYPSVMID
NEKAAYDAVKYLISLGHKKIGMIAGSMQDPIAGLQRIEGYKRALEEHGIKYDPELVVEGEFKTRKAYLAMLKLLEHKVTA
VFAASDDMAAAAINAIFDSNLRVPDDIHVVGFDNTYISTIFRPTITTILQPAYDIGAVAMRLLTKLLGKEPIEEMHVILP
HQLIVRESTGFKEGSSR
>Mature_336_residues
GATIKDVAREAKVSIATVSRVLNNSAVVTEETRQRVLEAIKKTGYKPNALARSLKIQKTHTIGLIIPDISSTFYPEVVRG
IEDIAAMYNYNIFLCNTDQKEDKEIKYIEILGEKQVDGIIFMGDVVRDSVIQAFNEFKVPVVLAGTQDKEKRYPSVMIDN
EKAAYDAVKYLISLGHKKIGMIAGSMQDPIAGLQRIEGYKRALEEHGIKYDPELVVEGEFKTRKAYLAMLKLLEHKVTAV
FAASDDMAAAAINAIFDSNLRVPDDIHVVGFDNTYISTIFRPTITTILQPAYDIGAVAMRLLTKLLGKEPIEEMHVILPH
QLIVRESTGFKEGSSR

Specific function: Transcriptional regulator involved in both the repression of carbohydrate utilization genes such as the alpha- amylase (AmyE) and the acetyl-coenzyme A synthetase (AcsA); and in the positive regulation of genes involved in excretion of excess carbon such

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lacI-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790369, Length=331, Percent_Identity=38.368580060423, Blast_Score=223, Evalue=1e-59,
Organism=Escherichia coli, GI1787948, Length=332, Percent_Identity=35.5421686746988, Blast_Score=198, Evalue=3e-52,
Organism=Escherichia coli, GI1790194, Length=330, Percent_Identity=34.5454545454545, Blast_Score=186, Evalue=1e-48,
Organism=Escherichia coli, GI1788474, Length=338, Percent_Identity=30.4733727810651, Blast_Score=159, Evalue=2e-40,
Organism=Escherichia coli, GI1789202, Length=336, Percent_Identity=32.4404761904762, Blast_Score=158, Evalue=5e-40,
Organism=Escherichia coli, GI1789068, Length=303, Percent_Identity=30.03300330033, Blast_Score=157, Evalue=6e-40,
Organism=Escherichia coli, GI1787580, Length=319, Percent_Identity=29.153605015674, Blast_Score=124, Evalue=7e-30,
Organism=Escherichia coli, GI1786540, Length=332, Percent_Identity=27.710843373494, Blast_Score=114, Evalue=7e-27,
Organism=Escherichia coli, GI1790715, Length=309, Percent_Identity=25.5663430420712, Blast_Score=109, Evalue=2e-25,
Organism=Escherichia coli, GI1787906, Length=334, Percent_Identity=25.4491017964072, Blast_Score=108, Evalue=4e-25,
Organism=Escherichia coli, GI48994940, Length=318, Percent_Identity=25.1572327044025, Blast_Score=107, Evalue=1e-24,
Organism=Escherichia coli, GI1790689, Length=324, Percent_Identity=25, Blast_Score=89, Evalue=3e-19,
Organism=Escherichia coli, GI1786268, Length=301, Percent_Identity=22.2591362126246, Blast_Score=79, Evalue=4e-16,
Organism=Escherichia coli, GI1789456, Length=343, Percent_Identity=25.6559766763848, Blast_Score=79, Evalue=6e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006377
- InterPro:   IPR000843
- InterPro:   IPR010982
- InterPro:   IPR001761 [H]

Pfam domain/function: PF00356 LacI; PF00532 Peripla_BP_1 [H]

EC number: NA

Molecular weight: Translated: 37549; Mature: 37418

Theoretical pI: Translated: 6.90; Mature: 6.90

Prosite motif: PS00356 HTH_LACI_1 ; PS50932 HTH_LACI_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGATIKDVAREAKVSIATVSRVLNNSAVVTEETRQRVLEAIKKTGYKPNALARSLKIQKT
CCCCHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHCCCCCHHHHHHHEEEEE
HTIGLIIPDISSTFYPEVVRGIEDIAAMYNYNIFLCNTDQKEDKEIKYIEILGEKQVDGI
EEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCEEEEEEECCCCCCCE
IFMGDVVRDSVIQAFNEFKVPVVLAGTQDKEKRYPSVMIDNEKAAYDAVKYLISLGHKKI
EEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHCCCCEEECCCHHHHHHHHHHHHHCCHHH
GMIAGSMQDPIAGLQRIEGYKRALEEHGIKYDPELVVEGEFKTRKAYLAMLKLLEHKVTA
HHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHE
VFAASDDMAAAAINAIFDSNLRVPDDIHVVGFDNTYISTIFRPTITTILQPAYDIGAVAM
EEECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCHHHHHHHHH
RLLTKLLGKEPIEEMHVILPHQLIVRESTGFKEGSSR
HHHHHHHCCCCHHHHHHHCCCHHEEECCCCCCCCCCC
>Mature Secondary Structure 
GATIKDVAREAKVSIATVSRVLNNSAVVTEETRQRVLEAIKKTGYKPNALARSLKIQKT
CCCHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHCCCCCHHHHHHHEEEEE
HTIGLIIPDISSTFYPEVVRGIEDIAAMYNYNIFLCNTDQKEDKEIKYIEILGEKQVDGI
EEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCEEEEEEECCCCCCCE
IFMGDVVRDSVIQAFNEFKVPVVLAGTQDKEKRYPSVMIDNEKAAYDAVKYLISLGHKKI
EEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHCCCCEEECCCHHHHHHHHHHHHHCCHHH
GMIAGSMQDPIAGLQRIEGYKRALEEHGIKYDPELVVEGEFKTRKAYLAMLKLLEHKVTA
HHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHE
VFAASDDMAAAAINAIFDSNLRVPDDIHVVGFDNTYISTIFRPTITTILQPAYDIGAVAM
EEECCCCHHHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCHHHHHHHHH
RLLTKLLGKEPIEEMHVILPHQLIVRESTGFKEGSSR
HHHHHHHCCCCHHHHHHHCCCHHEEECCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 1904524; 9387221; 9384377 [H]