Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is purR [H]

Identifier: 159898293

GI number: 159898293

Start: 2058314

End: 2059333

Strand: Reverse

Name: purR [H]

Synonym: Haur_1769

Alternate gene names: 159898293

Gene position: 2059333-2058314 (Counterclockwise)

Preceding gene: 159898294

Following gene: 159898291

Centisome position: 32.45

GC content: 50.39

Gene sequence:

>1020_bases
GTGCGTTCAAAACAATCGACAGTCACGATTCACGATGTAGCCAAAGCGGCAGGCGTTTCGGTAAGCACAGCCTCGCGAGT
GCTTAATAATAAAGATGATGTATCGCCTGAAACCTATAAAAAAGTTCGGCAGGTGATCGACGACTTGAATTACACTGCTA
GCTTGGCCGCCAAAAGTATGCGCAGCCGTACTACCAACGTCATTGGCTTATTGGTTCCTGAATTGATCGAAGTCTATTAT
CATGAAGTGATCAAAGGTGTTGGTGCAGCAATTGAAAATTCGGGCTACGATTTATTGATTTATACCAGCGGCAGCCCGAC
CCGCAACAAACGAGCCTCGTGGGAGCGTGAGCATGTGGCCTTATTGAGCAGTGGCTTAACCGATGGCTGTATTATCGTCT
CGCCCTCAGCCCCAACCTTCCAAGAAAACGCCAAAATCGTGGTGATCGACCCGCATGGAGCAGGAGCCGAAGTGCCTTCC
GTCGTGGCCACCAACCATGAAGGGGCCATTCAAGCCGTCGATTATTTGGTGAGCCTTGGCCATCGCCGAATTGGCTTTGT
GCAAGGCCATCCCTCGGCGTGGAGTGCGCTGCAACGCTTCGAAGGCTACAAAGATGGTTTGGCAAAGGCCGGCATTAGCT
TCGAAGCAGCCTTGGTGTGTGAAGGTGATTTTACCTCGGCCTGCGGCAAAAACGCTACCTATCGCCTGATGAACCAGCCC
AATCCACCAACCGCAATCTTTGCCGCCAACGATCGCACAGCGATTGGCGTGCTTGAAGCAGCCAAGGAACTCAGCCTCAA
TGTGCCGCAAGATCTCTCGGTGATTGGCTTCGATAATATTCCTGAAACCATGCAAACCACGCCCCGTCTAACCACGATCG
ATCAATCGATTCGTGAAATGGGATCATTGGGAGTCCAGCTATTAATTGATATGTTACAAAATCGTGAATCAAGCCAATTA
TTGCACACCGTGCCAACCCGCTTGGTCATTCGCGATTCCTGTTGGATGCTTAAACAATAA

Upstream 100 bases:

>100_bases
CCCTCAGGCCAATTAATTGACATTTAGCACCAACCTGTTATCATGATATCGTTTCCGGTATCGTTTCCAGAGATTTTCCT
TTAAATATTGAGGTATTGCT

Downstream 100 bases:

>100_bases
CAAGCGAGGCCGAATAGGCTTCGATCTCGCTTAAATGCGCTGCGATACAAACCTCCGCTTTACAGCATCTAGGCCAAATT
TAGCCCACGACGGCGCAGTT

Product: LacI family transcription regulator

Products: NA

Alternate protein names: Pur regulon repressor; Purine nucleotide synthesis repressor [H]

Number of amino acids: Translated: 339; Mature: 339

Protein sequence:

>339_residues
MRSKQSTVTIHDVAKAAGVSVSTASRVLNNKDDVSPETYKKVRQVIDDLNYTASLAAKSMRSRTTNVIGLLVPELIEVYY
HEVIKGVGAAIENSGYDLLIYTSGSPTRNKRASWEREHVALLSSGLTDGCIIVSPSAPTFQENAKIVVIDPHGAGAEVPS
VVATNHEGAIQAVDYLVSLGHRRIGFVQGHPSAWSALQRFEGYKDGLAKAGISFEAALVCEGDFTSACGKNATYRLMNQP
NPPTAIFAANDRTAIGVLEAAKELSLNVPQDLSVIGFDNIPETMQTTPRLTTIDQSIREMGSLGVQLLIDMLQNRESSQL
LHTVPTRLVIRDSCWMLKQ

Sequences:

>Translated_339_residues
MRSKQSTVTIHDVAKAAGVSVSTASRVLNNKDDVSPETYKKVRQVIDDLNYTASLAAKSMRSRTTNVIGLLVPELIEVYY
HEVIKGVGAAIENSGYDLLIYTSGSPTRNKRASWEREHVALLSSGLTDGCIIVSPSAPTFQENAKIVVIDPHGAGAEVPS
VVATNHEGAIQAVDYLVSLGHRRIGFVQGHPSAWSALQRFEGYKDGLAKAGISFEAALVCEGDFTSACGKNATYRLMNQP
NPPTAIFAANDRTAIGVLEAAKELSLNVPQDLSVIGFDNIPETMQTTPRLTTIDQSIREMGSLGVQLLIDMLQNRESSQL
LHTVPTRLVIRDSCWMLKQ
>Mature_339_residues
MRSKQSTVTIHDVAKAAGVSVSTASRVLNNKDDVSPETYKKVRQVIDDLNYTASLAAKSMRSRTTNVIGLLVPELIEVYY
HEVIKGVGAAIENSGYDLLIYTSGSPTRNKRASWEREHVALLSSGLTDGCIIVSPSAPTFQENAKIVVIDPHGAGAEVPS
VVATNHEGAIQAVDYLVSLGHRRIGFVQGHPSAWSALQRFEGYKDGLAKAGISFEAALVCEGDFTSACGKNATYRLMNQP
NPPTAIFAANDRTAIGVLEAAKELSLNVPQDLSVIGFDNIPETMQTTPRLTTIDQSIREMGSLGVQLLIDMLQNRESSQL
LHTVPTRLVIRDSCWMLKQ

Specific function: Is the main repressor of the genes involved in the de novo synthesis of purine nucleotides, regulating purB, purC, purEK, purF, purHD, purL, purMN and guaBA expression. PurR is allosterically activated to bind its cognate DNA by binding the purine corepre

COG id: COG1609

COG function: function code K; Transcriptional regulators

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH lacI-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1790369, Length=342, Percent_Identity=30.9941520467836, Blast_Score=174, Evalue=7e-45,
Organism=Escherichia coli, GI1787948, Length=334, Percent_Identity=33.2335329341317, Blast_Score=166, Evalue=2e-42,
Organism=Escherichia coli, GI1790194, Length=309, Percent_Identity=33.0097087378641, Blast_Score=152, Evalue=4e-38,
Organism=Escherichia coli, GI1789068, Length=338, Percent_Identity=30.4733727810651, Blast_Score=134, Evalue=1e-32,
Organism=Escherichia coli, GI1787580, Length=335, Percent_Identity=27.7611940298507, Blast_Score=133, Evalue=2e-32,
Organism=Escherichia coli, GI1789202, Length=331, Percent_Identity=28.3987915407855, Blast_Score=123, Evalue=1e-29,
Organism=Escherichia coli, GI1788474, Length=306, Percent_Identity=29.7385620915033, Blast_Score=119, Evalue=2e-28,
Organism=Escherichia coli, GI1786540, Length=343, Percent_Identity=28.8629737609329, Blast_Score=118, Evalue=5e-28,
Organism=Escherichia coli, GI1789456, Length=338, Percent_Identity=28.4023668639053, Blast_Score=112, Evalue=3e-26,
Organism=Escherichia coli, GI1787906, Length=346, Percent_Identity=26.878612716763, Blast_Score=102, Evalue=3e-23,
Organism=Escherichia coli, GI48994940, Length=314, Percent_Identity=24.8407643312102, Blast_Score=96, Evalue=3e-21,
Organism=Escherichia coli, GI1790715, Length=328, Percent_Identity=22.5609756097561, Blast_Score=77, Evalue=1e-15,
Organism=Escherichia coli, GI1790689, Length=326, Percent_Identity=25.1533742331288, Blast_Score=74, Evalue=1e-14,
Organism=Escherichia coli, GI1786268, Length=283, Percent_Identity=23.6749116607774, Blast_Score=61, Evalue=9e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000843
- InterPro:   IPR010982
- InterPro:   IPR001761 [H]

Pfam domain/function: PF00356 LacI; PF00532 Peripla_BP_1 [H]

EC number: NA

Molecular weight: Translated: 36735; Mature: 36735

Theoretical pI: Translated: 6.72; Mature: 6.72

Prosite motif: PS00356 HTH_LACI_1 ; PS50932 HTH_LACI_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRSKQSTVTIHDVAKAAGVSVSTASRVLNNKDDVSPETYKKVRQVIDDLNYTASLAAKSM
CCCCCCEEEHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH
RSRTTNVIGLLVPELIEVYYHEVIKGVGAAIENSGYDLLIYTSGSPTRNKRASWEREHVA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHH
LLSSGLTDGCIIVSPSAPTFQENAKIVVIDPHGAGAEVPSVVATNHEGAIQAVDYLVSLG
HHHCCCCCCEEEECCCCCCHHCCCEEEEECCCCCCCCCCCEEECCCCCHHHHHHHHHHCC
HRRIGFVQGHPSAWSALQRFEGYKDGLAKAGISFEAALVCEGDFTSACGKNATYRLMNQP
CCCEEEEECCCHHHHHHHHHCCHHHHHHHCCCCEEEEEEECCCCHHHCCCCCEEEECCCC
NPPTAIFAANDRTAIGVLEAAKELSLNVPQDLSVIGFDNIPETMQTTPRLTTIDQSIREM
CCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHCCCCHHHHHHHHHHH
GSLGVQLLIDMLQNRESSQLLHTVPTRLVIRDSCWMLKQ
HHHHHHHHHHHHHCCCHHHHHHHHCHHEEEECCHHHCCC
>Mature Secondary Structure
MRSKQSTVTIHDVAKAAGVSVSTASRVLNNKDDVSPETYKKVRQVIDDLNYTASLAAKSM
CCCCCCEEEHHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH
RSRTTNVIGLLVPELIEVYYHEVIKGVGAAIENSGYDLLIYTSGSPTRNKRASWEREHVA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHH
LLSSGLTDGCIIVSPSAPTFQENAKIVVIDPHGAGAEVPSVVATNHEGAIQAVDYLVSLG
HHHCCCCCCEEEECCCCCCHHCCCEEEEECCCCCCCCCCCEEECCCCCHHHHHHHHHHCC
HRRIGFVQGHPSAWSALQRFEGYKDGLAKAGISFEAALVCEGDFTSACGKNATYRLMNQP
CCCEEEEECCCHHHHHHHHHCCHHHHHHHCCCCEEEEEEECCCCHHHCCCCCEEEECCCC
NPPTAIFAANDRTAIGVLEAAKELSLNVPQDLSVIGFDNIPETMQTTPRLTTIDQSIREM
CCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHCCCCHHHHHHHHHHH
GSLGVQLLIDMLQNRESSQLLHTVPTRLVIRDSCWMLKQ
HHHHHHHHHHHHHCCCHHHHHHHHCHHEEEECCHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA