Definition | Xanthomonas oryzae pv. oryzae MAFF 311018, complete genome. |
---|---|
Accession | NC_007705 |
Length | 4,940,217 |
Click here to switch to the map view.
The map label for this gene is purR [H]
Identifier: 84625354
GI number: 84625354
Start: 4176176
End: 4177252
Strand: Reverse
Name: purR [H]
Synonym: XOO_3697
Alternate gene names: 84625354
Gene position: 4177252-4176176 (Counterclockwise)
Preceding gene: 84625355
Following gene: 84625353
Centisome position: 84.56
GC content: 64.72
Gene sequence:
>1077_bases ATGCGCAGGCCTACCATCAAAGATGTCGCCGAACGCGCCAAGGTCTCGTTGAAGACCGTGTCGCGGGTGATCAACAACGA GCCGTCAGTGATGCAGGCCACGCGTGCGCGGGTGCTACGCGCCATTGCCGATCTCGACTACGAACCCGACCCGTCCGCGC GCAATCTGCGCAGCGGCACGCCGTTCGTGATCGGGGTGGTCTACGACAACCCCAACCCGTACCACATCATCGGCATCCAG AACGGCGTGCTGGCTGCGTGCCGTGAAACCGGTTTTGGCTTGCAGATCCACCCCTGCGATTCGACCTCGCCGTTGCTGGC CGAAGAACTGGCCGAATGGGTGCAGCGCTCGCGTCTGGCCGGTGTGGTGCTGACCGCACCGATCTCCGAACGCCCGGAGT TGCTGGCCGGTCTGGCGGCGCGCGGTATCAAGAGCGTACGCATCATCGCCGCCACCGATGACCCGGGCAATGGCCCGTGC GTGTATATCGACGACCGCGATGCCGCGTATGAAATCACCGAGCATCTGATCCAGCTCGGCCATCAACGCATCGGCTTCTT GTGGGGTGGTCCGCAGCATCGTTCCAGCGGCGAGCGCTATGCCGGCTACGAAGCGGCGTTGAAGGACTATGGCATTGCGC TGGACAAGCACCTGGTCATTCCCGGCGATTACACCTTCGACGATGGCTTCCGTGGTGCACGTCGCTTGCTGTCGCTGCGC GAGCCACCCACCGCCATCTTCGGCAGCAACGACGAAATCGCCGCCGGCGTGTTGGCCGCAGCCAGATCCACCAGCATGAA CGTCCCGTACGACTTGTCGATTGCCGGGTTCGAAGACAGCCCGTTTTCGCGCCAGTCGTGGCCGGCATTGACCACAGCCA AGCAGGCCACCGACGACATCGCGCGGCATGCCGCACGCCTGTTGATCAGCCAGCTGCGCAGCGATGCCTACGACGACCAG CCCGCGCAACTGCAGAACCGTGGCTTCGTGCCGCAACTGGTGGTGCGCGGCTCCACCGCGCCGGCGCCTGCGTCCACCGG CAAACCCCTTCCCCCGAATCCGCCTGAGCCACGATGA
Upstream 100 bases:
>100_bases TACATCCTGTTCTATTCGCTGCGCGGTCACCGCGTGGGCCTGCCGGCCCAGGCCAAGTGATCGCGGCGGCATGATGCGTT CCGACGGACAAGGTACATCC
Downstream 100 bases:
>100_bases GCCTTCCCACCGAAACCGATACCCTGATGTTCCGCGAAGCGGCGCAGACCGCCGATGTGGTTGCTGCACAGTTCGCACGT AACGCCGACACCATCGCTGC
Product: LacI family transcription regulator
Products: NA
Alternate protein names: Pur regulon repressor; Purine nucleotide synthesis repressor [H]
Number of amino acids: Translated: 358; Mature: 358
Protein sequence:
>358_residues MRRPTIKDVAERAKVSLKTVSRVINNEPSVMQATRARVLRAIADLDYEPDPSARNLRSGTPFVIGVVYDNPNPYHIIGIQ NGVLAACRETGFGLQIHPCDSTSPLLAEELAEWVQRSRLAGVVLTAPISERPELLAGLAARGIKSVRIIAATDDPGNGPC VYIDDRDAAYEITEHLIQLGHQRIGFLWGGPQHRSSGERYAGYEAALKDYGIALDKHLVIPGDYTFDDGFRGARRLLSLR EPPTAIFGSNDEIAAGVLAAARSTSMNVPYDLSIAGFEDSPFSRQSWPALTTAKQATDDIARHAARLLISQLRSDAYDDQ PAQLQNRGFVPQLVVRGSTAPAPASTGKPLPPNPPEPR
Sequences:
>Translated_358_residues MRRPTIKDVAERAKVSLKTVSRVINNEPSVMQATRARVLRAIADLDYEPDPSARNLRSGTPFVIGVVYDNPNPYHIIGIQ NGVLAACRETGFGLQIHPCDSTSPLLAEELAEWVQRSRLAGVVLTAPISERPELLAGLAARGIKSVRIIAATDDPGNGPC VYIDDRDAAYEITEHLIQLGHQRIGFLWGGPQHRSSGERYAGYEAALKDYGIALDKHLVIPGDYTFDDGFRGARRLLSLR EPPTAIFGSNDEIAAGVLAAARSTSMNVPYDLSIAGFEDSPFSRQSWPALTTAKQATDDIARHAARLLISQLRSDAYDDQ PAQLQNRGFVPQLVVRGSTAPAPASTGKPLPPNPPEPR >Mature_358_residues MRRPTIKDVAERAKVSLKTVSRVINNEPSVMQATRARVLRAIADLDYEPDPSARNLRSGTPFVIGVVYDNPNPYHIIGIQ NGVLAACRETGFGLQIHPCDSTSPLLAEELAEWVQRSRLAGVVLTAPISERPELLAGLAARGIKSVRIIAATDDPGNGPC VYIDDRDAAYEITEHLIQLGHQRIGFLWGGPQHRSSGERYAGYEAALKDYGIALDKHLVIPGDYTFDDGFRGARRLLSLR EPPTAIFGSNDEIAAGVLAAARSTSMNVPYDLSIAGFEDSPFSRQSWPALTTAKQATDDIARHAARLLISQLRSDAYDDQ PAQLQNRGFVPQLVVRGSTAPAPASTGKPLPPNPPEPR
Specific function: Is the main repressor of the genes involved in the de novo synthesis of purine nucleotides, regulating purB, purC, purEK, purF, purHD, purL, purMN and guaBA expression. PurR is allosterically activated to bind its cognate DNA by binding the purine corepre
COG id: COG1609
COG function: function code K; Transcriptional regulators
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH lacI-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1790369, Length=341, Percent_Identity=31.6715542521994, Blast_Score=147, Evalue=9e-37, Organism=Escherichia coli, GI1787948, Length=342, Percent_Identity=30.7017543859649, Blast_Score=130, Evalue=1e-31, Organism=Escherichia coli, GI1790194, Length=344, Percent_Identity=30.8139534883721, Blast_Score=130, Evalue=1e-31, Organism=Escherichia coli, GI1789202, Length=293, Percent_Identity=33.4470989761092, Blast_Score=122, Evalue=3e-29, Organism=Escherichia coli, GI1789068, Length=292, Percent_Identity=29.7945205479452, Blast_Score=113, Evalue=1e-26, Organism=Escherichia coli, GI1788474, Length=343, Percent_Identity=30.0291545189504, Blast_Score=113, Evalue=2e-26, Organism=Escherichia coli, GI1787580, Length=316, Percent_Identity=27.8481012658228, Blast_Score=104, Evalue=1e-23, Organism=Escherichia coli, GI1786540, Length=300, Percent_Identity=26.3333333333333, Blast_Score=103, Evalue=2e-23, Organism=Escherichia coli, GI1789456, Length=350, Percent_Identity=28, Blast_Score=97, Evalue=1e-21, Organism=Escherichia coli, GI48994940, Length=321, Percent_Identity=24.2990654205607, Blast_Score=87, Evalue=2e-18, Organism=Escherichia coli, GI1790689, Length=315, Percent_Identity=26.031746031746, Blast_Score=76, Evalue=4e-15, Organism=Escherichia coli, GI1790715, Length=330, Percent_Identity=23.030303030303, Blast_Score=74, Evalue=1e-14, Organism=Escherichia coli, GI1787906, Length=297, Percent_Identity=27.6094276094276, Blast_Score=71, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000843 - InterPro: IPR010982 - InterPro: IPR001761 [H]
Pfam domain/function: PF00356 LacI; PF00532 Peripla_BP_1 [H]
EC number: NA
Molecular weight: Translated: 38857; Mature: 38857
Theoretical pI: Translated: 6.88; Mature: 6.88
Prosite motif: PS00356 HTH_LACI_1 ; PS50932 HTH_LACI_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 0.8 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 0.8 %Met (Mature Protein) 1.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MRRPTIKDVAERAKVSLKTVSRVINNEPSVMQATRARVLRAIADLDYEPDPSARNLRSGT CCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC PFVIGVVYDNPNPYHIIGIQNGVLAACRETGFGLQIHPCDSTSPLLAEELAEWVQRSRLA CEEEEEEECCCCCEEEEEECCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHC GVVLTAPISERPELLAGLAARGIKSVRIIAATDDPGNGPCVYIDDRDAAYEITEHLIQLG CEEEECCCCCCHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCHHHHHHHHHHHHC HQRIGFLWGGPQHRSSGERYAGYEAALKDYGIALDKHLVIPGDYTFDDGFRGARRLLSLR HHHEEEEECCCCCCCCCCCCCCHHHHHHHHCEEECCEEECCCCCCCCCCHHHHHHHHHCC EPPTAIFGSNDEIAAGVLAAARSTSMNVPYDLSIAGFEDSPFSRQSWPALTTAKQATDDI CCCCEEECCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH ARHAARLLISQLRSDAYDDQPAQLQNRGFVPQLVVRGSTAPAPASTGKPLPPNPPEPR HHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure MRRPTIKDVAERAKVSLKTVSRVINNEPSVMQATRARVLRAIADLDYEPDPSARNLRSGT CCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC PFVIGVVYDNPNPYHIIGIQNGVLAACRETGFGLQIHPCDSTSPLLAEELAEWVQRSRLA CEEEEEEECCCCCEEEEEECCCHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHC GVVLTAPISERPELLAGLAARGIKSVRIIAATDDPGNGPCVYIDDRDAAYEITEHLIQLG CEEEECCCCCCHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCHHHHHHHHHHHHC HQRIGFLWGGPQHRSSGERYAGYEAALKDYGIALDKHLVIPGDYTFDDGFRGARRLLSLR HHHEEEEECCCCCCCCCCCCCCHHHHHHHHCEEECCEEECCCCCCCCCCHHHHHHHHHCC EPPTAIFGSNDEIAAGVLAAARSTSMNVPYDLSIAGFEDSPFSRQSWPALTTAKQATDDI CCCCEEECCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHH ARHAARLLISQLRSDAYDDQPAQLQNRGFVPQLVVRGSTAPAPASTGKPLPPNPPEPR HHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA