Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is deoR

Identifier: 157160320

GI number: 157160320

Start: 910812

End: 911570

Strand: Reverse

Name: deoR

Synonym: EcHS_A0901

Alternate gene names: 157160320

Gene position: 911570-910812 (Counterclockwise)

Preceding gene: 157160321

Following gene: 157160318

Centisome position: 19.63

GC content: 50.72

Gene sequence:

>759_bases
ATGGAAACACGTCGCGAAGAGCGTATCGGGCAGCTGCTGCAAGAATTAAAACGCAGCGATAAGTTACATCTTAAAGACGC
CGCCGCCCTGCTTGGGGTTTCGGAGATGACGATTCGTCGCGATCTGAACAACCACAGTGCGCCCGTCGTTTTGCTCGGCG
GCTATATTGTTCTGGAACCGCGCAGTGCCAGCCATTACCTGTTAAGCGATCAAAAATCCCGCCTGGTGGAAGAAAAACGC
CGGGCGGCAAAACTGGCTGCGACGCTGGTAGAACCCGATCAGACCCTCTTTTTTGACTGTGGCACCACCACGCCGTGGAT
TATTGAAGCGATTGATAATGAAATCCCTTTTACCGCCGTTTGTTATTCGCTAAATACCTTTCTGGCGCTGAAAGAGAAAC
CCCATTGCCGCGCGTTTCTTTGCGGTGGTGAATTTCACGCCAGCAACGCCATTTTCAAACCCATCGATTTTCAGCAAACG
CTGAATAATTTTTGCCCGGATATCGCTTTTTATTCTGCGGCGGGCGTGCATGTCAGTAAAGGCGCTACCTGTTTTAATCT
TGAAGAGTTGCCGGTAAAACACTGGGCCATGTCGATGGCGCAAAAGCATGTGCTGGTTGTCGACCACAGTAAATTTGGCA
AGGTGCGTCCGGCGCGCATGGGTGACCTGAAACGCTTTGATATTGTGGTGAGCGATTGTTGCCCGGAAGATGAGTATGTG
AAGTACGCGCAGACGCAGCGCATTAAGTTGATGTATTAA

Upstream 100 bases:

>100_bases
CTGTTTTGCATTACCGATCCGCAAAGGCTGGGTGCGTGACTGATCTCTGCTAAAAAGTGTAGTATTGAGCGGCTCGCTTC
AATAACTATTCAGAGGGATT

Downstream 100 bases:

>100_bases
TGACGTATAACCGGATGACGTTTCGCGCCATCCGGTTATCAGAAGATTAAGAGAACCAGCTGCCGAACCACTGATGGAAT
TTCATCATCACGAAATCCCA

Product: DNA-binding transcriptional repressor DeoR

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 252; Mature: 252

Protein sequence:

>252_residues
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT
LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
KYAQTQRIKLMY

Sequences:

>Translated_252_residues
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT
LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
KYAQTQRIKLMY
>Mature_252_residues
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT
LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
KYAQTQRIKLMY

Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming

COG id: COG1349

COG function: function code KG; Transcriptional regulators of sugar metabolism

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH deoR-type DNA-binding domain

Homologues:

Organism=Escherichia coli, GI1787063, Length=252, Percent_Identity=100, Blast_Score=528, Evalue=1e-151,
Organism=Escherichia coli, GI1789519, Length=240, Percent_Identity=27.5, Blast_Score=94, Evalue=7e-21,
Organism=Escherichia coli, GI1789829, Length=253, Percent_Identity=26.8774703557312, Blast_Score=86, Evalue=2e-18,
Organism=Escherichia coli, GI1788069, Length=250, Percent_Identity=21.2, Blast_Score=72, Evalue=5e-14,
Organism=Escherichia coli, GI1790753, Length=243, Percent_Identity=23.8683127572016, Blast_Score=72, Evalue=5e-14,
Organism=Escherichia coli, GI1787540, Length=233, Percent_Identity=26.6094420600858, Blast_Score=71, Evalue=6e-14,
Organism=Escherichia coli, GI226510968, Length=210, Percent_Identity=30.4761904761905, Blast_Score=69, Evalue=3e-13,
Organism=Escherichia coli, GI1789059, Length=231, Percent_Identity=24.2424242424242, Blast_Score=65, Evalue=3e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): DEOR_ECO57 (P0ACK7)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   D85594
- PIR:   H90743
- RefSeq:   NP_286606.1
- RefSeq:   NP_308947.1
- ProteinModelPortal:   P0ACK7
- SMR:   P0ACK7
- EnsemblBacteria:   EBESCT00000028062
- EnsemblBacteria:   EBESCT00000058401
- GeneID:   917661
- GeneID:   958158
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1067
- KEGG:   ecs:ECs0920
- GeneTree:   EBGT00050000009213
- HOGENOM:   HBG300846
- OMA:   GICYSLN
- ProtClustDB:   PRK10681
- BioCyc:   ECOL83334:ECS0920-MONOMER
- GO:   GO:0005622
- InterPro:   IPR014036
- InterPro:   IPR001034
- InterPro:   IPR018356
- InterPro:   IPR011991
- Gene3D:   G3DSA:1.10.10.10
- SMART:   SM00420

Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR

EC number: NA

Molecular weight: Translated: 28548; Mature: 28548

Theoretical pI: Translated: 7.78; Mature: 7.78

Prosite motif: PS00894 HTH_DEOR_1; PS51000 HTH_DEOR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

3.2 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
5.6 %Cys+Met (Translated Protein)
3.2 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
5.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEP
CCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC
RSASHYLLSDQKSRLVEEKRRAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAV
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHEECCCCCCHHHH
CYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQTLNNFCPDIAFYSAAGVHVSK
HHHHHHHEEECCCCCCEEEEECCEEECCCCEECCCCHHHHHHHHCCCHHHHHCCCEEEEC
GATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCCHHCCCHHEEEEEECCCCCHHHH
KYAQTQRIKLMY
HHHCCCEEEEEC
>Mature Secondary Structure
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEP
CCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC
RSASHYLLSDQKSRLVEEKRRAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAV
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHEECCCCCCHHHH
CYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQTLNNFCPDIAFYSAAGVHVSK
HHHHHHHEEECCCCCCEEEEECCEEECCCCEECCCCHHHHHHHHCCCHHHHHCCCEEEEC
GATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCCHHCCCHHEEEEEECCCCCHHHH
KYAQTQRIKLMY
HHHCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796