Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is deoR
Identifier: 157160320
GI number: 157160320
Start: 910812
End: 911570
Strand: Reverse
Name: deoR
Synonym: EcHS_A0901
Alternate gene names: 157160320
Gene position: 911570-910812 (Counterclockwise)
Preceding gene: 157160321
Following gene: 157160318
Centisome position: 19.63
GC content: 50.72
Gene sequence:
>759_bases ATGGAAACACGTCGCGAAGAGCGTATCGGGCAGCTGCTGCAAGAATTAAAACGCAGCGATAAGTTACATCTTAAAGACGC CGCCGCCCTGCTTGGGGTTTCGGAGATGACGATTCGTCGCGATCTGAACAACCACAGTGCGCCCGTCGTTTTGCTCGGCG GCTATATTGTTCTGGAACCGCGCAGTGCCAGCCATTACCTGTTAAGCGATCAAAAATCCCGCCTGGTGGAAGAAAAACGC CGGGCGGCAAAACTGGCTGCGACGCTGGTAGAACCCGATCAGACCCTCTTTTTTGACTGTGGCACCACCACGCCGTGGAT TATTGAAGCGATTGATAATGAAATCCCTTTTACCGCCGTTTGTTATTCGCTAAATACCTTTCTGGCGCTGAAAGAGAAAC CCCATTGCCGCGCGTTTCTTTGCGGTGGTGAATTTCACGCCAGCAACGCCATTTTCAAACCCATCGATTTTCAGCAAACG CTGAATAATTTTTGCCCGGATATCGCTTTTTATTCTGCGGCGGGCGTGCATGTCAGTAAAGGCGCTACCTGTTTTAATCT TGAAGAGTTGCCGGTAAAACACTGGGCCATGTCGATGGCGCAAAAGCATGTGCTGGTTGTCGACCACAGTAAATTTGGCA AGGTGCGTCCGGCGCGCATGGGTGACCTGAAACGCTTTGATATTGTGGTGAGCGATTGTTGCCCGGAAGATGAGTATGTG AAGTACGCGCAGACGCAGCGCATTAAGTTGATGTATTAA
Upstream 100 bases:
>100_bases CTGTTTTGCATTACCGATCCGCAAAGGCTGGGTGCGTGACTGATCTCTGCTAAAAAGTGTAGTATTGAGCGGCTCGCTTC AATAACTATTCAGAGGGATT
Downstream 100 bases:
>100_bases TGACGTATAACCGGATGACGTTTCGCGCCATCCGGTTATCAGAAGATTAAGAGAACCAGCTGCCGAACCACTGATGGAAT TTCATCATCACGAAATCCCA
Product: DNA-binding transcriptional repressor DeoR
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 252; Mature: 252
Protein sequence:
>252_residues METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV KYAQTQRIKLMY
Sequences:
>Translated_252_residues METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV KYAQTQRIKLMY >Mature_252_residues METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV KYAQTQRIKLMY
Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming
COG id: COG1349
COG function: function code KG; Transcriptional regulators of sugar metabolism
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH deoR-type DNA-binding domain
Homologues:
Organism=Escherichia coli, GI1787063, Length=252, Percent_Identity=100, Blast_Score=528, Evalue=1e-151, Organism=Escherichia coli, GI1789519, Length=240, Percent_Identity=27.5, Blast_Score=94, Evalue=7e-21, Organism=Escherichia coli, GI1789829, Length=253, Percent_Identity=26.8774703557312, Blast_Score=86, Evalue=2e-18, Organism=Escherichia coli, GI1788069, Length=250, Percent_Identity=21.2, Blast_Score=72, Evalue=5e-14, Organism=Escherichia coli, GI1790753, Length=243, Percent_Identity=23.8683127572016, Blast_Score=72, Evalue=5e-14, Organism=Escherichia coli, GI1787540, Length=233, Percent_Identity=26.6094420600858, Blast_Score=71, Evalue=6e-14, Organism=Escherichia coli, GI226510968, Length=210, Percent_Identity=30.4761904761905, Blast_Score=69, Evalue=3e-13, Organism=Escherichia coli, GI1789059, Length=231, Percent_Identity=24.2424242424242, Blast_Score=65, Evalue=3e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): DEOR_ECO57 (P0ACK7)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: D85594 - PIR: H90743 - RefSeq: NP_286606.1 - RefSeq: NP_308947.1 - ProteinModelPortal: P0ACK7 - SMR: P0ACK7 - EnsemblBacteria: EBESCT00000028062 - EnsemblBacteria: EBESCT00000058401 - GeneID: 917661 - GeneID: 958158 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z1067 - KEGG: ecs:ECs0920 - GeneTree: EBGT00050000009213 - HOGENOM: HBG300846 - OMA: GICYSLN - ProtClustDB: PRK10681 - BioCyc: ECOL83334:ECS0920-MONOMER - GO: GO:0005622 - InterPro: IPR014036 - InterPro: IPR001034 - InterPro: IPR018356 - InterPro: IPR011991 - Gene3D: G3DSA:1.10.10.10 - SMART: SM00420
Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR
EC number: NA
Molecular weight: Translated: 28548; Mature: 28548
Theoretical pI: Translated: 7.78; Mature: 7.78
Prosite motif: PS00894 HTH_DEOR_1; PS51000 HTH_DEOR_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
3.2 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 5.6 %Cys+Met (Translated Protein) 3.2 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 5.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEP CCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC RSASHYLLSDQKSRLVEEKRRAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAV CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHEECCCCCCHHHH CYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQTLNNFCPDIAFYSAAGVHVSK HHHHHHHEEECCCCCCEEEEECCEEECCCCEECCCCHHHHHHHHCCCHHHHHCCCEEEEC GATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCCHHCCCHHEEEEEECCCCCHHHH KYAQTQRIKLMY HHHCCCEEEEEC >Mature Secondary Structure METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEP CCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC RSASHYLLSDQKSRLVEEKRRAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAV CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHEECCCCCCHHHH CYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQTLNNFCPDIAFYSAAGVHVSK HHHHHHHEEECCCCCCEEEEECCEEECCCCEECCCCHHHHHHHHCCCHHHHHCCCEEEEC GATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCCHHCCCHHEEEEEECCCCCHHHH KYAQTQRIKLMY HHHCCCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796