Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is deoR [H]

Identifier: 29142453

GI number: 29142453

Start: 2089162

End: 2089920

Strand: Direct

Name: deoR [H]

Synonym: t2032

Alternate gene names: 29142453

Gene position: 2089162-2089920 (Clockwise)

Preceding gene: 29142452

Following gene: 29142455

Centisome position: 43.6

GC content: 51.78

Gene sequence:

>759_bases
ATGGAAACGCGACGCGACGAGCGTATTGGTCAATTGCTGCAGGCCTTAAAACGCAGCGATAAACTTCATCTTAAAGAAGC
CGCGACCCTGTTGGGCGTCTCTGAAATGACCATTCGTCGCGACCTGAACCATAAAAGCGCGCCTGTCGTGTTGCTGGGCG
GCTATATCGTACTGGAGCCCCGCAGCGCCAGTCATTATCTGTTAAGCGATCAAAAATCTCGTCTGGTGGAAGAAAAACGC
CGTGCCGCTCAGTTGGCTGCGGGCTTGGTACAAGCGCATCAGACGGTATTTATTGACTGCGGCACCACCACGCCGTGGAT
CATTGAAGCTATTGATAATGATCTTCCCTTTACGGCGGTGTGCTATTCACTTAATACCTTTCTGGCGCTGCAGGACAAAC
CCCATTGCCGCGCCATCCTCAGCGGCGGGGAGTTTCACGCCAGCAACGCCATTTTCAAACCGCTTGATTTCCATGAAACG
TTAAACAATATTTGTCCGGATATCGCGTTTTATTCCGCCGCCGGCGTTCATACCAGTAAAGGCGCTACCTGCTTTAATCT
GGAAGAGCTGCCGGTAAAACATTGGGCCATGACAATGGCCCAGAGCCATGTACTGGTGGTGGATCACAGTAAATTCGGCA
AGGTACGTCCGGCGCGGATGGGGGAATTATCACGCTTTGATACGATTATCAGCGACCGCCGTCCCGATGAGGCCTTTGTG
GCCTACGCTAAAGCGCAACAAATTACTCTGATGTATTAA

Upstream 100 bases:

>100_bases
CCATTCGAAAAGGCTGGGTACGTGACTAAATAGCGAAAAAAAGAGTAATATTAAGCGCTCTGTATGGTGCGGAGCGCTTA
TTTAACTCTCCAGGGGAACT

Downstream 100 bases:

>100_bases
CCGCAGGTTTGCCGGATGCGACGCGCACCCGGCAACGGGGTTATTTTACGAGAACCAACTGCCAAACCACTGATGGAGTT
TCATCAGCACAAAATCCCAC

Product: DNA-binding transcriptional repressor DeoR

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 252; Mature: 252

Protein sequence:

>252_residues
METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET
LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV
AYAKAQQITLMY

Sequences:

>Translated_252_residues
METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET
LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV
AYAKAQQITLMY
>Mature_252_residues
METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET
LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV
AYAKAQQITLMY

Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming

COG id: COG1349

COG function: function code KG; Transcriptional regulators of sugar metabolism

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH deoR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1787063, Length=252, Percent_Identity=83.3333333333333, Blast_Score=450, Evalue=1e-128,
Organism=Escherichia coli, GI1789829, Length=238, Percent_Identity=28.5714285714286, Blast_Score=87, Evalue=1e-18,
Organism=Escherichia coli, GI1789519, Length=230, Percent_Identity=25.6521739130435, Blast_Score=75, Evalue=4e-15,
Organism=Escherichia coli, GI1790753, Length=243, Percent_Identity=26.7489711934156, Blast_Score=71, Evalue=6e-14,
Organism=Escherichia coli, GI1787540, Length=240, Percent_Identity=26.25, Blast_Score=71, Evalue=6e-14,
Organism=Escherichia coli, GI226510968, Length=249, Percent_Identity=29.3172690763052, Blast_Score=70, Evalue=1e-13,
Organism=Escherichia coli, GI1788069, Length=249, Percent_Identity=22.8915662650602, Blast_Score=68, Evalue=5e-13,
Organism=Escherichia coli, GI1789059, Length=241, Percent_Identity=24.4813278008299, Blast_Score=64, Evalue=7e-12,
Organism=Escherichia coli, GI87082344, Length=247, Percent_Identity=22.6720647773279, Blast_Score=62, Evalue=3e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014036
- InterPro:   IPR001034
- InterPro:   IPR018356
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR [H]

EC number: NA

Molecular weight: Translated: 28189; Mature: 28189

Theoretical pI: Translated: 7.96; Mature: 7.96

Prosite motif: PS00894 HTH_DEOR_1 ; PS51000 HTH_DEOR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
4.4 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEP
CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC
RSASHYLLSDQKSRLVEEKRRAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAV
CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHEECCCCCCHHHH
CYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHETLNNICPDIAFYSAAGVHTSK
HHHHHHEEEECCCCCCCEEECCCCEECCCCCCCCCCHHHHHHHHCCHHHHHHCCCCCCCC
GATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV
CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHH
AYAKAQQITLMY
EEHHHCEEEEEC
>Mature Secondary Structure
METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEP
CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC
RSASHYLLSDQKSRLVEEKRRAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAV
CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHEECCCCCCHHHH
CYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHETLNNICPDIAFYSAAGVHTSK
HHHHHHEEEECCCCCCCEEECCCCEECCCCCCCCCCHHHHHHHHCCHHHHHHCCCCCCCC
GATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV
CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHH
AYAKAQQITLMY
EEHHHCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]