Definition | Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome. |
---|---|
Accession | NC_004631 |
Length | 4,791,961 |
Click here to switch to the map view.
The map label for this gene is deoR [H]
Identifier: 29142453
GI number: 29142453
Start: 2089162
End: 2089920
Strand: Direct
Name: deoR [H]
Synonym: t2032
Alternate gene names: 29142453
Gene position: 2089162-2089920 (Clockwise)
Preceding gene: 29142452
Following gene: 29142455
Centisome position: 43.6
GC content: 51.78
Gene sequence:
>759_bases ATGGAAACGCGACGCGACGAGCGTATTGGTCAATTGCTGCAGGCCTTAAAACGCAGCGATAAACTTCATCTTAAAGAAGC CGCGACCCTGTTGGGCGTCTCTGAAATGACCATTCGTCGCGACCTGAACCATAAAAGCGCGCCTGTCGTGTTGCTGGGCG GCTATATCGTACTGGAGCCCCGCAGCGCCAGTCATTATCTGTTAAGCGATCAAAAATCTCGTCTGGTGGAAGAAAAACGC CGTGCCGCTCAGTTGGCTGCGGGCTTGGTACAAGCGCATCAGACGGTATTTATTGACTGCGGCACCACCACGCCGTGGAT CATTGAAGCTATTGATAATGATCTTCCCTTTACGGCGGTGTGCTATTCACTTAATACCTTTCTGGCGCTGCAGGACAAAC CCCATTGCCGCGCCATCCTCAGCGGCGGGGAGTTTCACGCCAGCAACGCCATTTTCAAACCGCTTGATTTCCATGAAACG TTAAACAATATTTGTCCGGATATCGCGTTTTATTCCGCCGCCGGCGTTCATACCAGTAAAGGCGCTACCTGCTTTAATCT GGAAGAGCTGCCGGTAAAACATTGGGCCATGACAATGGCCCAGAGCCATGTACTGGTGGTGGATCACAGTAAATTCGGCA AGGTACGTCCGGCGCGGATGGGGGAATTATCACGCTTTGATACGATTATCAGCGACCGCCGTCCCGATGAGGCCTTTGTG GCCTACGCTAAAGCGCAACAAATTACTCTGATGTATTAA
Upstream 100 bases:
>100_bases CCATTCGAAAAGGCTGGGTACGTGACTAAATAGCGAAAAAAAGAGTAATATTAAGCGCTCTGTATGGTGCGGAGCGCTTA TTTAACTCTCCAGGGGAACT
Downstream 100 bases:
>100_bases CCGCAGGTTTGCCGGATGCGACGCGCACCCGGCAACGGGGTTATTTTACGAGAACCAACTGCCAAACCACTGATGGAGTT TCATCAGCACAAAATCCCAC
Product: DNA-binding transcriptional repressor DeoR
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 252; Mature: 252
Protein sequence:
>252_residues METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV AYAKAQQITLMY
Sequences:
>Translated_252_residues METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV AYAKAQQITLMY >Mature_252_residues METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR RAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAVCYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHET LNNICPDIAFYSAAGVHTSKGATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV AYAKAQQITLMY
Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming
COG id: COG1349
COG function: function code KG; Transcriptional regulators of sugar metabolism
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH deoR-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1787063, Length=252, Percent_Identity=83.3333333333333, Blast_Score=450, Evalue=1e-128, Organism=Escherichia coli, GI1789829, Length=238, Percent_Identity=28.5714285714286, Blast_Score=87, Evalue=1e-18, Organism=Escherichia coli, GI1789519, Length=230, Percent_Identity=25.6521739130435, Blast_Score=75, Evalue=4e-15, Organism=Escherichia coli, GI1790753, Length=243, Percent_Identity=26.7489711934156, Blast_Score=71, Evalue=6e-14, Organism=Escherichia coli, GI1787540, Length=240, Percent_Identity=26.25, Blast_Score=71, Evalue=6e-14, Organism=Escherichia coli, GI226510968, Length=249, Percent_Identity=29.3172690763052, Blast_Score=70, Evalue=1e-13, Organism=Escherichia coli, GI1788069, Length=249, Percent_Identity=22.8915662650602, Blast_Score=68, Evalue=5e-13, Organism=Escherichia coli, GI1789059, Length=241, Percent_Identity=24.4813278008299, Blast_Score=64, Evalue=7e-12, Organism=Escherichia coli, GI87082344, Length=247, Percent_Identity=22.6720647773279, Blast_Score=62, Evalue=3e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014036 - InterPro: IPR001034 - InterPro: IPR018356 - InterPro: IPR011991 [H]
Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR [H]
EC number: NA
Molecular weight: Translated: 28189; Mature: 28189
Theoretical pI: Translated: 7.96; Mature: 7.96
Prosite motif: PS00894 HTH_DEOR_1 ; PS51000 HTH_DEOR_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 4.4 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEP CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC RSASHYLLSDQKSRLVEEKRRAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAV CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHEECCCCCCHHHH CYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHETLNNICPDIAFYSAAGVHTSK HHHHHHEEEECCCCCCCEEECCCCEECCCCCCCCCCHHHHHHHHCCHHHHHHCCCCCCCC GATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHH AYAKAQQITLMY EEHHHCEEEEEC >Mature Secondary Structure METRRDERIGQLLQALKRSDKLHLKEAATLLGVSEMTIRRDLNHKSAPVVLLGGYIVLEP CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCEEEECC RSASHYLLSDQKSRLVEEKRRAAQLAAGLVQAHQTVFIDCGTTTPWIIEAIDNDLPFTAV CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCHHHEECCCCCCHHHH CYSLNTFLALQDKPHCRAILSGGEFHASNAIFKPLDFHETLNNICPDIAFYSAAGVHTSK HHHHHHEEEECCCCCCCEEECCCCEECCCCCCCCCCHHHHHHHHCCHHHHHHCCCCCCCC GATCFNLEELPVKHWAMTMAQSHVLVVDHSKFGKVRPARMGELSRFDTIISDRRPDEAFV CCEEECHHHCCHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHH AYAKAQQITLMY EEHHHCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]