Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is deoQ

Identifier: 218698123

GI number: 218698123

Start: 5028910

End: 5029695

Strand: Direct

Name: deoQ

Synonym: EC55989_4935

Alternate gene names: 218698123

Gene position: 5028910-5029695 (Clockwise)

Preceding gene: 218698118

Following gene: 218698127

Centisome position: 97.56

GC content: 45.93

Gene sequence:

>786_bases
TTGATGGAAACGAAGCAAAAAGAGCGTATCCGACGTTTGATGGAACTGCTTAAGAAAACCGACAGAATCCATTTGAAAGA
CGCAGCGCGAATGCTGGAAGTTTCTGTAATGACTATTCGTCGCGATCTCCATCAGGAAGATGAACCTCTGCCACTGACCC
TACTGGGTGGCTATATTGTAATGGTGAATAAACCCGCGCCATCCATGCCAGTAATCCATGACGTTCCAAAAAATCATCGT
GATGACTTACCTATTGCAATTCTGGCTGCCGGAATGGTTAATGAAAATGATCTGATCTTCTTTGATAATGGCCAGGAGAT
ACCACTCGTTATAAGCATGATCCCGGATGCAATCACCTTCACCGGCATCTGTTACTCACATCGCGTCTTTGTTGCGTTGA
ATGAAAAGCCTAATGTAACAGCAATACTTTGTGGTGGTACGTATCGTGCCAGAAGTGATGCTTTTTACGATGCCAGTAAC
TCTTCGCCATTAGACTCTCTCAATCCGCGAAAAATATTTATTTCCGCCAGCGGTGTGCATAATCACTTTGGCGTCAGCTG
GTTTAACCCTGAAGATCTTGCCACTAAGCGTAAAGCGATGAACCGTGGACTACGGAAAATTTTGCTCGCCCGCCACGCGT
TGTTCGATGAAGTGGCCTCTGCCAGCCTCGCACCGATCTCTGCATTTGACGTTCTGATTAGCGATCGTCCGTTACCGGCA
GATTATGTTACGCACTGCCAGAATGGTTCTGTAAAGATCATTACACCTGATTCAGAAGACGAATGA

Upstream 100 bases:

>100_bases
ACCCTTTTCATTTCAAAGGGGCGGTCGTATAGTATGGTAATGAAAACAATGTTTACTAACGCCAAAATGTTATTTTTATA
ACATTCTTACGGAGAGAGAG

Downstream 100 bases:

>100_bases
CTTACTGAAAAAACACCACAATCTTGTTAAACATCGTCGGGTTGGACTGATTACGTTGCACTTTCACCACATATTCCAGC
TTATCTATTTGGCTTATCAC

Product: putative transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 261; Mature: 261

Protein sequence:

>261_residues
MMETKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPA
DYVTHCQNGSVKIITPDSEDE

Sequences:

>Translated_261_residues
MMETKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPA
DYVTHCQNGSVKIITPDSEDE
>Mature_261_residues
MMETKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPA
DYVTHCQNGSVKIITPDSEDE

Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH deoR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1787063, Length=253, Percent_Identity=37.5494071146245, Blast_Score=173, Evalue=1e-44,
Organism=Escherichia coli, GI1789059, Length=261, Percent_Identity=26.8199233716475, Blast_Score=73, Evalue=2e-14,
Organism=Escherichia coli, GI1790753, Length=262, Percent_Identity=22.1374045801527, Blast_Score=71, Evalue=8e-14,
Organism=Escherichia coli, GI1789519, Length=257, Percent_Identity=22.568093385214, Blast_Score=61, Evalue=6e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014036
- InterPro:   IPR001034
- InterPro:   IPR018356
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR [H]

EC number: NA

Molecular weight: Translated: 29171; Mature: 29171

Theoretical pI: Translated: 6.80; Mature: 6.80

Prosite motif: PS00894 HTH_DEOR_1 ; PS51000 HTH_DEOR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.8 %Met     (Translated Protein)
5.0 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.8 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMETKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIV
CCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCEEE
MVNKPAPSMPVIHDVPKNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITF
EECCCCCCCCEEECCCCCCCCCCCEEEEEECCCCCCCEEEEECCCCCCEEEEECCCHHHH
TGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVH
HEEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCC
NHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPA
CCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHEECCCCCCH
DYVTHCQNGSVKIITPDSEDE
HHHHHCCCCCEEEECCCCCCC
>Mature Secondary Structure
MMETKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIV
CCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCEEE
MVNKPAPSMPVIHDVPKNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITF
EECCCCCCCCEEECCCCCCCCCCCEEEEEECCCCCCCEEEEECCCCCCEEEEECCCHHHH
TGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVH
HEEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCC
NHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPA
CCCCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHEECCCCCCH
DYVTHCQNGSVKIITPDSEDE
HHHHHCCCCCEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]