Definition Escherichia coli ED1a chromosome, complete genome.
Accession NC_011745
Length 5,209,548

Click here to switch to the map view.

The map label for this gene is deoQ [H]

Identifier: 218688173

GI number: 218688173

Start: 342954

End: 343739

Strand: Direct

Name: deoQ [H]

Synonym: ECED1_0327

Alternate gene names: 218688173

Gene position: 342954-343739 (Clockwise)

Preceding gene: 218688169

Following gene: 218688174

Centisome position: 6.58

GC content: 46.18

Gene sequence:

>786_bases
TTGATGGAAACGAAGCAAAAAGAGCGTATCCGACGTTTGATTGAAATACTTAAGAAAACCGACAGAATCCATTTGAAAGA
CGCGGCACGAATGCTGGAAGTTTCTGTAATGACTATTCGTCGCGATCTCCATCAGGAAGATGAACCTCTGCCACTGACCC
TACTGGGTGGCTATATTGTAATGGTGCATAAACCCGCACCATCCATGCCAGTAATCCAGGACGTTCCGAGAAATCATCGT
GATGACTTACCTATTGCAATTCTGGCCGCCGGAATGGTTAATGAAAATGATCTGATCTTCTTTGATAATGGCCAGGAGAT
ACCGCTCGTTATAAGCATGATCCCGGATGCAATCACCTTCACTGGCATCTGTTACTCACATCGTGTCTTTGTTGCGTTGA
ATGAAAAACCTAATGTGACAGCAATACTTTGTGGTGGTACGTATCGTGCCAGAAGTGATGCTTTTTACGATGCCAGTAAC
TCTTCGCCATTAGACTCTCTCAATCCGCGAAAAATATTTATTTCCGCCAGCGGTGTACATGATCACTTTGGCGTCAGCTG
GTTTAATCCCGAAGATCTTGCCACTAAGCGTAAAGCGATGGCCCGTGGACTAAGGAAAATTTTGCTCGCCCGCCACGCCT
TGTTCGATGAAGTAGCCTCTGCCAGCCTCGCACCGCTCTCTGCATTTGATGTTCTGATTAGCGAGCGTCCGTTACCGGCA
GATTATGTTACGCACTGCCGGAATGCTTCTGTAAAGATCATTACACCTGATTCAGAAGACGAATGA

Upstream 100 bases:

>100_bases
TATACCCTTTTCATTTCAAAGGGTCGGTCGTATAGTATGGTAACTAAAACAATGTTTACTAATGCCATAATGTTATTTTT
ATAACATTTTACGGAGAGAG

Downstream 100 bases:

>100_bases
CTTACTGAAAAAACACCGTACTCTTGTTAAACATCGTCGGATTGGACTGATTACGTTGCACTTTCATCACATATTCCAGT
TTATCAATTTGGCTTATCAT

Product: putative transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 261; Mature: 261

Protein sequence:

>261_residues
MMETKQKERIRRLIEILKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVHKPAPSMPVIQDVPRNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHDHFGVSWFNPEDLATKRKAMARGLRKILLARHALFDEVASASLAPLSAFDVLISERPLPA
DYVTHCRNASVKIITPDSEDE

Sequences:

>Translated_261_residues
MMETKQKERIRRLIEILKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVHKPAPSMPVIQDVPRNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHDHFGVSWFNPEDLATKRKAMARGLRKILLARHALFDEVASASLAPLSAFDVLISERPLPA
DYVTHCRNASVKIITPDSEDE
>Mature_261_residues
MMETKQKERIRRLIEILKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVHKPAPSMPVIQDVPRNHR
DDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASN
SSPLDSLNPRKIFISASGVHDHFGVSWFNPEDLATKRKAMARGLRKILLARHALFDEVASASLAPLSAFDVLISERPLPA
DYVTHCRNASVKIITPDSEDE

Specific function: This protein is one of the repressors that regulate the expression of deoCABD genes, which encode nucleotide and deoxy ribonucleotide catabolizing enzymes. It also negatively regulates the expression of nupG (a transport protein) and tsx (a pore- forming

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH deoR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1787063, Length=253, Percent_Identity=37.5494071146245, Blast_Score=171, Evalue=3e-44,
Organism=Escherichia coli, GI1790753, Length=262, Percent_Identity=22.5190839694656, Blast_Score=68, Evalue=6e-13,
Organism=Escherichia coli, GI1789059, Length=261, Percent_Identity=24.1379310344828, Blast_Score=67, Evalue=9e-13,
Organism=Escherichia coli, GI1789519, Length=256, Percent_Identity=22.65625, Blast_Score=63, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014036
- InterPro:   IPR001034
- InterPro:   IPR018356
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR [H]

EC number: NA

Molecular weight: Translated: 29209; Mature: 29209

Theoretical pI: Translated: 6.80; Mature: 6.80

Prosite motif: PS00894 HTH_DEOR_1 ; PS51000 HTH_DEOR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.6 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMETKQKERIRRLIEILKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIV
CCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCEEE
MVHKPAPSMPVIQDVPRNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITF
EEECCCCCCCHHHHCCCCCCCCCCEEEEEECCCCCCCEEEEECCCCCCEEEEECCCHHHH
TGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVH
HEEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCC
DHFGVSWFNPEDLATKRKAMARGLRKILLARHALFDEVASASLAPLSAFDVLISERPLPA
HHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCH
DYVTHCRNASVKIITPDSEDE
HHHHHCCCCCEEEECCCCCCC
>Mature Secondary Structure
MMETKQKERIRRLIEILKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIV
CCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCEEE
MVHKPAPSMPVIQDVPRNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITF
EEECCCCCCCHHHHCCCCCCCCCCEEEEEECCCCCCCEEEEECCCCCCEEEEECCCHHHH
TGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVH
HEEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCC
DHFGVSWFNPEDLATKRKAMARGLRKILLARHALFDEVASASLAPLSAFDVLISERPLPA
HHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCH
DYVTHCRNASVKIITPDSEDE
HHHHHCCCCCEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]