| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is gadW [H]
Identifier: 209399020
GI number: 209399020
Start: 1182144
End: 1182731
Strand: Direct
Name: gadW [H]
Synonym: ECH74115_1164
Alternate gene names: 209399020
Gene position: 1182144-1182731 (Clockwise)
Preceding gene: 209399037
Following gene: 209399945
Centisome position: 21.22
GC content: 42.01
Gene sequence:
>588_bases ATGCATTATGGCAAAGTTAAAATTTTCGATATAAACCATTCCATAGTAAGTCAATATCTGGAAATTCAGCATAAGCTGAC AAGAACTCATCTGACTGACGTTCCGCTTTATCTGTCACTGGAACCCAACAACCCTGCGTTGGCTGAGGCTTTAATTACCA GCCAGAGATTTTCCGGAGATACCACGGATATGTTTCTTATGATGGCATGCCTGTCGCTGTTTGAATCAGATGAACGGATA TTATTATTTTTAAGTGGATGTTTATCCAGTATAAGTGCCAAAGTCAGGGCGATAATTCAGACAGATATATCAGCAAGCTG GACGCTTGGTGCGATTGCGTTACGCCTGCATATGAGTGAGAGTTTGTTAAAGATAAAACTGAAAAATGAAGGGCACATGT TCAGTCGCTTGTTGCTGGAAGAGCGGATGCGTGTTGCTGTCAATATGTTATGTTCCCGGCATGGATATGGACAGGCTGTA GCAGAAAAATGCGGTTATTCAAGCTGGTCCTACTTTATTTCTGTATTTCACCGCTATTATGGCTTCCCGCCAGACAGATA TGTATCCAGGCAAGGGCTTGATTATTGA
Upstream 100 bases:
>100_bases GGTTTGCTCCCCAATTAATATTTTTCTTGAAAAGGATACGTTGTCACTTAAGCCCGGCTCAGTCGTTCTGGCCACCAAAT GCATCAGGGCGCTTTTCCTT
Downstream 100 bases:
>100_bases TTTTCATCTGATTATTATTTTTTGACCCAGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGCCAGGTCGCTG GTTCAAATCCAGCAAGGGCC
Product: putative envelope protein encoded within prophage CP-933N
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 195; Mature: 195
Protein sequence:
>195_residues MHYGKVKIFDINHSIVSQYLEIQHKLTRTHLTDVPLYLSLEPNNPALAEALITSQRFSGDTTDMFLMMACLSLFESDERI LLFLSGCLSSISAKVRAIIQTDISASWTLGAIALRLHMSESLLKIKLKNEGHMFSRLLLEERMRVAVNMLCSRHGYGQAV AEKCGYSSWSYFISVFHRYYGFPPDRYVSRQGLDY
Sequences:
>Translated_195_residues MHYGKVKIFDINHSIVSQYLEIQHKLTRTHLTDVPLYLSLEPNNPALAEALITSQRFSGDTTDMFLMMACLSLFESDERI LLFLSGCLSSISAKVRAIIQTDISASWTLGAIALRLHMSESLLKIKLKNEGHMFSRLLLEERMRVAVNMLCSRHGYGQAV AEKCGYSSWSYFISVFHRYYGFPPDRYVSRQGLDY >Mature_195_residues MHYGKVKIFDINHSIVSQYLEIQHKLTRTHLTDVPLYLSLEPNNPALAEALITSQRFSGDTTDMFLMMACLSLFESDERI LLFLSGCLSSISAKVRAIIQTDISASWTLGAIALRLHMSESLLKIKLKNEGHMFSRLLLEERMRVAVNMLCSRHGYGQAV AEKCGYSSWSYFISVFHRYYGFPPDRYVSRQGLDY
Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an
COG id: COG2207
COG function: function code K; AraC-type DNA-binding domain-containing proteins
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789932, Length=186, Percent_Identity=37.0967741935484, Blast_Score=107, Evalue=7e-25, Organism=Escherichia coli, GI1787776, Length=126, Percent_Identity=40.4761904761905, Blast_Score=92, Evalue=2e-20, Organism=Escherichia coli, GI1786776, Length=192, Percent_Identity=29.6875, Blast_Score=92, Evalue=3e-20, Organism=Escherichia coli, GI1789933, Length=154, Percent_Identity=38.3116883116883, Blast_Score=87, Evalue=1e-18, Organism=Escherichia coli, GI1790557, Length=193, Percent_Identity=33.160621761658, Blast_Score=86, Evalue=2e-18, Organism=Escherichia coli, GI1786778, Length=127, Percent_Identity=44.8818897637795, Blast_Score=84, Evalue=4e-18,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 22296; Mature: 22296
Theoretical pI: Translated: 8.32; Mature: 8.32
Prosite motif: PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 4.1 %Met (Translated Protein) 6.2 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 4.1 %Met (Mature Protein) 6.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MHYGKVKIFDINHSIVSQYLEIQHKLTRTHLTDVPLYLSLEPNNPALAEALITSQRFSGD CCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCC TTDMFLMMACLSLFESDERILLFLSGCLSSISAKVRAIIQTDISASWTLGAIALRLHMSE HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCC SLLKIKLKNEGHMFSRLLLEERMRVAVNMLCSRHGYGQAVAEKCGYSSWSYFISVFHRYY CEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHH GFPPDRYVSRQGLDY CCCCHHHHCCCCCCC >Mature Secondary Structure MHYGKVKIFDINHSIVSQYLEIQHKLTRTHLTDVPLYLSLEPNNPALAEALITSQRFSGD CCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCC TTDMFLMMACLSLFESDERILLFLSGCLSSISAKVRAIIQTDISASWTLGAIALRLHMSE HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCC SLLKIKLKNEGHMFSRLLLEERMRVAVNMLCSRHGYGQAVAEKCGYSSWSYFISVFHRYY CEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCHHHHHHHHHHHHH GFPPDRYVSRQGLDY CCCCHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]