| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is gadW [H]
Identifier: 218691799
GI number: 218691799
Start: 4100374
End: 4101102
Strand: Reverse
Name: gadW [H]
Synonym: ECED1_4193
Alternate gene names: 218691799
Gene position: 4101102-4100374 (Counterclockwise)
Preceding gene: 218691800
Following gene: 218691794
Centisome position: 78.72
GC content: 41.02
Gene sequence:
>729_bases ATGGCTCATGTCTGCTCGGTGATCCTCGTTCGTCGTTCATTCGATATTCATCATGAACAGCAAAAAATATCGTTGCATAA CGAGAGTATCCTGCTGCTGGATAAAAATTTGGCAGACGATTTTGCGTTTTGTTCACTGGATACGCGACAGCTGGATATCG AAGAGCTGACAGTTTGCCATTACTTACAAAATATTCGTCAGTTGCCACGCAATTTAGGATTGCATAGCAAAGACCGTTTG TTAATTAACCAGTCACCCCCCATACAGCTGGTGACGGCGATTTTTGATAGTTTCAATGACCCCCGGGTCAATTCGCCGAT ACTGAGCAAAATGCTCTATCTTTCCTGTTTATCAATGTTTTCTCATAAGAAAGAACTGATCCCCTTACTTTTCAATAGTA TCAGTACTGTTTCAGGAAAAGTTGAACGCCTTATTAGCTTTGATATCGCTAAACGTTGGTATCTACGCGATATCGCAGAA AGAATGTACACCAGCGAGAGTCTCATCAAAAAAAAGTTGCAGGATGAAAATACCTGTTTCAGTAAAATATTACTCGCCTC CAGGATGTCGATGGCCAGACGATTACTCGAGTTACGTCAAATACCTCTGCATACTATTGCGGAAAAATGTGGCTATAGCA GTACGTCATACTTTATAAATACATTTAGACAATATTATGGTGTAACGCCACATCAGTTTTCGCAACATTCGCCGGGTACC TTTTCCTGA
Upstream 100 bases:
>100_bases TACCAAATATGGCAGTTTTTGCACACAGAAACAGTCTGGCGTCATTTCATTAGTATACTGACATTGAAATAATCGCAGTA ATGAAATATAAGGGATAGTC
Downstream 100 bases:
>100_bases CATATTTCGCATTTGAATATTGGTCAGGATCTCACGTCTGCTTCATGTGAAACTCTTTCCTGATGATTTCTGCCGGGCTA CCGGCTAGTTCTCTTTCGCA
Product: DNA-binding transcriptional activator
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 242; Mature: 241
Protein sequence:
>242_residues MAHVCSVILVRRSFDIHHEQQKISLHNESILLLDKNLADDFAFCSLDTRQLDIEELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPIQLVTAIFDSFNDPRVNSPILSKMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFSQHSPGT FS
Sequences:
>Translated_242_residues MAHVCSVILVRRSFDIHHEQQKISLHNESILLLDKNLADDFAFCSLDTRQLDIEELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPIQLVTAIFDSFNDPRVNSPILSKMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFSQHSPGT FS >Mature_241_residues AHVCSVILVRRSFDIHHEQQKISLHNESILLLDKNLADDFAFCSLDTRQLDIEELTVCHYLQNIRQLPRNLGLHSKDRLL INQSPPIQLVTAIFDSFNDPRVNSPILSKMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAER MYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFSQHSPGTF S
Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789932, Length=242, Percent_Identity=94.2148760330578, Blast_Score=471, Evalue=1e-134, Organism=Escherichia coli, GI1786776, Length=241, Percent_Identity=34.8547717842324, Blast_Score=162, Evalue=2e-41, Organism=Escherichia coli, GI1787776, Length=237, Percent_Identity=34.5991561181435, Blast_Score=135, Evalue=2e-33, Organism=Escherichia coli, GI1789933, Length=209, Percent_Identity=31.5789473684211, Blast_Score=96, Evalue=3e-21, Organism=Escherichia coli, GI1790557, Length=136, Percent_Identity=37.5, Blast_Score=88, Evalue=4e-19, Organism=Escherichia coli, GI1786778, Length=243, Percent_Identity=30.8641975308642, Blast_Score=85, Evalue=3e-18,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 28085; Mature: 27954
Theoretical pI: Translated: 8.99; Mature: 8.99
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.5 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 2.5 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAHVCSVILVRRSFDIHHEQQKISLHNESILLLDKNLADDFAFCSLDTRQLDIEELTVCH CHHHHHHHHHHHHHCHHHHHHHHHHCCCEEEEEECCCCCHHHHHCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPIQLVTAIFDSFNDPRVNSPILSKMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFSQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHCCCCCCCC FS CC >Mature Secondary Structure AHVCSVILVRRSFDIHHEQQKISLHNESILLLDKNLADDFAFCSLDTRQLDIEELTVCH HHHHHHHHHHHHHCHHHHHHHHHHCCCEEEEEECCCCCHHHHHCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPIQLVTAIFDSFNDPRVNSPILSKMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFSQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHCCCCCCCC FS CC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]