Definition | Escherichia coli 55989, complete genome. |
---|---|
Accession | NC_011748 |
Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is gadW
Identifier: 218697224
GI number: 218697224
Start: 4028998
End: 4029726
Strand: Reverse
Name: gadW
Synonym: EC55989_3960
Alternate gene names: 218697224
Gene position: 4029726-4028998 (Counterclockwise)
Preceding gene: 218697225
Following gene: 218697219
Centisome position: 78.17
GC content: 41.98
Gene sequence:
>729_bases ATGACTCATGTCTGCTCGGTGATCCTCATTCGTCGTTCATTCGATATTTATCATGAACAGCATAAAATATCGCTGCATAA CGAGAGTATCGTGCTGCTGGAGAAAAATTTGGCAGACGATTTTGCGTTTTGTTCACCGGATACGCGACGACTGGATATCG ATGAGCTGACAGTTTGCCATTACTTACAAAATATTCGTCAGCTACCACGCAATTTAGGGTTACATAGCAAAGACCGTCTA TTAATTAACCAGTCCCCCCCCATGCCGCTGGTGACGGCGATTTTTGATAGCTTCAATGAATCCGGGGTAAATTCACCGAT ACTGAGCAATATGCTCTACCTTTCCTGTTTATCGATGTTTTCTCATAAGAAAGAACTGATCCCCTTACTTTTCAATAGCA TCAGCACTGTTTCAGGAAAAGTTGAACGCCTTATTAGCTTTGATATCGCCAAACGTTGGTATCTGCGCGATATCGCGGAA AGAATGTATACCAGCGAGAGTCTCATCAAAAAAAAGTTGCAGGATGAAAATACCTGTTTCAGTAAAATATTACTCGCTTC CAGGATGTCGATGGCCAGACGATTACTCGAGTTACGTCAAATTCCTCTGCATACTATTGCAGAAAAATGTGGCTATAGCA GTACGTCGTACTTTATAAACACATTTCGACAATATTATGGTGTAACGCCACATCAGTTTGCGCAACATTCGCCAGGTACC TTTTCCTGA
Upstream 100 bases:
>100_bases CTACCAAATCTGGCAGTTTTTTCGCTAAGAAACAGTCTGGCAACATTTCATTAGTATACTGAAATTGAAATAATCGCAGT ATGAAATATAAGGGATAATC
Downstream 100 bases:
>100_bases CATATTTTGCATTTGAATATTGGTCAGGATCTCACACCTGCTTCATGTGAAACTCTTCCCTGATGATTTCTGCCGGGCTA CCGGCTAGTTCTCTTTCGCA
Product: DNA-binding transcriptional activator
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 242; Mature: 241
Protein sequence:
>242_residues MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT FS
Sequences:
>Translated_242_residues MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT FS >Mature_241_residues THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRLL INQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAER MYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGTF S
Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789932, Length=242, Percent_Identity=99.1735537190083, Blast_Score=496, Evalue=1e-142, Organism=Escherichia coli, GI1786776, Length=241, Percent_Identity=35.6846473029046, Blast_Score=166, Evalue=2e-42, Organism=Escherichia coli, GI1787776, Length=237, Percent_Identity=33.7552742616034, Blast_Score=128, Evalue=3e-31, Organism=Escherichia coli, GI1789933, Length=209, Percent_Identity=32.5358851674641, Blast_Score=92, Evalue=2e-20, Organism=Escherichia coli, GI1790557, Length=133, Percent_Identity=37.593984962406, Blast_Score=86, Evalue=2e-18, Organism=Escherichia coli, GI1786778, Length=124, Percent_Identity=41.1290322580645, Blast_Score=84, Evalue=9e-18,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 28024; Mature: 27893
Theoretical pI: Translated: 8.78; Mature: 8.78
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.5 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 5.4 %Cys+Met (Translated Protein) 2.5 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH CCHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHCCCCCC FS CC >Mature Secondary Structure THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH CHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHCCCCCC FS CC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]