Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yhiW
Identifier: 30065158
GI number: 30065158
Start: 4076000
End: 4076728
Strand: Reverse
Name: yhiW
Synonym: S4171
Alternate gene names: 30065158
Gene position: 4076728-4076000 (Counterclockwise)
Preceding gene: 30065159
Following gene: 30065156
Centisome position: 88.64
GC content: 42.11
Gene sequence:
>729_bases ATGACTCATGTCTGCTCGGTGATCCTCATTCGTCGTTCATTCGATATTTATCATGAACAGCAAAAAATATCGCTGCATAA CGAGAGTATCCTGCTGCTGGAGAAAAATTTGGCAGACGATTTTGCGTTTTGTTCACCGGATACGCGACGACTGGATATCG ATGAGCTGACAGTTTGCCATTACTTACAAAATATTCGTCAGCTACCACGCAATTTAGGGTTACATAGCAAAGACCGTCTG TTAATTAACCAGTCACCCCCCATGCCGCTGGTGACGGCGATTTTTGATAGCTTCAATGAATCCGGGGTAAATTCACCGAT ACTGAGCAATATGCTCTACCTTTCCTGTTTATCGATGTTTTCTCATAAGAAAGAACTGATCCCCTTACTTTTCAATAGCA TCAGCACTGTTTCAGGAAAAGTTGAACGCCTTATTAGCTTTGATATCGCCAAACGTTGGTATCTGCGCGATATCGCGGAA AGAATGTATACCAGCGAGAGTCTCATCAAAAAAAAGTTGCAGGATGAAAATACCTGTTTCAGTAAAATATTACTCGCTTC CAGGATGTCGATGGCCAGACGATTACTCGAGTTACGTCAAATTCCTCTGCATACTATTGCGGAAAAATGTGGCTATAGCA GTACGTCGTACTTTATAAACACATTTCGACAATATTATGGTGTAACGCCACATCAGTTTGCGCAACATTCGCCAGGTACC TTTTCCTGA
Upstream 100 bases:
>100_bases CTACCAAATCTGGCAGTTTTTGCGCTAAGAAACAGTCTGTCATCATTTCATTAGTATACTGAAATTGAAATAATCGCAGT ATGAAATATAAGGGATAATC
Downstream 100 bases:
>100_bases CATATTTTGCATTTGAATATTGGTCAGGATCTCACACCTGCTTCATGTGAAACTCTTCCCTGATGATTTCTGCCGGGCTA CCGGCTAGTTCTCTTTCGCA
Product: putative ARAC-type regulatory protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 242; Mature: 241
Protein sequence:
>242_residues MTHVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT FS
Sequences:
>Translated_242_residues MTHVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT FS >Mature_241_residues THVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRLL INQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAER MYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGTF S
Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain
Homologues:
Organism=Escherichia coli, GI1789932, Length=242, Percent_Identity=100, Blast_Score=500, Evalue=1e-143, Organism=Escherichia coli, GI1786776, Length=241, Percent_Identity=35.6846473029046, Blast_Score=167, Evalue=8e-43, Organism=Escherichia coli, GI1787776, Length=237, Percent_Identity=34.1772151898734, Blast_Score=131, Evalue=5e-32, Organism=Escherichia coli, GI1789933, Length=209, Percent_Identity=32.5358851674641, Blast_Score=92, Evalue=2e-20, Organism=Escherichia coli, GI1790557, Length=133, Percent_Identity=37.593984962406, Blast_Score=87, Evalue=1e-18, Organism=Escherichia coli, GI1786778, Length=124, Percent_Identity=41.1290322580645, Blast_Score=84, Evalue=1e-17,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): GADW_ECO57 (P63202)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: C91178 - PIR: D86024 - RefSeq: NP_290095.1 - RefSeq: NP_312422.1 - ProteinModelPortal: P63202 - SMR: P63202 - EnsemblBacteria: EBESCT00000025964 - EnsemblBacteria: EBESCT00000059915 - GeneID: 915748 - GeneID: 961148 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z4928 - KEGG: ecs:ECs4395 - GeneTree: EBGT00050000008641 - HOGENOM: HBG467631 - OMA: INQLPPM - ProtClustDB: CLSK880699 - BioCyc: ECOL83334:ECS4395-MONOMER - GO: GO:0005622 - InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 - Gene3D: G3DSA:1.10.10.60 - PRINTS: PR00032 - SMART: SM00342
Pfam domain/function: PF00165 HTH_AraC; SSF46689 Homeodomain_like
EC number: NA
Molecular weight: Translated: 28029; Mature: 27898
Theoretical pI: Translated: 8.78; Mature: 8.78
Prosite motif: PS00041 HTH_ARAC_FAMILY_1; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.5 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 5.4 %Cys+Met (Translated Protein) 2.5 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTHVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCH CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHCCCCCC FS CC >Mature Secondary Structure THVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCH CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHCCCCCC FS CC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796