Definition Shigella boydii Sb227, complete genome.
Accession NC_007613
Length 4,519,823

Click here to switch to the map view.

The map label for this gene is yhiW [H]

Identifier: 82545879

GI number: 82545879

Start: 3509456

End: 3510184

Strand: Reverse

Name: yhiW [H]

Synonym: SBO_3514

Alternate gene names: 82545879

Gene position: 3510184-3509456 (Counterclockwise)

Preceding gene: 82545880

Following gene: 82545874

Centisome position: 77.66

GC content: 41.7

Gene sequence:

>729_bases
ATGACTCATGTCTGCTCGGTGATCCTCATTCGTCGTTCATTCGATATTTATCATGAACAGCATAAAATATCGCTGCATAA
CGAGAGTATCGTGCTGCTGGAGAAAAATTTGGCAGACGATTTTGCGTTTTGTTCACCGGATACGCGACGACTGGATATCG
ATGAGCTGACAGTTTGCCATTACTTACAAAATATTCGTCAGCTACCACGCAATTTAGGGTTACATAGCAAAGACCGTCTA
TTAATTAACCAGTCACCCCCCATGCCGCTGGTGACGGCGATTTTTGATAGCTTCAATGAATCCGGGGTAAATTCACCGAT
ACTGAGCAATATGCTCTACCTTTCCTGTTTATCGATGTTTTCTCATAAGAAAGAACTGATCCCCTTACTTTTCAATAGCA
TCAGCACTGTTTCAGGAAAAGTTGAACGCCTTATTAGCTTTGATATCGCCAAACGTTGGTATCTGCGCGATATCGCGGAA
AGAATGTATACCAGCGAGAGTCTCATCAAAAAAAAGTTGCAGGATGAAAATACCTGTTTCAGTAAAATATTACTCGCTTC
CAGGATGTCGATGGCCAGACGATTACTCGAGTTACGTCAAATTCCTCTGCATACTATTGCAGAAAAATGTGGCTATAGCA
GTACGTCGTACTTTATAAACACATTTCGACAATATTATGGTGTAACGCCACATCAGTTTGCGCAACATTCGCTAGGTACC
TTTTCCTGA

Upstream 100 bases:

>100_bases
CTACCAAATCTGGCAGTTTTTGCGCTAAGAAACAGTCTGGCAACATTTCATTAGTATACTGAAATTGAAATAATCGCAGT
ATGAAATATAAGGGATAATC

Downstream 100 bases:

>100_bases
CATATTTTGCATTTGAATATTGGTCAGGATCTCACACCTGCTTCATGTGAAACTCTTCCCTGATGATTTCTGCCGGGCTA
CCGGCTAGTTCTCTTTCGCA

Product: ARAC-type regulatory protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 242; Mature: 241

Protein sequence:

>242_residues
MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL
LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE
RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT
FS

Sequences:

>Translated_242_residues
MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL
LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE
RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT
FS
>Mature_241_residues
THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRLL
INQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAER
MYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGTF
S

Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI1789932, Length=242, Percent_Identity=98.7603305785124, Blast_Score=493, Evalue=1e-141,
Organism=Escherichia coli, GI1786776, Length=241, Percent_Identity=35.6846473029046, Blast_Score=167, Evalue=6e-43,
Organism=Escherichia coli, GI1787776, Length=237, Percent_Identity=33.7552742616034, Blast_Score=128, Evalue=3e-31,
Organism=Escherichia coli, GI1789933, Length=204, Percent_Identity=32.843137254902, Blast_Score=92, Evalue=4e-20,
Organism=Escherichia coli, GI1790557, Length=127, Percent_Identity=38.5826771653543, Blast_Score=86, Evalue=2e-18,
Organism=Escherichia coli, GI1786778, Length=124, Percent_Identity=41.1290322580645, Blast_Score=84, Evalue=1e-17,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009057
- InterPro:   IPR012287
- InterPro:   IPR018062
- InterPro:   IPR020449
- InterPro:   IPR018060 [H]

Pfam domain/function: PF00165 HTH_AraC [H]

EC number: NA

Molecular weight: Translated: 28040; Mature: 27909

Theoretical pI: Translated: 8.78; Mature: 8.78

Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.5 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
5.4 %Cys+Met (Translated Protein)
2.5 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH
CCHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH
YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF
HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH
SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH
SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHCCCC
FS
CC
>Mature Secondary Structure 
THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH
CHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH
YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF
HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH
SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH
SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHCCCC
FS
CC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]