The gene/protein map for NC_007613 is currently unavailable.
Definition Shigella boydii Sb227, complete genome.
Accession NC_007613
Length 4,519,823

Click here to switch to the map view.

The map label for this gene is yihW [H]

Identifier: 82546217

GI number: 82546217

Start: 3915982

End: 3916770

Strand: Reverse

Name: yihW [H]

Synonym: SBO_3885

Alternate gene names: 82546217

Gene position: 3916770-3915982 (Counterclockwise)

Preceding gene: 82546218

Following gene: 82546216

Centisome position: 86.66

GC content: 49.94

Gene sequence:

>789_bases
ATGAGTATCATCGAAGTGACAGGTAATCCACGCCATGACCAACTCGTTCATCTAATCGCAGAACGCGGTTATATGAATAT
TGAAGAACTGGCACAACTGCTGGATGTTTCAACGCAAACCGTGCGCCGGGATATCCGAAAACTCAGTGAGCAGGGCTTGA
TCACCCGTCATCATGGAGGGGCCGGGCGCGTCTCCAGCGTCATGAATACCGCTTTTGAGCAACGGGAACTTTCGCTCACC
GCCGAGAAACGGGCGATCGCTGAGGCAGTCGCCGATTACCTTCCCGAACGCTGTACCGTCTTTATCACCATCGGTACAAC
CGTAGAAGCCGTTGCCAGGGCATTACTCAACCGGCGTGATTTACGCATTATCACTAACAGCCTACGTGTGGCACAGATTC
TTTATAAGAATCAGGATATTGAAGTGATGGTGCCGGGAGGCACTTTACGCGCTCATAACGGCGGGATTATCGGCCCAGGA
GCCGTGGATTTTATTGAAGGCTTCCGCGCAGATTATTTAATCACCAGTATAGGTGCCATTGAACATGACGGCACCCTACT
GGAATTTGATCTAAATGAAGCGTTAGTCGCGAGAACGATGATTAAACATGCGCGGAATACATTGTTAGTGGCCGATCATA
CGAAGTTTGCCGCGTCTGCTGCCGTTTCAATTGGCAATGCACGGAATGTCAGGGCTTTTTTTACTGATGCCCCGCCTCCC
AATTCTTTCTGCCAGTTGTTAAGTGAAGAGAATGTTGAACTGGTGGTTGCCGAGCAAGAAGTATCCTGA

Upstream 100 bases:

>100_bases
CGGTCGGGCCGGCATTCCTAATCGTGAGCAAACCGAATCATTTTTGTCACTTTATGCGTAAAATGGTAAACATCTACCGA
GGAGGGGCAAGGGAGTATTC

Downstream 100 bases:

>100_bases
AATTGAAATTTCGGAAAATCCGCTGGGAATTTCAGCGGATTTTGTTTTCCATAAGCAAAAAAAACCTGCCAGCGATGGCT
AATGCCGATCAGTTAAGGAT

Product: DEOR-type transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 262; Mature: 261

Protein sequence:

>262_residues
MSIIEVTGNPRHDQLVHLIAERGYMNIEELAQLLDVSTQTVRRDIRKLSEQGLITRHHGGAGRVSSVMNTAFEQRELSLT
AEKRAIAEAVADYLPERCTVFITIGTTVEAVARALLNRRDLRIITNSLRVAQILYKNQDIEVMVPGGTLRAHNGGIIGPG
AVDFIEGFRADYLITSIGAIEHDGTLLEFDLNEALVARTMIKHARNTLLVADHTKFAASAAVSIGNARNVRAFFTDAPPP
NSFCQLLSEENVELVVAEQEVS

Sequences:

>Translated_262_residues
MSIIEVTGNPRHDQLVHLIAERGYMNIEELAQLLDVSTQTVRRDIRKLSEQGLITRHHGGAGRVSSVMNTAFEQRELSLT
AEKRAIAEAVADYLPERCTVFITIGTTVEAVARALLNRRDLRIITNSLRVAQILYKNQDIEVMVPGGTLRAHNGGIIGPG
AVDFIEGFRADYLITSIGAIEHDGTLLEFDLNEALVARTMIKHARNTLLVADHTKFAASAAVSIGNARNVRAFFTDAPPP
NSFCQLLSEENVELVVAEQEVS
>Mature_261_residues
SIIEVTGNPRHDQLVHLIAERGYMNIEELAQLLDVSTQTVRRDIRKLSEQGLITRHHGGAGRVSSVMNTAFEQRELSLTA
EKRAIAEAVADYLPERCTVFITIGTTVEAVARALLNRRDLRIITNSLRVAQILYKNQDIEVMVPGGTLRAHNGGIIGPGA
VDFIEGFRADYLITSIGAIEHDGTLLEFDLNEALVARTMIKHARNTLLVADHTKFAASAAVSIGNARNVRAFFTDAPPPN
SFCQLLSEENVELVVAEQEVS

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH deoR-type DNA-binding domain [H]

Homologues:

Organism=Escherichia coli, GI87082344, Length=260, Percent_Identity=71.1538461538462, Blast_Score=381, Evalue=1e-107,
Organism=Escherichia coli, GI1789829, Length=244, Percent_Identity=46.7213114754098, Blast_Score=234, Evalue=5e-63,
Organism=Escherichia coli, GI1788069, Length=246, Percent_Identity=26.4227642276423, Blast_Score=102, Evalue=2e-23,
Organism=Escherichia coli, GI1789059, Length=251, Percent_Identity=26.2948207171315, Blast_Score=96, Evalue=3e-21,
Organism=Escherichia coli, GI1789519, Length=255, Percent_Identity=26.2745098039216, Blast_Score=95, Evalue=5e-21,
Organism=Escherichia coli, GI226510968, Length=251, Percent_Identity=25.4980079681275, Blast_Score=89, Evalue=4e-19,
Organism=Escherichia coli, GI1789170, Length=229, Percent_Identity=27.9475982532751, Blast_Score=86, Evalue=3e-18,
Organism=Escherichia coli, GI1787540, Length=230, Percent_Identity=24.7826086956522, Blast_Score=70, Evalue=2e-13,
Organism=Escherichia coli, GI1790753, Length=250, Percent_Identity=22, Blast_Score=67, Evalue=1e-12,
Organism=Escherichia coli, GI1790635, Length=252, Percent_Identity=23.4126984126984, Blast_Score=64, Evalue=1e-11,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR014036
- InterPro:   IPR001034
- InterPro:   IPR018356
- InterPro:   IPR011991 [H]

Pfam domain/function: PF00455 DeoR; PF08220 HTH_DeoR [H]

EC number: NA

Molecular weight: Translated: 28767; Mature: 28636

Theoretical pI: Translated: 5.86; Mature: 5.86

Prosite motif: PS00894 HTH_DEOR_1 ; PS51000 HTH_DEOR_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.7 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSIIEVTGNPRHDQLVHLIAERGYMNIEELAQLLDVSTQTVRRDIRKLSEQGLITRHHGG
CEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCEEEECCC
AGRVSSVMNTAFEQRELSLTAEKRAIAEAVADYLPERCTVFITIGTTVEAVARALLNRRD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCC
LRIITNSLRVAQILYKNQDIEVMVPGGTLRAHNGGIIGPGAVDFIEGFRADYLITSIGAI
HHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCCCEECCCHHHHHHHHHHHHHHHHHCCE
EHDGTLLEFDLNEALVARTMIKHARNTLLVADHTKFAASAAVSIGNARNVRAFFTDAPPP
ECCCEEEEEECHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHCCCCCCCEEEEECCCCCC
NSFCQLLSEENVELVVAEQEVS
HHHHHHHCCCCEEEEEEECCCC
>Mature Secondary Structure 
SIIEVTGNPRHDQLVHLIAERGYMNIEELAQLLDVSTQTVRRDIRKLSEQGLITRHHGG
EEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCEEEECCC
AGRVSSVMNTAFEQRELSLTAEKRAIAEAVADYLPERCTVFITIGTTVEAVARALLNRRD
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHCCCC
LRIITNSLRVAQILYKNQDIEVMVPGGTLRAHNGGIIGPGAVDFIEGFRADYLITSIGAI
HHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCCCEECCCHHHHHHHHHHHHHHHHHCCE
EHDGTLLEFDLNEALVARTMIKHARNTLLVADHTKFAASAAVSIGNARNVRAFFTDAPPP
ECCCEEEEEECHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHCCCCCCCEEEEECCCCCC
NSFCQLLSEENVELVVAEQEVS
HHHHHHHCCCCEEEEEEECCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8346018; 9278503 [H]