Definition | Shigella boydii Sb227, complete genome. |
---|---|
Accession | NC_007613 |
Length | 4,519,823 |
Click here to switch to the map view.
The map label for this gene is yhiW [H]
Identifier: 82545879
GI number: 82545879
Start: 3509456
End: 3510184
Strand: Reverse
Name: yhiW [H]
Synonym: SBO_3514
Alternate gene names: 82545879
Gene position: 3510184-3509456 (Counterclockwise)
Preceding gene: 82545880
Following gene: 82545874
Centisome position: 77.66
GC content: 41.7
Gene sequence:
>729_bases ATGACTCATGTCTGCTCGGTGATCCTCATTCGTCGTTCATTCGATATTTATCATGAACAGCATAAAATATCGCTGCATAA CGAGAGTATCGTGCTGCTGGAGAAAAATTTGGCAGACGATTTTGCGTTTTGTTCACCGGATACGCGACGACTGGATATCG ATGAGCTGACAGTTTGCCATTACTTACAAAATATTCGTCAGCTACCACGCAATTTAGGGTTACATAGCAAAGACCGTCTA TTAATTAACCAGTCACCCCCCATGCCGCTGGTGACGGCGATTTTTGATAGCTTCAATGAATCCGGGGTAAATTCACCGAT ACTGAGCAATATGCTCTACCTTTCCTGTTTATCGATGTTTTCTCATAAGAAAGAACTGATCCCCTTACTTTTCAATAGCA TCAGCACTGTTTCAGGAAAAGTTGAACGCCTTATTAGCTTTGATATCGCCAAACGTTGGTATCTGCGCGATATCGCGGAA AGAATGTATACCAGCGAGAGTCTCATCAAAAAAAAGTTGCAGGATGAAAATACCTGTTTCAGTAAAATATTACTCGCTTC CAGGATGTCGATGGCCAGACGATTACTCGAGTTACGTCAAATTCCTCTGCATACTATTGCAGAAAAATGTGGCTATAGCA GTACGTCGTACTTTATAAACACATTTCGACAATATTATGGTGTAACGCCACATCAGTTTGCGCAACATTCGCTAGGTACC TTTTCCTGA
Upstream 100 bases:
>100_bases CTACCAAATCTGGCAGTTTTTGCGCTAAGAAACAGTCTGGCAACATTTCATTAGTATACTGAAATTGAAATAATCGCAGT ATGAAATATAAGGGATAATC
Downstream 100 bases:
>100_bases CATATTTTGCATTTGAATATTGGTCAGGATCTCACACCTGCTTCATGTGAAACTCTTCCCTGATGATTTCTGCCGGGCTA CCGGCTAGTTCTCTTTCGCA
Product: ARAC-type regulatory protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 242; Mature: 241
Protein sequence:
>242_residues MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT FS
Sequences:
>Translated_242_residues MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT FS >Mature_241_residues THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRLL INQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAER MYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGTF S
Specific function: Depending on the conditions (growth phase and medium), acts as a positive or negative regulator of gadA and gadBC. Repression occurs directly or via the repression of the expression of gadX. Activation occurs directly by the binding of gadW to the gadA an
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 HTH araC/xylS-type DNA-binding domain [H]
Homologues:
Organism=Escherichia coli, GI1789932, Length=242, Percent_Identity=98.7603305785124, Blast_Score=493, Evalue=1e-141, Organism=Escherichia coli, GI1786776, Length=241, Percent_Identity=35.6846473029046, Blast_Score=167, Evalue=6e-43, Organism=Escherichia coli, GI1787776, Length=237, Percent_Identity=33.7552742616034, Blast_Score=128, Evalue=3e-31, Organism=Escherichia coli, GI1789933, Length=204, Percent_Identity=32.843137254902, Blast_Score=92, Evalue=4e-20, Organism=Escherichia coli, GI1790557, Length=127, Percent_Identity=38.5826771653543, Blast_Score=86, Evalue=2e-18, Organism=Escherichia coli, GI1786778, Length=124, Percent_Identity=41.1290322580645, Blast_Score=84, Evalue=1e-17,
Paralogues:
None
Copy number: 10-20 Molecules/Cell [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR009057 - InterPro: IPR012287 - InterPro: IPR018062 - InterPro: IPR020449 - InterPro: IPR018060 [H]
Pfam domain/function: PF00165 HTH_AraC [H]
EC number: NA
Molecular weight: Translated: 28040; Mature: 27909
Theoretical pI: Translated: 8.78; Mature: 8.78
Prosite motif: PS00041 HTH_ARAC_FAMILY_1 ; PS01124 HTH_ARAC_FAMILY_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.5 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 5.4 %Cys+Met (Translated Protein) 2.5 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTHVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH CCHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHCCCC FS CC >Mature Secondary Structure THVCSVILIRRSFDIYHEQHKISLHNESIVLLEKNLADDFAFCSPDTRRLDIDELTVCH CHHHHHHHHHHHHHHHHHHHHEEECCCEEEEEECCCCCHHHCCCCCCCCCCHHHHHHHH YLQNIRQLPRNLGLHSKDRLLINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMF HHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH SHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAERMYTSESLIKKKLQDENTCF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH SKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSLGT HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHCCCC FS CC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]