The gene/protein map for NC_004741 is currently unavailable.
Definition Shigella flexneri 2a str. 2457T, complete genome.
Accession NC_004741
Length 4,599,354

Click here to switch to the map view.

The map label for this gene is yidW

Identifier: 30065003

GI number: 30065003

Start: 3896308

End: 3896997

Strand: Reverse

Name: yidW

Synonym: S4002

Alternate gene names: 30065003

Gene position: 3896997-3896308 (Counterclockwise)

Preceding gene: 30065005

Following gene: 30065002

Centisome position: 84.73

GC content: 55.36

Gene sequence:

>690_bases
ATGACTCTCAATAAAACCGATCGCATTGTCATTACGCTGGGTAAACAGATCGTTCACGGCAAATACGTACCTGGCTCGCC
ACTTCCGGCTGAGGCGGAGCTCTGTGAAGAGTTTGCAACCTCGCGCAACATCATCCGTGAGGTGTTCCGTTCGCTGATGG
CGAAGCGGCTGATTGAAATGAAACGTTATCGCGGGGCGTTTGTGGCACCGCGTAACCAGTGGAATTACCTCGACACTGAC
GTACTGCAATGGGTGCTGGAAAACGACTACGACCCACGGCTTATCAGTGCCATGAGCGAAGTGCGAAATCTGGTGGAACC
GGCGATTGCCCGTTGGGCAGCAGAGCGCGCGACCTCCAGCGATCTGGCGCAGATTGAATCGGCGCTGAACGAGATGATTG
CCAACAATCAGGACCGCGAAGCGTTTAACGAAGCGGATATTCGCTACCACGAGGCGGTGCTGCAGTCGGTGCATAACCCG
GTGTTACAGCAACTTAGCATTGCGATCAGTTCGCTGCAGCGGGCGGTTTTTGAACGAACCTGGATGGGCGATGAGGCCAA
CATGCCGCAAACGCTCCAGGAACATAAGGCGCTGTTCGATGCGATACGGCATCAGGACGGCGATGCGGCAGAGCAGGCGG
CGCTTACCATGATTGCCAGCTCGACACGAAGGTTAAAGGAAATCACATGA

Upstream 100 bases:

>100_bases
CGGCAAACGCGAACGTCATCACGCTGGTACTACAAAGTTGCCGCGTTATGCATCGATCGGGGTAAAGTAGAGAAGAACAT
ACAGAGCACAAGGACTCTCC

Downstream 100 bases:

>100_bases
CAGCTCGCTACATCGCAATTGACTGGGGATCGACCAATCTGCGCGCCTGGCTTTATCAGGGCGACCACTGCCTGGAGAGC
AGGCAATCAGAAGCAGGCGT

Product: regulator protein for dgo operon

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 229; Mature: 228

Protein sequence:

>229_residues
MTLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEMKRYRGAFVAPRNQWNYLDTD
VLQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSSDLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNP
VLQQLSIAISSLQRAVFERTWMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT

Sequences:

>Translated_229_residues
MTLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEMKRYRGAFVAPRNQWNYLDTD
VLQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSSDLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNP
VLQQLSIAISSLQRAVFERTWMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT
>Mature_228_residues
TLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEMKRYRGAFVAPRNQWNYLDTDV
LQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSSDLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNPV
LQQLSIAISSLQRAVFERTWMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT

Specific function: Repressor for the dgoRKAT operon. Binds D-galactonate as an inducer

COG id: COG2186

COG function: function code K; Transcriptional regulators

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH gntR-type DNA-binding domain

Homologues:

Organism=Escherichia coli, GI48994955, Length=229, Percent_Identity=100, Blast_Score=473, Evalue=1e-135,
Organism=Escherichia coli, GI48994961, Length=213, Percent_Identity=30.9859154929577, Blast_Score=108, Evalue=4e-25,
Organism=Escherichia coli, GI1789353, Length=228, Percent_Identity=27.1929824561404, Blast_Score=69, Evalue=3e-13,
Organism=Escherichia coli, GI1790032, Length=212, Percent_Identity=28.3018867924528, Blast_Score=67, Evalue=1e-12,
Organism=Escherichia coli, GI1790780, Length=234, Percent_Identity=26.4957264957265, Blast_Score=65, Evalue=3e-12,
Organism=Escherichia coli, GI1787821, Length=222, Percent_Identity=27.9279279279279, Blast_Score=64, Evalue=6e-12,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): DGOR_ECOLI (P31460)

Other databases:

- EMBL:   L10328
- EMBL:   U00096
- EMBL:   AP009048
- RefSeq:   AP_004098.1
- RefSeq:   YP_026239.1
- ProteinModelPortal:   P31460
- SMR:   P31460
- DIP:   DIP-9435N
- STRING:   P31460
- EnsemblBacteria:   EBESCT00000000471
- EnsemblBacteria:   EBESCT00000015055
- GeneID:   2847767
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW5627
- KEGG:   eco:b4479
- EchoBASE:   EB1669
- EcoGene:   EG11718
- eggNOG:   COG2186
- GeneTree:   EBGT00050000009601
- HOGENOM:   HBG469528
- OMA:   RRTIEPA
- ProtClustDB:   CLSK894832
- BioCyc:   EcoCyc:G7790-MONOMER
- Genevestigator:   P31460
- GO:   GO:0005622
- InterPro:   IPR011711
- InterPro:   IPR000524
- InterPro:   IPR011991
- Gene3D:   G3DSA:1.10.10.10
- PRINTS:   PR00035
- SMART:   SM00895
- SMART:   SM00345

Pfam domain/function: PF07729 FCD; PF00392 GntR

EC number: NA

Molecular weight: Translated: 26080; Mature: 25949

Theoretical pI: Translated: 5.39; Mature: 5.39

Prosite motif: PS50949 HTH_GNTR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
3.9 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEM
CCCCCCCEEEEEECHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KRYRGAFVAPRNQWNYLDTDVLQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSS
HHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
DLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNPVLQQLSIAISSLQRAVFERT
HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
WMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT
HCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
TLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEM
CCCCCCEEEEEECHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KRYRGAFVAPRNQWNYLDTDVLQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSS
HHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
DLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNPVLQQLSIAISSLQRAVFERT
HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
WMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT
HCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7686882; 9278503