The gene/protein map for NC_006270 is currently unavailable.
Definition Shigella boydii Sb227, complete genome.
Accession NC_007613
Length 4,519,823

Click here to switch to the map view.

The map label for this gene is yidP

Identifier: 82546046

GI number: 82546046

Start: 3702417

End: 3703133

Strand: Reverse

Name: yidP

Synonym: SBO_3687

Alternate gene names: 82546046

Gene position: 3703133-3702417 (Counterclockwise)

Preceding gene: 82546053

Following gene: 161984828

Centisome position: 81.93

GC content: 51.32

Gene sequence:

>717_bases
ATGATCTACAAAAGCATTGCGGAGCGGTTAAGAATTCGACTTAATTCCGCGGATTTCACGCTAAACAGCCTTCTTCCCGG
TGAAAAAAAGTTGGCGGAAGAATTTGCAGTATCCCGGATGACCATCCGTAAAGCGATTGACCTGCTGGTAGCGTGGGGGC
TGGTGGTCCGCCGCCACGGCAGCGGCACTTACCTGGTGCGCAAAGATGTGCTGCATCAAACCGCCAGCCTGACCGGACTG
GTGGAGGTGTTAAAACGGCAGGGAAAAACGGTCACCAGCCAGGTGCTGATTTTTGAAATCATGCCTGCGCCTCCGGCCAT
TGCCAGCCAGTTACGGATTCAAATCAACGAGCAGATCTACTTCTCCCGTCGCGTTCGTTTTGTGGAAGGGAAACCGCTGA
TGCTGGAAGACAGCTATATGCCGGTTAAACTGTTCCGTAATCTTTCGCTGCAACATCTGGAAGGGTCGAAGTTTGAATAT
ATTGAACAAGAGTGTGGGATTTTGATTGGCGGTAATTATGAAAGCCTGACGCCGGTGCTCGCCGATAGACTGCTGGCGCG
GCAAATGAAGGTAGCGGAACACACGCCACTGCTGCGGATCACCTCGTTGTCATATAGCGAGAGCGGGGAGTTTTTGAATT
ATTCAGTGATGTTCAGAAATGCCAGCGAATACCAGGTGGAGTACCATTTACGGCGACTCCACCCGGAAAAGAGTTAA

Upstream 100 bases:

>100_bases
CAGCATCGAACACATCTTTAAAAAAAGATGTTTTTTCAATCGATTAAGCAGAACTTGTGGGCGCATTACCCGGGCTTGCA
GGCAAAAAAGAGATCTAGAG

Downstream 100 bases:

>100_bases
CCGATACTCCAGAAGAGCACCGCCAGTAATTGGGGGGTGATAATTCGCAGAAACATCACTAACGGATAGACAGTGGCGTA
AGAGAGCGCCGGCGCACCAC

Product: transcriptional regulator

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 238; Mature: 238

Protein sequence:

>238_residues
MIYKSIAERLRIRLNSADFTLNSLLPGEKKLAEEFAVSRMTIRKAIDLLVAWGLVVRRHGSGTYLVRKDVLHQTASLTGL
VEVLKRQGKTVTSQVLIFEIMPAPPAIASQLRIQINEQIYFSRRVRFVEGKPLMLEDSYMPVKLFRNLSLQHLEGSKFEY
IEQECGILIGGNYESLTPVLADRLLARQMKVAEHTPLLRITSLSYSESGEFLNYSVMFRNASEYQVEYHLRRLHPEKS

Sequences:

>Translated_238_residues
MIYKSIAERLRIRLNSADFTLNSLLPGEKKLAEEFAVSRMTIRKAIDLLVAWGLVVRRHGSGTYLVRKDVLHQTASLTGL
VEVLKRQGKTVTSQVLIFEIMPAPPAIASQLRIQINEQIYFSRRVRFVEGKPLMLEDSYMPVKLFRNLSLQHLEGSKFEY
IEQECGILIGGNYESLTPVLADRLLARQMKVAEHTPLLRITSLSYSESGEFLNYSVMFRNASEYQVEYHLRRLHPEKS
>Mature_238_residues
MIYKSIAERLRIRLNSADFTLNSLLPGEKKLAEEFAVSRMTIRKAIDLLVAWGLVVRRHGSGTYLVRKDVLHQTASLTGL
VEVLKRQGKTVTSQVLIFEIMPAPPAIASQLRIQINEQIYFSRRVRFVEGKPLMLEDSYMPVKLFRNLSLQHLEGSKFEY
IEQECGILIGGNYESLTPVLADRLLARQMKVAEHTPLLRITSLSYSESGEFLNYSVMFRNASEYQVEYHLRRLHPEKS

Specific function: Unknown

COG id: COG2188

COG function: function code K; Transcriptional regulators

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 HTH gntR-type DNA-binding domain

Homologues:

Organism=Escherichia coli, GI1790118, Length=238, Percent_Identity=100, Blast_Score=486, Evalue=1e-139,
Organism=Escherichia coli, GI1786950, Length=224, Percent_Identity=31.25, Blast_Score=130, Evalue=7e-32,
Organism=Escherichia coli, GI87082252, Length=223, Percent_Identity=24.6636771300448, Blast_Score=94, Evalue=6e-21,
Organism=Escherichia coli, GI1790540, Length=231, Percent_Identity=28.5714285714286, Blast_Score=74, Evalue=1e-14,
Organism=Escherichia coli, GI1788418, Length=148, Percent_Identity=25, Blast_Score=65, Evalue=5e-12,

Paralogues:

None

Copy number: 10-20 Molecules/Cell [C]

Swissprot (AC and ID): YIDP_ECOLI (P31453)

Other databases:

- EMBL:   L10328
- EMBL:   U00096
- EMBL:   AP009048
- PIR:   E65170
- RefSeq:   AP_004109.1
- RefSeq:   NP_418139.1
- ProteinModelPortal:   P31453
- SMR:   P31453
- IntAct:   P31453
- STRING:   P31453
- EnsemblBacteria:   EBESCT00000004122
- EnsemblBacteria:   EBESCT00000014715
- GeneID:   948194
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW3661
- KEGG:   eco:b3684
- EchoBASE:   EB1662
- EcoGene:   EG11711
- eggNOG:   COG2188
- GeneTree:   EBGT00050000008760
- HOGENOM:   HBG297848
- OMA:   SRRVRYV
- ProtClustDB:   CLSK880769
- BioCyc:   EcoCyc:EG11711-MONOMER
- Genevestigator:   P31453
- GO:   GO:0005622
- InterPro:   IPR000524
- InterPro:   IPR011663
- InterPro:   IPR011991
- Gene3D:   G3DSA:1.10.10.10
- PRINTS:   PR00035
- SMART:   SM00345
- SMART:   SM00866

Pfam domain/function: PF00392 GntR; PF07702 UTRA

EC number: NA

Molecular weight: Translated: 27328; Mature: 27328

Theoretical pI: Translated: 9.88; Mature: 9.88

Prosite motif: PS50949 HTH_GNTR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIYKSIAERLRIRLNSADFTLNSLLPGEKKLAEEFAVSRMTIRKAIDLLVAWGLVVRRHG
CCHHHHHHHHHEEECCCCEEHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECC
SGTYLVRKDVLHQTASLTGLVEVLKRQGKTVTSQVLIFEIMPAPPAIASQLRIQINEQIY
CCEEEEHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCHHHHHHEEEEECCEEE
FSRRVRFVEGKPLMLEDSYMPVKLFRNLSLQHLEGSKFEYIEQECGILIGGNYESLTPVL
EEHEEEEECCCEEEEECCCCHHHHHHCCCHHHCCCCHHHHHHHHCCEEECCCHHHHHHHH
ADRLLARQMKVAEHTPLLRITSLSYSESGEFLNYSVMFRNASEYQVEYHLRRLHPEKS
HHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEEEEEEECCCHHHHHHHHHHCCCCCC
>Mature Secondary Structure
MIYKSIAERLRIRLNSADFTLNSLLPGEKKLAEEFAVSRMTIRKAIDLLVAWGLVVRRHG
CCHHHHHHHHHEEECCCCEEHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECC
SGTYLVRKDVLHQTASLTGLVEVLKRQGKTVTSQVLIFEIMPAPPAIASQLRIQINEQIY
CCEEEEHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCHHHHHHEEEEECCEEE
FSRRVRFVEGKPLMLEDSYMPVKLFRNLSLQHLEGSKFEYIEQECGILIGGNYESLTPVL
EEHEEEEECCCEEEEECCCCHHHHHHCCCHHHCCCCHHHHHHHHCCEEECCCHHHHHHHH
ADRLLARQMKVAEHTPLLRITSLSYSESGEFLNYSVMFRNASEYQVEYHLRRLHPEKS
HHHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEEEEEEECCCHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: DNA [C]

Specific reaction: Protein + DNA = Protein-DNA [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 7686882; 9278503