Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is flgJ

Identifier: 209396103

GI number: 209396103

Start: 1440064

End: 1441005

Strand: Direct

Name: flgJ

Synonym: ECH74115_1460

Alternate gene names: 209396103

Gene position: 1440064-1441005 (Clockwise)

Preceding gene: 209398849

Following gene: 209396607

Centisome position: 25.84

GC content: 54.78

Gene sequence:

>942_bases
ATGATCAGCGACAGCAAACTACTGGCAAGTGCGGCCTGGGATGCGCAATCACTCAACGAACTAAAGGCGAAAGCGAGCGA
AGATCCGGCGGCAAATATCCGTCCGGTGGCCCGTCAGGTGGAAGGGATGTTCGTGCAGATGATGTTGAAAAGCATGCGCG
ACGCGTTACCAAAAGATGGCCTGTTCAGCAGCGAGCACACTCGCCTGTATACCAGTATGTATGACCAGCAGATTGCCCAA
CAGATGACGACGGGCAAAGGTCTGGGGCTTGCAGAGATGATGGTTAAGCAGATGACGCCAGAACAACCATTGCCAGAGGA
GTCCACGCCAGCAGCACCGATGAAATTCCCGCTCGAAACCGTGGTGCGTTATCAAAATCAGGCGCTTTCGCAGCTGGTGC
AAAAGGCCGTGCCACGTAACTACGATGATTCGCTGCCAGGTGACAGTAAAGCATTCCTCGCGCAACTCTCGCTGCCCGCC
CAACTGGCAAGCCAGCAAAGCGGTGTGCCACATCATTTGATCCTCGCTCAGGCGGCGCTGGAATCTGGCTGGGGGCAACG
GCAAATCCGCCGCGAAAACGGCGAGCCGAGCTATAACCTGTTTGGTGTCAAAGCCTCTGGCAACTGGAAAGGGCCAGTCA
CTGAAATCACCACGACTGAATATGAAAATGGCGAAGCGAAGAAAGTAAAAGCGAAGTTTCGGGTCTACAGCTCGTATCTG
GAAGCATTGTCGGATTACGTTGGGCTGTTAACACGTAACCCGCGCTACGCCGCCGTGACGACCGCCGCGAGTGCGGAGCA
GGGGGCGCAGGCCCTACAGGACGCGGGCTATGCCACCGATCCTCACTATGCCCGTAAACTCACCAACATGATTCAGCAGA
TGAAATCGATAAGCGACAAGGTGAGCAAAACCTACAGCATGAACATTGATAATCTGTTCTGA

Upstream 100 bases:

>100_bases
GCGCGCTCAATGCGCTGGGCGCTACGCCGATGGATCTGATGTCTATTTTGCAATCAATGCAAAGTGCGGGATGTCTGCGG
GCAAAACTGGAAATCATCTG

Downstream 100 bases:

>100_bases
ATAACTCAAGTCCGGCGGGTCGCTGCCGATAATACTCTGTAATTGAAGGCTTATAAGGAACCTCCATGTCCAGCTTGATC
AATAACGCCATGAGCGGACT

Product: flagellar rod assembly protein/muramidase FlgJ

Products: NA

Alternate protein names: Muramidase flgJ

Number of amino acids: Translated: 313; Mature: 313

Protein sequence:

>313_residues
MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMYDQQIAQ
QMTTGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPA
QLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSMNIDNLF

Sequences:

>Translated_313_residues
MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMYDQQIAQ
QMTTGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPA
QLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSMNIDNLF
>Mature_313_residues
MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMYDQQIAQ
QMTTGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPA
QLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSMNIDNLF

Specific function: Flagellum-specific muramidase which hydrolyzes the peptidoglycan layer to assemble the rod structure in the periplasmic space

COG id: COG3951

COG function: function code MNO; Rod binding protein

Gene ontology:

Cell location: Periplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: In the C-terminal section; belongs to the glycosyl hydrolase 73 family

Homologues:

Organism=Escherichia coli, GI1787321, Length=313, Percent_Identity=99.3610223642172, Blast_Score=642, Evalue=0.0,

Paralogues:

None

Copy number: 10-20 (rich media) [C]

Swissprot (AC and ID): FLGJ_ECO57 (P58231)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   C90811
- PIR:   G85670
- RefSeq:   NP_287215.1
- RefSeq:   NP_309486.1
- ProteinModelPortal:   P58231
- SMR:   P58231
- EnsemblBacteria:   EBESCT00000028491
- EnsemblBacteria:   EBESCT00000055275
- GeneID:   912388
- GeneID:   959424
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z1719
- KEGG:   ecs:ECs1459
- GeneTree:   EBGT00050000011202
- HOGENOM:   HBG336388
- OMA:   SMRDANA
- ProtClustDB:   PRK05684
- BioCyc:   ECOL83334:ECS1459-MONOMER
- InterPro:   IPR013377
- InterPro:   IPR000423
- InterPro:   IPR019301
- InterPro:   IPR013338
- InterPro:   IPR002901
- PRINTS:   PR01002
- SMART:   SM00047
- TIGRFAMs:   TIGR02541

Pfam domain/function: PF01832 Glucosaminidase; PF10135 Rod-binding

EC number: 3.2.1.-

Molecular weight: Translated: 34536; Mature: 34536

Theoretical pI: Translated: 8.60; Mature: 8.60

Prosite motif: NA

Important sites: ACT_SITE 220-220 ACT_SITE 245-245

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
4.5 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
4.5 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
CCCCHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC
LFSSEHTRLYTSMYDQQIAQQMTTGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHH
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
HHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHH
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
HCCCCHHHHHHCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHH
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH
VSKTYSMNIDNLF
HHHHHCCCCCCCC
>Mature Secondary Structure
MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
CCCCHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC
LFSSEHTRLYTSMYDQQIAQQMTTGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHH
VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
HHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHH
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
HCCCCHHHHHHCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHH
EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH
VSKTYSMNIDNLF
HHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796