Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is ytfM

Identifier: 209398307

GI number: 209398307

Start: 5373045

End: 5374778

Strand: Direct

Name: ytfM

Synonym: ECH74115_5738

Alternate gene names: 209398307

Gene position: 5373045-5374778 (Clockwise)

Preceding gene: 209398724

Following gene: 209399752

Centisome position: 96.43

GC content: 53.4

Gene sequence:

>1734_bases
GTGCGCTATATCCGACAGTTATGCTGTGTAAGCTTACTCTGCTTAAGCGGATCTGCCGTCGCCGCGAACGTCCGTCTACA
GGTCGAGGGGTTATCGGGACAGCTGGAAAAGAACGTTCGTGCGCAGCTTTCTACGATTGAAAGTGATGAAGTGACGCCAG
ACCGTCGCTTTCGCGCACGCGTCGATGATGCTATCCGCGAAGGTCTGAAAGCGCTGGGTTATTACCAGCCGACCATTGAA
TTTGATCTCCGTCCACCGCCAAAGAAAGGGCGGCAGGTATTGATCGCCAAAGTCACGCCAGGCGTGCCGGTGTTAATTGG
CGGCACCGATGTGGTATTGCGCGGGGGCGCGCGGACCGATAAAGACTATTTGAAATTGCTCGATACTCGCCCGGCTATTG
GCACGGTACTGAACCAGGGCGATTATGAAAATTTCAAAAAGTCCTTAACCAGCATTGCGTTGCGTAAAGGTTATTTCGAT
AGCGAATTTACCAAAGCGCAGCTGGGCATTGCGCTCGGCCTGCATAAAGCCTTCTGGGATATTGATTATAACAGTGGCGA
ACGTTACCGCTTTGGGCATGTGACCTTTGAAGGATCACAAATCCGCGATGAATACCTGCAAAATCTGGTGCCGTTTAAAG
AGGGCGATGAGTACGAATCGAAAGATCTGGCAGAACTGAACCGCCGACTTTCTGCTACCGGCTGGTTTAACTCGGTGGTG
GTGGCTCCACAATTTGATAAAGCGCGCGAAACGAAAGTATTACCATTGACGGGCGTGGTTTCGCCGCGAACAGAAAACAC
TATCGAAACCGGGGTCGGTTACTCTACGGACGTGGGACCGCGCGTGAAAGCGACGTGGAAAAAACCGTGGATGAACTCAT
ACGGTCACAGTCTGACCACCAGTACCAGTATTTCCGCGCCGGAACAGACCCTCGACTTCAGCTATAAAATGCCGCTGCTG
AAGAATCCACTGGAACAATATTATTTGGTGCAGGGCGGTTTTAAGCGCACTGACCTGAACGATACCGAGTCTGACTCCAC
TACGCTGGTGGCTTCTCGCTACTGGGATCTCTCCAGCGGCTGGCAGCGTGCCATTAACCTGCGCTGGAGTCTCGACCACT
TTACTCAGGGTGAAATTACCAACACCACGATGCTGTTTTATCCTGGGGTGATGATTAGCCGCACGCGTTCTCGTGGTGGC
CTGATGCCAACCTGGGGCGACTCGCAACGCTACTCTATCGACTACTCCAACACGGCCTGGGGTTCAGATGTCGATTTCTC
CGTTTTCCAGGCGCAGAACGTCTGGATCCGCACACTGTACGATCGCCATCGTTTTGTGACACGCGGCACGCTGGGCTGGA
TTGAAACCGGTGATTTCGACAAAGTACCGCCGGATCTGCGTTTCTTCGCCGGGGGCGATCGCAGTATTCGCGGCTACAAA
TACAAATCTATCGCCCCGAAATACGCTAACGGTGACCTGAAAGGGGCCTCGAAGTTGATAACCGGATCGCTGGAATACCA
GTACAACGTGACCGGAAAATGGTGGGGCGCGGTGTTTGTCGATAGTGGCGAAGCGGTAAGCGATATTCGCCGCAGCGACT
TTAAAACCGGTACCGGGGTCGGCGTACGCTGGGAATCGCCGGTCGGGCCAATCAAACTCGATTTTGCCGTACCGGTCGCG
GATAAAGACGAACACGGGTTACAGTTTTACATCGGTCTGGGGCCAGAATTATGA

Upstream 100 bases:

>100_bases
ATTTGGGGTTTAGTCTGCTTTTTAATCCATATTACTGGATTTTTGTTAAGCCGTTTAACGGCGTTCCAGGGGCAGGAAAA
AAGGATATTCAGGAGAAAAT

Downstream 100 bases:

>100_bases
GTTTATGGAAAAAAATCAGCCTCGGCGTGGTTATCGTTATCTTACTGTTGCTGGGATCGGTGGCGTTTCTGGTGGGCACC
ACCAGCGGCCTGCATCTGGT

Product: outer membrane protein, OMP85 family

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 577; Mature: 577

Protein sequence:

>577_residues
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRARVDDAIREGLKALGYYQPTIE
FDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTDKDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFD
SEFTKAQLGIALGLHKAFWDIDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTTSTSISAPEQTLDFSYKMPLL
KNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSGWQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGG
LMPTWGDSQRYSIDYSNTAWGSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGVGVRWESPVGPIKLDFAVPVA
DKDEHGLQFYIGLGPEL

Sequences:

>Translated_577_residues
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRARVDDAIREGLKALGYYQPTIE
FDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTDKDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFD
SEFTKAQLGIALGLHKAFWDIDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTTSTSISAPEQTLDFSYKMPLL
KNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSGWQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGG
LMPTWGDSQRYSIDYSNTAWGSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGVGVRWESPVGPIKLDFAVPVA
DKDEHGLQFYIGLGPEL
>Mature_577_residues
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRARVDDAIREGLKALGYYQPTIE
FDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTDKDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFD
SEFTKAQLGIALGLHKAFWDIDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTTSTSISAPEQTLDFSYKMPLL
KNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSGWQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGG
LMPTWGDSQRYSIDYSNTAWGSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGVGVRWESPVGPIKLDFAVPVA
DKDEHGLQFYIGLGPEL

Specific function: Unknown

COG id: COG0729

COG function: function code M; Outer membrane protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To H.influenzae HI_0698

Homologues:

Organism=Escherichia coli, GI1790666, Length=577, Percent_Identity=100, Blast_Score=1182, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YTFM_ECO57 (P0ADE5)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   F91278
- RefSeq:   NP_290852.1
- RefSeq:   NP_313225.1
- ProteinModelPortal:   P0ADE5
- SMR:   P0ADE5
- EnsemblBacteria:   EBESCT00000024159
- EnsemblBacteria:   EBESCT00000058261
- GeneID:   913933
- GeneID:   959881
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z5831
- KEGG:   ecs:ECs5198
- GeneTree:   EBGT00050000011675
- HOGENOM:   HBG401978
- OMA:   GNLGWIE
- ProtClustDB:   CLSK880872
- BioCyc:   ECOL83334:ECS5198-MONOMER
- InterPro:   IPR000184
- InterPro:   IPR010827
- PANTHER:   PTHR12815

Pfam domain/function: PF01103 Bac_surface_Ag; PF07244 Surf_Ag_VNR

EC number: NA

Molecular weight: Translated: 64797; Mature: 64797

Theoretical pI: Translated: 8.94; Mature: 8.94

Prosite motif: PS00213 LIPOCALIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRAR
CHHHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCHHHHHH
VDDAIREGLKALGYYQPTIEFDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTD
HHHHHHHHHHHHCCCCCEEEECCCCCCCCCCEEEEEEECCCCEEEECCCEEEEECCCCCC
KDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFDSEFTKAQLGIALGLHKAFWD
HHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHEEEEHHEEE
IDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
EECCCCCEEEEEEEEECCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEE
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTT
EECCCCCCCCCEEEEEECCCCCCCCCCHHHCCCCCCCCCCCEEEEECCCCHHHCCCCEEC
STSISAPEQTLDFSYKMPLLKNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSG
CCCCCCCHHHCCCEECCCHHHCHHHHEEEECCCCEECCCCCCCCCCEEEEEEECCCCCCC
WQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGGLMPTWGDSQRYSIDYSNTAW
CHHEEEEEEECCCCCCCCCCCEEEEEECCEEEEECCCCCCCCCCCCCCCEEEEECCCCCC
GSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
CCCCCEEEEEECCEEEEEHHHHHHEEECCCCCCEECCCCCCCCCCCEEEECCCCCCCCEE
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGV
ECCCCCCCCCCCCCCCHHEEEEEEEEEEECCEEEEEEEEEECCCHHHHHHHHCCCCCCCC
GVRWESPVGPIKLDFAVPVADKDEHGLQFYIGLGPEL
CEEECCCCCCEEEEEEECCCCCCCCCEEEEEECCCCC
>Mature Secondary Structure
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRAR
CHHHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCHHHHHH
VDDAIREGLKALGYYQPTIEFDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTD
HHHHHHHHHHHHCCCCCEEEECCCCCCCCCCEEEEEEECCCCEEEECCCEEEEECCCCCC
KDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFDSEFTKAQLGIALGLHKAFWD
HHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHEEEEHHEEE
IDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
EECCCCCEEEEEEEEECCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEE
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTT
EECCCCCCCCCEEEEEECCCCCCCCCCHHHCCCCCCCCCCCEEEEECCCCHHHCCCCEEC
STSISAPEQTLDFSYKMPLLKNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSG
CCCCCCCHHHCCCEECCCHHHCHHHHEEEECCCCEECCCCCCCCCCEEEEEEECCCCCCC
WQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGGLMPTWGDSQRYSIDYSNTAW
CHHEEEEEEECCCCCCCCCCCEEEEEECCEEEEECCCCCCCCCCCCCCCEEEEECCCCCC
GSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
CCCCCEEEEEECCEEEEEHHHHHHEEECCCCCCEECCCCCCCCCCCEEEECCCCCCCCEE
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGV
ECCCCCCCCCCCCCCCHHEEEEEEEEEEECCEEEEEEEEEECCCHHHHHHHHCCCCCCCC
GVRWESPVGPIKLDFAVPVADKDEHGLQFYIGLGPEL
CEEECCCCCCEEEEEEECCCCCCCCCEEEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796