The gene/protein map for NC_011353 is currently unavailable.
Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is stfR [H]

Identifier: 209400674

GI number: 209400674

Start: 1212392

End: 1213747

Strand: Direct

Name: stfR [H]

Synonym: ECH74115_1203

Alternate gene names: 209400674

Gene position: 1212392-1213747 (Clockwise)

Preceding gene: 209398994

Following gene: 209397139

Centisome position: 21.76

GC content: 60.03

Gene sequence:

>1356_bases
ATGACAGCCCGCCGGTTCAGGCGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCAC
AGGAAAACCGGTAGAGAACTGCACCATTCAACTGAAAGCCAGACGGACCAGCAGCACGGTGGTGGTGAACACGGTGGCCT
CTGAAAATCCGGATGAAGCCGGTCGTTACAGCATGGACGTTGAGTACGGTCAGTACAGCGTCATTCTGTTGGTGGAAGGA
TTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGATTCTCAACCGGGGACGCTGAATGATTTTCTCGGTGCCAT
GTCGGAGGATGACGTCCGGCCGGAGGCACTGCGTCGTTTTGAACTGATGGTGGAAGAAGCGGCGCGTCACGCTGAGGAGG
CGAAGAAGAATGCCGGAGAGGCGGAGACGTCCGCGAGGAATGCCGGCATATCAGCCAGTCAGGCAGAAGAGAGCGCGGCA
AATGCTGACACTTCAGCAGGGGATGCATCGGAGTCAGCCCGGCAGGCGGCAGAAAGTGCAGCCGCTGCAAAGCAGTCAGA
GGAGGCGTCCTCGTCCTCGGCTTCTGCGGCCGCTCAAAAAGCCAGTGAGTCATCACAAAGTGCAGCAGAAGCTGAATTGT
CAAGAAAGACGGCAGAAAGTGCAGCCGGTAATGCAGCCAGGGATGCAACGACCGCAACAGAAAAAGCCCGGGAGTCAGCA
GAAAGCGCACAGTCAGCGGAACAAAGCAGGATAGCGGCGGAAGAAGCCGTAAACCGAATCCCCACCGTGGTGGGACCTCC
CGGGCCAAAGGGGGAACCGGGTCCCGCGGGTCCTCAGGGGCCGAAGGGAGATAAAGGAGAGCGTGGCGACACCGGCCCGG
CAGGGGCAACCGGCGAACGGGGACCGGCAGGTGATGCTGGTCCGGCAGGCCCGCAGGGGCCGAAAGGTGACAGGGGAGAG
CGGGGAGAGACCGGTCTGACGGGAAATGCAGGTCCACAGGGTCCAAAGGGAGACACCGGGGCAGCAGGCCCGGCAGGCCC
ACAGGGACCGAAAGGAGAAACAGGTGCGGCTGGCCCGGTGGGGGCAACCGGACCTCAGGGACCGAAGGGCGACCCGGGGG
AGACACAAATCCGTTTTCGTCTGGGGCCGGCGAGCATTATTGAGACAAACAGCCATGGCTGGTTCCCGGGTACAGATGGT
GCGCTCATCACCGGACTGACCTTTCTTGCCCCCAAAGATGCCACACGGGTTCAGGTTTTTTTTCAGCATTTGCAGGTCAG
GTTTGGTGACGGGCCGTGGCAGGATGTTAAGGGGCTGGATGAAGTGGGCAGTGATACAGGCAGAACAGGAGAATGA

Upstream 100 bases:

>100_bases
TGATATTGCTTATGAAGGCTCCGGCAGTGGCGACTGGCGCACTGACGGCTTCATCGTGGGTGTCGGCTATAAATTCTGAT
TAGCCAGGTAACACAGTGTT

Downstream 100 bases:

>100_bases
CATGAACATATTAAAAAAAATTATGCAGCGTCTGTGCGGTTGCGGAAAGCATGATGACTGTGAACACGGGCAGTCGCTTA
CAGTACAACTGCGACTGGGG

Product: tail fiber protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 451; Mature: 450

Protein sequence:

>451_residues
MTARRFRRLFCGVNMAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEG
FPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAA
NADTSAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESA
ESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPQGPKGDRGE
RGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDG
ALITGLTFLAPKDATRVQVFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE

Sequences:

>Translated_451_residues
MTARRFRRLFCGVNMAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEG
FPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAA
NADTSAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESA
ESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPQGPKGDRGE
RGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDG
ALITGLTFLAPKDATRVQVFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
>Mature_450_residues
TARRFRRLFCGVNMAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGF
PPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAAN
ADTSAGDASESARQAAESAAAAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAE
SAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGERGPAGDAGPAGPQGPKGDRGER
GETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGA
LITGLTFLAPKDATRVQVFFQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the tail fiber family [H]

Homologues:

Organism=Homo sapiens, GI65301115, Length=103, Percent_Identity=51.4563106796116, Blast_Score=84, Evalue=3e-16,
Organism=Homo sapiens, GI56847616, Length=103, Percent_Identity=51.4563106796116, Blast_Score=84, Evalue=4e-16,
Organism=Homo sapiens, GI5803080, Length=110, Percent_Identity=48.1818181818182, Blast_Score=79, Evalue=7e-15,
Organism=Homo sapiens, GI48762934, Length=136, Percent_Identity=47.7941176470588, Blast_Score=79, Evalue=1e-14,
Organism=Homo sapiens, GI55743098, Length=112, Percent_Identity=46.4285714285714, Blast_Score=79, Evalue=1e-14,
Organism=Homo sapiens, GI55743106, Length=112, Percent_Identity=46.4285714285714, Blast_Score=78, Evalue=2e-14,
Organism=Homo sapiens, GI240255535, Length=112, Percent_Identity=46.4285714285714, Blast_Score=78, Evalue=2e-14,
Organism=Homo sapiens, GI115392133, Length=126, Percent_Identity=49.2063492063492, Blast_Score=77, Evalue=3e-14,
Organism=Homo sapiens, GI115527062, Length=110, Percent_Identity=46.3636363636364, Blast_Score=74, Evalue=3e-13,
Organism=Homo sapiens, GI183583553, Length=119, Percent_Identity=39.4957983193277, Blast_Score=73, Evalue=5e-13,
Organism=Homo sapiens, GI115527066, Length=110, Percent_Identity=46.3636363636364, Blast_Score=72, Evalue=1e-12,
Organism=Homo sapiens, GI115527070, Length=88, Percent_Identity=51.1363636363636, Blast_Score=71, Evalue=2e-12,
Organism=Homo sapiens, GI98985806, Length=118, Percent_Identity=47.4576271186441, Blast_Score=69, Evalue=8e-12,
Organism=Homo sapiens, GI98985810, Length=118, Percent_Identity=47.4576271186441, Blast_Score=69, Evalue=8e-12,
Organism=Homo sapiens, GI299523257, Length=118, Percent_Identity=47.4576271186441, Blast_Score=69, Evalue=1e-11,
Organism=Homo sapiens, GI156616290, Length=100, Percent_Identity=43, Blast_Score=68, Evalue=1e-11,
Organism=Homo sapiens, GI299523253, Length=118, Percent_Identity=47.4576271186441, Blast_Score=68, Evalue=2e-11,
Organism=Homo sapiens, GI87196339, Length=164, Percent_Identity=35.9756097560976, Blast_Score=65, Evalue=1e-10,
Organism=Escherichia coli, GI87081892, Length=123, Percent_Identity=88.6178861788618, Blast_Score=240, Evalue=1e-64,
Organism=Caenorhabditis elegans, GI17569903, Length=132, Percent_Identity=46.969696969697, Blast_Score=77, Evalue=2e-14,
Organism=Caenorhabditis elegans, GI17535735, Length=118, Percent_Identity=47.4576271186441, Blast_Score=67, Evalue=2e-11,
Organism=Caenorhabditis elegans, GI17551704, Length=190, Percent_Identity=34.2105263157895, Blast_Score=67, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008969
- InterPro:   IPR014766
- InterPro:   IPR011083
- InterPro:   IPR005003
- InterPro:   IPR013609 [H]

Pfam domain/function: PF07484 Collar; PF03335 Phage_fiber; PF08400 phage_tail_N [H]

EC number: NA

Molecular weight: Translated: 45821; Mature: 45690

Theoretical pI: Translated: 4.62; Mature: 4.62

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.6 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTARRFRRLFCGVNMAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEA
CCHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCC
GRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRF
CCEEEEEECCCEEEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHH
ELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGDASESARQAAESA
HHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHH
AAAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESA
HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
ESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGER
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GATGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGALITGLTFLAPKDATRVQVF
CCCCCCCCCCCCCCEEEEEEECCCCEEECCCCCCCCCCCCHHHHCCHHCCCCCCHHHHHH
FQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
HHHHHHHCCCCCCHHHHHHHHHCCCCCCCCC
>Mature Secondary Structure 
TARRFRRLFCGVNMAVKISGVLKDGTGKPVENCTIQLKARRTSSTVVVNTVASENPDEA
CHHHHHHHHHCCCEEEEEEEEEECCCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCC
GRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRF
CCEEEEEECCCEEEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHH
ELMVEEAARHAEEAKKNAGEAETSARNAGISASQAEESAANADTSAGDASESARQAAESA
HHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHH
AAAKQSEEASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESA
HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
ESAQSAEQSRIAAEEAVNRIPTVVGPPGPKGEPGPAGPQGPKGDKGERGDTGPAGATGER
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GATGPQGPKGDPGETQIRFRLGPASIIETNSHGWFPGTDGALITGLTFLAPKDATRVQVF
CCCCCCCCCCCCCCEEEEEEECCCCEEECCCCCCCCCCCCHHHHCCHHCCCCCCHHHHHH
FQHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
HHHHHHHCCCCCCHHHHHHHHHCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9097039; 9278503 [H]