| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is stfR [H]
Identifier: 209395988
GI number: 209395988
Start: 2892536
End: 2893849
Strand: Reverse
Name: stfR [H]
Synonym: ECH74115_3118
Alternate gene names: 209395988
Gene position: 2893849-2892536 (Counterclockwise)
Preceding gene: 209400205
Following gene: 209396050
Centisome position: 51.93
GC content: 60.05
Gene sequence:
>1314_bases ATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTAGAGAACTGCACCATTCAACTGAAAGCCAG ACGTAACAGCGCCACGGTGGTGGTGAACACGGTGGCCTCTGAAAATCCGGATGAAGCCGGTCGTTACAGCATGGACGTTG AGTACGGTCAGTACAGCGTCATTCTGTTGGTGGAGGGCTTCCCGCCGTCACATGCCGGGACCATCACCGTGTATGAAGAT TCTCAACCGGGGACGCTGAATGATTTTCTCGGTGCCATGTCGGAGGATGACGTCCGGCCGGAGGCACTGCGTCGTTTTGA ACTGATGGTGGAAGAAGCGGCGCGTCACGCTGAGGAGGCGAAGAAGAATGCCGGAGAGGCGGAGACATCAGCGAGGAATG CCGGCATATCATCCAGTAAGGCGGAAGCGAGCGCGGCAAATGCTGACACTTCAGCAGGGGATGCATTGGAGTCAGCCCGG CAGGCGGCAGAAAGTGCAGCCGCTGCAAAGCAGTCAGAGGATGCGTCCTCGTCCTCGGCTTCTGCGGCCGCTCAAAAAGC CAGTGAGTCATCACAAAGTGCAGCAGAAGCTGAATTGTCAAGAAAGACGGCAGAAAGTGCAGCCGGTAATGCAGCCAGGG ATGCAACGACCGCAACAGAAAAAGCCCGGGAGTCAGCAGAAAGCGCACAGTCAGCGGAACAAAGCAGGATAGCGGCGGAA GAGGCCGTAAACCGAATCCCCACCGTGGTGGGACCTCCCGGGCCAAAGGGGGAACAGGGGCCCGCGGGTCCTCAGGGGCC GAAGGGTGATAAGGGAGAGCGCGGTGACACCGGCCCTGTCGGGGCAACCGGCGAACGGGGACCGGCAGGTGATGCTGGTC CGGCAGGCCCGCAGGGGCCGAAAGGTGACAGGGGAGAGCGGGGAGAGACCGGTCTGACGGGAAATGCAGGTCCACAGGGT CCAAAGGGAGATACCGGTGCGGCAGGCCCGGCAGGCCCACAGGGACCGAAAGGAGAAACAGGTGCGGCTGGCCCGGTGGG GGCAACCGGACCTCAGGGACCGAAGGGCGACCCGGGGGAGGCACAAATCCGTTTTCGTCTGGGGCCGGCGAGCATTATTG AGACAAACAGCAATGGCTGGTTCCCGGATACAGATGGCGCACTCATCACCGGACTGACCTTTCTTGACCCCAAAGATGCC ACACAGGTTCAGGGGCTGTTTCGGCATTTGCAGGTCAGGTTTGGTGACGGGCCGTGGCAGGATGTTAAGGGGCTGGATGA AGTGGGCAGTGATACAGGCAGAACAGGAGAATGA
Upstream 100 bases:
>100_bases GACGGTTTCATCGTGGGTGTCGGTTATAAATTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCGGTTCAGGCG GGCTTTTTTGTGGGGGGAAT
Downstream 100 bases:
>100_bases CATGAATATACTAAAAAAACTTATGCAGTGTCTGTGTGGTTGCGGAAAGCATGATGGCCGTGAACACGTGCAGTCGCCTA CAGCACAGCTGCGACTGGGA
Product: tail fiber protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 437; Mature: 436
Protein sequence:
>437_residues MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYED SQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISSSKAEASAANADTSAGDALESAR QAAESAAAAKQSEDASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAE EAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQG PKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGEAQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDA TQVQGLFRHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
Sequences:
>Translated_437_residues MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYED SQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISSSKAEASAANADTSAGDALESAR QAAESAAAAKQSEDASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAE EAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQG PKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGEAQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDA TQVQGLFRHLQVRFGDGPWQDVKGLDEVGSDTGRTGE >Mature_436_residues AVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDS QPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEAKKNAGEAETSARNAGISSSKAEASAANADTSAGDALESARQ AAESAAAAKQSEDASSSSASAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAEE AVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGPKGDRGERGETGLTGNAGPQGP KGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGEAQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDAT QVQGLFRHLQVRFGDGPWQDVKGLDEVGSDTGRTGE
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the tail fiber family [H]
Homologues:
Organism=Homo sapiens, GI56847616, Length=153, Percent_Identity=41.8300653594771, Blast_Score=84, Evalue=4e-16, Organism=Homo sapiens, GI65301115, Length=153, Percent_Identity=41.8300653594771, Blast_Score=83, Evalue=6e-16, Organism=Homo sapiens, GI5803080, Length=111, Percent_Identity=47.7477477477478, Blast_Score=79, Evalue=1e-14, Organism=Homo sapiens, GI55743098, Length=112, Percent_Identity=46.4285714285714, Blast_Score=76, Evalue=6e-14, Organism=Homo sapiens, GI55743106, Length=112, Percent_Identity=46.4285714285714, Blast_Score=76, Evalue=8e-14, Organism=Homo sapiens, GI240255535, Length=112, Percent_Identity=46.4285714285714, Blast_Score=75, Evalue=1e-13, Organism=Homo sapiens, GI115392133, Length=112, Percent_Identity=50.8928571428571, Blast_Score=74, Evalue=3e-13, Organism=Homo sapiens, GI115527062, Length=125, Percent_Identity=44, Blast_Score=74, Evalue=3e-13, Organism=Homo sapiens, GI183583553, Length=111, Percent_Identity=42.3423423423423, Blast_Score=72, Evalue=7e-13, Organism=Homo sapiens, GI115527066, Length=125, Percent_Identity=44, Blast_Score=72, Evalue=1e-12, Organism=Homo sapiens, GI48762934, Length=130, Percent_Identity=47.6923076923077, Blast_Score=72, Evalue=1e-12, Organism=Homo sapiens, GI115527070, Length=125, Percent_Identity=44, Blast_Score=71, Evalue=2e-12, Organism=Homo sapiens, GI156616290, Length=101, Percent_Identity=43.5643564356436, Blast_Score=70, Evalue=5e-12, Organism=Homo sapiens, GI98985806, Length=112, Percent_Identity=47.3214285714286, Blast_Score=67, Evalue=4e-11, Organism=Homo sapiens, GI98985810, Length=112, Percent_Identity=47.3214285714286, Blast_Score=66, Evalue=6e-11, Organism=Homo sapiens, GI299523257, Length=112, Percent_Identity=47.3214285714286, Blast_Score=66, Evalue=6e-11, Organism=Homo sapiens, GI4502961, Length=113, Percent_Identity=46.9026548672566, Blast_Score=65, Evalue=1e-10, Organism=Escherichia coli, GI87081892, Length=123, Percent_Identity=89.4308943089431, Blast_Score=243, Evalue=2e-65, Organism=Caenorhabditis elegans, GI17569903, Length=132, Percent_Identity=46.969696969697, Blast_Score=78, Evalue=9e-15, Organism=Caenorhabditis elegans, GI17551704, Length=202, Percent_Identity=33.6633663366337, Blast_Score=68, Evalue=1e-11, Organism=Caenorhabditis elegans, GI17535735, Length=136, Percent_Identity=41.9117647058824, Blast_Score=67, Evalue=1e-11, Organism=Drosophila melanogaster, GI221379525, Length=133, Percent_Identity=39.8496240601504, Blast_Score=67, Evalue=3e-11, Organism=Drosophila melanogaster, GI221379533, Length=133, Percent_Identity=39.8496240601504, Blast_Score=66, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008969 - InterPro: IPR014766 - InterPro: IPR011083 - InterPro: IPR005003 - InterPro: IPR013609 [H]
Pfam domain/function: PF07484 Collar; PF03335 Phage_fiber; PF08400 phage_tail_N [H]
EC number: NA
Molecular weight: Translated: 44111; Mature: 43980
Theoretical pI: Translated: 4.44; Mature: 4.44
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.1 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 0.7 %Met (Mature Protein) 0.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSV CEEEEEEEEECCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCEEE ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEA EEEEECCCCCCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH KKNAGEAETSARNAGISSSKAEASAANADTSAGDALESARQAAESAAAAKQSEDASSSSA HHHCCCHHHHHHHCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH SAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAE HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGP HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC KGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGE CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDATQVQGLFRHLQVRFGDGPWQ EEEEEEECCCEEEEECCCCCCCCCCCCCEECEEEECCCHHHHHHHHHHHHEEEECCCCCH DVKGLDEVGSDTGRTGE HHHHHHHHCCCCCCCCC >Mature Secondary Structure AVKISGVLKDGTGKPVENCTIQLKARRNSATVVVNTVASENPDEAGRYSMDVEYGQYSV EEEEEEEEECCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCEEE ILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMSEDDVRPEALRRFELMVEEAARHAEEA EEEEECCCCCCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH KKNAGEAETSARNAGISSSKAEASAANADTSAGDALESARQAAESAAAAKQSEDASSSSA HHHCCCHHHHHHHCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH SAAAQKASESSQSAAEAELSRKTAESAAGNAARDATTATEKARESAESAQSAEQSRIAAE HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EAVNRIPTVVGPPGPKGEQGPAGPQGPKGDKGERGDTGPVGATGERGPAGDAGPAGPQGP HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC KGDRGERGETGLTGNAGPQGPKGDTGAAGPAGPQGPKGETGAAGPVGATGPQGPKGDPGE CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AQIRFRLGPASIIETNSNGWFPDTDGALITGLTFLDPKDATQVQGLFRHLQVRFGDGPWQ EEEEEEECCCEEEEECCCCCCCCCCCCCEECEEEECCCHHHHHHHHHHHHEEEECCCCCH DVKGLDEVGSDTGRTGE HHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9097039; 9278503 [H]