Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is yacH [H]
Identifier: 209398273
GI number: 209398273
Start: 133760
End: 135613
Strand: Reverse
Name: yacH [H]
Synonym: ECH74115_0124
Alternate gene names: 209398273
Gene position: 135613-133760 (Counterclockwise)
Preceding gene: 209397177
Following gene: 209399999
Centisome position: 2.43
GC content: 53.99
Gene sequence:
>1854_bases ATGAAAATGACTTTGCCGTTTAAACCCCATGTGCTGGCGCTAATTTGCAGTGCCGGGCTTTGTGCCGCCTCTGCCGGGCT ATATATAAAAAGCCGCACAGTGGAAGCGCCTGTGGAAACGCAATCGACACAACTGGCTGTGTCTGACGCTGCCGCAGTTA CGCTTCCTGCAACGGTTTCCGCACCTCCCGTAACACCCGCCGTCGTCAAATCCGCATTCAGCACTGCACAAATAGATCAA TGGGTCGCGCCCGTCGCGCTGTATCCCGACGCCCTACTTTCGCAGGTGCTGATGGCATCAACCTATCCGACAAACGTTGC TCAAGCAGTGCAATGGTCGCACGATAATCCACTTAAACAAGGCGATGCTGCTATTCAGGCGGTATCTGACCAGCCGTGGG ACGCCAGCGTTAAATCACTGGTGGCCTTTCCACAATTGATGGCATTGATGGGCGAAAACCCGCAATGGGTGCAAAACCTG GGCGATGCTTTTCTGGCCCAGCCGCAGGACGTGATGGACTCGGTACAACGATTGCGGCAACTGGCACAACAAACCGGCTC GCTGAAGTCATCAACCGAACAGAAAGTTATTACCACAACGAAGAAAACTGTACCGGTAACACAGACAGTCACGGCTCCCG TCATACCATCCAATACCGTTTCAACTGCCAACCCTGTCATTACAGAGCCTGCAACAACCGTCATTTCCATTGAGCCCGGC AATCCTGATGTGGTCTATATTCCCAACTACAACCCAACCGTGGTTTACGGGAACTGGGCCAATACTGCGTATCCGCCGGT TTATCTGCCACCACCAGCCGGAGAACCGTTTGTTGACAGCTTTGTACGCGGATTCGGCTATAGCATGGGCGTTGCTACCA CGTACGCACTATTCAGCAGCATCGACTGGGATGACGACGATCATGACCATCATCATCATGACGATGATAATTATCATCAC CACGATGGCGGTCATCGTGACGGTAATGGCTGGCAACATAACGGCGACAACATCAATATCGACGTCAACAATTTCAACCG TATCACCGGTGAGCATCTTACTGATAAGAATATGGCATGGCGGCACAATCCAAACTACCGTAATGGTGTGCCCTATCATG ATCAGGATATGGCAAAGCGGTTTCATCAAACTGATGTCAACGGCGGAATGAGTGCCACGCAGCTACCTGCTCCAACACGC GACAGCCAGCGTCAGGCGGCAGCAAGTCAGTTTCAGCAACGAACACACGCCGCCCCCGTCATTACACGAGATACCCAACG TCAGGCAGCGGCACAGCGGTTTAATGAAGCTGAACACTATGGGAGCTATGACGACTTCCGCGACTTCAGCCGTCGCCAAC CACTGACCCAGCAACAAAAGGACGCCGCTCGTCAGCGTTATCAGTCAGCTTCTCCTGAGCAGCGCCAGGCAGTTCACGAG AAAATGCAGACTAACCCGCAGAACCAGCAGCGAAGAGAGGCAGCGCGTGAGCGCATTCAGCCCGCCTCGCCTGAGCAGCG CCAGGCAGTCCGCGAGAAAATGCAGACTAACCCACAGATCCAGCAGCGAAGAGACGCAGCGCGTGAGCGTATTCAGTCAG CCTCGCCTGAGCAGCGCCAGGTGTTTAAGGAAAAAGTACAGCAGCGCCCACTGAACCAACAGCAACGTGATAACGCCCGC CAGCGTGTTCAATCAGCATCACCTGAACAACGTCAGGTTTTTCGGGAGAAAGCTCAGGAGAGCCGCCCACAACGTCTAAA CGACAGTAACCATACTGCCAGGCTGAATAACGAGCAACGGTCAGCAGTACGCGAACGTCTCTCTGAGCGCGGAGCAAGGC GACTGGAAAGGTAA
Upstream 100 bases:
>100_bases AGATTAGTCAAAATTTAAACTACCGCCTCTTTATACTCGGATTCACAGCACCTGCGGGTGGCAGTTCGCCGACCATTGCG ATTTCCTTGAGATCCGAATT
Downstream 100 bases:
>100_bases ATTACAGGCGTAAAAAAAGCGGCGTGGTTAGCCGCTTTTTTAATTGCCGGATGTTCCGGCAAACGAAAAATTACTTCTTC TTCGCTTTCGGGTTCGGCAG
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 617; Mature: 617
Protein sequence:
>617_residues MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDAAAVTLPATVSAPPVTPAVVKSAFSTAQIDQ WVAPVALYPDALLSQVLMASTYPTNVAQAVQWSHDNPLKQGDAAIQAVSDQPWDASVKSLVAFPQLMALMGENPQWVQNL GDAFLAQPQDVMDSVQRLRQLAQQTGSLKSSTEQKVITTTKKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPG NPDVVYIPNYNPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSSIDWDDDDHDHHHHDDDNYHH HDGGHRDGNGWQHNGDNINIDVNNFNRITGEHLTDKNMAWRHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTR DSQRQAAASQFQQRTHAAPVITRDTQRQAAAQRFNEAEHYGSYDDFRDFSRRQPLTQQQKDAARQRYQSASPEQRQAVHE KMQTNPQNQQRREAARERIQPASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQVFKEKVQQRPLNQQQRDNAR QRVQSASPEQRQVFREKAQESRPQRLNDSNHTARLNNEQRSAVRERLSERGARRLER
Sequences:
>Translated_617_residues MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDAAAVTLPATVSAPPVTPAVVKSAFSTAQIDQ WVAPVALYPDALLSQVLMASTYPTNVAQAVQWSHDNPLKQGDAAIQAVSDQPWDASVKSLVAFPQLMALMGENPQWVQNL GDAFLAQPQDVMDSVQRLRQLAQQTGSLKSSTEQKVITTTKKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPG NPDVVYIPNYNPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSSIDWDDDDHDHHHHDDDNYHH HDGGHRDGNGWQHNGDNINIDVNNFNRITGEHLTDKNMAWRHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTR DSQRQAAASQFQQRTHAAPVITRDTQRQAAAQRFNEAEHYGSYDDFRDFSRRQPLTQQQKDAARQRYQSASPEQRQAVHE KMQTNPQNQQRREAARERIQPASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQVFKEKVQQRPLNQQQRDNAR QRVQSASPEQRQVFREKAQESRPQRLNDSNHTARLNNEQRSAVRERLSERGARRLER >Mature_617_residues MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDAAAVTLPATVSAPPVTPAVVKSAFSTAQIDQ WVAPVALYPDALLSQVLMASTYPTNVAQAVQWSHDNPLKQGDAAIQAVSDQPWDASVKSLVAFPQLMALMGENPQWVQNL GDAFLAQPQDVMDSVQRLRQLAQQTGSLKSSTEQKVITTTKKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPG NPDVVYIPNYNPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSSIDWDDDDHDHHHHDDDNYHH HDGGHRDGNGWQHNGDNINIDVNNFNRITGEHLTDKNMAWRHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTR DSQRQAAASQFQQRTHAAPVITRDTQRQAAAQRFNEAEHYGSYDDFRDFSRRQPLTQQQKDAARQRYQSASPEQRQAVHE KMQTNPQNQQRREAARERIQPASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQVFKEKVQQRPLNQQQRDNAR QRVQSASPEQRQVFREKAQESRPQRLNDSNHTARLNNEQRSAVRERLSERGARRLER
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Escherichia coli, GI1786308, Length=617, Percent_Identity=96.7585089141005, Blast_Score=1171, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR021728 [H]
Pfam domain/function: PF11737 DUF3300 [H]
EC number: NA
Molecular weight: Translated: 69217; Mature: 69217
Theoretical pI: Translated: 8.58; Mature: 8.58
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 2.3 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDAAAVTLPATVS CCCCCCCCHHHHHHHHHCCHHHHHCCCEEEECCCCCCCCCCCCEEEECCCCEEEECCCCC APPVTPAVVKSAFSTAQIDQWVAPVALYPDALLSQVLMASTYPTNVAQAVQWSHDNPLKQ CCCCCHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCHH GDAAIQAVSDQPWDASVKSLVAFPQLMALMGENPQWVQNLGDAFLAQPQDVMDSVQRLRQ HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHH LAQQTGSLKSSTEQKVITTTKKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPG HHHHHCCHHCCCCCHHEEECCCCCCCHHCCCCCCCCCCCCCCCCCCEECCCCEEEEEECC NPDVVYIPNYNPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSS CCCEEEECCCCCEEEECCCCCCCCCCCEECCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH IDWDDDDHDHHHHDDDNYHHHDGGHRDGNGWQHNGDNINIDVNNFNRITGEHLTDKNMAW CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC RHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTRDSQRQAAASQFQQRTHAAPV CCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC ITRDTQRQAAAQRFNEAEHYGSYDDFRDFSRRQPLTQQQKDAARQRYQSASPEQRQAVHE EECCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH KMQTNPQNQQRREAARERIQPASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQ HHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHH VFKEKVQQRPLNQQQRDNARQRVQSASPEQRQVFREKAQESRPQRLNDSNHTARLNNEQR HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH SAVRERLSERGARRLER HHHHHHHHHHHHHHCCC >Mature Secondary Structure MKMTLPFKPHVLALICSAGLCAASAGLYIKSRTVEAPVETQSTQLAVSDAAAVTLPATVS CCCCCCCCHHHHHHHHHCCHHHHHCCCEEEECCCCCCCCCCCCEEEECCCCEEEECCCCC APPVTPAVVKSAFSTAQIDQWVAPVALYPDALLSQVLMASTYPTNVAQAVQWSHDNPLKQ CCCCCHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCHH GDAAIQAVSDQPWDASVKSLVAFPQLMALMGENPQWVQNLGDAFLAQPQDVMDSVQRLRQ HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHH LAQQTGSLKSSTEQKVITTTKKTVPVTQTVTAPVIPSNTVSTANPVITEPATTVISIEPG HHHHHCCHHCCCCCHHEEECCCCCCCHHCCCCCCCCCCCCCCCCCCEECCCCEEEEEECC NPDVVYIPNYNPTVVYGNWANTAYPPVYLPPPAGEPFVDSFVRGFGYSMGVATTYALFSS CCCEEEECCCCCEEEECCCCCCCCCCCEECCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH IDWDDDDHDHHHHDDDNYHHHDGGHRDGNGWQHNGDNINIDVNNFNRITGEHLTDKNMAW CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCC RHNPNYRNGVPYHDQDMAKRFHQTDVNGGMSATQLPAPTRDSQRQAAASQFQQRTHAAPV CCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC ITRDTQRQAAAQRFNEAEHYGSYDDFRDFSRRQPLTQQQKDAARQRYQSASPEQRQAVHE EECCHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH KMQTNPQNQQRREAARERIQPASPEQRQAVREKMQTNPQIQQRRDAARERIQSASPEQRQ HHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHH VFKEKVQQRPLNQQQRDNARQRVQSASPEQRQVFREKAQESRPQRLNDSNHTARLNNEQR HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH SAVRERLSERGARRLER HHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8202364; 9278503 [H]