| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is yejH [H]
Identifier: 209397941
GI number: 209397941
Start: 1322397
End: 1324757
Strand: Direct
Name: yejH [H]
Synonym: ECH74115_1306
Alternate gene names: 209397941
Gene position: 1322397-1324757 (Clockwise)
Preceding gene: 209400222
Following gene: 209396712
Centisome position: 23.73
GC content: 48.28
Gene sequence:
>2361_bases ATGGTTCATAAATCTGACAGTGATGAATTAGCTGCATTGAGGGCAGAAAATGCCCGTCTGGTCTCATTACTTGAAGCTCA TGGGATTGAATGGCGACGTAAACCGCAGACTCCTGTGCAGCGCGTTTCTGTATTATCCACCGACGAGAAGGTTGCATTAT TTCGTCGGTTGTTTCGTGGGCGTGATGATGTATGGGCACTTAGATGGGAAAGTAAAACCAGCGGCAAATCAGGGTACTCT CCTGCCTGCGCTAACGAATGGCAGGCGAGAATATGCGGTAAACCCCGGATAAAATGTGGGGACTGCGCTCACCGTCAGTT GATTCCTGTATCCGATCTCGTCATCTACCACCATCTGGCCGGTACCCATACTGTCGGGATGTATCCGTTGCTGGAAGATG ATTCCTGTTATTTTCTGGCAGTTGATTTTGATGAAGCTGAATGGCAGAAGGATGCATCCGCATTCATGCGATCCTGTGAT GAGCTGGGTGTACCTGCTGCGCTGGAAATATCCCGTTCACGTCAGGGGGCGCATGTCTGGATATTTTTTGCCTCACGAGT TTCGGCCCGCGAAGCTCGCCGTCTGGGGACTGCTATTATTAGCTATACGTGTAGTCGGACCCGACAGCTGCGATTGGGGT CTTATGACCGATTATTTCCTAATCAGGATACTATGCCAAAAGGGGGATTTGGTAATCTCATAGCGTTACCTCTGCAAAAA AGACCGCGTGAATTGGGGGGAAGCGTTTTTGTTGATATGAATCTCCAGCCTTATCCTGATCAGTGGGCTTTTCTTGTATC GGTGACCCCGATGAATGTGCAGGATATTGAACCGACGATATTACGGGCTACAGGGAGTATCCATCCTCTGGATGTGAATT TTATCAACGAAGAAGACCTGGGTACGCCGTGGGAAGAGAAAAAATCATCAGGAAACAGACTGAATATTTCTATTGCAGAA CCGCTGAAAATCACGCTGGCAAACCAGATCTATTTCGAAAAAGCGCAATTACCTCAGGTGCTGATTAACCGACTTATTCG GCTGGCAGCATTTCCGAACCCTGAGTTTTATAAGGCTCAGGCAATGCGTATGTCAGTCTGGAATAAGCCCCGTGTTATAG GTTGTGCGGAGAATTACCCGCAACACATTGCGTTGCCCCGGGGATGTCTGGACAGCGTATTATCTTTCCTTAGGGACAAC AATATTGCTGCAGAATTAATCGATAAACGATTTGCCGGGACGGAATGTAATGCCGTTTTTATGGGAAACCTCAGAGCGGA GCAGGAAGAGGCCGTTTCGGCATTACTCCGTTATGACACTGGTGTGCTTTGTGCGCCAACGGCTTTTGGTAAGACAGTTA CCGCAGCGGCAGTGATTGCCAAGCGGAAAGTGAATACACTGATACTGGTACACCGGACTGAATTGCTGAAGCAGTGGCAG GAGCGTCTCGCGGTGTTTCTGCAGGCCGGTGACAGTATTGGTATTATCGGGGGAGGTAAACATAAACCCTGTGGCAATAT TGATATTGCGGTGGTGCAGTCCATATCCAGACAAGGAGAAGTTGAACCTCTGGTCAGGAATTATGGGCAAATCATTGTGG ATGAGTGCCATCATATTGGCGCGGTTTCATTTTCTGCGATTCTGAAGGAAACGAATGCCAGATATCTGCTTGGCCTGACG GCAACACCAATCCGACGGGATGGTCTGCATCCCATTATTTTTATGTACTGTGGTGCCATTCGCCATACAGCGGTCCGCCC GAAGGAAAGCCCACATAATCTGGAGGTACTGATCCGTTCCCGTTTTACATCTGGTCATTTACCATCGGATGCGAGAATCC AGGATATTTTCAGAGAAATTGCTCTGGATCATGACAGAACGGTGGCGATAGCTGAAGAAGCCATGAAAGCTTTCGGGCAG GGGCGAAAAGTTCTGGTACTGACTGAACGTACAGATCATCTGGATGAGATAGCATCAGTGATGAATTCACTGAAATTGTC TCCCTTTATTCTCCATGGTCGACTATCGAAAAAAAAGCGTGCGATGCTGATATCCGGGCTGAATGCTCTTCCTCCCGATT CTCCTCGAATTTTGTTGTCAACAGGCAGACTTATTGGTGAGGGATTTGACCACCCTCCGCTGGATACGCTGATTCTTGCC ATGCCTGTGTCATGGAAAGGGACATTACAGCAGTATGCAGGGCGTCTTCACAGAGAGCATACCGGTAAAAGCGATGTCAG GATCATTGATTTTGTGGATACCGCGTATCCTGTGCTGCTCAGAATGTGGGATAAACGTCAGCGGGGTTATAAAGCGATGG GGTACAGGATTATAGCTGACGGCGATGAATCAGTGATTTAA
Upstream 100 bases:
>100_bases TTGAGTTAATAGGCAGAAGAGGGCGGGGCGCTTTGTCATCAGTGTAGGGGACCCGAATTTCTGCCGTGGTGCCTCAGTGG ATCGGATACAGGTAACGAAT
Downstream 100 bases:
>100_bases TTTTAAGGAGTACTTTACCCTTTTTAAATAATTTGAGCTCCCCGGGTTGTTCGGGCAACAAAGAGTCCCAGTATACAGGC TTCAGGAATATGTTGTAGTA
Product: type III restriction enzyme, res subunit family
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 786; Mature: 786
Protein sequence:
>786_residues MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEKVALFRRLFRGRDDVWALRWESKTSGKSGYS PACANEWQARICGKPRIKCGDCAHRQLIPVSDLVIYHHLAGTHTVGMYPLLEDDSCYFLAVDFDEAEWQKDASAFMRSCD ELGVPAALEISRSRQGAHVWIFFASRVSAREARRLGTAIISYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQK RPRELGGSVFVDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDLGTPWEEKKSSGNRLNISIAE PLKITLANQIYFEKAQLPQVLINRLIRLAAFPNPEFYKAQAMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDN NIAAELIDKRFAGTECNAVFMGNLRAEQEEAVSALLRYDTGVLCAPTAFGKTVTAAAVIAKRKVNTLILVHRTELLKQWQ ERLAVFLQAGDSIGIIGGGKHKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIGAVSFSAILKETNARYLLGLT ATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRSRFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQ GRKVLVLTERTDHLDEIASVMNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLSTGRLIGEGFDHPPLDTLILA MPVSWKGTLQQYAGRLHREHTGKSDVRIIDFVDTAYPVLLRMWDKRQRGYKAMGYRIIADGDESVI
Sequences:
>Translated_786_residues MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEKVALFRRLFRGRDDVWALRWESKTSGKSGYS PACANEWQARICGKPRIKCGDCAHRQLIPVSDLVIYHHLAGTHTVGMYPLLEDDSCYFLAVDFDEAEWQKDASAFMRSCD ELGVPAALEISRSRQGAHVWIFFASRVSAREARRLGTAIISYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQK RPRELGGSVFVDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDLGTPWEEKKSSGNRLNISIAE PLKITLANQIYFEKAQLPQVLINRLIRLAAFPNPEFYKAQAMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDN NIAAELIDKRFAGTECNAVFMGNLRAEQEEAVSALLRYDTGVLCAPTAFGKTVTAAAVIAKRKVNTLILVHRTELLKQWQ ERLAVFLQAGDSIGIIGGGKHKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIGAVSFSAILKETNARYLLGLT ATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRSRFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQ GRKVLVLTERTDHLDEIASVMNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLSTGRLIGEGFDHPPLDTLILA MPVSWKGTLQQYAGRLHREHTGKSDVRIIDFVDTAYPVLLRMWDKRQRGYKAMGYRIIADGDESVI >Mature_786_residues MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEKVALFRRLFRGRDDVWALRWESKTSGKSGYS PACANEWQARICGKPRIKCGDCAHRQLIPVSDLVIYHHLAGTHTVGMYPLLEDDSCYFLAVDFDEAEWQKDASAFMRSCD ELGVPAALEISRSRQGAHVWIFFASRVSAREARRLGTAIISYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQK RPRELGGSVFVDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDLGTPWEEKKSSGNRLNISIAE PLKITLANQIYFEKAQLPQVLINRLIRLAAFPNPEFYKAQAMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDN NIAAELIDKRFAGTECNAVFMGNLRAEQEEAVSALLRYDTGVLCAPTAFGKTVTAAAVIAKRKVNTLILVHRTELLKQWQ ERLAVFLQAGDSIGIIGGGKHKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIGAVSFSAILKETNARYLLGLT ATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRSRFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQ GRKVLVLTERTDHLDEIASVMNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLSTGRLIGEGFDHPPLDTLILA MPVSWKGTLQQYAGRLHREHTGKSDVRIIDFVDTAYPVLLRMWDKRQRGYKAMGYRIIADGDESVI
Specific function: Unknown
COG id: COG1061
COG function: function code KL; DNA or RNA helicases of superfamily II
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 helicase C-terminal domain [H]
Homologues:
Organism=Escherichia coli, GI1788511, Length=371, Percent_Identity=26.4150943396226, Blast_Score=77, Evalue=4e-15,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR014001 - InterPro: IPR001650 - InterPro: IPR014021 - InterPro: IPR006935 [H]
Pfam domain/function: PF00271 Helicase_C; PF04851 ResIII [H]
EC number: NA
Molecular weight: Translated: 88036; Mature: 88036
Theoretical pI: Translated: 8.98; Mature: 8.98
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEKVALFRRLFRG CCCCCCCCHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHCC RDDVWALRWESKTSGKSGYSPACANEWQARICGKPRIKCGDCAHRQLIPVSDLVIYHHLA CCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHC GTHTVGMYPLLEDDSCYFLAVDFDEAEWQKDASAFMRSCDELGVPAALEISRSRQGAHVW CCCEEEEEEEEECCCEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEE IFFASRVSAREARRLGTAIISYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQK EEEECHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCCCCCCCEEEECCCC RPRELGGSVFVDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDL CHHHCCCEEEEEECCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEEEEEECCCCC GTPWEEKKSSGNRLNISIAEPLKITLANQIYFEKAQLPQVLINRLIRLAAFPNPEFYKAQ CCCHHHHCCCCCEEEEEECCCEEEEEEHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHH AMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDNNIAAELIDKRFAGTECNAVF HHHHHHCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEE MGNLRAEQEEAVSALLRYDTGVLCAPTAFGKTVTAAAVIAKRKVNTLILVHRTELLKQWQ ECCCCCHHHHHHHHHHHHCCCCEECCCCCCCHHHHHHHHHHHCCCEEEEEEHHHHHHHHH ERLAVFLQAGDSIGIIGGGKHKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIG HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCHHHHHHHHHCC AVSFSAILKETNARYLLGLTATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRS HHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEE RFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQGRKVLVLTERTDHLDEIASV CCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH MNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLSTGRLIGEGFDHPPLDTLILA HHHHCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCEEEEECCHHHHCCCCCCCCCCEEEE MPVSWKGTLQQYAGRLHREHTGKSDVRIIDFVDTAYPVLLRMWDKRQRGYKAMGYRIIAD ECCCCHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHHHHCCCHHHCEEEEEC GDESVI CCCCCC >Mature Secondary Structure MVHKSDSDELAALRAENARLVSLLEAHGIEWRRKPQTPVQRVSVLSTDEKVALFRRLFRG CCCCCCCCHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHCC RDDVWALRWESKTSGKSGYSPACANEWQARICGKPRIKCGDCAHRQLIPVSDLVIYHHLA CCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHC GTHTVGMYPLLEDDSCYFLAVDFDEAEWQKDASAFMRSCDELGVPAALEISRSRQGAHVW CCCEEEEEEEEECCCEEEEEEECCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEE IFFASRVSAREARRLGTAIISYTCSRTRQLRLGSYDRLFPNQDTMPKGGFGNLIALPLQK EEEECHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCCCCCCCEEEECCCC RPRELGGSVFVDMNLQPYPDQWAFLVSVTPMNVQDIEPTILRATGSIHPLDVNFINEEDL CHHHCCCEEEEEECCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEEEEEECCCCC GTPWEEKKSSGNRLNISIAEPLKITLANQIYFEKAQLPQVLINRLIRLAAFPNPEFYKAQ CCCHHHHCCCCCEEEEEECCCEEEEEEHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHH AMRMSVWNKPRVIGCAENYPQHIALPRGCLDSVLSFLRDNNIAAELIDKRFAGTECNAVF HHHHHHCCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEE MGNLRAEQEEAVSALLRYDTGVLCAPTAFGKTVTAAAVIAKRKVNTLILVHRTELLKQWQ ECCCCCHHHHHHHHHHHHCCCCEECCCCCCCHHHHHHHHHHHCCCEEEEEEHHHHHHHHH ERLAVFLQAGDSIGIIGGGKHKPCGNIDIAVVQSISRQGEVEPLVRNYGQIIVDECHHIG HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCHHHHHCCHHHHHHHHHCC AVSFSAILKETNARYLLGLTATPIRRDGLHPIIFMYCGAIRHTAVRPKESPHNLEVLIRS HHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEE RFTSGHLPSDARIQDIFREIALDHDRTVAIAEEAMKAFGQGRKVLVLTERTDHLDEIASV CCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH MNSLKLSPFILHGRLSKKKRAMLISGLNALPPDSPRILLSTGRLIGEGFDHPPLDTLILA HHHHCCCEEEEECCCCHHHHHHHHCCCCCCCCCCCEEEEECCHHHHCCCCCCCCCCEEEE MPVSWKGTLQQYAGRLHREHTGKSDVRIIDFVDTAYPVLLRMWDKRQRGYKAMGYRIIAD ECCCCHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHHHHCCCHHHCEEEEEC GDESVI CCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]