| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is oxaA [H]
Identifier: 209400160
GI number: 209400160
Start: 4774413
End: 4776059
Strand: Direct
Name: oxaA [H]
Synonym: ECH74115_5135
Alternate gene names: 209400160
Gene position: 4774413-4776059 (Clockwise)
Preceding gene: 209396527
Following gene: 209395938
Centisome position: 85.68
GC content: 53.67
Gene sequence:
>1647_bases ATGGATTCGCAACGCAATCTTTTAGTCATCGCTTTGCTGTTCGTGTCTTTCATGATCTGGCAAGCCTGGGAGCAGGATAA AAACCCGCAACCTCAGGCCCAACAGACCACGCAGACAACGACCACCGCAGCGGGTAGCGCCGCCGACCAGGGCGTACCGG CCAGTGGCCAGGGGAAACTGATCTCGGTTAAGACCGACGTGCTTGATCTGACCATCAACACCCGTGGTGGTGATGTTGAG CAAGCTCTGCTGCCTGCTTACCCGAAAGAGCTGAACTCTACCCAGCCGTTCCAGCTGCTGGAAACTTCACCGCAGTTTAT TTATCAGGCACAGAGCGGTCTGACCGGTCGTGATGGCCCAGATAACCCGGCTAACGGCCCGCGTCCGCTGTATAACGTTG AAAAAGACGCTTATGTGCTGGCTGAAGGTCAAAACGAACTGCAGGTGCCGATGACGTATACCGACGCGGCAGGCAACACG TTTACCAAAACGTTTGTCCTGAAACGGGGTGATTACGCTGTCAACGTCAACTACAACGTGCAGAACGCTGGCGAGAAACC GCTGGAAATCTCCACCTTTGGTCAGTTGAAGCAATCCATCACTCTGCCACCGCATCTCGATACCGGAAGCAGCAACTTCG CACTGCACACCTTCCGTGGCGCGGCGTACTCCACGCCTGACGAGAAGTACGAGAAATACAAGTTCGATACCATTGCCGAT AACGAAAACCTGAACATCTCTTCGAAAGGTGGTTGGGTGGCAATGCTGCAACAGTATTTCGCGACGGCGTGGATCCCGCA TAACGACGGTACCAACAACTTCTATACTGCTAATCTGGGTAACGGCATCGCCGCTATCGGCTATAAATCTCAGCCGGTAC TGGTTCAGCCTGGTCAGACTGGCGCGATGAACAGCACCCTGTGGGTTGGCCCGGAAATCCAGGACAAAATGGCAGCTGTT GCTCCGCACCTGGATCTGACCGTTGATTACGGTTGGTTGTGGTTCATCTCTCAGCCGCTGTTCAAACTGCTGAAATGGAT CCATAGCTTTGTGGGTAACTGGGGCTTCTCCATTATCATCATCACCTTTATCGTTCGTGGCATCATGTACCCGCTGACCA AAGCGCAGTACACCTCCATGGCGAAGATGCGTATGCTGCAGCCGAAGATTCAGGCAATGCGTGAGCGTCTGGGCGATGAC AAACAGCGTATCAGCCAGGAAATGATGGCGCTGTACAAAGCTGAGAAGGTTAACCCGCTGGGCGGCTGCTTCCCGCTGCT GATCCAGATGCCAATCTTCCTGGCGTTGTACTACATGCTGATGGGTTCCGTTGAACTGCGTCAGGCACCGTTTGCACTGT GGATCCACGACCTGTCTGCACAGGACCCGTACTACATCCTGCCGATCCTGATGGGCGTAACGATGTTCTTCATTCAGAAG ATGTCGCCGACCACTGTGACCGACCCGATGCAGCAGAAGATCATGACCTTTATGCCGGTCATCTTCACCGTGTTCTTCCT GTGGTTCCCGTCAGGTCTGGTGCTGTACTATATCGTCAGCAACCTGGTAACCATTATTCAGCAGCAGCTGATTTACCGTG GTCTGGAAAAACGTGGCCTGCATAGCCGCGAGAAGAAAAAATCCTGA
Upstream 100 bases:
>100_bases GTTGGTTGACGGTGAAACGCGTATTAAAATGCCACCCTTTACACCCTGGTGGTGACGATCCCGTCCCGCCCGGACCATTT AATACCAGAGAACACTAACG
Downstream 100 bases:
>100_bases TTCGGTGAGTTTTCGCTAAAATAAGGGCGGTCAGTTGACCGCCTTTTTTCTTTTCGTAGGGCGGATAAGCACCGCGTATC CGCCACACAAAGCAACAGGA
Product: putative inner membrane protein translocase component YidC
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 548; Mature: 548
Protein sequence:
>548_residues MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS
Sequences:
>Translated_548_residues MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS >Mature_548_residues MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS
Specific function: Required for the insertion of integral membrane proteins into the membrane. Probably plays an essential role in the integration of proteins of the respiratory chain complexes. Involved in integration of membrane proteins that insert dependently and indepe
COG id: COG0706
COG function: function code U; Preprotein translocase subunit YidC
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the OXA1/oxaA family. Type 1 subfamily [H]
Homologues:
Organism=Escherichia coli, GI1790140, Length=548, Percent_Identity=99.8175182481752, Blast_Score=1129, Evalue=0.0, Organism=Drosophila melanogaster, GI24662345, Length=196, Percent_Identity=26.0204081632653, Blast_Score=77, Evalue=2e-14, Organism=Drosophila melanogaster, GI21356443, Length=196, Percent_Identity=26.0204081632653, Blast_Score=77, Evalue=2e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR019998 - InterPro: IPR001708 [H]
Pfam domain/function: PF02096 60KD_IMP [H]
EC number: NA
Molecular weight: Translated: 61541; Mature: 61541
Theoretical pI: Translated: 8.27; Mature: 8.27
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 4.0 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 4.0 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCE ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP EEEEEEEEEEEEECCCCCHHHHHHCCCHHHCCCCCCHHHHHCCHHHHHHHHCCCCCCCCC DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV CCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEEEECCCEEEEEEEEE QNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD CCCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHEECCEECC NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT CCCCEECCCCCHHHHHHHHHHHEECCCCCCCCCEEEEECCCCEEEECCCCCCEEECCCCC GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII CCCCCEEEECCCHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK CHHHHHHHHHHHHHHHHHHHHCCHHHHHCCEEEEEEECCCCCCEEHHHHHHHHHHHHHHH MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC HSREKKKS CCHHHCCC >Mature Secondary Structure MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCE ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP EEEEEEEEEEEEECCCCCHHHHHHCCCHHHCCCCCCHHHHHCCHHHHHHHHCCCCCCCCC DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV CCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEEEECCCEEEEEEEEE QNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD CCCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHEECCEECC NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT CCCCEECCCCCHHHHHHHHHHHEECCCCCCCCCEEEEECCCCEEEECCCCCCEEECCCCC GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII CCCCCEEEECCCHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK CHHHHHHHHHHHHHHHHHHHHCCHHHHHCCEEEEEEECCCCCCEEHHHHHHHHHHHHHHH MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC HSREKKKS CCHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA