Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is oxaA [H]

Identifier: 209400160

GI number: 209400160

Start: 4774413

End: 4776059

Strand: Direct

Name: oxaA [H]

Synonym: ECH74115_5135

Alternate gene names: 209400160

Gene position: 4774413-4776059 (Clockwise)

Preceding gene: 209396527

Following gene: 209395938

Centisome position: 85.68

GC content: 53.67

Gene sequence:

>1647_bases
ATGGATTCGCAACGCAATCTTTTAGTCATCGCTTTGCTGTTCGTGTCTTTCATGATCTGGCAAGCCTGGGAGCAGGATAA
AAACCCGCAACCTCAGGCCCAACAGACCACGCAGACAACGACCACCGCAGCGGGTAGCGCCGCCGACCAGGGCGTACCGG
CCAGTGGCCAGGGGAAACTGATCTCGGTTAAGACCGACGTGCTTGATCTGACCATCAACACCCGTGGTGGTGATGTTGAG
CAAGCTCTGCTGCCTGCTTACCCGAAAGAGCTGAACTCTACCCAGCCGTTCCAGCTGCTGGAAACTTCACCGCAGTTTAT
TTATCAGGCACAGAGCGGTCTGACCGGTCGTGATGGCCCAGATAACCCGGCTAACGGCCCGCGTCCGCTGTATAACGTTG
AAAAAGACGCTTATGTGCTGGCTGAAGGTCAAAACGAACTGCAGGTGCCGATGACGTATACCGACGCGGCAGGCAACACG
TTTACCAAAACGTTTGTCCTGAAACGGGGTGATTACGCTGTCAACGTCAACTACAACGTGCAGAACGCTGGCGAGAAACC
GCTGGAAATCTCCACCTTTGGTCAGTTGAAGCAATCCATCACTCTGCCACCGCATCTCGATACCGGAAGCAGCAACTTCG
CACTGCACACCTTCCGTGGCGCGGCGTACTCCACGCCTGACGAGAAGTACGAGAAATACAAGTTCGATACCATTGCCGAT
AACGAAAACCTGAACATCTCTTCGAAAGGTGGTTGGGTGGCAATGCTGCAACAGTATTTCGCGACGGCGTGGATCCCGCA
TAACGACGGTACCAACAACTTCTATACTGCTAATCTGGGTAACGGCATCGCCGCTATCGGCTATAAATCTCAGCCGGTAC
TGGTTCAGCCTGGTCAGACTGGCGCGATGAACAGCACCCTGTGGGTTGGCCCGGAAATCCAGGACAAAATGGCAGCTGTT
GCTCCGCACCTGGATCTGACCGTTGATTACGGTTGGTTGTGGTTCATCTCTCAGCCGCTGTTCAAACTGCTGAAATGGAT
CCATAGCTTTGTGGGTAACTGGGGCTTCTCCATTATCATCATCACCTTTATCGTTCGTGGCATCATGTACCCGCTGACCA
AAGCGCAGTACACCTCCATGGCGAAGATGCGTATGCTGCAGCCGAAGATTCAGGCAATGCGTGAGCGTCTGGGCGATGAC
AAACAGCGTATCAGCCAGGAAATGATGGCGCTGTACAAAGCTGAGAAGGTTAACCCGCTGGGCGGCTGCTTCCCGCTGCT
GATCCAGATGCCAATCTTCCTGGCGTTGTACTACATGCTGATGGGTTCCGTTGAACTGCGTCAGGCACCGTTTGCACTGT
GGATCCACGACCTGTCTGCACAGGACCCGTACTACATCCTGCCGATCCTGATGGGCGTAACGATGTTCTTCATTCAGAAG
ATGTCGCCGACCACTGTGACCGACCCGATGCAGCAGAAGATCATGACCTTTATGCCGGTCATCTTCACCGTGTTCTTCCT
GTGGTTCCCGTCAGGTCTGGTGCTGTACTATATCGTCAGCAACCTGGTAACCATTATTCAGCAGCAGCTGATTTACCGTG
GTCTGGAAAAACGTGGCCTGCATAGCCGCGAGAAGAAAAAATCCTGA

Upstream 100 bases:

>100_bases
GTTGGTTGACGGTGAAACGCGTATTAAAATGCCACCCTTTACACCCTGGTGGTGACGATCCCGTCCCGCCCGGACCATTT
AATACCAGAGAACACTAACG

Downstream 100 bases:

>100_bases
TTCGGTGAGTTTTCGCTAAAATAAGGGCGGTCAGTTGACCGCCTTTTTTCTTTTCGTAGGGCGGATAAGCACCGCGTATC
CGCCACACAAAGCAACAGGA

Product: putative inner membrane protein translocase component YidC

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 548; Mature: 548

Protein sequence:

>548_residues
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE
QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT
FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV
APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD
KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS

Sequences:

>Translated_548_residues
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE
QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT
FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV
APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD
KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS
>Mature_548_residues
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE
QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT
FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV
APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD
KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS

Specific function: Required for the insertion of integral membrane proteins into the membrane. Probably plays an essential role in the integration of proteins of the respiratory chain complexes. Involved in integration of membrane proteins that insert dependently and indepe

COG id: COG0706

COG function: function code U; Preprotein translocase subunit YidC

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the OXA1/oxaA family. Type 1 subfamily [H]

Homologues:

Organism=Escherichia coli, GI1790140, Length=548, Percent_Identity=99.8175182481752, Blast_Score=1129, Evalue=0.0,
Organism=Drosophila melanogaster, GI24662345, Length=196, Percent_Identity=26.0204081632653, Blast_Score=77, Evalue=2e-14,
Organism=Drosophila melanogaster, GI21356443, Length=196, Percent_Identity=26.0204081632653, Blast_Score=77, Evalue=2e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR019998
- InterPro:   IPR001708 [H]

Pfam domain/function: PF02096 60KD_IMP [H]

EC number: NA

Molecular weight: Translated: 61541; Mature: 61541

Theoretical pI: Translated: 8.27; Mature: 8.27

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
4.0 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
4.0 %Met     (Mature Protein)
4.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCE
ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP
EEEEEEEEEEEEECCCCCHHHHHHCCCHHHCCCCCCHHHHHCCHHHHHHHHCCCCCCCCC
DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV
CCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEEEECCCEEEEEEEEE
QNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
CCCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHEECCEECC
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT
CCCCEECCCCCHHHHHHHHHHHEECCCCCCCCCEEEEECCCCEEEECCCCCCEEECCCCC
GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII
CCCCCEEEECCCHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH
ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC
GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
CHHHHHHHHHHHHHHHHHHHHCCHHHHHCCEEEEEEECCCCCCEEHHHHHHHHHHHHHHH
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
HSREKKKS
CCHHHCCC
>Mature Secondary Structure
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL
CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCE
ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP
EEEEEEEEEEEEECCCCCHHHHHHCCCHHHCCCCCCHHHHHCCHHHHHHHHCCCCCCCCC
DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV
CCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEEECCCCCCEEEEEEEECCCEEEEEEEEE
QNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
CCCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHEECCEECC
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT
CCCCEECCCCCHHHHHHHHHHHEECCCCCCCCCEEEEECCCCEEEECCCCCCEEECCCCC
GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII
CCCCCEEEECCCHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH
ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC
GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
CHHHHHHHHHHHHHHHHHHHHCCHHHHHCCEEEEEEECCCCCCEEHHHHHHHHHHHHHHH
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
HSREKKKS
CCHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA