Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is torA
Identifier: 209399846
GI number: 209399846
Start: 1242642
End: 1245188
Strand: Direct
Name: torA
Synonym: ECH74115_1233
Alternate gene names: 209399846
Gene position: 1242642-1245188 (Clockwise)
Preceding gene: 209400451
Following gene: 209399417
Centisome position: 22.3
GC content: 55.2
Gene sequence:
>2547_bases ATGAACAATAACGATCTCTTTCAGGCATCACGTCGGCGTTTTCTGGCACAACTCGGCGGCTTAACCGTCGCCGGGATGCT GGGGCCGTCATTGTTAACGTCGCGCCGTGCGACTGCGGCGCAAGCGGCGACTGAGGCTGTCATCTCGAAAGAGGGCATTC TTACCGGGTCGCACTGGGGGGCTATCCGCGCGACGGTGAAGGATGGTCGCTTTGTGGCGGCAAAACCGTTCGAACTGGAT AAATATCCGTCGAAAATGATTGCCGGATTGCCGGATCATGTACACAACGCGGCGCGTATTCGTTATCCGATGGTACGCGT GGACTGGCTGCGTAAGCGCCATCTGAGCGACACCTCCCAGCGCGGTGATAACCGTTTTGTGCGTGTGAGCTGGGATGAAG CCCTCGACATGTTCTATGAAGAACTGGAACGCGTACAGAAAACTCACGGGCCGAGTGCCTTGCTGACCGCCAGTGGTTGG CAATCGACGGGGATGTTCCATAACGCTTCGGGGATGCTGGCGAAAGCTATTGCCTTGCATGGTAATAGCGTTGGTACGGG CGGAGATTACTCTACTGGTGCTGCGCAGGTGATCCTGCCGCGCGTAGTCGGTTCGATGGAAGTGTATGAACAGCAAACCT CCTGGCCGCTGGTATTGCAGAACAGCAAAACCATTGTGCTGTGGGGCTCCGATTTGCTGAAAAACCAGCAAGCGAACTGG TGGTGCCCGGATCACGATGTTTATGAATATTACGCGCAGTTGAAAGCGAAAGTCGCCGCCGGTGAAATTGAGGTCATTAG CATCGATCCGGTTGTTACATCCACCCATGAGTATCTGGGGCGCGAGCATGTGAAGCACATTGCGGTTAACCCGCAAACTG ACGTGCCGCTGCAACTGGCGCTGGCGCATACGTTGTACAGTGAAAACCTGTACGACAAAAACTTCCTCGCTAACTACTGT GTGGGTTTTGAGCAGTTCCTGCCGTATCTGCTGGGTGAGAAAGACGGTCAGCCGAAAGATGCCGCATGGGCTGAAAAACT GACCGGCATTGATGCCGAAACCATTCGTGGACTGGCGCGGCAGATGGCGGCGAACAGAACGCAGATTATTGCTGGCTGGT GCGTACAGCGTATGCAGCACGGTGAACAGTGGGCGTGGATGATTGTGGTTCTGGCGGCGATGCTGGGGCAAATTGGCCTG CCAGGTGGTGGCTTTGGTTTTGGCTGGCACTATAACGGCGCAGGCACGCCGGGGCGTAAAGGCGTTATTCTGAGTGGTTT CTCCGGCTCTACGTCGATTCCGCCTGTTCACGACAACAGTGATTACAAAGGCTACAGCAGCACCATTCCGATTGCCCGTT TTATCGATGCGATCCTCGAACCGGGGAAAGTAATCAACTGGAACGGTAAATCGGTAAAACTGCCGCCGCTGAAAATGTGT ATTTTTGCCGGAACTAACCCCTTCCATCGCCATCAGCAGATCAACCGCATTATTGAAGGCTGGCGCAAACTGGAAACGGT TATCGCCATAGATAACCAGTGGACCTCAACCTGCCGCTTTGCCGATATCGTGCTGCCTGCGACCACGCAGTTTGAGCGTA ACGATCTCGACCAGTACGGCAACCACTCCAACCGTGGCATTATCGCCATGAAACAGGTCGTGCCGCCGCAGTTCGAGGCG CGCAACGACTTTGATATTTTCCGCGAGCTGTGCCGCCGCTTTAATCGCGAAGAAGCCTTTACCGAAGGGCTGGACGAAAT GGGCTGGCTGAAACGCATCTGGCAGGAAGGTGTTCAGCAAGGCAAAGGACGCGGCGTTCATTTGCCAGCGTTTGATGACT TCTGGAATAACAAAGAGTACGTCGAGTTTGACCATCCGCAGATGTTTGTTCGCCACCAGGCATTCCGCGAAGATCCAGAT CTCGAACCGCTGGGCACGCCGAGTGGCCTGATTGAGATCTATTCGAAAACCATCGCCGATATGAACTACGACGATTGTCA GGGGCATCCGATGTGGTTTGAGAAAATCGAACGCTCCCACGGTGGGCCCGGCTCGCAGACGTATCCGTTGCATCTGCAAT CTGTGCATCCGGATTTCCGACTTCACTCGCAGTTATGTGAGTCGGAAACTCTGCGTCAGCAATATACGGTAGCGGGTAAA GAGCCAGTGTTCATTAACCCGCAGGATGCCAGCGCGCGCGGTATTCGTAACGGTGATGTGGTACGCGTCTTTAACGCTCG CGGTCAGGTGTTGGCAGGGGCAGTAGTTTCTGACCGCTATGCACCCGGCGTGGCGCGAATTCACGAAGGGGCATGGCACG ATCCAGATAAAGGCGGCGAGCCTGGTGCGCTGTGCAAATACGGTAACCCCAACGTGTTGACCATTGACATCGGTACTTCG CAGCTGGCGCAGGCGACCAGTGCGCACACTACGCTGGTGGAAATTGAGAAGTGCAACGGAACAGTGGAGCAGGTAACGGC GTTTAACGGCCCCGTGGAGATGGTGGCGCAGTGCGAATATGTTCCCGCGTCGCAGGTGAAATTATGA
Upstream 100 bases:
>100_bases GCCTCGATAAACGTGAAGAACGCACCTTGTTGAAATATCTGCAAATGAATGCGTCTGACACCGCAGGTAAGGCTCACGGC GATAAGAAGGAAGAAAAATA
Downstream 100 bases:
>100_bases CCACGCTGACAGCACAACAGATAGCCTGTGTTTACGCCTGGCTGGCGCAGTTGTTCTCCCGTGAACTGGACGATGAACAA CTGACGCAAATCGCCAGTGC
Product: trimethylamine-N-oxide reductase
Products: NA
Alternate protein names: TMAO reductase 1; Trimethylamine oxidase 1
Number of amino acids: Translated: 848; Mature: 848
Protein sequence:
>848_residues MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTSRRATAAQAATEAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELD KYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGW QSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAHTLYSENLYDKNFLANYC VGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGL PGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC IFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEA RNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPD LEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQTYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWHDPDKGGEPGALCKYGNPNVLTIDIGTS QLAQATSAHTTLVEIEKCNGTVEQVTAFNGPVEMVAQCEYVPASQVKL
Sequences:
>Translated_848_residues MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTSRRATAAQAATEAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELD KYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGW QSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAHTLYSENLYDKNFLANYC VGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGL PGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC IFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEA RNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPD LEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQTYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWHDPDKGGEPGALCKYGNPNVLTIDIGTS QLAQATSAHTTLVEIEKCNGTVEQVTAFNGPVEMVAQCEYVPASQVKL >Mature_848_residues MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTSRRATAAQAATEAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELD KYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGW QSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAHTLYSENLYDKNFLANYC VGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGL PGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC IFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEA RNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPD LEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQTYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWHDPDKGGEPGALCKYGNPNVLTIDIGTS QLAQATSAHTTLVEIEKCNGTVEQVTAFNGPVEMVAQCEYVPASQVKL
Specific function: Reduces trimethylamine-N-oxide (TMAO) into trimethylamine; an anaerobic reaction coupled to energy-yielding reactions
COG id: COG0243
COG function: function code C; Anaerobic dehydrogenases, typically selenocysteine-containing
Gene ontology:
Cell location: Periplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the prokaryotic molybdopterin-containing oxidoreductase family
Homologues:
Organism=Escherichia coli, GI1787231, Length=847, Percent_Identity=99.2916174734357, Blast_Score=1755, Evalue=0.0, Organism=Escherichia coli, GI87081994, Length=785, Percent_Identity=45.2229299363057, Blast_Score=666, Evalue=0.0, Organism=Escherichia coli, GI145693196, Length=790, Percent_Identity=42.6582278481013, Blast_Score=614, Evalue=1e-177, Organism=Escherichia coli, GI87081797, Length=761, Percent_Identity=31.9316688567674, Blast_Score=300, Evalue=4e-82, Organism=Escherichia coli, GI1787870, Length=864, Percent_Identity=29.2824074074074, Blast_Score=277, Evalue=2e-75, Organism=Escherichia coli, GI171474008, Length=854, Percent_Identity=29.3911007025761, Blast_Score=259, Evalue=6e-70,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): TORA_ECO57 (P58360)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: D85635 - PIR: H90772 - RefSeq: NP_286933.1 - RefSeq: NP_309179.1 - ProteinModelPortal: P58360 - SMR: P58360 - EnsemblBacteria: EBESCT00000028320 - EnsemblBacteria: EBESCT00000055968 - GeneID: 913030 - GeneID: 959060 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z1415 - KEGG: ecs:ECs1152 - GeneTree: EBGT00050000008999 - HOGENOM: HBG304064 - OMA: QQYAVGG - ProtClustDB: PRK15102 - BioCyc: ECOL83334:ECS1152-MONOMER - InterPro: IPR009010 - InterPro: IPR006658 - InterPro: IPR006657 - InterPro: IPR006656 - InterPro: IPR006655 - InterPro: IPR006311 - InterPro: IPR011887 - Gene3D: G3DSA:2.40.40.20 - TIGRFAMs: TIGR00509 - TIGRFAMs: TIGR01409 - TIGRFAMs: TIGR02164
Pfam domain/function: PF00384 Molybdopterin; PF01568 Molydop_binding; SSF50692 Asp_decarb_fold
EC number: =1.7.2.3
Molecular weight: Translated: 94447; Mature: 94447
Theoretical pI: Translated: 6.84; Mature: 6.84
Prosite motif: PS00551 MOLYBDOPTERIN_PROK_1; PS00490 MOLYBDOPTERIN_PROK_2; PS00932 MOLYBDOPTERIN_PROK_3; PS51318 TAT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTSRRATAAQAATEAVISKEGILTGSHWG CCCCHHHHHHHHHHHHHHCCCEEEHHHCHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCC AIRATVKDGRFVAAKPFELDKYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQ EEEEEECCCEEEEECCCCCCCCHHHHHHCCCHHHCCHHHHCCCHHHHHHHHHHHCCCHHH RGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGWQSTGMFHNASGMLAKAIALH CCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCEECCCCHHHHHHHHC GNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCEEEEECHHHHCCCCCCC WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLA CCCCCHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHEEECCCCCCCCHHHH LAHTLYSENLYDKNFLANYCVGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLAR HHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCCHHHHHHHHH QMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGLPGGGFGFGWHYNGAGTPGRK HHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEECCCCCCCCC GVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC CEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEECCCCEEECCCEEEE IFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYG EEECCCHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCEEEEEEECCCCCCCHHHHHHHC NHSNRGIIAMKQVVPPQFEARNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQ CCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHHCHHHHHHHHHHHHHHHHH GKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPDLEPLGTPSGLIEIYSKTIAD CCCCEEECCCHHHHCCCCCEEEECCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHC MNYDDCQGHPMWFEKIERSHGGPGSQTYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK CCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCC EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWHDPDKGGE CCEEECCCCCCCCCCCCCCEEEEECCCCCEEECHHHHCCCCCCHHHHCCCCCCCCCCCCC PGALCKYGNPNVLTIDIGTSQLAQATSAHTTLVEIEKCNGTVEQVTAFNGPVEMVAQCEY CCCEEECCCCCEEEEECCHHHHHHHHCCCEEEEEEECCCCCHHHHHHCCCCHHHHHHCCC VPASQVKL CCHHCCCC >Mature Secondary Structure MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTSRRATAAQAATEAVISKEGILTGSHWG CCCCHHHHHHHHHHHHHHCCCEEEHHHCHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCC AIRATVKDGRFVAAKPFELDKYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQ EEEEEECCCEEEEECCCCCCCCHHHHHHCCCHHHCCHHHHCCCHHHHHHHHHHHCCCHHH RGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGWQSTGMFHNASGMLAKAIALH CCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCEECCCCHHHHHHHHC GNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCEEEEECHHHHCCCCCCC WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLA CCCCCHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHEEECCCCCCCCHHHH LAHTLYSENLYDKNFLANYCVGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLAR HHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCCHHHHHHHHH QMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGLPGGGFGFGWHYNGAGTPGRK HHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEECCCCCCCCC GVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC CEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEECCCCEEECCCEEEE IFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYG EEECCCHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCEEEEEEECCCCCCCHHHHHHHC NHSNRGIIAMKQVVPPQFEARNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQ CCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHHCHHHHHHHHHHHHHHHHH GKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPDLEPLGTPSGLIEIYSKTIAD CCCCEEECCCHHHHCCCCCEEEECCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHC MNYDDCQGHPMWFEKIERSHGGPGSQTYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK CCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCC EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWHDPDKGGE CCEEECCCCCCCCCCCCCCEEEEECCCCCEEECHHHHCCCCCCHHHHCCCCCCCCCCCCC PGALCKYGNPNVLTIDIGTSQLAQATSAHTTLVEIEKCNGTVEQVTAFNGPVEMVAQCEY CCCEEECCCCCEEEEECCHHHHHHHHCCCEEEEEEECCCCCHHHHHHCCCCHHHHHHCCC VPASQVKL CCHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796