Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is mdtO
Identifier: 209399505
GI number: 209399505
Start: 5228913
End: 5230964
Strand: Reverse
Name: mdtO
Synonym: ECH74115_5589
Alternate gene names: 209399505
Gene position: 5230964-5228913 (Counterclockwise)
Preceding gene: 209400246
Following gene: 209399819
Centisome position: 93.88
GC content: 54.48
Gene sequence:
>2052_bases ATGAGCGCGCTCAACTCCCTGCCATTACCGGTGGTCAGGCTGCTGGCGTTCTTTCATGAAGAGTTAAGCGAGCGGCGACC AGGTCGCGTGCCGCAGACCGTGCAACTCTGGGTAGGCTGCCTGCTGGTGATTCTGATCTCGATGACTTTTGAGATCCCTT TTGTGGCGTTATCGCTGGCAGTGCTGTTTTACGGTATTCAGTCGAACGCGTTTTACACCAAATTTGTCGCGATCTTGTTT GTGGTTGCCACGGTGCTGGAGATCGGCAGCCTGTTTTTGATCTACAAATGGTCATACGGCGAACCGTTGATCCGCTTGAT CATCGCCGGGCCGATCCTGATGGGCTGCATGTTTTTGATGCGCACCCATCGGTTGGGACTGGTCTTTTTCGCTGTCGCCA TTGTCGCCATTTATGGGCAAACCTTCCCCGCCATGCTCGACTATCCGGAAGTGGTCGTGCGCTTAACGCTGTGGTGTATC GTTGTTGGCCTCTACCCGACGTTATTAATGACGTTAATCGGCGTACTGTGGTTTCCCAGTCGTGCCATTTCGCAAATGCA TCAAGCGCTTAATGATCGGCTTGATGATGCCATTAGCCACCTGACGGACAGCCTCGCACCGCTACCCGAAACGCGGATTG AAAGAGAGGCGCTGGCGCTGCAAAAACTCAATGTCTTTTGCCTCGCGGACGATGCCAACTGGCGAACTCAAAGCGCATGG TGGCAAAGCTGCGTGGCAACGGTAACCTACATTTACTCGACGCTGAATCGCTACGATCCCACCTCTTTTGCTGATTCTCA GGCAATTATTGAATTCCGACAAAAATTAGCTTCAGAAATCAACAAGCTGCAGCATGCCATTACCGAAGGTCAGTGCTGGC AAAGCGACTGGCGGATCAGTGAAAGTGAAGCGATGGCGGCACGGGAATGTAACCTGGAGAATATCTGCCAGACGTTGTTA CAACTGGGTCAGATGGACCCGAATACGCCGCCAACGCCCGCCGCCAAACCGCCATCAATGGTCGCTGATGCTTTTACCAA TCCAGACTATATGCGCTACGCGGTAAAAACGCTGCTCGCCTGTTTGATCTGTTACACCTTCTACAGCGGCGTGGACTGGG AAGGCATTCACACCTGTATGCTGACCTGCGTGATCGTCGCTAATCCGAATGTCGGTTCATCGTACCAGAAGATGGTGCTG CGTTTTGGCGGGGCCTTTTGCGGCGCGATTCTGGCGCTGTTATTCACGCTACTGGTCATGCCCTGGCTGGACAATATTGT CGAATTGCTGTTTGTGCTGGCACCGATTTTCCTGTTGGGCGCATGGATTGCCACCAGCTCTGAACGCTCTTCTTATATCG GCACACAGATGGTGGTCACCTTCGCGCTCGCCACGCTCGAAAACGTTTTTGGTCCGGTGTACGACCTGGTGGAAATTCGC GATCGCGCCCTGGGTATCATCATTGGTACCGTGGTGTCTGCGGTGATTTACACCTTTGTCTGGCCTGAAAGTGAAGCGCG CACGCTGCCGCAAAAACTGGCTGGCGCGCTGGGTATGCTAAGTAAAGTAATGCGGATCCCACGCCAGCAGGAAGTCACGG CTCTGCGCACTTATCTGCAAATTCGTATAGGTCTGCATGCGGCGTTTAATGCCTGTGAAGAGATGTGCCAACGCGTGGCG CTGGAGCGTCAACTGGACAGCGAAGAACGCGCCTTACTGATTGAACGTTCGCAAACGGTTATTCATCAGGGCCGCGATCT TCTTCACGCCTGGGATGCCACCTGGAACTCGGCGCAGGCGCTGGATAACGCACTACAGCCGGACAAAGCAGGTCAGTTTG CCGACGCCCTGGAAAAATACGCTGCCGGTCTGGCAACCGCACTCAGCCGTTCTCCTCAAATAACGCTTGAAGAGACACCC GCCTCGCAGGCCATCCTGCCCACCTTATTAAAACAGGAGCAACACGTCTGCCAGCTTTTCGCCCGCTTGCCAGACTGGAC AGCCCCGGCATTAACGCCCGCCACGGAACAGGCACAAGGAGCCACGCAATGA
Upstream 100 bases:
>100_bases GTGTTGCCCAGCGTTTTCCGGTCAAAATCATGGTTGATAAACCTGACCCGGAAATGTTCCGCATCGGCGCTTCGGCAGTC GCTAATCTTGAGCCGCAATA
Downstream 100 bases:
>100_bases TCAATCGTCAACTTTCACGTCTGCTGTTGTGCAGCATTCTCGGCAGCACGACGCTGATTTCCGGCTGTGCCCTGGTACGC AAGGATTCTGCGCCTCATCA
Product: multidrug efflux system protein MdtO
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 683; Mature: 682
Protein sequence:
>683_residues MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLAVLFYGIQSNAFYTKFVAILF VVATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCI VVGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAW WQSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRISESEAMAARECNLENICQTLL QLGQMDPNTPPTPAAKPPSMVADAFTNPDYMRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIR DRALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFNACEEMCQRVA LERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQALDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETP ASQAILPTLLKQEQHVCQLFARLPDWTAPALTPATEQAQGATQ
Sequences:
>Translated_683_residues MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLAVLFYGIQSNAFYTKFVAILF VVATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCI VVGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAW WQSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRISESEAMAARECNLENICQTLL QLGQMDPNTPPTPAAKPPSMVADAFTNPDYMRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIR DRALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFNACEEMCQRVA LERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQALDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETP ASQAILPTLLKQEQHVCQLFARLPDWTAPALTPATEQAQGATQ >Mature_682_residues SALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLAVLFYGIQSNAFYTKFVAILFV VATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCIV VGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAWW QSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRISESEAMAARECNLENICQTLLQ LGQMDPNTPPTPAAKPPSMVADAFTNPDYMRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVLR FGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIRD RALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFNACEEMCQRVAL ERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQALDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETPA SQAILPTLLKQEQHVCQLFARLPDWTAPALTPATEQAQGATQ
Specific function: Could be involved in resistance to puromycin, acriflavine and tetraphenylarsonium chloride
COG id: COG1289
COG function: function code S; Predicted membrane protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the MdtO family
Homologues:
Organism=Escherichia coli, GI87082367, Length=683, Percent_Identity=98.9751098096633, Blast_Score=1387, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): MDTO_ECO57 (Q8X5R8)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: C86102 - PIR: G91261 - RefSeq: NP_290714.2 - RefSeq: NP_313090.2 - EnsemblBacteria: EBESCT00000025265 - EnsemblBacteria: EBESCT00000059985 - GeneID: 914278 - GeneID: 960043 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z5681 - KEGG: ecs:ECs5063 - GeneTree: EBGT00050000011297 - HOGENOM: HBG467485 - OMA: DWRITES - ProtClustDB: PRK11427 - BioCyc: ECOL83334:ECS5063-MONOMER - GO: GO:0006810 - InterPro: IPR006726
Pfam domain/function: PF04632 FUSC
EC number: NA
Molecular weight: Translated: 76120; Mature: 75988
Theoretical pI: Translated: 5.24; Mature: 5.24
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x13099df0)-; HASH(0x137c4c40)-; HASH(0x1393310c)-; HASH(0x133061cc)-; HASH(0x1311dd44)-; HASH(0x11e707a0)-; HASH(0x1393003c)-; HASH(0x13935920)-; HASH(0x13669138)-;
Cys/Met content:
2.3 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 5.1 %Cys+Met (Translated Protein) 2.3 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLA CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VLFYGIQSNAFYTKFVAILFVVATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLM HHHHHHCCCHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHH RTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCIVVGLYPTLLMTLIGVLWFPS HHHHHHHHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH RAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAW HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCEEEEECCCCCCHHHHH WQSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRIS HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC ESEAMAARECNLENICQTLLQLGQMDPNTPPTPAAKPPSMVADAFTNPDYMRYAVKTLLA HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHH CLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVLRFGGAFCGAILALLFTLLVM HHHHHHHHCCCCCHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH PWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIR HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DRALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQ HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH IRIGLHAAFNACEEMCQRVALERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQA HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHH LDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETPASQAILPTLLKQEQHVCQLF HHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHH ARLPDWTAPALTPATEQAQGATQ HHCCCCCCCCCCCCCCCCCCCCH >Mature Secondary Structure SALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLA CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH VLFYGIQSNAFYTKFVAILFVVATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLM HHHHHHCCCHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHH RTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCIVVGLYPTLLMTLIGVLWFPS HHHHHHHHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH RAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQSAW HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCEEEEECCCCCCHHHHH WQSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAITEGQCWQSDWRIS HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC ESEAMAARECNLENICQTLLQLGQMDPNTPPTPAAKPPSMVADAFTNPDYMRYAVKTLLA HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHH CLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVLRFGGAFCGAILALLFTLLVM HHHHHHHHCCCCCHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH PWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIR HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DRALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGALGMLSKVMRIPRQQEVTALRTYLQ HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH IRIGLHAAFNACEEMCQRVALERQLDSEERALLIERSQTVIHQGRDLLHAWDATWNSAQA HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHH LDNALQPDKAGQFADALEKYAAGLATALSRSPQITLEETPASQAILPTLLKQEQHVCQLF HHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHH ARLPDWTAPALTPATEQAQGATQ HHCCCCCCCCCCCCCCCCCCCCH
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796