| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is mhpA
Identifier: 157159863
GI number: 157159863
Start: 433117
End: 434781
Strand: Direct
Name: mhpA
Synonym: EcHS_A0411
Alternate gene names: 157159863
Gene position: 433117-434781 (Clockwise)
Preceding gene: 157159858
Following gene: 157159864
Centisome position: 9.33
GC content: 56.82
Gene sequence:
>1665_bases ATGGCAATACAACACCCTGACATCCAGCCTGCTGTTAACCATAGCGTTCAGGTGGCGATCGCTGGTGCCGGTCCGGTTGG GCTGATGATGGCGAACTATCTCGGTCAGATGGGCATTGACGTGCTGGTGGTGGAGAAACTCGATAAGTTGATCGACTACC CGCGTGCGATTGGTATTGATGACGAGGCGCTGCGCACCATGCAGTCGGTCGGCCTGGTCGAGAATGTTCTGCCGCACACT ACACCGTGGCACGCGATGCGTTTTCTCACCCCAAAAGGCCGCTGTTTTGCTGATATTCAGCCAATGACCGATGAATTTGG CTGGCCGCGCCGTAACGCCTTTATTCAGCCACAGGTCGATGCGGTGATGCTGGAAGGGTTGTCGCGTTTTCCGAATGTGC GCTGCTTGTTTGCCCGCGAGCTGGAGGCCTTCAGCCAGCAAAATGACGAAGTGACCTTGCACCTGAAAACGGCAGAAGGG CAGCGGGAAACGGTCAAAGCCCAGTGGCTGGTAGCCTGTGACGGTGGAGCAAGTTTTGTCCGTCGCACTCTGAATGTGCC GTTTGAAGGTAAAACTGCGCCAAATCAGTGGATTGTGGTAGATATCGCCAACGATCCGTTAAGTACGCCGCATATCTATT TGTGTTGCGATCCGGTGCGCCCGTATGTTTCTGCCGCGCTGCCTCATGCGGTACGTCGCTTTGAATTTATGGTGATGCCG GGAGAAACCGAAGAGCAGCTGCGTGAGCCGCAAAATATGCGCAAGCTGTTAAGCAAAGTGCTGCCTAATCCGGACAATGT TGAATTGATTCGCCAGCGTGTCTACACCCACAACGCGCGACTGGCGCAACGTTTCCGTATTGATCGCGTACTGCTGGCGG GCGATGCCGCGCACATCATGCCGGTATGGCAGGGGCAGGGCTATAACAGTGGTATGCGCGACGCCTTTAACCTCGCATGG AAACTGGCGTTGGTTATCCAGGGGAAAGCCCGCGATGCGCTGCTCGATACCTATCAACAAGAACGTCGCGATCACGCCAA AGCGATGATTGACCTGTCCGTGACGGCGGGCAACGTGCTGGCTCCGCCGAAACGCTGGCAGGGTACGTTACGTGACGGCG TTTCCTGGCTGTTGAATTATCTGCCGCCAGTAAAACGCTACTTCCTCGAAATGCGCTTCAAGCCGATGCCGCAATATTAC GGCGGTGCGCTGATGCGTGAGGGCGAAGCGAAGCACTCTCCGGTCGGCAAGATGTTTATTCAGCCGAAAGTCACGCTGGA AAACGGCGACGTGACGCTGCTCGATAACGCGATCGGCGCGAACTTCGCGGTAATTGGCTGGGGATGCAATCCACTGTGGG GGATGAGCGACGAGCAAATCCAGCAGTGGCGCGCGTTGGGCACACGCTTCATTCAGGTGGTGCCGGAAGTGCAAATTCAT ACCGCACAGGATAACCACGACGGCGTACTACGCGTGGGCGATACGCAAGGTCGCCTGCGTAGCTGGTTCGCGCAACACAA TGCTTCGCTGGTGGTGATGCGCCCGGATCGCTTTGTTGCCGCCACCGCCATTCCGCAAACCCTGGGCAAGACCCTGAATA AACTGGCGTCGGTGATGACGCTGACCCGCCCTGATGCCGACGTTTCTGTCGAAAAGGTAGCCTGA
Upstream 100 bases:
>100_bases ACTGAGCGCACAATAAAAAATCATTTACATGTTTTTAACAAAATAAGTTGCGCTGTACTGTGCGCGCAACGGCATTTTGT CCGAGTCGTGAGGTACTGAA
Downstream 100 bases:
>100_bases TATGCACGCTTATCTTCACTGTCTTTCCCACTCGCCGCTGGTGGGGTATGTCGACCCGGCGCAAGAGGTGCTCGATGAGG TCAATGGCGTGATTGCCAGC
Product: 3-(3-hydroxyphenyl)propionate hydroxylase
Products: NA
Alternate protein names: 3-HCI hydroxylase; 3-HPP hydroxylase
Number of amino acids: Translated: 554; Mature: 553
Protein sequence:
>554_residues MAIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGIDDEALRTMQSVGLVENVLPHT TPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVDAVMLEGLSRFPNVRCLFARELEAFSQQNDEVTLHLKTAEG QRETVKAQWLVACDGGASFVRRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMP GETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIMPVWQGQGYNSGMRDAFNLAW KLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVLAPPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYY GGALMREGEAKHSPVGKMFIQPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIH TAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMTLTRPDADVSVEKVA
Sequences:
>Translated_554_residues MAIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGIDDEALRTMQSVGLVENVLPHT TPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVDAVMLEGLSRFPNVRCLFARELEAFSQQNDEVTLHLKTAEG QRETVKAQWLVACDGGASFVRRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMP GETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIMPVWQGQGYNSGMRDAFNLAW KLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVLAPPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYY GGALMREGEAKHSPVGKMFIQPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIH TAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMTLTRPDADVSVEKVA >Mature_553_residues AIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGIDDEALRTMQSVGLVENVLPHTT PWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVDAVMLEGLSRFPNVRCLFARELEAFSQQNDEVTLHLKTAEGQ RETVKAQWLVACDGGASFVRRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMPG ETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIMPVWQGQGYNSGMRDAFNLAWK LALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVLAPPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYYG GALMREGEAKHSPVGKMFIQPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIHT AQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMTLTRPDADVSVEKVA
Specific function: Catalyzes the insertion of one atom of molecular oxygen into position 2 of the phenyl ring of 3-(3- hydroxyphenyl)propionate (3-HPP) and hydroxycinnamic acid (3HCI)
COG id: COG0654
COG function: function code HC; 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the pheA/tfdB FAD monooxygenase family
Homologues:
Organism=Escherichia coli, GI1786543, Length=554, Percent_Identity=98.9169675090253, Blast_Score=1137, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): MHPA_ECO24 (A7ZI94)
Other databases:
- EMBL: CP000800 - RefSeq: YP_001461523.1 - ProteinModelPortal: A7ZI94 - SMR: A7ZI94 - STRING: A7ZI94 - EnsemblBacteria: EBESCT00000018928 - GeneID: 5589374 - GenomeReviews: CP000800_GR - KEGG: ecw:EcE24377A_0371 - eggNOG: COG0654 - GeneTree: EBGT00050000008990 - HOGENOM: HBG565573 - OMA: TTSSTRW - ProtClustDB: PRK06183 - BioCyc: ECOL331111:ECE24377A_0371-MONOMER - HAMAP: MF_01652 - InterPro: IPR002938 - InterPro: IPR003042 - PRINTS: PR00420
Pfam domain/function: PF01494 FAD_binding_3
EC number: 1.14.13.-
Molecular weight: Translated: 62185; Mature: 62053
Theoretical pI: Translated: 8.37; Mature: 8.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 3.8 %Met (Translated Protein) 4.9 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 4.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGID CCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCC DEALRTMQSVGLVENVLPHTTPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVD HHHHHHHHHHHHHHHHCCCCCCHHHHEEECCCCCEEEECCCCHHHCCCCCCCCCCCCCHH AVMLEGLSRFPNVRCLFARELEAFSQQNDEVTLHLKTAEGQRETVKAQWLVACDGGASFV HHHHHHHHHCCCEEEEHHHHHHHHHCCCCEEEEEEEECCCCCCEEEEEEEEEECCCHHHH RRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMP HHHHCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEECCCCHHHHHHHHHHHHHEEEEECC GETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIM CCCHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCEEEEECCCHHEE PVWQGQGYNSGMRDAFNLAWKLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVL EEECCCCCCCCHHHHHHHHEEEEEEEECCCHHHHHHHHHHHHHHHHHEEEEEEEECCCCC APPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYYGGALMREGEAKHSPVGKMFI CCCHHHCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHCCHHEECCCCCCCCCCEEEE QPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIH CCEEEEECCCEEEEECCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEE TAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMT ECCCCCCCEEEECCCHHHHHHHHHHCCCEEEEECCCCEEEHHHHHHHHHHHHHHHHHHHH LTRPDADVSVEKVA CCCCCCCCCHHCCC >Mature Secondary Structure AIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGID CCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCC DEALRTMQSVGLVENVLPHTTPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVD HHHHHHHHHHHHHHHHCCCCCCHHHHEEECCCCCEEEECCCCHHHCCCCCCCCCCCCCHH AVMLEGLSRFPNVRCLFARELEAFSQQNDEVTLHLKTAEGQRETVKAQWLVACDGGASFV HHHHHHHHHCCCEEEEHHHHHHHHHCCCCEEEEEEEECCCCCCEEEEEEEEEECCCHHHH RRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMP HHHHCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEECCCCHHHHHHHHHHHHHEEEEECC GETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIM CCCHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCEEEEECCCHHEE PVWQGQGYNSGMRDAFNLAWKLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVL EEECCCCCCCCHHHHHHHHEEEEEEEECCCHHHHHHHHHHHHHHHHHEEEEEEEECCCCC APPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYYGGALMREGEAKHSPVGKMFI CCCHHHCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHCCHHEECCCCCCCCCCEEEE QPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIH CCEEEEECCCEEEEECCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEE TAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMT ECCCCCCCEEEECCCHHHHHHHHHHCCCEEEEECCCCEEEHHHHHHHHHHHHHHHHHHHH LTRPDADVSVEKVA CCCCCCCCCHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA