Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is eutH
Identifier: 157161910
GI number: 157161910
Start: 2588788
End: 2590014
Strand: Reverse
Name: eutH
Synonym: EcHS_A2580
Alternate gene names: 157161910
Gene position: 2590014-2588788 (Counterclockwise)
Preceding gene: 157161911
Following gene: 157161909
Centisome position: 55.78
GC content: 55.18
Gene sequence:
>1227_bases ATGGGAATTAACGAAATCATCATGTACATCATGATGTTCTTTATGCTGATAGCTGCCGTAGACAGGATCCTGTCGCAGTT CGGCGGTTCTGCTCGTTTCCTCGGTAAGTTCGGTAAAAGTATCGAAGGATCAGGCGGTCAGTTCGAAGAAGGCTTTATGG CAATGGGCGCACTGGGCCTGGCGATGGTCGGTATGACCGCGCTGGCACCGGTACTGGCTCACGTTCTCGGGCCGGTAATT ATTCCGGTTTACGAAATGCTCGGCGCTAACCCATCGATGTTCGCCGGAACACTGCTGGCGTGCGATATGGGCGGCTTCTT CCTCGCCAAAGAGCTGGCGGGCGGCGACGTAGCCGCGTGGCTATACTCTGGGTTAATTCTCGGGTCGATGATGGGGCCAA CGATTGTGTTTTCCATTCCGGTGGCGCTCGGCATTATCGAACCTTCTGACCGTCGTTATCTGGCGCTCGGCGTGCTGGCG GGCATTGTGACCATTCCGATTGGTTGTATCGCTGGTGGTCTGGTTGCTATGTACTCCGGTGTGCAGATCAACGGCCAGCC GGTGGAATTCACTTTCGCCCTGATCCTGATGAACATGATCCCGGTGATCATTGTTGCGATTCTGGTGGCGCTGGGGCTGA AATTCATCCCGGAAAAAATGATCAACGGCTTCCAGATCTTCGCCAAATTCCTCGTTGCATTGATCACCCTCGGTCTTGCC GCTGCGGTAGTGAAATTCCTGCTTGGCTGGGAACTGATCCCCGGTCTGGATCCTATCTTTATGGCCCCTGGCGATAAACC CGGTGAGGTGATGCGCGCCATTGAAGTTATCGGTTCTATCTCCTGCGTTCTGTTAGGGGCGTATCCGATGGTGCTGCTGC TGACTCGCTGGTTTGAAAAACCGCTGATGAGCGTCGGTAAAGTACTGAATATGAACAACATCGCGGCAGCCGGCATGGTG GCAACGCTTGCCAACAACATCCCGATGTTCGGCATGATGAAGCAGATGGATACCCGCGGCAAAGTCATCAACTGCGCCTT CGCCGTTTCCGCTGCTTTCGCCCTGGGCGACCACTTAGGCTTCGCCGCTGCCAACATGAACGCCATGATCTTCCCGATGA TTGTCGGCAAGTTGATCGGCGGCGTAACGGCGATTGGCGTGGCGATGATGCTGGTGCCAAAAGAAGACGCGACCGCGACT AAAACCGAAGCGGAGGCACAATCGTGA
Upstream 100 bases:
>100_bases TTGCCGCGTCTTATCCAGCCTACGGGATTGCACATGTAGGGCGGATAAGGCGTTTACGCCGCATCCGCCAATAAATAACA ACTAACAGGGAGTAAAGGCG
Downstream 100 bases:
>100_bases ACACTCGCCAGCTATTGAGCGTCGGTATCGATATCGGCACCACCACCACCCAGGTGATTTTCTCCCACCTGGAGCTGGTT AACCGTGCGGCGGTGTCGCA
Product: ethanolamine utilization protein EutH
Products: NA
Alternate protein names: Putative ethanolamine transporter
Number of amino acids: Translated: 408; Mature: 407
Protein sequence:
>408_residues MGINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGLAMVGMTALAPVLAHVLGPVI IPVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAWLYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLA GIVTIPIGCIAGGLVAMYSGVQINGQPVEFTFALILMNMIPVIIVAILVALGLKFIPEKMINGFQIFAKFLVALITLGLA AAVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEKPLMSVGKVLNMNNIAAAGMV ATLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLGFAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKEDATAT KTEAEAQS
Sequences:
>Translated_408_residues MGINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGLAMVGMTALAPVLAHVLGPVI IPVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAWLYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLA GIVTIPIGCIAGGLVAMYSGVQINGQPVEFTFALILMNMIPVIIVAILVALGLKFIPEKMINGFQIFAKFLVALITLGLA AAVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEKPLMSVGKVLNMNNIAAAGMV ATLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLGFAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKEDATAT KTEAEAQS >Mature_407_residues GINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGLAMVGMTALAPVLAHVLGPVII PVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAWLYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLAG IVTIPIGCIAGGLVAMYSGVQINGQPVEFTFALILMNMIPVIIVAILVALGLKFIPEKMINGFQIFAKFLVALITLGLAA AVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEKPLMSVGKVLNMNNIAAAGMVA TLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLGFAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKEDATATK TEAEAQS
Specific function: Possibly involved in the transport of ethanolamine from the periplasm to the cytoplasm
COG id: COG3192
COG function: function code E; Ethanolamine utilization protein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the eutH family
Homologues:
Organism=Escherichia coli, GI1788794, Length=408, Percent_Identity=100, Blast_Score=796, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): EUTH_ECOLI (P76552)
Other databases:
- EMBL: U00096 - EMBL: AP009048 - PIR: C65020 - RefSeq: AP_003037.1 - RefSeq: NP_416947.1 - ProteinModelPortal: P76552 - DIP: DIP-9533N - MINT: MINT-1296863 - STRING: P76552 - EnsemblBacteria: EBESCT00000002312 - EnsemblBacteria: EBESCT00000018423 - GeneID: 944979 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW2436 - KEGG: eco:b2452 - EchoBASE: EB3934 - EcoGene: EG14182 - eggNOG: COG3192 - GeneTree: EBGT00050000010858 - HOGENOM: HBG304744 - OMA: FGDHLGF - ProtClustDB: PRK15086 - BioCyc: EcoCyc:G7282-MONOMER - Genevestigator: P76552 - GO: GO:0006810 - InterPro: IPR007441
Pfam domain/function: PF04346 EutH
EC number: NA
Molecular weight: Translated: 42808; Mature: 42676
Theoretical pI: Translated: 5.37; Mature: 5.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x15b5249c)-; HASH(0x119991d8)-; HASH(0x16628564)-; HASH(0x164f4224)-; HASH(0x162825f0)-; HASH(0x165d974c)-; HASH(0x16995e78)-; HASH(0x164e28f0)-; HASH(0x166cbc48)-; HASH(0x16628b40)-; HASH(0x15e9d02c)-;
Cys/Met content:
1.0 %Cys (Translated Protein) 8.1 %Met (Translated Protein) 9.1 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 7.9 %Met (Mature Protein) 8.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGL CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHH AMVGMTALAPVLAHVLGPVIIPVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAW HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHCCCHHHHH LYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLAGIVTIPIGCIAGGLVAMYSG HHHHHHHHHHCCCHHHEEHHHHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC VQINGQPVEFTFALILMNMIPVIIVAILVALGLKFIPEKMINGFQIFAKFLVALITLGLA CEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AAVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEK HHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH PLMSVGKVLNMNNIAAAGMVATLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLG HHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHH FAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKEDATATKTEAEAQS HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCHHCCCH >Mature Secondary Structure GINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGL CHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHH AMVGMTALAPVLAHVLGPVIIPVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAW HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHCCCHHHHH LYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLAGIVTIPIGCIAGGLVAMYSG HHHHHHHHHHCCCHHHEEHHHHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC VQINGQPVEFTFALILMNMIPVIIVAILVALGLKFIPEKMINGFQIFAKFLVALITLGLA CEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AAVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEK HHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH PLMSVGKVLNMNNIAAAGMVATLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLG HHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHH FAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKEDATATKTEAEAQS HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCHHCCCH
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9205837; 9278503