Definition | Escherichia coli E24377A, complete genome. |
---|---|
Accession | NC_009801 |
Length | 4,979,619 |
Click here to switch to the map view.
The map label for this gene is pduF [H]
Identifier: 157155813
GI number: 157155813
Start: 2245761
End: 2246570
Strand: Reverse
Name: pduF [H]
Synonym: EcE24377A_2278
Alternate gene names: 157155813
Gene position: 2246570-2245761 (Counterclockwise)
Preceding gene: 157156935
Following gene: 157155157
Centisome position: 45.12
GC content: 47.53
Gene sequence:
>810_bases ATGAATGATTCACTCAAGGCGCAATGCGTTGCCGAGTTTTTAGGCACCGGACTGTTCCTCTTTTTTGGCATTGGTTGTTT GTGCGCCCTGAAACTTGCTGGGGCAAGTCTGGGACTGTGGGAAATTTGTATCATTTGGGGGTTAGGGATTTCGCTTGCTG TTTACCTTACGGCAGGTATTTCCGGCGCTCATCTCAACCCGGCTGTTACCATTGCTCTGTGGCTGTTTGCCTCTTTCCCA GCCCGTAAAGTGCTGCCATTTTGTGTGGCGCAATTAGCGGGTGCCTTTGGTGGTGCGGTTCTGGCATATTCTCTATACAG TAGTCTGTTTACCGATTTTGAATCTGCTCACAATATGGTGCGTGGTAGCGCAGAAAGTTTACAACTCGTCAGTATCTTCA GTACTTATCCGGCAGCGGCGATTAATGTATGGCAAGCTGCGTTGGTCGAAGTGGTCATAACGTCCATGTTAATGGGGGTG ATTATGGCGTTGACCGATGACGGTAATGGCGTACCCAAAGGACCGCTTGCACCTTTACTTATTGGTATTCTGGTTGCTGT TATTGGTGCTTCTACCGGACCATTAACCGGTTTTGCCATGAATCCAGCGCGTGATTTTGGACCTAAGATTTTCACCTGGC TTGCGGGGTGGGGAGAAATTGCGATGACTGGTGGACGCAATATTCCTTATTTCATTGTACCCATTATTGCACCGATCATT GGGGCATGTGTTGGTGCGGCGATTTATCGCTATCTCATTGCAAAGAATTTACCTGTCAATAGTGTTACACCTAAAGGAAT TAGCGAATAA
Upstream 100 bases:
>100_bases ATATTTTCTGCAGAGCAATTTTTTATAGTTGCGCTCGGTATACACAACAATCAGAGCATAGTTGGCTGATAAAGAAAAAC TCCGTAAAGAAGGTGTCACT
Downstream 100 bases:
>100_bases CGAAATACTCTGGCAGTTATAAACATTTATTTTTTTACTGTTTGTGATCTGCTTTCCGAACTTATTAAAAAGTCTCCCGC TTCACGGAGGCTTTTGTTAT
Product: propanediol diffusion facilitator
Products: glycerol [Cytoplasm] [C]
Alternate protein names: NA
Number of amino acids: Translated: 269; Mature: 269
Protein sequence:
>269_residues MNDSLKAQCVAEFLGTGLFLFFGIGCLCALKLAGASLGLWEICIIWGLGISLAVYLTAGISGAHLNPAVTIALWLFASFP ARKVLPFCVAQLAGAFGGAVLAYSLYSSLFTDFESAHNMVRGSAESLQLVSIFSTYPAAAINVWQAALVEVVITSMLMGV IMALTDDGNGVPKGPLAPLLIGILVAVIGASTGPLTGFAMNPARDFGPKIFTWLAGWGEIAMTGGRNIPYFIVPIIAPII GACVGAAIYRYLIAKNLPVNSVTPKGISE
Sequences:
>Translated_269_residues MNDSLKAQCVAEFLGTGLFLFFGIGCLCALKLAGASLGLWEICIIWGLGISLAVYLTAGISGAHLNPAVTIALWLFASFP ARKVLPFCVAQLAGAFGGAVLAYSLYSSLFTDFESAHNMVRGSAESLQLVSIFSTYPAAAINVWQAALVEVVITSMLMGV IMALTDDGNGVPKGPLAPLLIGILVAVIGASTGPLTGFAMNPARDFGPKIFTWLAGWGEIAMTGGRNIPYFIVPIIAPII GACVGAAIYRYLIAKNLPVNSVTPKGISE >Mature_269_residues MNDSLKAQCVAEFLGTGLFLFFGIGCLCALKLAGASLGLWEICIIWGLGISLAVYLTAGISGAHLNPAVTIALWLFASFP ARKVLPFCVAQLAGAFGGAVLAYSLYSSLFTDFESAHNMVRGSAESLQLVSIFSTYPAAAINVWQAALVEVVITSMLMGV IMALTDDGNGVPKGPLAPLLIGILVAVIGASTGPLTGFAMNPARDFGPKIFTWLAGWGEIAMTGGRNIPYFIVPIIAPII GACVGAAIYRYLIAKNLPVNSVTPKGISE
Specific function: May facilitate the diffusion of propanediol [H]
COG id: COG0580
COG function: function code G; Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family)
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the MIP/aquaporin (TC 1.A.8) family [H]
Homologues:
Organism=Homo sapiens, GI157266307, Length=261, Percent_Identity=36.0153256704981, Blast_Score=148, Evalue=6e-36, Organism=Homo sapiens, GI22538420, Length=256, Percent_Identity=35.15625, Blast_Score=146, Evalue=1e-35, Organism=Homo sapiens, GI4502187, Length=261, Percent_Identity=34.4827586206897, Blast_Score=145, Evalue=3e-35, Organism=Homo sapiens, GI4826645, Length=256, Percent_Identity=35.546875, Blast_Score=141, Evalue=5e-34, Organism=Homo sapiens, GI310133356, Length=217, Percent_Identity=33.1797235023041, Blast_Score=100, Evalue=2e-21, Organism=Homo sapiens, GI310114181, Length=217, Percent_Identity=33.1797235023041, Blast_Score=100, Evalue=2e-21, Organism=Homo sapiens, GI45446752, Length=203, Percent_Identity=33.9901477832512, Blast_Score=84, Evalue=1e-16, Organism=Homo sapiens, GI4502179, Length=262, Percent_Identity=32.0610687022901, Blast_Score=80, Evalue=2e-15, Organism=Homo sapiens, GI6912506, Length=259, Percent_Identity=30.1158301158301, Blast_Score=80, Evalue=3e-15, Organism=Homo sapiens, GI4502183, Length=246, Percent_Identity=32.1138211382114, Blast_Score=76, Evalue=3e-14, Organism=Homo sapiens, GI86792455, Length=248, Percent_Identity=31.4516129032258, Blast_Score=74, Evalue=2e-13, Organism=Homo sapiens, GI37694062, Length=250, Percent_Identity=28, Blast_Score=70, Evalue=2e-12, Organism=Escherichia coli, GI1790362, Length=260, Percent_Identity=68.4615384615385, Blast_Score=374, Evalue=1e-105, Organism=Escherichia coli, GI1787101, Length=220, Percent_Identity=30.9090909090909, Blast_Score=73, Evalue=2e-14, Organism=Caenorhabditis elegans, GI71992966, Length=254, Percent_Identity=34.6456692913386, Blast_Score=142, Evalue=2e-34, Organism=Caenorhabditis elegans, GI71994009, Length=265, Percent_Identity=33.5849056603774, Blast_Score=142, Evalue=2e-34, Organism=Caenorhabditis elegans, GI32564052, Length=258, Percent_Identity=32.5581395348837, Blast_Score=126, Evalue=1e-29, Organism=Caenorhabditis elegans, GI17533613, Length=258, Percent_Identity=32.5581395348837, Blast_Score=126, Evalue=1e-29, Organism=Caenorhabditis elegans, GI17544068, Length=246, Percent_Identity=37.3983739837398, Blast_Score=124, Evalue=6e-29, Organism=Caenorhabditis elegans, GI17531429, Length=262, Percent_Identity=29.7709923664122, Blast_Score=110, Evalue=7e-25, Organism=Caenorhabditis elegans, GI17531431, Length=262, Percent_Identity=29.7709923664122, Blast_Score=110, Evalue=7e-25, Organism=Caenorhabditis elegans, GI71992961, Length=254, Percent_Identity=31.496062992126, Blast_Score=107, Evalue=4e-24, Organism=Caenorhabditis elegans, GI17558372, Length=266, Percent_Identity=28.1954887218045, Blast_Score=80, Evalue=1e-15, Organism=Caenorhabditis elegans, GI71993722, Length=241, Percent_Identity=28.6307053941909, Blast_Score=77, Evalue=9e-15, Organism=Caenorhabditis elegans, GI212646268, Length=282, Percent_Identity=25.531914893617, Blast_Score=65, Evalue=3e-11, Organism=Saccharomyces cerevisiae, GI6321054, Length=209, Percent_Identity=40.6698564593301, Blast_Score=139, Evalue=4e-34, Organism=Saccharomyces cerevisiae, GI6322985, Length=294, Percent_Identity=26.530612244898, Blast_Score=89, Evalue=5e-19, Organism=Drosophila melanogaster, GI45550503, Length=261, Percent_Identity=26.8199233716475, Blast_Score=80, Evalue=2e-15, Organism=Drosophila melanogaster, GI24762344, Length=245, Percent_Identity=27.3469387755102, Blast_Score=70, Evalue=2e-12, Organism=Drosophila melanogaster, GI24762346, Length=245, Percent_Identity=27.3469387755102, Blast_Score=70, Evalue=2e-12, Organism=Drosophila melanogaster, GI24762348, Length=245, Percent_Identity=27.3469387755102, Blast_Score=70, Evalue=2e-12, Organism=Drosophila melanogaster, GI20130305, Length=245, Percent_Identity=27.3469387755102, Blast_Score=70, Evalue=2e-12, Organism=Drosophila melanogaster, GI24652747, Length=253, Percent_Identity=26.8774703557312, Blast_Score=66, Evalue=2e-11, Organism=Drosophila melanogaster, GI45551084, Length=253, Percent_Identity=26.8774703557312, Blast_Score=66, Evalue=2e-11, Organism=Drosophila melanogaster, GI24762342, Length=243, Percent_Identity=25.9259259259259, Blast_Score=64, Evalue=1e-10,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR012269 - InterPro: IPR000425 - InterPro: IPR022357 [H]
Pfam domain/function: PF00230 MIP [H]
EC number: NA
Molecular weight: Translated: 28031; Mature: 28031
Theoretical pI: Translated: 7.20; Mature: 7.20
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS00221 MIP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.2 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 2.2 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 4.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNDSLKAQCVAEFLGTGLFLFFGIGCLCALKLAGASLGLWEICIIWGLGISLAVYLTAGI CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCC SGAHLNPAVTIALWLFASFPARKVLPFCVAQLAGAFGGAVLAYSLYSSLFTDFESAHNMV CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RGSAESLQLVSIFSTYPAAAINVWQAALVEVVITSMLMGVIMALTDDGNGVPKGPLAPLL HCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH IGILVAVIGASTGPLTGFAMNPARDFGPKIFTWLAGWGEIAMTGGRNIPYFIVPIIAPII HHHHHHHHCCCCCCCCCCCCCCHHHHCHHHHHHHHCCCCEEEECCCCCCEEHHHHHHHHH GACVGAAIYRYLIAKNLPVNSVTPKGISE HHHHHHHHHHHHHHCCCCCCCCCCCCCCC >Mature Secondary Structure MNDSLKAQCVAEFLGTGLFLFFGIGCLCALKLAGASLGLWEICIIWGLGISLAVYLTAGI CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCC SGAHLNPAVTIALWLFASFPARKVLPFCVAQLAGAFGGAVLAYSLYSSLFTDFESAHNMV CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RGSAESLQLVSIFSTYPAAAINVWQAALVEVVITSMLMGVIMALTDDGNGVPKGPLAPLL HCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH IGILVAVIGASTGPLTGFAMNPARDFGPKIFTWLAGWGEIAMTGGRNIPYFIVPIIAPII HHHHHHHHCCCCCCCCCCCCCCHHHHCHHHHHHHHCCCCEEEECCCCCCEEHHHHHHHHH GACVGAAIYRYLIAKNLPVNSVTPKGISE HHHHHHHHHHHHHHCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: glycerol [Periplasm] [C]
Specific reaction: glycerol [Periplasm] = glycerol [Cytoplasm] [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 8071226; 9352910; 11677609 [H]