| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is yihP [H]
Identifier: 218692161
GI number: 218692161
Start: 4519133
End: 4520551
Strand: Reverse
Name: yihP [H]
Synonym: ECED1_4576
Alternate gene names: 218692161
Gene position: 4520551-4519133 (Counterclockwise)
Preceding gene: 218692162
Following gene: 218692160
Centisome position: 86.77
GC content: 51.44
Gene sequence:
>1419_bases GTGGGGAAACTCACGGGCAAAGGGAGAACAACGATGAGTCACATCACAACGGAAGATCCAGCAACTCTACGCCTGCCCTT TAAAGAGAAACTCTCTTACGGTATCGGCGATCTGGCCTCTAACATCCTACTGGATATCGGCACGCTTTATCTTTTGAAGT TTTATACCGATGTTCTGGGGCTGCCTGGCACCTATGGCGGCATTATCTTTTTGATTTCAAAATTCTTTACCGCCTTTACC GATATGGGTACCGGCATTATGTTGGATTCCCGACGCAAGATCGGTCCAAAAGGTAAGTTCCGTCCTTTTATTCTGTATGC GTCATTTCCGGTCACCCTGCTGGAGATCGCCAACTTTGTCGGCACACCGTTTGATGTCACCGGTAAAACGGTGATGGCCA CCATCCTGTTTATGCTTTACGGACTGTTTTTCAGCATGATGAACTGCTCGTATGGCGCGATGGTCCCCGCTATTACTAAA AACCCCAACGAACGTGCATCGCTGGCAGCATGGCGTCAGGGTGGCGCTACATTGGGCCTGCTGCTGTGTACGGTGGGATT CGTGCCGGTTATGAATCTTATCGAGGGTAATCAGCAACTTGGCTATATCTTCGCCGCCACGCTGTTTTCACTGTTCGGCC TGCTGTTTATGTGGATCTGCTATTCGGGCGTGAAAGAGCGTTATGTCAAAACCCAACCTACCAATCCGGCGCAAAAGCCT GGCTTGTTGCAATCTTTCCGCGCAATTGCGGGGAACCGCCCACTGTTCATTTTGTGCATTGCCAACCTCTGTACCTTAGG GGCGTTTAACGTCAAGCTCGCCATTCAGGTCTATTACACCCAGTACGTGCTTAACGATCCCATCCTGTTGTCATATATGG GATTTTTCAGCATGGGCTGTATTTTCATCGGCGTGTTCCTGATGCCTGGTGCAGTCAGACGTTTTGGTAAGAAGAAGGTC TATATCGGCGGTCTGCTGATCTGGGTGCTGGGCGATCTGCTCAACTATTTCTTCGGCGGCGGCTCGGTCAGCTTTGTGGC TTTCTCCTGCCTGGCGTTCTTTGGTTCGGCGTTTGTTAACAGCCTGAACTGGGCGCTGGTTTCTGACACCGTCGAGTATG GCGAGTGGCGCACCGGTGTGCGTTCGGAAGGTACGGTCTACACCGGTTTCACCTTCTTTCGCAAAGTATCTCAGGCGCTG GCAGGTTTCTTCCCAGGCTGGATGCTGACGCAAATTGGCTATGTGCCAAACGTCGCCCAGGCTGACCACACTATCGAAGG GTTGCGCCAACTGATCTTCATCTACCCAAGCGCACTGGCAGTAGTCACCATTGTGGCAATGGGTTGCTTCTACAGTCTGA ACGAGAAGATGTACGTCCGCATTGTTGAAGAGATAGAAGCCCGTAAACGCACGGCGTAA
Upstream 100 bases:
>100_bases CGTTAATGCGCCCATCGGTAAACCGCCGGTCTTTTATCGCGCCGATAGCGAATGGGCGGCACTGTTCGCGTCGTTAAAAA GCATCTAATCACATCCGCCC
Downstream 100 bases:
>100_bases TTATAATAAACAACGCCCTGCGGGGCGTTATAAGGAGTGATTATGTCTGACCATAATCCACTGACATTAAAACTGAATCT GCGGGAAAAAATCGCCTATG
Product: putative transporter
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 472; Mature: 471
Protein sequence:
>472_residues MGKLTGKGRTTMSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLGLPGTYGGIIFLISKFFTAFT DMGTGIMLDSRRKIGPKGKFRPFILYASFPVTLLEIANFVGTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITK NPNERASLAAWRQGGATLGLLLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVKTQPTNPAQKP GLLQSFRAIAGNRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGCIFIGVFLMPGAVRRFGKKKV YIGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFVNSLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQAL AGFFPGWMLTQIGYVPNVAQADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA
Sequences:
>Translated_472_residues MGKLTGKGRTTMSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLGLPGTYGGIIFLISKFFTAFT DMGTGIMLDSRRKIGPKGKFRPFILYASFPVTLLEIANFVGTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITK NPNERASLAAWRQGGATLGLLLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVKTQPTNPAQKP GLLQSFRAIAGNRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGCIFIGVFLMPGAVRRFGKKKV YIGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFVNSLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQAL AGFFPGWMLTQIGYVPNVAQADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA >Mature_471_residues GKLTGKGRTTMSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLGLPGTYGGIIFLISKFFTAFTD MGTGIMLDSRRKIGPKGKFRPFILYASFPVTLLEIANFVGTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITKN PNERASLAAWRQGGATLGLLLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVKTQPTNPAQKPG LLQSFRAIAGNRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGCIFIGVFLMPGAVRRFGKKKVY IGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFVNSLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQALA GFFPGWMLTQIGYVPNVAQADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA
Specific function: Unknown
COG id: COG2211
COG function: function code G; Na+/melibiose symporter and related transporters
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sodium:galactoside symporter (TC 2.A.2) family [H]
Homologues:
Organism=Homo sapiens, GI122937339, Length=448, Percent_Identity=24.1071428571429, Blast_Score=69, Evalue=1e-11, Organism=Escherichia coli, GI145693206, Length=461, Percent_Identity=98.9154013015184, Blast_Score=926, Evalue=0.0, Organism=Escherichia coli, GI48994989, Length=456, Percent_Identity=64.6929824561403, Blast_Score=620, Evalue=1e-179, Organism=Escherichia coli, GI1787902, Length=453, Percent_Identity=33.3333333333333, Blast_Score=215, Evalue=4e-57, Organism=Escherichia coli, GI87082306, Length=451, Percent_Identity=29.0465631929047, Blast_Score=177, Evalue=1e-45, Organism=Escherichia coli, GI1786466, Length=420, Percent_Identity=27.3809523809524, Blast_Score=170, Evalue=2e-43, Organism=Escherichia coli, GI1790561, Length=450, Percent_Identity=26.2222222222222, Blast_Score=139, Evalue=5e-34,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011701 - InterPro: IPR016196 - InterPro: IPR001927 - InterPro: IPR018043 [H]
Pfam domain/function: PF07690 MFS_1 [H]
EC number: NA
Molecular weight: Translated: 52171; Mature: 52040
Theoretical pI: Translated: 9.61; Mature: 9.61
Prosite motif: PS00872 NA_GALACTOSIDE_SYMP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.7 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 5.3 %Cys+Met (Translated Protein) 1.7 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGKLTGKGRTTMSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLG CCCCCCCCCCHHHHCCCCCCCEEECCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHC LPGTYGGIIFLISKFFTAFTDMGTGIMLDSRRKIGPKGKFRPFILYASFPVTLLEIANFV CCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHH GTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITKNPNERASLAAWRQGGATLGL CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHH LLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVKTQPTNPAQKP HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC GLLQSFRAIAGNRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGC CHHHHHHHHHCCCCEEEHHHHHHHHCCCCCEEEEEEEEEEHHHHCCCHHHHHHHHHHHHH IFIGVFLMPGAVRRFGKKKVYIGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFVN HHHHHHHCCHHHHHHCCCEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH SLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQALAGFFPGWMLTQIGYVPNVAQ HCCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHCCHHHHHHHCCCCCHHH ADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC >Mature Secondary Structure GKLTGKGRTTMSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLG CCCCCCCCCHHHHCCCCCCCEEECCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHC LPGTYGGIIFLISKFFTAFTDMGTGIMLDSRRKIGPKGKFRPFILYASFPVTLLEIANFV CCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHHH GTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITKNPNERASLAAWRQGGATLGL CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHH LLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVKTQPTNPAQKP HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC GLLQSFRAIAGNRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGC CHHHHHHHHHCCCCEEEHHHHHHHHCCCCCEEEEEEEEEEHHHHCCCHHHHHHHHHHHHH IFIGVFLMPGAVRRFGKKKVYIGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFVN HHHHHHHCCHHHHHHCCCEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH SLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQALAGFFPGWMLTQIGYVPNVAQ HCCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHCCHHHHHHHCCCCCHHH ADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8346018; 9278503 [H]