| Definition | Escherichia coli ED1a chromosome, complete genome. |
|---|---|
| Accession | NC_011745 |
| Length | 5,209,548 |
Click here to switch to the map view.
The map label for this gene is ybhI [C]
Identifier: 218692342
GI number: 218692342
Start: 4734705
End: 4736210
Strand: Direct
Name: ybhI [C]
Synonym: ECED1_4775
Alternate gene names: 218692342
Gene position: 4734705-4736210 (Clockwise)
Preceding gene: 218692341
Following gene: 218692345
Centisome position: 90.89
GC content: 51.13
Gene sequence:
>1506_bases GTGAATATGTCATCAATATCCCATGGCGCACCACAAAAGCGTCGAATTATCCCTAATCCCGGCCTGTGGTTGGCGATCAT TGCCGGAATTATTATTACCCTTCTGCCACTGGGCGATACCTTGCCCGTCGCTGGACAAAATATGATCGCCATTCTGGTGT TTGCCATTATTGTCTGGATCAGCGAAGCGATGGATTACACCGCCAGTGCAATTGTTATTTCCGCACTGATTATCTTTATG GTTGGTTTTGCCCCGGATATGAATCACCCGGACACGATCCTTGGCACCGCGAAAGCGCTGAAAATGACTCTGTCGGGTTT TTCTAATTCTGCTCTGGCGCTGGTGGCAGCAGCAATGTTTATCGCGGCGGCAATGACCATTACCGGGCTGGATAAACGCA TTGCGCTGTTTACTATGTCGAAGATAGGTGCCAGCAGCCGCAGCATTATTATCGGTGCCATTGTGGTGACCATCGTCCTG AGCCTGGTTGTACCCAGTGCCACTGCCCGTACGGCTTGCGTGGTGCCGATTATGATGGGGGTCATTGCGGCATTTAAGGT GGATAAACATTCACGTTTAGCGGCGTCAATGATGATTGTTATCGCCCAGGCGACCAGTATCTGGAACGTTGGTATTCAAA CTTCGGCGGCGCAAAACCTGCTTTCTATCGGTTTTATCAATAAAACTTTCGGCGCCGGGCATTCTGTCAGCTGGCTTGAC TGGCTGTTAGCGGGCGCGCCCTGGAGCCTCACCATGTCGGCGATCCTCTATTTCCTCGCACGTAAATTGTTGCCGCCAGA AACGGAAGCTGTTGAAGGGGGGAGCGAGGCCATTAAAAAAGCACTTGCAGAGTTAGGGCCAACCACTGGTAAAGAAAAAC GTTTGATCGGTATTTCGCTATTGTTACTGCTCTTTTGGTCTACTGGCGGAAAATTACACAGTATTGATACCACCTCTGTC ACCCTTGCCGGACTGGCGATTATGTTGCTGCCAGGCATCGGTGTAATGAGCTGGAAAGAGGTCGAAAAACGTGTGCAGTG GGGCACGTTGCTTATGTTTGGTATTGGTATCAGCCTGGGGTCCACGCTGCTTGATACGCAGGCAGCTTCATGGATGGCGA ACTATGTGGTGAAAGGTTTTGGTCTCGATGGCTTGCCATCATTAGCCATCTTCGCGATTCTCGCCGCTTTCTTGATAATT ATTCACCTGGGGTTTGCCAGTGCCACTGCGCTAACTGCCGCATTGCTGCCCATCCTGATTAGCCTGTTAAGCAGTCTGCC GCCAGAACTCGGCGTTAACCCGGTCGGCATGACCATTCTGCTCGCCTTCAGTGTCAGTTTTGGCTTCATCCTGCCGATTA ACGCGCCACAAAATATGGTGTGTATGGGTACGGATACCTTCACGCCACGCCAGTTTACCCGTGTGGGACTGTACCTGACG GTGATCGGCTACCTGTTGTTGCTACTGTTTGCCGCCACCTGGTGGAAAATTTTAGGTCTAATGTGA
Upstream 100 bases:
>100_bases AATAAATAATAAAATCATTAAATAACAAAGCGTTATTAAAAAATTAAAGATGGCATAGCTCCTGCAATAGCCAGGCAGGT TTTTTATTTTTCACAATGAG
Downstream 100 bases:
>100_bases GGAAACCAACGATGTCATTACAAATTGCTGTTGAAAAAGTGCGTTGGTTAGCTGCCGGGCTGCTGGAACTTAATGGCTGT GATGCCGATATCGCTCAGGA
Product: putative transporter
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 501; Mature: 501
Protein sequence:
>501_residues MNMSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAILVFAIIVWISEAMDYTASAIVISALIIFM VGFAPDMNHPDTILGTAKALKMTLSGFSNSALALVAAAMFIAAAMTITGLDKRIALFTMSKIGASSRSIIIGAIVVTIVL SLVVPSATARTACVVPIMMGVIAAFKVDKHSRLAASMMIVIAQATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLD WLLAGAPWSLTMSAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISLLLLLFWSTGGKLHSIDTTSV TLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLMFGIGISLGSTLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLII IHLGFASATALTAALLPILISLLSSLPPELGVNPVGMTILLAFSVSFGFILPINAPQNMVCMGTDTFTPRQFTRVGLYLT VIGYLLLLLFAATWWKILGLM
Sequences:
>Translated_501_residues MNMSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAILVFAIIVWISEAMDYTASAIVISALIIFM VGFAPDMNHPDTILGTAKALKMTLSGFSNSALALVAAAMFIAAAMTITGLDKRIALFTMSKIGASSRSIIIGAIVVTIVL SLVVPSATARTACVVPIMMGVIAAFKVDKHSRLAASMMIVIAQATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLD WLLAGAPWSLTMSAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISLLLLLFWSTGGKLHSIDTTSV TLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLMFGIGISLGSTLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLII IHLGFASATALTAALLPILISLLSSLPPELGVNPVGMTILLAFSVSFGFILPINAPQNMVCMGTDTFTPRQFTRVGLYLT VIGYLLLLLFAATWWKILGLM >Mature_501_residues MNMSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAILVFAIIVWISEAMDYTASAIVISALIIFM VGFAPDMNHPDTILGTAKALKMTLSGFSNSALALVAAAMFIAAAMTITGLDKRIALFTMSKIGASSRSIIIGAIVVTIVL SLVVPSATARTACVVPIMMGVIAAFKVDKHSRLAASMMIVIAQATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLD WLLAGAPWSLTMSAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISLLLLLFWSTGGKLHSIDTTSV TLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLMFGIGISLGSTLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLII IHLGFASATALTAALLPILISLLSSLPPELGVNPVGMTILLAFSVSFGFILPINAPQNMVCMGTDTFTPRQFTRVGLYLT VIGYLLLLLFAATWWKILGLM
Specific function: Unknown
COG id: COG0471
COG function: function code P; Di- and tricarboxylate transporters
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the SLC13A transporter (TC 2.A.47) family. SODIT1 subfamily [H]
Homologues:
Organism=Homo sapiens, GI29171306, Length=461, Percent_Identity=22.7765726681128, Blast_Score=74, Evalue=3e-13, Organism=Homo sapiens, GI4506979, Length=547, Percent_Identity=22.3034734917733, Blast_Score=68, Evalue=2e-11, Organism=Homo sapiens, GI301069349, Length=520, Percent_Identity=21.3461538461538, Blast_Score=67, Evalue=3e-11, Organism=Homo sapiens, GI31795546, Length=290, Percent_Identity=22.7586206896552, Blast_Score=66, Evalue=9e-11, Organism=Escherichia coli, GI1786986, Length=411, Percent_Identity=30.9002433090024, Blast_Score=130, Evalue=3e-31, Organism=Escherichia coli, GI1786829, Length=481, Percent_Identity=27.4428274428274, Blast_Score=129, Evalue=4e-31, Organism=Escherichia coli, GI1789444, Length=423, Percent_Identity=28.6052009456265, Blast_Score=114, Evalue=2e-26, Organism=Caenorhabditis elegans, GI32564681, Length=458, Percent_Identity=22.2707423580786, Blast_Score=74, Evalue=2e-13, Organism=Caenorhabditis elegans, GI71988385, Length=536, Percent_Identity=22.9477611940299, Blast_Score=74, Evalue=2e-13, Organism=Caenorhabditis elegans, GI32565804, Length=458, Percent_Identity=22.707423580786, Blast_Score=72, Evalue=6e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001898 [H]
Pfam domain/function: PF00939 Na_sulph_symp [H]
EC number: NA
Molecular weight: Translated: 52977; Mature: 52977
Theoretical pI: Translated: 10.05; Mature: 10.05
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 4.6 %Met (Translated Protein) 5.0 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 4.6 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNMSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAILVFAIIVWI CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH SEAMDYTASAIVISALIIFMVGFAPDMNHPDTILGTAKALKMTLSGFSNSALALVAAAMF HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH IAAAMTITGLDKRIALFTMSKIGASSRSIIIGAIVVTIVLSLVVPSATARTACVVPIMMG HHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH VIAAFKVDKHSRLAASMMIVIAQATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLD HHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHHCCCCCCHHHHH WLLAGAPWSLTMSAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISL HHHHCCCHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH LLLLFWSTGGKLHSIDTTSVTLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLMFGIGISLG HHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC STLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLIIIHLGFASATALTAALLPILI HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SLLSSLPPELGVNPVGMTILLAFSVSFGFILPINAPQNMVCMGTDTFTPRQFTRVGLYLT HHHHHCCCCCCCCCHHHHHHHHHHHCCCEEEECCCCCCEEEECCCCCCCHHHHHHHHHHH VIGYLLLLLFAATWWKILGLM HHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MNMSSISHGAPQKRRIIPNPGLWLAIIAGIIITLLPLGDTLPVAGQNMIAILVFAIIVWI CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH SEAMDYTASAIVISALIIFMVGFAPDMNHPDTILGTAKALKMTLSGFSNSALALVAAAMF HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH IAAAMTITGLDKRIALFTMSKIGASSRSIIIGAIVVTIVLSLVVPSATARTACVVPIMMG HHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH VIAAFKVDKHSRLAASMMIVIAQATSIWNVGIQTSAAQNLLSIGFINKTFGAGHSVSWLD HHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHHCCCCCCHHHHH WLLAGAPWSLTMSAILYFLARKLLPPETEAVEGGSEAIKKALAELGPTTGKEKRLIGISL HHHHCCCHHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH LLLLFWSTGGKLHSIDTTSVTLAGLAIMLLPGIGVMSWKEVEKRVQWGTLLMFGIGISLG HHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC STLLDTQAASWMANYVVKGFGLDGLPSLAIFAILAAFLIIIHLGFASATALTAALLPILI HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH SLLSSLPPELGVNPVGMTILLAFSVSFGFILPINAPQNMVCMGTDTFTPRQFTRVGLYLT HHHHHCCCCCCCCCHHHHHHHHHHHCCCEEEECCCCCCEEEECCCCCCCHHHHHHHHHHH VIGYLLLLLFAATWWKILGLM HHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8405966 [H]