Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is yicJ

Identifier: 218697378

GI number: 218697378

Start: 4209569

End: 4210951

Strand: Reverse

Name: yicJ

Synonym: EC55989_4124

Alternate gene names: 218697378

Gene position: 4210951-4209569 (Counterclockwise)

Preceding gene: 218697382

Following gene: 218697377

Centisome position: 81.69

GC content: 49.39

Gene sequence:

>1383_bases
ATGAAGAGTGAAGTGTTGTCCGTTAAAGAGAAAATTGGTTATGGCATGGGAGACGCCGCCAGCCACATTATTTTCGATAA
CGTAATGTTATATATGATGTTCTTTTATACCGATATTTTTGGCATTCCTGCCGGATTTGTCGGAACCATGTTTTTGGTCG
CTCGTGCACTGGATGCGATTTCCGATCCTTGCATGGGGTTGTTGGCCGATCGAACGCGCTCTCGCTGGGGTAAATTTCGT
CCGTGGGTACTGTTTGGCGCACTGCCATTCGGGATCGTCTGTGTACTGGCCTATAGCACGCCAGATCTCAGTATGAACGG
CAAAATGATCTATGCAGCAATTACTTACACCCTACTTACCTTACTTTATACCGTCGTCAATATCCCTTACTGCGCATTGG
GTGGTGTAATCACCAATGACCCGACTCAGCGTATCTCGCTGCAATCCTGGCGTTTTGTGCTGGCGACCGCGGGAGGCATG
CTTTCTACTGTTCTGATGATGCCACTGGTTAATTTAATTGGCGGTGATAATAAACCACTCGGTTTCCAGGGCGGTATCGC
GGTCCTTTCCGTGGTGGCATTCATGATGCTGGCATTTTGTTTCTTCACCACTAAAGAACGCGTTGAAGCACCACCTACAA
CAACGTCTATGCGGGAAGATTTACGTGATATCTGGCAAAACGACCAGTGGCGGATTGTCGGTTTACTAACCATTTTCAAT
ATCCTGGCGGTGTGCGTACGCGGTGGGGCGATGATGTATTACGTCACATGGATTTTGGGCACGCCGGAAGTGTTTGTCGC
TTTTCTCACCACTTATTGCGTGGGTAACCTGATTGGTTCCGCACTGGCAAAACCTCTGACCGACTGGAAATGTAAAGTCA
CTATCTTCTGGTGGACGAACGCCCTGCTGGCAGTGATTAGCCTCGCGATGTTCTTTGTTCCCATGCAGGCCAGCATCACT
ATGTTTGTCTTCATCTTCGTGATTGGTGTGTTGCATCAACTGGTGACACCTATCCAGTGGGTAATGATGTCCGATACCGT
CGACTACGGCGAGTGGTGCAATGGTAAACGCCTGACCGGGATCAGTTTTGCTGGCACGCTGTTTGTGCTCAAACTGGGGT
TGGCCTTCGGCGGCGCTCTTATCGGCTGGATGCTGGCTTATGGCGGATATGATGCGGCAGAAAAAGCGCAGAACAGCGCC
ACGATTAGCATCATTATTGCGCTATTCACGATTGTTCCGGCGATCTGTTATTTGCTGAGCGCGATTATCGCTAAACGCTA
CTACTCACTCACGACGCACAATCTGAAAACCGTTATGGAACAGCTGGCTCAGGGTAAACGCCGTTGCCAGCAACAATTCA
CCTCTCAAGAAGTGCAGAACTAA

Upstream 100 bases:

>100_bases
CATAAATTAATAACCAGATATCGGAATATTCGCTCTCACAGGGATGGTTTACAAAATGCGTTATCGGTCTCAGCACCCCT
ATAGCATCAAGGAAAAGCAG

Downstream 100 bases:

>100_bases
GGAACGGCAATGAAAATTAGCGATGGAAACTGGTTGATTCAACCTGGCCTCAATTTGATTCACCCGCTTCAGGTGTTCGA
GGTTGAACAGCAGGATAATG

Product: putative transporter

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 460; Mature: 460

Protein sequence:

>460_residues
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAISDPCMGLLADRTRSRWGKFR
PWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLTLLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGM
LSTVLMMPLVNLIGGDNKPLGFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTNALLAVISLAMFFVPMQASIT
MFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTGISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSA
TISIIIALFTIVPAICYLLSAIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN

Sequences:

>Translated_460_residues
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAISDPCMGLLADRTRSRWGKFR
PWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLTLLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGM
LSTVLMMPLVNLIGGDNKPLGFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTNALLAVISLAMFFVPMQASIT
MFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTGISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSA
TISIIIALFTIVPAICYLLSAIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN
>Mature_460_residues
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAISDPCMGLLADRTRSRWGKFR
PWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLTLLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGM
LSTVLMMPLVNLIGGDNKPLGFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTNALLAVISLAMFFVPMQASIT
MFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTGISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSA
TISIIIALFTIVPAICYLLSAIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN

Specific function: Unknown

COG id: COG2211

COG function: function code G; Na+/melibiose symporter and related transporters

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sodium:galactoside symporter (TC 2.A.2) family

Homologues:

Organism=Homo sapiens, GI122937339, Length=453, Percent_Identity=23.1788079470199, Blast_Score=69, Evalue=6e-12,
Organism=Escherichia coli, GI87082306, Length=460, Percent_Identity=100, Blast_Score=940, Evalue=0.0,
Organism=Escherichia coli, GI1786466, Length=451, Percent_Identity=43.9024390243902, Blast_Score=375, Evalue=1e-105,
Organism=Escherichia coli, GI1790561, Length=464, Percent_Identity=28.0172413793103, Blast_Score=194, Evalue=9e-51,
Organism=Escherichia coli, GI145693206, Length=451, Percent_Identity=29.0465631929047, Blast_Score=187, Evalue=1e-48,
Organism=Escherichia coli, GI48994989, Length=452, Percent_Identity=29.2035398230088, Blast_Score=169, Evalue=4e-43,
Organism=Escherichia coli, GI1787902, Length=460, Percent_Identity=28.0434782608696, Blast_Score=164, Evalue=1e-41,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YICJ_ECOLI (P31435)

Other databases:

- EMBL:   L10328
- EMBL:   U00096
- EMBL:   AP009048
- RefSeq:   AP_004135.1
- RefSeq:   NP_418114.4
- ProteinModelPortal:   P31435
- STRING:   P31435
- EnsemblBacteria:   EBESCT00000002589
- EnsemblBacteria:   EBESCT00000018302
- GeneID:   948168
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW5939
- KEGG:   eco:b3657
- EchoBASE:   EB1637
- EcoGene:   EG11686
- eggNOG:   COG2211
- GeneTree:   EBGT00050000009910
- HOGENOM:   HBG518114
- OMA:   KERVSVQ
- ProtClustDB:   PRK11462
- BioCyc:   EcoCyc:YICJ-MONOMER
- Genevestigator:   P31435
- InterPro:   IPR016196
- InterPro:   IPR001927
- InterPro:   IPR018043
- TIGRFAMs:   TIGR00792

Pfam domain/function: SSF103473 MFS_gen_substrate_transporter

EC number: NA

Molecular weight: Translated: 50949; Mature: 50949

Theoretical pI: Translated: 8.51; Mature: 8.51

Prosite motif: PS00872 NA_GALACTOSIDE_SYMP

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x1dce414c)-; HASH(0x1dcd91e4)-; HASH(0x1d8aaa08)-; HASH(0x1ae6dc10)-; HASH(0x1dcdc700)-; HASH(0x1b94e5e0)-; HASH(0x1dd00aac)-; HASH(0x1d33747c)-; HASH(0x1c2ee7d8)-; HASH(0x1cbb36f4)-; HASH(0x1d606860)-;

Cys/Met content:

2.2 %Cys     (Translated Protein)
5.2 %Met     (Translated Protein)
7.4 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
5.2 %Met     (Mature Protein)
7.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAI
CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
SDPCMGLLADRTRSRWGKFRPWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLT
CCHHHHHHHHHHHHHCCCCCCHHEEHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHH
LLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGMLSTVLMMPLVNLIGGDNKPL
HHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCC
GFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEHHHHHHHH
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTN
HHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEHHH
ALLAVISLAMFFVPMQASITMFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCEECC
ISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSATISIIIALFTIVPAICYLLS
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHH
AIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH
>Mature Secondary Structure
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAI
CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH
SDPCMGLLADRTRSRWGKFRPWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLT
CCHHHHHHHHHHHHHCCCCCCHHEEHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHH
LLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGMLSTVLMMPLVNLIGGDNKPL
HHHHHHCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCC
GFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCEEEHHHHHHHH
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTN
HHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEHHH
ALLAVISLAMFFVPMQASITMFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCEECC
ISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSATISIIIALFTIVPAICYLLS
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHH
AIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7686882; 9278503