Definition Escherichia coli UTI89 chromosome, complete genome.
Accession NC_007946
Length 5,065,741

Click here to switch to the map view.

The map label for this gene is ygjU [H]

Identifier: 91212518

GI number: 91212518

Start: 3466475

End: 3467719

Strand: Direct

Name: ygjU [H]

Synonym: UTI89_C3527

Alternate gene names: 91212518

Gene position: 3466475-3467719 (Clockwise)

Preceding gene: 91212517

Following gene: 91212522

Centisome position: 68.43

GC content: 54.78

Gene sequence:

>1245_bases
ATGACTACGCAACATTCACCGGGGCTATTCCGGCGTCTGGCTCATGGCAGCCTGGTAAAACAAATCCTGGCCGGCCTTAT
TCTGGGGATTCTTCTGGCATGGATCTCAAAACCCGCGGCGGAAGCTGTTGGTCTGTTAGGTACTTTGTTCGTCGGCGCAC
TGAAAGCCGTTGCCCCCATCCTGGTGTTGATGCTGGTAATGGCATCTATTGCTAACCACCAGCACGGGCAGAAAACCAAT
ATCCGCCCTATTTTGTTCCTCTATCTGCTGGGCACCTTCTCTGCTGCTCTGGCCGCAGTAATCTTCAGTTTTGCCTTCCC
TTCTACCCTGCACTTGTCCAGTAGCGCGGGTGATATTTCGCCGCCGTCAGGCATTGTAGAAGTGATGCGCGGGCTGGTAA
TGAGCATGGTTTCCAACCCCATTGACGCGCTGCTGAAAGGTAACTATATCGGGATCCTGGTGTGGGCAATTGGCCTCGGT
TTCGCACTGCGTCACGGTAACGAAACCACCAAAAACCTGGTCAACGATATGTCGAATGCCGTTACCTTTATGGTGAAACT
GGTGATTCACTTCGCACCGATCGGTATTTTTGGTCTGGTTTCTTCTACCCTGGCAACCACCGGTTTCTCCACGCTGTGGG
GCTACGCGCAACTGCTGGTGGTACTGGTTGGCTGTATGCTGCTGGTGGCGCTGGTGGTTAACCCACTGCTGGTATGGTGG
AAAATTCGTCGTAACCCGTTCCCGCTGGTGCTGCTGTGCCTGCGCGAAAGCGGCGTGTATGCCTTCTTCACCCGCAGCTC
TGCGGCGAACATTCCGGTGAATATGGCGCTGTGTGAAAAGCTGAATCTGGATCGCGATACCTATTCCGTTTCTATTCCGC
TGGGAGCCACCATCAATATGGCGGGCGCAGCAATCACCATTACCGTGTTGACGCTGGCTGCGGTTAATACGCTGGGTATT
CCGGTTGATCTGCCCACAGCGCTGCTGTTGAGCGTAGTGGCTTCTCTGTGTGCCTGTGGCGCATCCGGCGTGGCGGGGGG
GTCTCTGTTGCTGATCCCACTGGCCTGTAATATGTTCGGTATTTCGAACGATATCGCCATGCAGGTGGTTGCTGTCGGCT
TTATTATCGGCGTATTGCAGGACTCCTGTGAAACCGCGCTGAACTCGTCAACTGACGTGCTGTTCACTGCGGCAGCTTGC
CAGGCGGAAGACGATCGTCTGGCAAATAGCGCCCTGCGTAACTAA

Upstream 100 bases:

>100_bases
TTTCCTTATACTCGACCTTGCAAACACTTTGTTACATCCTGAAAGATGCGTCGACAGAACGCACCAGGGATGTGCGACAA
CACAATGAAAGGATCGAAAA

Downstream 100 bases:

>100_bases
TACTTAGCCCCTTTCGTCTACGGCGGAAGGGGTTTTCTCCACTTTAAACGGATCAATTCCCCTTCTCTGCATACGCCAGA
AACGAATGATATTCAGGCCA

Product: serine/threonine transporter SstT

Products: NA

Alternate protein names: Na(+)/serine-threonine symporter [H]

Number of amino acids: Translated: 414; Mature: 413

Protein sequence:

>414_residues
MTTQHSPGLFRRLAHGSLVKQILAGLILGILLAWISKPAAEAVGLLGTLFVGALKAVAPILVLMLVMASIANHQHGQKTN
IRPILFLYLLGTFSAALAAVIFSFAFPSTLHLSSSAGDISPPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLG
FALRHGNETTKNLVNDMSNAVTFMVKLVIHFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWW
KIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINMAGAAITITVLTLAAVNTLGI
PVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFGISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAAC
QAEDDRLANSALRN

Sequences:

>Translated_414_residues
MTTQHSPGLFRRLAHGSLVKQILAGLILGILLAWISKPAAEAVGLLGTLFVGALKAVAPILVLMLVMASIANHQHGQKTN
IRPILFLYLLGTFSAALAAVIFSFAFPSTLHLSSSAGDISPPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLG
FALRHGNETTKNLVNDMSNAVTFMVKLVIHFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWW
KIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINMAGAAITITVLTLAAVNTLGI
PVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFGISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAAC
QAEDDRLANSALRN
>Mature_413_residues
TTQHSPGLFRRLAHGSLVKQILAGLILGILLAWISKPAAEAVGLLGTLFVGALKAVAPILVLMLVMASIANHQHGQKTNI
RPILFLYLLGTFSAALAAVIFSFAFPSTLHLSSSAGDISPPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLGF
ALRHGNETTKNLVNDMSNAVTFMVKLVIHFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWWK
IRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINMAGAAITITVLTLAAVNTLGIP
VDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFGISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAACQ
AEDDRLANSALRN

Specific function: Involved in the import of serine and threonine into the cell, with the concomitant import of sodium (symport system) [H]

COG id: COG3633

COG function: function code E; Na+/serine symporter

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the sodium:dicarboxylate (SDF) symporter (TC 2.A.23) family [H]

Homologues:

Organism=Homo sapiens, GI4827012, Length=242, Percent_Identity=26.0330578512397, Blast_Score=79, Evalue=9e-15,
Organism=Homo sapiens, GI21314632, Length=265, Percent_Identity=27.5471698113208, Blast_Score=72, Evalue=7e-13,
Organism=Homo sapiens, GI223468566, Length=285, Percent_Identity=27.3684210526316, Blast_Score=71, Evalue=2e-12,
Organism=Homo sapiens, GI169790839, Length=244, Percent_Identity=23.3606557377049, Blast_Score=71, Evalue=2e-12,
Organism=Homo sapiens, GI5032093, Length=273, Percent_Identity=27.1062271062271, Blast_Score=70, Evalue=5e-12,
Organism=Homo sapiens, GI223468564, Length=266, Percent_Identity=27.8195488721804, Blast_Score=69, Evalue=1e-11,
Organism=Escherichia coli, GI1789473, Length=414, Percent_Identity=98.792270531401, Blast_Score=812, Evalue=0.0,
Organism=Escherichia coli, GI1789947, Length=353, Percent_Identity=24.0793201133144, Blast_Score=82, Evalue=9e-17,
Organism=Escherichia coli, GI1790514, Length=426, Percent_Identity=23.943661971831, Blast_Score=75, Evalue=9e-15,
Organism=Caenorhabditis elegans, GI17537407, Length=424, Percent_Identity=23.8207547169811, Blast_Score=82, Evalue=3e-16,
Organism=Caenorhabditis elegans, GI71996953, Length=271, Percent_Identity=26.1992619926199, Blast_Score=77, Evalue=2e-14,
Organism=Caenorhabditis elegans, GI71983099, Length=275, Percent_Identity=25.0909090909091, Blast_Score=69, Evalue=6e-12,
Organism=Caenorhabditis elegans, GI71983106, Length=275, Percent_Identity=25.0909090909091, Blast_Score=68, Evalue=7e-12,
Organism=Caenorhabditis elegans, GI193206505, Length=267, Percent_Identity=27.7153558052434, Blast_Score=67, Evalue=2e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001991
- InterPro:   IPR023025 [H]

Pfam domain/function: PF00375 SDF [H]

EC number: NA

Molecular weight: Translated: 43440; Mature: 43309

Theoretical pI: Translated: 8.22; Mature: 8.22

Prosite motif: PS00713 NA_DICARBOXYL_SYMP_1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
5.1 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTTQHSPGLFRRLAHGSLVKQILAGLILGILLAWISKPAAEAVGLLGTLFVGALKAVAPI
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH
LVLMLVMASIANHQHGQKTNIRPILFLYLLGTFSAALAAVIFSFAFPSTLHLSSSAGDIS
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCC
PPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLGFALRHGNETTKNLVNDMSNA
CCHHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
VTFMVKLVIHFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWW
HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINM
HHCCCCCCEEEEEECCCCEEEEEECCCCCCCCCHHHHHHHCCCCCCCEEEEECCCCEEEC
AGAAITITVLTLAAVNTLGIPVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFG
CCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHC
ISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAACQAEDDRLANSALRN
CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEHHHHCCCHHHHHHHHHCC
>Mature Secondary Structure 
TTQHSPGLFRRLAHGSLVKQILAGLILGILLAWISKPAAEAVGLLGTLFVGALKAVAPI
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH
LVLMLVMASIANHQHGQKTNIRPILFLYLLGTFSAALAAVIFSFAFPSTLHLSSSAGDIS
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCC
PPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLGFALRHGNETTKNLVNDMSNA
CCHHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
VTFMVKLVIHFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWW
HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINM
HHCCCCCCEEEEEECCCCEEEEEECCCCCCCCCHHHHHHHCCCCCCCEEEEECCCCEEEC
AGAAITITVLTLAAVNTLGIPVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFG
CCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHC
ISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAACQAEDDRLANSALRN
CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEHHHHCCCHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]