Definition Corynebacterium diphtheriae NCTC 13129 chromosome, complete genome.
Accession NC_002935
Length 2,488,635

Click here to switch to the map view.

The map label for this gene is ygfU [H]

Identifier: 38234510

GI number: 38234510

Start: 1995598

End: 1997511

Strand: Direct

Name: ygfU [H]

Synonym: DIP1943

Alternate gene names: 38234510

Gene position: 1995598-1997511 (Clockwise)

Preceding gene: 38234509

Following gene: 38234511

Centisome position: 80.19

GC content: 54.18

Gene sequence:

>1914_bases
TTGTCCGCAGCTCCCGTCCATCCCGTGGACGCAGTTCCCGCAGCGCCCAAGCTCGTAGCTTTGGGATTGCAGCATGTTCT
AGCGTTCTACGCTGGCGCTGTCATCGTCCCTCTTCTCATCGCAGCATCGCTTAATCTCGATACTGCAACTACTATCCACC
TGATCAACGCAGATTTGCTGACATGTGGCATTGCAACATTGATCCAATCGGTAGGCATTGGCAAGCACGTCGGCGTGCGC
TTGCCGATCGTTCAAGGCGTGACCACCACAGCAGTGGCACCGATCATCGCAATTGGTTTGGGTGTTACTGATGGCCAAGG
TGGCGTCGAATCGCTTCCTACCGTCTACGGTGCCGTCATCGTCGCCGGTTTGTTTACTTTCTTTGCCACCCCTATCTTTG
CTAAGTTCCTTCGGTTCTTCCCACCAGTAGTTACCGGATCAGTGCTGTTGGTCATGGGTACGTCCCTTTTGGCGGTGTCA
GCAAATGACTTCATTAACTACGCAGAAGCTCAGCCAGAAACCCGCGATCTTTTCTATGCTTTTGGCACCCTCGCAGTTAT
CATCTTGGCGCAGCGATTCTTCCGAGGATTCTTGGGCACGCTCGCAGTCCTCATCGGCCTTGTTAGCGGCACGGTTGTTG
CGCTGCTTTTGGGCCACGCCAACCTAGATGAAGTAGGCAATGCAGCTGCATTCGGCATTACAACGCCGTTCTACTTTGGC
ATGCCGCAGTTCAACATCACCGCGTGTTTCTCCATGATCATCGTCATGATCATCACCATGGTGGAAACCACCGGCGACGT
ATTTGCTACCGGCGAGATCGTGAAAAAGCGTATCCGCAAGTCTGATGTGCAGCGAGCCCTGCGCGCCGATGGCCTTTCCA
CCTTCTTAGGTGGTGTGATGAACTCCTTCCCGTACACGTGCTTTGCTCAAAACGTCGGCCTCGTTCGCATCACTGGTGTG
AAGTCCCGCTGGGTTGCAGCCTCTGCAGCTGGCTTCATGATTATCCTCGGCCTATTGCCTAAAGCTGGTGCCGTTGTTGC
GTCGATTCCTTCCCCAGTATTGGGAGCTGCGTCCCTCGCACTGTTCGCCAATGTTGCTTGGGTGGGTTTGCAGACCATCG
CCAAAACCGACCTAGCAGATAGCCGTAACGCAGCGATCGTTACCACTGCTTTGGGATTAGCAATGCTGGTTACTTTCAAG
CCATCGGTTGCAGAAGCATTCCCTGAATGGGCACGTATCTTCGTTTCTTCTGGCATGTCCATCGGTGCTATCACCGCTAT
CTTGCTGAACTTGTTGTTCTTCCACGTAGGAAAGCAATCGGGTTCTGCGGTTGCGCGCAACGTTTCCGGCGACGGTATAA
CGCTCGATGAGATCAACGCATTGGACCGCGATGAGTTCGTAGCAACTTTGCGCCCACTGTTCAACAAAGAGACGTGGCCA
CTGGAGCAAGCATGGGAATCCCGTCCATTCACAGACGTCCATGAGTTGCGCGAAGCTATCCAAGTGGCAGTACTTACCGC
ACCTAGCGAACAACGCGAAGCACTCATCCACGACTACCCAGACACCTCAGCAGTTCTACTGGCAACCGATGCCGAATCTC
GAGCAATCAGCGCTGACCGTGGATCGTTGGGAATCAATGAACTCGACGATGTGGAAACGCAGCAGGTTCTGGAGCTCTCC
AAGGAATACCGCGAGCGCTTCGGCATGCCGTTCGTGTACTTCCTCGACACCAACGACACCGTCGCCTCCATCGTCAACGC
TGGTCTACGTCGTCTAGCAAACTCCGATGTGCAAGAGCACCGAGTAGCTCTCACCGAAATCGTCGAGATCGCCAACGACC
GCTTCGATATTCTTCTCGCCGATGCCAACCCAGTGCGATCAGCATGGGATCGCAAATTTACTGAAGTCGAGTAG

Upstream 100 bases:

>100_bases
TTCCGCAGCACTTCAGATTCTCTTTCTCCTTCAGGGATTGAGTAGAAGATTGCTAAGGTAATCGAGTCTGCAACTGGTTA
CACATTCAAGGAGTCCCCTC

Downstream 100 bases:

>100_bases
AAACAACAACCCCCACCTTGCTTTCGCAACGGCTGAAACCAGCACGAGCAGAAAGCAAGGTGGGGGTGTTTGCTGGCTCT
TCCGGTTTTAGAGTGCATTG

Product: xanthine/uracil permeases family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 637; Mature: 636

Protein sequence:

>637_residues
MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVR
LPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVS
ANDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG
MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGV
KSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFK
PSVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP
LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELS
KEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE

Sequences:

>Translated_637_residues
MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVR
LPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVS
ANDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG
MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGV
KSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFK
PSVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP
LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELS
KEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE
>Mature_636_residues
SAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVRL
PIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSA
NDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFGM
PQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGVK
SRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKP
SVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWPL
EQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELSK
EYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE

Specific function: Unknown

COG id: COG2233

COG function: function code F; Xanthine/uracil permeases

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the xanthine/uracil permease family. Nucleobase:cation symporter-2 (NCS2) (TC 2.A.40) subfamily [H]

Homologues:

Organism=Homo sapiens, GI44680148, Length=158, Percent_Identity=31.6455696202532, Blast_Score=77, Evalue=6e-14,
Organism=Homo sapiens, GI40316845, Length=158, Percent_Identity=31.6455696202532, Blast_Score=77, Evalue=6e-14,
Organism=Homo sapiens, GI44680143, Length=179, Percent_Identity=27.3743016759777, Blast_Score=67, Evalue=6e-11,
Organism=Homo sapiens, GI44680145, Length=179, Percent_Identity=27.3743016759777, Blast_Score=67, Evalue=6e-11,
Organism=Escherichia coli, GI87082181, Length=471, Percent_Identity=37.3673036093418, Blast_Score=301, Evalue=7e-83,
Organism=Escherichia coli, GI1790087, Length=439, Percent_Identity=30.0683371298405, Blast_Score=183, Evalue=3e-47,
Organism=Escherichia coli, GI87082178, Length=425, Percent_Identity=29.4117647058824, Blast_Score=144, Evalue=1e-35,
Organism=Escherichia coli, GI1788843, Length=393, Percent_Identity=28.498727735369, Blast_Score=126, Evalue=3e-30,
Organism=Escherichia coli, GI87081818, Length=403, Percent_Identity=28.287841191067, Blast_Score=104, Evalue=2e-23,
Organism=Caenorhabditis elegans, GI17558856, Length=465, Percent_Identity=25.5913978494624, Blast_Score=100, Evalue=2e-21,
Organism=Caenorhabditis elegans, GI17541904, Length=447, Percent_Identity=23.2662192393736, Blast_Score=87, Evalue=2e-17,
Organism=Caenorhabditis elegans, GI17542262, Length=451, Percent_Identity=24.390243902439, Blast_Score=80, Evalue=2e-15,
Organism=Drosophila melanogaster, GI21356175, Length=427, Percent_Identity=23.4192037470726, Blast_Score=69, Evalue=7e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017588
- InterPro:   IPR006042
- InterPro:   IPR006043 [H]

Pfam domain/function: PF00860 Xan_ur_permease [H]

EC number: NA

Molecular weight: Translated: 67849; Mature: 67718

Theoretical pI: Translated: 4.93; Mature: 4.93

Prosite motif: PS01116 XANTH_URACIL_PERMASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLL
CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECHHHH
TCGIATLIQSVGIGKHVGVRLPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVI
HHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH
VAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSANDFINYAEAQPETRDLFYA
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHEEEEHHHHHHHHHCCCCHHHHHHH
FGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCEEECCCCCHHCC
MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVM
CCCCHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH
NSFPYTCFAQNVGLVRITGVKSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLA
HCCCCEEHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHH
LFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKPSVAEAFPEWARIFVSSGMS
HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC
IGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP
HHHHHHHHHHHHHHHHCCCCCCHHEECCCCCCEEHHHHCCCCHHHHHHHHHHHHCCCCCC
LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADR
HHHHHCCCCCHHHHHHHHHHHHHHEECCCHHHHHHHCCCCCCCEEEEEECCCCCCCCCCC
GSLGINELDDVETQQVLELSKEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEH
CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHCCCHHHH
RVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE
HHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCC
>Mature Secondary Structure 
SAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLL
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECHHHH
TCGIATLIQSVGIGKHVGVRLPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVI
HHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH
VAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSANDFINYAEAQPETRDLFYA
HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHEEEEHHHHHHHHHCCCCHHHHHHH
FGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCEEECCCCCHHCC
MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVM
CCCCHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH
NSFPYTCFAQNVGLVRITGVKSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLA
HCCCCEEHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHH
LFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKPSVAEAFPEWARIFVSSGMS
HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC
IGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP
HHHHHHHHHHHHHHHHCCCCCCHHEECCCCCCEEHHHHCCCCHHHHHHHHHHHHCCCCCC
LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADR
HHHHHCCCCCHHHHHHHHHHHHHHEECCCHHHHHHHCCCCCCCEEEEEECCCCCCCCCCC
GSLGINELDDVETQQVLELSKEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEH
CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHCCCHHHH
RVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE
HHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]