| Definition | Corynebacterium diphtheriae NCTC 13129 chromosome, complete genome. |
|---|---|
| Accession | NC_002935 |
| Length | 2,488,635 |
Click here to switch to the map view.
The map label for this gene is ygfU [H]
Identifier: 38234510
GI number: 38234510
Start: 1995598
End: 1997511
Strand: Direct
Name: ygfU [H]
Synonym: DIP1943
Alternate gene names: 38234510
Gene position: 1995598-1997511 (Clockwise)
Preceding gene: 38234509
Following gene: 38234511
Centisome position: 80.19
GC content: 54.18
Gene sequence:
>1914_bases TTGTCCGCAGCTCCCGTCCATCCCGTGGACGCAGTTCCCGCAGCGCCCAAGCTCGTAGCTTTGGGATTGCAGCATGTTCT AGCGTTCTACGCTGGCGCTGTCATCGTCCCTCTTCTCATCGCAGCATCGCTTAATCTCGATACTGCAACTACTATCCACC TGATCAACGCAGATTTGCTGACATGTGGCATTGCAACATTGATCCAATCGGTAGGCATTGGCAAGCACGTCGGCGTGCGC TTGCCGATCGTTCAAGGCGTGACCACCACAGCAGTGGCACCGATCATCGCAATTGGTTTGGGTGTTACTGATGGCCAAGG TGGCGTCGAATCGCTTCCTACCGTCTACGGTGCCGTCATCGTCGCCGGTTTGTTTACTTTCTTTGCCACCCCTATCTTTG CTAAGTTCCTTCGGTTCTTCCCACCAGTAGTTACCGGATCAGTGCTGTTGGTCATGGGTACGTCCCTTTTGGCGGTGTCA GCAAATGACTTCATTAACTACGCAGAAGCTCAGCCAGAAACCCGCGATCTTTTCTATGCTTTTGGCACCCTCGCAGTTAT CATCTTGGCGCAGCGATTCTTCCGAGGATTCTTGGGCACGCTCGCAGTCCTCATCGGCCTTGTTAGCGGCACGGTTGTTG CGCTGCTTTTGGGCCACGCCAACCTAGATGAAGTAGGCAATGCAGCTGCATTCGGCATTACAACGCCGTTCTACTTTGGC ATGCCGCAGTTCAACATCACCGCGTGTTTCTCCATGATCATCGTCATGATCATCACCATGGTGGAAACCACCGGCGACGT ATTTGCTACCGGCGAGATCGTGAAAAAGCGTATCCGCAAGTCTGATGTGCAGCGAGCCCTGCGCGCCGATGGCCTTTCCA CCTTCTTAGGTGGTGTGATGAACTCCTTCCCGTACACGTGCTTTGCTCAAAACGTCGGCCTCGTTCGCATCACTGGTGTG AAGTCCCGCTGGGTTGCAGCCTCTGCAGCTGGCTTCATGATTATCCTCGGCCTATTGCCTAAAGCTGGTGCCGTTGTTGC GTCGATTCCTTCCCCAGTATTGGGAGCTGCGTCCCTCGCACTGTTCGCCAATGTTGCTTGGGTGGGTTTGCAGACCATCG CCAAAACCGACCTAGCAGATAGCCGTAACGCAGCGATCGTTACCACTGCTTTGGGATTAGCAATGCTGGTTACTTTCAAG CCATCGGTTGCAGAAGCATTCCCTGAATGGGCACGTATCTTCGTTTCTTCTGGCATGTCCATCGGTGCTATCACCGCTAT CTTGCTGAACTTGTTGTTCTTCCACGTAGGAAAGCAATCGGGTTCTGCGGTTGCGCGCAACGTTTCCGGCGACGGTATAA CGCTCGATGAGATCAACGCATTGGACCGCGATGAGTTCGTAGCAACTTTGCGCCCACTGTTCAACAAAGAGACGTGGCCA CTGGAGCAAGCATGGGAATCCCGTCCATTCACAGACGTCCATGAGTTGCGCGAAGCTATCCAAGTGGCAGTACTTACCGC ACCTAGCGAACAACGCGAAGCACTCATCCACGACTACCCAGACACCTCAGCAGTTCTACTGGCAACCGATGCCGAATCTC GAGCAATCAGCGCTGACCGTGGATCGTTGGGAATCAATGAACTCGACGATGTGGAAACGCAGCAGGTTCTGGAGCTCTCC AAGGAATACCGCGAGCGCTTCGGCATGCCGTTCGTGTACTTCCTCGACACCAACGACACCGTCGCCTCCATCGTCAACGC TGGTCTACGTCGTCTAGCAAACTCCGATGTGCAAGAGCACCGAGTAGCTCTCACCGAAATCGTCGAGATCGCCAACGACC GCTTCGATATTCTTCTCGCCGATGCCAACCCAGTGCGATCAGCATGGGATCGCAAATTTACTGAAGTCGAGTAG
Upstream 100 bases:
>100_bases TTCCGCAGCACTTCAGATTCTCTTTCTCCTTCAGGGATTGAGTAGAAGATTGCTAAGGTAATCGAGTCTGCAACTGGTTA CACATTCAAGGAGTCCCCTC
Downstream 100 bases:
>100_bases AAACAACAACCCCCACCTTGCTTTCGCAACGGCTGAAACCAGCACGAGCAGAAAGCAAGGTGGGGGTGTTTGCTGGCTCT TCCGGTTTTAGAGTGCATTG
Product: xanthine/uracil permeases family protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 637; Mature: 636
Protein sequence:
>637_residues MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVR LPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVS ANDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGV KSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFK PSVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELS KEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE
Sequences:
>Translated_637_residues MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVR LPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVS ANDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGV KSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFK PSVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELS KEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE >Mature_636_residues SAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLLTCGIATLIQSVGIGKHVGVRL PIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVIVAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSA NDFINYAEAQPETRDLFYAFGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFGM PQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVMNSFPYTCFAQNVGLVRITGVK SRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLALFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKP SVAEAFPEWARIFVSSGMSIGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWPL EQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADRGSLGINELDDVETQQVLELSK EYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEHRVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE
Specific function: Unknown
COG id: COG2233
COG function: function code F; Xanthine/uracil permeases
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the xanthine/uracil permease family. Nucleobase:cation symporter-2 (NCS2) (TC 2.A.40) subfamily [H]
Homologues:
Organism=Homo sapiens, GI44680148, Length=158, Percent_Identity=31.6455696202532, Blast_Score=77, Evalue=6e-14, Organism=Homo sapiens, GI40316845, Length=158, Percent_Identity=31.6455696202532, Blast_Score=77, Evalue=6e-14, Organism=Homo sapiens, GI44680143, Length=179, Percent_Identity=27.3743016759777, Blast_Score=67, Evalue=6e-11, Organism=Homo sapiens, GI44680145, Length=179, Percent_Identity=27.3743016759777, Blast_Score=67, Evalue=6e-11, Organism=Escherichia coli, GI87082181, Length=471, Percent_Identity=37.3673036093418, Blast_Score=301, Evalue=7e-83, Organism=Escherichia coli, GI1790087, Length=439, Percent_Identity=30.0683371298405, Blast_Score=183, Evalue=3e-47, Organism=Escherichia coli, GI87082178, Length=425, Percent_Identity=29.4117647058824, Blast_Score=144, Evalue=1e-35, Organism=Escherichia coli, GI1788843, Length=393, Percent_Identity=28.498727735369, Blast_Score=126, Evalue=3e-30, Organism=Escherichia coli, GI87081818, Length=403, Percent_Identity=28.287841191067, Blast_Score=104, Evalue=2e-23, Organism=Caenorhabditis elegans, GI17558856, Length=465, Percent_Identity=25.5913978494624, Blast_Score=100, Evalue=2e-21, Organism=Caenorhabditis elegans, GI17541904, Length=447, Percent_Identity=23.2662192393736, Blast_Score=87, Evalue=2e-17, Organism=Caenorhabditis elegans, GI17542262, Length=451, Percent_Identity=24.390243902439, Blast_Score=80, Evalue=2e-15, Organism=Drosophila melanogaster, GI21356175, Length=427, Percent_Identity=23.4192037470726, Blast_Score=69, Evalue=7e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR017588 - InterPro: IPR006042 - InterPro: IPR006043 [H]
Pfam domain/function: PF00860 Xan_ur_permease [H]
EC number: NA
Molecular weight: Translated: 67849; Mature: 67718
Theoretical pI: Translated: 4.93; Mature: 4.93
Prosite motif: PS01116 XANTH_URACIL_PERMASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLL CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECHHHH TCGIATLIQSVGIGKHVGVRLPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVI HHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH VAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSANDFINYAEAQPETRDLFYA HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHEEEEHHHHHHHHHCCCCHHHHHHH FGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCEEECCCCCHHCC MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVM CCCCHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH NSFPYTCFAQNVGLVRITGVKSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLA HCCCCEEHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHH LFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKPSVAEAFPEWARIFVSSGMS HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC IGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP HHHHHHHHHHHHHHHHCCCCCCHHEECCCCCCEEHHHHCCCCHHHHHHHHHHHHCCCCCC LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADR HHHHHCCCCCHHHHHHHHHHHHHHEECCCHHHHHHHCCCCCCCEEEEEECCCCCCCCCCC GSLGINELDDVETQQVLELSKEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEH CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHCCCHHHH RVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE HHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCC >Mature Secondary Structure SAAPVHPVDAVPAAPKLVALGLQHVLAFYAGAVIVPLLIAASLNLDTATTIHLINADLL CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECHHHH TCGIATLIQSVGIGKHVGVRLPIVQGVTTTAVAPIIAIGLGVTDGQGGVESLPTVYGAVI HHHHHHHHHHCCCCCCCCEEEEHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH VAGLFTFFATPIFAKFLRFFPPVVTGSVLLVMGTSLLAVSANDFINYAEAQPETRDLFYA HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHEEEEHHHHHHHHHCCCCHHHHHHH FGTLAVIILAQRFFRGFLGTLAVLIGLVSGTVVALLLGHANLDEVGNAAAFGITTPFYFG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCEEECCCCCHHCC MPQFNITACFSMIIVMIITMVETTGDVFATGEIVKKRIRKSDVQRALRADGLSTFLGGVM CCCCHHHHHHHHHHHHHHHHHHCCCCCEECHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH NSFPYTCFAQNVGLVRITGVKSRWVAASAAGFMIILGLLPKAGAVVASIPSPVLGAASLA HCCCCEEHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHH LFANVAWVGLQTIAKTDLADSRNAAIVTTALGLAMLVTFKPSVAEAFPEWARIFVSSGMS HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC IGAITAILLNLLFFHVGKQSGSAVARNVSGDGITLDEINALDRDEFVATLRPLFNKETWP HHHHHHHHHHHHHHHHCCCCCCHHEECCCCCCEEHHHHCCCCHHHHHHHHHHHHCCCCCC LEQAWESRPFTDVHELREAIQVAVLTAPSEQREALIHDYPDTSAVLLATDAESRAISADR HHHHHCCCCCHHHHHHHHHHHHHHEECCCHHHHHHHCCCCCCCEEEEEECCCCCCCCCCC GSLGINELDDVETQQVLELSKEYRERFGMPFVYFLDTNDTVASIVNAGLRRLANSDVQEH CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHCCCHHHH RVALTEIVEIANDRFDILLADANPVRSAWDRKFTEVE HHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]