Definition | Bacillus anthracis str. CDC 684, complete genome. |
---|---|
Accession | NC_012581 |
Length | 5,230,115 |
Click here to switch to the map view.
The map label for this gene is pyn1 [H]
Identifier: 227815287
GI number: 227815287
Start: 2458932
End: 2460227
Strand: Reverse
Name: pyn1 [H]
Synonym: BAMEG_2698
Alternate gene names: 227815287
Gene position: 2460227-2458932 (Counterclockwise)
Preceding gene: 227815288
Following gene: 227815286
Centisome position: 47.04
GC content: 36.81
Gene sequence:
>1296_bases ATGGTAGATATTATTGCAAAAAAACGTGACGGTAAAGAATTAACAACTGAAGAAATCAAATTCTTTATTAATGGTTATAC AGACGGAAGTATTCCTGATTATCAAGTAAGTGCACTTGCAATGGCAATCTTCTTTAAAGATATGACAGATCGTGAACGTG CAGATTTAACGATGGCAATGGTGGAGTCTGGAGAAACAATCGACTTATCTGCAATTGAAGGAATTAAAGTAGACAAACAT TCAACTGGTGGTGTTGGTGATACAACAACATTAGTATTAGGACCATTAGTAGCTGCTTTAGATGTACCAGTAGCAAAAAT GTCTGGTCGTGGTTTAGGACATACAGGCGGAACAATTGATAAATTAGAAGCAGTAGAAGGATTCCACGTTGAAATTACGA AAGAGCAGTTCATTGATATTGTAAACCGTGACAAAGTAGCTGTTATTGGACAAACAGGAAACTTAACACCTGCAGATAAA AAGATTTATGCATTACGCGATGTAACAGGAACAGTAAACTCAATTCCTTTAATCGCAAGTTCAATTATGAGTAAAAAAAT TGCAGCTGGTGCAGATGCAATCGTACTTGATGTAAAAACAGGTGCTGGCGCATTTATGAAAACAGAAGAAGATGCAAAAG AATTAGCACATGCGATGGTACGTATCGGAAATAATGTAGGACGTCAAACTATGGCTGTTATTTCAGACATGTCACAACCG CTTGGATTTGCGATTGGTAACGCACTAGAAGTGAAAGAAGCAATTGATACGTTAAAAGGTGAAGGTCCAGAAGATTTAAC AGAATTAGTACTCGTATTAGGAAGTCAGATGGTTGTACTTGCGAAAAAAGCTAATACATTAGAAGAAGCGCGTGAAATGT TAATTGAAGTGATGAAGAACGGAAAAGCAACTGAGAAGTTTAAAGAATTCTTAAACAATCAAGGCGGAGATAGCTCAATT GTAGACAATCCAGAAAAAATGCCACAGGCGAAGTATGTAATTGATGTACCTGCTAAAACTTCAGGTGTTATTTCTAACAT TGTTGCAGATGAAATCGGTATCGCAGCTATGCTACTTGGTGCTGGTCGTGCAACAAAAGAAGATGAAATTGATTTAGCAG TAGGATTAATGTTACGTAAAAAAGTGGGCGATGCAGTAAAAGAAGGCGAGCCATTCGTAACGATTTATGCAAATCGCGAA AATGTAGAAGATGTAAAAGCTAAAATTTATGAGAACATTTCTATCGCTGAAACAGCAGTGGCTCCTAAATTAGTTCATAC AGTTATTACTGACTAA
Upstream 100 bases:
>100_bases TTTCGGTGCAACATTAGTAAGTTTCTTATCAGCAACAATCGTAGGCTTATTATTTTAATAGATTCATATCATATAAAAAG GAATGGTGATTGTAATGAGA
Downstream 100 bases:
>100_bases TTATAATTACACTTACTTTGAGGGGGAAATAATATGGATAAGAAAAAATATATTGAAGAAGCAAATAAGATGTTGTCGAA AGCATATATTCCGTATTCAA
Product: pyrimidine-nucleoside phosphorylase
Products: NA
Alternate protein names: PYNP [H]
Number of amino acids: Translated: 431; Mature: 431
Protein sequence:
>431_residues MVDIIAKKRDGKELTTEEIKFFINGYTDGSIPDYQVSALAMAIFFKDMTDRERADLTMAMVESGETIDLSAIEGIKVDKH STGGVGDTTTLVLGPLVAALDVPVAKMSGRGLGHTGGTIDKLEAVEGFHVEITKEQFIDIVNRDKVAVIGQTGNLTPADK KIYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAKELAHAMVRIGNNVGRQTMAVISDMSQP LGFAIGNALEVKEAIDTLKGEGPEDLTELVLVLGSQMVVLAKKANTLEEAREMLIEVMKNGKATEKFKEFLNNQGGDSSI VDNPEKMPQAKYVIDVPAKTSGVISNIVADEIGIAAMLLGAGRATKEDEIDLAVGLMLRKKVGDAVKEGEPFVTIYANRE NVEDVKAKIYENISIAETAVAPKLVHTVITD
Sequences:
>Translated_431_residues MVDIIAKKRDGKELTTEEIKFFINGYTDGSIPDYQVSALAMAIFFKDMTDRERADLTMAMVESGETIDLSAIEGIKVDKH STGGVGDTTTLVLGPLVAALDVPVAKMSGRGLGHTGGTIDKLEAVEGFHVEITKEQFIDIVNRDKVAVIGQTGNLTPADK KIYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAKELAHAMVRIGNNVGRQTMAVISDMSQP LGFAIGNALEVKEAIDTLKGEGPEDLTELVLVLGSQMVVLAKKANTLEEAREMLIEVMKNGKATEKFKEFLNNQGGDSSI VDNPEKMPQAKYVIDVPAKTSGVISNIVADEIGIAAMLLGAGRATKEDEIDLAVGLMLRKKVGDAVKEGEPFVTIYANRE NVEDVKAKIYENISIAETAVAPKLVHTVITD >Mature_431_residues MVDIIAKKRDGKELTTEEIKFFINGYTDGSIPDYQVSALAMAIFFKDMTDRERADLTMAMVESGETIDLSAIEGIKVDKH STGGVGDTTTLVLGPLVAALDVPVAKMSGRGLGHTGGTIDKLEAVEGFHVEITKEQFIDIVNRDKVAVIGQTGNLTPADK KIYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAKELAHAMVRIGNNVGRQTMAVISDMSQP LGFAIGNALEVKEAIDTLKGEGPEDLTELVLVLGSQMVVLAKKANTLEEAREMLIEVMKNGKATEKFKEFLNNQGGDSSI VDNPEKMPQAKYVIDVPAKTSGVISNIVADEIGIAAMLLGAGRATKEDEIDLAVGLMLRKKVGDAVKEGEPFVTIYANRE NVEDVKAKIYENISIAETAVAPKLVHTVITD
Specific function: The Enzymes Which Catalyze The Reversible Phosphorolysis Of Pyrimidine Nucleosides Are Involved In The Degradation Of These Compounds And In Their Utilization As Carbon And Energy Sources, Or In The Rescue Of Pyrimidine Bases For Nucleotide Synthesis. [C
COG id: COG0213
COG function: function code F; Thymidine phosphorylase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the thymidine/pyrimidine-nucleoside phosphorylase family [H]
Homologues:
Organism=Homo sapiens, GI166158925, Length=422, Percent_Identity=35.5450236966825, Blast_Score=267, Evalue=1e-71, Organism=Homo sapiens, GI4503445, Length=422, Percent_Identity=35.5450236966825, Blast_Score=267, Evalue=1e-71, Organism=Homo sapiens, GI166158922, Length=422, Percent_Identity=35.5450236966825, Blast_Score=267, Evalue=1e-71, Organism=Escherichia coli, GI1790842, Length=404, Percent_Identity=46.7821782178218, Blast_Score=338, Evalue=4e-94,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000312 - InterPro: IPR017459 - InterPro: IPR020072 - InterPro: IPR013102 - InterPro: IPR018090 - InterPro: IPR000053 - InterPro: IPR017872 [H]
Pfam domain/function: PF02885 Glycos_trans_3N; PF00591 Glycos_transf_3; PF07831 PYNP_C [H]
EC number: =2.4.2.2 [H]
Molecular weight: Translated: 46035; Mature: 46035
Theoretical pI: Translated: 4.62; Mature: 4.62
Prosite motif: PS00647 THYMID_PHOSPHORYLASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 3.9 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVDIIAKKRDGKELTTEEIKFFINGYTDGSIPDYQVSALAMAIFFKDMTDRERADLTMAM CCCCEECCCCCCCCCHHHHEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHEEHEE VESGETIDLSAIEGIKVDKHSTGGVGDTTTLVLGPLVAALDVPVAKMSGRGLGHTGGTID CCCCCEEEEEECCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCHH KLEAVEGFHVEITKEQFIDIVNRDKVAVIGQTGNLTPADKKIYALRDVTGTVNSIPLIAS HHHHCCCCEEEECHHHHHHHHCCCCEEEEECCCCCCCCCCEEEEEECCCCCCHHHHHHHH SIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAKELAHAMVRIGNNVGRQTMAVISDMSQP HHHHHHHHCCCCEEEEEECCCCCCCEECHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCC LGFAIGNALEVKEAIDTLKGEGPEDLTELVLVLGSQMVVLAKKANTLEEAREMLIEVMKN HHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHHHHHHC GKATEKFKEFLNNQGGDSSIVDNPEKMPQAKYVIDVPAKTSGVISNIVADEIGIAAMLLG CCHHHHHHHHHCCCCCCCCCCCCHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHH AGRATKEDEIDLAVGLMLRKKVGDAVKEGEPFVTIYANRENVEDVKAKIYENISIAETAV CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCHHHHHH APKLVHTVITD HHHHHHHHHCC >Mature Secondary Structure MVDIIAKKRDGKELTTEEIKFFINGYTDGSIPDYQVSALAMAIFFKDMTDRERADLTMAM CCCCEECCCCCCCCCHHHHEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHEEHEE VESGETIDLSAIEGIKVDKHSTGGVGDTTTLVLGPLVAALDVPVAKMSGRGLGHTGGTID CCCCCEEEEEECCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCHH KLEAVEGFHVEITKEQFIDIVNRDKVAVIGQTGNLTPADKKIYALRDVTGTVNSIPLIAS HHHHCCCCEEEECHHHHHHHHCCCCEEEEECCCCCCCCCCEEEEEECCCCCCHHHHHHHH SIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAKELAHAMVRIGNNVGRQTMAVISDMSQP HHHHHHHHCCCCEEEEEECCCCCCCEECHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCC LGFAIGNALEVKEAIDTLKGEGPEDLTELVLVLGSQMVVLAKKANTLEEAREMLIEVMKN HHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHHHHHHC GKATEKFKEFLNNQGGDSSIVDNPEKMPQAKYVIDVPAKTSGVISNIVADEIGIAAMLLG CCHHHHHHHHHCCCCCCCCCCCCHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHH AGRATKEDEIDLAVGLMLRKKVGDAVKEGEPFVTIYANRENVEDVKAKIYENISIAETAV CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCHHHHHH APKLVHTVITD HHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8550462; 8867804; 9384377; 10568751 [H]