Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is purL

Identifier: 159184945

GI number: 159184945

Start: 1829966

End: 1832200

Strand: Direct

Name: purL

Synonym: Atu1850

Alternate gene names: 159184945

Gene position: 1829966-1832200 (Clockwise)

Preceding gene: 15889150

Following gene: 159184946

Centisome position: 64.4

GC content: 61.61

Gene sequence:

>2235_bases
ATGTCGATTTCAAACTCCATCAAGATCACCCCGGAACTCGTTGCATCCCACGGGCTGAAGCCCGACGAATACCAGCGTAT
CCTCGACCTGATCGGACGGGAGCCCAGCTTTACCGAGCTTGGCATTTTCTCGGCCATGTGGAACGAGCACTGCTCCTACA
AGTCTTCGAAGAAGTGGCTGAAGACCCTGCCGACAACGGGTCCGCGCGTCATTCAGGGCCCCGGTGAAAATGCCGGCGTG
GTGGATATTGACGATGGCGACTGCGTCGTCTTCAAGATGGAGAGCCACAACCACCCGTCCTATATCGAGCCCTATCAGGG
TGCGGCAACCGGCGTCGGCGGCATCCTGCGTGACGTCTTCACCATGGGCGCACGCCCGATCGCCGCCATGAACGCCCTGC
GTTTCGGTGCGCCGGATCATCCGAAGACCCGCCACCTCGTCGCCGGCGTCGTCGCCGGTGTCGGCGGTTACGGCAACTCC
TTCGGTGTGCCGACAGTTGGCGGCGAAGTGGAATTCGACCCGCGTTATAACGGCAATATCCTCGTCAACGCCTTTGCCGC
CGGTCTTGCCAAGTCCAATGCGATCTTCCTCTCGGAAGCCAAGGGCGTCGGCCTGCCGGTCGTTTATCTCGGTGCAAAGA
CCGGCCGCGACGGCGTTGGCGGCGCGACCATGGCCTCTGCCGAATTCGACGAGTCCATCGAAGAAAAGCGCCCGACGGTT
CAAGTCGGCGACCCCTTCACCGAAAAGTGCCTGCTGGAAGCCTGCCTTGAGCTGATGAAGACCGGTGCGGTCATCGCCAT
TCAGGACATGGGTGCCGCCGGCCTCACCTGCTCCGCCGTCGAAATGGGCGCCAAGGGTGATCTCGGCATCGAACTGGACC
TGAACGCCGTGCCGGTGCGCGAAGAGCGCATGACGGCTTACGAAATGATGCTGTCGGAAAGCCAGGAGCGCATGCTCATG
GTTCTCGAGCCCTCCAAGGAAGAAGTCGCCAAGGCGATCTTCGTCAAATGGGGTCTGGACTTCGCCATCGTCGGCAAGAC
CACCGACGACCTGCGCTTCCGCGTGCTGCACAATGGCGAAGAAGTCGCCAACCTGCCGATCAAGGAACTCGGCGACGAGG
CCCCCGAATACGACCGCCCGTGGACGCCAGCCAAGGTGCCCGCAGCGCTTTCGGAAACCGACATCCCCGAAGCCGACATT
GCCGATGCGCTGGTGTCGCTCGTTGGTTCGGCCAACAACTCCTCGCGCCGCTGGGTTTACGAACAGTATGACACGCTGAT
CCAGGGCAATTCCCTGCAGCTGCCGGGCGGTGATGCCGGCGTGGTGCGCGTCGAAGGCCATGACAAGAAGGCGCTCGCTT
TCTCCTCCGACGTGACCCCGCGTTATGTCGAGGCCGATGCCTTCGAAGGCGGCAAGCAGGCGGTCGCCGAATGCTGGCGC
AACATTACCGCAACTGGTGCTCTGCCGCTGGCGGCCACCGACAACCTCAATTTCGGCAATCCGGAAAAGCCCGAAATCAT
GAGCCAGCTGGTCCATGCCATCAAGGGCATCGGCGAGGCCTGCCGCGTGCTGGAATTCCCGATCGTTTCCGGCAACGTCT
CGCTCTACAACGAGACCAATGGTCAGGCGATCCTGCCCACCCCCACCATCGGCGGCGTTGGCCTGCTGAAGGACTGGGGC
CGCATGGCGCGCATCCGTTTCGCGGCTGCCGACGAGGTGGTACTCTTGGTTGGCGCACCTGCTGGCCTCGGCACCCACAT
TGCCCAGTCGGTCTATATGCGTGATGTTCACGGCCGCACCGATGGCCCGGCGCCGCATGTCGATCTCATCGCCGAGAAGA
AGAACGGCGATTTCGTCCGTGGTCTCATCACTGAAGGTCTCACCACCGCCGTTCACGATTGCTCCTCTGGCGGTCTGGCA
CTCGCTGTTGCCGAAATGGCGATTTCATCCGGCATCGGCGCAACGATCGATGCGGTCGAAGGCCATAATCCGATCCTCAC
CTTCTATGGCGAAGATCAGGCCCGTTACGTCCTGACCGTCAAGAAGTCCGATCTCGACAAGGTGCGGGCGGCGGCCAAGG
CGGCCGGTGTTTCCTGCCCTCTTATTGGCACTACCGGTGGTTCCACCGTAAAGCTGGGCACGGCGCGCGCTGTCGAGATT
AAAGAATTGCACTTGGCCTATGAATCGTGGTTCCCTCAGTTCATGGACGGCGAAACTTTGATTGCCGCAGAATGA

Upstream 100 bases:

>100_bases
CAGCGTTTTGCCGGTAAAACTGCGCATTTTCCTGTCGCACGGAGAGGCCTCTTGCGCTAAAGAACCGGAAACTTTCCTCG
GCTCGAACCGCGTGAGACCT

Downstream 100 bases:

>100_bases
ATTAAGGAGAAACACCATGCCCATGAAACCCGGCGACATTGAAGACATGATTAAGGCGGGAATTCCCGGGGCAAAGGTCA
CGATCCGCGATCTGGCCGGT

Product: phosphoribosylformylglycinamidine synthase II

Products: NA

Alternate protein names: Phosphoribosylformylglycinamidine synthase II; FGAM synthase II

Number of amino acids: Translated: 744; Mature: 743

Protein sequence:

>744_residues
MSISNSIKITPELVASHGLKPDEYQRILDLIGREPSFTELGIFSAMWNEHCSYKSSKKWLKTLPTTGPRVIQGPGENAGV
VDIDDGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDVFTMGARPIAAMNALRFGAPDHPKTRHLVAGVVAGVGGYGNS
FGVPTVGGEVEFDPRYNGNILVNAFAAGLAKSNAIFLSEAKGVGLPVVYLGAKTGRDGVGGATMASAEFDESIEEKRPTV
QVGDPFTEKCLLEACLELMKTGAVIAIQDMGAAGLTCSAVEMGAKGDLGIELDLNAVPVREERMTAYEMMLSESQERMLM
VLEPSKEEVAKAIFVKWGLDFAIVGKTTDDLRFRVLHNGEEVANLPIKELGDEAPEYDRPWTPAKVPAALSETDIPEADI
ADALVSLVGSANNSSRRWVYEQYDTLIQGNSLQLPGGDAGVVRVEGHDKKALAFSSDVTPRYVEADAFEGGKQAVAECWR
NITATGALPLAATDNLNFGNPEKPEIMSQLVHAIKGIGEACRVLEFPIVSGNVSLYNETNGQAILPTPTIGGVGLLKDWG
RMARIRFAAADEVVLLVGAPAGLGTHIAQSVYMRDVHGRTDGPAPHVDLIAEKKNGDFVRGLITEGLTTAVHDCSSGGLA
LAVAEMAISSGIGATIDAVEGHNPILTFYGEDQARYVLTVKKSDLDKVRAAAKAAGVSCPLIGTTGGSTVKLGTARAVEI
KELHLAYESWFPQFMDGETLIAAE

Sequences:

>Translated_744_residues
MSISNSIKITPELVASHGLKPDEYQRILDLIGREPSFTELGIFSAMWNEHCSYKSSKKWLKTLPTTGPRVIQGPGENAGV
VDIDDGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDVFTMGARPIAAMNALRFGAPDHPKTRHLVAGVVAGVGGYGNS
FGVPTVGGEVEFDPRYNGNILVNAFAAGLAKSNAIFLSEAKGVGLPVVYLGAKTGRDGVGGATMASAEFDESIEEKRPTV
QVGDPFTEKCLLEACLELMKTGAVIAIQDMGAAGLTCSAVEMGAKGDLGIELDLNAVPVREERMTAYEMMLSESQERMLM
VLEPSKEEVAKAIFVKWGLDFAIVGKTTDDLRFRVLHNGEEVANLPIKELGDEAPEYDRPWTPAKVPAALSETDIPEADI
ADALVSLVGSANNSSRRWVYEQYDTLIQGNSLQLPGGDAGVVRVEGHDKKALAFSSDVTPRYVEADAFEGGKQAVAECWR
NITATGALPLAATDNLNFGNPEKPEIMSQLVHAIKGIGEACRVLEFPIVSGNVSLYNETNGQAILPTPTIGGVGLLKDWG
RMARIRFAAADEVVLLVGAPAGLGTHIAQSVYMRDVHGRTDGPAPHVDLIAEKKNGDFVRGLITEGLTTAVHDCSSGGLA
LAVAEMAISSGIGATIDAVEGHNPILTFYGEDQARYVLTVKKSDLDKVRAAAKAAGVSCPLIGTTGGSTVKLGTARAVEI
KELHLAYESWFPQFMDGETLIAAE
>Mature_743_residues
SISNSIKITPELVASHGLKPDEYQRILDLIGREPSFTELGIFSAMWNEHCSYKSSKKWLKTLPTTGPRVIQGPGENAGVV
DIDDGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDVFTMGARPIAAMNALRFGAPDHPKTRHLVAGVVAGVGGYGNSF
GVPTVGGEVEFDPRYNGNILVNAFAAGLAKSNAIFLSEAKGVGLPVVYLGAKTGRDGVGGATMASAEFDESIEEKRPTVQ
VGDPFTEKCLLEACLELMKTGAVIAIQDMGAAGLTCSAVEMGAKGDLGIELDLNAVPVREERMTAYEMMLSESQERMLMV
LEPSKEEVAKAIFVKWGLDFAIVGKTTDDLRFRVLHNGEEVANLPIKELGDEAPEYDRPWTPAKVPAALSETDIPEADIA
DALVSLVGSANNSSRRWVYEQYDTLIQGNSLQLPGGDAGVVRVEGHDKKALAFSSDVTPRYVEADAFEGGKQAVAECWRN
ITATGALPLAATDNLNFGNPEKPEIMSQLVHAIKGIGEACRVLEFPIVSGNVSLYNETNGQAILPTPTIGGVGLLKDWGR
MARIRFAAADEVVLLVGAPAGLGTHIAQSVYMRDVHGRTDGPAPHVDLIAEKKNGDFVRGLITEGLTTAVHDCSSGGLAL
AVAEMAISSGIGATIDAVEGHNPILTFYGEDQARYVLTVKKSDLDKVRAAAKAAGVSCPLIGTTGGSTVKLGTARAVEIK
ELHLAYESWFPQFMDGETLIAAE

Specific function: Unknown

COG id: COG0046

COG function: function code F; Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the FGAMS family

Homologues:

Organism=Homo sapiens, GI31657129, Length=334, Percent_Identity=26.6467065868263, Blast_Score=75, Evalue=2e-13,
Organism=Escherichia coli, GI48994899, Length=778, Percent_Identity=24.1645244215938, Blast_Score=123, Evalue=5e-29,
Organism=Caenorhabditis elegans, GI17553022, Length=440, Percent_Identity=23.6363636363636, Blast_Score=79, Evalue=9e-15,
Organism=Saccharomyces cerevisiae, GI6321498, Length=798, Percent_Identity=22.1804511278195, Blast_Score=102, Evalue=3e-22,
Organism=Drosophila melanogaster, GI24582111, Length=743, Percent_Identity=24.4952893674293, Blast_Score=100, Evalue=4e-21,
Organism=Drosophila melanogaster, GI24582109, Length=743, Percent_Identity=24.4952893674293, Blast_Score=100, Evalue=4e-21,
Organism=Drosophila melanogaster, GI17137292, Length=743, Percent_Identity=24.4952893674293, Blast_Score=100, Evalue=4e-21,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): PURL_AGRT5 (Q8UEB0)

Other databases:

- EMBL:   AE007869
- PIR:   AH2803
- PIR:   H97582
- RefSeq:   NP_354832.2
- ProteinModelPortal:   Q8UEB0
- SMR:   Q8UEB0
- STRING:   Q8UEB0
- GeneID:   1133888
- GenomeReviews:   AE007869_GR
- KEGG:   atu:Atu1850
- eggNOG:   COG0046
- HOGENOM:   HBG311214
- OMA:   YGNSFGV
- PhylomeDB:   Q8UEB0
- ProtClustDB:   PRK01213
- BioCyc:   ATUM176299-1:ATU1850-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00420
- InterPro:   IPR000728
- InterPro:   IPR010918
- InterPro:   IPR010074
- InterPro:   IPR016188
- TIGRFAMs:   TIGR01736

Pfam domain/function: PF00586 AIRS; PF02769 AIRS_C; SSF56042 AIR_synth_C; SSF55326 PurM_N-like

EC number: =6.3.5.3

Molecular weight: Translated: 79065; Mature: 78934

Theoretical pI: Translated: 4.72; Mature: 4.72

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSISNSIKITPELVASHGLKPDEYQRILDLIGREPSFTELGIFSAMWNEHCSYKSSKKWL
CCCCCCEEECHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHH
KTLPTTGPRVIQGPGENAGVVDIDDGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDVF
HHCCCCCCEEEECCCCCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHH
TMGARPIAAMNALRFGAPDHPKTRHLVAGVVAGVGGYGNSFGVPTVGGEVEFDPRYNGNI
HHCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCE
LVNAFAAGLAKSNAIFLSEAKGVGLPVVYLGAKTGRDGVGGATMASAEFDESIEEKRPTV
EEEEHHHHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCCCEEECHHHHHHHHHCCCEE
QVGDPFTEKCLLEACLELMKTGAVIAIQDMGAAGLTCSAVEMGAKGDLGIELDLNAVPVR
EECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEHHEECCCCCCCEEEEECCCCCCH
EERMTAYEMMLSESQERMLMVLEPSKEEVAKAIFVKWGLDFAIVGKTTDDLRFRVLHNGE
HHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHEECCCEEEEECCCCCEEEEEEECCC
EVANLPIKELGDEAPEYDRPWTPAKVPAALSETDIPEADIADALVSLVGSANNSSRRWVY
HHHCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEH
EQYDTLIQGNSLQLPGGDAGVVRVEGHDKKALAFSSDVTPRYVEADAFEGGKQAVAECWR
HHHHHHCCCCEEECCCCCCCEEEEECCCCEEEEECCCCCCCEEECCCCCCHHHHHHHHHH
NITATGALPLAATDNLNFGNPEKPEIMSQLVHAIKGIGEACRVLEFPIVSGNVSLYNETN
CCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHEECEEEECCEEEEECCC
GQAILPTPTIGGVGLLKDWGRMARIRFAAADEVVLLVGAPAGLGTHIAQSVYMRDVHGRT
CCEEECCCCCCCCHHHHHHHHHHEEEEEECCCEEEEEECCCCHHHHHHHHHHHHHHCCCC
DGPAPHVDLIAEKKNGDFVRGLITEGLTTAVHDCSSGGLALAVAEMAISSGIGATIDAVE
CCCCCCEEEEEECCCCCHHHHHHHHCHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEC
GHNPILTFYGEDQARYVLTVKKSDLDKVRAAAKAAGVSCPLIGTTGGSTVKLGTARAVEI
CCCCEEEEEECCCEEEEEEEECCCHHHHHHHHHHCCCCCCEEECCCCCEEEECCCCEEEH
KELHLAYESWFPQFMDGETLIAAE
HHHHHHHHHHCCHHCCCCEEEEEC
>Mature Secondary Structure 
SISNSIKITPELVASHGLKPDEYQRILDLIGREPSFTELGIFSAMWNEHCSYKSSKKWL
CCCCCEEECHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHH
KTLPTTGPRVIQGPGENAGVVDIDDGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDVF
HHCCCCCCEEEECCCCCCCEEEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHH
TMGARPIAAMNALRFGAPDHPKTRHLVAGVVAGVGGYGNSFGVPTVGGEVEFDPRYNGNI
HHCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCE
LVNAFAAGLAKSNAIFLSEAKGVGLPVVYLGAKTGRDGVGGATMASAEFDESIEEKRPTV
EEEEHHHHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCCCEEECHHHHHHHHHCCCEE
QVGDPFTEKCLLEACLELMKTGAVIAIQDMGAAGLTCSAVEMGAKGDLGIELDLNAVPVR
EECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEHHEECCCCCCCEEEEECCCCCCH
EERMTAYEMMLSESQERMLMVLEPSKEEVAKAIFVKWGLDFAIVGKTTDDLRFRVLHNGE
HHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHEECCCEEEEECCCCCEEEEEEECCC
EVANLPIKELGDEAPEYDRPWTPAKVPAALSETDIPEADIADALVSLVGSANNSSRRWVY
HHHCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEH
EQYDTLIQGNSLQLPGGDAGVVRVEGHDKKALAFSSDVTPRYVEADAFEGGKQAVAECWR
HHHHHHCCCCEEECCCCCCCEEEEECCCCEEEEECCCCCCCEEECCCCCCHHHHHHHHHH
NITATGALPLAATDNLNFGNPEKPEIMSQLVHAIKGIGEACRVLEFPIVSGNVSLYNETN
CCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHEECEEEECCEEEEECCC
GQAILPTPTIGGVGLLKDWGRMARIRFAAADEVVLLVGAPAGLGTHIAQSVYMRDVHGRT
CCEEECCCCCCCCHHHHHHHHHHEEEEEECCCEEEEEECCCCHHHHHHHHHHHHHHCCCC
DGPAPHVDLIAEKKNGDFVRGLITEGLTTAVHDCSSGGLALAVAEMAISSGIGATIDAVE
CCCCCCEEEEEECCCCCHHHHHHHHCHHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEC
GHNPILTFYGEDQARYVLTVKKSDLDKVRAAAKAAGVSCPLIGTTGGSTVKLGTARAVEI
CCCCEEEEEECCCEEEEEEEECCCHHHHHHHHHHCCCCCCEEECCCCCEEEECCCCEEEH
KELHLAYESWFPQFMDGETLIAAE
HHHHHHHHHHCCHHCCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11743193; 11743194