Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is purC
Identifier: 15889145
GI number: 15889145
Start: 1824862
End: 1825626
Strand: Direct
Name: purC
Synonym: Atu1843
Alternate gene names: 15889145
Gene position: 1824862-1825626 (Clockwise)
Preceding gene: 159184944
Following gene: 17935734
Centisome position: 64.22
GC content: 58.69
Gene sequence:
>765_bases ATGAACCGTCGCCGCCGTATTTACGAAGGCAAGGCCAAGATCCTCTACGAAGGTCCGGAGCCAGGCACGCTTATCCAGTT CTTCAAGGACGATGCGACCGCCTTCAACAAGAAGAAGCACGAAGTCATCGACGGCAAGGGTGTGCTGAACAACCGCATCT GCGAATATGTCTTCACGCATCTGAACAAGATCGGTATTCCGACCCACTTCATCCGCCGCCTCAACATGCGCGAGCAGTTG ATCAAGGAAGTGGAGATGATCCCGCTCGAGATCGTCGTGCGCAATGTCGCCGCCGGCTCGCTGTCCAAGCGCCTCGGCAT CGAGGAAGGCGTGGTGCTGCCGCGCTCCATCATCGAGTTCTATTACAAGTCGGATGAGCTGGAAGACCCGATGGTCTCCG AAGAGCACATCACGGCTTTCGGCTGGGCCAACCCCGCCGAGCTTGACGACATCATGGCGCTTGCCATCCGCGTCAACGAC TTCCTGTCCGGCCTCTTCCTCGGCGTCGGCATCCAGCTCGTCGATTTCAAGATCGAATGCGGCCGGCTGTTCGAAGGCGA CATGATGCGCATCATCCTCGCCGACGAGATTTCGCCCGATAGCTGCCGCCTGTGGGACATCGAGACCCAGAAAAAGATGG ACAAGGACCTGTTCCGCCGCGATCTGGGCGGGCTTGTGGAAGCCTATTCCGAAGTGGCGCGCCGTCTCGGCATCATCAAT GAAAACGAACCGATCCGCGGCACCGGCCCAGTCCTCGTAAAGTGA
Upstream 100 bases:
>100_bases GCGCGCCTCATTGTAGAATGCAGTCTTTTCCTTTAAGGGACCGCTTCAAAGACCAAACTTAACGCCACCCGAATCGGCGT CAACCAAGAGACCTGCCATT
Downstream 100 bases:
>100_bases TATCGGCCTCGTAAAGTAATATTGGCTTCGTAGGGTAAATCAGGAGAAGATCAGGTGATCAAGGCACGGGTGACTGTTAC GCTGAAGAACGGCGTTCTCG
Product: phosphoribosylaminoimidazole-succinocarboxamide synthase
Products: NA
Alternate protein names: SAICAR synthetase 1
Number of amino acids: Translated: 254; Mature: 254
Protein sequence:
>254_residues MNRRRRIYEGKAKILYEGPEPGTLIQFFKDDATAFNKKKHEVIDGKGVLNNRICEYVFTHLNKIGIPTHFIRRLNMREQL IKEVEMIPLEIVVRNVAAGSLSKRLGIEEGVVLPRSIIEFYYKSDELEDPMVSEEHITAFGWANPAELDDIMALAIRVND FLSGLFLGVGIQLVDFKIECGRLFEGDMMRIILADEISPDSCRLWDIETQKKMDKDLFRRDLGGLVEAYSEVARRLGIIN ENEPIRGTGPVLVK
Sequences:
>Translated_254_residues MNRRRRIYEGKAKILYEGPEPGTLIQFFKDDATAFNKKKHEVIDGKGVLNNRICEYVFTHLNKIGIPTHFIRRLNMREQL IKEVEMIPLEIVVRNVAAGSLSKRLGIEEGVVLPRSIIEFYYKSDELEDPMVSEEHITAFGWANPAELDDIMALAIRVND FLSGLFLGVGIQLVDFKIECGRLFEGDMMRIILADEISPDSCRLWDIETQKKMDKDLFRRDLGGLVEAYSEVARRLGIIN ENEPIRGTGPVLVK >Mature_254_residues MNRRRRIYEGKAKILYEGPEPGTLIQFFKDDATAFNKKKHEVIDGKGVLNNRICEYVFTHLNKIGIPTHFIRRLNMREQL IKEVEMIPLEIVVRNVAAGSLSKRLGIEEGVVLPRSIIEFYYKSDELEDPMVSEEHITAFGWANPAELDDIMALAIRVND FLSGLFLGVGIQLVDFKIECGRLFEGDMMRIILADEISPDSCRLWDIETQKKMDKDLFRRDLGGLVEAYSEVARRLGIIN ENEPIRGTGPVLVK
Specific function: De novo purine biosynthesis; seventh step. [C]
COG id: COG0152
COG function: function code F; Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the SAICAR synthetase family
Homologues:
Organism=Homo sapiens, GI119220557, Length=228, Percent_Identity=33.7719298245614, Blast_Score=106, Evalue=2e-23, Organism=Homo sapiens, GI5453539, Length=228, Percent_Identity=33.7719298245614, Blast_Score=106, Evalue=2e-23, Organism=Homo sapiens, GI119220559, Length=235, Percent_Identity=32.3404255319149, Blast_Score=97, Evalue=2e-20, Organism=Escherichia coli, GI1788820, Length=238, Percent_Identity=45.3781512605042, Blast_Score=211, Evalue=5e-56, Organism=Caenorhabditis elegans, GI17531275, Length=223, Percent_Identity=31.390134529148, Blast_Score=92, Evalue=3e-19, Organism=Drosophila melanogaster, GI18860083, Length=222, Percent_Identity=32.4324324324324, Blast_Score=97, Evalue=1e-20, Organism=Drosophila melanogaster, GI24583917, Length=250, Percent_Identity=25.6, Blast_Score=74, Evalue=1e-13,
Paralogues:
None
Copy number: 1680 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 13,000 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): PUR71_AGRT5 (Q8UEB7)
Other databases:
- EMBL: AE007869 - PIR: AI2802 - PIR: B97582 - RefSeq: NP_354826.1 - ProteinModelPortal: Q8UEB7 - SMR: Q8UEB7 - STRING: Q8UEB7 - GeneID: 1133881 - GenomeReviews: AE007869_GR - KEGG: atu:Atu1843 - eggNOG: COG0152 - HOGENOM: HBG306070 - OMA: YKDDALG - PhylomeDB: Q8UEB7 - ProtClustDB: PRK09362 - BioCyc: ATUM176299-1:ATU1843-MONOMER - HAMAP: MF_00137 - InterPro: IPR013816 - InterPro: IPR001636 - InterPro: IPR018236 - Gene3D: G3DSA:3.30.470.20 - PANTHER: PTHR11609 - TIGRFAMs: TIGR00081
Pfam domain/function: PF01259 SAICAR_synt
EC number: =6.3.2.6
Molecular weight: Translated: 29039; Mature: 29039
Theoretical pI: Translated: 5.50; Mature: 5.50
Prosite motif: PS01057 SAICAR_SYNTHETASE_1; PS01058 SAICAR_SYNTHETASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 4.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNRRRRIYEGKAKILYEGPEPGTLIQFFKDDATAFNKKKHEVIDGKGVLNNRICEYVFTH CCCCCCEECCCEEEEEECCCCCHHHHHHHHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHH LNKIGIPTHFIRRLNMREQLIKEVEMIPLEIVVRNVAAGSLSKRLGIEEGVVLPRSIIEF HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH YYKSDELEDPMVSEEHITAFGWANPAELDDIMALAIRVNDFLSGLFLGVGIQLVDFKIEC HHCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEEC GRLFEGDMMRIILADEISPDSCRLWDIETQKKMDKDLFRRDLGGLVEAYSEVARRLGIIN CCCCCCCEEEEEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC ENEPIRGTGPVLVK CCCCCCCCCCEEEC >Mature Secondary Structure MNRRRRIYEGKAKILYEGPEPGTLIQFFKDDATAFNKKKHEVIDGKGVLNNRICEYVFTH CCCCCCEECCCEEEEEECCCCCHHHHHHHHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHH LNKIGIPTHFIRRLNMREQLIKEVEMIPLEIVVRNVAAGSLSKRLGIEEGVVLPRSIIEF HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH YYKSDELEDPMVSEEHITAFGWANPAELDDIMALAIRVNDFLSGLFLGVGIQLVDFKIEC HHCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEEC GRLFEGDMMRIILADEISPDSCRLWDIETQKKMDKDLFRRDLGGLVEAYSEVARRLGIIN CCCCCCCEEEEEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC ENEPIRGTGPVLVK CCCCCCCCCCEEEC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11743193; 11743194