Definition Prochlorococcus marinus str. NATL1A, complete genome.
Accession NC_008819
Length 1,864,731

Click here to switch to the map view.

The map label for this gene is purA

Identifier: 124025275

GI number: 124025275

Start: 510371

End: 511684

Strand: Reverse

Name: purA

Synonym: NATL1_05641

Alternate gene names: 124025275

Gene position: 511684-510371 (Counterclockwise)

Preceding gene: 124025276

Following gene: 124025274

Centisome position: 27.44

GC content: 39.5

Gene sequence:

>1314_bases
TTGGCAAATGTTGTAGTTATAGGCGCTCAATGGGGTGATGAAGGTAAAGGCAAAATCACCGATTTGCTCAGTCGCTCCGC
AGATGTAGTTGTTCGCTATCAAGGTGGGGTTAATGCCGGTCATACAATTGTTGTAGAGGACAAAGTTCTCAAATTACATT
TAATCCCCTCAGGGATTTTGTATCCAGATACGATTTGTCTTATTGGGTCAGGAACAGTTGTTGATCCAAAAGTAATGATT
AAAGAAATAAAAATGCTTGAAGATAACGATATTGATATTTCAGGCCTAAAGCTTGCTTCAACTGCTCATGTAACGATGCC
ATATCACAGGCTTCTTGATTTGGCTATGGAGCAAAAGCGAGGTGACCAAAAAATCGGTACTACTGGTAGGGGAATTGGTC
CAACATATGCTGACAAATCTCAAAGAAATGGGATTCGAATAATTGATCTGATGAGCAGGGAAAAGCTTCAAGAAAGATTA
CAGGTCCCTTTATCAGAAAAAAACGGACTACTCCAAAAGATTTATGGAATCGAGCCATTAATTATTGATGAAATAATAGA
AGAATATCTTGATTACGGAAAACAGCTAAAAAAACATATTGTTGACTGCAACCGAACAATTCATCAAGCAGCCAGAAAAA
AGAAGAACATATTATTTGAGGGAGCTCAAGGGACTCTTTTAGATCTTGATCACGGAACCTACCCTTATGTAACCTCCTCT
AATCCAGTGTCAGGTGGGGCCTGTATTGGCGCAGGAGTAGGTCCTACCCTTATAGACAGGGTTATCGGGGTTGCTAAAGC
CTATACGACTAGAGTTGGAGAAGGACCCTTCCCGACTGAATTGCACGGGAGTATTAATGATCAACTTTGTGATAGAGGTG
GTGAATTTGGAACAACTACTGGACGAAGAAGGAGATGTGGCTGGTTTGATGGAGTAATTGGAAAATACGCAGTTGAGGTA
AATGGATTGGATTGCCTTGCTATTACTAAATTAGACGTTTTGGATGAGCTAGAAGAAATTGATATATGTGTGGCTTATGA
ATTAAATGGTAAAAGAATTGATTATTTCCCAACTAGTGTTGAAGATTTTGAAAAATGTAATCCGATTTTCAAAAAGCTAC
CTGGCTGGAGATGCTCTACAGAAAATTGTCGTCGTCTAGAAGATCTTCCACCTGCAGCAATGAGTTACTTGAGATTCCTA
GCTGAGCTCATGGAAGTACCCATAGCAATAGTTTCCCTCGGAGCTAATAGAGATCAAACAATTGTTATTGAAGATCCTAT
TCACGGACCCAAAAGAGCTCTTTTGAATTCTTAA

Upstream 100 bases:

>100_bases
GTTAAAGATAATCTTTAATTCATTTCTTTGACTCAGGGTAGGGCTATCTGCAAATCTTATTTCTTCTAAAGTAAATCGGT
TTCGAGCCGAATTTTCTTCC

Downstream 100 bases:

>100_bases
AAAACAAACAATTGTTCTTTTATGATTTTTAAACAAACTTGATGACTAATGAAACAAACGCCGCTTCGCTAGACATAGTA
GGTATTGGGAACGCAATTGT

Product: adenylosuccinate synthetase

Products: NA

Alternate protein names: AMPSase; AdSS; IMP--aspartate ligase

Number of amino acids: Translated: 437; Mature: 436

Protein sequence:

>437_residues
MANVVVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIVVEDKVLKLHLIPSGILYPDTICLIGSGTVVDPKVMI
KEIKMLEDNDIDISGLKLASTAHVTMPYHRLLDLAMEQKRGDQKIGTTGRGIGPTYADKSQRNGIRIIDLMSREKLQERL
QVPLSEKNGLLQKIYGIEPLIIDEIIEEYLDYGKQLKKHIVDCNRTIHQAARKKKNILFEGAQGTLLDLDHGTYPYVTSS
NPVSGGACIGAGVGPTLIDRVIGVAKAYTTRVGEGPFPTELHGSINDQLCDRGGEFGTTTGRRRRCGWFDGVIGKYAVEV
NGLDCLAITKLDVLDELEEIDICVAYELNGKRIDYFPTSVEDFEKCNPIFKKLPGWRCSTENCRRLEDLPPAAMSYLRFL
AELMEVPIAIVSLGANRDQTIVIEDPIHGPKRALLNS

Sequences:

>Translated_437_residues
MANVVVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIVVEDKVLKLHLIPSGILYPDTICLIGSGTVVDPKVMI
KEIKMLEDNDIDISGLKLASTAHVTMPYHRLLDLAMEQKRGDQKIGTTGRGIGPTYADKSQRNGIRIIDLMSREKLQERL
QVPLSEKNGLLQKIYGIEPLIIDEIIEEYLDYGKQLKKHIVDCNRTIHQAARKKKNILFEGAQGTLLDLDHGTYPYVTSS
NPVSGGACIGAGVGPTLIDRVIGVAKAYTTRVGEGPFPTELHGSINDQLCDRGGEFGTTTGRRRRCGWFDGVIGKYAVEV
NGLDCLAITKLDVLDELEEIDICVAYELNGKRIDYFPTSVEDFEKCNPIFKKLPGWRCSTENCRRLEDLPPAAMSYLRFL
AELMEVPIAIVSLGANRDQTIVIEDPIHGPKRALLNS
>Mature_436_residues
ANVVVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIVVEDKVLKLHLIPSGILYPDTICLIGSGTVVDPKVMIK
EIKMLEDNDIDISGLKLASTAHVTMPYHRLLDLAMEQKRGDQKIGTTGRGIGPTYADKSQRNGIRIIDLMSREKLQERLQ
VPLSEKNGLLQKIYGIEPLIIDEIIEEYLDYGKQLKKHIVDCNRTIHQAARKKKNILFEGAQGTLLDLDHGTYPYVTSSN
PVSGGACIGAGVGPTLIDRVIGVAKAYTTRVGEGPFPTELHGSINDQLCDRGGEFGTTTGRRRRCGWFDGVIGKYAVEVN
GLDCLAITKLDVLDELEEIDICVAYELNGKRIDYFPTSVEDFEKCNPIFKKLPGWRCSTENCRRLEDLPPAAMSYLRFLA
ELMEVPIAIVSLGANRDQTIVIEDPIHGPKRALLNS

Specific function: Plays an important role in the de novo pathway of purine nucleotide biosynthesis. Catalyzes the first commited step in the biosynthesis of AMP from IMP

COG id: COG0104

COG function: function code F; Adenylosuccinate synthase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the adenylosuccinate synthetase family

Homologues:

Organism=Homo sapiens, GI22748717, Length=425, Percent_Identity=42.8235294117647, Blast_Score=331, Evalue=7e-91,
Organism=Homo sapiens, GI34577063, Length=426, Percent_Identity=41.5492957746479, Blast_Score=328, Evalue=5e-90,
Organism=Homo sapiens, GI40316944, Length=395, Percent_Identity=41.0126582278481, Blast_Score=288, Evalue=7e-78,
Organism=Escherichia coli, GI1790620, Length=427, Percent_Identity=46.6042154566745, Blast_Score=400, Evalue=1e-113,
Organism=Caenorhabditis elegans, GI25146555, Length=427, Percent_Identity=40.7494145199063, Blast_Score=312, Evalue=2e-85,
Organism=Caenorhabditis elegans, GI25146553, Length=427, Percent_Identity=40.7494145199063, Blast_Score=312, Evalue=2e-85,
Organism=Saccharomyces cerevisiae, GI6324109, Length=434, Percent_Identity=42.1658986175115, Blast_Score=326, Evalue=6e-90,
Organism=Drosophila melanogaster, GI21356033, Length=424, Percent_Identity=41.2735849056604, Blast_Score=331, Evalue=7e-91,

Paralogues:

None

Copy number: 940 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): PURA_PROM1 (A2C0W6)

Other databases:

- EMBL:   CP000553
- RefSeq:   YP_001014391.1
- ProteinModelPortal:   A2C0W6
- SMR:   A2C0W6
- STRING:   A2C0W6
- GeneID:   4780184
- GenomeReviews:   CP000553_GR
- KEGG:   pme:NATL1_05641
- eggNOG:   COG0104
- HOGENOM:   HBG658237
- OMA:   DYVVRYQ
- ProtClustDB:   PRK01117
- BioCyc:   PMAR167555:NATL1_05641-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00011
- InterPro:   IPR018220
- InterPro:   IPR001114
- PANTHER:   PTHR11846
- SMART:   SM00788
- TIGRFAMs:   TIGR00184

Pfam domain/function: PF00709 Adenylsucc_synt

EC number: =6.3.4.4

Molecular weight: Translated: 48139; Mature: 48008

Theoretical pI: Translated: 6.26; Mature: 6.26

Prosite motif: PS01266 ADENYLOSUCCIN_SYN_1; PS00513 ADENYLOSUCCIN_SYN_2

Important sites: ACT_SITE 13-13 ACT_SITE 41-41 BINDING 128-128 BINDING 142-142 BINDING 223-223 BINDING 238-238 BINDING 302-302 BINDING 304-304

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MANVVVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIVVEDKVLKLHLIPSGIL
CCCEEEEEECCCCCCCCHHHHHHCCCCCEEEEECCCCCCCCEEEEECCEEEEEECCCCCC
YPDTICLIGSGTVVDPKVMIKEIKMLEDNDIDISGLKLASTAHVTMPYHRLLDLAMEQKR
CCCCEEEEECCCEECHHHHHHHHHHHCCCCCCCCCEEEECEEEEECCHHHHHHHHHHHHC
GDQKIGTTGRGIGPTYADKSQRNGIRIIDLMSREKLQERLQVPLSEKNGLLQKIYGIEPL
CCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCCCHHHHHHCCCHH
IIDEIIEEYLDYGKQLKKHIVDCNRTIHQAARKKKNILFEGAQGTLLDLDHGTYPYVTSS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCEEECCCCCCCEEECC
NPVSGGACIGAGVGPTLIDRVIGVAKAYTTRVGEGPFPTELHGSINDQLCDRGGEFGTTT
CCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCCC
GRRRRCGWFDGVIGKYAVEVNGLDCLAITKLDVLDELEEIDICVAYELNGKRIDYFPTSV
CCCCCCCHHHCCCCEEEEEECCCEEEEEHHHHHHHHHHHCCEEEEEEECCCEEEECCCCH
EDFEKCNPIFKKLPGWRCSTENCRRLEDLPPAAMSYLRFLAELMEVPIAIVSLGANRDQT
HHHHHCCHHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCE
IVIEDPIHGPKRALLNS
EEEECCCCCHHHHHCCC
>Mature Secondary Structure 
ANVVVIGAQWGDEGKGKITDLLSRSADVVVRYQGGVNAGHTIVVEDKVLKLHLIPSGIL
CCEEEEEECCCCCCCCHHHHHHCCCCCEEEEECCCCCCCCEEEEECCEEEEEECCCCCC
YPDTICLIGSGTVVDPKVMIKEIKMLEDNDIDISGLKLASTAHVTMPYHRLLDLAMEQKR
CCCCEEEEECCCEECHHHHHHHHHHHCCCCCCCCCEEEECEEEEECCHHHHHHHHHHHHC
GDQKIGTTGRGIGPTYADKSQRNGIRIIDLMSREKLQERLQVPLSEKNGLLQKIYGIEPL
CCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCCCHHHHHHCCCHH
IIDEIIEEYLDYGKQLKKHIVDCNRTIHQAARKKKNILFEGAQGTLLDLDHGTYPYVTSS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCEEECCCCCCCEEECC
NPVSGGACIGAGVGPTLIDRVIGVAKAYTTRVGEGPFPTELHGSINDQLCDRGGEFGTTT
CCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCCC
GRRRRCGWFDGVIGKYAVEVNGLDCLAITKLDVLDELEEIDICVAYELNGKRIDYFPTSV
CCCCCCCHHHCCCCEEEEEECCCEEEEEHHHHHHHHHHHCCEEEEEEECCCEEEECCCCH
EDFEKCNPIFKKLPGWRCSTENCRRLEDLPPAAMSYLRFLAELMEVPIAIVSLGANRDQT
HHHHHCCHHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCE
IVIEDPIHGPKRALLNS
EEEECCCCCHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA