Definition Tropheryma whipplei TW08/27, complete genome.
Accession NC_004551
Length 925,938

Click here to switch to the map view.

The map label for this gene is purA

Identifier: 28572942

GI number: 28572942

Start: 908787

End: 910076

Strand: Direct

Name: purA

Synonym: TW801

Alternate gene names: 28572942

Gene position: 908787-910076 (Clockwise)

Preceding gene: 28572941

Following gene: 28572943

Centisome position: 98.15

GC content: 46.67

Gene sequence:

>1290_bases
TTGCCAGCAACAATTCTTATTGGTGCCCAGTGGGGGGATGAAGGCAAGGGAAAGGCCACAGACCTTCTTGCAAAAGATAT
AGATTATGTCGTCAAATTCAACGGCGGTAATAATGCGGGTCACACCGTAGTTATAGGCGGTGATAAGTACGTTCTGCACC
TTCTGCCTTCCGGGATACTCAATGAGAATGTCGTGCCGGTCATTGCAAATGGTGTTGTAATCAATCCGGAGGTGCTTTTT
GATGAAATTGCTACTCTAAATTCACGAGGCGTAAACACGGACAAGCTTGTTATAAGCGCCAACGCTCATATTATTGCGCC
CTTTCACAGAACTATAGATCTTGTGACGGAACGTTTCTTGGGCAAAAGACAGTTGGGAACAACAGGAAGAGGAATAGGCC
CCACATATGCAGACAAAATAAACCGCATTGGTATACGGGTACAAGATTTATTTGATAAAAGCGTACTGCGACAAAAAATT
GAGGGGTCTTTGAGTAATAAAAATCATATGCTCGTAAAAGTTTTTAACCGCAGATCTGTGTCTGTAACGGAAATGCTGGA
TTATTTACTTAGTTTTGCTGAGCGTATGAGGCCGATGATCGCGGACACATCACTTTTGCTAAATAATGCTTTGGATTGCG
GAAAACATGTCCTATTCGAGGGGGGTCAGGCGACTATGCTTGATGTCGATCATGGAAGTTACCCATTTGTCACTTCGTCC
AATGCAACGGTCGGAGGGGCAATAACAGGGGCAGGGATTGGCCCAACTCGTGTGAACAAAGTTATCGGAGTGGCCAAATC
GTACACAACTAGGGTAGGCGCAGGGCCATTTCCAACTGAATTGCATGACGAATATGGGGAGTGGCTACAAAAAAGAGGTT
ATGAGGTAGGAGCAACTACAGGACGCAAGAGGCGGTGTGGGTGGTTTGACGGTGTGGTCGCCCGCTATGCGACTCGTATA
AACGGGATAACGGACTACGTTCTGACAAAACTCGATGTTTTGACGGGCCTTGACAGAATACCTATCTGCGTCGGGTATAA
AGTTGGAGACTCTGTCTTTCGGGAAATGCCTGTCTCGCAGAGTGATTTTCACCACGCAGTTCCCATATACGAGGATCTGC
CCGGATGGCAGTGCAATATTTCCGAATGCGAAAGTTTTGACAGTCTTCCTCCGGAAGCAAGGGGCTATGTCTTGGCCCTG
GAGGATCTTATAAAGGCACGAATATCAGTCATTGGAACAGGTCCAGAGCGGGAAAATATAATCATTAGGCACCCGCTGGG
CATCTTCTAG

Upstream 100 bases:

>100_bases
GGCACTGGGGGCACCAAAAATAGCTCAGGTGCTGGGTGAAAATGCTTGCCTGCAAAACTATGGCAAGCTTTCGGTTGCTC
AAGCCTGAGGAGGAATAGAT

Downstream 100 bases:

>100_bases
ATAAGCGCATCGAGAGTGTTAGGATCTATCCAGGGCAATACCGCCGATGTTCTGCCCAGGATAGTGTGCGTGGGACAGGT
GGCCTTAGCTGCTCGGCGGG

Product: adenylosuccinate synthetase

Products: NA

Alternate protein names: AMPSase; AdSS; IMP--aspartate ligase

Number of amino acids: Translated: 429; Mature: 428

Protein sequence:

>429_residues
MPATILIGAQWGDEGKGKATDLLAKDIDYVVKFNGGNNAGHTVVIGGDKYVLHLLPSGILNENVVPVIANGVVINPEVLF
DEIATLNSRGVNTDKLVISANAHIIAPFHRTIDLVTERFLGKRQLGTTGRGIGPTYADKINRIGIRVQDLFDKSVLRQKI
EGSLSNKNHMLVKVFNRRSVSVTEMLDYLLSFAERMRPMIADTSLLLNNALDCGKHVLFEGGQATMLDVDHGSYPFVTSS
NATVGGAITGAGIGPTRVNKVIGVAKSYTTRVGAGPFPTELHDEYGEWLQKRGYEVGATTGRKRRCGWFDGVVARYATRI
NGITDYVLTKLDVLTGLDRIPICVGYKVGDSVFREMPVSQSDFHHAVPIYEDLPGWQCNISECESFDSLPPEARGYVLAL
EDLIKARISVIGTGPERENIIIRHPLGIF

Sequences:

>Translated_429_residues
MPATILIGAQWGDEGKGKATDLLAKDIDYVVKFNGGNNAGHTVVIGGDKYVLHLLPSGILNENVVPVIANGVVINPEVLF
DEIATLNSRGVNTDKLVISANAHIIAPFHRTIDLVTERFLGKRQLGTTGRGIGPTYADKINRIGIRVQDLFDKSVLRQKI
EGSLSNKNHMLVKVFNRRSVSVTEMLDYLLSFAERMRPMIADTSLLLNNALDCGKHVLFEGGQATMLDVDHGSYPFVTSS
NATVGGAITGAGIGPTRVNKVIGVAKSYTTRVGAGPFPTELHDEYGEWLQKRGYEVGATTGRKRRCGWFDGVVARYATRI
NGITDYVLTKLDVLTGLDRIPICVGYKVGDSVFREMPVSQSDFHHAVPIYEDLPGWQCNISECESFDSLPPEARGYVLAL
EDLIKARISVIGTGPERENIIIRHPLGIF
>Mature_428_residues
PATILIGAQWGDEGKGKATDLLAKDIDYVVKFNGGNNAGHTVVIGGDKYVLHLLPSGILNENVVPVIANGVVINPEVLFD
EIATLNSRGVNTDKLVISANAHIIAPFHRTIDLVTERFLGKRQLGTTGRGIGPTYADKINRIGIRVQDLFDKSVLRQKIE
GSLSNKNHMLVKVFNRRSVSVTEMLDYLLSFAERMRPMIADTSLLLNNALDCGKHVLFEGGQATMLDVDHGSYPFVTSSN
ATVGGAITGAGIGPTRVNKVIGVAKSYTTRVGAGPFPTELHDEYGEWLQKRGYEVGATTGRKRRCGWFDGVVARYATRIN
GITDYVLTKLDVLTGLDRIPICVGYKVGDSVFREMPVSQSDFHHAVPIYEDLPGWQCNISECESFDSLPPEARGYVLALE
DLIKARISVIGTGPERENIIIRHPLGIF

Specific function: Plays an important role in the de novo pathway of purine nucleotide biosynthesis. Catalyzes the first commited step in the biosynthesis of AMP from IMP

COG id: COG0104

COG function: function code F; Adenylosuccinate synthase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the adenylosuccinate synthetase family

Homologues:

Organism=Homo sapiens, GI22748717, Length=427, Percent_Identity=43.7939110070258, Blast_Score=345, Evalue=4e-95,
Organism=Homo sapiens, GI34577063, Length=429, Percent_Identity=44.5221445221445, Blast_Score=341, Evalue=9e-94,
Organism=Homo sapiens, GI40316944, Length=396, Percent_Identity=42.6767676767677, Blast_Score=305, Evalue=6e-83,
Organism=Escherichia coli, GI1790620, Length=427, Percent_Identity=46.6042154566745, Blast_Score=400, Evalue=1e-112,
Organism=Caenorhabditis elegans, GI25146553, Length=434, Percent_Identity=44.0092165898618, Blast_Score=328, Evalue=4e-90,
Organism=Caenorhabditis elegans, GI25146555, Length=434, Percent_Identity=44.0092165898618, Blast_Score=328, Evalue=5e-90,
Organism=Saccharomyces cerevisiae, GI6324109, Length=425, Percent_Identity=40.2352941176471, Blast_Score=330, Evalue=2e-91,
Organism=Drosophila melanogaster, GI21356033, Length=424, Percent_Identity=44.1037735849057, Blast_Score=353, Evalue=1e-97,

Paralogues:

None

Copy number: 940 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): PURA_TROW8 (Q83H67)

Other databases:

- EMBL:   BX251412
- RefSeq:   NP_789722.1
- ProteinModelPortal:   Q83H67
- SMR:   Q83H67
- STRING:   Q83H67
- GeneID:   1065001
- GenomeReviews:   BX072543_GR
- KEGG:   tws:TW801
- eggNOG:   COG0104
- HOGENOM:   HBG658237
- OMA:   DYVVRYQ
- PhylomeDB:   Q83H67
- ProtClustDB:   PRK01117
- BioCyc:   TWHI218496:TW0767-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00011
- InterPro:   IPR018220
- InterPro:   IPR001114
- PANTHER:   PTHR11846
- SMART:   SM00788
- TIGRFAMs:   TIGR00184

Pfam domain/function: PF00709 Adenylsucc_synt

EC number: =6.3.4.4

Molecular weight: Translated: 46844; Mature: 46713

Theoretical pI: Translated: 7.34; Mature: 7.34

Prosite motif: PS01266 ADENYLOSUCCIN_SYN_1; PS00513 ADENYLOSUCCIN_SYN_2

Important sites: ACT_SITE 13-13 ACT_SITE 41-41 BINDING 128-128 BINDING 142-142 BINDING 223-223 BINDING 238-238 BINDING 302-302 BINDING 304-304

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPATILIGAQWGDEGKGKATDLLAKDIDYVVKFNGGNNAGHTVVIGGDKYVLHLLPSGIL
CCCEEEEECCCCCCCCCHHHHHHHHCCEEEEEECCCCCCCCEEEECCCEEEEEECCCCCC
NENVVPVIANGVVINPEVLFDEIATLNSRGVNTDKLVISANAHIIAPFHRTIDLVTERFL
CCCCCEEEECCEEECHHHHHHHHHHHCCCCCCCCEEEEECCCEEEEHHHHHHHHHHHHHH
GKRQLGTTGRGIGPTYADKINRIGIRVQDLFDKSVLRQKIEGSLSNKNHMLVKVFNRRSV
CCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCC
SVTEMLDYLLSFAERMRPMIADTSLLLNNALDCGKHVLFEGGQATMLDVDHGSYPFVTSS
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHEEECCCEEEEECCCCCCCEEECC
NATVGGAITGAGIGPTRVNKVIGVAKSYTTRVGAGPFPTELHDEYGEWLQKRGYEVGATT
CCEECCCEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEECCCC
GRKRRCGWFDGVVARYATRINGITDYVLTKLDVLTGLDRIPICVGYKVGDSVFREMPVSQ
CCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCCCCC
SDFHHAVPIYEDLPGWQCNISECESFDSLPPEARGYVLALEDLIKARISVIGTGPERENI
CCCCCCCCHHHCCCCCEECHHHHHHCCCCCCCCCCEEEEHHHHHHHHHEEEECCCCCCCE
IIRHPLGIF
EEECCCCCC
>Mature Secondary Structure 
PATILIGAQWGDEGKGKATDLLAKDIDYVVKFNGGNNAGHTVVIGGDKYVLHLLPSGIL
CCEEEEECCCCCCCCCHHHHHHHHCCEEEEEECCCCCCCCEEEECCCEEEEEECCCCCC
NENVVPVIANGVVINPEVLFDEIATLNSRGVNTDKLVISANAHIIAPFHRTIDLVTERFL
CCCCCEEEECCEEECHHHHHHHHHHHCCCCCCCCEEEEECCCEEEEHHHHHHHHHHHHHH
GKRQLGTTGRGIGPTYADKINRIGIRVQDLFDKSVLRQKIEGSLSNKNHMLVKVFNRRSV
CCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCC
SVTEMLDYLLSFAERMRPMIADTSLLLNNALDCGKHVLFEGGQATMLDVDHGSYPFVTSS
CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHEEECCCEEEEECCCCCCCEEECC
NATVGGAITGAGIGPTRVNKVIGVAKSYTTRVGAGPFPTELHDEYGEWLQKRGYEVGATT
CCEECCCEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEECCCC
GRKRRCGWFDGVVARYATRINGITDYVLTKLDVLTGLDRIPICVGYKVGDSVFREMPVSQ
CCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHCCCCC
SDFHHAVPIYEDLPGWQCNISECESFDSLPPEARGYVLALEDLIKARISVIGTGPERENI
CCCCCCCCHHHCCCCCEECHHHHHHCCCCCCCCCCEEEEHHHHHHHHHEEEECCCCCCCE
IIRHPLGIF
EEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12606174