Definition Helicobacter pylori Shi470, complete genome.
Accession NC_010698
Length 1,608,548

Click here to switch to the map view.

The map label for this gene is purA [H]

Identifier: 188527062

GI number: 188527062

Start: 249430

End: 250665

Strand: Direct

Name: purA [H]

Synonym: HPSH_01325

Alternate gene names: 188527062

Gene position: 249430-250665 (Clockwise)

Preceding gene: 188527061

Following gene: 188527063

Centisome position: 15.51

GC content: 44.5

Gene sequence:

>1236_bases
ATGGCAGATGTCGTTGTGGGGATCCAGTGGGGAGATGAGGGGAAAGGAAAAATTGTTGATAGGATCGCTAAAGATTATGA
CTTTGTGGTGCGCTATCAAGGCGGGCATAATGCCGGGCATACCATTGTGCATAAGGGGGTTAAGCATTCTTTGCATTTAA
TGCCCTCAGGGGTTTTATACCCACAATGCAAGAACATCATTTCTAGCGCGGTGGTCGTGAGCATTAAGGATTTGTGCGAA
GAAATCAGCGCGTTTGAGGATTTAGAAAATCGTTTGTTCATCAGCGACAGAGCCCATGTGATCTTGCCTTATCATGCCAA
AAAAGACGCTTTTAAAGAAAAATCTCAAAACATCGGCACGACTAAAAAAGGCATAGGCCCTTGCTATGAGGATAAAATGG
CAAGGAGCGGGATAAGAATGGGGGATTTATTAGACGATAAGATCTTAGAAGAAAGGCTAAACGCTCATTTCAAAGCCATT
GAGCCTTTTAAGGAAGCGTATGATTTGGGCGAGAATTACGAAAAAGATTTGAGAGAGTATTTTAAAACTCATGCTCCAAA
AATCTGCCCCTTTATCAAAGACACGACAAGCATGCTGATAGAAGCGAACCAAAAGGGTGAAAAAATCCTACTAGAAGGGG
CGCAAGGCACGCTTTTAGACATTGATTTAGGGACTTACCCTTTTGTAACAAGCTCTAACACCACGAGCGCTAGCGCATGC
GTGAGCACCGGCTTAAACCCTAAAGCGATCAATGAAGTGATAGGCATCACGAAAGCCTACTCCACTCGTGTGGGTAATGG
GCCTTTCCCTAGCGAAGACACTACACCCATGGGCGATCATTTAAGGACTAAGGGCGCGGAGTTTGGCACGACAACCAAGC
GCCCACGGCGTTGCGGGTGGCTGGATTTGGTGGCTTTAAAATACGCTTGCACTTTGAATGGTTGCACGCAATTAGCTTTA
ATGAAATTAGATGTTTTAGACGGGATTGATGCGATTAAGGTGTGCGTGGCTTATGAAAGAAAGGGCGAAAGATTAGAGGC
TTTCCCTAGCGATCTGAAAGATTGCGCACCGATCTATCAAACTTTTAAAGGCTGGGAAAAAAGCGCGGGCGTGAGAAAAT
TAGACGATTTAGAGCCAAACGCTAGAGAGTATATCCGCTTTATTGAAAAAGAAGTGGGGGTAAAAATCCGCCTTATTTCT
ACAAGCCCTGAAAGAGAAGACACGATTTTTCTATGA

Upstream 100 bases:

>100_bases
AAGCTCTTTTTAGGGCTTATAAAGAGGCTTTTTACTTTTTTTTGGTATTCTAACAAGCTTTTAAACAATCCAATCTACTT
TGTTTTAAGGATAATATTTT

Downstream 100 bases:

>100_bases
AAAAATTCGCTTCTGTATTGGTGCAATTAAAAACCCTTGCGTTAGAAAAAATAGAACAAAAGCTTGAAAGCAAGCGTTTA
GAATGGCAGCAAAATGAGCG

Product: adenylosuccinate synthetase

Products: NA

Alternate protein names: AMPSase; AdSS; IMP--aspartate ligase [H]

Number of amino acids: Translated: 411; Mature: 410

Protein sequence:

>411_residues
MADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLYPQCKNIISSAVVVSIKDLCE
EISAFEDLENRLFISDRAHVILPYHAKKDAFKEKSQNIGTTKKGIGPCYEDKMARSGIRMGDLLDDKILEERLNAHFKAI
EPFKEAYDLGENYEKDLREYFKTHAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASAC
VSTGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGWLDLVALKYACTLNGCTQLAL
MKLDVLDGIDAIKVCVAYERKGERLEAFPSDLKDCAPIYQTFKGWEKSAGVRKLDDLEPNAREYIRFIEKEVGVKIRLIS
TSPEREDTIFL

Sequences:

>Translated_411_residues
MADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLYPQCKNIISSAVVVSIKDLCE
EISAFEDLENRLFISDRAHVILPYHAKKDAFKEKSQNIGTTKKGIGPCYEDKMARSGIRMGDLLDDKILEERLNAHFKAI
EPFKEAYDLGENYEKDLREYFKTHAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASAC
VSTGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGWLDLVALKYACTLNGCTQLAL
MKLDVLDGIDAIKVCVAYERKGERLEAFPSDLKDCAPIYQTFKGWEKSAGVRKLDDLEPNAREYIRFIEKEVGVKIRLIS
TSPEREDTIFL
>Mature_410_residues
ADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLYPQCKNIISSAVVVSIKDLCEE
ISAFEDLENRLFISDRAHVILPYHAKKDAFKEKSQNIGTTKKGIGPCYEDKMARSGIRMGDLLDDKILEERLNAHFKAIE
PFKEAYDLGENYEKDLREYFKTHAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASACV
STGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGWLDLVALKYACTLNGCTQLALM
KLDVLDGIDAIKVCVAYERKGERLEAFPSDLKDCAPIYQTFKGWEKSAGVRKLDDLEPNAREYIRFIEKEVGVKIRLIST
SPEREDTIFL

Specific function: Plays an important role in the de novo pathway of purine nucleotide biosynthesis. Catalyzes the first commited step in the biosynthesis of AMP from IMP [H]

COG id: COG0104

COG function: function code F; Adenylosuccinate synthase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the adenylosuccinate synthetase family [H]

Homologues:

Organism=Homo sapiens, GI22748717, Length=426, Percent_Identity=43.6619718309859, Blast_Score=347, Evalue=9e-96,
Organism=Homo sapiens, GI34577063, Length=430, Percent_Identity=45.1162790697674, Blast_Score=347, Evalue=1e-95,
Organism=Homo sapiens, GI40316944, Length=397, Percent_Identity=42.0654911838791, Blast_Score=306, Evalue=3e-83,
Organism=Escherichia coli, GI1790620, Length=426, Percent_Identity=46.0093896713615, Blast_Score=350, Evalue=8e-98,
Organism=Caenorhabditis elegans, GI25146553, Length=424, Percent_Identity=42.2169811320755, Blast_Score=330, Evalue=6e-91,
Organism=Caenorhabditis elegans, GI25146555, Length=424, Percent_Identity=42.2169811320755, Blast_Score=330, Evalue=7e-91,
Organism=Saccharomyces cerevisiae, GI6324109, Length=429, Percent_Identity=42.1911421911422, Blast_Score=355, Evalue=7e-99,
Organism=Drosophila melanogaster, GI21356033, Length=423, Percent_Identity=45.1536643026005, Blast_Score=360, Evalue=1e-100,

Paralogues:

None

Copy number: 940 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018220
- InterPro:   IPR001114 [H]

Pfam domain/function: PF00709 Adenylsucc_synt [H]

EC number: =6.3.4.4 [H]

Molecular weight: Translated: 45775; Mature: 45644

Theoretical pI: Translated: 7.02; Mature: 7.02

Prosite motif: PS01266 ADENYLOSUCCIN_SYN_1 ; PS00513 ADENYLOSUCCIN_SYN_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.4 %Cys     (Translated Protein)
1.7 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
2.4 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLY
CCCEEEEEEECCCCCCHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCC
PQCKNIISSAVVVSIKDLCEEISAFEDLENRLFISDRAHVILPYHAKKDAFKEKSQNIGT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEECCCEEEEEECCCHHHHHHHHHCCCC
TKKGIGPCYEDKMARSGIRMGDLLDDKILEERLNAHFKAIEPFKEAYDLGENYEKDLREY
CCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH
FKTHAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASAC
HHHCCCCCCCHHHHHHHHHEECCCCCCEEEEECCCCCEEEEECCCCCEEECCCCCCHHHH
VSTGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGW
HHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCH
LDLVALKYACTLNGCTQLALMKLDVLDGIDAIKVCVAYERKGERLEAFPSDLKDCAPIYQ
HHHHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHH
TFKGWEKSAGVRKLDDLEPNAREYIRFIEKEVGVKIRLISTSPEREDTIFL
HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCEEEEEEECCCCCCCCCCC
>Mature Secondary Structure 
ADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLY
CCEEEEEEECCCCCCHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCC
PQCKNIISSAVVVSIKDLCEEISAFEDLENRLFISDRAHVILPYHAKKDAFKEKSQNIGT
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEECCCEEEEEECCCHHHHHHHHHCCCC
TKKGIGPCYEDKMARSGIRMGDLLDDKILEERLNAHFKAIEPFKEAYDLGENYEKDLREY
CCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH
FKTHAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASAC
HHHCCCCCCCHHHHHHHHHEECCCCCCEEEEECCCCCEEEEECCCCCEEECCCCCCHHHH
VSTGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGW
HHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCH
LDLVALKYACTLNGCTQLALMKLDVLDGIDAIKVCVAYERKGERLEAFPSDLKDCAPIYQ
HHHHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHH
TFKGWEKSAGVRKLDDLEPNAREYIRFIEKEVGVKIRLISTSPEREDTIFL
HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCEEEEEEECCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA