Definition Corynebacterium diphtheriae NCTC 13129 chromosome, complete genome.
Accession NC_002935
Length 2,488,635

Click here to switch to the map view.

The map label for this gene is purB [H]

Identifier: 38234495

GI number: 38234495

Start: 1982334

End: 1983773

Strand: Reverse

Name: purB [H]

Synonym: DIP1928

Alternate gene names: 38234495

Gene position: 1983773-1982334 (Counterclockwise)

Preceding gene: 38234496

Following gene: 38234494

Centisome position: 79.71

GC content: 54.86

Gene sequence:

>1440_bases
ATGAGCTACGTGGCTGAGAAGAAGAAGATCGCAAACGTCCTGTCCAACCGCTACGCCTCTGCTGAGCTCACCGAACTGTG
GAGCCCAGAGGCAAAGATCCTCCTAGAACGCCAGTTGTGGATCGCAGTAATGAAGGCCCAGAAAAATCTGGGTGTTGATA
TTCCAACAGAAGCAATTTCTGCTTATGAAGCAGTGATCGACCGCATCGATCTTGACAGCATCGCACAGCGTGAGAAGATC
ACCCGCCATGATGTGAAAGCTCGTATCGAGGAGTTCAATGCTCTCGCAGGCTACGAGCATATTCATAAGGGCATGACCTC
CCGCGACCTCACCGAGAACGTGGAACAGCTGCAGATCTACCGTTCGCTTGAGATGATGCGTAACAAGTCGGTGTCTGTTG
CCGCTGCTATCGCACAGCACGCAGCACAGTACCAGTCCTTGGTCATGGCTGGCCGCTCCCACAACGTGGCTGCTCAGGCT
ACCACTTTGGGTAAGCGTTTTGCCTCTGCTGCCGAGGAAATTTTGGTTGCTATTGACCGCGTTGAGGATCTTCTGAGCCG
CTATCCACTGCGCGGTATTAAGGGCCCTATGGGCACCGCCCAAGACATGCTTGATCTCGTTGGTGGCGATGAGTCCAAGC
TTGCCGCATTGGAAACTCAAATCGCAGATCACCTAGGTATTGCCCGCGTGTTCGATTCAGTGGGACAGGTTTACCCCCGC
TCCTTGGACTTCGATGCGGTGTCTGCTCTGGTTCAGTTGGGTGCTGGTCCATCCTCGTTGGCTCACACCATCCGTTTGAT
GGCTGGCAATGAGACGGTGACTGAGGGCTTTAAGGAAGGCCAGGTCGGTTCCTCGGCTATGCCGCACAAGATGAATGCGC
GTTCGTGTGAGCGTGTTGGCGGCCTGCAGGTGATCTTGCGTGGTTACCTCACCATGGTGGCGGATCTTTCTGGTCAGCAG
TGGAATGAGGGCGACGTGTTCTGCTCGGTTATTCGTCGTGTGGCTCTTCCGGATGCGTTCTTCGCGCTGGATGGCATGTA
CGAGACATTCCTCACAGTGTTGGCTGAGTTCGGTGCTTTCCCAGCGATGATTGATCGTGAGCTTGAGCGCTACCTGCCAT
TCCTTGCTACCACTCGTATTCTGATGGCTGCTGTTCGCGTGGGTGTTGGCCGTGAAACTGCCCATGAGGTGATTAAGGAG
AACGCTGTGGCGGTTGCCTTGAACATGCGTGAAAACGGTGGTGAGCAAGACCTCATTGATCGCCTTGCGGCGGATGAGCG
GTTGCCGATGACTCGTGAACAGTTGGATGAGGCGTTGGCTGATCGACATGCCTTTATTGGTGCTGCAGAATCGCAGGTTG
CTCGCGTTGTTGACCGCGTGAATGATCTGGTTCGTCGTTATCCAGCGGCTGCCCAGTACACCCCCGGTGACATTCTCTAA

Upstream 100 bases:

>100_bases
TCGACCCCGTTGACGGACATCGTTTTGTTCGGCTGAGTTTCTGCGGAGAATATGAGGAAATTCTTCAAGCATGTGAGCGT
TTGCGCACGTTCATGCCATA

Downstream 100 bases:

>100_bases
CAACAGTTTTTAAGTACAAGTCATATCGCATCTGTGGGTGTTGCGATATGACTTGTTCTTGTTTCAAGGTCTAGACTGCT
TGATTATGCGTCCTGAACTC

Product: adenylosuccinate lyase

Products: NA

Alternate protein names: ASL; Adenylosuccinase; ASase [H]

Number of amino acids: Translated: 479; Mature: 478

Protein sequence:

>479_residues
MSYVAEKKKIANVLSNRYASAELTELWSPEAKILLERQLWIAVMKAQKNLGVDIPTEAISAYEAVIDRIDLDSIAQREKI
TRHDVKARIEEFNALAGYEHIHKGMTSRDLTENVEQLQIYRSLEMMRNKSVSVAAAIAQHAAQYQSLVMAGRSHNVAAQA
TTLGKRFASAAEEILVAIDRVEDLLSRYPLRGIKGPMGTAQDMLDLVGGDESKLAALETQIADHLGIARVFDSVGQVYPR
SLDFDAVSALVQLGAGPSSLAHTIRLMAGNETVTEGFKEGQVGSSAMPHKMNARSCERVGGLQVILRGYLTMVADLSGQQ
WNEGDVFCSVIRRVALPDAFFALDGMYETFLTVLAEFGAFPAMIDRELERYLPFLATTRILMAAVRVGVGRETAHEVIKE
NAVAVALNMRENGGEQDLIDRLAADERLPMTREQLDEALADRHAFIGAAESQVARVVDRVNDLVRRYPAAAQYTPGDIL

Sequences:

>Translated_479_residues
MSYVAEKKKIANVLSNRYASAELTELWSPEAKILLERQLWIAVMKAQKNLGVDIPTEAISAYEAVIDRIDLDSIAQREKI
TRHDVKARIEEFNALAGYEHIHKGMTSRDLTENVEQLQIYRSLEMMRNKSVSVAAAIAQHAAQYQSLVMAGRSHNVAAQA
TTLGKRFASAAEEILVAIDRVEDLLSRYPLRGIKGPMGTAQDMLDLVGGDESKLAALETQIADHLGIARVFDSVGQVYPR
SLDFDAVSALVQLGAGPSSLAHTIRLMAGNETVTEGFKEGQVGSSAMPHKMNARSCERVGGLQVILRGYLTMVADLSGQQ
WNEGDVFCSVIRRVALPDAFFALDGMYETFLTVLAEFGAFPAMIDRELERYLPFLATTRILMAAVRVGVGRETAHEVIKE
NAVAVALNMRENGGEQDLIDRLAADERLPMTREQLDEALADRHAFIGAAESQVARVVDRVNDLVRRYPAAAQYTPGDIL
>Mature_478_residues
SYVAEKKKIANVLSNRYASAELTELWSPEAKILLERQLWIAVMKAQKNLGVDIPTEAISAYEAVIDRIDLDSIAQREKIT
RHDVKARIEEFNALAGYEHIHKGMTSRDLTENVEQLQIYRSLEMMRNKSVSVAAAIAQHAAQYQSLVMAGRSHNVAAQAT
TLGKRFASAAEEILVAIDRVEDLLSRYPLRGIKGPMGTAQDMLDLVGGDESKLAALETQIADHLGIARVFDSVGQVYPRS
LDFDAVSALVQLGAGPSSLAHTIRLMAGNETVTEGFKEGQVGSSAMPHKMNARSCERVGGLQVILRGYLTMVADLSGQQW
NEGDVFCSVIRRVALPDAFFALDGMYETFLTVLAEFGAFPAMIDRELERYLPFLATTRILMAAVRVGVGRETAHEVIKEN
AVAVALNMRENGGEQDLIDRLAADERLPMTREQLDEALADRHAFIGAAESQVARVVDRVNDLVRRYPAAAQYTPGDIL

Specific function: De novo purine biosynthesis; eighth step. [C]

COG id: COG0015

COG function: function code F; Adenylosuccinate lyase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the lyase 1 family. Adenylosuccinate lyase subfamily [H]

Homologues:

Organism=Homo sapiens, GI4557269, Length=462, Percent_Identity=32.4675324675325, Blast_Score=233, Evalue=3e-61,
Organism=Homo sapiens, GI183227688, Length=386, Percent_Identity=32.3834196891192, Blast_Score=199, Evalue=4e-51,
Organism=Caenorhabditis elegans, GI17508577, Length=410, Percent_Identity=30.2439024390244, Blast_Score=164, Evalue=9e-41,
Organism=Caenorhabditis elegans, GI32564234, Length=357, Percent_Identity=31.3725490196078, Blast_Score=147, Evalue=1e-35,
Organism=Saccharomyces cerevisiae, GI6323391, Length=463, Percent_Identity=34.7732181425486, Blast_Score=243, Evalue=7e-65,
Organism=Drosophila melanogaster, GI24647570, Length=460, Percent_Identity=31.7391304347826, Blast_Score=215, Evalue=5e-56,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR019468
- InterPro:   IPR003031
- InterPro:   IPR000362
- InterPro:   IPR020557
- InterPro:   IPR008948
- InterPro:   IPR022761
- InterPro:   IPR004769 [H]

Pfam domain/function: PF10397 ADSL_C; PF00206 Lyase_1 [H]

EC number: =4.3.2.2 [H]

Molecular weight: Translated: 52672; Mature: 52541

Theoretical pI: Translated: 5.26; Mature: 5.26

Prosite motif: PS00163 FUMARATE_LYASES

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.5 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
3.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSYVAEKKKIANVLSNRYASAELTELWSPEAKILLERQLWIAVMKAQKNLGVDIPTEAIS
CCCHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH
AYEAVIDRIDLDSIAQREKITRHDVKARIEEFNALAGYEHIHKGMTSRDLTENVEQLQIY
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
RSLEMMRNKSVSVAAAIAQHAAQYQSLVMAGRSHNVAAQATTLGKRFASAAEEILVAIDR
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
VEDLLSRYPLRGIKGPMGTAQDMLDLVGGDESKLAALETQIADHLGIARVFDSVGQVYPR
HHHHHHHCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
SLDFDAVSALVQLGAGPSSLAHTIRLMAGNETVTEGFKEGQVGSSAMPHKMNARSCERVG
CCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHCHHCCCCCCCCCCCCCCHHHHHHHH
GLQVILRGYLTMVADLSGQQWNEGDVFCSVIRRVALPDAFFALDGMYETFLTVLAEFGAF
HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC
PAMIDRELERYLPFLATTRILMAAVRVGVGRETAHEVIKENAVAVALNMRENGGEQDLID
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEECCCCCCHHHHHH
RLAADERLPMTREQLDEALADRHAFIGAAESQVARVVDRVNDLVRRYPAAAQYTPGDIL
HHHCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC
>Mature Secondary Structure 
SYVAEKKKIANVLSNRYASAELTELWSPEAKILLERQLWIAVMKAQKNLGVDIPTEAIS
CCHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH
AYEAVIDRIDLDSIAQREKITRHDVKARIEEFNALAGYEHIHKGMTSRDLTENVEQLQIY
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
RSLEMMRNKSVSVAAAIAQHAAQYQSLVMAGRSHNVAAQATTLGKRFASAAEEILVAIDR
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
VEDLLSRYPLRGIKGPMGTAQDMLDLVGGDESKLAALETQIADHLGIARVFDSVGQVYPR
HHHHHHHCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
SLDFDAVSALVQLGAGPSSLAHTIRLMAGNETVTEGFKEGQVGSSAMPHKMNARSCERVG
CCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHCHHCCCCCCCCCCCCCCHHHHHHHH
GLQVILRGYLTMVADLSGQQWNEGDVFCSVIRRVALPDAFFALDGMYETFLTVLAEFGAF
HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC
PAMIDRELERYLPFLATTRILMAAVRVGVGRETAHEVIKENAVAVALNMRENGGEQDLID
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCEEEEEECCCCCCHHHHHH
RLAADERLPMTREQLDEALADRHAFIGAAESQVARVVDRVNDLVRRYPAAAQYTPGDIL
HHHCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9371463 [H]