| Definition | Mycobacterium tuberculosis H37Ra, complete genome. |
|---|---|
| Accession | NC_009525 |
| Length | 4,419,977 |
Click here to switch to the map view.
The map label for this gene is purB [H]
Identifier: 148660554
GI number: 148660554
Start: 871314
End: 872732
Strand: Direct
Name: purB [H]
Synonym: MRA_0786
Alternate gene names: 148660554
Gene position: 871314-872732 (Clockwise)
Preceding gene: 148660552
Following gene: 148660555
Centisome position: 19.71
GC content: 65.61
Gene sequence:
>1419_bases GTGAGCATTCCCAACGTGCTGGCCACCCGATACGCCAGCGCCGAGATGGTCGCGATCTGGTCGCCGGAGGCCAAGGTGGT CTCGGAGCGGCGGTTATGGCTGGCCGTATTGCGGGCACAGGCAGAGCTGGGGGTAGCGGTTGCCGATTCGGTGCTCGCCG ACTACGAACGTGTGGTCGACGATGTGGACTTGGCCTCGATCTCAGCCCGGGAGCGGGTGCTGCGCCACGATGTCAAGGCC CGCATCGAGGAATTCAACGCATTGGCCGGTCATGAGCACGTGCACAAGGGGATGACCAGCCGCGACCTGACCGAGAACGT GGAGCAACTGCAGATTCGGCGGTCGCTGGAAGTGATTTTCGCCCATGGGGTGGCGGCGGTGGCGCGGCTGGCCGAGCGGG CGGTGAGCTACCGTGACCTGATCATGGCCGGGCGCAGCCACAACGTGGCCGCTCAGGCCACCACCTTGGGCAAGCGGTTC GCCTCGGCGGCCCAAGAGATGATGATCGCGTTGAGGCGGTTGAGGGAGTTGATCGACCGCTACCCCCTGCGTGGCATCAA GGGCCCGATGGGCACCGGTCAGGACATGCTCGATCTGCTGGGCGGTGACCGTGCGGCGCTGGCCGATCTCGAGCGGCGCG TCGCCGACTTCTTGGGCTTTGCAACTGTTTTCAACAGCGTGGGGCAGGTGTATCCGCGTTCATTGGACCACGACGTGGTT TCGGCTCTGGTGCAGCTCGGCGCGGGGCCGTCATCACTGGCACACACGATTCGATTGATGGCCGGCCACGAGCTCGCCAC CGAGGGTTTCGCGCCGGGTCAGGTCGGTTCGTCGGCGATGCCGCACAAGATGAACACCCGCAGCTGCGAACGGGTCAACG GGCTGCAGGTTGTGCTACGCGGCTATGCATCCATGGTGGCCGAGTTAGCCGGTGCACAGTGGAACGAGGGTGATGTGTTT TGCTCCGTGGTGCGCCGGGTTGCGTTGCCGGACAGCTTCTTTGCCGTCGACGGGCAGATCGAGACGTTTTTGACGGTGCT GGACGAGTTCGGCGCCTACCCGGCGGTGATCGGCCGCGAGTTGGATCGTTATCTGCCGTTCCTGGCCACCACTAAGGTGC TAATGGCGGCCGTGCGCGCGGGGATGGGTCGCGAGTCCGCGCACCGGTTGATCTCCGAGCACGCGGTGGCGACGGCGCTG GCCATGCGAGAACACGGCGCGGAGCCCGACCTGCTGGACCGGTTGGCCGCCGATCCGCGGCTGACGCTGGGACGAGACGC TTTGGAGGCCGCGCTGGCCGACAAGAAGGCATTTGCCGGTGCCGCGGGTGACCAGGTCGATGATGTGGTCGCGATGGTGG ACGCGCTGGTGAGCCGTTACCCGGACGCGGCTAAATACACGCCGGGTGCAATTCTTTAG
Upstream 100 bases:
>100_bases CGATGAGTCGGGCACATCGGGTGCTCCCTGGCGCCGGGACTCGTGTGACAACTGCGACTACTAGGCCCGCGACCGTAAGC TGTGTCTTTGTGAGGGCCAA
Downstream 100 bases:
>100_bases TGTCATGACTACCGCCGCCGGGCTTTCGGGCATCGATCTGACCGATCTGGACAACTTCGCCGACGGCTTCCCCCATCACC TCTTCGCCATCCACCGTCGT
Product: adenylosuccinate lyase
Products: NA
Alternate protein names: ASL; Adenylosuccinase; ASase [H]
Number of amino acids: Translated: 472; Mature: 471
Protein sequence:
>472_residues MSIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELGVAVADSVLADYERVVDDVDLASISARERVLRHDVKA RIEEFNALAGHEHVHKGMTSRDLTENVEQLQIRRSLEVIFAHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRF ASAAQEMMIALRRLRELIDRYPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFLGFATVFNSVGQVYPRSLDHDVV SALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVGSSAMPHKMNTRSCERVNGLQVVLRGYASMVAELAGAQWNEGDVF CSVVRRVALPDSFFAVDGQIETFLTVLDEFGAYPAVIGRELDRYLPFLATTKVLMAAVRAGMGRESAHRLISEHAVATAL AMREHGAEPDLLDRLAADPRLTLGRDALEAALADKKAFAGAAGDQVDDVVAMVDALVSRYPDAAKYTPGAIL
Sequences:
>Translated_472_residues MSIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELGVAVADSVLADYERVVDDVDLASISARERVLRHDVKA RIEEFNALAGHEHVHKGMTSRDLTENVEQLQIRRSLEVIFAHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRF ASAAQEMMIALRRLRELIDRYPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFLGFATVFNSVGQVYPRSLDHDVV SALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVGSSAMPHKMNTRSCERVNGLQVVLRGYASMVAELAGAQWNEGDVF CSVVRRVALPDSFFAVDGQIETFLTVLDEFGAYPAVIGRELDRYLPFLATTKVLMAAVRAGMGRESAHRLISEHAVATAL AMREHGAEPDLLDRLAADPRLTLGRDALEAALADKKAFAGAAGDQVDDVVAMVDALVSRYPDAAKYTPGAIL >Mature_471_residues SIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELGVAVADSVLADYERVVDDVDLASISARERVLRHDVKAR IEEFNALAGHEHVHKGMTSRDLTENVEQLQIRRSLEVIFAHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRFA SAAQEMMIALRRLRELIDRYPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFLGFATVFNSVGQVYPRSLDHDVVS ALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVGSSAMPHKMNTRSCERVNGLQVVLRGYASMVAELAGAQWNEGDVFC SVVRRVALPDSFFAVDGQIETFLTVLDEFGAYPAVIGRELDRYLPFLATTKVLMAAVRAGMGRESAHRLISEHAVATALA MREHGAEPDLLDRLAADPRLTLGRDALEAALADKKAFAGAAGDQVDDVVAMVDALVSRYPDAAKYTPGAIL
Specific function: De novo purine biosynthesis; eighth step. [C]
COG id: COG0015
COG function: function code F; Adenylosuccinate lyase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the lyase 1 family. Adenylosuccinate lyase subfamily [H]
Homologues:
Organism=Homo sapiens, GI4557269, Length=442, Percent_Identity=31.6742081447964, Blast_Score=226, Evalue=5e-59, Organism=Homo sapiens, GI183227688, Length=389, Percent_Identity=32.3907455012853, Blast_Score=206, Evalue=4e-53, Organism=Caenorhabditis elegans, GI17508577, Length=466, Percent_Identity=27.6824034334764, Blast_Score=169, Evalue=4e-42, Organism=Caenorhabditis elegans, GI32564234, Length=343, Percent_Identity=30.6122448979592, Blast_Score=149, Evalue=4e-36, Organism=Saccharomyces cerevisiae, GI6323391, Length=449, Percent_Identity=34.7438752783964, Blast_Score=242, Evalue=8e-65, Organism=Drosophila melanogaster, GI24647570, Length=465, Percent_Identity=33.1182795698925, Blast_Score=238, Evalue=6e-63,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR019468 - InterPro: IPR003031 - InterPro: IPR000362 - InterPro: IPR020557 - InterPro: IPR008948 - InterPro: IPR022761 - InterPro: IPR004769 [H]
Pfam domain/function: PF10397 ADSL_C; PF00206 Lyase_1 [H]
EC number: =4.3.2.2 [H]
Molecular weight: Translated: 51041; Mature: 50910
Theoretical pI: Translated: 6.33; Mature: 6.33
Prosite motif: PS00163 FUMARATE_LYASES
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 3.2 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELGVAVADSVLADYERVVD CCCCHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DVDLASISARERVLRHDVKARIEEFNALAGHEHVHKGMTSRDLTENVEQLQIRRSLEVIF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH AHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRFASAAQEMMIALRRLRELIDR HHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFLGFATVFNSVGQVYPRSLDHDVV CCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH SALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVGSSAMPHKMNTRSCERVNGLQVVLR HHHHHHCCCHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH GYASMVAELAGAQWNEGDVFCSVVRRVALPDSFFAVDGQIETFLTVLDEFGAYPAVIGRE HHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCEEECCHHHHHHHHHHHHCCCHHHHHHH LDRYLPFLATTKVLMAAVRAGMGRESAHRLISEHAVATALAMREHGAEPDLLDRLAADPR HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCC LTLGRDALEAALADKKAFAGAAGDQVDDVVAMVDALVSRYPDAAKYTPGAIL CCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC >Mature Secondary Structure SIPNVLATRYASAEMVAIWSPEAKVVSERRLWLAVLRAQAELGVAVADSVLADYERVVD CCCHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH DVDLASISARERVLRHDVKARIEEFNALAGHEHVHKGMTSRDLTENVEQLQIRRSLEVIF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH AHGVAAVARLAERAVSYRDLIMAGRSHNVAAQATTLGKRFASAAQEMMIALRRLRELIDR HHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YPLRGIKGPMGTGQDMLDLLGGDRAALADLERRVADFLGFATVFNSVGQVYPRSLDHDVV CCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH SALVQLGAGPSSLAHTIRLMAGHELATEGFAPGQVGSSAMPHKMNTRSCERVNGLQVVLR HHHHHHCCCHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH GYASMVAELAGAQWNEGDVFCSVVRRVALPDSFFAVDGQIETFLTVLDEFGAYPAVIGRE HHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCEEECCHHHHHHHHHHHHCCCHHHHHHH LDRYLPFLATTKVLMAAVRAGMGRESAHRLISEHAVATALAMREHGAEPDLLDRLAADPR HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCC LTLGRDALEAALADKKAFAGAAGDQVDDVVAMVDALVSRYPDAAKYTPGAIL CCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9389475 [H]