Definition Methanosarcina mazei Go1 chromosome, complete genome.
Accession NC_003901
Length 4,096,345

Click here to switch to the map view.

The map label for this gene is purP [H]

Identifier: 21226695

GI number: 21226695

Start: 724369

End: 725532

Strand: Direct

Name: purP [H]

Synonym: MM_0593

Alternate gene names: 21226695

Gene position: 724369-725532 (Clockwise)

Preceding gene: 21226694

Following gene: 21226696

Centisome position: 17.68

GC content: 46.31

Gene sequence:

>1164_bases
ATGATTGACAGGAAAGAAATAAAGGAAATTGTTGAAGGCTATTACGCGCACGCTGATAAGATAAAAATAGGGGCAATTGC
CTCACACTCAGGGCTCGACATCTGCGACGGAGCAGTTGAAGAAGACTTCAGGACCCTTGCAGTCTGCCAGGCAGGAAGGG
AGAAGACTTACACTGAATACTTCAGGGCTCAGAGAGACCATTACGGAAGAATAAAAAGAGGGATTGTCGATGAAACAGTT
GTATATAAGAAGTATAATGAAATCCTTCTGCCCCAGAACCAGCAGAAACTGGTTGACGAAAAAGTGCTCTTTATCCCTAA
CCGTTCTTTTACTTCCTACTGCAGCATAGATGAGATTGAAGATAATTTCAGGGTCCCCATGGTAGGAAGCAGGAACCTTT
TGAGAAGTGAGGAACGCAGCGAACAGCAGAGCTACTACTGGATCCTCGAAAAAGCAGGGCTTCCTTTCCCTGAGAAAATA
GAATCTCCAGAGGACATCAACGAACTCGTAATGGTAAAACTTCCACACGCAGTAAAGAAACTCGAGAGAGGCTTTTTTAC
GGCTTCAAGCTATAAGGAATACCAGGAGAAATCCGAAGCTCTTATAAAGCAGGGAGTAATTACGCGCGACGCCCTTGAAA
ATGCCAGGATAGAGCGTTATATCATAGGACCAGTGTTCAATCTTGACATGTTCTACTCTCCTATCGAGCCGAAAATGAGC
AAACTGGAGTTACTTGGCGTTGACTGGCGCTTTGAGACTAGCCTTGACGGGCATGTAAGGCTTCCTGCCCCGCAGCAGAT
GGCTCTTGCACCTCACCAGCTCACTCCTGAATATACGGTCTGCGGACATAACTCTGCGACACTTCGTGAATCTCTTCTTG
ACAAAGTCTTTGAGATGGGAGAAAAATACGTAAAAGCCACTCAGGAGCACTATGCGCCTGGAATTATAGGATCGTTCTGC
CTCCAGACCTGTGTGGACAAGGACCTTAATTTCTATATTTATGATGTGGCCCCGAGAGTAGGCGGCGGGACAAATGTGCA
TATGTCAGTGGGTCATTCCTATGGCAATTCACTCTGGAGAAAACCGATGAGTACGGGCAGAAGGCTTGCCTTTGAGATAA
GACGCGCTCTGGAGCTCGAGAAGCTTGATATGATCGTCACATAA

Upstream 100 bases:

>100_bases
GAAATACTGCTTCAATACAGATCAAAAATCACACAGGCCAGTGCCTGATCACCATCTCACGGAAGTATCACTACCTCGAA
AAAGCTGAGAGATATCACAT

Downstream 100 bases:

>100_bases
ATTTATTAAGGCTTTCCGGCATTAGTAGATATAAATTGTGCAGAAACCGTAAACTTTAAAAAGATCTCCTGAATCTGCGG
GATTCAAGCAGTAATGAGAT

Product: 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein

Products: NA

Alternate protein names: 5-aminoimidazole-4-carboxamide-1-beta-D-ribofuranosyl 5'-monophosphate--formate ligase [H]

Number of amino acids: Translated: 387; Mature: 387

Protein sequence:

>387_residues
MIDRKEIKEIVEGYYAHADKIKIGAIASHSGLDICDGAVEEDFRTLAVCQAGREKTYTEYFRAQRDHYGRIKRGIVDETV
VYKKYNEILLPQNQQKLVDEKVLFIPNRSFTSYCSIDEIEDNFRVPMVGSRNLLRSEERSEQQSYYWILEKAGLPFPEKI
ESPEDINELVMVKLPHAVKKLERGFFTASSYKEYQEKSEALIKQGVITRDALENARIERYIIGPVFNLDMFYSPIEPKMS
KLELLGVDWRFETSLDGHVRLPAPQQMALAPHQLTPEYTVCGHNSATLRESLLDKVFEMGEKYVKATQEHYAPGIIGSFC
LQTCVDKDLNFYIYDVAPRVGGGTNVHMSVGHSYGNSLWRKPMSTGRRLAFEIRRALELEKLDMIVT

Sequences:

>Translated_387_residues
MIDRKEIKEIVEGYYAHADKIKIGAIASHSGLDICDGAVEEDFRTLAVCQAGREKTYTEYFRAQRDHYGRIKRGIVDETV
VYKKYNEILLPQNQQKLVDEKVLFIPNRSFTSYCSIDEIEDNFRVPMVGSRNLLRSEERSEQQSYYWILEKAGLPFPEKI
ESPEDINELVMVKLPHAVKKLERGFFTASSYKEYQEKSEALIKQGVITRDALENARIERYIIGPVFNLDMFYSPIEPKMS
KLELLGVDWRFETSLDGHVRLPAPQQMALAPHQLTPEYTVCGHNSATLRESLLDKVFEMGEKYVKATQEHYAPGIIGSFC
LQTCVDKDLNFYIYDVAPRVGGGTNVHMSVGHSYGNSLWRKPMSTGRRLAFEIRRALELEKLDMIVT
>Mature_387_residues
MIDRKEIKEIVEGYYAHADKIKIGAIASHSGLDICDGAVEEDFRTLAVCQAGREKTYTEYFRAQRDHYGRIKRGIVDETV
VYKKYNEILLPQNQQKLVDEKVLFIPNRSFTSYCSIDEIEDNFRVPMVGSRNLLRSEERSEQQSYYWILEKAGLPFPEKI
ESPEDINELVMVKLPHAVKKLERGFFTASSYKEYQEKSEALIKQGVITRDALENARIERYIIGPVFNLDMFYSPIEPKMS
KLELLGVDWRFETSLDGHVRLPAPQQMALAPHQLTPEYTVCGHNSATLRESLLDKVFEMGEKYVKATQEHYAPGIIGSFC
LQTCVDKDLNFYIYDVAPRVGGGTNVHMSVGHSYGNSLWRKPMSTGRRLAFEIRRALELEKLDMIVT

Specific function: Catalyzes the ATP- and formate-dependent formylation of 5-aminoimidazole-4-carboxamide-1-beta-d-ribofuranosyl 5'- monophosphate (AICAR) to 5-formaminoimidazole-4-carboxamide-1- beta-d-ribofuranosyl 5'-monophosphate (FAICAR) in the absence of folates [H]

COG id: COG1759

COG function: function code R; ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases)

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 ATP-grasp domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011761
- InterPro:   IPR013815
- InterPro:   IPR013816
- InterPro:   IPR009720
- InterPro:   IPR010672
- InterPro:   IPR013817 [H]

Pfam domain/function: PF06849 DUF1246; PF06973 DUF1297 [H]

EC number: NA

Molecular weight: Translated: 44472; Mature: 44472

Theoretical pI: Translated: 6.36; Mature: 6.36

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIDRKEIKEIVEGYYAHADKIKIGAIASHSGLDICDGAVEEDFRTLAVCQAGREKTYTEY
CCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHCCHHHHHHHHHHHHHCCCCHHHHHH
FRAQRDHYGRIKRGIVDETVVYKKYNEILLPQNQQKLVDEKVLFIPNRSFTSYCSIDEIE
HHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHCCCEEEECCCCCCCCCCHHHCC
DNFRVPMVGSRNLLRSEERSEQQSYYWILEKAGLPFPEKIESPEDINELVMVKLPHAVKK
CCCCCCCCCCCHHHHHHHHHHHHHEEEEEECCCCCCHHHCCCCCCHHHHEEEECCHHHHH
LERGFFTASSYKEYQEKSEALIKQGVITRDALENARIERYIIGPVFNLDMFYSPIEPKMS
HHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHCCEEEEEECCEEEHHHHCCCCCCCCH
KLELLGVDWRFETSLDGHVRLPAPQQMALAPHQLTPEYTVCGHNSATLRESLLDKVFEMG
HEEEEECCEEEECCCCCCEECCCCHHHHCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHH
EKYVKATQEHYAPGIIGSFCLQTCVDKDLNFYIYDVAPRVGGGTNVHMSVGHSYGNSLWR
HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEEECCCHHHHHHHH
KPMSTGRRLAFEIRRALELEKLDMIVT
CHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MIDRKEIKEIVEGYYAHADKIKIGAIASHSGLDICDGAVEEDFRTLAVCQAGREKTYTEY
CCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHCCHHHHHHHHHHHHHCCCCHHHHHH
FRAQRDHYGRIKRGIVDETVVYKKYNEILLPQNQQKLVDEKVLFIPNRSFTSYCSIDEIE
HHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHCCCEEEECCCCCCCCCCHHHCC
DNFRVPMVGSRNLLRSEERSEQQSYYWILEKAGLPFPEKIESPEDINELVMVKLPHAVKK
CCCCCCCCCCCHHHHHHHHHHHHHEEEEEECCCCCCHHHCCCCCCHHHHEEEECCHHHHH
LERGFFTASSYKEYQEKSEALIKQGVITRDALENARIERYIIGPVFNLDMFYSPIEPKMS
HHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHCCEEEEEECCEEEHHHHCCCCCCCCH
KLELLGVDWRFETSLDGHVRLPAPQQMALAPHQLTPEYTVCGHNSATLRESLLDKVFEMG
HEEEEECCEEEECCCCCCEECCCCHHHHCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHH
EKYVKATQEHYAPGIIGSFCLQTCVDKDLNFYIYDVAPRVGGGTNVHMSVGHSYGNSLWR
HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEEECCCHHHHHHHH
KPMSTGRRLAFEIRRALELEKLDMIVT
CHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA