Definition | Bacillus anthracis str. Sterne chromosome, complete genome. |
---|---|
Accession | NC_005945 |
Length | 5,228,663 |
Click here to switch to the map view.
The map label for this gene is purL
Identifier: 49183309
GI number: 49183309
Start: 300211
End: 302430
Strand: Direct
Name: purL
Synonym: BAS0281
Alternate gene names: 49183309
Gene position: 300211-302430 (Clockwise)
Preceding gene: 49183308
Following gene: 49183310
Centisome position: 5.74
GC content: 40.77
Gene sequence:
>2220_bases ATGTCGTTAATGCTTGAACCAAATCCAACACAAATTAAAGAAGAGCGCATATATGCGGAAATGGGGCTAACAGACGAAGA GTTTGCCATGGTTGAAAAGATTTTAGGCCGTCTGCCAAATTATACAGAAACAGGTTTATTCTCTGTAATGTGGTCAGAAC ATTGTAGTTATAAAAATTCAAAACCAGTGCTTCGAAAATTCCCAACAACAGGTGAGCGCGTTCTGCAAGGACCTGGGGAA GGTGCTGGAATTGTAGATATCGGTGATAATCAAGCGGTTGTATTTAAAATGGAAAGTCATAACCATCCTTCTGCTATTGA ACCATATCAAGGAGCAGCAACAGGCGTTGGTGGTATTATTCGTGACGTATTCTCTATGGGGGCACGTCCAGTAGCTCTAT TAAACTCACTTCGCTTTGGGGAATTACAATCACCACGTGTGAAATACTTATTCGAAGAAGTAGTAGCAGGAATTGCGGGA TACGGTAACTGTATCGGTATTCCGACTGTTGGCGGCGAAGTACAGTTTGATCCATGTTATGAAGGAAACCCACTTGTAAA TGCAATGTGCGTAGGCTTAATTAACCACGAAGACATTAAAAAAGGGCAAGCGCACGGTGCTGGAAATACAGTAATGTACG TAGGCGCATCAACTGGTCGTGATGGTATCCACGGTGCAACATTCGCATCTGAAGAGTTATCTGAAAGCTCAGAAGCGAAG CGTCCAGCAGTTCAAGTAGGAGATCCATTTATGGAGAAACTTCTTATTGAAGCTTGCTTAGAACTTATTCAATCGGATGC ACTTGTTGGAATTCAAGATATGGGTGCAGCTGGTTTAACATCATCTTCTGCAGAAATGGCAAGTAAAGCAGGAATGGGTA TTGAAATGTACTTAGATGATGTACCGCAGCGAGAAACAGGAATGACACCATATGAAATGATGTTATCTGAATCACAAGAG CGTATGTTAATTGTTGTGAAAAAAGGTAGAGAGCAAGAAATAGTAGATTTATTTGAGAAGTATGGTCTTGCAGCCGTTAC GATGGGGAAAGTAACGGAAGACAAAATGCTTCGTTTATTCCATAAAGGTGAAATGGTAGCTGAAGTGCCAGCAGATGCTC TTGCAGAAGAAGCACCAATTTATCATAAACCATCAAAAGAAGCAGCATATTTTGCTGAATTCCAGCAGATGAAAATGGAA ACACCAAAAGTAGAAAATTATAAAGAAACGTTATTCGCTCTATTACAACAACCGACGATTGCAAGTAAAGAGTGGGTTTA TGATCAATATGATTATCAAGTTCGCACAAGCACAGTTGTTACACCAGGTTCAGATGCAGCAGTTGTACGTGTACGCGGTA CAGAAAAAGGATTAGCAATGACGACAGATTGTAACTCTCGCTACATTTATTTAGATCCAGAAGTGGGCGGTAAAATTGCA GTAGCAGAGGCAGCACGTAATATCGTATGTTCTGGGGGAGAGCCTCTTGCAATTACAGATTGCTTAAACTTCGGTAATCC AGAAAAACCAGAAATCTTCTGGCAAATTGAGAAATCAGTAGATGGTATGAGTGAAGCTTGTCGTACGTTACAAACACCAG TTATCGGCGGAAACGTATCAATGTATAACGAGCGTAGTGGTGAAGCGGTATACCCAACACCGACAGTTGGGATGGTTGGA CTTGTACATGATTTAAAACATGTAACAACACAAGAGTTTAAGCAAGCCGGCGATTTAGTTTATGTCATTGGAGAGACGAA AGCTGAGTTTGGTGGAAGTGAATTACAGAAAATGCTTCACGGAAAAATTTTCGGCCAATCACCAAGTATCGATTTAGACG TAGAATTAAAACGTCAAAAACAAGTATTAGCAGCCATTCAAGCTGGCCTTGTTCAATCTGCACATGACGTTGCTGAAGGT GGTTTAGCGGTTGCGATTTCAGAAAGTGCAATTGGTGCTAACGGCTTAGGTGCTACTGTGAAATTAGACGGGGAAGCAAC AGCTGCATTATTTGCGGAATCACAATCTCGCTTCGTAATAACGGTAAAACGTGAAAATAAAGAAGCGTTCGAGAAAGCGG TAGAAGCCATTCAAGTTGGTGAAGTAACAAATACAAATGAAGTAACAATTCATAATGAAGAAAATGAAGTATTACTTACA GCAAATGTAGATGAAATGAGAAAGGCTTGGAAAGGGGCAATCCCATGCTTGCTGAAATAA
Upstream 100 bases:
>100_bases GGTATGATGCCACACCCAGAGCGTGCTGTAGATGAATTACTTGGCGGTGCTGAAGGGTTAAAAGTCTTTCAATCTATCTT AAAACAGTGGAGGGAAACAT
Downstream 100 bases:
>100_bases AGGGGTTAAACGAAGAATGTGGCGTATTCGGAATTTGGGGGCATGAAAATGCGGCACAAGTTTCATACTACGGATTGCAC AGTTTACAGCACCGTGGGCA
Product: phosphoribosylformylglycinamidine synthase II
Products: NA
Alternate protein names: Phosphoribosylformylglycinamidine synthase II; FGAM synthase II
Number of amino acids: Translated: 739; Mature: 738
Protein sequence:
>739_residues MSLMLEPNPTQIKEERIYAEMGLTDEEFAMVEKILGRLPNYTETGLFSVMWSEHCSYKNSKPVLRKFPTTGERVLQGPGE GAGIVDIGDNQAVVFKMESHNHPSAIEPYQGAATGVGGIIRDVFSMGARPVALLNSLRFGELQSPRVKYLFEEVVAGIAG YGNCIGIPTVGGEVQFDPCYEGNPLVNAMCVGLINHEDIKKGQAHGAGNTVMYVGASTGRDGIHGATFASEELSESSEAK RPAVQVGDPFMEKLLIEACLELIQSDALVGIQDMGAAGLTSSSAEMASKAGMGIEMYLDDVPQRETGMTPYEMMLSESQE RMLIVVKKGREQEIVDLFEKYGLAAVTMGKVTEDKMLRLFHKGEMVAEVPADALAEEAPIYHKPSKEAAYFAEFQQMKME TPKVENYKETLFALLQQPTIASKEWVYDQYDYQVRTSTVVTPGSDAAVVRVRGTEKGLAMTTDCNSRYIYLDPEVGGKIA VAEAARNIVCSGGEPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRTLQTPVIGGNVSMYNERSGEAVYPTPTVGMVG LVHDLKHVTTQEFKQAGDLVYVIGETKAEFGGSELQKMLHGKIFGQSPSIDLDVELKRQKQVLAAIQAGLVQSAHDVAEG GLAVAISESAIGANGLGATVKLDGEATAALFAESQSRFVITVKRENKEAFEKAVEAIQVGEVTNTNEVTIHNEENEVLLT ANVDEMRKAWKGAIPCLLK
Sequences:
>Translated_739_residues MSLMLEPNPTQIKEERIYAEMGLTDEEFAMVEKILGRLPNYTETGLFSVMWSEHCSYKNSKPVLRKFPTTGERVLQGPGE GAGIVDIGDNQAVVFKMESHNHPSAIEPYQGAATGVGGIIRDVFSMGARPVALLNSLRFGELQSPRVKYLFEEVVAGIAG YGNCIGIPTVGGEVQFDPCYEGNPLVNAMCVGLINHEDIKKGQAHGAGNTVMYVGASTGRDGIHGATFASEELSESSEAK RPAVQVGDPFMEKLLIEACLELIQSDALVGIQDMGAAGLTSSSAEMASKAGMGIEMYLDDVPQRETGMTPYEMMLSESQE RMLIVVKKGREQEIVDLFEKYGLAAVTMGKVTEDKMLRLFHKGEMVAEVPADALAEEAPIYHKPSKEAAYFAEFQQMKME TPKVENYKETLFALLQQPTIASKEWVYDQYDYQVRTSTVVTPGSDAAVVRVRGTEKGLAMTTDCNSRYIYLDPEVGGKIA VAEAARNIVCSGGEPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRTLQTPVIGGNVSMYNERSGEAVYPTPTVGMVG LVHDLKHVTTQEFKQAGDLVYVIGETKAEFGGSELQKMLHGKIFGQSPSIDLDVELKRQKQVLAAIQAGLVQSAHDVAEG GLAVAISESAIGANGLGATVKLDGEATAALFAESQSRFVITVKRENKEAFEKAVEAIQVGEVTNTNEVTIHNEENEVLLT ANVDEMRKAWKGAIPCLLK >Mature_738_residues SLMLEPNPTQIKEERIYAEMGLTDEEFAMVEKILGRLPNYTETGLFSVMWSEHCSYKNSKPVLRKFPTTGERVLQGPGEG AGIVDIGDNQAVVFKMESHNHPSAIEPYQGAATGVGGIIRDVFSMGARPVALLNSLRFGELQSPRVKYLFEEVVAGIAGY GNCIGIPTVGGEVQFDPCYEGNPLVNAMCVGLINHEDIKKGQAHGAGNTVMYVGASTGRDGIHGATFASEELSESSEAKR PAVQVGDPFMEKLLIEACLELIQSDALVGIQDMGAAGLTSSSAEMASKAGMGIEMYLDDVPQRETGMTPYEMMLSESQER MLIVVKKGREQEIVDLFEKYGLAAVTMGKVTEDKMLRLFHKGEMVAEVPADALAEEAPIYHKPSKEAAYFAEFQQMKMET PKVENYKETLFALLQQPTIASKEWVYDQYDYQVRTSTVVTPGSDAAVVRVRGTEKGLAMTTDCNSRYIYLDPEVGGKIAV AEAARNIVCSGGEPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRTLQTPVIGGNVSMYNERSGEAVYPTPTVGMVGL VHDLKHVTTQEFKQAGDLVYVIGETKAEFGGSELQKMLHGKIFGQSPSIDLDVELKRQKQVLAAIQAGLVQSAHDVAEGG LAVAISESAIGANGLGATVKLDGEATAALFAESQSRFVITVKRENKEAFEKAVEAIQVGEVTNTNEVTIHNEENEVLLTA NVDEMRKAWKGAIPCLLK
Specific function: Unknown
COG id: COG0046
COG function: function code F; Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the FGAMS family
Homologues:
Organism=Homo sapiens, GI31657129, Length=722, Percent_Identity=24.2382271468144, Blast_Score=114, Evalue=3e-25, Organism=Escherichia coli, GI48994899, Length=767, Percent_Identity=25.2933507170795, Blast_Score=137, Evalue=2e-33, Organism=Caenorhabditis elegans, GI17553022, Length=683, Percent_Identity=21.9619326500732, Blast_Score=104, Evalue=2e-22, Organism=Saccharomyces cerevisiae, GI6321498, Length=722, Percent_Identity=22.5761772853186, Blast_Score=92, Evalue=3e-19, Organism=Drosophila melanogaster, GI24582111, Length=637, Percent_Identity=24.1758241758242, Blast_Score=104, Evalue=3e-22, Organism=Drosophila melanogaster, GI24582109, Length=637, Percent_Identity=24.1758241758242, Blast_Score=104, Evalue=3e-22, Organism=Drosophila melanogaster, GI17137292, Length=637, Percent_Identity=24.1758241758242, Blast_Score=104, Evalue=3e-22,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): PURL_BACAA (C3PBN0)
Other databases:
- EMBL: CP001598 - RefSeq: YP_002864912.1 - ProteinModelPortal: C3PBN0 - SMR: C3PBN0 - EnsemblBacteria: EBBACT00000130097 - GeneID: 7851020 - GenomeReviews: CP001598_GR - KEGG: bai:BAA_0349 - GeneTree: EBGT00050000002932 - ProtClustDB: PRK01213 - GO: GO:0005737 - HAMAP: MF_00420 - InterPro: IPR000728 - InterPro: IPR010918 - InterPro: IPR010074 - InterPro: IPR016188 - TIGRFAMs: TIGR01736
Pfam domain/function: PF00586 AIRS; PF02769 AIRS_C; SSF56042 AIR_synth_C; SSF55326 PurM_N-like
EC number: =6.3.5.3
Molecular weight: Translated: 80176; Mature: 80045
Theoretical pI: Translated: 4.57; Mature: 4.57
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 3.9 %Met (Translated Protein) 5.3 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 3.8 %Met (Mature Protein) 5.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSLMLEPNPTQIKEERIYAEMGLTDEEFAMVEKILGRLPNYTETGLFSVMWSEHCSYKNS CCEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCC KPVLRKFPTTGERVLQGPGEGAGIVDIGDNQAVVFKMESHNHPSAIEPYQGAATGVGGII CCHHHHCCCCCHHHHCCCCCCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH RDVFSMGARPVALLNSLRFGELQSPRVKYLFEEVVAGIAGYGNCIGIPTVGGEVQFDPCY HHHHHCCCCHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCEEEECCCC EGNPLVNAMCVGLINHEDIKKGQAHGAGNTVMYVGASTGRDGIHGATFASEELSESSEAK CCCCHHHHHHHHHCCHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHC RPAVQVGDPFMEKLLIEACLELIQSDALVGIQDMGAAGLTSSSAEMASKAGMGIEMYLDD CCCEECCCHHHHHHHHHHHHHHHHCCCEEEHHHCCCCCCCCCHHHHHHHCCCCEEEEECC VPQRETGMTPYEMMLSESQERMLIVVKKGREQEIVDLFEKYGLAAVTMGKVTEDKMLRLF CCCCCCCCCHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCEEEEECCCCHHHHHHHH HKGEMVAEVPADALAEEAPIYHKPSKEAAYFAEFQQMKMETPKVENYKETLFALLQQPTI HCCCEEECCCHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCC ASKEWVYDQYDYQVRTSTVVTPGSDAAVVRVRGTEKGLAMTTDCNSRYIYLDPEVGGKIA CCCCCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCEEEEECCCCEEEEECCCCCCEEE VAEAARNIVCSGGEPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRTLQTPVIGGNVS EEHHHCCEEECCCCCEEEEEHHCCCCCCCCEEEEEEHHHHCHHHHHHHHHCCCEECCCEE MYNERSGEAVYPTPTVGMVGLVHDLKHVTTQEFKQAGDLVYVIGETKAEFGGSELQKMLH EECCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHCCHHHHHHHHH GKIFGQSPSIDLDVELKRQKQVLAAIQAGLVQSAHDVAEGGLAVAISESAIGANGLGATV HHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEEE KLDGEATAALFAESQSRFVITVKRENKEAFEKAVEAIQVGEVTNTNEVTIHNEENEVLLT EECCCCEEEEEECCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEE ANVDEMRKAWKGAIPCLLK ECHHHHHHHHCCCCCCCCC >Mature Secondary Structure SLMLEPNPTQIKEERIYAEMGLTDEEFAMVEKILGRLPNYTETGLFSVMWSEHCSYKNS CEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCC KPVLRKFPTTGERVLQGPGEGAGIVDIGDNQAVVFKMESHNHPSAIEPYQGAATGVGGII CCHHHHCCCCCHHHHCCCCCCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHH RDVFSMGARPVALLNSLRFGELQSPRVKYLFEEVVAGIAGYGNCIGIPTVGGEVQFDPCY HHHHHCCCCHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEECCCCCCEEEECCCC EGNPLVNAMCVGLINHEDIKKGQAHGAGNTVMYVGASTGRDGIHGATFASEELSESSEAK CCCCHHHHHHHHHCCHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHC RPAVQVGDPFMEKLLIEACLELIQSDALVGIQDMGAAGLTSSSAEMASKAGMGIEMYLDD CCCEECCCHHHHHHHHHHHHHHHHCCCEEEHHHCCCCCCCCCHHHHHHHCCCCEEEEECC VPQRETGMTPYEMMLSESQERMLIVVKKGREQEIVDLFEKYGLAAVTMGKVTEDKMLRLF CCCCCCCCCHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCEEEEECCCCHHHHHHHH HKGEMVAEVPADALAEEAPIYHKPSKEAAYFAEFQQMKMETPKVENYKETLFALLQQPTI HCCCEEECCCHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCC ASKEWVYDQYDYQVRTSTVVTPGSDAAVVRVRGTEKGLAMTTDCNSRYIYLDPEVGGKIA CCCCCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCEEEEECCCCEEEEECCCCCCEEE VAEAARNIVCSGGEPLAITDCLNFGNPEKPEIFWQIEKSVDGMSEACRTLQTPVIGGNVS EEHHHCCEEECCCCCEEEEEHHCCCCCCCCEEEEEEHHHHCHHHHHHHHHCCCEECCCEE MYNERSGEAVYPTPTVGMVGLVHDLKHVTTQEFKQAGDLVYVIGETKAEFGGSELQKMLH EECCCCCCEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHCCHHHHHHHHH GKIFGQSPSIDLDVELKRQKQVLAAIQAGLVQSAHDVAEGGLAVAISESAIGANGLGATV HHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEEE KLDGEATAALFAESQSRFVITVKRENKEAFEKAVEAIQVGEVTNTNEVTIHNEENEVLLT EECCCCEEEEEECCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEE ANVDEMRKAWKGAIPCLLK ECHHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA