Definition | Bacillus anthracis str. CDC 684, complete genome. |
---|---|
Accession | NC_012581 |
Length | 5,230,115 |
Click here to switch to the map view.
The map label for this gene is asnO3 [H]
Identifier: 227814934
GI number: 227814934
Start: 2137746
End: 2139647
Strand: Reverse
Name: asnO3 [H]
Synonym: BAMEG_2345
Alternate gene names: 227814934
Gene position: 2139647-2137746 (Counterclockwise)
Preceding gene: 227814936
Following gene: 227814933
Centisome position: 40.91
GC content: 36.59
Gene sequence:
>1902_bases ATGTGTGGTTTTGTAGGATGTTTATGTGAAAACCCTAGAGAGTTTTCAGAAACAGAAAAACATCAATTTGAAAATATGAA CACGATGATTTTCCACCGTGGTCCAGATGACGAAGGATATTTTCGTGATGAACATGTACAATTTGGCTTCCGCCGTTTAA GTATCATTGACTTAGAGGCAGGACATCAGCCGCTAACTTATGAAAATGATCGATATGTAATTATTTTTAATGGTGAAATT TACAACTATGTAGAATTACGTGAAATGTTACTTGAAAAAGGTGCAACGTTTGCAACGCAATCTGATACAGAAGTTATCAT TGCATTGTATGCACATATGAAAGAAAAATGTGTAGACTACCTTCGTGGTATGTTTGCATTTATGATTTGGGATCGTGAAG AGAAGAAACTTTTCGGAGCACGTGATCACTTCGGTATTAAACCTTTATACATCGCACAACAAGGTGATACTACATTCTTC GCATCTGAGAAGAAAAGTATTATGCATGTGATGGAAGATAAAGGCGTTAATCCAACGTCACTACAACATTACTTTACGTA TCAATATGGTCCAGAGCCAGAAACATTAACAATTGATGTTAATAAAATCGAGCCTGGTCATTATTTCGTAAAAGAAATCG GTAAAGAGATGGAAATCCATCGCTACTGGAAACCTTATTTCAATGCTTCAAGTGCAACGAAAGAGGAACATATCCAAGCG ATTCGTGATGTGTTATATGATTCAGTAAAAGTGCATATGCGCAGTGATGTACCAGTAGGTTCATTCTTATCTGGTGGTAT CGATTCATCTATTATCGCTTCTATTGCAAGAGAAATGAATCCAAATCTTTTAACATTCTCTGTTGGTTTTGAGCAACGTG GTTTTAGTGAAGTTGATGTTGCGAAAGAAACTGCTGAGAAATTAGGCGTTAAAAACCATAACGTATTCATTTCAGCGAAA GAGTTTATGGATGAGTTCCCAAAAATCATTTGGCATATGGATGATCCTTTAGCAGATCCGGCAGCTGTACCATTGTACTT CGTTGCAAAAGAAGCACGTAAACATGTAACAGTTGTTCTTTCAGGTGAAGGTGCAGACGAGCTATTTGGTGGTTATAACA TTTACCGTGAGCCAAACTCACTGAAAATGTTCTCTTACATTCCTAGCCCAGGTAAGAGCGTTCTAAAAGCATTAAGTGGT GCTCTTAAAGAAGGCTTTAAAGGAAAGAGCTTCCTAGAGCGTGGATGTACGCCAATTGAAGAGCGTTACTATGGAAACGC GAAAATCTTCCGTGAAGAAGAGAAAGCTGAATTAATGAAGTATTACAATGAAAGTGTTAACTATATGGATATCACGAAAC CATTGTATAACGAAATTAAAGATTATGATGATGTAAGTAAAATGCAGTACATTGACATGTTCACATGGTTACGCGGTGAC ATTTTATTAAAAGCTGATAAAATGACAATGGCGAATTCATTAGAACTTCGTGTACCGTTCTTAGATAAAGAAGTATTCGA TGTTGCATCTAAAATTCCAACTGAATTTAAGATTGCTAACGGAACTACGAAAGCTATTTTACGTGAAGCAGCACGCGGAA TCGTTCCAGATCACGTATTAGATCGTAAAAAACTTGGATTCCCAGTACCAATTCGTCACTGGCTAAAAGACGAAATGCAT GATTGGGCTATTAATATTATAAACGAAAGTAAGACAGAGCATTTAATCGACAAACAGTATGTATTAAACTTACTGGAAGC ACATTGTGCAGATAAAGGCGATTATAGCCGTAAAATTTGGACTGTACTTGCATTTATGGTATGGCATCAAATTTATGTTG AGCATAAATACGATACGAATAAGTTCCACGAAGAAACGAAACGTGCGTATAGCTTAGTTTAA
Upstream 100 bases:
>100_bases TTATAGTTTTTGGTCGCCATATTCAGTTGATTATGCTAGTATAAATAGGGTTGATATAGTAGTATAGAGACAGAATGAAA ACTGAAAGAGGGTAAGAAGT
Downstream 100 bases:
>100_bases TTAGAAAAACCTTAAGATCGCATACGGTCTTAAGGTTTTTTTCATAAATAGGACGTATTATTGATACAATAATAGTAAAA GGATGGACAACATATGATTA
Product: asparagine synthetase, glutamine-hydrolyzing
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 633; Mature: 633
Protein sequence:
>633_residues MCGFVGCLCENPREFSETEKHQFENMNTMIFHRGPDDEGYFRDEHVQFGFRRLSIIDLEAGHQPLTYENDRYVIIFNGEI YNYVELREMLLEKGATFATQSDTEVIIALYAHMKEKCVDYLRGMFAFMIWDREEKKLFGARDHFGIKPLYIAQQGDTTFF ASEKKSIMHVMEDKGVNPTSLQHYFTYQYGPEPETLTIDVNKIEPGHYFVKEIGKEMEIHRYWKPYFNASSATKEEHIQA IRDVLYDSVKVHMRSDVPVGSFLSGGIDSSIIASIAREMNPNLLTFSVGFEQRGFSEVDVAKETAEKLGVKNHNVFISAK EFMDEFPKIIWHMDDPLADPAAVPLYFVAKEARKHVTVVLSGEGADELFGGYNIYREPNSLKMFSYIPSPGKSVLKALSG ALKEGFKGKSFLERGCTPIEERYYGNAKIFREEEKAELMKYYNESVNYMDITKPLYNEIKDYDDVSKMQYIDMFTWLRGD ILLKADKMTMANSLELRVPFLDKEVFDVASKIPTEFKIANGTTKAILREAARGIVPDHVLDRKKLGFPVPIRHWLKDEMH DWAINIINESKTEHLIDKQYVLNLLEAHCADKGDYSRKIWTVLAFMVWHQIYVEHKYDTNKFHEETKRAYSLV
Sequences:
>Translated_633_residues MCGFVGCLCENPREFSETEKHQFENMNTMIFHRGPDDEGYFRDEHVQFGFRRLSIIDLEAGHQPLTYENDRYVIIFNGEI YNYVELREMLLEKGATFATQSDTEVIIALYAHMKEKCVDYLRGMFAFMIWDREEKKLFGARDHFGIKPLYIAQQGDTTFF ASEKKSIMHVMEDKGVNPTSLQHYFTYQYGPEPETLTIDVNKIEPGHYFVKEIGKEMEIHRYWKPYFNASSATKEEHIQA IRDVLYDSVKVHMRSDVPVGSFLSGGIDSSIIASIAREMNPNLLTFSVGFEQRGFSEVDVAKETAEKLGVKNHNVFISAK EFMDEFPKIIWHMDDPLADPAAVPLYFVAKEARKHVTVVLSGEGADELFGGYNIYREPNSLKMFSYIPSPGKSVLKALSG ALKEGFKGKSFLERGCTPIEERYYGNAKIFREEEKAELMKYYNESVNYMDITKPLYNEIKDYDDVSKMQYIDMFTWLRGD ILLKADKMTMANSLELRVPFLDKEVFDVASKIPTEFKIANGTTKAILREAARGIVPDHVLDRKKLGFPVPIRHWLKDEMH DWAINIINESKTEHLIDKQYVLNLLEAHCADKGDYSRKIWTVLAFMVWHQIYVEHKYDTNKFHEETKRAYSLV >Mature_633_residues MCGFVGCLCENPREFSETEKHQFENMNTMIFHRGPDDEGYFRDEHVQFGFRRLSIIDLEAGHQPLTYENDRYVIIFNGEI YNYVELREMLLEKGATFATQSDTEVIIALYAHMKEKCVDYLRGMFAFMIWDREEKKLFGARDHFGIKPLYIAQQGDTTFF ASEKKSIMHVMEDKGVNPTSLQHYFTYQYGPEPETLTIDVNKIEPGHYFVKEIGKEMEIHRYWKPYFNASSATKEEHIQA IRDVLYDSVKVHMRSDVPVGSFLSGGIDSSIIASIAREMNPNLLTFSVGFEQRGFSEVDVAKETAEKLGVKNHNVFISAK EFMDEFPKIIWHMDDPLADPAAVPLYFVAKEARKHVTVVLSGEGADELFGGYNIYREPNSLKMFSYIPSPGKSVLKALSG ALKEGFKGKSFLERGCTPIEERYYGNAKIFREEEKAELMKYYNESVNYMDITKPLYNEIKDYDDVSKMQYIDMFTWLRGD ILLKADKMTMANSLELRVPFLDKEVFDVASKIPTEFKIANGTTKAILREAARGIVPDHVLDRKKLGFPVPIRHWLKDEMH DWAINIINESKTEHLIDKQYVLNLLEAHCADKGDYSRKIWTVLAFMVWHQIYVEHKYDTNKFHEETKRAYSLV
Specific function: Main asparagine synthetase in vegetative cells [H]
COG id: COG0367
COG function: function code E; Asparagine synthase (glutamine-hydrolyzing)
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 glutamine amidotransferase type-2 domain [H]
Homologues:
Organism=Homo sapiens, GI168229248, Length=392, Percent_Identity=28.3163265306122, Blast_Score=127, Evalue=2e-29, Organism=Homo sapiens, GI168229252, Length=392, Percent_Identity=28.3163265306122, Blast_Score=127, Evalue=2e-29, Organism=Homo sapiens, GI168229250, Length=392, Percent_Identity=28.3163265306122, Blast_Score=127, Evalue=2e-29, Organism=Homo sapiens, GI296010848, Length=392, Percent_Identity=28.3163265306122, Blast_Score=127, Evalue=3e-29, Organism=Homo sapiens, GI296010852, Length=319, Percent_Identity=27.2727272727273, Blast_Score=96, Evalue=1e-19, Organism=Homo sapiens, GI296010850, Length=319, Percent_Identity=27.2727272727273, Blast_Score=96, Evalue=1e-19, Organism=Escherichia coli, GI1786889, Length=403, Percent_Identity=31.2655086848635, Blast_Score=182, Evalue=8e-47, Organism=Caenorhabditis elegans, GI71993933, Length=389, Percent_Identity=33.1619537275064, Blast_Score=165, Evalue=5e-41, Organism=Caenorhabditis elegans, GI25147557, Length=389, Percent_Identity=33.1619537275064, Blast_Score=165, Evalue=6e-41, Organism=Caenorhabditis elegans, GI17560178, Length=377, Percent_Identity=30.2387267904509, Blast_Score=120, Evalue=3e-27, Organism=Caenorhabditis elegans, GI25147560, Length=332, Percent_Identity=30.7228915662651, Blast_Score=117, Evalue=2e-26, Organism=Saccharomyces cerevisiae, GI6321563, Length=428, Percent_Identity=28.7383177570093, Blast_Score=158, Evalue=2e-39, Organism=Saccharomyces cerevisiae, GI6325403, Length=432, Percent_Identity=28.2407407407407, Blast_Score=153, Evalue=6e-38, Organism=Drosophila melanogaster, GI45553209, Length=404, Percent_Identity=30.1980198019802, Blast_Score=169, Evalue=4e-42,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006426 - InterPro: IPR001962 - InterPro: IPR017932 - InterPro: IPR014729 [H]
Pfam domain/function: PF00733 Asn_synthase [H]
EC number: =6.3.5.4 [H]
Molecular weight: Translated: 73413; Mature: 73413
Theoretical pI: Translated: 5.96; Mature: 5.96
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 4.6 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 4.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MCGFVGCLCENPREFSETEKHQFENMNTMIFHRGPDDEGYFRDEHVQFGFRRLSIIDLEA CCCCHHHHCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHCCEEEEEEEECC GHQPLTYENDRYVIIFNGEIYNYVELREMLLEKGATFATQSDTEVIIALYAHMKEKCVDY CCCCCEECCCEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHHHHHH LRGMFAFMIWDREEKKLFGARDHFGIKPLYIAQQGDTTFFASEKKSIMHVMEDKGVNPTS HHHHHHHHHCCCHHHHHCCCCCCCCCCEEEEEECCCCEEECCHHHHHHHHHHHCCCCCCC LQHYFTYQYGPEPETLTIDVNKIEPGHYFVKEIGKEMEIHRYWKPYFNASSATKEEHIQA EEEEEEEEECCCCCEEEEEEEECCCCHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHH IRDVLYDSVKVHMRSDVPVGSFLSGGIDSSIIASIAREMNPNLLTFSVGFEQRGFSEVDV HHHHHHHHHHEEEECCCCHHHHHHCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHH AKETAEKLGVKNHNVFISAKEFMDEFPKIIWHMDDPLADPAAVPLYFVAKEARKHVTVVL HHHHHHHCCCCCCEEEEEHHHHHHHHHHHEEECCCCCCCCCCCHHHHHHHHCCCEEEEEE SGEGADELFGGYNIYREPNSLKMFSYIPSPGKSVLKALSGALKEGFKGKSFLERGCTPIE ECCCHHHHHCCCEEEECCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCHHHHHHCCCCHH ERYYGNAKIFREEEKAELMKYYNESVNYMDITKPLYNEIKDYDDVSKMQYIDMFTWLRGD HHHCCCHHEECCHHHHHHHHHHHCCCCEEECCHHHHHHHHCCHHHHHHHHHHHHHHHCCC ILLKADKMTMANSLELRVPFLDKEVFDVASKIPTEFKIANGTTKAILREAARGIVPDHVL EEEEECCEECCCCCEEEECCCCHHHHHHHHHCCCCEEECCCHHHHHHHHHHHCCCCHHHH DRKKLGFPVPIRHWLKDEMHDWAINIINESKTEHLIDKQYVLNLLEAHCADKGDYSRKIW HHHHCCCCCCHHHHHHHHHHHHEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHH TVLAFMVWHQIYVEHKYDTNKFHEETKRAYSLV HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC >Mature Secondary Structure MCGFVGCLCENPREFSETEKHQFENMNTMIFHRGPDDEGYFRDEHVQFGFRRLSIIDLEA CCCCHHHHCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHCCEEEEEEEECC GHQPLTYENDRYVIIFNGEIYNYVELREMLLEKGATFATQSDTEVIIALYAHMKEKCVDY CCCCCEECCCEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHHHHHH LRGMFAFMIWDREEKKLFGARDHFGIKPLYIAQQGDTTFFASEKKSIMHVMEDKGVNPTS HHHHHHHHHCCCHHHHHCCCCCCCCCCEEEEEECCCCEEECCHHHHHHHHHHHCCCCCCC LQHYFTYQYGPEPETLTIDVNKIEPGHYFVKEIGKEMEIHRYWKPYFNASSATKEEHIQA EEEEEEEEECCCCCEEEEEEEECCCCHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHH IRDVLYDSVKVHMRSDVPVGSFLSGGIDSSIIASIAREMNPNLLTFSVGFEQRGFSEVDV HHHHHHHHHHEEEECCCCHHHHHHCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHH AKETAEKLGVKNHNVFISAKEFMDEFPKIIWHMDDPLADPAAVPLYFVAKEARKHVTVVL HHHHHHHCCCCCCEEEEEHHHHHHHHHHHEEECCCCCCCCCCCHHHHHHHHCCCEEEEEE SGEGADELFGGYNIYREPNSLKMFSYIPSPGKSVLKALSGALKEGFKGKSFLERGCTPIE ECCCHHHHHCCCEEEECCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCHHHHHHCCCCHH ERYYGNAKIFREEEKAELMKYYNESVNYMDITKPLYNEIKDYDDVSKMQYIDMFTWLRGD HHHCCCHHEECCHHHHHHHHHHHCCCCEEECCHHHHHHHHCCHHHHHHHHHHHHHHHCCC ILLKADKMTMANSLELRVPFLDKEVFDVASKIPTEFKIANGTTKAILREAARGIVPDHVL EEEEECCEECCCCCEEEECCCCHHHHHHHHHCCCCEEECCCHHHHHHHHHHHCCCCHHHH DRKKLGFPVPIRHWLKDEMHDWAINIINESKTEHLIDKQYVLNLLEAHCADKGDYSRKIW HHHHCCCCCCHHHHHHHHHHHHEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHH TVLAFMVWHQIYVEHKYDTNKFHEETKRAYSLV HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9387221; 9384377; 8755891; 10498721 [H]