The gene/protein map for NC_006274 is currently unavailable.
Definition Bacillus cereus E33L, complete genome.
Accession NC_006274
Length 5,300,915

Click here to switch to the map view.

The map label for this gene is argS [H]

Identifier: 52140201

GI number: 52140201

Start: 5172459

End: 5174129

Strand: Reverse

Name: argS [H]

Synonym: BCZK5061

Alternate gene names: 52140201

Gene position: 5174129-5172459 (Counterclockwise)

Preceding gene: 52140198

Following gene: 52140200

Centisome position: 97.61

GC content: 38.18

Gene sequence:

>1671_bases
ATGAATTCTTTAGAACAAGTAAAAGGATTAATTAAAGAAGAAATTCAAGCTGCTGTATTAAAGGCAGAACTAGCGACAGA
AGAACAAATTCCAAATGTTGTATTAGAATCTCCAAAAGATAAAACAAATGGTGACTTCTCTACAAATATGGCAATGCAAC
TTGCACGCGTTGCGAAAAAAGCACCTCGTATGATTGCAGAAGAATTAGTTGCAAACTTCGATAAAGCAAAAGCTTCTATT
GAAAAAATTGAAATCGCAGGTCCTGGTTTCATTAACTTCTACATGGATAATAGCTACTTAACAGACTTAATCCCAACAAT
CGTTAATGCTGGTGAAGCTTATGGTGAAACGAATACTGGTAAAGGTGAAAAAGTACAAGTTGAGTTCGTATCTGCGAACC
CAACAGGTGACCTTCACTTAGGACATGCACGTGGTGCTGCAGTAGGTGACACTTTATGTAATCTATTAGCAAAAGCAGGA
TACGATGTATCTCGTGAGTACTATATTAATGACGCTGGTAACCAAATTCATAACTTAGCTCTTTCTGTTGAAGCTCGTTA
CATGCAAGCTTTAGGCTTAGAGAAAGAAATGCCAGAAGACGGATACCATGGTGCGGACATCATTGGAATCGGTAAACGTT
TAGCAGAAGAGTTTGGCGATCGTTATGCGAAAGCTGATGAAAAAGAAAGCTATGAATTCTACCGTGAGTACGGTTTAAAA
TATGAGTTAGCAAAACTTCAAAAAGACTTAGAAAGCTTCCGTGTTAAATTTGATGTATGGTTCTCAGAAACATCATTATA
CAAAAACGGAAAAATCGATCAAGCTCTTGCGGTATTAAAAGAGCGTGACGAGATCTTTGAAGAAGACGGTGCAACTTGGT
TCCGTTCAATGACTTACGGCGATGATAAAAACCGTGTATTAATTAAAAACGATGGTTCTTACACGTACTTAACGCCAGAT
ATTGCATATCACCGTGATAAATTAGAGCGTGGTTTCGATAAGTTAATCAATATTTGGGGTGCTGACCACCACGGTTACAT
TCCACGTATGAAAGCTGCTATTCAAGCGCTAGGTTACGATAAAGAAACACTTGAAGTAGAAATCATCCAAATGGTACAAC
TATACCAAAACGGTGAAAAAATGAAGATGAGTAAACGTACAGGTAAAGCAGTTACACTTCGTGAGCTTATGGAAGAAGTA
GGCGTGGACGCAATGCGTTACTTCTTCGCAATGCGTAGTGGCGATTCTCATTTAGATTTCGATATGGACTTAGCTGTATC
AAAATCTAATGAAAACCCAGTATACTATGCACAATACGCTCATGCTCGTGTATGCAGTATCCTTCGTCAAGGTGAAGAGT
TAGGATTAGCTACAGGCGGAGACGTTAACTACAAACTTGTTACTTCTGAGAAAGAAGTAGAGTTACTGAAAAAACTTGGT
GAATTCCCAGCAGTAGTTGCAGATGCGGCACAAAAACGTTTACCACACCGTATTACAAACTATGCATTTGAATTAGCTGC
AACATTACACAGCTTCTACAATGCAGAAAAAGTATTAAACCAAGATAACTTAGAATTAAGTAAAGCTCGTTACGAGTTAA
TGAAAGCAGTACGCACTACACTTCAAAACGCATTAGCAATCGTAGGAGTATCTGCACCAGAAAAAATGTAA

Upstream 100 bases:

>100_bases
AAAAGAAAAAAGGACAACTCTTCTTAACGTACGCATTGCTTCTAAGTGAACAAGAAGCTGGCAGATATACAATTACAATT
AATTTGAAGGAGGCAAAATA

Downstream 100 bases:

>100_bases
TGAAGAAAGGCGGAAGCGGCTCGTTCAGAATAGGAGGGGGATGGAGCTCCTGACAGAGAGGCGTACTTTGCCTCGGAGGA
AGGCGCGAAGCCACCGAGTA

Product: arginyl-tRNA synthetase

Products: NA

Alternate protein names: Arginine--tRNA ligase 1; ArgRS 1 [H]

Number of amino acids: Translated: 556; Mature: 556

Protein sequence:

>556_residues
MNSLEQVKGLIKEEIQAAVLKAELATEEQIPNVVLESPKDKTNGDFSTNMAMQLARVAKKAPRMIAEELVANFDKAKASI
EKIEIAGPGFINFYMDNSYLTDLIPTIVNAGEAYGETNTGKGEKVQVEFVSANPTGDLHLGHARGAAVGDTLCNLLAKAG
YDVSREYYINDAGNQIHNLALSVEARYMQALGLEKEMPEDGYHGADIIGIGKRLAEEFGDRYAKADEKESYEFYREYGLK
YELAKLQKDLESFRVKFDVWFSETSLYKNGKIDQALAVLKERDEIFEEDGATWFRSMTYGDDKNRVLIKNDGSYTYLTPD
IAYHRDKLERGFDKLINIWGADHHGYIPRMKAAIQALGYDKETLEVEIIQMVQLYQNGEKMKMSKRTGKAVTLRELMEEV
GVDAMRYFFAMRSGDSHLDFDMDLAVSKSNENPVYYAQYAHARVCSILRQGEELGLATGGDVNYKLVTSEKEVELLKKLG
EFPAVVADAAQKRLPHRITNYAFELAATLHSFYNAEKVLNQDNLELSKARYELMKAVRTTLQNALAIVGVSAPEKM

Sequences:

>Translated_556_residues
MNSLEQVKGLIKEEIQAAVLKAELATEEQIPNVVLESPKDKTNGDFSTNMAMQLARVAKKAPRMIAEELVANFDKAKASI
EKIEIAGPGFINFYMDNSYLTDLIPTIVNAGEAYGETNTGKGEKVQVEFVSANPTGDLHLGHARGAAVGDTLCNLLAKAG
YDVSREYYINDAGNQIHNLALSVEARYMQALGLEKEMPEDGYHGADIIGIGKRLAEEFGDRYAKADEKESYEFYREYGLK
YELAKLQKDLESFRVKFDVWFSETSLYKNGKIDQALAVLKERDEIFEEDGATWFRSMTYGDDKNRVLIKNDGSYTYLTPD
IAYHRDKLERGFDKLINIWGADHHGYIPRMKAAIQALGYDKETLEVEIIQMVQLYQNGEKMKMSKRTGKAVTLRELMEEV
GVDAMRYFFAMRSGDSHLDFDMDLAVSKSNENPVYYAQYAHARVCSILRQGEELGLATGGDVNYKLVTSEKEVELLKKLG
EFPAVVADAAQKRLPHRITNYAFELAATLHSFYNAEKVLNQDNLELSKARYELMKAVRTTLQNALAIVGVSAPEKM
>Mature_556_residues
MNSLEQVKGLIKEEIQAAVLKAELATEEQIPNVVLESPKDKTNGDFSTNMAMQLARVAKKAPRMIAEELVANFDKAKASI
EKIEIAGPGFINFYMDNSYLTDLIPTIVNAGEAYGETNTGKGEKVQVEFVSANPTGDLHLGHARGAAVGDTLCNLLAKAG
YDVSREYYINDAGNQIHNLALSVEARYMQALGLEKEMPEDGYHGADIIGIGKRLAEEFGDRYAKADEKESYEFYREYGLK
YELAKLQKDLESFRVKFDVWFSETSLYKNGKIDQALAVLKERDEIFEEDGATWFRSMTYGDDKNRVLIKNDGSYTYLTPD
IAYHRDKLERGFDKLINIWGADHHGYIPRMKAAIQALGYDKETLEVEIIQMVQLYQNGEKMKMSKRTGKAVTLRELMEEV
GVDAMRYFFAMRSGDSHLDFDMDLAVSKSNENPVYYAQYAHARVCSILRQGEELGLATGGDVNYKLVTSEKEVELLKKLG
EFPAVVADAAQKRLPHRITNYAFELAATLHSFYNAEKVLNQDNLELSKARYELMKAVRTTLQNALAIVGVSAPEKM

Specific function: Unknown

COG id: COG0018

COG function: function code J; Arginyl-tRNA synthetase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-I aminoacyl-tRNA synthetase family [H]

Homologues:

Organism=Homo sapiens, GI15149476, Length=607, Percent_Identity=25.2059308072488, Blast_Score=122, Evalue=1e-27,
Organism=Escherichia coli, GI1788184, Length=600, Percent_Identity=25.3333333333333, Blast_Score=139, Evalue=4e-34,
Organism=Caenorhabditis elegans, GI71985061, Length=582, Percent_Identity=24.0549828178694, Blast_Score=110, Evalue=1e-24,
Organism=Caenorhabditis elegans, GI71985068, Length=537, Percent_Identity=24.3947858472998, Blast_Score=105, Evalue=5e-23,
Organism=Saccharomyces cerevisiae, GI6320548, Length=551, Percent_Identity=25.0453720508167, Blast_Score=138, Evalue=2e-33,
Organism=Saccharomyces cerevisiae, GI6321883, Length=572, Percent_Identity=22.3776223776224, Blast_Score=114, Evalue=4e-26,
Organism=Drosophila melanogaster, GI18859963, Length=588, Percent_Identity=24.4897959183673, Blast_Score=115, Evalue=1e-25,
Organism=Drosophila melanogaster, GI28571570, Length=493, Percent_Identity=23.9350912778905, Blast_Score=75, Evalue=2e-13,

Paralogues:

None

Copy number: 789 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 800 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001412
- InterPro:   IPR001278
- InterPro:   IPR015945
- InterPro:   IPR005148
- InterPro:   IPR008909
- InterPro:   IPR014729
- InterPro:   IPR009080 [H]

Pfam domain/function: PF03485 Arg_tRNA_synt_N; PF05746 DALR_1; PF00750 tRNA-synt_1d [H]

EC number: =6.1.1.19 [H]

Molecular weight: Translated: 62503; Mature: 62503

Theoretical pI: Translated: 5.07; Mature: 5.07

Prosite motif: PS00178 AA_TRNA_LIGASE_I

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.2 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNSLEQVKGLIKEEIQAAVLKAELATEEQIPNVVLESPKDKTNGDFSTNMAMQLARVAKK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHH
APRMIAEELVANFDKAKASIEKIEIAGPGFINFYMDNSYLTDLIPTIVNAGEAYGETNTG
HHHHHHHHHHHHHHHHHHHHHEEEECCCCEEEEEECCCHHHHHHHHHHHCCHHCCCCCCC
KGEKVQVEFVSANPTGDLHLGHARGAAVGDTLCNLLAKAGYDVSREYYINDAGNQIHNLA
CCCEEEEEEEECCCCCCEEECCCCCCHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHH
LSVEARYMQALGLEKEMPEDGYHGADIIGIGKRLAEEFGDRYAKADEKESYEFYREYGLK
HHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC
YELAKLQKDLESFRVKFDVWFSETSLYKNGKIDQALAVLKERDEIFEEDGATWFRSMTYG
HHHHHHHHHHHHHEEEEEEEECCHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHCCCC
DDKNRVLIKNDGSYTYLTPDIAYHRDKLERGFDKLINIWGADHHGYIPRMKAAIQALGYD
CCCCEEEEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCC
KETLEVEIIQMVQLYQNGEKMKMSKRTGKAVTLRELMEEVGVDAMRYFFAMRSGDSHLDF
CHHHHHHHHHHHHHHHCCCCEEHHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCCCCCCC
DMDLAVSKSNENPVYYAQYAHARVCSILRQGEELGLATGGDVNYKLVTSEKEVELLKKLG
CEEEEEECCCCCCEEEEHHHHHHHHHHHHCCHHCCCCCCCCCEEEEECCHHHHHHHHHHC
EFPAVVADAAQKRLPHRITNYAFELAATLHSFYNAEKVLNQDNLELSKARYELMKAVRTT
CCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
LQNALAIVGVSAPEKM
HHHHHEEEECCCCCCC
>Mature Secondary Structure
MNSLEQVKGLIKEEIQAAVLKAELATEEQIPNVVLESPKDKTNGDFSTNMAMQLARVAKK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHH
APRMIAEELVANFDKAKASIEKIEIAGPGFINFYMDNSYLTDLIPTIVNAGEAYGETNTG
HHHHHHHHHHHHHHHHHHHHHEEEECCCCEEEEEECCCHHHHHHHHHHHCCHHCCCCCCC
KGEKVQVEFVSANPTGDLHLGHARGAAVGDTLCNLLAKAGYDVSREYYINDAGNQIHNLA
CCCEEEEEEEECCCCCCEEECCCCCCHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHH
LSVEARYMQALGLEKEMPEDGYHGADIIGIGKRLAEEFGDRYAKADEKESYEFYREYGLK
HHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC
YELAKLQKDLESFRVKFDVWFSETSLYKNGKIDQALAVLKERDEIFEEDGATWFRSMTYG
HHHHHHHHHHHHHEEEEEEEECCHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHCCCC
DDKNRVLIKNDGSYTYLTPDIAYHRDKLERGFDKLINIWGADHHGYIPRMKAAIQALGYD
CCCCEEEEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCC
KETLEVEIIQMVQLYQNGEKMKMSKRTGKAVTLRELMEEVGVDAMRYFFAMRSGDSHLDF
CHHHHHHHHHHHHHHHCCCCEEHHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCCCCCCC
DMDLAVSKSNENPVYYAQYAHARVCSILRQGEELGLATGGDVNYKLVTSEKEVELLKKLG
CEEEEEECCCCCCEEEEHHHHHHHHHHHHCCHHCCCCCCCCCEEEEECCHHHHHHHHHHC
EFPAVVADAAQKRLPHRITNYAFELAATLHSFYNAEKVLNQDNLELSKARYELMKAVRTT
CCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
LQNALAIVGVSAPEKM
HHHHHEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12721629 [H]