The gene/protein map for NC_009495 is currently unavailable.
Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is argH

Identifier: 148380627

GI number: 148380627

Start: 2817786

End: 2819108

Strand: Reverse

Name: argH

Synonym: CBO2669

Alternate gene names: 148380627

Gene position: 2819108-2817786 (Counterclockwise)

Preceding gene: 148380628

Following gene: 148380626

Centisome position: 72.53

GC content: 28.87

Gene sequence:

>1323_bases
ATGAAACTTTGGGGAGGACGTTTTAAGGAAGAAGAAAGCAAACTTATGGAGGACTTTAATAGTTCTCTAAGTTTTGATAA
AAAACTTTATTATGAAGATATAAAAGGAAGCATAGCTCATGTTAAAATGCTTACGAATCAAAATATAATAAAGGAAGAAG
AAAAAGAAAAAATATTGCTTGGGTTAGAGGAAATATTAAAAGAAATAGATGAAGGGATTTTAAAAATAGAGGGAGACTAT
GAGGATATTCATAGCTTTGTGGAAATAAATTTAATAAACAAAATAGGAAATGTGGGAAAAAAGCTTCATACGGGAAGAAG
TAGAAATGACCAAGTAGCCTTAGATATGAAATTATATGCTAAAAAATCCACGGAAGAAGTAATAGAATGCTTAAAGGAAC
TTATGGATTCTTTAATTAAAGTTGGAAATGAAAATAATTATATTATGCCAGGATATACTCATCTTCAAAGAGCTCAAGTG
GTAACTTTTAGGTATCATTTGTTAGCTTATTTTGAAATGTTTAAAAGAGATGAGAAAAGATTAGAAAATGCCTTAGAGAT
TTTAAATGAAAGTCCTTTAGGATCAGGAGCCTTAGCGGGAAGTACTTATAACATAGATAAAGAATATACTGCTAAGTTAT
TGGGTTTTAGAAAACCGGTAGATAATTTTTTAGATGGAGTTAGTGATAGGGATTATATAATAGAACTTATAAGTAAGTTT
TCTATAATAATGATGCATTTAAGTAGATTATCTGAAGAACTTATACTTTGGAGTAGTAGTGAATTTAGGTTTATACAAAT
AGGAGATGCTTATTCCACAGGCAGTAGTATAATGCCTCAAAAGAAAAACCCAGATGGGGCGGAACTTATACGCGGGAAAA
TTGGAAGAGTATATGGGGACTTAATAAGTATATTAACAGTTATGAAATCATTACCATTAGCTTATAATAAAGATATGCAA
GAGGATAAAGAACCTTTCTTTGATGCAAAAGATACTGTAATAAGCTGTTTAAAAGTAATGGAAGGTATAATATCTACTCT
AAAAGTAAATAAAGAAAATTTAATGAAATCTGTGAAGAAAGGATTTTTAAATGCCACAGAAGCAGCAGATTATTTAGTAA
ATAAAGGAATGGCTTTTAGAGATGCACATAAAGTTATAGGTGAAGTTGTAATATACTGTGAGGATAAAAATTCAGCTATA
GAGGATTTATCCTTAGAAGAATTAAAACAATTTTCAGATCTATTTTGTGAGGATATTTATGAATTTATAGATTATAAGAA
TTCTATAAACAAAGGGATAAAAAAAGAAATGGGATACTTTTAA

Upstream 100 bases:

>100_bases
TTACAGTCATAAAGATGCAGAAGGCTTTATAAATTTATTTGGATTACCATCCAAAATAAAAGCGTTAAAAAATTTCTAGA
CTAAATTGGAGGAACATACT

Downstream 100 bases:

>100_bases
GAGGATTATAATTTATAGTCTTTAAAGGTTGAAAACTAAAGCAGAAATACTTTTAGTCAATAGATAGCCATGAAATAAAG
AATACTAATAATATTAGTTC

Product: argininosuccinate lyase

Products: NA

Alternate protein names: ASAL; Arginosuccinase

Number of amino acids: Translated: 440; Mature: 440

Protein sequence:

>440_residues
MKLWGGRFKEEESKLMEDFNSSLSFDKKLYYEDIKGSIAHVKMLTNQNIIKEEEKEKILLGLEEILKEIDEGILKIEGDY
EDIHSFVEINLINKIGNVGKKLHTGRSRNDQVALDMKLYAKKSTEEVIECLKELMDSLIKVGNENNYIMPGYTHLQRAQV
VTFRYHLLAYFEMFKRDEKRLENALEILNESPLGSGALAGSTYNIDKEYTAKLLGFRKPVDNFLDGVSDRDYIIELISKF
SIIMMHLSRLSEELILWSSSEFRFIQIGDAYSTGSSIMPQKKNPDGAELIRGKIGRVYGDLISILTVMKSLPLAYNKDMQ
EDKEPFFDAKDTVISCLKVMEGIISTLKVNKENLMKSVKKGFLNATEAADYLVNKGMAFRDAHKVIGEVVIYCEDKNSAI
EDLSLEELKQFSDLFCEDIYEFIDYKNSINKGIKKEMGYF

Sequences:

>Translated_440_residues
MKLWGGRFKEEESKLMEDFNSSLSFDKKLYYEDIKGSIAHVKMLTNQNIIKEEEKEKILLGLEEILKEIDEGILKIEGDY
EDIHSFVEINLINKIGNVGKKLHTGRSRNDQVALDMKLYAKKSTEEVIECLKELMDSLIKVGNENNYIMPGYTHLQRAQV
VTFRYHLLAYFEMFKRDEKRLENALEILNESPLGSGALAGSTYNIDKEYTAKLLGFRKPVDNFLDGVSDRDYIIELISKF
SIIMMHLSRLSEELILWSSSEFRFIQIGDAYSTGSSIMPQKKNPDGAELIRGKIGRVYGDLISILTVMKSLPLAYNKDMQ
EDKEPFFDAKDTVISCLKVMEGIISTLKVNKENLMKSVKKGFLNATEAADYLVNKGMAFRDAHKVIGEVVIYCEDKNSAI
EDLSLEELKQFSDLFCEDIYEFIDYKNSINKGIKKEMGYF
>Mature_440_residues
MKLWGGRFKEEESKLMEDFNSSLSFDKKLYYEDIKGSIAHVKMLTNQNIIKEEEKEKILLGLEEILKEIDEGILKIEGDY
EDIHSFVEINLINKIGNVGKKLHTGRSRNDQVALDMKLYAKKSTEEVIECLKELMDSLIKVGNENNYIMPGYTHLQRAQV
VTFRYHLLAYFEMFKRDEKRLENALEILNESPLGSGALAGSTYNIDKEYTAKLLGFRKPVDNFLDGVSDRDYIIELISKF
SIIMMHLSRLSEELILWSSSEFRFIQIGDAYSTGSSIMPQKKNPDGAELIRGKIGRVYGDLISILTVMKSLPLAYNKDMQ
EDKEPFFDAKDTVISCLKVMEGIISTLKVNKENLMKSVKKGFLNATEAADYLVNKGMAFRDAHKVIGEVVIYCEDKNSAI
EDLSLEELKQFSDLFCEDIYEFIDYKNSINKGIKKEMGYF

Specific function: Arginine biosynthesis; eighth (last) step. [C]

COG id: COG0165

COG function: function code E; Argininosuccinate lyase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the lyase 1 family. Argininosuccinate lyase subfamily

Homologues:

Organism=Homo sapiens, GI31541964, Length=431, Percent_Identity=42.2273781902552, Blast_Score=367, Evalue=1e-101,
Organism=Homo sapiens, GI68303542, Length=431, Percent_Identity=42.2273781902552, Blast_Score=367, Evalue=1e-101,
Organism=Homo sapiens, GI68303549, Length=431, Percent_Identity=40.3712296983759, Blast_Score=340, Evalue=1e-93,
Organism=Homo sapiens, GI68303547, Length=431, Percent_Identity=40.1392111368909, Blast_Score=336, Evalue=2e-92,
Organism=Escherichia coli, GI1790398, Length=437, Percent_Identity=47.1395881006865, Blast_Score=429, Evalue=1e-121,
Organism=Saccharomyces cerevisiae, GI6321806, Length=434, Percent_Identity=45.1612903225806, Blast_Score=364, Evalue=1e-101,
Organism=Drosophila melanogaster, GI221473854, Length=431, Percent_Identity=40.3712296983759, Blast_Score=340, Evalue=1e-93,
Organism=Drosophila melanogaster, GI78706858, Length=431, Percent_Identity=40.3712296983759, Blast_Score=340, Evalue=1e-93,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): ARLY_CLOB1 (A7FWU5)

Other databases:

- EMBL:   CP000726
- RefSeq:   YP_001384912.1
- ProteinModelPortal:   A7FWU5
- SMR:   A7FWU5
- STRING:   A7FWU5
- GeneID:   5398113
- GenomeReviews:   CP000726_GR
- KEGG:   cba:CLB_2611
- eggNOG:   COG0165
- HOGENOM:   HBG539632
- OMA:   MAEDLIF
- ProtClustDB:   PRK00855
- BioCyc:   CBOT441770:CLB_2611-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00006
- InterPro:   IPR009049
- InterPro:   IPR003031
- InterPro:   IPR000362
- InterPro:   IPR020557
- InterPro:   IPR008948
- InterPro:   IPR022761
- PANTHER:   PTHR11444:SF3
- PRINTS:   PR00145
- PRINTS:   PR00149
- TIGRFAMs:   TIGR00838

Pfam domain/function: PF00206 Lyase_1; SSF48557 L-Aspartase-like

EC number: =4.3.2.1

Molecular weight: Translated: 50483; Mature: 50483

Theoretical pI: Translated: 5.07; Mature: 5.07

Prosite motif: PS00163 FUMARATE_LYASES

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
3.6 %Met     (Translated Protein)
4.5 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
3.6 %Met     (Mature Protein)
4.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKLWGGRFKEEESKLMEDFNSSLSFDKKLYYEDIKGSIAHVKMLTNQNIIKEEEKEKILL
CCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHH
GLEEILKEIDEGILKIEGDYEDIHSFVEINLINKIGNVGKKLHTGRSRNDQVALDMKLYA
HHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHH
KKSTEEVIECLKELMDSLIKVGNENNYIMPGYTHLQRAQVVTFRYHLLAYFEMFKRDEKR
CCCHHHHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LENALEILNESPLGSGALAGSTYNIDKEYTAKLLGFRKPVDNFLDGVSDRDYIIELISKF
HHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHCCCCCHHHHHHHHHHH
SIIMMHLSRLSEELILWSSSEFRFIQIGDAYSTGSSIMPQKKNPDGAELIRGKIGRVYGD
HHHHHHHHHHHHHHEEECCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH
LISILTVMKSLPLAYNKDMQEDKEPFFDAKDTVISCLKVMEGIISTLKVNKENLMKSVKK
HHHHHHHHHHCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH
GFLNATEAADYLVNKGMAFRDAHKVIGEVVIYCEDKNSAIEDLSLEELKQFSDLFCEDIY
HCCCHHHHHHHHHHCCCHHHHHHHHHHHEEEEECCCCCCHHCCCHHHHHHHHHHHHHHHH
EFIDYKNSINKGIKKEMGYF
HHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MKLWGGRFKEEESKLMEDFNSSLSFDKKLYYEDIKGSIAHVKMLTNQNIIKEEEKEKILL
CCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHH
GLEEILKEIDEGILKIEGDYEDIHSFVEINLINKIGNVGKKLHTGRSRNDQVALDMKLYA
HHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHH
KKSTEEVIECLKELMDSLIKVGNENNYIMPGYTHLQRAQVVTFRYHLLAYFEMFKRDEKR
CCCHHHHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LENALEILNESPLGSGALAGSTYNIDKEYTAKLLGFRKPVDNFLDGVSDRDYIIELISKF
HHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHCCCCCHHHHHHHHHHH
SIIMMHLSRLSEELILWSSSEFRFIQIGDAYSTGSSIMPQKKNPDGAELIRGKIGRVYGD
HHHHHHHHHHHHHHEEECCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH
LISILTVMKSLPLAYNKDMQEDKEPFFDAKDTVISCLKVMEGIISTLKVNKENLMKSVKK
HHHHHHHHHHCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH
GFLNATEAADYLVNKGMAFRDAHKVIGEVVIYCEDKNSAIEDLSLEELKQFSDLFCEDIY
HCCCHHHHHHHHHHCCCHHHHHHHHHHHEEEEECCCCCCHHCCCHHHHHHHHHHHHHHHH
EFIDYKNSINKGIKKEMGYF
HHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA