Definition Clostridium botulinum A str. ATCC 3502, complete genome.
Accession NC_009495
Length 3,886,916

Click here to switch to the map view.

The map label for this gene is glsA

Identifier: 148380768

GI number: 148380768

Start: 2973057

End: 2973974

Strand: Reverse

Name: glsA

Synonym: CBO2808

Alternate gene names: 148380768

Gene position: 2973974-2973057 (Counterclockwise)

Preceding gene: 148380769

Following gene: 148380766

Centisome position: 76.51

GC content: 33.66

Gene sequence:

>918_bases
ATGAACAGACTGCTTAAGACGATTATAGAAAATAATAGAAAGTGGATAAGTGAAGGAAAGGTCGCCTCATATATTCCTGA
ACTTTCTAAAATGGATAAAAATTTACTGGGTATTTCTGTATGTACCCTGGGAGGGGAAGAATATTGGGAAGGCGATGCTG
AAGTTAAGTTTACGATTCAAAGTATATCAAAAATAGTAACTTTAATGCTAGCTATAATAGACAATGGAGAGGATTATGTT
TTTTCAAAGGTAGGAATGGAACCTACGGAAACTGCTTTTAATTCTATAGTGAATTTAGAAGCAAAAGAATCTCATAAGCC
TATAAATCCAATGATAAATGCTGGTGCCATAGTGGTGGCTTCTATGGTAGCTGGAAAGGATTCAGATGAAAAGTTTGATA
GAATCTTAAAGTTTACTAGAAAAATAAGTGGTAATAATGATATTGATATAAATCTAAATGTATATGAATCCGAAAAAGAA
ACAGGACATAGAAACAGAGCCCTTGCTTATTTTATGAAAAGCACAGGAGCTCTCAAAGGAAATGTAGAAGAGATTTTAGA
TGTGTATTTTAAACAATGTTCTATAGAAATTACTTGTAAAGATTTAGCTAGAATAGGAGTTATGTTAGCTAATGATGGGG
TATCTCCTTATACTGGTGATAGAATAGTTCCAAGGCATGTGGCTAGAATTGTAAAAACTATAATGGTAACCTGTGGTATG
TATGATGCGTCAGGAAATTTTGCAGTACATATCGGAATACCTGCTAAAAGTGGAGTTGGAGGAGGTATAATTGCCTGTGC
TCCGAGAAGAATGGGTATAGGGGTTTTAGGTACAGCTTTAGATGAAAAGGGTAATAGTATAGCTGGAACTAAGATATTAG
AAGAGCTTTCAAAGCAATTAGATTTAAGTATTTTTTAA

Upstream 100 bases:

>100_bases
TATATAATTTTTATATAATAGAAATTAGCTTAAAAAAATTATATAAAAGATGTATAATTATTGTATAGTATAATATTATA
AAGATTATGGAGAAACATAT

Downstream 100 bases:

>100_bases
AAGATAAAAATATGATAAAATATCTTCTTTAATTGCTTAGTTTAATAATAAGGCCCCAGCAGATTGTTATAAAGATAAAA
CCATAAGAGAATTTATCATT

Product: glutaminase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 305; Mature: 305

Protein sequence:

>305_residues
MNRLLKTIIENNRKWISEGKVASYIPELSKMDKNLLGISVCTLGGEEYWEGDAEVKFTIQSISKIVTLMLAIIDNGEDYV
FSKVGMEPTETAFNSIVNLEAKESHKPINPMINAGAIVVASMVAGKDSDEKFDRILKFTRKISGNNDIDINLNVYESEKE
TGHRNRALAYFMKSTGALKGNVEEILDVYFKQCSIEITCKDLARIGVMLANDGVSPYTGDRIVPRHVARIVKTIMVTCGM
YDASGNFAVHIGIPAKSGVGGGIIACAPRRMGIGVLGTALDEKGNSIAGTKILEELSKQLDLSIF

Sequences:

>Translated_305_residues
MNRLLKTIIENNRKWISEGKVASYIPELSKMDKNLLGISVCTLGGEEYWEGDAEVKFTIQSISKIVTLMLAIIDNGEDYV
FSKVGMEPTETAFNSIVNLEAKESHKPINPMINAGAIVVASMVAGKDSDEKFDRILKFTRKISGNNDIDINLNVYESEKE
TGHRNRALAYFMKSTGALKGNVEEILDVYFKQCSIEITCKDLARIGVMLANDGVSPYTGDRIVPRHVARIVKTIMVTCGM
YDASGNFAVHIGIPAKSGVGGGIIACAPRRMGIGVLGTALDEKGNSIAGTKILEELSKQLDLSIF
>Mature_305_residues
MNRLLKTIIENNRKWISEGKVASYIPELSKMDKNLLGISVCTLGGEEYWEGDAEVKFTIQSISKIVTLMLAIIDNGEDYV
FSKVGMEPTETAFNSIVNLEAKESHKPINPMINAGAIVVASMVAGKDSDEKFDRILKFTRKISGNNDIDINLNVYESEKE
TGHRNRALAYFMKSTGALKGNVEEILDVYFKQCSIEITCKDLARIGVMLANDGVSPYTGDRIVPRHVARIVKTIMVTCGM
YDASGNFAVHIGIPAKSGVGGGIIACAPRRMGIGVLGTALDEKGNSIAGTKILEELSKQLDLSIF

Specific function: Unknown

COG id: COG2066

COG function: function code E; Glutaminase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glutaminase family

Homologues:

Organism=Homo sapiens, GI156104878, Length=283, Percent_Identity=41.696113074205, Blast_Score=235, Evalue=5e-62,
Organism=Homo sapiens, GI20336214, Length=286, Percent_Identity=39.8601398601399, Blast_Score=222, Evalue=4e-58,
Organism=Escherichia coli, GI1787804, Length=304, Percent_Identity=42.7631578947368, Blast_Score=254, Evalue=4e-69,
Organism=Escherichia coli, GI1786693, Length=306, Percent_Identity=35.6209150326797, Blast_Score=207, Evalue=6e-55,
Organism=Caenorhabditis elegans, GI17532727, Length=290, Percent_Identity=41.7241379310345, Blast_Score=229, Evalue=2e-60,
Organism=Caenorhabditis elegans, GI17507019, Length=292, Percent_Identity=40.7534246575342, Blast_Score=223, Evalue=7e-59,
Organism=Caenorhabditis elegans, GI193204073, Length=311, Percent_Identity=36.3344051446945, Blast_Score=208, Evalue=3e-54,
Organism=Caenorhabditis elegans, GI193204075, Length=314, Percent_Identity=36.9426751592357, Blast_Score=207, Evalue=5e-54,
Organism=Drosophila melanogaster, GI281363241, Length=304, Percent_Identity=37.828947368421, Blast_Score=215, Evalue=3e-56,
Organism=Drosophila melanogaster, GI24653164, Length=304, Percent_Identity=37.828947368421, Blast_Score=215, Evalue=4e-56,
Organism=Drosophila melanogaster, GI281363239, Length=295, Percent_Identity=38.9830508474576, Blast_Score=214, Evalue=4e-56,
Organism=Drosophila melanogaster, GI24653162, Length=304, Percent_Identity=37.828947368421, Blast_Score=214, Evalue=4e-56,
Organism=Drosophila melanogaster, GI24653158, Length=295, Percent_Identity=38.9830508474576, Blast_Score=214, Evalue=4e-56,
Organism=Drosophila melanogaster, GI24653156, Length=295, Percent_Identity=38.9830508474576, Blast_Score=214, Evalue=4e-56,
Organism=Drosophila melanogaster, GI116008307, Length=304, Percent_Identity=37.828947368421, Blast_Score=214, Evalue=5e-56,
Organism=Drosophila melanogaster, GI24653166, Length=295, Percent_Identity=38.9830508474576, Blast_Score=214, Evalue=7e-56,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): GLSA_CLOB1 (A7FX54)

Other databases:

- EMBL:   CP000726
- RefSeq:   YP_001385052.1
- ProteinModelPortal:   A7FX54
- SMR:   A7FX54
- STRING:   A7FX54
- GeneID:   5396651
- GenomeReviews:   CP000726_GR
- KEGG:   cba:CLB_2751
- eggNOG:   COG2066
- HOGENOM:   HBG512335
- OMA:   RNASIAY
- ProtClustDB:   PRK00971
- BioCyc:   CBOT441770:CLB_2751-MONOMER
- HAMAP:   MF_00313
- InterPro:   IPR012338
- InterPro:   IPR015868
- Gene3D:   G3DSA:3.40.710.20
- PANTHER:   PTHR12544
- TIGRFAMs:   TIGR03814

Pfam domain/function: PF04960 Glutaminase; SSF56601 PBP_transp_fold

EC number: =3.5.1.2

Molecular weight: Translated: 33287; Mature: 33287

Theoretical pI: Translated: 6.52; Mature: 6.52

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.6 %Cys     (Translated Protein)
3.6 %Met     (Translated Protein)
5.2 %Cys+Met (Translated Protein)
1.6 %Cys     (Mature Protein)
3.6 %Met     (Mature Protein)
5.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNRLLKTIIENNRKWISEGKVASYIPELSKMDKNLLGISVCTLGGEEYWEGDAEVKFTIQ
CHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHCCHHHEEEEEECCCHHHCCCCCEEEEEHH
SISKIVTLMLAIIDNGEDYVFSKVGMEPTETAFNSIVNLEAKESHKPINPMINAGAIVVA
HHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCHHCCCHHHHH
SMVAGKDSDEKFDRILKFTRKISGNNDIDINLNVYESEKETGHRNRALAYFMKSTGALKG
HHHHCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEECCHHCCCHHHHHHHHHHHCCCCCC
NVEEILDVYFKQCSIEITCKDLARIGVMLANDGVSPYTGDRIVPRHVARIVKTIMVTCGM
CHHHHHHHHHHHCCEEEEHHHHHHHCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC
YDASGNFAVHIGIPAKSGVGGGIIACAPRRMGIGVLGTALDEKGNSIAGTKILEELSKQL
EECCCCEEEEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHC
DLSIF
CCCCC
>Mature Secondary Structure
MNRLLKTIIENNRKWISEGKVASYIPELSKMDKNLLGISVCTLGGEEYWEGDAEVKFTIQ
CHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHCCHHHEEEEEECCCHHHCCCCCEEEEEHH
SISKIVTLMLAIIDNGEDYVFSKVGMEPTETAFNSIVNLEAKESHKPINPMINAGAIVVA
HHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCHHCCCHHHHH
SMVAGKDSDEKFDRILKFTRKISGNNDIDINLNVYESEKETGHRNRALAYFMKSTGALKG
HHHHCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEECCHHCCCHHHHHHHHHHHCCCCCC
NVEEILDVYFKQCSIEITCKDLARIGVMLANDGVSPYTGDRIVPRHVARIVKTIMVTCGM
CHHHHHHHHHHHCCEEEEHHHHHHHCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHCC
YDASGNFAVHIGIPAKSGVGGGIIACAPRRMGIGVLGTALDEKGNSIAGTKILEELSKQL
EECCCCEEEEEECCCCCCCCCCEEEECCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHC
DLSIF
CCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA