The gene/protein map for NC_009348 is currently unavailable.
Definition Aeromonas salmonicida subsp. salmonicida A449, complete genome.
Accession NC_009348
Length 4,702,402

Click here to switch to the map view.

The map label for this gene is hex [H]

Identifier: 145299761

GI number: 145299761

Start: 3057104

End: 3059449

Strand: Direct

Name: hex [H]

Synonym: ASA_2841

Alternate gene names: 145299761

Gene position: 3057104-3059449 (Clockwise)

Preceding gene: 145299756

Following gene: 145299765

Centisome position: 65.01

GC content: 60.06

Gene sequence:

>2346_bases
ATGCGTGTAAGACGTTCTCTGGTCAGCATGGCGCTCGCCATGGCCCTGACAGCTCCGGCCGCCATGGCCATGACCCAATC
CCAGTTGGACGGGATGGGTCAGGGAGTCGACCTCAAGTATCAAGTCATCGATAACACCCTGACAGATGGCAACAGCTTCA
GGGCGAGTATCACCCTGCGAAATAACACCGCCCAGCCTCTGCCGGCAACGGGCTGGAGCCTCTGGTTCAGCCATATTCGT
GACGTGAGCAAGCTCTACACCGACCAATTCAAGGTCACCCATGTCAACGGCGACATCTTCAAACTGGAGCCAACGGCTCA
ATTCCGGGGGGTGCCCGCCGGCCAGTCGTTCGAGATCGCCTTCGATGGTGGAAACTGGCAGGTTGCCAAGAGTGACGTGA
TGCCCAACTGGTACTTCGCCGCACAAGATGCCAAGGGCAACCCCATCACAGCGCTGCTTGCCAGCACCAGTAATACCCAT
AACGGCGTGGTCCCGACGGAGCCCTCGGCCGAGCTGCCGTTTGTGGGTGATTTCAGCACGCCCAAACAGTGGAAACGCTA
TAACGGCCCGAGCAATATCGACCGCTGGAATCCGTTCACAGCCGCCGAGCGCTATGCCCGTAACGCCGATCTGTCCGTAC
AGGCTCATCCGGTCGGCGTCGTGCCAACCCCCGCCGAGCTGCAGCTGGCCACCGGCAATGTCACCCTAGGTGCGGACTGG
GTTGTCGTGTTTGATAACGGCTATGAGGAGCAGGCTCGCTGGTTGGCGGCCAACCTTGGTCTGCCAGCGCAGAGTTGGTC
ACCCAACACCAGCAAGGTGATTCGCATGGGTTGGGGACAGGTGCAGATAGACGGGCAGGCCAAGTGGGAAGAGGCCTATC
GCCTCAAGGTTGACAGCAAGAGCCAAACCATCCAGATCACGGCCGCAGATGCAGCCGGCGCCCTCTATGCCGCACAGTCG
CTGCTGCAACTGGAGCAACAGCGCGTCGTGCCTGCGTTGACCATCGCCGACGCCCCGCGCTTTGCCTATCGTGGCCTCTC
TCTCGACGCTTCGCGTAACTTCCGTAGCAAACAGGCCGTGCTCGCCTTGCCGGAGCTGACCGACGTTGGTGCTCGTCGTT
GCCACGATCCGGCCGAGCGCACCTGTATTCTGCCCTTCCTGGGTGCCGGTCCGAACGGCACCGCACAGTCTGACGGTTTC
TACAGTGCCGACGATTACCGCGAGATCCTGAGTCACGCCAAGGCGCTCAATATCGAGGTTATCCCCGAGATCGACATGCC
TGGCCACGCTCACGCCGCCATCAAGGCGATGGATGCGCGCAGTGCCCGTTTGAACGAAGCCGGTCAGCCACAGCAAGCCG
CCGAGTATCGTCTCTCCGATCCGGATGACCGTACCGACTACACCTCGGTACAGATGTTCAAGGACAACGCCATGAACGTC
TGTATGGAGTCTACCTACCGCTTCATCGACACCGTCGTGGGTGAGTTGGTGGCCCTGTATCAGGGTATCCAGCCGCTCAA
GACCTTCCACTTCGGTGGGGATGAAGTCGCTGGCGCCTGGAAGCAGTCGCCGGCCTGCCAGGCGTTCTTTGCCAACAACA
GCCAAGGGATCAAGGATCCGTCCCAGCTCAGTCAGTATTTTGTCGAGCGAGTCTCAGGTATTACCTCTGCCCACGGCCTC
AATATGGGAGGCTGGGAAGATGGCTTGATGCATGACAACAAGGTCTATCCGCGCAGCAACCTAGCCAACGCACTGGTAAG
CGGCAATGCCTGGCAGAATATCTGGGAATGGGGTGTGGCCGATCGCGCCTATAAGCTGGCTAACGCCGGTTATGGTGTCA
TCTACAACCAGGCTAGTCACCTCTACTTCGACCATCCGAACGAGCCGGATCCGGCCGAGCGTGGCTACTATTGGGCCCCG
CGCTTTACCGATACCCGAAAGACCTTCGGTTTCATGCCGGACGATCTGTTTGCCAACGCCGATTACACCCGTGCCGGCAA
GCCGATCACCAAGGCTGAGGTGGTCGATGGCGCCACCACCAAGACTCTGGAGCAACCGGCCAACGTGCTGGGGATGCAGG
GTTCGCTCTGGGCCGAAACCGTGCGCACCGACAACCAGTTCGAAGAGATGCTCTTCCCACGGGTCTTTGCCCTCGCCGAG
CGGGCCTGGCATAAAGCAGGCTGGGAGGCCAACAGCCCGTTCCCAGGTCTGACCATCCAGTACTCCATTGATGGCGGCAG
CTGGCAAGCCTATGACGCAGCGAATGCGCCGAGCGTCTCAGGAAAGGTAGTTGTGCGTACTGCCTCCGGCCTGCGTGCAG
GTCGAAACGTGGTTATTAACAACTAA

Upstream 100 bases:

>100_bases
CAAAACGCTCCGTGCGACACCCTTTTTGTGTATAAAAAACACCCTCACAGCGCTACCCTTGCTTCGCCAAGAAAATTAAC
AACCAAAAAGGACGGTAACA

Downstream 100 bases:

>100_bases
ATGAAATTTTGAACTATCAGTTGTGCAACGATAGATAATCCATCTGTCGCATATTTATAACTCTGAGTTAAAAAGCAAAC
TGATTTACTGATATGAAAAA

Product: beta-N-acetylhexosaminidase

Products: NA

Alternate protein names: Beta-N-acetylhexosaminidase; Chitobiase; N-acetyl-beta-glucosaminidase [H]

Number of amino acids: Translated: 781; Mature: 781

Protein sequence:

>781_residues
MRVRRSLVSMALAMALTAPAAMAMTQSQLDGMGQGVDLKYQVIDNTLTDGNSFRASITLRNNTAQPLPATGWSLWFSHIR
DVSKLYTDQFKVTHVNGDIFKLEPTAQFRGVPAGQSFEIAFDGGNWQVAKSDVMPNWYFAAQDAKGNPITALLASTSNTH
NGVVPTEPSAELPFVGDFSTPKQWKRYNGPSNIDRWNPFTAAERYARNADLSVQAHPVGVVPTPAELQLATGNVTLGADW
VVVFDNGYEEQARWLAANLGLPAQSWSPNTSKVIRMGWGQVQIDGQAKWEEAYRLKVDSKSQTIQITAADAAGALYAAQS
LLQLEQQRVVPALTIADAPRFAYRGLSLDASRNFRSKQAVLALPELTDVGARRCHDPAERTCILPFLGAGPNGTAQSDGF
YSADDYREILSHAKALNIEVIPEIDMPGHAHAAIKAMDARSARLNEAGQPQQAAEYRLSDPDDRTDYTSVQMFKDNAMNV
CMESTYRFIDTVVGELVALYQGIQPLKTFHFGGDEVAGAWKQSPACQAFFANNSQGIKDPSQLSQYFVERVSGITSAHGL
NMGGWEDGLMHDNKVYPRSNLANALVSGNAWQNIWEWGVADRAYKLANAGYGVIYNQASHLYFDHPNEPDPAERGYYWAP
RFTDTRKTFGFMPDDLFANADYTRAGKPITKAEVVDGATTKTLEQPANVLGMQGSLWAETVRTDNQFEEMLFPRVFALAE
RAWHKAGWEANSPFPGLTIQYSIDGGSWQAYDAANAPSVSGKVVVRTASGLRAGRNVVINN

Sequences:

>Translated_781_residues
MRVRRSLVSMALAMALTAPAAMAMTQSQLDGMGQGVDLKYQVIDNTLTDGNSFRASITLRNNTAQPLPATGWSLWFSHIR
DVSKLYTDQFKVTHVNGDIFKLEPTAQFRGVPAGQSFEIAFDGGNWQVAKSDVMPNWYFAAQDAKGNPITALLASTSNTH
NGVVPTEPSAELPFVGDFSTPKQWKRYNGPSNIDRWNPFTAAERYARNADLSVQAHPVGVVPTPAELQLATGNVTLGADW
VVVFDNGYEEQARWLAANLGLPAQSWSPNTSKVIRMGWGQVQIDGQAKWEEAYRLKVDSKSQTIQITAADAAGALYAAQS
LLQLEQQRVVPALTIADAPRFAYRGLSLDASRNFRSKQAVLALPELTDVGARRCHDPAERTCILPFLGAGPNGTAQSDGF
YSADDYREILSHAKALNIEVIPEIDMPGHAHAAIKAMDARSARLNEAGQPQQAAEYRLSDPDDRTDYTSVQMFKDNAMNV
CMESTYRFIDTVVGELVALYQGIQPLKTFHFGGDEVAGAWKQSPACQAFFANNSQGIKDPSQLSQYFVERVSGITSAHGL
NMGGWEDGLMHDNKVYPRSNLANALVSGNAWQNIWEWGVADRAYKLANAGYGVIYNQASHLYFDHPNEPDPAERGYYWAP
RFTDTRKTFGFMPDDLFANADYTRAGKPITKAEVVDGATTKTLEQPANVLGMQGSLWAETVRTDNQFEEMLFPRVFALAE
RAWHKAGWEANSPFPGLTIQYSIDGGSWQAYDAANAPSVSGKVVVRTASGLRAGRNVVINN
>Mature_781_residues
MRVRRSLVSMALAMALTAPAAMAMTQSQLDGMGQGVDLKYQVIDNTLTDGNSFRASITLRNNTAQPLPATGWSLWFSHIR
DVSKLYTDQFKVTHVNGDIFKLEPTAQFRGVPAGQSFEIAFDGGNWQVAKSDVMPNWYFAAQDAKGNPITALLASTSNTH
NGVVPTEPSAELPFVGDFSTPKQWKRYNGPSNIDRWNPFTAAERYARNADLSVQAHPVGVVPTPAELQLATGNVTLGADW
VVVFDNGYEEQARWLAANLGLPAQSWSPNTSKVIRMGWGQVQIDGQAKWEEAYRLKVDSKSQTIQITAADAAGALYAAQS
LLQLEQQRVVPALTIADAPRFAYRGLSLDASRNFRSKQAVLALPELTDVGARRCHDPAERTCILPFLGAGPNGTAQSDGF
YSADDYREILSHAKALNIEVIPEIDMPGHAHAAIKAMDARSARLNEAGQPQQAAEYRLSDPDDRTDYTSVQMFKDNAMNV
CMESTYRFIDTVVGELVALYQGIQPLKTFHFGGDEVAGAWKQSPACQAFFANNSQGIKDPSQLSQYFVERVSGITSAHGL
NMGGWEDGLMHDNKVYPRSNLANALVSGNAWQNIWEWGVADRAYKLANAGYGVIYNQASHLYFDHPNEPDPAERGYYWAP
RFTDTRKTFGFMPDDLFANADYTRAGKPITKAEVVDGATTKTLEQPANVLGMQGSLWAETVRTDNQFEEMLFPRVFALAE
RAWHKAGWEANSPFPGLTIQYSIDGGSWQAYDAANAPSVSGKVVVRTASGLRAGRNVVINN

Specific function: Hydrolysis of terminal, non-reducing N-acetyl-beta-D- glucosamine residues in chitobiose and higher analogs, and in glycoproteins [H]

COG id: COG3525

COG function: function code G; N-acetyl-beta-hexosaminidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyl hydrolase 20 family [H]

Homologues:

Organism=Caenorhabditis elegans, GI17569815, Length=454, Percent_Identity=23.1277533039648, Blast_Score=81, Evalue=2e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR015882
- InterPro:   IPR008965
- InterPro:   IPR004866
- InterPro:   IPR012291
- InterPro:   IPR013812
- InterPro:   IPR001540
- InterPro:   IPR004867
- InterPro:   IPR015883
- InterPro:   IPR017853
- InterPro:   IPR013781
- InterPro:   IPR014756 [H]

Pfam domain/function: PF03173 CHB_HEX; PF03174 CHB_HEX_C; PF00728 Glyco_hydro_20; PF02838 Glyco_hydro_20b [H]

EC number: =3.2.1.52 [H]

Molecular weight: Translated: 85708; Mature: 85708

Theoretical pI: Translated: 5.79; Mature: 5.79

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRVRRSLVSMALAMALTAPAAMAMTQSQLDGMGQGVDLKYQVIDNTLTDGNSFRASITLR
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEEEEEEEE
NNTAQPLPATGWSLWFSHIRDVSKLYTDQFKVTHVNGDIFKLEPTAQFRGVPAGQSFEIA
CCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCEEEECCCHHHCCCCCCCEEEEE
FDGGNWQVAKSDVMPNWYFAAQDAKGNPITALLASTSNTHNGVVPTEPSAELPFVGDFST
EECCCEEEECCCCCCCEEEEECCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC
PKQWKRYNGPSNIDRWNPFTAAERYARNADLSVQAHPVGVVPTPAELQLATGNVTLGADW
CHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEECCEEECCCE
VVVFDNGYEEQARWLAANLGLPAQSWSPNTSKVIRMGWGQVQIDGQAKWEEAYRLKVDSK
EEEECCCCHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCEEEECCCCCCCEEEEEEECCC
SQTIQITAADAAGALYAAQSLLQLEQQRVVPALTIADAPRFAYRGLSLDASRNFRSKQAV
CCEEEEEECCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCHHHCCCCCCCCCCCCCCCEE
LALPELTDVGARRCHDPAERTCILPFLGAGPNGTAQSDGFYSADDYREILSHAKALNIEV
EECCCHHHHHHHHCCCHHHCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEE
IPEIDMPGHAHAAIKAMDARSARLNEAGQPQQAAEYRLSDPDDRTDYTSVQMFKDNAMNV
EECCCCCCCCHHHHEEHHHHHHHCCCCCCCHHHHHCCCCCCCCCCCCCEEEEECCCHHHH
CMESTYRFIDTVVGELVALYQGIQPLKTFHFGGDEVAGAWKQSPACQAFFANNSQGIKDP
HHHHHHHHHHHHHHHHHHHHHCCHHHHHEECCCHHHCCHHCCCCCEEEEECCCCCCCCCH
SQLSQYFVERVSGITSAHGLNMGGWEDGLMHDNKVYPRSNLANALVSGNAWQNIWEWGVA
HHHHHHHHHHHHCCHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHCCH
DRAYKLANAGYGVIYNQASHLYFDHPNEPDPAERGYYWAPRFTDTRKTFGFMPDDLFANA
HHHHHHHCCCCEEEEECCCEEEECCCCCCCHHHCCEEECCCCCCCHHHCCCCCHHHHCCC
DYTRAGKPITKAEVVDGATTKTLEQPANVLGMQGSLWAETVRTDNQFEEMLFPRVFALAE
CCCCCCCCCCHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHH
RAWHKAGWEANSPFPGLTIQYSIDGGSWQAYDAANAPSVSGKVVVRTASGLRAGRNVVIN
HHHHHCCCCCCCCCCCCEEEEEECCCCEEEECCCCCCCCCCEEEEEECCCCCCCCEEEEC
N
C
>Mature Secondary Structure
MRVRRSLVSMALAMALTAPAAMAMTQSQLDGMGQGVDLKYQVIDNTLTDGNSFRASITLR
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCEEEEEEEE
NNTAQPLPATGWSLWFSHIRDVSKLYTDQFKVTHVNGDIFKLEPTAQFRGVPAGQSFEIA
CCCCCCCCCCHHHHHHHHHHHHHHHHHCCEEEEEECCCEEEECCCHHHCCCCCCCEEEEE
FDGGNWQVAKSDVMPNWYFAAQDAKGNPITALLASTSNTHNGVVPTEPSAELPFVGDFST
EECCCEEEECCCCCCCEEEEECCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC
PKQWKRYNGPSNIDRWNPFTAAERYARNADLSVQAHPVGVVPTPAELQLATGNVTLGADW
CHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEECCEEECCCE
VVVFDNGYEEQARWLAANLGLPAQSWSPNTSKVIRMGWGQVQIDGQAKWEEAYRLKVDSK
EEEECCCCHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCEEEECCCCCCCEEEEEEECCC
SQTIQITAADAAGALYAAQSLLQLEQQRVVPALTIADAPRFAYRGLSLDASRNFRSKQAV
CCEEEEEECCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCHHHCCCCCCCCCCCCCCCEE
LALPELTDVGARRCHDPAERTCILPFLGAGPNGTAQSDGFYSADDYREILSHAKALNIEV
EECCCHHHHHHHHCCCHHHCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEE
IPEIDMPGHAHAAIKAMDARSARLNEAGQPQQAAEYRLSDPDDRTDYTSVQMFKDNAMNV
EECCCCCCCCHHHHEEHHHHHHHCCCCCCCHHHHHCCCCCCCCCCCCCEEEEECCCHHHH
CMESTYRFIDTVVGELVALYQGIQPLKTFHFGGDEVAGAWKQSPACQAFFANNSQGIKDP
HHHHHHHHHHHHHHHHHHHHHCCHHHHHEECCCHHHCCHHCCCCCEEEEECCCCCCCCCH
SQLSQYFVERVSGITSAHGLNMGGWEDGLMHDNKVYPRSNLANALVSGNAWQNIWEWGVA
HHHHHHHHHHHHCCHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHCCH
DRAYKLANAGYGVIYNQASHLYFDHPNEPDPAERGYYWAPRFTDTRKTFGFMPDDLFANA
HHHHHHHCCCCEEEEECCCEEEECCCCCCCHHHCCEEECCCCCCCHHHCCCCCHHHHCCC
DYTRAGKPITKAEVVDGATTKTLEQPANVLGMQGSLWAETVRTDNQFEEMLFPRVFALAE
CCCCCCCCCCHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHH
RAWHKAGWEANSPFPGLTIQYSIDGGSWQAYDAANAPSVSGKVVVRTASGLRAGRNVVIN
HHHHHCCCCCCCCCCCCEEEEEECCCCEEEECCCCCCCCCCEEEEEECCCCCCCCEEEEC
N
C

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8341694 [H]