The gene/protein map for NC_009972 is currently unavailable.
Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is csxA [H]

Identifier: 159897413

GI number: 159897413

Start: 1008017

End: 1010488

Strand: Reverse

Name: csxA [H]

Synonym: Haur_0884

Alternate gene names: 159897413

Gene position: 1010488-1008017 (Counterclockwise)

Preceding gene: 159897414

Following gene: 159897411

Centisome position: 15.92

GC content: 52.1

Gene sequence:

>2472_bases
ATGCAAAAGCTCGCATTGCAGCAGGCATGGCAGGCCAAACAACGTGATCCACAACGCACTGTGCTCGCTGATAGTACGAG
TAGCGAAGGCTGGATTGCCGCGCCTGTGCCTGGCACAATTTACGAAGCCTTGATCGCCGCCGAGCGCATTCCCGATCCCT
TCGATGGCTTGAATGAGCTGGCGGTGCAATGGGTAGCTGAGGTCGATTGGCTCTATCGTTGCGATTTTGAATTAACCGCT
GAGCAAGCCAACCAACCAGCCGCCTTGCACTTCGCAGGCTTGGATACGATCGCCACGGTTTGGATCAATGGCCAAGAAAT
ATTAAACAGCGACAATATGTTCGTGCCGCAACGGGTGGTGGTCAGTAACCAAATCCACGTTGGCGCAAATCAATTGCTGA
TCGAATTTCGCTCGGCGTTGAAGCATGGCCATGCCTTGCAAGCCGAAATGGGACAACTTGGTGTGTGGAACGGCGATCCC
AGCCGCTTGTATCTGCGCAAAGCCCAATATCATTATGGCTGGGATTGGGGTCCAGCCCTGTTGACGGCTGGCCCGTGGCT
GCCCGTCACCCTCGAATTGGGCGCAACTCGTTTGAGCGATTTGGCTTGTCCAATCAGTGTAGCTGATGATTGTAGCACTG
CGATTTTTGCGGTAACTGCAACCGTTGCGGATGTACAGGCCGATACTGCGGTATTAATTCAATTATGGAACCCAGCAGGC
GAGTTAATCGCTGAACACCAACAATTGGTTGTTGCTGGCAATATCCAGCATTCAATCACGGTTGATCAGCCCAGCTTATG
GTGGCCGCATGGCTATGGTCAGCAACATCGCTATCGCTTGGCAGTCAAGGTCTTTGCTAATCAAACGGTGCTCGATCAAC
AAGAATTACAGCTTGGCGTGCGGCGCGTGCGTTTGGTGCAAGAGCCATTGCTCGATGAAGCAGGCGAAACATTTTTGTTT
GAAATCAACAATGTGCCAATGTTTAGCGGCGGAGCCAATTGGATTCCCGCTGATCTACTGACCAATCGTGTAAGCAACGA
GCACTATCGCCGCTTGTTGCAGGCAGCGGTTGATAGCCATATGCTGATGATTCGGATTTGGGGCGGCGGAATCTACGAAG
TTGACCATTTTTATGATCTTTGCGATCAGCTAGGCTTGTTGGTTTGGCAAGATTTTATGTTTGCCTGCGGCATGTATCCA
GCTCATCCTGCATTTTTGGCCAGCGTCGAGGCCGAAGCAATCGCCCAAGTTCAGCGCTTGCGCCATCATCCCTCGATTGT
GCTGTGGTGTGGCAACAACGAAGATTACCAAATTGCCCAAACCTTCAACGCCTATGATCACAGTTTCCAAGGCGATTTTA
CCAAAACCAGCTTTCCGGCCCGCGAAATTTACGAACGCTTGTTGCCCAAGGTCTGTGCCAGTTACGATCCGACCACAATT
TATTGGCCTGGCAGCCCGTATGGCGGAGCCGATGTCTATGATAAAACCCGTGGCGACCGCCATACCTGGGATGTTTGGCA
TAGCGCGATGGCTCCCTACCAAGATTACCCCAAGTACGAAGGGCGTTTTGTCAGCGAGTTTGGCATGGAATCGTGCGCAG
CGTTGCCAACCTTGCTGAGCGTCATTCCTGAGCACGAGCGCTATCCCCAAAGCCGCACGGTCGAGCACCACAACAAATCG
GAGGGTGGCGCACGCCGTTTGGCGGTGTATCTCAACGATACCTTACGATTTGAAAACACCCTCGAATCGTATGTGTATGC
CACCCAATTGATGCAGGCCGAGGCTTTGGCGGCAGCCTATCGTGGCTGGAGGCGACGTTGGGGTGGCGCAGGCCGTTATG
CAGTGGCAGGCGCTTTAGTGTGGCAACTTAACGATTGTTGGCCAGTGATTAGCTGGGCCATTATCGATTCAGCTTTGCGC
AAAAAACCCGCCATCTATAGCATTGGCCGCGAGCTAGCCCCCATCAGCGCCGGATTACAGCGCAATGGCGCAACAATCGA
GGCTTGGGTGGTCAATGGCACAATTGAATCAAAATCAGCAACGATTCAATTGACTGGCTACGATTTGCATGGCAGATTAC
TTTTTGAGCAAAACATTGAATATGAATTAGCCGCCAATCAAGCCAACCCAATTCCCAGCCCAAACCTAAACTTGCCCGAA
CAAAGCGTAGTTGGCATGCAAGTGCTGGTTGATGGTGTGGTTGTGGCAAGAGCCAGCGCATGGCCCGAGCCATTCAAATA
TCTCCCAGCCTATGATCCCCAAATTAGCGTCACCCGCATGGCCGACGATTGGCTAGAAATTAGCAGCCAACACCCAGCCA
AAGGGGTTTGGCTACAAACGGAGGCTGAAATCAATTGGAGCGATAATCTGCTTGATCTCTTGCCCAACCAACCACAACGG
ATTCAAGCATGTGGCTTAGGTCAACAGCCGATTGACATTAAGTGGCTGCACTGGGATCAAGCAAGAACATAG

Upstream 100 bases:

>100_bases
AAATTCTTGATTTAAGCCCGATCCCTAGTCCCCAACAACCAACCCCAACGAATCTAGCTTGCCACAACCAAATTGGTGGC
AGAGAAAGCATCAATAATTC

Downstream 100 bases:

>100_bases
ATCCTAGAACATAAATTTTGATCCACGAAGGACACGAAGCGCACGAAGGATATATAGGCTATCGACTATCGGAATTAATC
AGCCACCTCAACATATTCCG

Product: beta-mannosidase

Products: NA

Alternate protein names: GlcNase; Exochitinase [H]

Number of amino acids: Translated: 823; Mature: 823

Protein sequence:

>823_residues
MQKLALQQAWQAKQRDPQRTVLADSTSSEGWIAAPVPGTIYEALIAAERIPDPFDGLNELAVQWVAEVDWLYRCDFELTA
EQANQPAALHFAGLDTIATVWINGQEILNSDNMFVPQRVVVSNQIHVGANQLLIEFRSALKHGHALQAEMGQLGVWNGDP
SRLYLRKAQYHYGWDWGPALLTAGPWLPVTLELGATRLSDLACPISVADDCSTAIFAVTATVADVQADTAVLIQLWNPAG
ELIAEHQQLVVAGNIQHSITVDQPSLWWPHGYGQQHRYRLAVKVFANQTVLDQQELQLGVRRVRLVQEPLLDEAGETFLF
EINNVPMFSGGANWIPADLLTNRVSNEHYRRLLQAAVDSHMLMIRIWGGGIYEVDHFYDLCDQLGLLVWQDFMFACGMYP
AHPAFLASVEAEAIAQVQRLRHHPSIVLWCGNNEDYQIAQTFNAYDHSFQGDFTKTSFPAREIYERLLPKVCASYDPTTI
YWPGSPYGGADVYDKTRGDRHTWDVWHSAMAPYQDYPKYEGRFVSEFGMESCAALPTLLSVIPEHERYPQSRTVEHHNKS
EGGARRLAVYLNDTLRFENTLESYVYATQLMQAEALAAAYRGWRRRWGGAGRYAVAGALVWQLNDCWPVISWAIIDSALR
KKPAIYSIGRELAPISAGLQRNGATIEAWVVNGTIESKSATIQLTGYDLHGRLLFEQNIEYELAANQANPIPSPNLNLPE
QSVVGMQVLVDGVVVARASAWPEPFKYLPAYDPQISVTRMADDWLEISSQHPAKGVWLQTEAEINWSDNLLDLLPNQPQR
IQACGLGQQPIDIKWLHWDQART

Sequences:

>Translated_823_residues
MQKLALQQAWQAKQRDPQRTVLADSTSSEGWIAAPVPGTIYEALIAAERIPDPFDGLNELAVQWVAEVDWLYRCDFELTA
EQANQPAALHFAGLDTIATVWINGQEILNSDNMFVPQRVVVSNQIHVGANQLLIEFRSALKHGHALQAEMGQLGVWNGDP
SRLYLRKAQYHYGWDWGPALLTAGPWLPVTLELGATRLSDLACPISVADDCSTAIFAVTATVADVQADTAVLIQLWNPAG
ELIAEHQQLVVAGNIQHSITVDQPSLWWPHGYGQQHRYRLAVKVFANQTVLDQQELQLGVRRVRLVQEPLLDEAGETFLF
EINNVPMFSGGANWIPADLLTNRVSNEHYRRLLQAAVDSHMLMIRIWGGGIYEVDHFYDLCDQLGLLVWQDFMFACGMYP
AHPAFLASVEAEAIAQVQRLRHHPSIVLWCGNNEDYQIAQTFNAYDHSFQGDFTKTSFPAREIYERLLPKVCASYDPTTI
YWPGSPYGGADVYDKTRGDRHTWDVWHSAMAPYQDYPKYEGRFVSEFGMESCAALPTLLSVIPEHERYPQSRTVEHHNKS
EGGARRLAVYLNDTLRFENTLESYVYATQLMQAEALAAAYRGWRRRWGGAGRYAVAGALVWQLNDCWPVISWAIIDSALR
KKPAIYSIGRELAPISAGLQRNGATIEAWVVNGTIESKSATIQLTGYDLHGRLLFEQNIEYELAANQANPIPSPNLNLPE
QSVVGMQVLVDGVVVARASAWPEPFKYLPAYDPQISVTRMADDWLEISSQHPAKGVWLQTEAEINWSDNLLDLLPNQPQR
IQACGLGQQPIDIKWLHWDQART
>Mature_823_residues
MQKLALQQAWQAKQRDPQRTVLADSTSSEGWIAAPVPGTIYEALIAAERIPDPFDGLNELAVQWVAEVDWLYRCDFELTA
EQANQPAALHFAGLDTIATVWINGQEILNSDNMFVPQRVVVSNQIHVGANQLLIEFRSALKHGHALQAEMGQLGVWNGDP
SRLYLRKAQYHYGWDWGPALLTAGPWLPVTLELGATRLSDLACPISVADDCSTAIFAVTATVADVQADTAVLIQLWNPAG
ELIAEHQQLVVAGNIQHSITVDQPSLWWPHGYGQQHRYRLAVKVFANQTVLDQQELQLGVRRVRLVQEPLLDEAGETFLF
EINNVPMFSGGANWIPADLLTNRVSNEHYRRLLQAAVDSHMLMIRIWGGGIYEVDHFYDLCDQLGLLVWQDFMFACGMYP
AHPAFLASVEAEAIAQVQRLRHHPSIVLWCGNNEDYQIAQTFNAYDHSFQGDFTKTSFPAREIYERLLPKVCASYDPTTI
YWPGSPYGGADVYDKTRGDRHTWDVWHSAMAPYQDYPKYEGRFVSEFGMESCAALPTLLSVIPEHERYPQSRTVEHHNKS
EGGARRLAVYLNDTLRFENTLESYVYATQLMQAEALAAAYRGWRRRWGGAGRYAVAGALVWQLNDCWPVISWAIIDSALR
KKPAIYSIGRELAPISAGLQRNGATIEAWVVNGTIESKSATIQLTGYDLHGRLLFEQNIEYELAANQANPIPSPNLNLPE
QSVVGMQVLVDGVVVARASAWPEPFKYLPAYDPQISVTRMADDWLEISSQHPAKGVWLQTEAEINWSDNLLDLLPNQPQR
IQACGLGQQPIDIKWLHWDQART

Specific function: Hydrolyzes chitosan and chitooligosaccharides with retention of anomeric configuration. Has maximum activity on chitotetraose, chitopentaose and their corresponding alcohols, with a slight decrease in the rate of hydrolysis on longer chains. Has no activi

COG id: COG3250

COG function: function code G; Beta-galactosidase/beta-glucuronidase

Gene ontology:

Cell location: Secreted, extracellular space [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 CBM6 (carbohydrate binding type-6) domain [H]

Homologues:

Organism=Homo sapiens, GI84798622, Length=673, Percent_Identity=31.0549777117385, Blast_Score=290, Evalue=5e-78,
Organism=Caenorhabditis elegans, GI17550784, Length=691, Percent_Identity=28.3646888567294, Blast_Score=260, Evalue=3e-69,
Organism=Drosophila melanogaster, GI24643838, Length=689, Percent_Identity=30.4789550072569, Blast_Score=242, Evalue=1e-63,
Organism=Drosophila melanogaster, GI24643840, Length=558, Percent_Identity=31.8996415770609, Blast_Score=216, Evalue=6e-56,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005084
- InterPro:   IPR008979
- InterPro:   IPR013812
- InterPro:   IPR006104
- InterPro:   IPR006102
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF03422 CBM_6; PF00703 Glyco_hydro_2; PF02837 Glyco_hydro_2_N [H]

EC number: =3.2.1.165 [H]

Molecular weight: Translated: 92458; Mature: 92458

Theoretical pI: Translated: 4.99; Mature: 4.99

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
2.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQKLALQQAWQAKQRDPQRTVLADSTSSEGWIAAPVPGTIYEALIAAERIPDPFDGLNEL
CCHHHHHHHHHHHHCCCCCEEEECCCCCCCEEEECCCHHHHHHHHHHHHCCCCCCHHHHH
AVQWVAEVDWLYRCDFELTAEQANQPAALHFAGLDTIATVWINGQEILNSDNMFVPQRVV
HHHHHHHCCEEEECCCEEEHHHCCCCCEEEEECCCEEEEEEECCHHHHCCCCCCCCCEEE
VSNQIHVGANQLLIEFRSALKHGHALQAEMGQLGVWNGDPSRLYLRKAQYHYGWDWGPAL
ECCCEECCHHHHHHHHHHHHHCCCCHHHHHCCCEECCCCCCEEEEEECCCCCCCCCCHHH
LTAGPWLPVTLELGATRLSDLACPISVADDCSTAIFAVTATVADVQADTAVLIQLWNPAG
HCCCCCCEEEEECCCHHHHHCCCCCEECCCCCHHHHHHHHHHHHCCCCCEEEEEEECCHH
ELIAEHQQLVVAGNIQHSITVDQPSLWWPHGYGQQHRYRLAVKVFANQTVLDQQELQLGV
HHHHCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCCCHHHHHHHH
RRVRLVQEPLLDEAGETFLFEINNVPMFSGGANWIPADLLTNRVSNEHYRRLLQAAVDSH
HHHHHHHHHHHHHCCCEEEEEECCCEEECCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCC
MLMIRIWGGGIYEVDHFYDLCDQLGLLVWQDFMFACGMYPAHPAFLASVEAEAIAQVQRL
EEEEEEECCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH
RHHPSIVLWCGNNEDYQIAQTFNAYDHSFQGDFTKTSFPAREIYERLLPKVCASYDPTTI
HCCCCEEEEECCCCCEEEEEHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEE
YWPGSPYGGADVYDKTRGDRHTWDVWHSAMAPYQDYPKYEGRFVSEFGMESCAALPTLLS
EECCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHCCCCCCCEEHHHHCHHHHHHHHHHHH
VIPEHERYPQSRTVEHHNKSEGGARRLAVYLNDTLRFENTLESYVYATQLMQAEALAAAY
HCCCHHHCCCCCCCCCCCCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH
RGWRRRWGGAGRYAVAGALVWQLNDCWPVISWAIIDSALRKKPAIYSIGRELAPISAGLQ
HHHHHHCCCCCCHHHHEEEEEEECCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCC
RNGATIEAWVVNGTIESKSATIQLTGYDLHGRLLFEQNIEYELAANQANPIPSPNLNLPE
CCCCEEEEEEEECEEECCCEEEEEEEEEECEEEEEECCCCEEEECCCCCCCCCCCCCCCH
QSVVGMQVLVDGVVVARASAWPEPFKYLPAYDPQISVTRMADDWLEISSQHPAKGVWLQT
HHHHHHHHHHHHHHEEECCCCCCCHHCCCCCCCCEEEEEHHHHHHHHCCCCCCCCEEEEE
EAEINWSDNLLDLLPNQPQRIQACGLGQQPIDIKWLHWDQART
ECCCCCCCCHHHHCCCCCCHHHHCCCCCCCCEEEEEECCCCCC
>Mature Secondary Structure
MQKLALQQAWQAKQRDPQRTVLADSTSSEGWIAAPVPGTIYEALIAAERIPDPFDGLNEL
CCHHHHHHHHHHHHCCCCCEEEECCCCCCCEEEECCCHHHHHHHHHHHHCCCCCCHHHHH
AVQWVAEVDWLYRCDFELTAEQANQPAALHFAGLDTIATVWINGQEILNSDNMFVPQRVV
HHHHHHHCCEEEECCCEEEHHHCCCCCEEEEECCCEEEEEEECCHHHHCCCCCCCCCEEE
VSNQIHVGANQLLIEFRSALKHGHALQAEMGQLGVWNGDPSRLYLRKAQYHYGWDWGPAL
ECCCEECCHHHHHHHHHHHHHCCCCHHHHHCCCEECCCCCCEEEEEECCCCCCCCCCHHH
LTAGPWLPVTLELGATRLSDLACPISVADDCSTAIFAVTATVADVQADTAVLIQLWNPAG
HCCCCCCEEEEECCCHHHHHCCCCCEECCCCCHHHHHHHHHHHHCCCCCEEEEEEECCHH
ELIAEHQQLVVAGNIQHSITVDQPSLWWPHGYGQQHRYRLAVKVFANQTVLDQQELQLGV
HHHHCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCCCHHHHHHHH
RRVRLVQEPLLDEAGETFLFEINNVPMFSGGANWIPADLLTNRVSNEHYRRLLQAAVDSH
HHHHHHHHHHHHHCCCEEEEEECCCEEECCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCC
MLMIRIWGGGIYEVDHFYDLCDQLGLLVWQDFMFACGMYPAHPAFLASVEAEAIAQVQRL
EEEEEEECCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH
RHHPSIVLWCGNNEDYQIAQTFNAYDHSFQGDFTKTSFPAREIYERLLPKVCASYDPTTI
HCCCCEEEEECCCCCEEEEEHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEE
YWPGSPYGGADVYDKTRGDRHTWDVWHSAMAPYQDYPKYEGRFVSEFGMESCAALPTLLS
EECCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHCCCCCCCEEHHHHCHHHHHHHHHHHH
VIPEHERYPQSRTVEHHNKSEGGARRLAVYLNDTLRFENTLESYVYATQLMQAEALAAAY
HCCCHHHCCCCCCCCCCCCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH
RGWRRRWGGAGRYAVAGALVWQLNDCWPVISWAIIDSALRKKPAIYSIGRELAPISAGLQ
HHHHHHCCCCCCHHHHEEEEEEECCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCC
RNGATIEAWVVNGTIESKSATIQLTGYDLHGRLLFEQNIEYELAANQANPIPSPNLNLPE
CCCCEEEEEEEECEEECCCEEEEEEEEEECEEEEEECCCCEEEECCCCCCCCCCCCCCCH
QSVVGMQVLVDGVVVARASAWPEPFKYLPAYDPQISVTRMADDWLEISSQHPAKGVWLQT
HHHHHHHHHHHHHHEEECCCCCCCHHCCCCCCCCEEEEEHHHHHHHHCCCCCCCCEEEEE
EAEINWSDNLLDLLPNQPQRIQACGLGQQPIDIKWLHWDQART
ECCCCCCCCHHHHCCCCCCHHHHCCCCCCCCEEEEEECCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA