| Definition | Vibrio cholerae M66-2 chromosome I, complete genome. |
|---|---|
| Accession | NC_012578 |
| Length | 2,892,523 |
Click here to switch to the map view.
The map label for this gene is chb-1 [H]
Identifier: 227080794
GI number: 227080794
Start: 604322
End: 606235
Strand: Reverse
Name: chb-1 [H]
Synonym: VCM66_0571
Alternate gene names: 227080794
Gene position: 606235-604322 (Counterclockwise)
Preceding gene: 227080795
Following gene: 227080793
Centisome position: 20.96
GC content: 50.21
Gene sequence:
>1914_bases ATGAGTTATCGAATTGAATTTGCGGTGCTCTCGGAACAAAAACCGGATTGCCGTTTTGGTTTAACCCTGCACAATTTGAG CGATCAAGATCTGCATGATTGGTCGCTGTATTTTGTGATTGATCGTTACATCCAACCCATGAGTGTGACCAACGGTCAAC TGACCCAAGTCGGCAGCTTATGTTCGATTGTTCCAACGGAAAAAGTGTTGCAAGCTAATGGCCACTTTTATTGTGAGTTC ATCATCAAAACCGCGCCTTACCATTTCTACACCGATGGGGTAAAGCACGCGTTTGTCCAACTTAATGATAAACAGCCTGT TGAACGTATTAACGTCGCGGTTAACCCTATCGTTCTCGCGTCACCATTTCGCGAGCGTAGCCAGATCCCTGAAGTGACTG CGGCTGAGCTCTGTCTCATCCCCAAACCTAACTCACTGCAACGTTTCCAAGGTGAGTTTGTGGTCAACCACTCCAGCCAG ATCTCGCTGCAATCGGACTCGGCCGCGCGTGCTGCACGCTGGTTAGAGCAAGAATTGCATGCACTGCATGAGTTCAAACT GAATACGGTTGGCCATAGCGATATCGTCTACCGCAGTAATCCCACGCTCGATGAGGGCCATTACCAACTCAATATCGAAG CGCAAGGGATCAAGATTGAAGCAGGCAGCCACAGTGGCTTTATGCATGCCAGCGCGACTTTGCTGCAACTGGCGCAAGCG CATCAAGGCTCATTGCGCTTTCCTCTGGTCAACATTGTCGATGCACCGCGCTTTAAGTATCGCGGTATGATGCTCGATTG CGCCCGCCATTTTCACTCGCTTGAGCAGGTCAAACGAGTGATCAATCAACTGGCACACTACAAATTTAACGTGTTCCACT GGCATCTGACTGATGATGAAGGTTGGCGTATTGAGATTAAACGCCTGCCGCAACTGACAGACATTGGTGCATGGCGTGGC ATGGATGAAGTGCTGGAACCTCAGTACAGCTTACTCACGGAGCGTCATGGCGGTTTCTATACCCAAGATGAGATCCGTGC AGTGATTGAGTACGCCAGCGATCGTGGCATTACTGTCATCCCTGAAATTGACGTACCAGGGCACAGCCGCGCCGCGATTA AAGCGCTGCCGGCATGGCTGGTCGATGAGGAAGATTGCTCGCAATATCGCAGTATTCAGTACTACAACGACAACGTGCTC TCCCCTGCACTGCCGGGCACTTATCAATTCCTCGACATCGTATTGGAAGAAGTGGCTGCGCTGTTTCCAAGCCAATTTAT TCATATCGGTGCCGATGAAGTCCCACACGGTGTGTGGGTAGATAGCCCGAAATGCCAAGCCTTAATGCAAGAGCAAGGCT ATACCGACCCGAAAGAGCTGCAAGGCCACTTACTGCGCTACGCCGAGAAAAAACTCAAGAGCTTGGGTAAGCGTATGGTC GGCTGGGAAGAAGCTCATCACGGCGACAAAGTGAGTAAAGATACGGTGATTTACTCTTGGCTATCGGAAAAAGCCGCCTT GGATTGCGCCAAACAAGGCTTTGACGTGATTTTGCAGCCGGGACAATTTACCTATCTCGATATCGTTCAAGACTACGCTC CTGAAGAACCGGGCGTGGATTGGGCGGGTGTTACTCCGCTAGAGCGTGCTTACGGTTATGAACCGTTAGCCGACGTTCCG GCCAATGACCCACTGCGTAAACGCATTTTAGGTATTCAATGCGCCTTGTGGTGTGAATTGATCAATAACTCAGAACGCAT GGAATACATGCTCTATCCACGTCTCACGGCATTAGCAGAAGGCGGTTGGACAGAGAAATCCCAGCGTGACTGGTTGGATT ATCTAGCGCGTTTGAAAGGCCATTTACCACTGCTGGATAAGCAGAAAATACCTTATCGCGCGCCTTGGAAGTAA
Upstream 100 bases:
>100_bases CACCGGTACAGCAGTGGATTGTCAAACCGCAATCGGATGCTATCGAAGGTGCATTAATGTTTGCCGGTAAGCCTGAGCAC AACCTGTATAAGGATGGTTT
Downstream 100 bases:
>100_bases TTCTGCCTTGAAAGTAATTCTTCCGGAAATCGTAACGACGGCGTGACGGAACTCACGCTGGTTTTCACTAGCAAAATTTT TAGCTGCCATAGGGCAGCTG
Product: beta-N-acetylhexosaminidase
Products: NA
Alternate protein names: Beta-N-acetylhexosaminidase; N-acetyl-beta-glucosaminidase [H]
Number of amino acids: Translated: 637; Mature: 636
Protein sequence:
>637_residues MSYRIEFAVLSEQKPDCRFGLTLHNLSDQDLHDWSLYFVIDRYIQPMSVTNGQLTQVGSLCSIVPTEKVLQANGHFYCEF IIKTAPYHFYTDGVKHAFVQLNDKQPVERINVAVNPIVLASPFRERSQIPEVTAAELCLIPKPNSLQRFQGEFVVNHSSQ ISLQSDSAARAARWLEQELHALHEFKLNTVGHSDIVYRSNPTLDEGHYQLNIEAQGIKIEAGSHSGFMHASATLLQLAQA HQGSLRFPLVNIVDAPRFKYRGMMLDCARHFHSLEQVKRVINQLAHYKFNVFHWHLTDDEGWRIEIKRLPQLTDIGAWRG MDEVLEPQYSLLTERHGGFYTQDEIRAVIEYASDRGITVIPEIDVPGHSRAAIKALPAWLVDEEDCSQYRSIQYYNDNVL SPALPGTYQFLDIVLEEVAALFPSQFIHIGADEVPHGVWVDSPKCQALMQEQGYTDPKELQGHLLRYAEKKLKSLGKRMV GWEEAHHGDKVSKDTVIYSWLSEKAALDCAKQGFDVILQPGQFTYLDIVQDYAPEEPGVDWAGVTPLERAYGYEPLADVP ANDPLRKRILGIQCALWCELINNSERMEYMLYPRLTALAEGGWTEKSQRDWLDYLARLKGHLPLLDKQKIPYRAPWK
Sequences:
>Translated_637_residues MSYRIEFAVLSEQKPDCRFGLTLHNLSDQDLHDWSLYFVIDRYIQPMSVTNGQLTQVGSLCSIVPTEKVLQANGHFYCEF IIKTAPYHFYTDGVKHAFVQLNDKQPVERINVAVNPIVLASPFRERSQIPEVTAAELCLIPKPNSLQRFQGEFVVNHSSQ ISLQSDSAARAARWLEQELHALHEFKLNTVGHSDIVYRSNPTLDEGHYQLNIEAQGIKIEAGSHSGFMHASATLLQLAQA HQGSLRFPLVNIVDAPRFKYRGMMLDCARHFHSLEQVKRVINQLAHYKFNVFHWHLTDDEGWRIEIKRLPQLTDIGAWRG MDEVLEPQYSLLTERHGGFYTQDEIRAVIEYASDRGITVIPEIDVPGHSRAAIKALPAWLVDEEDCSQYRSIQYYNDNVL SPALPGTYQFLDIVLEEVAALFPSQFIHIGADEVPHGVWVDSPKCQALMQEQGYTDPKELQGHLLRYAEKKLKSLGKRMV GWEEAHHGDKVSKDTVIYSWLSEKAALDCAKQGFDVILQPGQFTYLDIVQDYAPEEPGVDWAGVTPLERAYGYEPLADVP ANDPLRKRILGIQCALWCELINNSERMEYMLYPRLTALAEGGWTEKSQRDWLDYLARLKGHLPLLDKQKIPYRAPWK >Mature_636_residues SYRIEFAVLSEQKPDCRFGLTLHNLSDQDLHDWSLYFVIDRYIQPMSVTNGQLTQVGSLCSIVPTEKVLQANGHFYCEFI IKTAPYHFYTDGVKHAFVQLNDKQPVERINVAVNPIVLASPFRERSQIPEVTAAELCLIPKPNSLQRFQGEFVVNHSSQI SLQSDSAARAARWLEQELHALHEFKLNTVGHSDIVYRSNPTLDEGHYQLNIEAQGIKIEAGSHSGFMHASATLLQLAQAH QGSLRFPLVNIVDAPRFKYRGMMLDCARHFHSLEQVKRVINQLAHYKFNVFHWHLTDDEGWRIEIKRLPQLTDIGAWRGM DEVLEPQYSLLTERHGGFYTQDEIRAVIEYASDRGITVIPEIDVPGHSRAAIKALPAWLVDEEDCSQYRSIQYYNDNVLS PALPGTYQFLDIVLEEVAALFPSQFIHIGADEVPHGVWVDSPKCQALMQEQGYTDPKELQGHLLRYAEKKLKSLGKRMVG WEEAHHGDKVSKDTVIYSWLSEKAALDCAKQGFDVILQPGQFTYLDIVQDYAPEEPGVDWAGVTPLERAYGYEPLADVPA NDPLRKRILGIQCALWCELINNSERMEYMLYPRLTALAEGGWTEKSQRDWLDYLARLKGHLPLLDKQKIPYRAPWK
Specific function: Hydrolyzes rapidly p-nitrophenyl-N-acetyl-beta-D- glucosaminide (PNP-beta-GlcNAc) and 4-methylumbelliferyl-beta- GlcNAc, and slightly active on p-nitrophenyl-beta-GalNAc. Hydrolyzes aryl-N-acetyl-beta-D-glucosaminide (aryl-beta-GlcNAc), aryl-beta-GalNAc a
COG id: COG3525
COG function: function code G; N-acetyl-beta-hexosaminidase
Gene ontology:
Cell location: Periplasm [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 20 family [H]
Homologues:
Organism=Homo sapiens, GI4504373, Length=428, Percent_Identity=29.4392523364486, Blast_Score=177, Evalue=3e-44, Organism=Homo sapiens, GI189181666, Length=420, Percent_Identity=28.3333333333333, Blast_Score=161, Evalue=2e-39, Organism=Caenorhabditis elegans, GI17569815, Length=422, Percent_Identity=28.1990521327014, Blast_Score=159, Evalue=3e-39, Organism=Drosophila melanogaster, GI45551090, Length=431, Percent_Identity=28.0742459396752, Blast_Score=130, Evalue=3e-30, Organism=Drosophila melanogaster, GI24653074, Length=431, Percent_Identity=28.0742459396752, Blast_Score=130, Evalue=3e-30, Organism=Drosophila melanogaster, GI281365639, Length=471, Percent_Identity=25.6900212314225, Blast_Score=116, Evalue=4e-26, Organism=Drosophila melanogaster, GI24657474, Length=471, Percent_Identity=25.6900212314225, Blast_Score=116, Evalue=4e-26, Organism=Drosophila melanogaster, GI24657468, Length=471, Percent_Identity=25.6900212314225, Blast_Score=116, Evalue=5e-26, Organism=Drosophila melanogaster, GI17647501, Length=471, Percent_Identity=25.6900212314225, Blast_Score=116, Evalue=5e-26, Organism=Drosophila melanogaster, GI17933586, Length=451, Percent_Identity=24.390243902439, Blast_Score=105, Evalue=7e-23,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR015882 - InterPro: IPR001540 - InterPro: IPR015883 - InterPro: IPR017853 - InterPro: IPR013781 [H]
Pfam domain/function: PF00728 Glyco_hydro_20; PF02838 Glyco_hydro_20b [H]
EC number: =3.2.1.52 [H]
Molecular weight: Translated: 72764; Mature: 72633
Theoretical pI: Translated: 6.04; Mature: 6.04
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.6 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 1.6 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSYRIEFAVLSEQKPDCRFGLTLHNLSDQDLHDWSLYFVIDRYIQPMSVTNGQLTQVGSL CCEEEEEEEECCCCCCCEEEEEEECCCCCCCCCHHHEEEEHHHCCCCCCCCCCHHHHHHH CSIVPTEKVLQANGHFYCEFIIKTAPYHFYTDGVKHAFVQLNDKQPVERINVAVNPIVLA HHCCCCHHHHHCCCCEEEEEEEECCCCEEECCCCEEEEEEECCCCCHHHHHEEECCEEEE SPFRERSQIPEVTAAELCLIPKPNSLQRFQGEFVVNHSSQISLQSDSAARAARWLEQELH CCCHHHHCCCCCCHHCEEECCCCCHHHHHCCCEEEECCCEEEECCCHHHHHHHHHHHHHH ALHEFKLNTVGHSDIVYRSNPTLDEGHYQLNIEAQGIKIEAGSHSGFMHASATLLQLAQA HHHHHHCCCCCCCCEEECCCCCCCCCEEEEEEEECCEEEECCCCCCCHHHHHHHHHHHHH HQGSLRFPLVNIVDAPRFKYRGMMLDCARHFHSLEQVKRVINQLAHYKFNVFHWHLTDDE CCCCCCCCCCEECCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEECCC GWRIEIKRLPQLTDIGAWRGMDEVLEPQYSLLTERHGGFYTQDEIRAVIEYASDRGITVI CCEEEEEECCCHHCCHHHCCCHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEE PEIDVPGHSRAAIKALPAWLVDEEDCSQYRSIQYYNDNVLSPALPGTYQFLDIVLEEVAA ECCCCCCCCCHHHHHHHHHHCCHHHHHHHHCEEEECCCCCCCCCCCHHHHHHHHHHHHHH LFPSQFIHIGADEVPHGVWVDSPKCQALMQEQGYTDPKELQGHLLRYAEKKLKSLGKRMV HCCHHEEECCCCCCCCCEECCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC GWEEAHHGDKVSKDTVIYSWLSEKAALDCAKQGFDVILQPGQFTYLDIVQDYAPEEPGVD CCHHHCCCCCCCCCCEEHHHHHHHHHHHHHHCCCEEEECCCCEEHHHHHHHHCCCCCCCC WAGVTPLERAYGYEPLADVPANDPLRKRILGIQCALWCELINNSERMEYMLYPRLTALAE CCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHC GGWTEKSQRDWLDYLARLKGHLPLLDKQKIPYRAPWK CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC >Mature Secondary Structure SYRIEFAVLSEQKPDCRFGLTLHNLSDQDLHDWSLYFVIDRYIQPMSVTNGQLTQVGSL CEEEEEEEECCCCCCCEEEEEEECCCCCCCCCHHHEEEEHHHCCCCCCCCCCHHHHHHH CSIVPTEKVLQANGHFYCEFIIKTAPYHFYTDGVKHAFVQLNDKQPVERINVAVNPIVLA HHCCCCHHHHHCCCCEEEEEEEECCCCEEECCCCEEEEEEECCCCCHHHHHEEECCEEEE SPFRERSQIPEVTAAELCLIPKPNSLQRFQGEFVVNHSSQISLQSDSAARAARWLEQELH CCCHHHHCCCCCCHHCEEECCCCCHHHHHCCCEEEECCCEEEECCCHHHHHHHHHHHHHH ALHEFKLNTVGHSDIVYRSNPTLDEGHYQLNIEAQGIKIEAGSHSGFMHASATLLQLAQA HHHHHHCCCCCCCCEEECCCCCCCCCEEEEEEEECCEEEECCCCCCCHHHHHHHHHHHHH HQGSLRFPLVNIVDAPRFKYRGMMLDCARHFHSLEQVKRVINQLAHYKFNVFHWHLTDDE CCCCCCCCCCEECCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEECCC GWRIEIKRLPQLTDIGAWRGMDEVLEPQYSLLTERHGGFYTQDEIRAVIEYASDRGITVI CCEEEEEECCCHHCCHHHCCCHHHHCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEEE PEIDVPGHSRAAIKALPAWLVDEEDCSQYRSIQYYNDNVLSPALPGTYQFLDIVLEEVAA ECCCCCCCCCHHHHHHHHHHCCHHHHHHHHCEEEECCCCCCCCCCCHHHHHHHHHHHHHH LFPSQFIHIGADEVPHGVWVDSPKCQALMQEQGYTDPKELQGHLLRYAEKKLKSLGKRMV HCCHHEEECCCCCCCCCEECCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC GWEEAHHGDKVSKDTVIYSWLSEKAALDCAKQGFDVILQPGQFTYLDIVQDYAPEEPGVD CCHHHCCCCCCCCCCEEHHHHHHHHHHHHHHCCCEEEECCCCEEHHHHHHHHCCCCCCCC WAGVTPLERAYGYEPLADVPANDPLRKRILGIQCALWCELINNSERMEYMLYPRLTALAE CCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHC GGWTEKSQRDWLDYLARLKGHLPLLDKQKIPYRAPWK CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8969205 [H]