Definition | Halothermothrix orenii H 168 chromosome, complete genome. |
---|---|
Accession | NC_011899 |
Length | 2,578,146 |
Click here to switch to the map view.
The map label for this gene is yaaH [H]
Identifier: 220933014
GI number: 220933014
Start: 2371686
End: 2372810
Strand: Direct
Name: yaaH [H]
Synonym: Hore_21810
Alternate gene names: 220933014
Gene position: 2371686-2372810 (Clockwise)
Preceding gene: 220933009
Following gene: 220933028
Centisome position: 91.99
GC content: 41.16
Gene sequence:
>1125_bases ATGTTAGTATCTCCGGTTATATACTGTGCTGCAACATCCAGTGAAGAAGGCCCCGGTACTTTTGACTGGTTAAAAGGTAT TCTGTTATTAATTATTTCTTTTTTTATTAATAACTTTGTTGATAAAAATGAAGAAGATCAGGAAGCAAGACCCGGTGAAT CGCCTCTGGATGAAGATATTATCTCAAATCGGGAAATACTGGGCTTTTATGTCAACTGGCTAACCCCATATGCTAATTCA TATGATGCCATGGTTTCTAACCACAGGTATGTTGACATGGTAGCACCCTTCTGGTTTACAGCCAACCCTGATGGTACAAT CAAGAGTAGATACGGGGGGCACCAGTATGAGGTAGATTCCTTTTCCAAAAGACAGGGTCTTGAATTACTACCTCTGATTA ATAACAACCAGAAAAATAACATGATCCTGGTTGATTCAGATGTCAGGAGTAAGACGATAAAAAATATAGTTAAGCTGGTG GAAAAATATAATTATAATGGAGTAAATATTGACTTTGAATTTATTCCACCCTGGACCCGTAATGGTTATACCCAGTTTAT TAAAGAGCTTTCCAGTGAGTTAAACAAGAAAAACAAAAAACTTACAATCTCCGTTTTTCCTAAAATAGATGTCCCGATGG AGTTACAGGGAGCCTATGATTATGCAGCCCTGGGAAAACTGGTTGACAGGGTAGTTATCATGACCTATGACCACCACTGG CCCTCCGGTGACCCCGGACCGATTGCCCCCATAAACTGGGTCGAAAAGAATATTAAATATGCACTGGAATATATACCAAA TGAGAAACTTCTAATAGGAGTAGCTAACTACGGCTATGACTGGCCTGAGGGGGGACCCGGTAGGCCCATCAGTGCTAAAG AAGCAATGAACCTGGCCCGGGAAAAGGGCGTTAAAGTTCAATGGGATACACCTTCCCAGAGCCCCTATTTCTATTACCAG GATAACAGTGGCATTAAACACGAAGTCTGGTTTGAATCAAGTAGTAGCCTTGCCTTCAAACTGGAGCTGGTTAAGAAATA TAATCTGAAAGGTATAGCCATCTGGCGGCTGGGAAATGGTACTGACCGGTTCTGGGAGATTATAGACAATAAATTAGGTC AGTGA
Upstream 100 bases:
>100_bases TATATTGAAAGGATGTTACTTAAATATGAAACCATCTTATCAACGTAAAAAGTTCATTACTATGCTCCTTCTTGTGTCTA TTATAACTCCTTTCTTACTT
Downstream 100 bases:
>100_bases TACCAGACCTCTTTTAAGAGCAACTATAACAGCCTGGGTTCTGTCATTAACTGATAACTTCCTTAAAATATTGCTGACAT GATTCTTAACAGTTTTTTCA
Product: chitinase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 374; Mature: 374
Protein sequence:
>374_residues MLVSPVIYCAATSSEEGPGTFDWLKGILLLIISFFINNFVDKNEEDQEARPGESPLDEDIISNREILGFYVNWLTPYANS YDAMVSNHRYVDMVAPFWFTANPDGTIKSRYGGHQYEVDSFSKRQGLELLPLINNNQKNNMILVDSDVRSKTIKNIVKLV EKYNYNGVNIDFEFIPPWTRNGYTQFIKELSSELNKKNKKLTISVFPKIDVPMELQGAYDYAALGKLVDRVVIMTYDHHW PSGDPGPIAPINWVEKNIKYALEYIPNEKLLIGVANYGYDWPEGGPGRPISAKEAMNLAREKGVKVQWDTPSQSPYFYYQ DNSGIKHEVWFESSSSLAFKLELVKKYNLKGIAIWRLGNGTDRFWEIIDNKLGQ
Sequences:
>Translated_374_residues MLVSPVIYCAATSSEEGPGTFDWLKGILLLIISFFINNFVDKNEEDQEARPGESPLDEDIISNREILGFYVNWLTPYANS YDAMVSNHRYVDMVAPFWFTANPDGTIKSRYGGHQYEVDSFSKRQGLELLPLINNNQKNNMILVDSDVRSKTIKNIVKLV EKYNYNGVNIDFEFIPPWTRNGYTQFIKELSSELNKKNKKLTISVFPKIDVPMELQGAYDYAALGKLVDRVVIMTYDHHW PSGDPGPIAPINWVEKNIKYALEYIPNEKLLIGVANYGYDWPEGGPGRPISAKEAMNLAREKGVKVQWDTPSQSPYFYYQ DNSGIKHEVWFESSSSLAFKLELVKKYNLKGIAIWRLGNGTDRFWEIIDNKLGQ >Mature_374_residues MLVSPVIYCAATSSEEGPGTFDWLKGILLLIISFFINNFVDKNEEDQEARPGESPLDEDIISNREILGFYVNWLTPYANS YDAMVSNHRYVDMVAPFWFTANPDGTIKSRYGGHQYEVDSFSKRQGLELLPLINNNQKNNMILVDSDVRSKTIKNIVKLV EKYNYNGVNIDFEFIPPWTRNGYTQFIKELSSELNKKNKKLTISVFPKIDVPMELQGAYDYAALGKLVDRVVIMTYDHHW PSGDPGPIAPINWVEKNIKYALEYIPNEKLLIGVANYGYDWPEGGPGRPISAKEAMNLAREKGVKVQWDTPSQSPYFYYQ DNSGIKHEVWFESSSSLAFKLELVKKYNLKGIAIWRLGNGTDRFWEIIDNKLGQ
Specific function: May be required for the L-alanine-stimulated germination pathway [H]
COG id: COG3858
COG function: function code R; Predicted glycosyl hydrolase
Gene ontology:
Cell location: Spore wall (Probable). Note=Probably localized either on the surface of the outer spore membrane and/or in the inner spore coat [H]
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 2 LysM repeats [H]
Homologues:
Organism=Homo sapiens, GI218083215, Length=325, Percent_Identity=21.2307692307692, Blast_Score=82, Evalue=6e-16, Organism=Homo sapiens, GI218083182, Length=325, Percent_Identity=21.2307692307692, Blast_Score=82, Evalue=6e-16, Organism=Homo sapiens, GI218083142, Length=325, Percent_Identity=21.2307692307692, Blast_Score=82, Evalue=6e-16, Organism=Homo sapiens, GI218083233, Length=325, Percent_Identity=21.2307692307692, Blast_Score=82, Evalue=7e-16, Organism=Homo sapiens, GI4758092, Length=315, Percent_Identity=23.1746031746032, Blast_Score=75, Evalue=1e-13, Organism=Homo sapiens, GI218083269, Length=151, Percent_Identity=26.4900662251656, Blast_Score=74, Evalue=2e-13, Organism=Caenorhabditis elegans, GI25150970, Length=331, Percent_Identity=23.8670694864048, Blast_Score=74, Evalue=1e-13, Organism=Caenorhabditis elegans, GI71995504, Length=317, Percent_Identity=24.2902208201893, Blast_Score=69, Evalue=3e-12, Organism=Drosophila melanogaster, GI24582726, Length=361, Percent_Identity=22.4376731301939, Blast_Score=72, Evalue=6e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR011583 - InterPro: IPR001223 - InterPro: IPR017853 - InterPro: IPR013781 - InterPro: IPR018392 - InterPro: IPR002482 [H]
Pfam domain/function: PF00704 Glyco_hydro_18; PF01476 LysM [H]
EC number: NA
Molecular weight: Translated: 42827; Mature: 42827
Theoretical pI: Translated: 5.31; Mature: 5.31
Prosite motif: PS01095 CHITINASE_18
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLVSPVIYCAATSSEEGPGTFDWLKGILLLIISFFINNFVDKNEEDQEARPGESPLDEDI CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCHHH ISNREILGFYVNWLTPYANSYDAMVSNHRYVDMVAPFWFTANPDGTIKSRYGGHQYEVDS HCCCCEEEEEEEHHCCCCCCHHHHHCCCEEEEEECCEEEECCCCCCEECCCCCCEEECCC FSKRQGLELLPLINNNQKNNMILVDSDVRSKTIKNIVKLVEKYNYNGVNIDFEFIPPWTR CHHHCCCEEEEEECCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC NGYTQFIKELSSELNKKNKKLTISVFPKIDVPMELQGAYDYAALGKLVDRVVIMTYDHHW CCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEECCCCCHHHHHHHHHEEEEEEEECCC PSGDPGPIAPINWVEKNIKYALEYIPNEKLLIGVANYGYDWPEGGPGRPISAKEAMNLAR CCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHH EKGVKVQWDTPSQSPYFYYQDNSGIKHEVWFESSSSLAFKLELVKKYNLKGIAIWRLGNG HCCCEEEECCCCCCCEEEEECCCCCEEEEEECCCCCEEEEEEEEHHCCCCEEEEEEECCC TDRFWEIIDNKLGQ HHHHHHHHHHHCCC >Mature Secondary Structure MLVSPVIYCAATSSEEGPGTFDWLKGILLLIISFFINNFVDKNEEDQEARPGESPLDEDI CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCHHH ISNREILGFYVNWLTPYANSYDAMVSNHRYVDMVAPFWFTANPDGTIKSRYGGHQYEVDS HCCCCEEEEEEEHHCCCCCCHHHHHCCCEEEEEECCEEEECCCCCCEECCCCCCEEECCC FSKRQGLELLPLINNNQKNNMILVDSDVRSKTIKNIVKLVEKYNYNGVNIDFEFIPPWTR CHHHCCCEEEEEECCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCC NGYTQFIKELSSELNKKNKKLTISVFPKIDVPMELQGAYDYAALGKLVDRVVIMTYDHHW CCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEECCCCCHHHHHHHHHEEEEEEEECCC PSGDPGPIAPINWVEKNIKYALEYIPNEKLLIGVANYGYDWPEGGPGRPISAKEAMNLAR CCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHH EKGVKVQWDTPSQSPYFYYQDNSGIKHEVWFESSSSLAFKLELVKKYNLKGIAIWRLGNG HCCCEEEECCCCCCCEEEEECCCCCEEEEEECCCCCEEEEEEEEHHCCCCEEEEEEECCC TDRFWEIIDNKLGQ HHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7584024; 9384377; 10419957 [H]