| Definition | Geobacillus thermodenitrificans NG80-2 chromosome, complete genome. |
|---|---|
| Accession | NC_009328 |
| Length | 3,550,319 |
Click here to switch to the map view.
The map label for this gene is malZ [C]
Identifier: 138894288
GI number: 138894288
Start: 681680
End: 683215
Strand: Direct
Name: malZ [C]
Synonym: GTNG_0614
Alternate gene names: 138894288
Gene position: 681680-683215 (Clockwise)
Preceding gene: 138894287
Following gene: 138894289
Centisome position: 19.2
GC content: 46.61
Gene sequence:
>1536_bases ATGGGGAACCGGCTTTTTATGCTGTTCATCCTTCCGTTCCTTCTTTTTTATGCCATGCCGGTTGCGGCGGCGGAAAAAGA AGAACGGACGTGGCAAGACGAAGCGATTTATTTTATTATGGTCGATCGTTTTAACAACATGGATTCAACGAACGACCAAG ACGTCAATGTCAATGATCCAAAAGGGTATTTTGGCGGTGACTTAAAAGGGGTGACAGCGAAGCTCGATTATATTAAAGAA ATGGGCTTCACTGCCATTTGGCTGACTCCCATTTTTAAAAACAGGCCGGGCGGCTATCATGGCTATTGGATCGAGGACTT TTACGAAGTCGACCCGCATTTTGGCACGCTTGATGACCTCAAGACGCTCGTCAAAGAGGCGCATAAGCGCGATATGAAAG TGATTTTGGATTTCGTGGCTAACCATGTCGGCTACGACCATCCGTGGCTTCATGACCCGGCGAAAAAAGATTGGTTCCAT CCGAAGAAAGAGATTTTCGACTGGAACAGCCAAGAGCAGGTGGAAAACGGTTGGGTGTATGGACTTCCCGATTTGGCGCA GGAAAATCCGGAAGTAAAAAACTATTTAATTGACGCAGCCAAATGGTGGATTAAAGAAACCGACATTGACGGTTACCGAC TCGACATGGTACGCCATGTGCCGAAATCGTTCTGGCAAGAGTTTGCGAAAGAAGTGAAAGCGGTAAAAAAAGACTTTTTT CTCCTCGGCGAAGTGTGGAGTGATGATCCGCGCTATATCGCTGATTACGGAAAGTATGGCATCGACGGGTTTGTCGATTA CCCGCTGTACGGTGCGGTGAAGCAGTCTCTTGCCAAACGTGATGCATCGCTTCGGCCGCTCTATGATGTATGGGAGTATA ACAAGACATTTTACGACCGCCCGTATTTGCTTGGATCATTTTTGGACAACCATGATAACGTCCGATTTACGAAACTCGTC ATTGATCATCGGAACAATCCGATTTCACGCATGAAAGTAGCGATGACGTATTTGTTCACTGCGCCGGGCATTCCGATTAT GTACTACGGAACAGAAATTGCGATGACTGGAGGTCCTGATCCGGACAATCGCCGCCTGATGGACTTCCGTGCCGATCCAG AAATCATCGATTATTTAAAGAAAGTGGGCCCGCTCCGCCAACAGTTGCCATCATTGCGGCGCGGCGATTTTACGTTGTTG TATGAACAAGACGGCATGGCGGTGTTTAAACGCCAATACAAAGACGAAACGACGGTCATTGCCATTAACAACACGAGTGA AACGAAGCACGTTCATCTGACGAACGAACAGCTGCCGAAAAACAAGGAATTGCGTGGCTTTTTGCTCGATGACCTCGTCC GTGGCGATGAGGATGGCTATGATATTGTCTTGGATCGTGAAACAGCAGAAGTGTACAAATTACGGAACAAAACGGGAGTT AACGTGCCGTTTATCGTTGCAATGGTTGCGGTTTACGCACTATTTATCTTGTTTTTATACATGGTGAAAAAACGAACAAA ACGGACAAATGAATAA
Upstream 100 bases:
>100_bases TTAACAGCGGGTGGCACAAAAGGGTAAGAATCGCTATCATAAATAGTGGGCTTACCCGTTTCCAAACAGAAAAAATAGAC AGTAAAGGAGGGAAACGGAA
Downstream 100 bases:
>100_bases AGAAAGGAGGTAGTATATGACCGTTACGATTAAAGACGTCGCCAAACGAGCCAATGTCGCGCCGTCGACCGTCTCGCGCG TGATTGCCGATAGCCCGCGC
Product: alpha-amylase family protein
Products: NA
Alternate protein names: Beta-amylase; Alpha-amylase [H]
Number of amino acids: Translated: 511; Mature: 510
Protein sequence:
>511_residues MGNRLFMLFILPFLLFYAMPVAAAEKEERTWQDEAIYFIMVDRFNNMDSTNDQDVNVNDPKGYFGGDLKGVTAKLDYIKE MGFTAIWLTPIFKNRPGGYHGYWIEDFYEVDPHFGTLDDLKTLVKEAHKRDMKVILDFVANHVGYDHPWLHDPAKKDWFH PKKEIFDWNSQEQVENGWVYGLPDLAQENPEVKNYLIDAAKWWIKETDIDGYRLDMVRHVPKSFWQEFAKEVKAVKKDFF LLGEVWSDDPRYIADYGKYGIDGFVDYPLYGAVKQSLAKRDASLRPLYDVWEYNKTFYDRPYLLGSFLDNHDNVRFTKLV IDHRNNPISRMKVAMTYLFTAPGIPIMYYGTEIAMTGGPDPDNRRLMDFRADPEIIDYLKKVGPLRQQLPSLRRGDFTLL YEQDGMAVFKRQYKDETTVIAINNTSETKHVHLTNEQLPKNKELRGFLLDDLVRGDEDGYDIVLDRETAEVYKLRNKTGV NVPFIVAMVAVYALFILFLYMVKKRTKRTNE
Sequences:
>Translated_511_residues MGNRLFMLFILPFLLFYAMPVAAAEKEERTWQDEAIYFIMVDRFNNMDSTNDQDVNVNDPKGYFGGDLKGVTAKLDYIKE MGFTAIWLTPIFKNRPGGYHGYWIEDFYEVDPHFGTLDDLKTLVKEAHKRDMKVILDFVANHVGYDHPWLHDPAKKDWFH PKKEIFDWNSQEQVENGWVYGLPDLAQENPEVKNYLIDAAKWWIKETDIDGYRLDMVRHVPKSFWQEFAKEVKAVKKDFF LLGEVWSDDPRYIADYGKYGIDGFVDYPLYGAVKQSLAKRDASLRPLYDVWEYNKTFYDRPYLLGSFLDNHDNVRFTKLV IDHRNNPISRMKVAMTYLFTAPGIPIMYYGTEIAMTGGPDPDNRRLMDFRADPEIIDYLKKVGPLRQQLPSLRRGDFTLL YEQDGMAVFKRQYKDETTVIAINNTSETKHVHLTNEQLPKNKELRGFLLDDLVRGDEDGYDIVLDRETAEVYKLRNKTGV NVPFIVAMVAVYALFILFLYMVKKRTKRTNE >Mature_510_residues GNRLFMLFILPFLLFYAMPVAAAEKEERTWQDEAIYFIMVDRFNNMDSTNDQDVNVNDPKGYFGGDLKGVTAKLDYIKEM GFTAIWLTPIFKNRPGGYHGYWIEDFYEVDPHFGTLDDLKTLVKEAHKRDMKVILDFVANHVGYDHPWLHDPAKKDWFHP KKEIFDWNSQEQVENGWVYGLPDLAQENPEVKNYLIDAAKWWIKETDIDGYRLDMVRHVPKSFWQEFAKEVKAVKKDFFL LGEVWSDDPRYIADYGKYGIDGFVDYPLYGAVKQSLAKRDASLRPLYDVWEYNKTFYDRPYLLGSFLDNHDNVRFTKLVI DHRNNPISRMKVAMTYLFTAPGIPIMYYGTEIAMTGGPDPDNRRLMDFRADPEIIDYLKKVGPLRQQLPSLRRGDFTLLY EQDGMAVFKRQYKDETTVIAINNTSETKHVHLTNEQLPKNKELRGFLLDDLVRGDEDGYDIVLDRETAEVYKLRNKTGVN VPFIVAMVAVYALFILFLYMVKKRTKRTNE
Specific function: The precursor protein is proteolytically cleaved to produce multiform beta-amylases and a 48 kDa alpha-amylase after secretion [H]
COG id: COG0366
COG function: function code G; Glycosidases
Gene ontology:
Cell location: Secreted [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: In the C-terminal section; belongs to the glycosyl hydrolase 13 family [H]
Homologues:
Organism=Homo sapiens, GI187423904, Length=215, Percent_Identity=29.3023255813954, Blast_Score=87, Evalue=5e-17, Organism=Escherichia coli, GI1786604, Length=404, Percent_Identity=28.7128712871287, Blast_Score=136, Evalue=3e-33, Organism=Escherichia coli, GI1790687, Length=399, Percent_Identity=27.8195488721804, Blast_Score=110, Evalue=2e-25, Organism=Escherichia coli, GI1789995, Length=133, Percent_Identity=35.3383458646617, Blast_Score=100, Evalue=2e-22, Organism=Caenorhabditis elegans, GI32565753, Length=155, Percent_Identity=29.6774193548387, Blast_Score=71, Evalue=1e-12, Organism=Saccharomyces cerevisiae, GI6322245, Length=218, Percent_Identity=32.1100917431193, Blast_Score=96, Evalue=1e-20, Organism=Saccharomyces cerevisiae, GI6322241, Length=229, Percent_Identity=30.5676855895196, Blast_Score=92, Evalue=2e-19, Organism=Saccharomyces cerevisiae, GI6322021, Length=229, Percent_Identity=30.5676855895196, Blast_Score=92, Evalue=2e-19, Organism=Saccharomyces cerevisiae, GI6324416, Length=229, Percent_Identity=30.5676855895196, Blast_Score=92, Evalue=2e-19, Organism=Saccharomyces cerevisiae, GI6321726, Length=229, Percent_Identity=29.6943231441048, Blast_Score=89, Evalue=2e-18, Organism=Saccharomyces cerevisiae, GI6319776, Length=144, Percent_Identity=37.5, Blast_Score=84, Evalue=4e-17, Organism=Saccharomyces cerevisiae, GI6321731, Length=144, Percent_Identity=37.5, Blast_Score=84, Evalue=4e-17, Organism=Drosophila melanogaster, GI24586587, Length=184, Percent_Identity=35.8695652173913, Blast_Score=103, Evalue=2e-22, Organism=Drosophila melanogaster, GI24586599, Length=225, Percent_Identity=29.7777777777778, Blast_Score=100, Evalue=3e-21, Organism=Drosophila melanogaster, GI221330053, Length=181, Percent_Identity=32.0441988950276, Blast_Score=99, Evalue=5e-21, Organism=Drosophila melanogaster, GI24586593, Length=217, Percent_Identity=32.7188940092166, Blast_Score=99, Evalue=8e-21, Organism=Drosophila melanogaster, GI24583745, Length=218, Percent_Identity=29.8165137614679, Blast_Score=98, Evalue=1e-20, Organism=Drosophila melanogaster, GI45549022, Length=184, Percent_Identity=32.0652173913043, Blast_Score=96, Evalue=6e-20, Organism=Drosophila melanogaster, GI24586597, Length=188, Percent_Identity=31.9148936170213, Blast_Score=95, Evalue=1e-19, Organism=Drosophila melanogaster, GI24586591, Length=190, Percent_Identity=33.6842105263158, Blast_Score=94, Evalue=2e-19, Organism=Drosophila melanogaster, GI24586589, Length=189, Percent_Identity=33.8624338624339, Blast_Score=94, Evalue=3e-19, Organism=Drosophila melanogaster, GI24583747, Length=190, Percent_Identity=32.1052631578947, Blast_Score=91, Evalue=2e-18, Organism=Drosophila melanogaster, GI24583749, Length=190, Percent_Identity=32.1052631578947, Blast_Score=90, Evalue=3e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006048 - InterPro: IPR005085 - InterPro: IPR008985 - InterPro: IPR013780 - InterPro: IPR006047 - InterPro: IPR006589 - InterPro: IPR001554 - InterPro: IPR018238 - InterPro: IPR000125 - InterPro: IPR017853 - InterPro: IPR013781 [H]
Pfam domain/function: PF00128 Alpha-amylase; PF03423 CBM_25; PF01373 Glyco_hydro_14 [H]
EC number: =3.2.1.2; =3.2.1.1 [H]
Molecular weight: Translated: 59894; Mature: 59763
Theoretical pI: Translated: 5.83; Mature: 5.83
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGNRLFMLFILPFLLFYAMPVAAAEKEERTWQDEAIYFIMVDRFNNMDSTNDQDVNVNDP CCCCEEHHHHHHHHHHHHCCHHHCCHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCC KGYFGGDLKGVTAKLDYIKEMGFTAIWLTPIFKNRPGGYHGYWIEDFYEVDPHFGTLDDL CCCCCCCCCHHHHHHHHHHHCCCEEEEEEHHHCCCCCCCCCCCHHHHEECCCCCCCHHHH KTLVKEAHKRDMKVILDFVANHVGYDHPWLHDPAKKDWFHPKKEIFDWNSQEQVENGWVY HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCCCHHHHHCCCEE GLPDLAQENPEVKNYLIDAAKWWIKETDIDGYRLDMVRHVPKSFWQEFAKEVKAVKKDFF CCCHHHHCCCHHHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLGEVWSDDPRYIADYGKYGIDGFVDYPLYGAVKQSLAKRDASLRPLYDVWEYNKTFYDR EEHHHCCCCCCHHHHHCCCCCCCEECCCHHHHHHHHHHHCCCCCCHHHHHHHCCCHHCCC PYLLGSFLDNHDNVRFTKLVIDHRNNPISRMKVAMTYLFTAPGIPIMYYGTEIAMTGGPD CHHHHHHHCCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEECCCCC PDNRRLMDFRADPEIIDYLKKVGPLRQQLPSLRRGDFTLLYEQDGMAVFKRQYKDETTVI CCCCEEEECCCCHHHHHHHHHCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHCCCCEEEE AINNTSETKHVHLTNEQLPKNKELRGFLLDDLVRGDEDGYDIVLDRETAEVYKLRNKTGV EEECCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHCCCCC NVPFIVAMVAVYALFILFLYMVKKRTKRTNE CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC >Mature Secondary Structure GNRLFMLFILPFLLFYAMPVAAAEKEERTWQDEAIYFIMVDRFNNMDSTNDQDVNVNDP CCCEEHHHHHHHHHHHHCCHHHCCHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCC KGYFGGDLKGVTAKLDYIKEMGFTAIWLTPIFKNRPGGYHGYWIEDFYEVDPHFGTLDDL CCCCCCCCCHHHHHHHHHHHCCCEEEEEEHHHCCCCCCCCCCCHHHHEECCCCCCCHHHH KTLVKEAHKRDMKVILDFVANHVGYDHPWLHDPAKKDWFHPKKEIFDWNSQEQVENGWVY HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHCCCCCHHHHHCCCEE GLPDLAQENPEVKNYLIDAAKWWIKETDIDGYRLDMVRHVPKSFWQEFAKEVKAVKKDFF CCCHHHHCCCHHHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLGEVWSDDPRYIADYGKYGIDGFVDYPLYGAVKQSLAKRDASLRPLYDVWEYNKTFYDR EEHHHCCCCCCHHHHHCCCCCCCEECCCHHHHHHHHHHHCCCCCCHHHHHHHCCCHHCCC PYLLGSFLDNHDNVRFTKLVIDHRNNPISRMKVAMTYLFTAPGIPIMYYGTEIAMTGGPD CHHHHHHHCCCCCEEEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCEEEECCCCC PDNRRLMDFRADPEIIDYLKKVGPLRQQLPSLRRGDFTLLYEQDGMAVFKRQYKDETTVI CCCCEEEECCCCHHHHHHHHHCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHCCCCEEEE AINNTSETKHVHLTNEQLPKNKELRGFLLDDLVRGDEDGYDIVLDRETAEVYKLRNKTGV EEECCCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHCCCCC NVPFIVAMVAVYALFILFLYMVKKRTKRTNE CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2435707; 2464578; 2438660; 1827035 [H]