Definition Kosmotoga olearia TBF 19.5.1, complete genome.
Accession NC_012785
Length 2,302,126

Click here to switch to the map view.

The map label for this gene is malS [C]

Identifier: 239616866

GI number: 239616866

Start: 489398

End: 491890

Strand: Reverse

Name: malS [C]

Synonym: Kole_0461

Alternate gene names: 239616866

Gene position: 491890-489398 (Counterclockwise)

Preceding gene: 239616867

Following gene: 239616856

Centisome position: 21.37

GC content: 42.84

Gene sequence:

>2493_bases
TTGTTATTTCTTTTGCTGCTGGTATGTGTTTTAGCTCTGGCAGTTCCTTCTCCGAAGTGGGAAGACCAGATAATTTATTT
CGTTATGATTGATCGTTTTGCAAATGGAGACCCAACCAACGACAATTTTGGCATGGGGGAATATGGAAAGGACAATGCGC
GTTACAACGGGGGAGATCTTGCCGGGTTGATCGAAAAGCTCGATTACATTAAAGAGCTTGGTGCAACTGCCATATGGATC
ACACCTCCCATTGCCAACCAGTGGTGGAATCCGTGGGTAAATTATGGTGGATATCACGGTTACTGGGCAAGGGACTTCAA
GAAGATTGATGAGCATTTCGGAGACCTGGAACTTTATAAGAAGTTTGTTGAAGAGGCTCATAAGAGAGGTTTGTACGTGA
TTCAAGATATTGTTGCCAATCATGTTGGAGATTATTTCAAGTTCGTTGGTGGAAAGTTTTACAAAAACACTGAAAGTGTT
CCCACTTCTGCCCCAGAACAGTATCCGTTTTCGTTAAACGATTATCTGAAAGATAAGGATAAGAACATTTATCACTGGAC
ACCGGATATTACGAATTACAGTGATCCCAACCAGAAGCTAAATTATCAGATGGCAGGACTGGATGATCTCAACACGGAAA
ATCCGGTGGTTATTGAAGCCTTAAAGGATAGCTATACTTACTGGATAAAAGAAGCGGGAGTAGATGGTTTTAGAATTGAC
ACCGTCATTTACGTTCCTCACGAGTTCTGGAAGGAATTTTTGAATGGTAAAAACGGTGTTTATGAGACTGCGAAAGCTCT
CGGAAAGGAAGATTTTATAACTTTTGGCGAGGCCTGGATAAGGTCTGATCCATACGATGATTCAGGTGAAAAGATTCTGG
AGAGCTATTTCGAAGACGGTATGAGTGCCATGCTCGACTTCCCGTTGAACATAGAGATAAGGCGCGTTTTTAAGGAAGGA
AAGGCTACGGCGAACCTCCGTTACCGTTTAGAGCGCCGCGAACAATTCGAACATCCTGAGAGGCTGGTAACGTTCATAGA
CAATCACGATATGGAACGATTCTTAAAGGGTGCAGGATTGGCTACCGTAAAACAGGCTCTTGCATTTCTCTTTACCGTTC
CGGGTATACCGGTGATTTATTACGGAACGGAACAGGGATTTGTAGAAACGAGAGCGGCTATGTTCAAGGAAGGGTATGCC
TCAGGTGGTGTGGATCATTACGATACTTCCTTTGAAATGTACCAGTTCATTCGTGAATTGACGGAGTTTAGAAAGAGTAA
TCCTGTATTTCGTCATGGTCACATAGAGGTTCTCAAAGATGATCCAAATGGCCCCGGCATCTTCGCTTTTAAGATAAGCG
ATGAAGAGGTTACAGCATTTGTCATGCTGAATACTTCGAACGAAAGAAGGATAATAACCAACCTGAAAACGGGGTTAGAA
CCTGGAACGGTTATAGAACCGCTTTACACCCTCAATACGTTGAAGAAAAAATATAGTGCCGACAAAGATGGAGAGTTGAA
TCTCGTATTAAATCCGCGCTCCGTTTACGTTGGTATAGCGACAACACAAAAGAAAAAAGTGAAGATGCCAACAGTAGAAA
TTTATCCAAATCTCAAACAGGGACAGAAAATAACCGGTAACTTTGTAATCACCGGTACTGCAAAGAATGCAAAGAGCGTC
AAGATAATCTTTGATACCAGAATAGACGCAAGCGTTTCTGTTGATGTAATCGACGGAAAATGGTCGTATGAATGGGACAT
CTCTAAATTCGACCCGGGGATTCACACCATAGTTTTCAAGGTTTACGGTCAAAAACGGACAGATACTACTTACTCTGAAG
ATTATCAGGTAATCCTAGATATACCGGAAAAACTCCTGGCACAGGTGGAAGATCCTGAAGGCGATGACCATGGACCCTTT
GGAAAATATCTTTACCCCACCGATGTTACGTTCAAGAGACAGATGGACCTGCTGGGAGTTACGATAAAGCAGATCGGCGC
AAGCCTTGTTATAACCATGAAGATAAAAGATTTGACAACGTCCTGGAGCCCGCAAAATGGCTTTGACCATGTAACATTCC
AGATCTACATAGATGACCCAAGTAAAACGGGTGCAAAAGATTTGCCGTTCCAGAATGCGAAAATGCCAGATGGGCTTGAT
TGGGATTATTTCATATTTGCCAACGGTTGGTCGATTGTCGCTTATTCTTCAGAAGGAAGCGGATCACAAGCGTTCGGTAC
TCCTATATCTCCAACACCTCAGGTTAAAACAAACAAGATGACCAGAGAGGTAACGCTGATAATCCCGAGTGAGGTTCTTG
GAAGGCCGGATAGCTTTGATGGATTCAAATTTTATATCACCACATGGGATTTTGATGGGATTGAGGCACGTTACCGTGAT
CTGTATCCTAAACCAAAAGCCTTCCATTTTGGTGGCGGCGAGAAGACAGATCCATATATCATGGATGATGTGCTGATAGA
TCTTGGAAAATAG

Upstream 100 bases:

>100_bases
TAGTCTCGGCTCTCGGTTTCTCGGATTCTCGATGTTTTTCGTGCAACCGACAACCAACAACGATTTCTTAAAGAGGGGTG
ATAGTGTGAGAAGGCTAATT

Downstream 100 bases:

>100_bases
CAAAGACAAAGGTGGAGCGTTCGCTCCACCTCTTTTATTTGTTGTATCGTAAAATAGTTTTCTTATGCTTCTTTCATTTC
TATTCCGAGCGTACGGACGT

Product: alpha amylase catalytic region

Products: NA

Alternate protein names: Beta-amylase; Alpha-amylase [H]

Number of amino acids: Translated: 830; Mature: 830

Protein sequence:

>830_residues
MLFLLLLVCVLALAVPSPKWEDQIIYFVMIDRFANGDPTNDNFGMGEYGKDNARYNGGDLAGLIEKLDYIKELGATAIWI
TPPIANQWWNPWVNYGGYHGYWARDFKKIDEHFGDLELYKKFVEEAHKRGLYVIQDIVANHVGDYFKFVGGKFYKNTESV
PTSAPEQYPFSLNDYLKDKDKNIYHWTPDITNYSDPNQKLNYQMAGLDDLNTENPVVIEALKDSYTYWIKEAGVDGFRID
TVIYVPHEFWKEFLNGKNGVYETAKALGKEDFITFGEAWIRSDPYDDSGEKILESYFEDGMSAMLDFPLNIEIRRVFKEG
KATANLRYRLERREQFEHPERLVTFIDNHDMERFLKGAGLATVKQALAFLFTVPGIPVIYYGTEQGFVETRAAMFKEGYA
SGGVDHYDTSFEMYQFIRELTEFRKSNPVFRHGHIEVLKDDPNGPGIFAFKISDEEVTAFVMLNTSNERRIITNLKTGLE
PGTVIEPLYTLNTLKKKYSADKDGELNLVLNPRSVYVGIATTQKKKVKMPTVEIYPNLKQGQKITGNFVITGTAKNAKSV
KIIFDTRIDASVSVDVIDGKWSYEWDISKFDPGIHTIVFKVYGQKRTDTTYSEDYQVILDIPEKLLAQVEDPEGDDHGPF
GKYLYPTDVTFKRQMDLLGVTIKQIGASLVITMKIKDLTTSWSPQNGFDHVTFQIYIDDPSKTGAKDLPFQNAKMPDGLD
WDYFIFANGWSIVAYSSEGSGSQAFGTPISPTPQVKTNKMTREVTLIIPSEVLGRPDSFDGFKFYITTWDFDGIEARYRD
LYPKPKAFHFGGGEKTDPYIMDDVLIDLGK

Sequences:

>Translated_830_residues
MLFLLLLVCVLALAVPSPKWEDQIIYFVMIDRFANGDPTNDNFGMGEYGKDNARYNGGDLAGLIEKLDYIKELGATAIWI
TPPIANQWWNPWVNYGGYHGYWARDFKKIDEHFGDLELYKKFVEEAHKRGLYVIQDIVANHVGDYFKFVGGKFYKNTESV
PTSAPEQYPFSLNDYLKDKDKNIYHWTPDITNYSDPNQKLNYQMAGLDDLNTENPVVIEALKDSYTYWIKEAGVDGFRID
TVIYVPHEFWKEFLNGKNGVYETAKALGKEDFITFGEAWIRSDPYDDSGEKILESYFEDGMSAMLDFPLNIEIRRVFKEG
KATANLRYRLERREQFEHPERLVTFIDNHDMERFLKGAGLATVKQALAFLFTVPGIPVIYYGTEQGFVETRAAMFKEGYA
SGGVDHYDTSFEMYQFIRELTEFRKSNPVFRHGHIEVLKDDPNGPGIFAFKISDEEVTAFVMLNTSNERRIITNLKTGLE
PGTVIEPLYTLNTLKKKYSADKDGELNLVLNPRSVYVGIATTQKKKVKMPTVEIYPNLKQGQKITGNFVITGTAKNAKSV
KIIFDTRIDASVSVDVIDGKWSYEWDISKFDPGIHTIVFKVYGQKRTDTTYSEDYQVILDIPEKLLAQVEDPEGDDHGPF
GKYLYPTDVTFKRQMDLLGVTIKQIGASLVITMKIKDLTTSWSPQNGFDHVTFQIYIDDPSKTGAKDLPFQNAKMPDGLD
WDYFIFANGWSIVAYSSEGSGSQAFGTPISPTPQVKTNKMTREVTLIIPSEVLGRPDSFDGFKFYITTWDFDGIEARYRD
LYPKPKAFHFGGGEKTDPYIMDDVLIDLGK
>Mature_830_residues
MLFLLLLVCVLALAVPSPKWEDQIIYFVMIDRFANGDPTNDNFGMGEYGKDNARYNGGDLAGLIEKLDYIKELGATAIWI
TPPIANQWWNPWVNYGGYHGYWARDFKKIDEHFGDLELYKKFVEEAHKRGLYVIQDIVANHVGDYFKFVGGKFYKNTESV
PTSAPEQYPFSLNDYLKDKDKNIYHWTPDITNYSDPNQKLNYQMAGLDDLNTENPVVIEALKDSYTYWIKEAGVDGFRID
TVIYVPHEFWKEFLNGKNGVYETAKALGKEDFITFGEAWIRSDPYDDSGEKILESYFEDGMSAMLDFPLNIEIRRVFKEG
KATANLRYRLERREQFEHPERLVTFIDNHDMERFLKGAGLATVKQALAFLFTVPGIPVIYYGTEQGFVETRAAMFKEGYA
SGGVDHYDTSFEMYQFIRELTEFRKSNPVFRHGHIEVLKDDPNGPGIFAFKISDEEVTAFVMLNTSNERRIITNLKTGLE
PGTVIEPLYTLNTLKKKYSADKDGELNLVLNPRSVYVGIATTQKKKVKMPTVEIYPNLKQGQKITGNFVITGTAKNAKSV
KIIFDTRIDASVSVDVIDGKWSYEWDISKFDPGIHTIVFKVYGQKRTDTTYSEDYQVILDIPEKLLAQVEDPEGDDHGPF
GKYLYPTDVTFKRQMDLLGVTIKQIGASLVITMKIKDLTTSWSPQNGFDHVTFQIYIDDPSKTGAKDLPFQNAKMPDGLD
WDYFIFANGWSIVAYSSEGSGSQAFGTPISPTPQVKTNKMTREVTLIIPSEVLGRPDSFDGFKFYITTWDFDGIEARYRD
LYPKPKAFHFGGGEKTDPYIMDDVLIDLGK

Specific function: The precursor protein is proteolytically cleaved to produce multiform beta-amylases and a 48 kDa alpha-amylase after secretion [H]

COG id: COG0366

COG function: function code G; Glycosidases

Gene ontology:

Cell location: Secreted [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: In the C-terminal section; belongs to the glycosyl hydrolase 13 family [H]

Homologues:

Organism=Homo sapiens, GI187423904, Length=396, Percent_Identity=23.2323232323232, Blast_Score=77, Evalue=5e-14,
Organism=Escherichia coli, GI1789995, Length=499, Percent_Identity=25.8517034068136, Blast_Score=151, Evalue=2e-37,
Organism=Escherichia coli, GI1786604, Length=554, Percent_Identity=25.812274368231, Blast_Score=102, Evalue=1e-22,
Organism=Escherichia coli, GI1790687, Length=202, Percent_Identity=29.7029702970297, Blast_Score=73, Evalue=6e-14,
Organism=Saccharomyces cerevisiae, GI6322245, Length=199, Percent_Identity=30.1507537688442, Blast_Score=69, Evalue=2e-12,
Organism=Drosophila melanogaster, GI24583745, Length=411, Percent_Identity=21.8978102189781, Blast_Score=85, Evalue=2e-16,
Organism=Drosophila melanogaster, GI24583747, Length=384, Percent_Identity=23.1770833333333, Blast_Score=80, Evalue=6e-15,
Organism=Drosophila melanogaster, GI24583749, Length=384, Percent_Identity=23.1770833333333, Blast_Score=80, Evalue=7e-15,
Organism=Drosophila melanogaster, GI24586589, Length=381, Percent_Identity=23.8845144356955, Blast_Score=80, Evalue=7e-15,
Organism=Drosophila melanogaster, GI221330053, Length=205, Percent_Identity=29.2682926829268, Blast_Score=77, Evalue=5e-14,
Organism=Drosophila melanogaster, GI24586591, Length=400, Percent_Identity=23.5, Blast_Score=75, Evalue=2e-13,
Organism=Drosophila melanogaster, GI24586599, Length=384, Percent_Identity=23.4375, Blast_Score=72, Evalue=2e-12,
Organism=Drosophila melanogaster, GI45549022, Length=389, Percent_Identity=21.8508997429306, Blast_Score=70, Evalue=5e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR006048
- InterPro:   IPR005085
- InterPro:   IPR008985
- InterPro:   IPR013780
- InterPro:   IPR006047
- InterPro:   IPR006589
- InterPro:   IPR001554
- InterPro:   IPR018238
- InterPro:   IPR000125
- InterPro:   IPR017853
- InterPro:   IPR013781 [H]

Pfam domain/function: PF00128 Alpha-amylase; PF03423 CBM_25; PF01373 Glyco_hydro_14 [H]

EC number: =3.2.1.2; =3.2.1.1 [H]

Molecular weight: Translated: 94717; Mature: 94717

Theoretical pI: Translated: 4.94; Mature: 4.94

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLFLLLLVCVLALAVPSPKWEDQIIYFVMIDRFANGDPTNDNFGMGEYGKDNARYNGGDL
CHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEHHCCCCCCCCCCCCCCCCCCCCCCCCCHH
AGLIEKLDYIKELGATAIWITPPIANQWWNPWVNYGGYHGYWARDFKKIDEHFGDLELYK
HHHHHHHHHHHHCCCEEEEEECCCCCCCCCCHHCCCCCCCHHHHHHHHHHHHCCCHHHHH
KFVEEAHKRGLYVIQDIVANHVGDYFKFVGGKFYKNTESVPTSAPEQYPFSLNDYLKDKD
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCCHHHHHCCCC
KNIYHWTPDITNYSDPNQKLNYQMAGLDDLNTENPVVIEALKDSYTYWIKEAGVDGFRID
CCEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCEEEEEECCCCEEEEECCCCCCEEEE
TVIYVPHEFWKEFLNGKNGVYETAKALGKEDFITFGEAWIRSDPYDDSGEKILESYFEDG
EEEECCHHHHHHHHCCCCCHHHHHHHCCCHHHEEHHHHHHCCCCCCCHHHHHHHHHHHCC
MSAMLDFPLNIEIRRVFKEGKATANLRYRLERREQFEHPERLVTFIDNHDMERFLKGAGL
HHHHHCCCCCHHHHHHHHHCCCCEEHHEEEHHHHHCCCHHHHHHHHCCCCHHHHHHCCCH
ATVKQALAFLFTVPGIPVIYYGTEQGFVETRAAMFKEGYASGGVDHYDTSFEMYQFIREL
HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH
TEFRKSNPVFRHGHIEVLKDDPNGPGIFAFKISDEEVTAFVMLNTSNERRIITNLKTGLE
HHHHHCCCCEECCCEEEEEECCCCCEEEEEEECCCCEEEEEEEECCCCCEEEEHHHCCCC
PGTVIEPLYTLNTLKKKYSADKDGELNLVLNPRSVYVGIATTQKKKVKMPTVEIYPNLKQ
CCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEEEECCCCEEECCEEEECCCCCC
GQKITGNFVITGTAKNAKSVKIIFDTRIDASVSVDVIDGKWSYEWDISKFDPGIHTIVFK
CCEEECCEEEEECCCCCCEEEEEEECCCCCEEEEEEEECCEEEEECHHHCCCCCEEEEEE
VYGQKRTDTTYSEDYQVILDIPEKLLAQVEDPEGDDHGPFGKYLYPTDVTFKRQMDLLGV
EECCCCCCCCCCCCCEEEEECCHHHHHHCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHH
TIKQIGASLVITMKIKDLTTSWSPQNGFDHVTFQIYIDDPSKTGAKDLPFQNAKMPDGLD
HHHHCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCC
WDYFIFANGWSIVAYSSEGSGSQAFGTPISPTPQVKTNKMTREVTLIIPSEVLGRPDSFD
CCEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECHHHCCCCCCCC
GFKFYITTWDFDGIEARYRDLYPKPKAFHFGGGEKTDPYIMDDVLIDLGK
CEEEEEEEECCCCCHHHHHHCCCCCCEEECCCCCCCCCEEHHHHHHCCCC
>Mature Secondary Structure
MLFLLLLVCVLALAVPSPKWEDQIIYFVMIDRFANGDPTNDNFGMGEYGKDNARYNGGDL
CHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEHHCCCCCCCCCCCCCCCCCCCCCCCCCHH
AGLIEKLDYIKELGATAIWITPPIANQWWNPWVNYGGYHGYWARDFKKIDEHFGDLELYK
HHHHHHHHHHHHCCCEEEEEECCCCCCCCCCHHCCCCCCCHHHHHHHHHHHHCCCHHHHH
KFVEEAHKRGLYVIQDIVANHVGDYFKFVGGKFYKNTESVPTSAPEQYPFSLNDYLKDKD
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCCHHHHHCCCC
KNIYHWTPDITNYSDPNQKLNYQMAGLDDLNTENPVVIEALKDSYTYWIKEAGVDGFRID
CCEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCEEEEEECCCCEEEEECCCCCCEEEE
TVIYVPHEFWKEFLNGKNGVYETAKALGKEDFITFGEAWIRSDPYDDSGEKILESYFEDG
EEEECCHHHHHHHHCCCCCHHHHHHHCCCHHHEEHHHHHHCCCCCCCHHHHHHHHHHHCC
MSAMLDFPLNIEIRRVFKEGKATANLRYRLERREQFEHPERLVTFIDNHDMERFLKGAGL
HHHHHCCCCCHHHHHHHHHCCCCEEHHEEEHHHHHCCCHHHHHHHHCCCCHHHHHHCCCH
ATVKQALAFLFTVPGIPVIYYGTEQGFVETRAAMFKEGYASGGVDHYDTSFEMYQFIREL
HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH
TEFRKSNPVFRHGHIEVLKDDPNGPGIFAFKISDEEVTAFVMLNTSNERRIITNLKTGLE
HHHHHCCCCEECCCEEEEEECCCCCEEEEEEECCCCEEEEEEEECCCCCEEEEHHHCCCC
PGTVIEPLYTLNTLKKKYSADKDGELNLVLNPRSVYVGIATTQKKKVKMPTVEIYPNLKQ
CCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEEEECCCCEEECCEEEECCCCCC
GQKITGNFVITGTAKNAKSVKIIFDTRIDASVSVDVIDGKWSYEWDISKFDPGIHTIVFK
CCEEECCEEEEECCCCCCEEEEEEECCCCCEEEEEEEECCEEEEECHHHCCCCCEEEEEE
VYGQKRTDTTYSEDYQVILDIPEKLLAQVEDPEGDDHGPFGKYLYPTDVTFKRQMDLLGV
EECCCCCCCCCCCCCEEEEECCHHHHHHCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHH
TIKQIGASLVITMKIKDLTTSWSPQNGFDHVTFQIYIDDPSKTGAKDLPFQNAKMPDGLD
HHHHCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCC
WDYFIFANGWSIVAYSSEGSGSQAFGTPISPTPQVKTNKMTREVTLIIPSEVLGRPDSFD
CCEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECHHHCCCCCCCC
GFKFYITTWDFDGIEARYRDLYPKPKAFHFGGGEKTDPYIMDDVLIDLGK
CEEEEEEEECCCCCHHHHHHCCCCCCEEECCCCCCCCCEEHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2435707; 2464578; 2438660; 1827035 [H]