| Definition | Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome. |
|---|---|
| Accession | NC_009972 |
| Length | 6,346,587 |
Click here to switch to the map view.
The map label for this gene is ams [H]
Identifier: 159897682
GI number: 159897682
Start: 1323927
End: 1325900
Strand: Direct
Name: ams [H]
Synonym: Haur_1153
Alternate gene names: 159897682
Gene position: 1323927-1325900 (Clockwise)
Preceding gene: 159897681
Following gene: 159897683
Centisome position: 20.86
GC content: 50.51
Gene sequence:
>1974_bases ATGCAACCACTCACAGTTCAAACTATGCATTTGCCCAATCCAACCACGTTCGATGATCTGCTCGATCAACAGATTGCCAA TTCGCGTGATCGCGATATTTTTCGCTTGCGCATGCAACGCCATTTTGGCGATTGTTTAGAAGCGCTAGGAGCGTTGTATG CCCAGCATCCAGCTTGGCCACAGTTGTTGGAGCAATTGCCCGAACGCTTGATTACTGCCTATGCCCAGCGCCGCGATGCC CTGAAAATTCACGATTTAGCCCGCGAAATCCAGCCCGATTGGTTTGCTGAGGCCACCATGGTTGGCGGCATTTACTATGT TGATCGCTTGGCAGGCACATTGCGCGGGGTGATTGAGCATATTGATTATTTGCAAGAATTGGGTTTGACCTATGTGCATC TGATGCCGCTATTACAGCCACGCCATGGCCCCAACGATGGCGGCTATGCGGTGCTCGATTATCGCTCGATTGATCAACGG CTTGGCAATGTGGCCGATTTTATCGAATTAAGCGATTTGCTCCGTACCAACGGCATCAGCTTATGCATTGATGTGGTGGT GAATCACACGGCCAAAGAGCATGAATGGGCAGTCAAGGCCCGTGCTGGTGATGCCCAATATTTGGATTACTATCTGAGTT TTGCCGATCGCAGTTTGCCTGATGCCTATGAGCAACATTTACCCGAAGTGTTTCCCGATTTTGCGCCTGGTAATTTTACT TGGTATGCCGAGTTGAGCGAGCATGGCCGTTGGGTTTGGACGACCTTCAACGAATTTCAATGGGATTTGAACTATACCAA CCCCATGGTTTGGCTGGAGATGCTGGATATTTTGCTGTATCTCGCCAATCTAGGCGTTGATGTGCTGCGTTTGGATGCCG TGCCGTTTATGTGGAAACGCCTCGGCACGAATTGCCAAAATCAGCCCGAAGTGCTCGATTTGTTACAAGCTTGGCGAGCA GCCATGCGGATCGTCTGTCCGGCGACAATTTTCAAGGCCGAGGCGATTGTTGCCCCCGACGATTTGGTGCAATATTTGGG TTTGGGACGGCGCACAGGCAAGCTCTGTGAAATTGCCTACCATAATTCGCTGATGGTGTTGTTGTGGAGTGCCTTGGCCT CGCAACGCGCCGATCTGTTTACGCAATCGCTGTTGAACATGCCTGCAACGCCCAGCAATGCCGCTTGGATTACCTATGTG CGCTGCCACGATGATATTGGCTGGGCTGTGACCGACCACAATGCAGCTTTGGTTGGCGAAGATGGGCCATTGCATCGCCA ATTTTTAAGCGCTTGGTATAGTGGCGAATTTGCTGGTAGTTTTGCGCGGGGCGAGGTGTTTCAATATAATCCACTCACCA ACGATCGCCGAATTAGCGGCATGACTGCCTCGTTGGCTGGGCTAGAGCAAGCCTTGGAAACCACCGATCCAGCAGCGATT GAATTGACAATTCGCCGGATTGCGTTGCTGTATGCCGTGATTTTTAGCTTTGGTGGCATTCCGTTGATCTATATGGGCGA TGAATTGGGCATGCTCAATGATCACAGCTACTTGCATGACCCTACCAAAGCCAACGATAACCGCTGGTTGCATCGCCCAG CCATGGATTGGTGCTTAGCGGCCCAACGCCATGATCCAACTACGCTTGCTGGGCGCTTATGGCAGGTATTGCGCCATTTG ATTCAGGTGCGCCAACATACTCCAGCCTTGCATAGCGCAGGCCAAACCTTGCCAATCTGGACACAGCAACGCCATGTTTT AGGGGTGGTTCGAGTTCACCCATTGGGGCGAATTTTAATTCTTGGAAACCTTTCCGCCACCCCACAGCGGGTCAGTTTAG CGGTTATTCAACAAGCAGGGCTGGTTGGTCGCTTATATAATTTGTTGGATAACGATTCACTTAATATCGATACACAAAGC CATGAAATTATACTCGATGCATATCAATGTTGTTGGCTCAGCATTCAAGCCTAA
Upstream 100 bases:
>100_bases GGTTAAAACCCTTGGAAGGCCGAGGGCACGGCGATTTGATTTCATTTAGTTAATCCAACATTACAGCTTAAAAACCATCG GCAGTATGAGGAGTGTGGTT
Downstream 100 bases:
>100_bases ACTTTTTCAAATCAAAAAATAAGCTTCCGATAACGATTGCAGTCCTGAATTGGATCAAGTATAGTTGCGCTATCCCTGTT CTTTTTTCAGAAGGAGCGCA
Product: alpha amylase catalytic subunit
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 657; Mature: 657
Protein sequence:
>657_residues MQPLTVQTMHLPNPTTFDDLLDQQIANSRDRDIFRLRMQRHFGDCLEALGALYAQHPAWPQLLEQLPERLITAYAQRRDA LKIHDLAREIQPDWFAEATMVGGIYYVDRLAGTLRGVIEHIDYLQELGLTYVHLMPLLQPRHGPNDGGYAVLDYRSIDQR LGNVADFIELSDLLRTNGISLCIDVVVNHTAKEHEWAVKARAGDAQYLDYYLSFADRSLPDAYEQHLPEVFPDFAPGNFT WYAELSEHGRWVWTTFNEFQWDLNYTNPMVWLEMLDILLYLANLGVDVLRLDAVPFMWKRLGTNCQNQPEVLDLLQAWRA AMRIVCPATIFKAEAIVAPDDLVQYLGLGRRTGKLCEIAYHNSLMVLLWSALASQRADLFTQSLLNMPATPSNAAWITYV RCHDDIGWAVTDHNAALVGEDGPLHRQFLSAWYSGEFAGSFARGEVFQYNPLTNDRRISGMTASLAGLEQALETTDPAAI ELTIRRIALLYAVIFSFGGIPLIYMGDELGMLNDHSYLHDPTKANDNRWLHRPAMDWCLAAQRHDPTTLAGRLWQVLRHL IQVRQHTPALHSAGQTLPIWTQQRHVLGVVRVHPLGRILILGNLSATPQRVSLAVIQQAGLVGRLYNLLDNDSLNIDTQS HEIILDAYQCCWLSIQA
Sequences:
>Translated_657_residues MQPLTVQTMHLPNPTTFDDLLDQQIANSRDRDIFRLRMQRHFGDCLEALGALYAQHPAWPQLLEQLPERLITAYAQRRDA LKIHDLAREIQPDWFAEATMVGGIYYVDRLAGTLRGVIEHIDYLQELGLTYVHLMPLLQPRHGPNDGGYAVLDYRSIDQR LGNVADFIELSDLLRTNGISLCIDVVVNHTAKEHEWAVKARAGDAQYLDYYLSFADRSLPDAYEQHLPEVFPDFAPGNFT WYAELSEHGRWVWTTFNEFQWDLNYTNPMVWLEMLDILLYLANLGVDVLRLDAVPFMWKRLGTNCQNQPEVLDLLQAWRA AMRIVCPATIFKAEAIVAPDDLVQYLGLGRRTGKLCEIAYHNSLMVLLWSALASQRADLFTQSLLNMPATPSNAAWITYV RCHDDIGWAVTDHNAALVGEDGPLHRQFLSAWYSGEFAGSFARGEVFQYNPLTNDRRISGMTASLAGLEQALETTDPAAI ELTIRRIALLYAVIFSFGGIPLIYMGDELGMLNDHSYLHDPTKANDNRWLHRPAMDWCLAAQRHDPTTLAGRLWQVLRHL IQVRQHTPALHSAGQTLPIWTQQRHVLGVVRVHPLGRILILGNLSATPQRVSLAVIQQAGLVGRLYNLLDNDSLNIDTQS HEIILDAYQCCWLSIQA >Mature_657_residues MQPLTVQTMHLPNPTTFDDLLDQQIANSRDRDIFRLRMQRHFGDCLEALGALYAQHPAWPQLLEQLPERLITAYAQRRDA LKIHDLAREIQPDWFAEATMVGGIYYVDRLAGTLRGVIEHIDYLQELGLTYVHLMPLLQPRHGPNDGGYAVLDYRSIDQR LGNVADFIELSDLLRTNGISLCIDVVVNHTAKEHEWAVKARAGDAQYLDYYLSFADRSLPDAYEQHLPEVFPDFAPGNFT WYAELSEHGRWVWTTFNEFQWDLNYTNPMVWLEMLDILLYLANLGVDVLRLDAVPFMWKRLGTNCQNQPEVLDLLQAWRA AMRIVCPATIFKAEAIVAPDDLVQYLGLGRRTGKLCEIAYHNSLMVLLWSALASQRADLFTQSLLNMPATPSNAAWITYV RCHDDIGWAVTDHNAALVGEDGPLHRQFLSAWYSGEFAGSFARGEVFQYNPLTNDRRISGMTASLAGLEQALETTDPAAI ELTIRRIALLYAVIFSFGGIPLIYMGDELGMLNDHSYLHDPTKANDNRWLHRPAMDWCLAAQRHDPTTLAGRLWQVLRHL IQVRQHTPALHSAGQTLPIWTQQRHVLGVVRVHPLGRILILGNLSATPQRVSLAVIQQAGLVGRLYNLLDNDSLNIDTQS HEIILDAYQCCWLSIQA
Specific function: Catalyzes the synthesis of alpha-glucan from sucrose. Catalyzes, in addition, sucrose hydrolysis, maltose and maltotriose synthesis by successive transfers of the glucosyl moiety of sucrose onto the released glucose, and finally turanose and trehalulose s
COG id: COG0366
COG function: function code G; Glycosidases
Gene ontology:
Cell location: Secreted [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 13 family [H]
Homologues:
Organism=Homo sapiens, GI187423904, Length=212, Percent_Identity=27.8301886792453, Blast_Score=81, Evalue=3e-15, Organism=Escherichia coli, GI87081873, Length=529, Percent_Identity=24.952741020794, Blast_Score=132, Evalue=5e-32, Organism=Escherichia coli, GI1790687, Length=217, Percent_Identity=32.258064516129, Blast_Score=99, Evalue=1e-21, Organism=Drosophila melanogaster, GI24583749, Length=290, Percent_Identity=27.9310344827586, Blast_Score=93, Evalue=6e-19, Organism=Drosophila melanogaster, GI24583747, Length=290, Percent_Identity=27.9310344827586, Blast_Score=93, Evalue=7e-19, Organism=Drosophila melanogaster, GI45549022, Length=285, Percent_Identity=25.9649122807018, Blast_Score=92, Evalue=9e-19, Organism=Drosophila melanogaster, GI24586591, Length=186, Percent_Identity=30.6451612903226, Blast_Score=92, Evalue=1e-18, Organism=Drosophila melanogaster, GI24586593, Length=341, Percent_Identity=25.2199413489736, Blast_Score=90, Evalue=4e-18, Organism=Drosophila melanogaster, GI24586587, Length=281, Percent_Identity=26.3345195729537, Blast_Score=89, Evalue=7e-18, Organism=Drosophila melanogaster, GI24586589, Length=310, Percent_Identity=26.4516129032258, Blast_Score=87, Evalue=3e-17, Organism=Drosophila melanogaster, GI221330053, Length=198, Percent_Identity=30.8080808080808, Blast_Score=85, Evalue=2e-16, Organism=Drosophila melanogaster, GI24586599, Length=276, Percent_Identity=25, Blast_Score=84, Evalue=3e-16, Organism=Drosophila melanogaster, GI24583745, Length=288, Percent_Identity=25, Blast_Score=84, Evalue=4e-16, Organism=Drosophila melanogaster, GI24586597, Length=290, Percent_Identity=25.8620689655172, Blast_Score=78, Evalue=2e-14, Organism=Drosophila melanogaster, GI281360393, Length=307, Percent_Identity=24.1042345276873, Blast_Score=72, Evalue=9e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006047 - InterPro: IPR017853 - InterPro: IPR013781 [H]
Pfam domain/function: PF00128 Alpha-amylase [H]
EC number: =2.4.1.4 [H]
Molecular weight: Translated: 74570; Mature: 74570
Theoretical pI: Translated: 5.69; Mature: 5.69
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MQPLTVQTMHLPNPTTFDDLLDQQIANSRDRDIFRLRMQRHFGDCLEALGALYAQHPAWP CCCCEEEEEECCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH QLLEQLPERLITAYAQRRDALKIHDLAREIQPDWFAEATMVGGIYYVDRLAGTLRGVIEH HHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH IDYLQELGLTYVHLMPLLQPRHGPNDGGYAVLDYRSIDQRLGNVADFIELSDLLRTNGIS HHHHHHHCHHHHHHHHHHCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCHH LCIDVVVNHTAKEHEWAVKARAGDAQYLDYYLSFADRSLPDAYEQHLPEVFPDFAPGNFT HHHHHHHHCCCHHCCEEEEECCCCHHHHHHHHHHHCCCCCHHHHHHCHHHCCCCCCCCEE WYAELSEHGRWVWTTFNEFQWDLNYTNPMVWLEMLDILLYLANLGVDVLRLDAVPFMWKR EEEEHHHCCCEEEEECCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHH LGTNCQNQPEVLDLLQAWRAAMRIVCPATIFKAEAIVAPDDLVQYLGLGRRTGKLCEIAY HCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCEEECHHHHHHHHCCCCCCCCEEEEEE HNSLMVLLWSALASQRADLFTQSLLNMPATPSNAAWITYVRCHDDIGWAVTDHNAALVGE CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCEEEECCCEEEECC DGPLHRQFLSAWYSGEFAGSFARGEVFQYNPLTNDRRISGMTASLAGLEQALETTDPAAI CCCHHHHHHHHHHCCCCCCCHHCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCHHH ELTIRRIALLYAVIFSFGGIPLIYMGDELGMLNDHSYLHDPTKANDNRWLHRPAMDWCLA HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECHHHHHHHH AQRHDPTTLAGRLWQVLRHLIQVRQHTPALHSAGQTLPIWTQQRHVLGVVRVHPLGRILI HCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCHHEEEEEECCCCEEEE LGNLSATPQRVSLAVIQQAGLVGRLYNLLDNDSLNIDTQSHEIILDAYQCCWLSIQA EECCCCCCHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCCHHHHEEHHHHEEEECC >Mature Secondary Structure MQPLTVQTMHLPNPTTFDDLLDQQIANSRDRDIFRLRMQRHFGDCLEALGALYAQHPAWP CCCCEEEEEECCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH QLLEQLPERLITAYAQRRDALKIHDLAREIQPDWFAEATMVGGIYYVDRLAGTLRGVIEH HHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH IDYLQELGLTYVHLMPLLQPRHGPNDGGYAVLDYRSIDQRLGNVADFIELSDLLRTNGIS HHHHHHHCHHHHHHHHHHCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCHH LCIDVVVNHTAKEHEWAVKARAGDAQYLDYYLSFADRSLPDAYEQHLPEVFPDFAPGNFT HHHHHHHHCCCHHCCEEEEECCCCHHHHHHHHHHHCCCCCHHHHHHCHHHCCCCCCCCEE WYAELSEHGRWVWTTFNEFQWDLNYTNPMVWLEMLDILLYLANLGVDVLRLDAVPFMWKR EEEEHHHCCCEEEEECCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHH LGTNCQNQPEVLDLLQAWRAAMRIVCPATIFKAEAIVAPDDLVQYLGLGRRTGKLCEIAY HCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCCEEECHHHHHHHHCCCCCCCCEEEEEE HNSLMVLLWSALASQRADLFTQSLLNMPATPSNAAWITYVRCHDDIGWAVTDHNAALVGE CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCEEEECCCEEEECC DGPLHRQFLSAWYSGEFAGSFARGEVFQYNPLTNDRRISGMTASLAGLEQALETTDPAAI CCCHHHHHHHHHHCCCCCCCHHCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCHHH ELTIRRIALLYAVIFSFGGIPLIYMGDELGMLNDHSYLHDPTKANDNRWLHRPAMDWCLA HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECHHHHHHHH AQRHDPTTLAGRLWQVLRHLIQVRQHTPALHSAGQTLPIWTQQRHVLGVVRVHPLGRILI HCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCHHEEEEEECCCCEEEE LGNLSATPQRVSLAVIQQAGLVGRLYNLLDNDSLNIDTQSHEIILDAYQCCWLSIQA EECCCCCCHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCCHHHHEEHHHHEEEECC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 12517860 [H]