Definition Nostoc sp. PCC 7120, complete genome.
Accession NC_003272
Length 6,413,771

Click here to switch to the map view.

The map label for this gene is 17228805

Identifier: 17228805

GI number: 17228805

Start: 1552356

End: 1554590

Strand: Direct

Name: 17228805

Synonym: alr1310

Alternate gene names: NA

Gene position: 1552356-1554590 (Clockwise)

Preceding gene: 17228804

Following gene: 17228806

Centisome position: 24.2

GC content: 44.74

Gene sequence:

>2235_bases
ATGACTCATCCGCTATACGTTGCTTTTATTTGGCATCAACATCAGCCATTGTATAAATCTCCTGGTAGCAGCCTTTCAAC
GCCTTCTAGTCAGCAATATCGCCTGCCTTGGGTGCGCTTACATGGTACAAAGGATTATTTAGATTTAATACTAATTTTGG
AAAAGTATCCAAAATTACATCAGACAGTAAACTTAGTTCCTTCCTTAATTCTGCAACTGGAAGATTATATTGCTGGTACA
GCGTTTGACCCTTACCTCAAAGCCAGTCTGACACCAACTGAGCAACTGACTCAGCAACAGCGAGAGTTTATCATTCAGCA
CTTTTTTGATGCCAATCACCACACCTTGATTGACCCTCACCCCCGCTATGCCGAGTTGTACTATCAAAGGCAGGAAAAAG
GACAGTCTTGGTGTTTGGCGAATTGGCAATTAGCAGATTACAGCGATTTGTTGGCGTGGCATAATTTGGCATGGATAGAC
CCACTGTTTTGGGATGACCCAGAAATTGCGGCTTGGTTACAGCAGGGGCGTAACTTTACATTAGGCGATCGCCAGCGCAT
TTATTCCAAACAACGTGATATTCTCAGCCGCATTATTCCTCAACACCGGAAAATGCAGGAATCGGGGCAATTAGAAGTTA
CCACCACACCCTATACTCACCCGATTTTGCCTTTGTTGGCTGATACTAATTCTGGTCGGGTAGCTGTGCCAAATATGGCG
TTACCAGAATCTCGGTTTCAGTGGTCAGAAGATATTCCTCGTCATTTAAGAAAAGCTTGGGAACTTTATACAGAAAGATT
CGGGCAGGAACCAAAGGGTTTATGGCCTTCCGAACAATCAGTTAGTCCAGATATATTACCGTATATTATTAAACAAGGAT
TTCAGTGGATTTGCTCAGATGAAGCAGTCTTGGGGTGGACACTAAAACACTTCTTTCATCGGGATGGGGCAGGGAATGTA
CAACAGCCAGAACTGTTGTATCGTCCTTATCGCCTAGCAACTCCAGCCGGAGATTTAGCTATTGTCTTCCGTGACCACAG
ATTATCAGATTTGATTGGCTTTACCTATGGGGCAATGCCCGCCAAACAGGCAGCCGCCGATCTGGTGGGACACCTGCAAG
CGATCGCCAAAATGCAACGAGAGCGACCAAGTGAACAGCCTTGGTTAGTGACTATCGCCTTAGATGGCGAAAACTGTTGG
GAATTTTACCCCCAAGATGGCAAACCATTCCTAGAAGCTTTATATCAAAGTTTAAGCAACGAACCCCACATCAAACTCGT
CACTGTCTCCGAATTTATCGAAGAATTTCCTGCCACAGCGACCATTCCCGCAGAACAACTACATAGCGGTTCTTGGGTTG
ATGGCAGTTTTACCACCTGGATTGGTGATCCTGCCAAAAATCGAGCTTGGGATTACCTTACAGAAGCGAGAATCATGTTG
GCAAATCATCCCGAAGCAACAGAAGAAAATAACCCCGAAGCTTGGGAAGCTTTATATGCTGCCGAAGGTTCAGACTGGTT
TTGGTGGTTTGGGGAAGGGCATTCCTCAAATCAAGATGCCATTTTTGACCAATTATTTCGAGAACATTTGTGTGGCATCT
ATAAAGCTTTGAATGAACCCATACCCGCATATCTCAAGCATCCAGTGGAGGTTCATGCAGCCAGAGCCGATCATTCCCCT
GAAGGCTTCATTCATCCTGTAATTGATGGCAGAGGAGATGAGCAAGATTGGGACAAAGCTGGACGGATAGAAATTGGTGG
GGCGAGGGGGACAATGCACAACAGCAGCATTGTTCAGCGCCTATGGTATGGGGTAGATCACCTGAATTTCTATTTGCGAG
TAGATTTTAAAAGTGGTGTTACCCCTGGACATGGTTTGCCTCCAGAGTTAAACCTGTTGTGGTTTTATCCAGACCAAACA
ATGCACAACAGTCCGATTCCTTTAGCTGATGTGCCGGATACAGCCCCACTTAATTATTTATTCCATCATCATTTGGAAAT
TAACTTGCTGACCCAATCAATTCAATTTCGGGAAGCAGCAGAAAATTATCAATGGCATCCCCGTTTCAGCCGCGCTCAAG
TCGCTTTAGAAAATTGTTTAGAAGTAGCAATACCCTGGGCAGATTTGCAAGTTCCGCCAGATTATCCTCTGCGGCTAATT
CTAGTACTTGCTGATGAGGGACGTTTTAGTAAATATTTACCAGAAAACACTTTGATTCCGATTGAAGTGCCGTAG

Upstream 100 bases:

>100_bases
CTGAGGATTAGGAAAGAGGCAATTTACCCAATCCCCAATACCCAATACCCAACACCCAATACCCAATACCCAATACCCAA
CACCCAATCCCTAAGACTCT

Downstream 100 bases:

>100_bases
GTAGCAATAGGGGGGTAGGGTATAAAATCTCACTGATTTAGAGTGTAGGGATTAATGAAACTATGATTATGTCCTTACTT
TTATGGTGATCACTCATGGC

Product: hypothetical protein

Products: NA

Alternate protein names: Glycoside Hydrolase Family Protein; Amylopullulanase; Glycoside Hydrolase; Glycosy Hydrolase Family Protein; Pullulanase; Family; Glycosyl Hydrolase Family; Alpha-Amylase/Alpha-Mannosidase; Membrane Bound Alpha-Amylase; Pullulanase Glycoside Hydrolase Family; Alpha-Dextran Endo-1 6-Alpha-Glucosidase; Alpha-Mannosidase; Amylopullulanase Related Protein

Number of amino acids: Translated: 744; Mature: 743

Protein sequence:

>744_residues
MTHPLYVAFIWHQHQPLYKSPGSSLSTPSSQQYRLPWVRLHGTKDYLDLILILEKYPKLHQTVNLVPSLILQLEDYIAGT
AFDPYLKASLTPTEQLTQQQREFIIQHFFDANHHTLIDPHPRYAELYYQRQEKGQSWCLANWQLADYSDLLAWHNLAWID
PLFWDDPEIAAWLQQGRNFTLGDRQRIYSKQRDILSRIIPQHRKMQESGQLEVTTTPYTHPILPLLADTNSGRVAVPNMA
LPESRFQWSEDIPRHLRKAWELYTERFGQEPKGLWPSEQSVSPDILPYIIKQGFQWICSDEAVLGWTLKHFFHRDGAGNV
QQPELLYRPYRLATPAGDLAIVFRDHRLSDLIGFTYGAMPAKQAAADLVGHLQAIAKMQRERPSEQPWLVTIALDGENCW
EFYPQDGKPFLEALYQSLSNEPHIKLVTVSEFIEEFPATATIPAEQLHSGSWVDGSFTTWIGDPAKNRAWDYLTEARIML
ANHPEATEENNPEAWEALYAAEGSDWFWWFGEGHSSNQDAIFDQLFREHLCGIYKALNEPIPAYLKHPVEVHAARADHSP
EGFIHPVIDGRGDEQDWDKAGRIEIGGARGTMHNSSIVQRLWYGVDHLNFYLRVDFKSGVTPGHGLPPELNLLWFYPDQT
MHNSPIPLADVPDTAPLNYLFHHHLEINLLTQSIQFREAAENYQWHPRFSRAQVALENCLEVAIPWADLQVPPDYPLRLI
LVLADEGRFSKYLPENTLIPIEVP

Sequences:

>Translated_744_residues
MTHPLYVAFIWHQHQPLYKSPGSSLSTPSSQQYRLPWVRLHGTKDYLDLILILEKYPKLHQTVNLVPSLILQLEDYIAGT
AFDPYLKASLTPTEQLTQQQREFIIQHFFDANHHTLIDPHPRYAELYYQRQEKGQSWCLANWQLADYSDLLAWHNLAWID
PLFWDDPEIAAWLQQGRNFTLGDRQRIYSKQRDILSRIIPQHRKMQESGQLEVTTTPYTHPILPLLADTNSGRVAVPNMA
LPESRFQWSEDIPRHLRKAWELYTERFGQEPKGLWPSEQSVSPDILPYIIKQGFQWICSDEAVLGWTLKHFFHRDGAGNV
QQPELLYRPYRLATPAGDLAIVFRDHRLSDLIGFTYGAMPAKQAAADLVGHLQAIAKMQRERPSEQPWLVTIALDGENCW
EFYPQDGKPFLEALYQSLSNEPHIKLVTVSEFIEEFPATATIPAEQLHSGSWVDGSFTTWIGDPAKNRAWDYLTEARIML
ANHPEATEENNPEAWEALYAAEGSDWFWWFGEGHSSNQDAIFDQLFREHLCGIYKALNEPIPAYLKHPVEVHAARADHSP
EGFIHPVIDGRGDEQDWDKAGRIEIGGARGTMHNSSIVQRLWYGVDHLNFYLRVDFKSGVTPGHGLPPELNLLWFYPDQT
MHNSPIPLADVPDTAPLNYLFHHHLEINLLTQSIQFREAAENYQWHPRFSRAQVALENCLEVAIPWADLQVPPDYPLRLI
LVLADEGRFSKYLPENTLIPIEVP
>Mature_743_residues
THPLYVAFIWHQHQPLYKSPGSSLSTPSSQQYRLPWVRLHGTKDYLDLILILEKYPKLHQTVNLVPSLILQLEDYIAGTA
FDPYLKASLTPTEQLTQQQREFIIQHFFDANHHTLIDPHPRYAELYYQRQEKGQSWCLANWQLADYSDLLAWHNLAWIDP
LFWDDPEIAAWLQQGRNFTLGDRQRIYSKQRDILSRIIPQHRKMQESGQLEVTTTPYTHPILPLLADTNSGRVAVPNMAL
PESRFQWSEDIPRHLRKAWELYTERFGQEPKGLWPSEQSVSPDILPYIIKQGFQWICSDEAVLGWTLKHFFHRDGAGNVQ
QPELLYRPYRLATPAGDLAIVFRDHRLSDLIGFTYGAMPAKQAAADLVGHLQAIAKMQRERPSEQPWLVTIALDGENCWE
FYPQDGKPFLEALYQSLSNEPHIKLVTVSEFIEEFPATATIPAEQLHSGSWVDGSFTTWIGDPAKNRAWDYLTEARIMLA
NHPEATEENNPEAWEALYAAEGSDWFWWFGEGHSSNQDAIFDQLFREHLCGIYKALNEPIPAYLKHPVEVHAARADHSPE
GFIHPVIDGRGDEQDWDKAGRIEIGGARGTMHNSSIVQRLWYGVDHLNFYLRVDFKSGVTPGHGLPPELNLLWFYPDQTM
HNSPIPLADVPDTAPLNYLFHHHLEINLLTQSIQFREAAENYQWHPRFSRAQVALENCLEVAIPWADLQVPPDYPLRLIL
VLADEGRFSKYLPENTLIPIEVP

Specific function: Unknown

COG id: COG1449

COG function: function code G; Alpha-amylase/alpha-mannosidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 85773; Mature: 85641

Theoretical pI: Translated: 5.49; Mature: 5.49

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.7 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTHPLYVAFIWHQHQPLYKSPGSSLSTPSSQQYRLPWVRLHGTKDYLDLILILEKYPKLH
CCCCEEEEEEEECCCCHHCCCCCCCCCCCCCCEECCEEEECCCHHHHHHHHHHHHCHHHH
QTVNLVPSLILQLEDYIAGTAFDPYLKASLTPTEQLTQQQREFIIQHFFDANHHTLIDPH
HHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCEECCCC
PRYAELYYQRQEKGQSWCLANWQLADYSDLLAWHNLAWIDPLFWDDPEIAAWLQQGRNFT
CHHHHHHHHHHHCCCCEEEECCEECCHHHHHHHCCHHHCCCCCCCCHHHHHHHHCCCCCC
LGDRQRIYSKQRDILSRIIPQHRKMQESGQLEVTTTPYTHPILPLLADTNSGRVAVPNMA
CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEEECCCCCEEEECCCC
LPESRFQWSEDIPRHLRKAWELYTERFGQEPKGLWPSEQSVSPDILPYIIKQGFQWICSD
CCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEECC
EAVLGWTLKHFFHRDGAGNVQQPELLYRPYRLATPAGDLAIVFRDHRLSDLIGFTYGAMP
CHHHHHHHHHHHHCCCCCCCCCCHHHHCCEEECCCCCCEEEEEECCCHHHHHHHHHCCCC
AKQAAADLVGHLQAIAKMQRERPSEQPWLVTIALDGENCWEFYPQDGKPFLEALYQSLSN
HHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCHHHCCCCCCCHHHHHHHHHHCC
EPHIKLVTVSEFIEEFPATATIPAEQLHSGSWVDGSFTTWIGDPAKNRAWDYLTEARIML
CCCEEEEEHHHHHHHCCCCCCCCHHHHCCCCCCCCCEEEECCCCCCCCHHHHHHHCEEEE
ANHPEATEENNPEAWEALYAAEGSDWFWWFGEGHSSNQDAIFDQLFREHLCGIYKALNEP
ECCCCCCCCCCCHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCC
IPAYLKHPVEVHAARADHSPEGFIHPVIDGRGDEQDWDKAGRIEIGGARGTMHNSSIVQR
CHHHHCCCHHEEEECCCCCCCCCEEECCCCCCCCCCCCCCCCEEECCCCCCCCHHHHHHH
LWYGVDHLNFYLRVDFKSGVTPGHGLPPELNLLWFYPDQTMHNSPIPLADVPDTAPLNYL
HHHCHHHEEEEEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHH
FHHHLEINLLTQSIQFREAAENYQWHPRFSRAQVALENCLEVAIPWADLQVPPDYPLRLI
HHHEEEEEEEHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEE
LVLADEGRFSKYLPENTLIPIEVP
EEEECCCCHHHCCCCCCEEEEECC
>Mature Secondary Structure 
THPLYVAFIWHQHQPLYKSPGSSLSTPSSQQYRLPWVRLHGTKDYLDLILILEKYPKLH
CCCEEEEEEEECCCCHHCCCCCCCCCCCCCCEECCEEEECCCHHHHHHHHHHHHCHHHH
QTVNLVPSLILQLEDYIAGTAFDPYLKASLTPTEQLTQQQREFIIQHFFDANHHTLIDPH
HHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCEECCCC
PRYAELYYQRQEKGQSWCLANWQLADYSDLLAWHNLAWIDPLFWDDPEIAAWLQQGRNFT
CHHHHHHHHHHHCCCCEEEECCEECCHHHHHHHCCHHHCCCCCCCCHHHHHHHHCCCCCC
LGDRQRIYSKQRDILSRIIPQHRKMQESGQLEVTTTPYTHPILPLLADTNSGRVAVPNMA
CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEEECCCCCEEEECCCC
LPESRFQWSEDIPRHLRKAWELYTERFGQEPKGLWPSEQSVSPDILPYIIKQGFQWICSD
CCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEECC
EAVLGWTLKHFFHRDGAGNVQQPELLYRPYRLATPAGDLAIVFRDHRLSDLIGFTYGAMP
CHHHHHHHHHHHHCCCCCCCCCCHHHHCCEEECCCCCCEEEEEECCCHHHHHHHHHCCCC
AKQAAADLVGHLQAIAKMQRERPSEQPWLVTIALDGENCWEFYPQDGKPFLEALYQSLSN
HHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCHHHCCCCCCCHHHHHHHHHHCC
EPHIKLVTVSEFIEEFPATATIPAEQLHSGSWVDGSFTTWIGDPAKNRAWDYLTEARIML
CCCEEEEEHHHHHHHCCCCCCCCHHHHCCCCCCCCCEEEECCCCCCCCHHHHHHHCEEEE
ANHPEATEENNPEAWEALYAAEGSDWFWWFGEGHSSNQDAIFDQLFREHLCGIYKALNEP
ECCCCCCCCCCCHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCC
IPAYLKHPVEVHAARADHSPEGFIHPVIDGRGDEQDWDKAGRIEIGGARGTMHNSSIVQR
CHHHHCCCHHEEEECCCCCCCCCEEECCCCCCCCCCCCCCCCEEECCCCCCCCHHHHHHH
LWYGVDHLNFYLRVDFKSGVTPGHGLPPELNLLWFYPDQTMHNSPIPLADVPDTAPLNYL
HHHCHHHEEEEEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHH
FHHHLEINLLTQSIQFREAAENYQWHPRFSRAQVALENCLEVAIPWADLQVPPDYPLRLI
HHHEEEEEEEHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEE
LVLADEGRFSKYLPENTLIPIEVP
EEEECCCCHHHCCCCCCEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA