Definition Clostridium botulinum A str. Hall, complete genome.
Accession NC_009698
Length 3,760,560

Click here to switch to the map view.

The map label for this gene is polC [H]

Identifier: 153935522

GI number: 153935522

Start: 2397736

End: 2402034

Strand: Reverse

Name: polC [H]

Synonym: CLC_2270

Alternate gene names: 153935522

Gene position: 2402034-2397736 (Counterclockwise)

Preceding gene: 153934543

Following gene: 153935745

Centisome position: 63.87

GC content: 30.61

Gene sequence:

>4299_bases
ATGGAAAATACTTCAGTTGGTGATTTGCAAAAGGAAATAAATAAAGAAAGGTTTGATACAGATAAAGAAATAAAAATAGT
AAGAGTTCAGTATTTTAGAAAAAAAAATAAACTAAGAATAATACTAAAATCCATTGGTAATTTTACTAAAGAAAAAGAAG
ACCATATAAAAAATATATTAAAGAAGAGATTTTCTATGGTGGAGGATTTTGAAATAATCTGCTATAAAGATCTATCCAAT
ATTACCCTAGAGGAACTTTCAAAAAAATATTGGGTAGATATAGTGAATTTAGCTTCTTCTTCTGTACCTATTGCAAGGGA
TTGTCTTTTAAAATCTAAAAGAGAAGTGCTAGAAGATAGTATAAATATAACCTATAACAATGAATTTTTATGTAGGTTTC
TTTCAAAAAATAAATTTGAAGGCAAACTGAAATCCTATATAAGAGATATATTTGGAATAAAATGCAATATAAAATTAGAA
TATGATAAATCTTTTAATGAAGAAGATTATTTTAAAACTATAGAAACTATGGAAAAGTCTATGATAAAGAATGTTTTGAG
TGAGATTAAGTCTAAAGAAAAGAAAGTCATTAGAAAAGAAAATTCTCCAAAGACTAGGGAAGAGGAGAATAAGGATACTT
CTGTCATATTAGGAAGAAGTATAAAAGAAGATAGTATATTCATGAAGGATTTAAATGAAAATTCTGGTATAGTAGTGGTA
TGTGGAGATGTATTTAAAAAAGAAGTTATAGAAACAAAGACTGGGAGAAAAATAGTAACTTTTTTCATAACAGATTACAC
CAATTCTATGACAGTTAAACTTTTCCCAAGGCCTAGAGATGCAGATAGAATAATAGAGGAAATAAAAGAGGGGATTTATT
GCAAAGTTAGAGGAGAAGTAGTAAATGACCCTTACGCAAGAGAATGCGTAATAATGGCTAAGGATATAGTAAAAACTACA
AAAATAGAAAAAATGGATATAGCAGAGGAAAAAAGAGTAGAACTTCATATGCATAGTCAAATGAGTGCCATGGATGGAAT
GACTCCAGTATCAGAATTAGTGAAAAGAGCTGCTAAATGGGGGCATAAAGCAGTGGCTATTACAGATCATGGTGTAGTTC
AGGCTTATCCAGATGCAATGTCTGCAGCTAAAAAAGGTAATATAAAAGTTATATACGGAATAGAAGCTTATCTTGTAGAT
GATGGAGTACCTATTGTATCTAAAGCAGGGGATAAAACCATAGAGGATACTTTTGTAGTTTTTGATTTAGAAACTACAGG
CTTTTCCAGCAAAAACGATAAAATAATAGAAATAGGTGCAGTAAAGATAGAAAAGGGAAAAATAGTAGATAGGTTTAGTG
AATTTGTAAATCCAGAAAGGAGCATACCAGACAAAATAACAGAACTTACAAGCATAACAGATGATATGGTAGATGATAAA
GAAGCTATAGATAAAATACTTCCAAGATTTATAGATTTTATAGGAGACTCTGTGGTGGTTGCTCATAATGCAAGCTTTGA
TGTGTCTTTTATAAATAAAAATTGTAAAGATTTAAAAATAGAATTTGAAAATAGTGTAATGGATACGGTGACACTAAGCA
GATTTTTATTTCCTGAATTAAAGAGATATAAATTAAATGTTATAGCAAAGCATTTAGGTATATCCTTAGAAAATCATCAT
AGAGCTGTAGATGATGCTAAAGCCACAGCAGATATCTTATTAAAATCTTTTGAAATGTTAAGAGAAAAAGAAATAATTAC
CTTACATAGTTTAAATAAAGAGTTTTTAGGAAATATAAATATAAAAAAAGAGCCCACCCATCATTTAATAATTTTGGCTA
AAAATGCAGTAGGTCTTAAAAATTTATATAAATTAGTTTCTAAATCTAATTTGGAGTATTTCTTTAAAAGACCAAGAATG
GCTAAAAGTCTTATAAATGAGTATAAAGAAGGACTTATAATAGGTTCTGCCTGTGAGGCCGGTCAAATATATAAAGAAAT
ATTAAAGGGAAAAACAAAAGAAGAATTAAAAAATATGATAGATTTTTATGATTATTTAGAAATACAACCTCTAGGTAATA
ATGAATTCCTTGTAAAAAACGGTTCCGTAAAAGATGAGAAAGAGCTTGAAGAAATAAATAAAAAGATATGTGAATTAGGT
GAGTTTTATAATAAACCTGTAGTAGCTACCTGTGATGTGCATTTTTTAGATGAAGAACATGAAGTATTTAGAAGAATATT
AATGTCAGGTCAAGGCTTTAGTGATGCAGATAATCAACCACCACTTTATTTTAGAACCACAGAGGAAATGCTTAAAGAAT
TTGAATATTTAGGAGAGGAAAAGGCTAAAGAAGTAGTAATTTATAATACAAATAAGATAGCAGATATGATAGATGAAATA
AAACCAATTCCAGAGGAAACTTTTCCGCCTAAGATAGAGGGAGCAGAAGAAGATATAAGAAGAATGACTATAGAAAAGGC
TCATTCTATATATGGAGATCCGCTTCCTGAAATAGTAGAAAAAAGATTAACAAAAGAATTAAATTCTATAATAAACAATG
GTTATGCTGTACTTTATTTAATAGCTCATAAGCTAGTGGCCAAATCCTATGAAGATGGATATTTAGTAGGATCCAGAGGA
TCTGTTGGATCTTCCTTTGCAGCTACTATGTCAGATATTACAGAAGTTAATGGATTACCACCTCATTATGTTTGTCCTAA
ATGTAAATATAGTGAATTCTTTACGGATGGGTCTGTAGGTTCAGGAGCGGATTTAGAGGATAAGGATTGCCCAAATTGCG
GAACTAAATTAATAAAAGATGGTCATGATATACCTTTTGAAACTTTTTTAGGTTTCGAAGGGGATAAAGAACCAGATATA
GATTTAAATTTTTCAGGAGAATATCAAGGAGTAGTTCATAAATATACAGAAGTTCTATTTGGAAAAGGGCATGTTTTTAA
AGCAGGAACTATTGGTACTATAGCAGAAAAGACTGCCTATGGTTTTGTAAAAAAGTATTTAGATGAAAAGAATATAAAAG
CTAACAATGCAGAAATAGAAAGATTATCTAAAGGATGTACAGGAGTTAAAAGGACATCGGGACAACATCCAGGAGGAATT
ATGGTTGTGCCTGCAGATAATGAAATATTTAACTTTACCCCTATACAACATCCAGCAGATGATACAGAAAGTGACGTTAT
AACTACCCATTTTGATTATCATTCTATAAGTGGTAGACTTTTAAAATTAGATATATTAGGCCATGACGACCCTACAGTAC
TTAGAATGCTTCAGGATTTGACTCAGGTAGATCCTAAATCTGTACCTCTAGGAGATCCAAAGGTTATAAGTTTATTTACA
TCACCAGAGGCTTTAGGGGTTACAGAGGAGGATATAGATTGTCCTGTAGGAACTTATGGGCTCCCAGAGTTTGGAACTAA
ATTCGTAAGACAAATGTTATTAGATACCAAACCAAAAAGTTTTGCAGAATTGGTAAGAATATCAGGATTATCCCATGGTA
CAGATGTATGGCTTAATAACGCTCAGTATTTTATAAAAGAGGGGTATACTACTATAAAGGATTGTATATCTACAAGAGAT
GATATAATGGTATATTTACTTCAAAAGGAACTGCCACCAAAGAGCGCTTTTACAATAATGGAAAAAGTGAGAAAGGGTAA
AGGCCTTACAGAGGAACAGGAAGCTTTAATGAAAGAACACGATGTGCCAGATTGGTATATAGAGTCCTGTAAGAAGATAA
AGTATATGTTCCCTAAAGGTCATGCTGTAGCCTATGTAATGATGGCCATGAGAATAGCTTATTTTAAAGTTTATTATCCA
AGGGCTTATTATGCTACTTATTTTTCTGTTAGAGCGGATGATTTTGATGCAGATCTAATAGTAAAGGGAGAAGAAAAAAT
AAAAGAAAAAATGAATGAATTATATGCTATGGGCAATAATATAACTCAAAAGGATAAAGGTCTTTTAACTATACTTGAAA
TATGTTATGAAATGTATAAAAGGGGAATAAAGTTTTTAAAGGTAGATTTATATAAATCTAATGCTACTAAATTTTTAATA
GAAGGAGATAGTATAAGACCACCTTTAAGCGCCCTTCAAGGTGTAGGAAAAAATGCAGCTAAAAGTATAGAAGAAGCTAG
ATTAGAGGGCGAATTTATATCAAAAGAAGATTTAAGATTAAGAACTAAGGCTACAAAAACTGTTATAGAGACTTTAGAAA
ATCATGGTTGCTTAAGAGGAATGCAGGAAACTAACCAAATATCCTTATTTAGCCTTTAG

Upstream 100 bases:

>100_bases
TACTTTAAATTTTAAGTTGAAATTCATTTTTTTATTTTTATAAATGTGATAATATATTTATTATGTTTTACCGGATAACA
TTACTCTTAGGAGGAATAAT

Downstream 100 bases:

>100_bases
TGGAAAATTAAATTTAAACTATACTTATAATAAAATGGATATTTTTATTAAAAAGACTTATGTTGTCAATAGCTATTTTT
TTAGTTTCATAACTTTTCCT

Product: DNA polymerase III PolC

Products: NA

Alternate protein names: PolIII [H]

Number of amino acids: Translated: 1432; Mature: 1432

Protein sequence:

>1432_residues
MENTSVGDLQKEINKERFDTDKEIKIVRVQYFRKKNKLRIILKSIGNFTKEKEDHIKNILKKRFSMVEDFEIICYKDLSN
ITLEELSKKYWVDIVNLASSSVPIARDCLLKSKREVLEDSINITYNNEFLCRFLSKNKFEGKLKSYIRDIFGIKCNIKLE
YDKSFNEEDYFKTIETMEKSMIKNVLSEIKSKEKKVIRKENSPKTREEENKDTSVILGRSIKEDSIFMKDLNENSGIVVV
CGDVFKKEVIETKTGRKIVTFFITDYTNSMTVKLFPRPRDADRIIEEIKEGIYCKVRGEVVNDPYARECVIMAKDIVKTT
KIEKMDIAEEKRVELHMHSQMSAMDGMTPVSELVKRAAKWGHKAVAITDHGVVQAYPDAMSAAKKGNIKVIYGIEAYLVD
DGVPIVSKAGDKTIEDTFVVFDLETTGFSSKNDKIIEIGAVKIEKGKIVDRFSEFVNPERSIPDKITELTSITDDMVDDK
EAIDKILPRFIDFIGDSVVVAHNASFDVSFINKNCKDLKIEFENSVMDTVTLSRFLFPELKRYKLNVIAKHLGISLENHH
RAVDDAKATADILLKSFEMLREKEIITLHSLNKEFLGNINIKKEPTHHLIILAKNAVGLKNLYKLVSKSNLEYFFKRPRM
AKSLINEYKEGLIIGSACEAGQIYKEILKGKTKEELKNMIDFYDYLEIQPLGNNEFLVKNGSVKDEKELEEINKKICELG
EFYNKPVVATCDVHFLDEEHEVFRRILMSGQGFSDADNQPPLYFRTTEEMLKEFEYLGEEKAKEVVIYNTNKIADMIDEI
KPIPEETFPPKIEGAEEDIRRMTIEKAHSIYGDPLPEIVEKRLTKELNSIINNGYAVLYLIAHKLVAKSYEDGYLVGSRG
SVGSSFAATMSDITEVNGLPPHYVCPKCKYSEFFTDGSVGSGADLEDKDCPNCGTKLIKDGHDIPFETFLGFEGDKEPDI
DLNFSGEYQGVVHKYTEVLFGKGHVFKAGTIGTIAEKTAYGFVKKYLDEKNIKANNAEIERLSKGCTGVKRTSGQHPGGI
MVVPADNEIFNFTPIQHPADDTESDVITTHFDYHSISGRLLKLDILGHDDPTVLRMLQDLTQVDPKSVPLGDPKVISLFT
SPEALGVTEEDIDCPVGTYGLPEFGTKFVRQMLLDTKPKSFAELVRISGLSHGTDVWLNNAQYFIKEGYTTIKDCISTRD
DIMVYLLQKELPPKSAFTIMEKVRKGKGLTEEQEALMKEHDVPDWYIESCKKIKYMFPKGHAVAYVMMAMRIAYFKVYYP
RAYYATYFSVRADDFDADLIVKGEEKIKEKMNELYAMGNNITQKDKGLLTILEICYEMYKRGIKFLKVDLYKSNATKFLI
EGDSIRPPLSALQGVGKNAAKSIEEARLEGEFISKEDLRLRTKATKTVIETLENHGCLRGMQETNQISLFSL

Sequences:

>Translated_1432_residues
MENTSVGDLQKEINKERFDTDKEIKIVRVQYFRKKNKLRIILKSIGNFTKEKEDHIKNILKKRFSMVEDFEIICYKDLSN
ITLEELSKKYWVDIVNLASSSVPIARDCLLKSKREVLEDSINITYNNEFLCRFLSKNKFEGKLKSYIRDIFGIKCNIKLE
YDKSFNEEDYFKTIETMEKSMIKNVLSEIKSKEKKVIRKENSPKTREEENKDTSVILGRSIKEDSIFMKDLNENSGIVVV
CGDVFKKEVIETKTGRKIVTFFITDYTNSMTVKLFPRPRDADRIIEEIKEGIYCKVRGEVVNDPYARECVIMAKDIVKTT
KIEKMDIAEEKRVELHMHSQMSAMDGMTPVSELVKRAAKWGHKAVAITDHGVVQAYPDAMSAAKKGNIKVIYGIEAYLVD
DGVPIVSKAGDKTIEDTFVVFDLETTGFSSKNDKIIEIGAVKIEKGKIVDRFSEFVNPERSIPDKITELTSITDDMVDDK
EAIDKILPRFIDFIGDSVVVAHNASFDVSFINKNCKDLKIEFENSVMDTVTLSRFLFPELKRYKLNVIAKHLGISLENHH
RAVDDAKATADILLKSFEMLREKEIITLHSLNKEFLGNINIKKEPTHHLIILAKNAVGLKNLYKLVSKSNLEYFFKRPRM
AKSLINEYKEGLIIGSACEAGQIYKEILKGKTKEELKNMIDFYDYLEIQPLGNNEFLVKNGSVKDEKELEEINKKICELG
EFYNKPVVATCDVHFLDEEHEVFRRILMSGQGFSDADNQPPLYFRTTEEMLKEFEYLGEEKAKEVVIYNTNKIADMIDEI
KPIPEETFPPKIEGAEEDIRRMTIEKAHSIYGDPLPEIVEKRLTKELNSIINNGYAVLYLIAHKLVAKSYEDGYLVGSRG
SVGSSFAATMSDITEVNGLPPHYVCPKCKYSEFFTDGSVGSGADLEDKDCPNCGTKLIKDGHDIPFETFLGFEGDKEPDI
DLNFSGEYQGVVHKYTEVLFGKGHVFKAGTIGTIAEKTAYGFVKKYLDEKNIKANNAEIERLSKGCTGVKRTSGQHPGGI
MVVPADNEIFNFTPIQHPADDTESDVITTHFDYHSISGRLLKLDILGHDDPTVLRMLQDLTQVDPKSVPLGDPKVISLFT
SPEALGVTEEDIDCPVGTYGLPEFGTKFVRQMLLDTKPKSFAELVRISGLSHGTDVWLNNAQYFIKEGYTTIKDCISTRD
DIMVYLLQKELPPKSAFTIMEKVRKGKGLTEEQEALMKEHDVPDWYIESCKKIKYMFPKGHAVAYVMMAMRIAYFKVYYP
RAYYATYFSVRADDFDADLIVKGEEKIKEKMNELYAMGNNITQKDKGLLTILEICYEMYKRGIKFLKVDLYKSNATKFLI
EGDSIRPPLSALQGVGKNAAKSIEEARLEGEFISKEDLRLRTKATKTVIETLENHGCLRGMQETNQISLFSL
>Mature_1432_residues
MENTSVGDLQKEINKERFDTDKEIKIVRVQYFRKKNKLRIILKSIGNFTKEKEDHIKNILKKRFSMVEDFEIICYKDLSN
ITLEELSKKYWVDIVNLASSSVPIARDCLLKSKREVLEDSINITYNNEFLCRFLSKNKFEGKLKSYIRDIFGIKCNIKLE
YDKSFNEEDYFKTIETMEKSMIKNVLSEIKSKEKKVIRKENSPKTREEENKDTSVILGRSIKEDSIFMKDLNENSGIVVV
CGDVFKKEVIETKTGRKIVTFFITDYTNSMTVKLFPRPRDADRIIEEIKEGIYCKVRGEVVNDPYARECVIMAKDIVKTT
KIEKMDIAEEKRVELHMHSQMSAMDGMTPVSELVKRAAKWGHKAVAITDHGVVQAYPDAMSAAKKGNIKVIYGIEAYLVD
DGVPIVSKAGDKTIEDTFVVFDLETTGFSSKNDKIIEIGAVKIEKGKIVDRFSEFVNPERSIPDKITELTSITDDMVDDK
EAIDKILPRFIDFIGDSVVVAHNASFDVSFINKNCKDLKIEFENSVMDTVTLSRFLFPELKRYKLNVIAKHLGISLENHH
RAVDDAKATADILLKSFEMLREKEIITLHSLNKEFLGNINIKKEPTHHLIILAKNAVGLKNLYKLVSKSNLEYFFKRPRM
AKSLINEYKEGLIIGSACEAGQIYKEILKGKTKEELKNMIDFYDYLEIQPLGNNEFLVKNGSVKDEKELEEINKKICELG
EFYNKPVVATCDVHFLDEEHEVFRRILMSGQGFSDADNQPPLYFRTTEEMLKEFEYLGEEKAKEVVIYNTNKIADMIDEI
KPIPEETFPPKIEGAEEDIRRMTIEKAHSIYGDPLPEIVEKRLTKELNSIINNGYAVLYLIAHKLVAKSYEDGYLVGSRG
SVGSSFAATMSDITEVNGLPPHYVCPKCKYSEFFTDGSVGSGADLEDKDCPNCGTKLIKDGHDIPFETFLGFEGDKEPDI
DLNFSGEYQGVVHKYTEVLFGKGHVFKAGTIGTIAEKTAYGFVKKYLDEKNIKANNAEIERLSKGCTGVKRTSGQHPGGI
MVVPADNEIFNFTPIQHPADDTESDVITTHFDYHSISGRLLKLDILGHDDPTVLRMLQDLTQVDPKSVPLGDPKVISLFT
SPEALGVTEEDIDCPVGTYGLPEFGTKFVRQMLLDTKPKSFAELVRISGLSHGTDVWLNNAQYFIKEGYTTIKDCISTRD
DIMVYLLQKELPPKSAFTIMEKVRKGKGLTEEQEALMKEHDVPDWYIESCKKIKYMFPKGHAVAYVMMAMRIAYFKVYYP
RAYYATYFSVRADDFDADLIVKGEEKIKEKMNELYAMGNNITQKDKGLLTILEICYEMYKRGIKFLKVDLYKSNATKFLI
EGDSIRPPLSALQGVGKNAAKSIEEARLEGEFISKEDLRLRTKATKTVIETLENHGCLRGMQETNQISLFSL

Specific function: Required for replicative DNA synthesis. This DNA polymerase also exhibits 3' to 5' exonuclease activity [H]

COG id: COG2176

COG function: function code L; DNA polymerase III, alpha subunit (gram-positive type)

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 exonuclease domain [H]

Homologues:

Organism=Escherichia coli, GI1786381, Length=921, Percent_Identity=22.1498371335505, Blast_Score=108, Evalue=2e-24,
Organism=Escherichia coli, GI1788149, Length=160, Percent_Identity=31.875, Blast_Score=69, Evalue=2e-12,
Organism=Escherichia coli, GI1786409, Length=169, Percent_Identity=28.9940828402367, Blast_Score=67, Evalue=6e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011708
- InterPro:   IPR006054
- InterPro:   IPR006055
- InterPro:   IPR013520
- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR004365
- InterPro:   IPR004013
- InterPro:   IPR003141
- InterPro:   IPR006308
- InterPro:   IPR012337 [H]

Pfam domain/function: PF07733 DNA_pol3_alpha; PF00929 Exonuc_X-T; PF02811 PHP; PF01336 tRNA_anti [H]

EC number: =2.7.7.7 [H]

Molecular weight: Translated: 163035; Mature: 163035

Theoretical pI: Translated: 6.13; Mature: 6.13

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MENTSVGDLQKEINKERFDTDKEIKIVRVQYFRKKNKLRIILKSIGNFTKEKEDHIKNIL
CCCCCHHHHHHHHHHHHCCCCCCEEEEEEHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHH
KKRFSMVEDFEIICYKDLSNITLEELSKKYWVDIVNLASSSVPIARDCLLKSKREVLEDS
HHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC
INITYNNEFLCRFLSKNKFEGKLKSYIRDIFGIKCNIKLEYDKSFNEEDYFKTIETMEKS
CEEEECCHHHHHHHHCCCCCHHHHHHHHHHHCCEEEEEEEECCCCCHHHHHHHHHHHHHH
MIKNVLSEIKSKEKKVIRKENSPKTREEENKDTSVILGRSIKEDSIFMKDLNENSGIVVV
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHCCCCCCEEEE
CGDVFKKEVIETKTGRKIVTFFITDYTNSMTVKLFPRPRDADRIIEEIKEGIYCKVRGEV
ECHHHHHHHHHCCCCCEEEEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCEEEEECCC
VNDPYARECVIMAKDIVKTTKIEKMDIAEEKRVELHMHSQMSAMDGMTPVSELVKRAAKW
CCCCCHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GHKAVAITDHGVVQAYPDAMSAAKKGNIKVIYGIEAYLVDDGVPIVSKAGDKTIEDTFVV
CCCEEEEECCCCEEECCHHHHHHCCCCEEEEEEEEEEEEECCCCHHHCCCCCCCCCEEEE
FDLETTGFSSKNDKIIEIGAVKIEKGKIVDRFSEFVNPERSIPDKITELTSITDDMVDDK
EEEEECCCCCCCCCEEEEEEEEEECCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCH
EAIDKILPRFIDFIGDSVVVAHNASFDVSFINKNCKDLKIEFENSVMDTVTLSRFLFPEL
HHHHHHHHHHHHHHCCCEEEEECCCEEEHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHH
KRYKLNVIAKHLGISLENHHRAVDDAKATADILLKSFEMLREKEIITLHSLNKEFLGNIN
HHHHHHHHHHHHCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHCCEEEHHHCCHHHHCCCC
IKKEPTHHLIILAKNAVGLKNLYKLVSKSNLEYFFKRPRMAKSLINEYKEGLIIGSACEA
CCCCCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHCCEEEECCCCH
GQIYKEILKGKTKEELKNMIDFYDYLEIQPLGNNEFLVKNGSVKDEKELEEINKKICELG
HHHHHHHHCCCCHHHHHHHHHHHHCEEEEECCCCEEEEECCCCCCHHHHHHHHHHHHHHH
EFYNKPVVATCDVHFLDEEHEVFRRILMSGQGFSDADNQPPLYFRTTEEMLKEFEYLGEE
HHHCCCEEEEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCEEEECHHHHHHHHHHHCCC
KAKEVVIYNTNKIADMIDEIKPIPEETFPPKIEGAEEDIRRMTIEKAHSIYGDPLPEIVE
CCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHH
KRLTKELNSIINNGYAVLYLIAHKLVAKSYEDGYLVGSRGSVGSSFAATMSDITEVNGLP
HHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHHHHCCCC
PHYVCPKCKYSEFFTDGSVGSGADLEDKDCPNCGTKLIKDGHDIPFETFLGFEGDKEPDI
CCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHCCCCCCCCCCE
DLNFSGEYQGVVHKYTEVLFGKGHVFKAGTIGTIAEKTAYGFVKKYLDEKNIKANNAEIE
EECCCCCCHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHH
RLSKGCTGVKRTSGQHPGGIMVVPADNEIFNFTPIQHPADDTESDVITTHFDYHSISGRL
HHHHCCCCCCCCCCCCCCCEEEEECCCCEEECCCCCCCCCCCCCCEEEEEECCCCCCCEE
LKLDILGHDDPTVLRMLQDLTQVDPKSVPLGDPKVISLFTSPEALGVTEEDIDCPVGTYG
EEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC
LPEFGTKFVRQMLLDTKPKSFAELVRISGLSHGTDVWLNNAQYFIKEGYTTIKDCISTRD
CHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCEEECCCHHHHHCCCHHHHHHHCCHH
DIMVYLLQKELPPKSAFTIMEKVRKGKGLTEEQEALMKEHDVPDWYIESCKKIKYMFPKG
HHEEHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCC
HAVAYVMMAMRIAYFKVYYPRAYYATYFSVRADDFDADLIVKGEEKIKEKMNELYAMGNN
HHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCEEEECHHHHHHHHHHHHHHCCC
ITQKDKGLLTILEICYEMYKRGIKFLKVDLYKSNATKFLIEGDSIRPPLSALQGVGKNAA
CCCHHCHHHHHHHHHHHHHHCCCEEEEEEEEECCCCEEEEECCCCCCCHHHHHHCCCHHH
KSIEEARLEGEFISKEDLRLRTKATKTVIETLENHGCLRGMQETNQISLFSL
HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCEEEEEC
>Mature Secondary Structure
MENTSVGDLQKEINKERFDTDKEIKIVRVQYFRKKNKLRIILKSIGNFTKEKEDHIKNIL
CCCCCHHHHHHHHHHHHCCCCCCEEEEEEHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHH
KKRFSMVEDFEIICYKDLSNITLEELSKKYWVDIVNLASSSVPIARDCLLKSKREVLEDS
HHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC
INITYNNEFLCRFLSKNKFEGKLKSYIRDIFGIKCNIKLEYDKSFNEEDYFKTIETMEKS
CEEEECCHHHHHHHHCCCCCHHHHHHHHHHHCCEEEEEEEECCCCCHHHHHHHHHHHHHH
MIKNVLSEIKSKEKKVIRKENSPKTREEENKDTSVILGRSIKEDSIFMKDLNENSGIVVV
HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHCCCCCCEEEE
CGDVFKKEVIETKTGRKIVTFFITDYTNSMTVKLFPRPRDADRIIEEIKEGIYCKVRGEV
ECHHHHHHHHHCCCCCEEEEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCEEEEECCC
VNDPYARECVIMAKDIVKTTKIEKMDIAEEKRVELHMHSQMSAMDGMTPVSELVKRAAKW
CCCCCHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GHKAVAITDHGVVQAYPDAMSAAKKGNIKVIYGIEAYLVDDGVPIVSKAGDKTIEDTFVV
CCCEEEEECCCCEEECCHHHHHHCCCCEEEEEEEEEEEEECCCCHHHCCCCCCCCCEEEE
FDLETTGFSSKNDKIIEIGAVKIEKGKIVDRFSEFVNPERSIPDKITELTSITDDMVDDK
EEEEECCCCCCCCCEEEEEEEEEECCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCH
EAIDKILPRFIDFIGDSVVVAHNASFDVSFINKNCKDLKIEFENSVMDTVTLSRFLFPEL
HHHHHHHHHHHHHHCCCEEEEECCCEEEHHHCCCCCEEEEEECCHHHHHHHHHHHHHHHH
KRYKLNVIAKHLGISLENHHRAVDDAKATADILLKSFEMLREKEIITLHSLNKEFLGNIN
HHHHHHHHHHHHCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHCCEEEHHHCCHHHHCCCC
IKKEPTHHLIILAKNAVGLKNLYKLVSKSNLEYFFKRPRMAKSLINEYKEGLIIGSACEA
CCCCCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHCCEEEECCCCH
GQIYKEILKGKTKEELKNMIDFYDYLEIQPLGNNEFLVKNGSVKDEKELEEINKKICELG
HHHHHHHHCCCCHHHHHHHHHHHHCEEEEECCCCEEEEECCCCCCHHHHHHHHHHHHHHH
EFYNKPVVATCDVHFLDEEHEVFRRILMSGQGFSDADNQPPLYFRTTEEMLKEFEYLGEE
HHHCCCEEEEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCEEEECHHHHHHHHHHHCCC
KAKEVVIYNTNKIADMIDEIKPIPEETFPPKIEGAEEDIRRMTIEKAHSIYGDPLPEIVE
CCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHH
KRLTKELNSIINNGYAVLYLIAHKLVAKSYEDGYLVGSRGSVGSSFAATMSDITEVNGLP
HHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHHHHCCCC
PHYVCPKCKYSEFFTDGSVGSGADLEDKDCPNCGTKLIKDGHDIPFETFLGFEGDKEPDI
CCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHCCCCCCCCCCE
DLNFSGEYQGVVHKYTEVLFGKGHVFKAGTIGTIAEKTAYGFVKKYLDEKNIKANNAEIE
EECCCCCCHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHH
RLSKGCTGVKRTSGQHPGGIMVVPADNEIFNFTPIQHPADDTESDVITTHFDYHSISGRL
HHHHCCCCCCCCCCCCCCCEEEEECCCCEEECCCCCCCCCCCCCCEEEEEECCCCCCCEE
LKLDILGHDDPTVLRMLQDLTQVDPKSVPLGDPKVISLFTSPEALGVTEEDIDCPVGTYG
EEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC
LPEFGTKFVRQMLLDTKPKSFAELVRISGLSHGTDVWLNNAQYFIKEGYTTIKDCISTRD
CHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCEEECCCHHHHHCCCHHHHHHHCCHH
DIMVYLLQKELPPKSAFTIMEKVRKGKGLTEEQEALMKEHDVPDWYIESCKKIKYMFPKG
HHEEHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCC
HAVAYVMMAMRIAYFKVYYPRAYYATYFSVRADDFDADLIVKGEEKIKEKMNELYAMGNN
HHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCEEEECHHHHHHHHHHHHHHCCC
ITQKDKGLLTILEICYEMYKRGIKFLKVDLYKSNATKFLIEGDSIRPPLSALQGVGKNAA
CCCHHCHHHHHHHHHHHHHHCCCEEEEEEEEECCCCEEEEECCCCCCCHHHHHHCCCHHH
KSIEEARLEGEFISKEDLRLRTKATKTVIETLENHGCLRGMQETNQISLFSL
HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12552129 [H]