| Definition | Streptococcus pneumoniae D39, complete genome. |
|---|---|
| Accession | NC_008533 |
| Length | 2,046,115 |
Click here to switch to the map view.
The map label for this gene is polC
Identifier: 116515846
GI number: 116515846
Start: 251075
End: 255466
Strand: Direct
Name: polC
Synonym: SPD_0254
Alternate gene names: 116515846
Gene position: 251075-255466 (Clockwise)
Preceding gene: 116516752
Following gene: 116516886
Centisome position: 12.27
GC content: 44.47
Gene sequence:
>4392_bases ATGTCAAATAGTTTTGAAATTTTGATGAATCAATTGGGGATGCCTGCTGAAATGAGACAGGCTCCTGCTTTAGCACAGGC CAATATTGAGCGAGTTGTGGTTCATAAAATTAGTAAGGTATGGGAGTTTCATTTCGTATTTTCTAATATTTTACCGATTG AAATCTTTTTAGAATTAAAGAAAGGTTTGAGCGAAGAATTTTCTAAGACAGGCAATAAAGCTGTTTTTGAAATTAAGGCT CGGTCTCAAGAATTTTCAAATCAGCTCTTGCAGTCCTACTATAGGGAGGCTTTCTCTGAAGGTCCATGTGCTAGTCAAGG TTTTAAGTCCCTTTATCAAAATTTGCAAGTTCGTGCTGAGGGTAATCAGCTATTTATTGAAGGATCTGAAGCGATTGATA AGGAACATTTTAAGAAGAATCATCTTCCTAATTTAGCCAAACAACTTGAAAAGTTTGGTTTTCCAACTTTTAACTGTCAA GTCGAGAAGAATGATGTCCTGACCCAAGAGCAGGAAGAGGCCTTTCATGCTGAAAATGAGCAGATTGTTCAAGCTGCCAA TGAGGAAGCGCTCCGTGCTATGGAACAACTGGAGCAGATGGCACCTCCTCCAGCGGAAGAGAAACCAGTCTTTGATTTTC AAGCGAAAAAAGCTGCAGCTAAACCCAAGCTGGATAAGGCGGAGATTACTCCTATGATCGAAGTGACGACAGAGGAAAAT CGTCTGGTATTTGAAGGGGTTGTTTTTGATGTGGAGCAAAAAGTGACTAGAACAGGTCGTGTTTTAATCAACTTTAAAAT GACGGACTATACTTCAAGTTTTTCTATGCAAAAGTGGGTTAAAAACGAGGAAGAGGCCCAGAAGTTTGACCTCATCAAGA AGAATTCTTGGCTCCGAGTTCGAGGGAATGTGGAGATGAATAACTTCACACGCGATTTGACTATGAACGTACAGGATCTG CAGGAAGTTGTTCACTATGAGCGGAAGGATTTGATGCCAGAAGGTGAGCGTCGGGTTGAGTTTCATGCTCATACTAACAT GTCGACTATGGATGCTTTGCCAGAGGTCGAAGAGATTGTTGCAACAGCTGCTAAGTGGGGACACAAGGCGGTTGCTATCA CAGACCATGGGAATGTCCAGTCCTTTCCACATGGCTATAAGGCGGCTAAGAAAGCGGGAATCCAGCTGATCTATGGGATA GAAGCCAATATCGTGGAGGACCGTGTCCCTATCGTCTATAACGAAGTGGAGATGGACTTATCAGAAGCAACCTACGTGGT CTTTGACGTGGAAACGACGGGACTTTCAGCTATCTATAATGACTTGATTCAGGTTGCGGCCTCTAAGATGTACAAGGGGA ATGTTATTGCTGAATTTGATGAATTTATCAATCCTGGGCATCCCTTGTCAGCTTTTACTACAGAGTTAACTGGAATTACA GATGATCATGTCAAAAATGCCAAACCACTAGAACAAGTTTTGCAAGAATTCCAAGAATTTTGCAAGGATACGGTCCTAGT TGCCCACAATGCTACCTTTGACGTTGGCTTTATGAATGCTAATTATGAGCGTCATGATCTTCCAAAGATTAGTCAGCCAG TTATTGATACGCTGGAGTTTGCTAGAAACCTCTATCCTGAGTATAAACGCCATGGTTTGGGGCCTTTGACCAAGCGTTTT GGTGTGGCCTTGGAACATCACCACATGGCCAACTACGATGCGGAAGCGACTGGTCGTCTGCTTTTCATCTTTATCAAAGA GGTAGCAGAAAAACATGGTGTGACCGATTTAGCTAGACTCAACATTGATCTAATCAGTCCAGATTCTTACAAAAAAGCTC GGATCAAGCATGCGACCATCTATGTCAAGAATCAGGTAGGTCTAAAAAATATCTTTAAGCTGGTTTCCTTGTCTAATACC AAGTATTTTGAAGGAGTGTCACGGATTCCGAGAACGGTTCTAGATGCCCATCGAGAGGGCTTGATTTTAGGTTCAGCCTG TTCAGAGGGTGAAGTTTTTGACGTGGTCGTTTCTCAAGGTGTGGATGCGGCGGTTGAGGTGGCCAAGTATTATGACTTTA TCGAGGTCATGCCACCGGCTATCTATGCGCCCTTGATTGCTAAAGAGCAGGTCAAGGATATGGAGGAACTCCAGACCATT ATCAAGAGTTTGATAGAGGTTGGAGACCGCCTTGGCAAGCCTGTTTTGGCTACGGGAAATGTTCACTATATCGAACCGGA AGAAGAGATTTATCGTGAAATTATCGTCCGTAGTTTGGGACAGGGTGCGATGATTAACCGAACTATCGGTCATGGTGAAC ATGCCCAACCAGCACCACTTCCAAAGGCTCATTTTCGAACGACTAATGAGATGTTGGATGAATTTGCCTTTTTGGGAGAG GAACTGGCTCGTAAACTGGTTATTGAAAACACCAATGCCTTGGCAGAAATATTTGAACCCGTTGAAGTCGTTAAGGGTGA CTTGTATACGCCTTTCATCGACAAGGCTGAAGAAACAGTTGCTGAGTTGACCTATAAGAAAGCTTTTGAGATTTATGGAA ATCCGCTGCCAGATATTGTTGATTTGCGGATTGAAAAAGAATTAACATCCATACTGGGGAATGGATTTGCTGTGATTTAT CTGGCATCGCAGATGCTGGTGCAACGTTCTAATGAACGGGGTTATTTGGTTGGTTCTCGTGGGTCTGTCGGATCTAGTTT CGTTGCGACCATGATTGGGATTACGGAGGTCAATCCTCTCTCTCCTCACTATGTCTGTGGTCAGTGTCAGTACAGTGAGT TTATCACAGATGGTTCGTACGGTTCAGGATTTGATATGCCCCATAAGGACTGTCCAAACTGTGGTCACAAACTCAGTAAA AACGGACAGGATATTCCGTTTGAGACCTTCCTTGGTTTTGATGGGGATAAGGTTCCTGATATTGACTTGAACTTCTCGGG AGAAGATCAGCCTAGCGCCCACTTGGATGTGCGTGATATCTTTGGTGAAGAATATGCCTTCCGTGCGGGAACAGTTGGTA CGGTAGCTGCCAAGACTGCCTATGGATTTGTCAAGGGTTACGAGCGAGATTATGGCAAGTTTTATCGTGATGCAGAAGTA GAACGCCTCGCTCAAGGAGCGGCGGGTGTCAAGCGGACAACAGGCCAACACCCGGGGGGAATCGTTGTTATTCCGAACTA CATGGATGTCTACGATTTTACGCCTGTCCAGTATCCAGCAGATGATGTCACGGCTGAATGGCAGACCACTCACTTTAACT TCCACGATATCGATGAGAACGTCCTCAAACTCGATGTACTGGGACATGATGATCCGACCATGATTCGAAAACTTCAGGAT TTGTCTGGTATTGACCCTAATAAAATTCCTATGGATGACGAAGGCGTGATGGCACTCTTTTCTGGGACTGATGTGCTAGG GGTAACACCTGAACAAATTGGAACGCCTACGGGTATGTTAGGGATTCCAGAGTTTGGAACAAATTTCGTACGTGGAATGG TAGACGAAACCCATCCGACAACCTTTGCGGAATTGCTTCAGCTGTCTGGTCTGTCCCACGGTACTGACGTTTGGTTGGGG AATGCTCAGGATCTGATTAAGCAAGGAATAGCGGACCTATCGACTGTTATCGGTTGTCGGGACGACATCATGGTTTACCT CATGCATGCGGGTCTGGAACCTAAGATGGCCTTTACCATTATGGAACGGGTACGTAAGGGTTTGTGGCTAAAGATTTCAG AAGAGGAGAGAAATGGCTATATCGAAGCCATGAAGGCTAATAAGGTGCCAGAGTGGTATATCGAATCCTGTGGGAAAATT AAGTACATGTTCCCTAAGGCCCATGCGGCAGCCTACGTTATGATGGCCTTGCGTGTAGCTTACTTCAAGGTTCACCATCC TATTTATTACTACTGTGCTTACTTCTCCATTCGTGCTAAGGCTTTTGATATCAAGACCATGGGTGCGGGCTTGGAGGCCA TCAAGCGCAGAATGGAAGAAATCTCTGAAAAACGGAAGAACAATGAAGCCTCTAATGTGGAAATCGATCTCTATACAACT CTTGAGATTGTCAATGAGATGTGGGAACGAGGTTTCAAGTTTGGCAAATTAGATCTCTACCGTAGTCAGGCGACAGAGTT CCTCATCGACGGGGATACCCTTATCCCACCATTTGTAGCAATGGATGGTCTGGGAGAGAACGTTGCCAAGCAACTGGTGC GGGCGCGTGAAGAGGGAGAATTCCTCTCTAAAACAGAACTACGCAAGCGTGGTGGACTCTCATCAACCTTGGTTGAAAAG ATGGATGAGATGGGTATTCTTGGAAATATGCCAGAGGATAACCAGTTGAGTTTGTTTGATGAGTTGTTTTAA
Upstream 100 bases:
>100_bases GTCCTCACTCTAGAAGGAAGTCACTTAGTGGCTTCCTTTTGTCTTTAGAAAATACCTCTAAATATGGTAAAATAGTAGAA GAATAATGTGAGGAAAATGA
Downstream 100 bases:
>100_bases AAAATTGCTTAATAATCTATTAGAAGAGGCTAACGTATATCCAATAGATTTACATTAGCTTTCTTTTTTGTTAAAATAGT CTATGGAAAGAGGGTGAGAG
Product: DNA polymerase III PolC
Products: NA
Alternate protein names: PolIII
Number of amino acids: Translated: 1463; Mature: 1462
Protein sequence:
>1463_residues MSNSFEILMNQLGMPAEMRQAPALAQANIERVVVHKISKVWEFHFVFSNILPIEIFLELKKGLSEEFSKTGNKAVFEIKA RSQEFSNQLLQSYYREAFSEGPCASQGFKSLYQNLQVRAEGNQLFIEGSEAIDKEHFKKNHLPNLAKQLEKFGFPTFNCQ VEKNDVLTQEQEEAFHAENEQIVQAANEEALRAMEQLEQMAPPPAEEKPVFDFQAKKAAAKPKLDKAEITPMIEVTTEEN RLVFEGVVFDVEQKVTRTGRVLINFKMTDYTSSFSMQKWVKNEEEAQKFDLIKKNSWLRVRGNVEMNNFTRDLTMNVQDL QEVVHYERKDLMPEGERRVEFHAHTNMSTMDALPEVEEIVATAAKWGHKAVAITDHGNVQSFPHGYKAAKKAGIQLIYGI EANIVEDRVPIVYNEVEMDLSEATYVVFDVETTGLSAIYNDLIQVAASKMYKGNVIAEFDEFINPGHPLSAFTTELTGIT DDHVKNAKPLEQVLQEFQEFCKDTVLVAHNATFDVGFMNANYERHDLPKISQPVIDTLEFARNLYPEYKRHGLGPLTKRF GVALEHHHMANYDAEATGRLLFIFIKEVAEKHGVTDLARLNIDLISPDSYKKARIKHATIYVKNQVGLKNIFKLVSLSNT KYFEGVSRIPRTVLDAHREGLILGSACSEGEVFDVVVSQGVDAAVEVAKYYDFIEVMPPAIYAPLIAKEQVKDMEELQTI IKSLIEVGDRLGKPVLATGNVHYIEPEEEIYREIIVRSLGQGAMINRTIGHGEHAQPAPLPKAHFRTTNEMLDEFAFLGE ELARKLVIENTNALAEIFEPVEVVKGDLYTPFIDKAEETVAELTYKKAFEIYGNPLPDIVDLRIEKELTSILGNGFAVIY LASQMLVQRSNERGYLVGSRGSVGSSFVATMIGITEVNPLSPHYVCGQCQYSEFITDGSYGSGFDMPHKDCPNCGHKLSK NGQDIPFETFLGFDGDKVPDIDLNFSGEDQPSAHLDVRDIFGEEYAFRAGTVGTVAAKTAYGFVKGYERDYGKFYRDAEV ERLAQGAAGVKRTTGQHPGGIVVIPNYMDVYDFTPVQYPADDVTAEWQTTHFNFHDIDENVLKLDVLGHDDPTMIRKLQD LSGIDPNKIPMDDEGVMALFSGTDVLGVTPEQIGTPTGMLGIPEFGTNFVRGMVDETHPTTFAELLQLSGLSHGTDVWLG NAQDLIKQGIADLSTVIGCRDDIMVYLMHAGLEPKMAFTIMERVRKGLWLKISEEERNGYIEAMKANKVPEWYIESCGKI KYMFPKAHAAAYVMMALRVAYFKVHHPIYYYCAYFSIRAKAFDIKTMGAGLEAIKRRMEEISEKRKNNEASNVEIDLYTT LEIVNEMWERGFKFGKLDLYRSQATEFLIDGDTLIPPFVAMDGLGENVAKQLVRAREEGEFLSKTELRKRGGLSSTLVEK MDEMGILGNMPEDNQLSLFDELF
Sequences:
>Translated_1463_residues MSNSFEILMNQLGMPAEMRQAPALAQANIERVVVHKISKVWEFHFVFSNILPIEIFLELKKGLSEEFSKTGNKAVFEIKA RSQEFSNQLLQSYYREAFSEGPCASQGFKSLYQNLQVRAEGNQLFIEGSEAIDKEHFKKNHLPNLAKQLEKFGFPTFNCQ VEKNDVLTQEQEEAFHAENEQIVQAANEEALRAMEQLEQMAPPPAEEKPVFDFQAKKAAAKPKLDKAEITPMIEVTTEEN RLVFEGVVFDVEQKVTRTGRVLINFKMTDYTSSFSMQKWVKNEEEAQKFDLIKKNSWLRVRGNVEMNNFTRDLTMNVQDL QEVVHYERKDLMPEGERRVEFHAHTNMSTMDALPEVEEIVATAAKWGHKAVAITDHGNVQSFPHGYKAAKKAGIQLIYGI EANIVEDRVPIVYNEVEMDLSEATYVVFDVETTGLSAIYNDLIQVAASKMYKGNVIAEFDEFINPGHPLSAFTTELTGIT DDHVKNAKPLEQVLQEFQEFCKDTVLVAHNATFDVGFMNANYERHDLPKISQPVIDTLEFARNLYPEYKRHGLGPLTKRF GVALEHHHMANYDAEATGRLLFIFIKEVAEKHGVTDLARLNIDLISPDSYKKARIKHATIYVKNQVGLKNIFKLVSLSNT KYFEGVSRIPRTVLDAHREGLILGSACSEGEVFDVVVSQGVDAAVEVAKYYDFIEVMPPAIYAPLIAKEQVKDMEELQTI IKSLIEVGDRLGKPVLATGNVHYIEPEEEIYREIIVRSLGQGAMINRTIGHGEHAQPAPLPKAHFRTTNEMLDEFAFLGE ELARKLVIENTNALAEIFEPVEVVKGDLYTPFIDKAEETVAELTYKKAFEIYGNPLPDIVDLRIEKELTSILGNGFAVIY LASQMLVQRSNERGYLVGSRGSVGSSFVATMIGITEVNPLSPHYVCGQCQYSEFITDGSYGSGFDMPHKDCPNCGHKLSK NGQDIPFETFLGFDGDKVPDIDLNFSGEDQPSAHLDVRDIFGEEYAFRAGTVGTVAAKTAYGFVKGYERDYGKFYRDAEV ERLAQGAAGVKRTTGQHPGGIVVIPNYMDVYDFTPVQYPADDVTAEWQTTHFNFHDIDENVLKLDVLGHDDPTMIRKLQD LSGIDPNKIPMDDEGVMALFSGTDVLGVTPEQIGTPTGMLGIPEFGTNFVRGMVDETHPTTFAELLQLSGLSHGTDVWLG NAQDLIKQGIADLSTVIGCRDDIMVYLMHAGLEPKMAFTIMERVRKGLWLKISEEERNGYIEAMKANKVPEWYIESCGKI KYMFPKAHAAAYVMMALRVAYFKVHHPIYYYCAYFSIRAKAFDIKTMGAGLEAIKRRMEEISEKRKNNEASNVEIDLYTT LEIVNEMWERGFKFGKLDLYRSQATEFLIDGDTLIPPFVAMDGLGENVAKQLVRAREEGEFLSKTELRKRGGLSSTLVEK MDEMGILGNMPEDNQLSLFDELF >Mature_1462_residues SNSFEILMNQLGMPAEMRQAPALAQANIERVVVHKISKVWEFHFVFSNILPIEIFLELKKGLSEEFSKTGNKAVFEIKAR SQEFSNQLLQSYYREAFSEGPCASQGFKSLYQNLQVRAEGNQLFIEGSEAIDKEHFKKNHLPNLAKQLEKFGFPTFNCQV EKNDVLTQEQEEAFHAENEQIVQAANEEALRAMEQLEQMAPPPAEEKPVFDFQAKKAAAKPKLDKAEITPMIEVTTEENR LVFEGVVFDVEQKVTRTGRVLINFKMTDYTSSFSMQKWVKNEEEAQKFDLIKKNSWLRVRGNVEMNNFTRDLTMNVQDLQ EVVHYERKDLMPEGERRVEFHAHTNMSTMDALPEVEEIVATAAKWGHKAVAITDHGNVQSFPHGYKAAKKAGIQLIYGIE ANIVEDRVPIVYNEVEMDLSEATYVVFDVETTGLSAIYNDLIQVAASKMYKGNVIAEFDEFINPGHPLSAFTTELTGITD DHVKNAKPLEQVLQEFQEFCKDTVLVAHNATFDVGFMNANYERHDLPKISQPVIDTLEFARNLYPEYKRHGLGPLTKRFG VALEHHHMANYDAEATGRLLFIFIKEVAEKHGVTDLARLNIDLISPDSYKKARIKHATIYVKNQVGLKNIFKLVSLSNTK YFEGVSRIPRTVLDAHREGLILGSACSEGEVFDVVVSQGVDAAVEVAKYYDFIEVMPPAIYAPLIAKEQVKDMEELQTII KSLIEVGDRLGKPVLATGNVHYIEPEEEIYREIIVRSLGQGAMINRTIGHGEHAQPAPLPKAHFRTTNEMLDEFAFLGEE LARKLVIENTNALAEIFEPVEVVKGDLYTPFIDKAEETVAELTYKKAFEIYGNPLPDIVDLRIEKELTSILGNGFAVIYL ASQMLVQRSNERGYLVGSRGSVGSSFVATMIGITEVNPLSPHYVCGQCQYSEFITDGSYGSGFDMPHKDCPNCGHKLSKN GQDIPFETFLGFDGDKVPDIDLNFSGEDQPSAHLDVRDIFGEEYAFRAGTVGTVAAKTAYGFVKGYERDYGKFYRDAEVE RLAQGAAGVKRTTGQHPGGIVVIPNYMDVYDFTPVQYPADDVTAEWQTTHFNFHDIDENVLKLDVLGHDDPTMIRKLQDL SGIDPNKIPMDDEGVMALFSGTDVLGVTPEQIGTPTGMLGIPEFGTNFVRGMVDETHPTTFAELLQLSGLSHGTDVWLGN AQDLIKQGIADLSTVIGCRDDIMVYLMHAGLEPKMAFTIMERVRKGLWLKISEEERNGYIEAMKANKVPEWYIESCGKIK YMFPKAHAAAYVMMALRVAYFKVHHPIYYYCAYFSIRAKAFDIKTMGAGLEAIKRRMEEISEKRKNNEASNVEIDLYTTL EIVNEMWERGFKFGKLDLYRSQATEFLIDGDTLIPPFVAMDGLGENVAKQLVRAREEGEFLSKTELRKRGGLSSTLVEKM DEMGILGNMPEDNQLSLFDELF
Specific function: Required for replicative DNA synthesis. This DNA polymerase also exhibits 3' to 5' exonuclease activity
COG id: COG2176
COG function: function code L; DNA polymerase III, alpha subunit (gram-positive type)
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 exonuclease domain
Homologues:
Organism=Escherichia coli, GI1786381, Length=928, Percent_Identity=22.4137931034483, Blast_Score=104, Evalue=4e-23,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): DPO3_STRP2 (Q04MH6)
Other databases:
- EMBL: CP000410 - RefSeq: YP_815772.1 - ProteinModelPortal: Q04MH6 - STRING: Q04MH6 - EnsemblBacteria: EBSTRT00000019802 - GeneID: 4441537 - GenomeReviews: CP000410_GR - KEGG: spd:SPD_0254 - eggNOG: COG2176 - GeneTree: EBGT00050000027831 - HOGENOM: HBG617116 - OMA: WKTTHFD - ProtClustDB: PRK00448 - GO: GO:0005737 - HAMAP: MF_00356 - InterPro: IPR011708 - InterPro: IPR006054 - InterPro: IPR006055 - InterPro: IPR013520 - InterPro: IPR012340 - InterPro: IPR016027 - InterPro: IPR004013 - InterPro: IPR003141 - InterPro: IPR016195 - InterPro: IPR006308 - InterPro: IPR012337 - Gene3D: G3DSA:2.40.50.140 - SMART: SM00479 - SMART: SM00481 - TIGRFAMs: TIGR00573 - TIGRFAMs: TIGR01405
Pfam domain/function: PF07733 DNA_pol3_alpha; PF00929 Exonuc_X-T; PF02811 PHP; SSF50249 Nucleic_acid_OB; SSF89550 PHP-like; SSF53098 RNaseH_fold
EC number: =2.7.7.7
Molecular weight: Translated: 164777; Mature: 164646
Theoretical pI: Translated: 4.87; Mature: 4.87
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 3.1 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 3.1 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSNSFEILMNQLGMPAEMRQAPALAQANIERVVVHKISKVWEFHFVFSNILPIEIFLELK CCCHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH KGLSEEFSKTGNKAVFEIKARSQEFSNQLLQSYYREAFSEGPCASQGFKSLYQNLQVRAE HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCEEEEC GNQLFIEGSEAIDKEHFKKNHLPNLAKQLEKFGFPTFNCQVEKNDVLTQEQEEAFHAENE CCEEEEECHHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHCCCH QIVQAANEEALRAMEQLEQMAPPPAEEKPVFDFQAKKAAAKPKLDKAEITPMIEVTTEEN HHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCHHHCCEEEEEECCCC RLVFEGVVFDVEQKVTRTGRVLINFKMTDYTSSFSMQKWVKNEEEAQKFDLIKKNSWLRV EEEEEHEEEHHHHHHHCCCEEEEEEEEECCCCCHHHHHHHCCHHHHHHHHHHCCCCEEEE RGNVEMNNFTRDLTMNVQDLQEVVHYERKDLMPEGERRVEFHAHTNMSTMDALPEVEEIV ECCEECCCCCHHHCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHCHHHHHHH ATAAKWGHKAVAITDHGNVQSFPHGYKAAKKAGIQLIYGIEANIVEDRVPIVYNEVEMDL HHHHHCCCEEEEEECCCCCCCCCCHHHHHHHCCCEEEEECCCCEECCCCCEEEEHHHCCC SEATYVVFDVETTGLSAIYNDLIQVAASKMYKGNVIAEFDEFINPGHPLSAFTTELTGIT CCCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHCCCC DDHVKNAKPLEQVLQEFQEFCKDTVLVAHNATFDVGFMNANYERHDLPKISQPVIDTLEF HHHHCCCHHHHHHHHHHHHHHHCCEEEEECCEEEEEEECCCCCCCCCCCCHHHHHHHHHH ARNLYPEYKRHGLGPLTKRFGVALEHHHMANYDAEATGRLLFIFIKEVAEKHGVTDLARL HHHHCHHHHHCCCCHHHHHHCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEEHHE NIDLISPDSYKKARIKHATIYVKNQVGLKNIFKLVSLSNTKYFEGVSRIPRTVLDAHREG EEEEECCCCCHHHHEEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC LILGSACSEGEVFDVVVSQGVDAAVEVAKYYDFIEVMPPAIYAPLIAKEQVKDMEELQTI EEEECCCCCCCEEHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH IKSLIEVGDRLGKPVLATGNVHYIEPEEEIYREIIVRSLGQGAMINRTIGHGEHAQPAPL HHHHHHHHHHHCCCEEECCCEEEECCHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCC PKAHFRTTNEMLDEFAFLGEELARKLVIENTNALAEIFEPVEVVKGDLYTPFIDKAEETV CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHH AELTYKKAFEIYGNPLPDIVDLRIEKELTSILGNGFAVIYLASQMLVQRSNERGYLVGSR HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHCCCCCCEEEECC GSVGSSFVATMIGITEVNPLSPHYVCGQCQYSEFITDGSYGSGFDMPHKDCPNCGHKLSK CCCCHHHHHHHHHHHCCCCCCCCEEECCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHCC NGQDIPFETFLGFDGDKVPDIDLNFSGEDQPSAHLDVRDIFGEEYAFRAGTVGTVAAKTA CCCCCCHHHHCCCCCCCCCCEECCCCCCCCCCCCCCHHHHCCCHHEECCCCCHHHHHHHH YGFVKGYERDYGKFYRDAEVERLAQGAAGVKRTTGQHPGGIVVIPNYMDVYDFTPVQYPA HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCC DDVTAEWQTTHFNFHDIDENVLKLDVLGHDDPTMIRKLQDLSGIDPNKIPMDDEGVMALF CCCCCCEEEEECCCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEE SGTDVLGVTPEQIGTPTGMLGIPEFGTNFVRGMVDETHPTTFAELLQLSGLSHGTDVWLG ECCCEECCCHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCEEEC NAQDLIKQGIADLSTVIGCRDDIMVYLMHAGLEPKMAFTIMERVRKGLWLKISEEERNGY CHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCEEEEECCCCCCE IEAMKANKVPEWYIESCGKIKYMFPKAHAAAYVMMALRVAYFKVHHPIYYYCAYFSIRAK EEEHHCCCCCHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHE AFDIKTMGAGLEAIKRRMEEISEKRKNNEASNVEIDLYTTLEIVNEMWERGFKFGKLDLY EEEHHHHCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHCCCCCCCHHHH RSQATEFLIDGDTLIPPFVAMDGLGENVAKQLVRAREEGEFLSKTELRKRGGLSSTLVEK HHHCCEEEECCCCCCCCHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHH MDEMGILGNMPEDNQLSLFDELF HHHCCCCCCCCCCCCHHHHHHCC >Mature Secondary Structure SNSFEILMNQLGMPAEMRQAPALAQANIERVVVHKISKVWEFHFVFSNILPIEIFLELK CCHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH KGLSEEFSKTGNKAVFEIKARSQEFSNQLLQSYYREAFSEGPCASQGFKSLYQNLQVRAE HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCEEEEC GNQLFIEGSEAIDKEHFKKNHLPNLAKQLEKFGFPTFNCQVEKNDVLTQEQEEAFHAENE CCEEEEECHHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHCCCH QIVQAANEEALRAMEQLEQMAPPPAEEKPVFDFQAKKAAAKPKLDKAEITPMIEVTTEEN HHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCHHHCCEEEEEECCCC RLVFEGVVFDVEQKVTRTGRVLINFKMTDYTSSFSMQKWVKNEEEAQKFDLIKKNSWLRV EEEEEHEEEHHHHHHHCCCEEEEEEEEECCCCCHHHHHHHCCHHHHHHHHHHCCCCEEEE RGNVEMNNFTRDLTMNVQDLQEVVHYERKDLMPEGERRVEFHAHTNMSTMDALPEVEEIV ECCEECCCCCHHHCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHCHHHHHHH ATAAKWGHKAVAITDHGNVQSFPHGYKAAKKAGIQLIYGIEANIVEDRVPIVYNEVEMDL HHHHHCCCEEEEEECCCCCCCCCCHHHHHHHCCCEEEEECCCCEECCCCCEEEEHHHCCC SEATYVVFDVETTGLSAIYNDLIQVAASKMYKGNVIAEFDEFINPGHPLSAFTTELTGIT CCCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCHHHHHHHHCCCC DDHVKNAKPLEQVLQEFQEFCKDTVLVAHNATFDVGFMNANYERHDLPKISQPVIDTLEF HHHHCCCHHHHHHHHHHHHHHHCCEEEEECCEEEEEEECCCCCCCCCCCCHHHHHHHHHH ARNLYPEYKRHGLGPLTKRFGVALEHHHMANYDAEATGRLLFIFIKEVAEKHGVTDLARL HHHHCHHHHHCCCCHHHHHHCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEEHHE NIDLISPDSYKKARIKHATIYVKNQVGLKNIFKLVSLSNTKYFEGVSRIPRTVLDAHREG EEEEECCCCCHHHHEEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC LILGSACSEGEVFDVVVSQGVDAAVEVAKYYDFIEVMPPAIYAPLIAKEQVKDMEELQTI EEEECCCCCCCEEHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH IKSLIEVGDRLGKPVLATGNVHYIEPEEEIYREIIVRSLGQGAMINRTIGHGEHAQPAPL HHHHHHHHHHHCCCEEECCCEEEECCHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCC PKAHFRTTNEMLDEFAFLGEELARKLVIENTNALAEIFEPVEVVKGDLYTPFIDKAEETV CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHH AELTYKKAFEIYGNPLPDIVDLRIEKELTSILGNGFAVIYLASQMLVQRSNERGYLVGSR HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHCCCCCCEEEECC GSVGSSFVATMIGITEVNPLSPHYVCGQCQYSEFITDGSYGSGFDMPHKDCPNCGHKLSK CCCCHHHHHHHHHHHCCCCCCCCEEECCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHCC NGQDIPFETFLGFDGDKVPDIDLNFSGEDQPSAHLDVRDIFGEEYAFRAGTVGTVAAKTA CCCCCCHHHHCCCCCCCCCCEECCCCCCCCCCCCCCHHHHCCCHHEECCCCCHHHHHHHH YGFVKGYERDYGKFYRDAEVERLAQGAAGVKRTTGQHPGGIVVIPNYMDVYDFTPVQYPA HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCC DDVTAEWQTTHFNFHDIDENVLKLDVLGHDDPTMIRKLQDLSGIDPNKIPMDDEGVMALF CCCCCCEEEEECCCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEE SGTDVLGVTPEQIGTPTGMLGIPEFGTNFVRGMVDETHPTTFAELLQLSGLSHGTDVWLG ECCCEECCCHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCEEEC NAQDLIKQGIADLSTVIGCRDDIMVYLMHAGLEPKMAFTIMERVRKGLWLKISEEERNGY CHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCEEEEECCCCCCE IEAMKANKVPEWYIESCGKIKYMFPKAHAAAYVMMALRVAYFKVHHPIYYYCAYFSIRAK EEEHHCCCCCHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHE AFDIKTMGAGLEAIKRRMEEISEKRKNNEASNVEIDLYTTLEIVNEMWERGFKFGKLDLY EEEHHHHCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHCCCCCCCHHHH RSQATEFLIDGDTLIPPFVAMDGLGENVAKQLVRAREEGEFLSKTELRKRGGLSSTLVEK HHHCCEEEECCCCCCCCHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHH MDEMGILGNMPEDNQLSLFDELF HHHCCCCCCCCCCCCHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA