Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is rpoC2

Identifier: 113476513

GI number: 113476513

Start: 4559674

End: 4563921

Strand: Reverse

Name: rpoC2

Synonym: Tery_2937

Alternate gene names: 113476513

Gene position: 4563921-4559674 (Counterclockwise)

Preceding gene: 113476514

Following gene: 113476509

Centisome position: 58.89

GC content: 40.02

Gene sequence:

>4248_bases
ATGATTTTCTATAATAGAGTTATCAATAAGGGTCAACTGAAAAAGCTAATTTCTTGGTCTTTTAATAATTACGGTACAGC
AATAACTGCACAAATGGCAAATAAGCTGAAAGATTTAGGATTCCGTTATGCAACAAAGGCAGGAGTTTCTATTAGTGTAG
ATGATTTACAAGTACCTCCTTCTAAACGACAACTTTTAGATGAAGCAGAGCAAGAAATTCGCAATACAACTGAACGTTAT
ACTAAAGGGAAAATCACGGAAGTAGAACGCTTTCAAAAAGTAATTGATACTTGGAATAGTACTAGTGAAAATCTCAAAAA
CGAAGTAGTTCGTAACTTTAGATCAACTGATCCCTTGAACTCAGTTTATATGATGGCATTTTCTGGAGCGCGGGGTAATA
TATCTCAGGTTCGCCAGTTAGTTGGGATGCGTGGTTTGATGGCAGATCCACAGGGGGAAATTATTGACTTACCAATTAAG
ACGAATTTCCGGGAGGGTTTAACTGTAACTGAGTATATTATCTCTTCTTACGGTGCGCGGAAAGGTTTAGTTGATACAGC
CTTGAGAACTGCTGACTCTGGTTATTTGACACGTCGTTTGGTTGATGTTTCTCAAGATGTAATAGTTCGGGAAGCAGACT
GTGGCACTAAGCGGGGAGTGATGGTAACTAGTATGAAAGATGGTGATCGTGTTTTGATTCCGGTCAAGGATAGACTATTA
GGAAGAGTTTTAGCTGATGATGTCAAGGATCCAAAAACAGGAGAAATTGTTACTCAAGGCCATATTCAGGCGGTAAAAAA
TCAAGTATTAACCGAGGATTTGGCAAAAGCTATAGGTAAAGCAGGTGTAGAAAATGTGTTTGTGCGATCGCCTTTAACTT
GTGAATCACCGCGATCTGTCTGTCAAACTTGCTATGGGTGGAGTCTCGCTCACGGTCATATGGTTGACATGGGAGAAGCT
ATTGGGATTATTGCTGCCCAGTCGATCGGTGAGCCAGGTACCCAATTAACTATGCGAACATTCCATACTGGAGGAGTCTT
TACAGGAGAAGTGGCCCGTCAATTCGTAGCATCTTTTGGAGGTGTAGTCAAATATCCATCTAATGTGCGTACCAGATCAT
TCCGAACTCGTCATGGGGATGAAGCAATGATTGCAGAGAATAATTTTGATATGATAATATTAGGAGTTGATGGTCGTAAA
GAAACTATACCTATCGCTCAAGGTTCTATTTTAATGGTGCAAAATAACCAGCAGGTATCAGCAGGCATAGTTTTAGCAGA
AGTACCAAAGGTAGGTCAGGTCAGAAAAACCACTGAAAAAGTAACAAAAGAAGTTGCTTCAGACCTAGCAGGAGAACTTA
AATTTGCTCAGTTGGTGCAAGAAGAAAAGGTAGATAAACAAGGAACAACAACTCGCATTGCTCAAAGAGGTGGTTTGATA
TGGATATTAGAAGGAGAAGTATATAATTTGCCACCAGGTGCAGAAGCAATGGTAAAAAATGGCACTAGGATTAATATTAA
CAGCGTACTTGCGGAAACTAAATTAGTAACCGAACATGGTGGTGTGATCAGAATCTCTTCTTCTCCTTTGTACAGTCCTG
AGAAGCAAAATGAGACAAATGTTTTGGCCCAAGCTACTCAAGAATCAACAATAGATGAGTTAGAAGCCTCAATGACGGGA
ACTTCTGCCCAACTGCCAATAAATACTGAAATGCCGGATAATTTGCCTCGTGAGATTGAGGTGATTACGGCAAGTGTGCG
ACTAGACCAGGCGAAAGTTCGGTTGGAAAGTATTTCCAACCGCGAGCAATATATCATTGAGACCCAAAATGATCAACGGT
TTGCCCTAAAAGTTACCCCTGGTAGTAAGGTTGGCAACCACGAAGTTATTGCTGAGCTTTTAGACGAAAATTATCAAACT
TCTACAGGTGGTATCATTAAATATGCTGGAGTAGAAGTTCTCAAACGTGGTAAAGCGAAACAAGGCTATGAGGTAGTAAA
AGGTGGTACGTTGCTATGGATTCCTGAAGAATGCCATGAAGTTAATAAAGACATATCTTTACTTTTGGTAGAAGACGGTC
AATATGTGGAAGCTGGTACAGAAATAGTTAAGGATATTTTCTGTCAGTCTAATGGAGTAACAGAAGTACACCAGAAAAAT
GATATTCTCAGAGAAGTTGTAATTAAACCAGGCAAACTACATAGTGGGAACTATGAAATAGATTTAGGGGATTTAACCCT
GATGGATGGTCAAATTGCTACACCGGGAGAGGAAGTAATTCCAGGATTAGTAACATCTGAGTTAAAGTACATTGAGTATA
TAGAAACACCAGAGGGTCCTGGTTTATTATTGCGACCTGTGACTGAATTTCATGTGCCGGATGAACCTGGTGTGCCTTCT
CAGAAATCAATTAATTCTTCTATTGAGTTACGGGCAGTACAACGAATTCCTTTTAAGGATGGGGAAAGAGTTAAATCTGT
AGAAGGTATAGAATTGCTGCGCACCCAGTTAGTATTGGAGATTGGAGAAGAAGCCCCTCAATTAGCTGCTGATATTGAGT
TATTACCAGACCTAGAGGAAGAGGGAATAATGCGTTTGCAGTTGGTAATTCTAGAGTCTTTAGTCATTCGACGAGATGTG
GTAGCAGATACAACTCAAGGTAGTACAAGTACTCGATTATTAGTTAAAGATGGAGATGTTATTGAAGCTGGAGGGGTAGT
ATCTCGCACACAGATATTATCTAAAAAATCTGGTGAGGTTCGCGGTATCCGAGAAGGTTCAGAAGCTATTCGTCGTATTT
TGATAGTTCGAGAAGCAGATTTAGTTAAAATACCAGTAAATACTTTACCTTCTGTAGTTGAAGGTGATTTGTTGGTTGCT
GGTACCGAAATTGCTCCGGGAATAGTTATTCCAAGGTCTGGTCTGGTGTCAAAAGTTGAGGAAACTGTTTTAAATGATGG
TAGTAAAGGTTATCAGGTAATCTTGCGTAAAGGCAGACCTTATCGTGTTTCTACGGGAGCAGTATTGCTGACTATGGATG
GAGATTTAGTGCAACGGGGAGATATTTTGGTACTGTTGGTGTTTGAGCGGACTAAGACTGGGGATATTATTCAGGGTTTA
CCTCGGATTGAAGAGTTGTTGGAGGCACGTAAACCCAAAGAGTCTTGTATTTTAGTAAAATATCCTGGTCAAGCTCAGGT
TAATGTTAATGATGATAATGTAGAGGTGAGCGTGGTCTCCAGTGATGGCACAATCACAGATTATCCTCTAGGTCATGGCC
AGAATGTGATTGTTGCTGATGGACAAAATGTTAACGTGGGTGAAGCTCTGACAGATGGACCTCAGAATCCCCATGAAATT
CTGGAGACTTTCTTCAACTATTACCGAGAACATGAAAGCGCTTATGAAGCCTGCCTGAGGAGTTTTGAGGCGTGTCAGAG
GTTTTTGGTTAATCAGGTGCAAGCAGTGTATCAGTCTCAGGGTATAGATATTTCTGATAAGCATATTGAGGTAATTGTGC
GTCAGATGACAGCTAAGGTGAGGATTGATGATGGTGGTGATACTACTATGTTACCTGGGGAATTGATAGAGTTGCGACAA
GTGGAACAGGTTAATGAGGCTATGTCTATTACGGGTGGAGCCACTGCTCAATATACACCAATGTTGTTGGGGATTACAAA
AGCGTCCCTAAATACAGATAGCTTTATTTCAGCAGCAAGTTTCCAAGAGACTACACGGGTGTTGACGGAAGCTGCAATTG
AGGGTAAGTCTGATTGGTTAAGGGGATTGAAAGAGAATGTAATTATTGGGCGATTGATTCCTGCTGGTACTGGTTTTAAT
GCTTATGAAGATGCCCTGAGTGCAGAGATTAATCGTTTGGAACAAAATTGGGATGATGATCTCGATATTTTTGAGGAAGG
TGATTTGCAGAGTGTGGTTTTAGATGATCAGACAGCTCGTTCATTAGAATTTGAAAACAGTCTAAATTTGTCATCTGCGA
ATCAGAATTTTGTAGATTCTCAGGGCAAACCTCAAAGTCAAAGTTCATTTATAGATGATAGTATGTCTGAGTTTTCTCCA
GTTAAAGATAAGTCTGGGTCAGTTTTAGATGATTCAGATTTCCCTCCTGGTAATTTTGATTCAGATTTCCCAGCTGATAA
TTATGACCTAGAGCATGAAATAGATTTAGAGGATGATGTTTATGATGGTTATGATGATTTTGATGAGAATACACCAGATT
TAATTTAG

Upstream 100 bases:

>100_bases
GATTAACCTGGTTTCTACGTAACTCCATAGGAAATAGGAAATAGGTAGGGATGTTTTATTTAACTACTCCCCGTCAACTA
ACAAATAAAGGTAAAAAAAC

Downstream 100 bases:

>100_bases
GTACAAAGTAATAAGTATTTAGCTCTTAGCAATAGACCTATCCAAAAATTCCAATTTTGGAACATTAGGGTAAGGGTGTG
AGAACCTTTTGTGGAGAGAT

Product: DNA-directed RNA polymerase subunit beta'

Products: NA

Alternate protein names: RNAP subunit beta'; RNA polymerase subunit beta'; Transcriptase subunit beta'

Number of amino acids: Translated: 1415; Mature: 1415

Protein sequence:

>1415_residues
MIFYNRVINKGQLKKLISWSFNNYGTAITAQMANKLKDLGFRYATKAGVSISVDDLQVPPSKRQLLDEAEQEIRNTTERY
TKGKITEVERFQKVIDTWNSTSENLKNEVVRNFRSTDPLNSVYMMAFSGARGNISQVRQLVGMRGLMADPQGEIIDLPIK
TNFREGLTVTEYIISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIVREADCGTKRGVMVTSMKDGDRVLIPVKDRLL
GRVLADDVKDPKTGEIVTQGHIQAVKNQVLTEDLAKAIGKAGVENVFVRSPLTCESPRSVCQTCYGWSLAHGHMVDMGEA
IGIIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQFVASFGGVVKYPSNVRTRSFRTRHGDEAMIAENNFDMIILGVDGRK
ETIPIAQGSILMVQNNQQVSAGIVLAEVPKVGQVRKTTEKVTKEVASDLAGELKFAQLVQEEKVDKQGTTTRIAQRGGLI
WILEGEVYNLPPGAEAMVKNGTRININSVLAETKLVTEHGGVIRISSSPLYSPEKQNETNVLAQATQESTIDELEASMTG
TSAQLPINTEMPDNLPREIEVITASVRLDQAKVRLESISNREQYIIETQNDQRFALKVTPGSKVGNHEVIAELLDENYQT
STGGIIKYAGVEVLKRGKAKQGYEVVKGGTLLWIPEECHEVNKDISLLLVEDGQYVEAGTEIVKDIFCQSNGVTEVHQKN
DILREVVIKPGKLHSGNYEIDLGDLTLMDGQIATPGEEVIPGLVTSELKYIEYIETPEGPGLLLRPVTEFHVPDEPGVPS
QKSINSSIELRAVQRIPFKDGERVKSVEGIELLRTQLVLEIGEEAPQLAADIELLPDLEEEGIMRLQLVILESLVIRRDV
VADTTQGSTSTRLLVKDGDVIEAGGVVSRTQILSKKSGEVRGIREGSEAIRRILIVREADLVKIPVNTLPSVVEGDLLVA
GTEIAPGIVIPRSGLVSKVEETVLNDGSKGYQVILRKGRPYRVSTGAVLLTMDGDLVQRGDILVLLVFERTKTGDIIQGL
PRIEELLEARKPKESCILVKYPGQAQVNVNDDNVEVSVVSSDGTITDYPLGHGQNVIVADGQNVNVGEALTDGPQNPHEI
LETFFNYYREHESAYEACLRSFEACQRFLVNQVQAVYQSQGIDISDKHIEVIVRQMTAKVRIDDGGDTTMLPGELIELRQ
VEQVNEAMSITGGATAQYTPMLLGITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWLRGLKENVIIGRLIPAGTGFN
AYEDALSAEINRLEQNWDDDLDIFEEGDLQSVVLDDQTARSLEFENSLNLSSANQNFVDSQGKPQSQSSFIDDSMSEFSP
VKDKSGSVLDDSDFPPGNFDSDFPADNYDLEHEIDLEDDVYDGYDDFDENTPDLI

Sequences:

>Translated_1415_residues
MIFYNRVINKGQLKKLISWSFNNYGTAITAQMANKLKDLGFRYATKAGVSISVDDLQVPPSKRQLLDEAEQEIRNTTERY
TKGKITEVERFQKVIDTWNSTSENLKNEVVRNFRSTDPLNSVYMMAFSGARGNISQVRQLVGMRGLMADPQGEIIDLPIK
TNFREGLTVTEYIISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIVREADCGTKRGVMVTSMKDGDRVLIPVKDRLL
GRVLADDVKDPKTGEIVTQGHIQAVKNQVLTEDLAKAIGKAGVENVFVRSPLTCESPRSVCQTCYGWSLAHGHMVDMGEA
IGIIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQFVASFGGVVKYPSNVRTRSFRTRHGDEAMIAENNFDMIILGVDGRK
ETIPIAQGSILMVQNNQQVSAGIVLAEVPKVGQVRKTTEKVTKEVASDLAGELKFAQLVQEEKVDKQGTTTRIAQRGGLI
WILEGEVYNLPPGAEAMVKNGTRININSVLAETKLVTEHGGVIRISSSPLYSPEKQNETNVLAQATQESTIDELEASMTG
TSAQLPINTEMPDNLPREIEVITASVRLDQAKVRLESISNREQYIIETQNDQRFALKVTPGSKVGNHEVIAELLDENYQT
STGGIIKYAGVEVLKRGKAKQGYEVVKGGTLLWIPEECHEVNKDISLLLVEDGQYVEAGTEIVKDIFCQSNGVTEVHQKN
DILREVVIKPGKLHSGNYEIDLGDLTLMDGQIATPGEEVIPGLVTSELKYIEYIETPEGPGLLLRPVTEFHVPDEPGVPS
QKSINSSIELRAVQRIPFKDGERVKSVEGIELLRTQLVLEIGEEAPQLAADIELLPDLEEEGIMRLQLVILESLVIRRDV
VADTTQGSTSTRLLVKDGDVIEAGGVVSRTQILSKKSGEVRGIREGSEAIRRILIVREADLVKIPVNTLPSVVEGDLLVA
GTEIAPGIVIPRSGLVSKVEETVLNDGSKGYQVILRKGRPYRVSTGAVLLTMDGDLVQRGDILVLLVFERTKTGDIIQGL
PRIEELLEARKPKESCILVKYPGQAQVNVNDDNVEVSVVSSDGTITDYPLGHGQNVIVADGQNVNVGEALTDGPQNPHEI
LETFFNYYREHESAYEACLRSFEACQRFLVNQVQAVYQSQGIDISDKHIEVIVRQMTAKVRIDDGGDTTMLPGELIELRQ
VEQVNEAMSITGGATAQYTPMLLGITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWLRGLKENVIIGRLIPAGTGFN
AYEDALSAEINRLEQNWDDDLDIFEEGDLQSVVLDDQTARSLEFENSLNLSSANQNFVDSQGKPQSQSSFIDDSMSEFSP
VKDKSGSVLDDSDFPPGNFDSDFPADNYDLEHEIDLEDDVYDGYDDFDENTPDLI
>Mature_1415_residues
MIFYNRVINKGQLKKLISWSFNNYGTAITAQMANKLKDLGFRYATKAGVSISVDDLQVPPSKRQLLDEAEQEIRNTTERY
TKGKITEVERFQKVIDTWNSTSENLKNEVVRNFRSTDPLNSVYMMAFSGARGNISQVRQLVGMRGLMADPQGEIIDLPIK
TNFREGLTVTEYIISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIVREADCGTKRGVMVTSMKDGDRVLIPVKDRLL
GRVLADDVKDPKTGEIVTQGHIQAVKNQVLTEDLAKAIGKAGVENVFVRSPLTCESPRSVCQTCYGWSLAHGHMVDMGEA
IGIIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQFVASFGGVVKYPSNVRTRSFRTRHGDEAMIAENNFDMIILGVDGRK
ETIPIAQGSILMVQNNQQVSAGIVLAEVPKVGQVRKTTEKVTKEVASDLAGELKFAQLVQEEKVDKQGTTTRIAQRGGLI
WILEGEVYNLPPGAEAMVKNGTRININSVLAETKLVTEHGGVIRISSSPLYSPEKQNETNVLAQATQESTIDELEASMTG
TSAQLPINTEMPDNLPREIEVITASVRLDQAKVRLESISNREQYIIETQNDQRFALKVTPGSKVGNHEVIAELLDENYQT
STGGIIKYAGVEVLKRGKAKQGYEVVKGGTLLWIPEECHEVNKDISLLLVEDGQYVEAGTEIVKDIFCQSNGVTEVHQKN
DILREVVIKPGKLHSGNYEIDLGDLTLMDGQIATPGEEVIPGLVTSELKYIEYIETPEGPGLLLRPVTEFHVPDEPGVPS
QKSINSSIELRAVQRIPFKDGERVKSVEGIELLRTQLVLEIGEEAPQLAADIELLPDLEEEGIMRLQLVILESLVIRRDV
VADTTQGSTSTRLLVKDGDVIEAGGVVSRTQILSKKSGEVRGIREGSEAIRRILIVREADLVKIPVNTLPSVVEGDLLVA
GTEIAPGIVIPRSGLVSKVEETVLNDGSKGYQVILRKGRPYRVSTGAVLLTMDGDLVQRGDILVLLVFERTKTGDIIQGL
PRIEELLEARKPKESCILVKYPGQAQVNVNDDNVEVSVVSSDGTITDYPLGHGQNVIVADGQNVNVGEALTDGPQNPHEI
LETFFNYYREHESAYEACLRSFEACQRFLVNQVQAVYQSQGIDISDKHIEVIVRQMTAKVRIDDGGDTTMLPGELIELRQ
VEQVNEAMSITGGATAQYTPMLLGITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWLRGLKENVIIGRLIPAGTGFN
AYEDALSAEINRLEQNWDDDLDIFEEGDLQSVVLDDQTARSLEFENSLNLSSANQNFVDSQGKPQSQSSFIDDSMSEFSP
VKDKSGSVLDDSDFPPGNFDSDFPADNYDLEHEIDLEDDVYDGYDDFDENTPDLI

Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates

COG id: COG0086

COG function: function code K; DNA-directed RNA polymerase, beta' subunit/160 kD subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RNA polymerase beta' chain family. RpoC2 subfamily

Homologues:

Organism=Escherichia coli, GI2367335, Length=477, Percent_Identity=39.622641509434, Blast_Score=335, Evalue=9e-93,
Organism=Caenorhabditis elegans, GI25145495, Length=307, Percent_Identity=26.3843648208469, Blast_Score=73, Evalue=9e-13,
Organism=Drosophila melanogaster, GI281360912, Length=140, Percent_Identity=36.4285714285714, Blast_Score=71, Evalue=5e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): RPOC2_TRIEI (Q110H3)

Other databases:

- EMBL:   CP000393
- RefSeq:   YP_722574.1
- ProteinModelPortal:   Q110H3
- STRING:   Q110H3
- GeneID:   4245279
- GenomeReviews:   CP000393_GR
- KEGG:   ter:Tery_2937
- NMPDR:   fig|203124.1.peg.5950
- eggNOG:   COG0086
- HOGENOM:   HBG621785
- OMA:   IEGKSDW
- ProtClustDB:   PRK02597
- BioCyc:   TERY203124:TERY_2937-MONOMER
- HAMAP:   MF_01324
- InterPro:   IPR007066
- InterPro:   IPR007083
- InterPro:   IPR007081
- InterPro:   IPR012756
- TIGRFAMs:   TIGR02388

Pfam domain/function: PF04983 RNA_pol_Rpb1_3; PF05000 RNA_pol_Rpb1_4; PF04998 RNA_pol_Rpb1_5

EC number: =2.7.7.6

Molecular weight: Translated: 155550; Mature: 155550

Theoretical pI: Translated: 4.50; Mature: 4.50

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIFYNRVINKGQLKKLISWSFNNYGTAITAQMANKLKDLGFRYATKAGVSISVDDLQVPP
CEEHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCHHEECCCCEEEECCCCCCC
SKRQLLDEAEQEIRNTTERYTKGKITEVERFQKVIDTWNSTSENLKNEVVRNFRSTDPLN
HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCC
SVYMMAFSGARGNISQVRQLVGMRGLMADPQGEIIDLPIKTNFREGLTVTEYIISSYGAR
CEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCCC
KGLVDTALRTADSGYLTRRLVDVSQDVIVREADCGTKRGVMVTSMKDGDRVLIPVKDRLL
CCHHHHHHHHCCCCHHHHHHHHCCCCCEEEECCCCCCCCEEEEEECCCCEEEEECHHHHH
GRVLADDVKDPKTGEIVTQGHIQAVKNQVLTEDLAKAIGKAGVENVFVRSPLTCESPRSV
HHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHH
CQTCYGWSLAHGHMVDMGEAIGIIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQFVASFG
HHHHHCCCCCCCCHHHCCHHHHHHHHHCCCCCCCEEEEEEEECCCEEEHHHHHHHHHHCC
GVVKYPSNVRTRSFRTRHGDEAMIAENNFDMIILGVDGRKETIPIAQGSILMVQNNQQVS
CEEECCCCCCHHHHHCCCCCCEEEEECCCCEEEEECCCCCCEEEECCCCEEEEECCCCCC
AGIVLAEVPKVGQVRKTTEKVTKEVASDLAGELKFAQLVQEEKVDKQGTTTRIAQRGGLI
CCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCEE
WILEGEVYNLPPGAEAMVKNGTRININSVLAETKLVTEHGGVIRISSSPLYSPEKQNETN
EEECCEEEECCCCHHHHHCCCCEEEHHHHHHHHHEEECCCCEEEECCCCCCCCCCCCCHH
VLAQATQESTIDELEASMTGTSAQLPINTEMPDNLPREIEVITASVRLDQAKVRLESISN
HHHHHHHHHHHHHHHHHCCCCCEECEECCCCCCCCCCEEEEEEEEEEHHHHHHHHHHCCC
REQYIIETQNDQRFALKVTPGSKVGNHEVIAELLDENYQTSTGGIIKYAGVEVLKRGKAK
CCEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCCCC
QGYEVVKGGTLLWIPEECHEVNKDISLLLVEDGQYVEAGTEIVKDIFCQSNGVTEVHQKN
CCCEEECCCEEEECCHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCCCHHHHHH
DILREVVIKPGKLHSGNYEIDLGDLTLMDGQIATPGEEVIPGLVTSELKYIEYIETPEGP
HHHHHHHCCCCCCCCCCEEEEECCEEEECCCCCCCHHHHCCCHHHHHHHHHHEEECCCCC
GLLLRPVTEFHVPDEPGVPSQKSINSSIELRAVQRIPFKDGERVKSVEGIELLRTQLVLE
CEEEECHHHCCCCCCCCCCCHHHCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHH
IGEEAPQLAADIELLPDLEEEGIMRLQLVILESLVIRRDVVADTTQGSTSTRLLVKDGDV
HCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCE
IEAGGVVSRTQILSKKSGEVRGIREGSEAIRRILIVREADLVKIPVNTLPSVVEGDLLVA
EECCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHEECCCEEEECHHHCCHHHCCCEEEE
GTEIAPGIVIPRSGLVSKVEETVLNDGSKGYQVILRKGRPYRVSTGAVLLTMDGDLVQRG
CCCCCCCEEECCCHHHHHHHHHHHCCCCCCCHHHEECCCCEEEECCEEEEEECCCEEECC
DILVLLVFERTKTGDIIQGLPRIEELLEARKPKESCILVKYPGQAQVNVNDDNVEVSVVS
CEEEEEEEECCCCCHHHHCCHHHHHHHHHCCCCCCEEEEECCCCEEEEECCCCEEEEEEE
SDGTITDYPLGHGQNVIVADGQNVNVGEALTDGPQNPHEILETFFNYYREHESAYEACLR
CCCCEEECCCCCCCEEEEECCCCCCCCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
SFEACQRFLVNQVQAVYQSQGIDISDKHIEVIVRQMTAKVRIDDGGDTTMLPGELIELRQ
HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHEEEEEECCCCCEECCHHHHHHHH
VEQVNEAMSITGGATAQYTPMLLGITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWL
HHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH
RGLKENVIIGRLIPAGTGFNAYEDALSAEINRLEQNWDDDLDIFEEGDLQSVVLDDQTAR
HHHHHCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCHHH
SLEFENSLNLSSANQNFVDSQGKPQSQSSFIDDSMSEFSPVKDKSGSVLDDSDFPPGNFD
CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
SDFPADNYDLEHEIDLEDDVYDGYDDFDENTPDLI
CCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MIFYNRVINKGQLKKLISWSFNNYGTAITAQMANKLKDLGFRYATKAGVSISVDDLQVPP
CEEHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCHHEECCCCEEEECCCCCCC
SKRQLLDEAEQEIRNTTERYTKGKITEVERFQKVIDTWNSTSENLKNEVVRNFRSTDPLN
HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCC
SVYMMAFSGARGNISQVRQLVGMRGLMADPQGEIIDLPIKTNFREGLTVTEYIISSYGAR
CEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCCC
KGLVDTALRTADSGYLTRRLVDVSQDVIVREADCGTKRGVMVTSMKDGDRVLIPVKDRLL
CCHHHHHHHHCCCCHHHHHHHHCCCCCEEEECCCCCCCCEEEEEECCCCEEEEECHHHHH
GRVLADDVKDPKTGEIVTQGHIQAVKNQVLTEDLAKAIGKAGVENVFVRSPLTCESPRSV
HHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHH
CQTCYGWSLAHGHMVDMGEAIGIIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQFVASFG
HHHHHCCCCCCCCHHHCCHHHHHHHHHCCCCCCCEEEEEEEECCCEEEHHHHHHHHHHCC
GVVKYPSNVRTRSFRTRHGDEAMIAENNFDMIILGVDGRKETIPIAQGSILMVQNNQQVS
CEEECCCCCCHHHHHCCCCCCEEEEECCCCEEEEECCCCCCEEEECCCCEEEEECCCCCC
AGIVLAEVPKVGQVRKTTEKVTKEVASDLAGELKFAQLVQEEKVDKQGTTTRIAQRGGLI
CCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHCCCEE
WILEGEVYNLPPGAEAMVKNGTRININSVLAETKLVTEHGGVIRISSSPLYSPEKQNETN
EEECCEEEECCCCHHHHHCCCCEEEHHHHHHHHHEEECCCCEEEECCCCCCCCCCCCCHH
VLAQATQESTIDELEASMTGTSAQLPINTEMPDNLPREIEVITASVRLDQAKVRLESISN
HHHHHHHHHHHHHHHHHCCCCCEECEECCCCCCCCCCEEEEEEEEEEHHHHHHHHHHCCC
REQYIIETQNDQRFALKVTPGSKVGNHEVIAELLDENYQTSTGGIIKYAGVEVLKRGKAK
CCEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCCCC
QGYEVVKGGTLLWIPEECHEVNKDISLLLVEDGQYVEAGTEIVKDIFCQSNGVTEVHQKN
CCCEEECCCEEEECCHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCCCHHHHHH
DILREVVIKPGKLHSGNYEIDLGDLTLMDGQIATPGEEVIPGLVTSELKYIEYIETPEGP
HHHHHHHCCCCCCCCCCEEEEECCEEEECCCCCCCHHHHCCCHHHHHHHHHHEEECCCCC
GLLLRPVTEFHVPDEPGVPSQKSINSSIELRAVQRIPFKDGERVKSVEGIELLRTQLVLE
CEEEECHHHCCCCCCCCCCCHHHCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHH
IGEEAPQLAADIELLPDLEEEGIMRLQLVILESLVIRRDVVADTTQGSTSTRLLVKDGDV
HCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCE
IEAGGVVSRTQILSKKSGEVRGIREGSEAIRRILIVREADLVKIPVNTLPSVVEGDLLVA
EECCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHEECCCEEEECHHHCCHHHCCCEEEE
GTEIAPGIVIPRSGLVSKVEETVLNDGSKGYQVILRKGRPYRVSTGAVLLTMDGDLVQRG
CCCCCCCEEECCCHHHHHHHHHHHCCCCCCCHHHEECCCCEEEECCEEEEEECCCEEECC
DILVLLVFERTKTGDIIQGLPRIEELLEARKPKESCILVKYPGQAQVNVNDDNVEVSVVS
CEEEEEEEECCCCCHHHHCCHHHHHHHHHCCCCCCEEEEECCCCEEEEECCCCEEEEEEE
SDGTITDYPLGHGQNVIVADGQNVNVGEALTDGPQNPHEILETFFNYYREHESAYEACLR
CCCCEEECCCCCCCEEEEECCCCCCCCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH
SFEACQRFLVNQVQAVYQSQGIDISDKHIEVIVRQMTAKVRIDDGGDTTMLPGELIELRQ
HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHEEEEEECCCCCEECCHHHHHHHH
VEQVNEAMSITGGATAQYTPMLLGITKASLNTDSFISAASFQETTRVLTEAAIEGKSDWL
HHHHHHHHHCCCCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH
RGLKENVIIGRLIPAGTGFNAYEDALSAEINRLEQNWDDDLDIFEEGDLQSVVLDDQTAR
HHHHHCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCHHH
SLEFENSLNLSSANQNFVDSQGKPQSQSSFIDDSMSEFSPVKDKSGSVLDDSDFPPGNFD
CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC
SDFPADNYDLEHEIDLEDDVYDGYDDFDENTPDLI
CCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA