Definition Candidatus Blochmannia pennsylvanicus str. BPEN, complete genome.
Accession NC_007292
Length 791,654

Click here to switch to the map view.

The map label for this gene is rpoB

Identifier: 71892326

GI number: 71892326

Start: 692991

End: 697016

Strand: Reverse

Name: rpoB

Synonym: BPEN_577

Alternate gene names: 71892326

Gene position: 697016-692991 (Counterclockwise)

Preceding gene: 71892327

Following gene: 71892325

Centisome position: 88.05

GC content: 33.56

Gene sequence:

>4026_bases
ATGGTGTATTCTTATACTGAAAAGAAACGTATTCGTAAAGATTTTGGAAAACGCCCTCAGGTTTTGGATATTCCATATCT
TCTTTCTATTCAGATTGATTCTTTTCAAAAATTTATTAAACAAGATCCAGAGGAACCATGTGGACTGGAAGCAGCTTTTA
GATCTGTTTTTCCAATTAAAAGTTATAATGGCAACGCTGAATTACAATACATCAAATATCAATTAGGAGAACCAACGTTT
GATGTGAAAGAATGTCAGACACGTGGTGCTACCTTTTCTGCGCCGTTACGTGTACGCCTATGTTTAATTATTTATGAACG
AGAAGGATTAAATAACATAATAAAAAATACCAAAGAACAAGAAGTATATATGGGTGAAATTCCTCTCATGACTAATAACG
GTACTTTTATTATCAATGGTATTGAACGGGTTATTGTATCTCAATTACATAGAAGTCCCGGTGTGTTTTTTGATAGCGAT
AAAGGTAAAACACATTCATCCGGAAAAGTATTATATAATGCACGTATTATTCCATATCGTGGGTCTTGGTTAGATTTTGA
ATTCGATTTAAAAGATAATTTATTTGTTCGAATTGATCGGCGACGTAAGTTACCAGTAACAGTAATATTGCGTGCTTTAA
ATTATACTACCGATCAAATTTTAAATATATTCTTTAACAAAGTAATATACGAAATTCAGAATAATACATTATATATGCAT
TTAATCCCTGAAAGATTACGCGGTGAAACAGCATCTTTTGATATTGCTGTTAATGGTGTTATATATGTAAAAAAAGGACG
TCGTATTGCAGCTAAACACATTCGTCAATTAAAAAAGGATAAAATTTCAAAAATTGAAGTACCCATGGATTATATAATTG
GTAAAGTTGTTATAAAAGACTATTTCGATAAGAACACGAATATACCTATTGTTACAGCTAATACAGAAATATCTTCTGAC
ATCTTGCATAACTTAATTCGATCAGGATATGAATCCATAGAAACATTATTCAGCAATGATTTAGATTACGGTAATTACAT
CTCTGAAACTTTGCGTATTGATGCAACTACAAATAAATTTGACGCATTAGTAGAAATTTATCGTGTAATGCGTCCAGGGG
AACCTCCTACTAAAGAAGCTGCCGAATATTTATTTGAAAATTTATTTTTTTCAGAAGAGCGTTATGACTTGTCTTCTGTA
GGAAGAATGAAATTCAATCGGTCATTACAACGAGTACAAATAGAAGATTTAGGAACATTAAAAAAGGATGATATTGTTGA
TGTAATAAAAAAATTGATTGATATTAGAAACGGTAAAGGCGAAGTAGATGATATCGATCATTTGGGGAATCGACGTATTC
GTTCTGTAGGTGAAATGGCAGAAAATCAATTTAGAATTGGATTAGTTAGAGTAGAACGCGCGGTAAAAGAACGCTTATCT
TTAGGTGATTTAGATGTGTTAACTCCTCAAGACTTGATTAATGCTAAACCAATTTCAGCTGCAGTAAGAGAATTTTTTAC
TTCTAGCCAATTATCTCAGTTTATGGATCAAAATAATCCATTATCAGAAATTACTCATAAACGTCGCATATCTGCTTTAG
GACCAGGAGGGCTAACTCGAGAACGTGCGGGATTTGAAGTACGTGATGTCCATCCTACACATTATGGTAGAGTCTGCCCA
ATCGAAACTCCAGAAGGACCAAATATAGGTTTAATTAATTCATTATCTGTATATGCTCGAGCTAATAAGTATGGATTTTT
AGAAACTCCATACCGAAAAGTCCAAAATGGCGTTGTTAGTAATGACATTCATTACTTATCTGCAATTGAAGAGGGTGATT
TCGTTATTGCTCAAGCAAATACAAACTTAAATTCAATAGGTGAATTTATTGACGATCTGGTAACTTGCAGAAATAAAGGT
GAATCTGGTCTTTTTAAAAAAGACCAAGTTGATTACATGGATGTATCTACACAACAAATAGTTTCAGTCGCTGCTTCATT
AATCCCTTTTCTTGAGCACGATGATGCTAATCGTGCTCTTATGGGCGCAAACATGCAACGCCAAGCCGTTCCTGTTTTAT
GTAGTGAAAAACCATTAGTAGGCACTGGAATGGAGCGCGCAGTAGCTATAGATTCTGGTGTTACCGTAGTAGCTAAACGC
GGCGGCGTCATTAAATATGTAGATGCATCACGTATTGTAATACATGTTAACAAGAATGAAACGCATACTGAAGAATCAGG
AATTGATATCTATCAATTAACAAAATATATTCGATCCAACCAAAATACTTGCATTAATCAACGTCCTTGCGTGTCTTTAG
GAGAATTAGTAGAACATGGAGATGTTATAGCAGATGGTCCATCTACTGATTTAGGAGAATTAGCTTTAGGTCAAAATATG
CGAATTGCATTTATGCCTTGGAATGGATATAATTTTGAAGATTCGATGTTGGTCTCGGAACGTGTAGTGCAAGAAGATAA
ATTTACAAGTATACATATTCAAGAGTTAACTTGTGTATCTCGTGATACTAAATTAGGGCCTGAAGAAATCACGGCTGATA
TTCCAAATGTAGGAGAGACAGCTTTATCTAAATTAGATGAATCTGGAATTATCTATATCGGCGCAGAAGTAATAGGAGGA
GATATCCTCGTCGGGAAAGTTACACCCAAAGGAGAAACTCAATTAACACCAGAAGAGAAATTATTGCGTGCTATTTTTGG
TGAAAAAGCATCCGATGTAAAAGATTCATCTTTACGTGTTCCCAACGGGGTTTGTGGTACTGTAATTGATGTACAAATAT
TTACTAGAGATGGTATTAATAAAGATAAACGTTCATTAATAATTGAATCCGAGAAATTAAAACAGGTTAAGAAAGATTTA
AGTGAAGAATTACAAATTTTTGAATCAGCTTTATTTGATCGTGTCTGCGATGTATTAATGACCAGTGGAATTGATAAAAA
GAAATTGTTTGAAACTAGCCGCAATGCTTGGTTAGATTTGGTGTTATCAGATCCAGAAAAACAATATCAATTATCTCAAT
TAACTAAACAATATTTTGATTTAAAACGCATGTTTGAAAAAAAATTAGAAATTCAACATCGTAAAATCACTCAAGGAGAT
GAATTAGCTCCCGGTATTTTAAAAATAGTTAAAGTATATTTAGCTGTAAAACGTCAGATACAACCTGGCGACAAAATGGC
AGGAAGACATGGAAATAAAGGAGTAATCTCTAAAATTAACCCCATTGAAGATATGCCATATGATCAACATGGAATACCGG
TAGATATCGTACTCAATCCTCTTGGAGTACCATCTCGAATGAATATTGGTCAAATTTTAGAAACACATCTTGGTATGGCG
GCAAAAGGTATCGGAGATAAAATAAACTTTATGTTGCAACAACATAAAGAAGCAAATCAATTAAGAAGATTTATGCAGCA
AGCGTATAATTTAGGAGAAGGATCGCGTCAACACATCAATCTTAATTCATTTTCAGATATAGAAATATTAAAATTAGCTA
AAAATTTAAAAAAAGGAATGCCTATTGCTACTCCGGTATTCGATGGGGCTACAGAAAAAGAAATTAAAGATCTTTTAAAA
TTATCCGGATTACCTAGTTCTGGTCAGATTACATTATTTGATGGGTGCACAGGAGAAGCATTCGAGAGACAAGTTACTGT
GGGCTATATGTACATGTTAAAACTAAATCATTTGGTAGACGATAAAATGCATGCACGTTCTACTGGTTCTTATAGTTTAG
TAACCCAACAACCACTGGGTGGGAAAGCTCAGTTTGGAGGTCAACGTTTTGGAGAAATGGAAGTATGGGCATTAGAAGCT
TATGGAGCATCATATACTTTACAAGAAATGTTAACTGTAAAATCAGATGATGTAAATGGACGTACTAAAATGTATAAAAA
TATTGTTGATGGCAATCATATGATGGAACCGGGAATGCCTGAGTCTTTTAATGTTTTATTAAAAGAAATTCGTTCTTTAG
CAATTAATATTGAACTAGAAGATTAA

Upstream 100 bases:

>100_bases
CTAAATCTAAAATAGATATAGAAATGTATAGCTATATTATATAGCAGCTAACTTATTTCTTTGAAAAAACAATCGCCCAT
ATTAACTATCGAGGAACTTT

Downstream 100 bases:

>100_bases
TATTTTTAATTAATCTCTATATAGACATATTTTATAAAGCATATAGTGCTTATGTGTTCAAAGGTATAACTTTAATAAAG
GCTCATCTGTGAAAGATTTA

Product: DNA-directed RNA polymerase subunit beta

Products: NA

Alternate protein names: RNAP subunit beta; RNA polymerase subunit beta; Transcriptase subunit beta

Number of amino acids: Translated: 1341; Mature: 1341

Protein sequence:

>1341_residues
MVYSYTEKKRIRKDFGKRPQVLDIPYLLSIQIDSFQKFIKQDPEEPCGLEAAFRSVFPIKSYNGNAELQYIKYQLGEPTF
DVKECQTRGATFSAPLRVRLCLIIYEREGLNNIIKNTKEQEVYMGEIPLMTNNGTFIINGIERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDLKDNLFVRIDRRRKLPVTVILRALNYTTDQILNIFFNKVIYEIQNNTLYMH
LIPERLRGETASFDIAVNGVIYVKKGRRIAAKHIRQLKKDKISKIEVPMDYIIGKVVIKDYFDKNTNIPIVTANTEISSD
ILHNLIRSGYESIETLFSNDLDYGNYISETLRIDATTNKFDALVEIYRVMRPGEPPTKEAAEYLFENLFFSEERYDLSSV
GRMKFNRSLQRVQIEDLGTLKKDDIVDVIKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLS
LGDLDVLTPQDLINAKPISAAVREFFTSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYARANKYGFLETPYRKVQNGVVSNDIHYLSAIEEGDFVIAQANTNLNSIGEFIDDLVTCRNKG
ESGLFKKDQVDYMDVSTQQIVSVAASLIPFLEHDDANRALMGANMQRQAVPVLCSEKPLVGTGMERAVAIDSGVTVVAKR
GGVIKYVDASRIVIHVNKNETHTEESGIDIYQLTKYIRSNQNTCINQRPCVSLGELVEHGDVIADGPSTDLGELALGQNM
RIAFMPWNGYNFEDSMLVSERVVQEDKFTSIHIQELTCVSRDTKLGPEEITADIPNVGETALSKLDESGIIYIGAEVIGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVCGTVIDVQIFTRDGINKDKRSLIIESEKLKQVKKDL
SEELQIFESALFDRVCDVLMTSGIDKKKLFETSRNAWLDLVLSDPEKQYQLSQLTKQYFDLKRMFEKKLEIQHRKITQGD
ELAPGILKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKINPIEDMPYDQHGIPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINFMLQQHKEANQLRRFMQQAYNLGEGSRQHINLNSFSDIEILKLAKNLKKGMPIATPVFDGATEKEIKDLLK
LSGLPSSGQITLFDGCTGEAFERQVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGASYTLQEMLTVKSDDVNGRTKMYKNIVDGNHMMEPGMPESFNVLLKEIRSLAINIELED

Sequences:

>Translated_1341_residues
MVYSYTEKKRIRKDFGKRPQVLDIPYLLSIQIDSFQKFIKQDPEEPCGLEAAFRSVFPIKSYNGNAELQYIKYQLGEPTF
DVKECQTRGATFSAPLRVRLCLIIYEREGLNNIIKNTKEQEVYMGEIPLMTNNGTFIINGIERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDLKDNLFVRIDRRRKLPVTVILRALNYTTDQILNIFFNKVIYEIQNNTLYMH
LIPERLRGETASFDIAVNGVIYVKKGRRIAAKHIRQLKKDKISKIEVPMDYIIGKVVIKDYFDKNTNIPIVTANTEISSD
ILHNLIRSGYESIETLFSNDLDYGNYISETLRIDATTNKFDALVEIYRVMRPGEPPTKEAAEYLFENLFFSEERYDLSSV
GRMKFNRSLQRVQIEDLGTLKKDDIVDVIKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLS
LGDLDVLTPQDLINAKPISAAVREFFTSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYARANKYGFLETPYRKVQNGVVSNDIHYLSAIEEGDFVIAQANTNLNSIGEFIDDLVTCRNKG
ESGLFKKDQVDYMDVSTQQIVSVAASLIPFLEHDDANRALMGANMQRQAVPVLCSEKPLVGTGMERAVAIDSGVTVVAKR
GGVIKYVDASRIVIHVNKNETHTEESGIDIYQLTKYIRSNQNTCINQRPCVSLGELVEHGDVIADGPSTDLGELALGQNM
RIAFMPWNGYNFEDSMLVSERVVQEDKFTSIHIQELTCVSRDTKLGPEEITADIPNVGETALSKLDESGIIYIGAEVIGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVCGTVIDVQIFTRDGINKDKRSLIIESEKLKQVKKDL
SEELQIFESALFDRVCDVLMTSGIDKKKLFETSRNAWLDLVLSDPEKQYQLSQLTKQYFDLKRMFEKKLEIQHRKITQGD
ELAPGILKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKINPIEDMPYDQHGIPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINFMLQQHKEANQLRRFMQQAYNLGEGSRQHINLNSFSDIEILKLAKNLKKGMPIATPVFDGATEKEIKDLLK
LSGLPSSGQITLFDGCTGEAFERQVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGASYTLQEMLTVKSDDVNGRTKMYKNIVDGNHMMEPGMPESFNVLLKEIRSLAINIELED
>Mature_1341_residues
MVYSYTEKKRIRKDFGKRPQVLDIPYLLSIQIDSFQKFIKQDPEEPCGLEAAFRSVFPIKSYNGNAELQYIKYQLGEPTF
DVKECQTRGATFSAPLRVRLCLIIYEREGLNNIIKNTKEQEVYMGEIPLMTNNGTFIINGIERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDLKDNLFVRIDRRRKLPVTVILRALNYTTDQILNIFFNKVIYEIQNNTLYMH
LIPERLRGETASFDIAVNGVIYVKKGRRIAAKHIRQLKKDKISKIEVPMDYIIGKVVIKDYFDKNTNIPIVTANTEISSD
ILHNLIRSGYESIETLFSNDLDYGNYISETLRIDATTNKFDALVEIYRVMRPGEPPTKEAAEYLFENLFFSEERYDLSSV
GRMKFNRSLQRVQIEDLGTLKKDDIVDVIKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLS
LGDLDVLTPQDLINAKPISAAVREFFTSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYARANKYGFLETPYRKVQNGVVSNDIHYLSAIEEGDFVIAQANTNLNSIGEFIDDLVTCRNKG
ESGLFKKDQVDYMDVSTQQIVSVAASLIPFLEHDDANRALMGANMQRQAVPVLCSEKPLVGTGMERAVAIDSGVTVVAKR
GGVIKYVDASRIVIHVNKNETHTEESGIDIYQLTKYIRSNQNTCINQRPCVSLGELVEHGDVIADGPSTDLGELALGQNM
RIAFMPWNGYNFEDSMLVSERVVQEDKFTSIHIQELTCVSRDTKLGPEEITADIPNVGETALSKLDESGIIYIGAEVIGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVCGTVIDVQIFTRDGINKDKRSLIIESEKLKQVKKDL
SEELQIFESALFDRVCDVLMTSGIDKKKLFETSRNAWLDLVLSDPEKQYQLSQLTKQYFDLKRMFEKKLEIQHRKITQGD
ELAPGILKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKINPIEDMPYDQHGIPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINFMLQQHKEANQLRRFMQQAYNLGEGSRQHINLNSFSDIEILKLAKNLKKGMPIATPVFDGATEKEIKDLLK
LSGLPSSGQITLFDGCTGEAFERQVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGASYTLQEMLTVKSDDVNGRTKMYKNIVDGNHMMEPGMPESFNVLLKEIRSLAINIELED

Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates

COG id: COG0085

COG function: function code K; DNA-directed RNA polymerase, beta subunit/140 kD subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RNA polymerase beta chain family

Homologues:

Organism=Homo sapiens, GI33469941, Length=297, Percent_Identity=30.3030303030303, Blast_Score=123, Evalue=1e-27,
Organism=Homo sapiens, GI212286172, Length=297, Percent_Identity=30.3030303030303, Blast_Score=123, Evalue=2e-27,
Organism=Homo sapiens, GI4505941, Length=305, Percent_Identity=29.5081967213115, Blast_Score=115, Evalue=3e-25,
Organism=Homo sapiens, GI238908505, Length=160, Percent_Identity=34.375, Blast_Score=84, Evalue=1e-15,
Organism=Homo sapiens, GI238908503, Length=160, Percent_Identity=34.375, Blast_Score=84, Evalue=1e-15,
Organism=Escherichia coli, GI1790419, Length=1341, Percent_Identity=81.1334824757644, Blast_Score=2276, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17552304, Length=249, Percent_Identity=31.7269076305221, Blast_Score=116, Evalue=1e-25,
Organism=Caenorhabditis elegans, GI17506623, Length=237, Percent_Identity=31.2236286919831, Blast_Score=115, Evalue=2e-25,
Organism=Caenorhabditis elegans, GI25144348, Length=297, Percent_Identity=28.2828282828283, Blast_Score=92, Evalue=3e-18,
Organism=Saccharomyces cerevisiae, GI6324725, Length=250, Percent_Identity=32.4, Blast_Score=122, Evalue=5e-28,
Organism=Saccharomyces cerevisiae, GI6325267, Length=253, Percent_Identity=30.4347826086957, Blast_Score=107, Evalue=2e-23,
Organism=Saccharomyces cerevisiae, GI6324781, Length=168, Percent_Identity=33.3333333333333, Blast_Score=87, Evalue=1e-17,
Organism=Drosophila melanogaster, GI17136444, Length=249, Percent_Identity=30.5220883534137, Blast_Score=108, Evalue=2e-23,
Organism=Drosophila melanogaster, GI17136446, Length=256, Percent_Identity=31.640625, Blast_Score=107, Evalue=4e-23,
Organism=Drosophila melanogaster, GI17647877, Length=297, Percent_Identity=26.9360269360269, Blast_Score=94, Evalue=9e-19,

Paralogues:

None

Copy number: 4233 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 2,500 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): RPOB_BLOPB (Q492B9)

Other databases:

- EMBL:   CP000016
- RefSeq:   YP_278060.1
- ProteinModelPortal:   Q492B9
- STRING:   Q492B9
- GeneID:   3563227
- GenomeReviews:   CP000016_GR
- KEGG:   bpn:BPEN_577
- eggNOG:   COG0085
- HOGENOM:   HBG285547
- OMA:   FMTWEGY
- ProtClustDB:   PRK00405
- BioCyc:   CBLO291272:BPEN_577-MONOMER
- HAMAP:   MF_01321
- InterPro:   IPR010243
- InterPro:   IPR019462
- InterPro:   IPR015712
- InterPro:   IPR007120
- InterPro:   IPR007121
- InterPro:   IPR007644
- InterPro:   IPR007642
- InterPro:   IPR007645
- InterPro:   IPR007641
- InterPro:   IPR014724
- Gene3D:   G3DSA:2.40.50.150
- PANTHER:   PTHR20856
- TIGRFAMs:   TIGR02013

Pfam domain/function: PF04563 RNA_pol_Rpb2_1; PF04561 RNA_pol_Rpb2_2; PF04565 RNA_pol_Rpb2_3; PF10385 RNA_pol_Rpb2_45; PF00562 RNA_pol_Rpb2_6; PF04560 RNA_pol_Rpb2_7

EC number: =2.7.7.6

Molecular weight: Translated: 151225; Mature: 151225

Theoretical pI: Translated: 6.66; Mature: 6.66

Prosite motif: PS01166 RNA_POL_BETA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MVYSYTEKKRIRKDFGKRPQVLDIPYLLSIQIDSFQKFIKQDPEEPCGLEAAFRSVFPIK
CCCCCHHHHHHHHHHCCCCCEEECCEEEEEEHHHHHHHHHCCCCCCCCHHHHHHHHCCEE
SYNGNAELQYIKYQLGEPTFDVKECQTRGATFSAPLRVRLCLIIYEREGLNNIIKNTKEQ
CCCCCCEEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCC
EVYMGEIPLMTNNGTFIINGIERVIVSQLHRSPGVFFDSDKGKTHSSGKVLYNARIIPYR
EEEECCCEEEECCCEEEECHHHHHHHHHHHCCCCCEEECCCCCCCCCCEEEEEEEEEEEC
GSWLDFEFDLKDNLFVRIDRRRKLPVTVILRALNYTTDQILNIFFNKVIYEIQNNTLYMH
CCEEEEEEECCCCEEEEECCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHEECCCEEEEE
LIPERLRGETASFDIAVNGVIYVKKGRRIAAKHIRQLKKDKISKIEVPMDYIIGKVVIKD
ECCHHHCCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHHHHHCEECCCHHHHHHHHHHHH
YFDKNTNIPIVTANTEISSDILHNLIRSGYESIETLFSNDLDYGNYISETLRIDATTNKF
HHCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHEEEECCCHHH
DALVEIYRVMRPGEPPTKEAAEYLFENLFFSEERYDLSSVGRMKFNRSLQRVQIEDLGTL
HHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCHHEEEHHHCCCC
KKDDIVDVIKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLS
CCCHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHHHCCCEEHHHHHHHHHHHHCC
LGDLDVLTPQDLINAKPISAAVREFFTSSQLSQFMDQNNPLSEITHKRRISALGPGGLTR
CCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCH
ERAGFEVRDVHPTHYGRVCPIETPEGPNIGLINSLSVYARANKYGFLETPYRKVQNGVVS
HHCCCEEEECCCCCCCCCCCCCCCCCCCCCEECCCEEEEECCCCCCCCCHHHHHHCCCCC
NDIHYLSAIEEGDFVIAQANTNLNSIGEFIDDLVTCRNKGESGLFKKDQVDYMDVSTQQI
CCCHHHEEECCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH
VSVAASLIPFLEHDDANRALMGANMQRQAVPVLCSEKPLVGTGMERAVAIDSGVTVVAKR
HHHHHHHHHHHCCCCCCCEEECCCCCCCCCCEEECCCCCCCCCCCCEEEECCCCEEEEEC
GGVIKYVDASRIVIHVNKNETHTEESGIDIYQLTKYIRSNQNTCINQRPCVSLGELVEHG
CCEEEEECCCEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCC
DVIADGPSTDLGELALGQNMRIAFMPWNGYNFEDSMLVSERVVQEDKFTSIHIQELTCVS
CEEECCCCCCHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEEEHHHEEC
RDTKLGPEEITADIPNVGETALSKLDESGIIYIGAEVIGGDILVGKVTPKGETQLTPEEK
CCCCCCHHHHHCCCCCCCHHHHHHCCCCCEEEEEHEEECCEEEEEEECCCCCCCCCHHHH
LLRAIFGEKASDVKDSSLRVPNGVCGTVIDVQIFTRDGINKDKRSLIIESEKLKQVKKDL
HHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCHHHEEECHHHHHHHHHHH
SEELQIFESALFDRVCDVLMTSGIDKKKLFETSRNAWLDLVLSDPEKQYQLSQLTKQYFD
HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHH
LKRMFEKKLEIQHRKITQGDELAPGILKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKIN
HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCCC
PIEDMPYDQHGIPVDIVLNPLGVPSRMNIGQILETHLGMAAKGIGDKINFMLQQHKEANQ
CCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHCHHHHCCCHHHHHHHHHHHHHHH
LRRFMQQAYNLGEGSRQHINLNSFSDIEILKLAKNLKKGMPIATPVFDGATEKEIKDLLK
HHHHHHHHHCCCCCCCCEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH
LSGLPSSGQITLFDGCTGEAFERQVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLG
HCCCCCCCCEEEEECCCCHHHHHHHHHHHHEEHHHHHHHCCHHHHCCCCCEEEEECCCCC
GKAQFGGQRFGEMEVWALEAYGASYTLQEMLTVKSDDVNGRTKMYKNIVDGNHMMEPGMP
CCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCC
ESFNVLLKEIRSLAINIELED
HHHHHHHHHHHHEEEEEEECC
>Mature Secondary Structure
MVYSYTEKKRIRKDFGKRPQVLDIPYLLSIQIDSFQKFIKQDPEEPCGLEAAFRSVFPIK
CCCCCHHHHHHHHHHCCCCCEEECCEEEEEEHHHHHHHHHCCCCCCCCHHHHHHHHCCEE
SYNGNAELQYIKYQLGEPTFDVKECQTRGATFSAPLRVRLCLIIYEREGLNNIIKNTKEQ
CCCCCCEEEEEEEECCCCCCCHHHHHHCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCC
EVYMGEIPLMTNNGTFIINGIERVIVSQLHRSPGVFFDSDKGKTHSSGKVLYNARIIPYR
EEEECCCEEEECCCEEEECHHHHHHHHHHHCCCCCEEECCCCCCCCCCEEEEEEEEEEEC
GSWLDFEFDLKDNLFVRIDRRRKLPVTVILRALNYTTDQILNIFFNKVIYEIQNNTLYMH
CCEEEEEEECCCCEEEEECCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHEECCCEEEEE
LIPERLRGETASFDIAVNGVIYVKKGRRIAAKHIRQLKKDKISKIEVPMDYIIGKVVIKD
ECCHHHCCCCCEEEEEEEEEEEEECCCHHHHHHHHHHHHHHHCEECCCHHHHHHHHHHHH
YFDKNTNIPIVTANTEISSDILHNLIRSGYESIETLFSNDLDYGNYISETLRIDATTNKF
HHCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHEEEECCCHHH
DALVEIYRVMRPGEPPTKEAAEYLFENLFFSEERYDLSSVGRMKFNRSLQRVQIEDLGTL
HHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCHHEEEHHHCCCC
KKDDIVDVIKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRIGLVRVERAVKERLS
CCCHHHHHHHHHHHCCCCCCCCCHHHHHCCHHHHHHHHHHHCCCEEHHHHHHHHHHHHCC
LGDLDVLTPQDLINAKPISAAVREFFTSSQLSQFMDQNNPLSEITHKRRISALGPGGLTR
CCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCH
ERAGFEVRDVHPTHYGRVCPIETPEGPNIGLINSLSVYARANKYGFLETPYRKVQNGVVS
HHCCCEEEECCCCCCCCCCCCCCCCCCCCCEECCCEEEEECCCCCCCCCHHHHHHCCCCC
NDIHYLSAIEEGDFVIAQANTNLNSIGEFIDDLVTCRNKGESGLFKKDQVDYMDVSTQQI
CCCHHHEEECCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH
VSVAASLIPFLEHDDANRALMGANMQRQAVPVLCSEKPLVGTGMERAVAIDSGVTVVAKR
HHHHHHHHHHHCCCCCCCEEECCCCCCCCCCEEECCCCCCCCCCCCEEEECCCCEEEEEC
GGVIKYVDASRIVIHVNKNETHTEESGIDIYQLTKYIRSNQNTCINQRPCVSLGELVEHG
CCEEEEECCCEEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCC
DVIADGPSTDLGELALGQNMRIAFMPWNGYNFEDSMLVSERVVQEDKFTSIHIQELTCVS
CEEECCCCCCHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEEEHHHEEC
RDTKLGPEEITADIPNVGETALSKLDESGIIYIGAEVIGGDILVGKVTPKGETQLTPEEK
CCCCCCHHHHHCCCCCCCHHHHHHCCCCCEEEEEHEEECCEEEEEEECCCCCCCCCHHHH
LLRAIFGEKASDVKDSSLRVPNGVCGTVIDVQIFTRDGINKDKRSLIIESEKLKQVKKDL
HHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCHHHEEECHHHHHHHHHHH
SEELQIFESALFDRVCDVLMTSGIDKKKLFETSRNAWLDLVLSDPEKQYQLSQLTKQYFD
HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHH
LKRMFEKKLEIQHRKITQGDELAPGILKIVKVYLAVKRQIQPGDKMAGRHGNKGVISKIN
HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCCC
PIEDMPYDQHGIPVDIVLNPLGVPSRMNIGQILETHLGMAAKGIGDKINFMLQQHKEANQ
CCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHCHHHHCCCHHHHHHHHHHHHHHH
LRRFMQQAYNLGEGSRQHINLNSFSDIEILKLAKNLKKGMPIATPVFDGATEKEIKDLLK
HHHHHHHHHCCCCCCCCEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH
LSGLPSSGQITLFDGCTGEAFERQVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLG
HCCCCCCCCEEEEECCCCHHHHHHHHHHHHEEHHHHHHHCCHHHHCCCCCEEEEECCCCC
GKAQFGGQRFGEMEVWALEAYGASYTLQEMLTVKSDDVNGRTKMYKNIVDGNHMMEPGMP
CCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCC
ESFNVLLKEIRSLAINIELED
HHHHHHHHHHHHEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA