Definition Anaplasma phagocytophilum HZ, complete genome.
Accession NC_007797
Length 1,471,282

Click here to switch to the map view.

The map label for this gene is rpoB [H]

Identifier: 88606872

GI number: 88606872

Start: 1086938

End: 1091089

Strand: Reverse

Name: rpoB [H]

Synonym: APH_1024

Alternate gene names: 88606872

Gene position: 1091089-1086938 (Counterclockwise)

Preceding gene: 88607605

Following gene: 88607105

Centisome position: 74.16

GC content: 43.83

Gene sequence:

>4152_bases
ATGTCTTCTGCGGGTGATTCTGGTCCCGGGTATGTGCTGAATGATTTTGATGCTGTTCCTAGGCTTTCTTACGCGAGGTC
TATAGACATTCGCGATTCTTTGAGTGATTTGATAAGAATACAGAGAGATTCTTATGATGCCTTCATAGGGATTGATGAGG
GTAGTAGTGGCGGTATACAAAGCATATTCCAGTCGATGTTTCCCATACGAGATCCTTTGGGGAGGGCTGTACTCGAGTTC
GTGAGTTGTAATATTGGGGAGCCTCAATATGATGAATACGAGTGCATAAAACGTGGGATAACGTTTTCTGTGCCGATGCG
CATAACTCTTCGTTTTGTTGTGTGGAAGGTGCAGGAGGTTTCATTCAAGGAAGTTAAGTACGTCGTAGATGAGGGCACTC
TTGAGAGGAGTGTTAAGTACATGAAGGAGCAGGAGGTGTCTATTGGCGATCTTCCGATGATGACATCATATGGAACCTTC
ATCATCAATGGTATAGAGAGGGTTATTGTTTCCCAGATGCATCGGTCTCCTGGTGTCTTTTTTGACAGTGATAAGGGGAA
GACATACAGTTCTGGGAAGCTAATTTATTCTGCGCGTATCATCCCTTACAGGGGCTCGTGGCTTGATTTTGAATTTGACA
TCAAGGACATCATTTACTTCCGAATAGATAAAAAGCGTAAACTCCCGGTCACCTACTTATTAAAGGCCCTTGGGATGTCT
AACAATGATATCCTAGACACATTTTATGACAAAGTTCTTTACGTAAGAAGTGATAAGGGTTGGAAGGTACCCTTTGTAGT
GGATCGCTTTAAGGGTGTGAGGCTGTCTTATGATCTTATGGACGTAGACGGTAATGTGCTGATTAAGGCGAATACTAGGA
TTACTCTCAGGATTGCTAAAAAGTTATATGCTGATGGCTTAAGGGAATATTTGGTGCCTTTCGCGGGGATTTCTGGGTTG
TTCGTGGCAACTGACTTGGTGGATCCTGCTAGTGGTGCTGTGATTGTTTCTGCGGGTGAGGCGATAGCTGCGGAGCACAT
TGTAAAATTAGAATTGTTTGATATCAGTGAGATAGCGTTTTTGAACATAGACTTTTTGACAGTTGGTCCTTATGTTCTGA
ATACTCTATTTTTGGACAGGCATATAACGCAAGAGGATGCGCTGTTTGAGATATATAGGGTATTGCGTTCTGGAGAATCT
CCAAATTTAGAAGCTGTGAAGTCATTCTTTAAAGGGCTCTTTTTTGAGCCAGATAGGTATGATTTGTCAGTTGTTGGAAG
AATTAAGCTGAACAGTCATTTGCGCTTAGATATAGATGAGAATCTCACTGTCCTTACTAAGGATGACATAGTACACGTAA
TAAAGAAGTTGGTTTTATTGCCTGATGGTGAGGGTGTAGTTGATGATATAGACCATCTTGGTAATAGGCGGGTAAGGTCT
GTAGGGGAGTTCATAGAGAATCAATTCCGGGTTGGTATATTGCGGCTCGAGAGAATGATTATGGATTACATGTCCTCGGT
TAACTTTGATAATGCGGTGCCGTGTGACTTTGTTAATCCTAAGATCTTGGCTACGGTGTTGAAAGACTTTTTCAGTTCTT
CGCAGCTTTCGCAGTTTATGGATCAGACTAATCCTCTCTCAGAGGTTACGCATAAGCGTAGGTTATCGGCGTTGGGTCCT
GGTGGTTTAACGAGAGAAAGGGCTGGTTTTGAAGTTCGTGACGTTCACCCTACGCACTATGGTAGGATATGTCCTATTGA
AACTCCGGAAGGACAGAATATTGGTCTTATTAGTAGTCTCGCTATATATGCTAAGATAAACAAGTATGGCTTTATTGAAA
GCCCGTACAGGAAGGTTATTGATGGCGTAGTTACTGATTCTGTAGAGTATCTTTTGGCTACACAGGAGAGTGACTATTAT
ATTGCTGATGCGGGCGCTGCCCTTGATGAGAATAACCGTTTTGTGGATGATATGCTGTATTGTCGCCATGGCGGTAACTT
CGTGATGGTGAAGCGTGAAGACGTGAATTACATTGATGTATCTCCAAAGCAGATAGTTTCTGTTGCAGCGTCGCTGATTC
CGTTTTTGGAAAACAATGACGCGAACCGTGCTCTAATGGGTTCAAATATGCAGCGTCAGGCGGTGCCATTACTAAAGGCT
GAAGCGCCTTTGGTGGGTACTGGTATGGAATCAGTTGTTGCGGCAGGTTCAGGTGCCGTCGTATTGGCTAAGCGCGATGG
TGTAGTACACAGGGTTGACGGGTCTTATATCGTAATCAGGGCTTTTGATAAGAATAAGGACGAGTATCTTGGTGTTGATA
TATACAAACTGAGAAAGTTCCAGCGTTCAAATCATAATACATGTATCAATCAGAGGCCTATAGTTAAGATAGGGGATTAC
GTTAGGACTAATGATGTTATTGCCGATGGTGCCGCTATAGATCGTGGTGAGTTGGCGTTGGGTAAAAATGTCCTTGTTGC
CTTTATGTCATGGCAGGGTTACAACTTTGAGGACTCGATAGTAATATCCAGTGATGTTGTGAAGAGGGATGTCTTCACTT
CAATTCACATAGAAGAGTTTGAATGTGTTGTGCGGGATACTCCGCTTGGTCCTGAGAAGATCATGCGTTCGGTGCCGGAT
GTGAATGAGGAAAGTCTCAGTCATCTGGATGATGTTGGTATTGTAAATATCGGGGCTGAGGTTTCTGCTGGTAGTGTTTT
GGTGGGTAAAGTGACTCCGAGACCGCCGGTTTCATTGCCGCCTGAGACGAAGCTTTTGGTGACAATTTTCGGTGAGAAGG
TATTCGATTGCGTTGATTCTTCTTTATATTTGCCGCCGGATGTTGAGGGAACGGTTATAGACGTTCATGTGTTTGTTAGA
AGGGGCGTTGAGGAGAATGATAGATCTTTGCTTATCAAGCAAAGTGAGGTGAATAGCTTCAGGAAGGAGCGTGACTACGA
AATAGATGTTGTCAGTGAGTACTTCTATGATGAGCTGAAGAAGTTGCTTTGTAGTGCTGATTTGCCGTTGAATGGTCATG
CTGATGTAGAGAGTCTTCTAGCTGCGAAGTCTTTGGAAGCGTTATGGGAAATTGGTCTATCGAATCCTAAGATATCTGCC
AAAGTTGCTGATATGAAGGGTAAATTTGATGAGTTGATTACTGAAGCTCATAGTAAATTTGACCAAAAGATAGACAAGCT
TAATTACGGCTATGATTTGCCGCAGGGTGTTCTGACTATTGTAAAGGTCTTTGTGGCTGTTAAGCATAATTTGCAGCCAG
GAGATAAAATGGCTGGTCGGCATGGTAACAAGGGGGTTATTTCGAGAATAGTTCCTGTTGAGGATATGCCGCATCTAGAG
GATGGTACGCCCGTGGATATAATCTTGAATTCTCTGGGTGTACCTTCGCGTATGAATATAGGGCAGATCCTTGAAACTCA
CTTGGGTTGGGCTGCAGTGAATTTGGGTCATAGGGTAGGTAGGATGCTTGATTCAGGGGAAGAAGAAGGGCCGGTAGTTG
AGAGTATTCGTAGCTTTTTGAGCGAAGTATATGAAGGGCAAAAGCTTAAGGAAGATGTGGCTTCTATGTCGGATGAGGCT
TTGCTGAAGTTTGCCAATAGGCTCAGAAGAGGTGTTCCTATGGCTGCTCCGGTGTTTGAGGGTCCGAAGGATGCGCAGAT
TTCCCGGCTTTTGGAATTAGCGGATGTTGATCCGTCTGGGCAGGTGGATCTTTATGATGGGCGTTCAGGGCAGAAGTTTG
ATCGCAAGGTAACTGTTGGATACATTTACATGTTGAAGCTCCATCACTTGGTGGATGACAAGATACATGCTAGGTCTGTT
GGTCCGTATGGTCTGGTTACTCAGCAACCTCTTGGAGGAAAGTCGCACTTTGGTGGGCAGAGATTTGGGGAAATGGAATG
CTGGGCATTGCAGGCCTATGGTGCTGCTTATACTTTGCAGGAAATGCTAACTGTCAAATCTGACGATATCGTAGGTAGGG
TAAGAATCTATGAATCCATAATTAAGGGGGATAGCAACTTCGAGTGTGGTATTCCTGAGTCGTTTAATGTCATGGTCAAG
GAGTTACGCTCGCTGTGCCTTGATGTTGTTCTAAAGCAGGATAAAGAGTTTACTAGTAGCAAGGTGGAGTAG

Upstream 100 bases:

>100_bases
TGAAGCTGAGGAAGTGAAGAAGAAGCTTGAAGATGCCGGGGCTCAGGTGAGTTTGAAGTAATCACGGGTATTTTTGTTGT
AATGGTTTTTGAGGGAATTG

Downstream 100 bases:

>100_bases
GGATTTACAATTATGAAGACGTTGGATTTGTATGGCTATACCAGTATAGCACAGTCGTTCGATAAGATTTGCATATCCAT
AGCTAGTCCAGAAAGTATAA

Product: DNA-directed RNA polymerase subunit beta

Products: NA

Alternate protein names: RNAP subunit beta; RNA polymerase subunit beta; Transcriptase subunit beta [H]

Number of amino acids: Translated: 1383; Mature: 1382

Protein sequence:

>1383_residues
MSSAGDSGPGYVLNDFDAVPRLSYARSIDIRDSLSDLIRIQRDSYDAFIGIDEGSSGGIQSIFQSMFPIRDPLGRAVLEF
VSCNIGEPQYDEYECIKRGITFSVPMRITLRFVVWKVQEVSFKEVKYVVDEGTLERSVKYMKEQEVSIGDLPMMTSYGTF
IINGIERVIVSQMHRSPGVFFDSDKGKTYSSGKLIYSARIIPYRGSWLDFEFDIKDIIYFRIDKKRKLPVTYLLKALGMS
NNDILDTFYDKVLYVRSDKGWKVPFVVDRFKGVRLSYDLMDVDGNVLIKANTRITLRIAKKLYADGLREYLVPFAGISGL
FVATDLVDPASGAVIVSAGEAIAAEHIVKLELFDISEIAFLNIDFLTVGPYVLNTLFLDRHITQEDALFEIYRVLRSGES
PNLEAVKSFFKGLFFEPDRYDLSVVGRIKLNSHLRLDIDENLTVLTKDDIVHVIKKLVLLPDGEGVVDDIDHLGNRRVRS
VGEFIENQFRVGILRLERMIMDYMSSVNFDNAVPCDFVNPKILATVLKDFFSSSQLSQFMDQTNPLSEVTHKRRLSALGP
GGLTRERAGFEVRDVHPTHYGRICPIETPEGQNIGLISSLAIYAKINKYGFIESPYRKVIDGVVTDSVEYLLATQESDYY
IADAGAALDENNRFVDDMLYCRHGGNFVMVKREDVNYIDVSPKQIVSVAASLIPFLENNDANRALMGSNMQRQAVPLLKA
EAPLVGTGMESVVAAGSGAVVLAKRDGVVHRVDGSYIVIRAFDKNKDEYLGVDIYKLRKFQRSNHNTCINQRPIVKIGDY
VRTNDVIADGAAIDRGELALGKNVLVAFMSWQGYNFEDSIVISSDVVKRDVFTSIHIEEFECVVRDTPLGPEKIMRSVPD
VNEESLSHLDDVGIVNIGAEVSAGSVLVGKVTPRPPVSLPPETKLLVTIFGEKVFDCVDSSLYLPPDVEGTVIDVHVFVR
RGVEENDRSLLIKQSEVNSFRKERDYEIDVVSEYFYDELKKLLCSADLPLNGHADVESLLAAKSLEALWEIGLSNPKISA
KVADMKGKFDELITEAHSKFDQKIDKLNYGYDLPQGVLTIVKVFVAVKHNLQPGDKMAGRHGNKGVISRIVPVEDMPHLE
DGTPVDIILNSLGVPSRMNIGQILETHLGWAAVNLGHRVGRMLDSGEEEGPVVESIRSFLSEVYEGQKLKEDVASMSDEA
LLKFANRLRRGVPMAAPVFEGPKDAQISRLLELADVDPSGQVDLYDGRSGQKFDRKVTVGYIYMLKLHHLVDDKIHARSV
GPYGLVTQQPLGGKSHFGGQRFGEMECWALQAYGAAYTLQEMLTVKSDDIVGRVRIYESIIKGDSNFECGIPESFNVMVK
ELRSLCLDVVLKQDKEFTSSKVE

Sequences:

>Translated_1383_residues
MSSAGDSGPGYVLNDFDAVPRLSYARSIDIRDSLSDLIRIQRDSYDAFIGIDEGSSGGIQSIFQSMFPIRDPLGRAVLEF
VSCNIGEPQYDEYECIKRGITFSVPMRITLRFVVWKVQEVSFKEVKYVVDEGTLERSVKYMKEQEVSIGDLPMMTSYGTF
IINGIERVIVSQMHRSPGVFFDSDKGKTYSSGKLIYSARIIPYRGSWLDFEFDIKDIIYFRIDKKRKLPVTYLLKALGMS
NNDILDTFYDKVLYVRSDKGWKVPFVVDRFKGVRLSYDLMDVDGNVLIKANTRITLRIAKKLYADGLREYLVPFAGISGL
FVATDLVDPASGAVIVSAGEAIAAEHIVKLELFDISEIAFLNIDFLTVGPYVLNTLFLDRHITQEDALFEIYRVLRSGES
PNLEAVKSFFKGLFFEPDRYDLSVVGRIKLNSHLRLDIDENLTVLTKDDIVHVIKKLVLLPDGEGVVDDIDHLGNRRVRS
VGEFIENQFRVGILRLERMIMDYMSSVNFDNAVPCDFVNPKILATVLKDFFSSSQLSQFMDQTNPLSEVTHKRRLSALGP
GGLTRERAGFEVRDVHPTHYGRICPIETPEGQNIGLISSLAIYAKINKYGFIESPYRKVIDGVVTDSVEYLLATQESDYY
IADAGAALDENNRFVDDMLYCRHGGNFVMVKREDVNYIDVSPKQIVSVAASLIPFLENNDANRALMGSNMQRQAVPLLKA
EAPLVGTGMESVVAAGSGAVVLAKRDGVVHRVDGSYIVIRAFDKNKDEYLGVDIYKLRKFQRSNHNTCINQRPIVKIGDY
VRTNDVIADGAAIDRGELALGKNVLVAFMSWQGYNFEDSIVISSDVVKRDVFTSIHIEEFECVVRDTPLGPEKIMRSVPD
VNEESLSHLDDVGIVNIGAEVSAGSVLVGKVTPRPPVSLPPETKLLVTIFGEKVFDCVDSSLYLPPDVEGTVIDVHVFVR
RGVEENDRSLLIKQSEVNSFRKERDYEIDVVSEYFYDELKKLLCSADLPLNGHADVESLLAAKSLEALWEIGLSNPKISA
KVADMKGKFDELITEAHSKFDQKIDKLNYGYDLPQGVLTIVKVFVAVKHNLQPGDKMAGRHGNKGVISRIVPVEDMPHLE
DGTPVDIILNSLGVPSRMNIGQILETHLGWAAVNLGHRVGRMLDSGEEEGPVVESIRSFLSEVYEGQKLKEDVASMSDEA
LLKFANRLRRGVPMAAPVFEGPKDAQISRLLELADVDPSGQVDLYDGRSGQKFDRKVTVGYIYMLKLHHLVDDKIHARSV
GPYGLVTQQPLGGKSHFGGQRFGEMECWALQAYGAAYTLQEMLTVKSDDIVGRVRIYESIIKGDSNFECGIPESFNVMVK
ELRSLCLDVVLKQDKEFTSSKVE
>Mature_1382_residues
SSAGDSGPGYVLNDFDAVPRLSYARSIDIRDSLSDLIRIQRDSYDAFIGIDEGSSGGIQSIFQSMFPIRDPLGRAVLEFV
SCNIGEPQYDEYECIKRGITFSVPMRITLRFVVWKVQEVSFKEVKYVVDEGTLERSVKYMKEQEVSIGDLPMMTSYGTFI
INGIERVIVSQMHRSPGVFFDSDKGKTYSSGKLIYSARIIPYRGSWLDFEFDIKDIIYFRIDKKRKLPVTYLLKALGMSN
NDILDTFYDKVLYVRSDKGWKVPFVVDRFKGVRLSYDLMDVDGNVLIKANTRITLRIAKKLYADGLREYLVPFAGISGLF
VATDLVDPASGAVIVSAGEAIAAEHIVKLELFDISEIAFLNIDFLTVGPYVLNTLFLDRHITQEDALFEIYRVLRSGESP
NLEAVKSFFKGLFFEPDRYDLSVVGRIKLNSHLRLDIDENLTVLTKDDIVHVIKKLVLLPDGEGVVDDIDHLGNRRVRSV
GEFIENQFRVGILRLERMIMDYMSSVNFDNAVPCDFVNPKILATVLKDFFSSSQLSQFMDQTNPLSEVTHKRRLSALGPG
GLTRERAGFEVRDVHPTHYGRICPIETPEGQNIGLISSLAIYAKINKYGFIESPYRKVIDGVVTDSVEYLLATQESDYYI
ADAGAALDENNRFVDDMLYCRHGGNFVMVKREDVNYIDVSPKQIVSVAASLIPFLENNDANRALMGSNMQRQAVPLLKAE
APLVGTGMESVVAAGSGAVVLAKRDGVVHRVDGSYIVIRAFDKNKDEYLGVDIYKLRKFQRSNHNTCINQRPIVKIGDYV
RTNDVIADGAAIDRGELALGKNVLVAFMSWQGYNFEDSIVISSDVVKRDVFTSIHIEEFECVVRDTPLGPEKIMRSVPDV
NEESLSHLDDVGIVNIGAEVSAGSVLVGKVTPRPPVSLPPETKLLVTIFGEKVFDCVDSSLYLPPDVEGTVIDVHVFVRR
GVEENDRSLLIKQSEVNSFRKERDYEIDVVSEYFYDELKKLLCSADLPLNGHADVESLLAAKSLEALWEIGLSNPKISAK
VADMKGKFDELITEAHSKFDQKIDKLNYGYDLPQGVLTIVKVFVAVKHNLQPGDKMAGRHGNKGVISRIVPVEDMPHLED
GTPVDIILNSLGVPSRMNIGQILETHLGWAAVNLGHRVGRMLDSGEEEGPVVESIRSFLSEVYEGQKLKEDVASMSDEAL
LKFANRLRRGVPMAAPVFEGPKDAQISRLLELADVDPSGQVDLYDGRSGQKFDRKVTVGYIYMLKLHHLVDDKIHARSVG
PYGLVTQQPLGGKSHFGGQRFGEMECWALQAYGAAYTLQEMLTVKSDDIVGRVRIYESIIKGDSNFECGIPESFNVMVKE
LRSLCLDVVLKQDKEFTSSKVE

Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates [H]

COG id: COG0085

COG function: function code K; DNA-directed RNA polymerase, beta subunit/140 kD subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RNA polymerase beta chain family [H]

Homologues:

Organism=Homo sapiens, GI33469941, Length=267, Percent_Identity=32.2097378277154, Blast_Score=118, Evalue=4e-26,
Organism=Homo sapiens, GI212286172, Length=267, Percent_Identity=32.2097378277154, Blast_Score=117, Evalue=6e-26,
Organism=Homo sapiens, GI4505941, Length=237, Percent_Identity=32.0675105485232, Blast_Score=103, Evalue=2e-21,
Organism=Homo sapiens, GI238908503, Length=139, Percent_Identity=33.8129496402878, Blast_Score=72, Evalue=3e-12,
Organism=Homo sapiens, GI238908505, Length=139, Percent_Identity=33.8129496402878, Blast_Score=72, Evalue=3e-12,
Organism=Escherichia coli, GI1790419, Length=1346, Percent_Identity=53.0460624071322, Blast_Score=1419, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17506623, Length=235, Percent_Identity=30.6382978723404, Blast_Score=107, Evalue=4e-23,
Organism=Caenorhabditis elegans, GI17552304, Length=237, Percent_Identity=30.8016877637131, Blast_Score=102, Evalue=1e-21,
Organism=Caenorhabditis elegans, GI25144348, Length=296, Percent_Identity=27.3648648648649, Blast_Score=97, Evalue=5e-20,
Organism=Saccharomyces cerevisiae, GI6324725, Length=318, Percent_Identity=30.188679245283, Blast_Score=113, Evalue=2e-25,
Organism=Saccharomyces cerevisiae, GI6325267, Length=251, Percent_Identity=29.8804780876494, Blast_Score=106, Evalue=3e-23,
Organism=Saccharomyces cerevisiae, GI6324781, Length=249, Percent_Identity=31.3253012048193, Blast_Score=105, Evalue=5e-23,
Organism=Drosophila melanogaster, GI17136444, Length=262, Percent_Identity=29.3893129770992, Blast_Score=102, Evalue=3e-21,
Organism=Drosophila melanogaster, GI17647877, Length=249, Percent_Identity=30.1204819277108, Blast_Score=98, Evalue=4e-20,
Organism=Drosophila melanogaster, GI17136446, Length=306, Percent_Identity=28.4313725490196, Blast_Score=98, Evalue=5e-20,

Paralogues:

None

Copy number: 4233 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 2,500 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010243
- InterPro:   IPR019462
- InterPro:   IPR015712
- InterPro:   IPR007120
- InterPro:   IPR007121
- InterPro:   IPR007644
- InterPro:   IPR007642
- InterPro:   IPR007645
- InterPro:   IPR007641
- InterPro:   IPR014724 [H]

Pfam domain/function: PF04563 RNA_pol_Rpb2_1; PF04561 RNA_pol_Rpb2_2; PF04565 RNA_pol_Rpb2_3; PF10385 RNA_pol_Rpb2_45; PF00562 RNA_pol_Rpb2_6; PF04560 RNA_pol_Rpb2_7 [H]

EC number: =2.7.7.6 [H]

Molecular weight: Translated: 154525; Mature: 154394

Theoretical pI: Translated: 5.15; Mature: 5.15

Prosite motif: PS01166 RNA_POL_BETA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSAGDSGPGYVLNDFDAVPRLSYARSIDIRDSLSDLIRIQRDSYDAFIGIDEGSSGGIQ
CCCCCCCCCCCEECCCCCCCCHHHHHCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHH
SIFQSMFPIRDPLGRAVLEFVSCNIGEPQYDEYECIKRGITFSVPMRITLRFVVWKVQEV
HHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEECCHHEEEEEEEHHHHHC
SFKEVKYVVDEGTLERSVKYMKEQEVSIGDLPMMTSYGTFIINGIERVIVSQMHRSPGVF
CHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHCCCCCE
FDSDKGKTYSSGKLIYSARIIPYRGSWLDFEFDIKDIIYFRIDKKRKLPVTYLLKALGMS
EECCCCCEECCCEEEEEEEEEEECCCEEEEEECCCEEEEEEECCCCCCCHHHHHHHHCCC
NNDILDTFYDKVLYVRSDKGWKVPFVVDRFKGVRLSYDLMDVDGNVLIKANTRITLRIAK
CCCHHHHHHHCEEEEECCCCCCCCEEHHCCCCCEEEEEEEECCCCEEEEECCEEEEHHHH
KLYADGLREYLVPFAGISGLFVATDLVDPASGAVIVSAGEAIAAEHIVKLELFDISEIAF
HHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEECCCHHHHHHEEEEEEECCCEEEE
LNIDFLTVGPYVLNTLFLDRHITQEDALFEIYRVLRSGESPNLEAVKSFFKGLFFEPDRY
EEEEHEEECHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCC
DLSVVGRIKLNSHLRLDIDENLTVLTKDDIVHVIKKLVLLPDGEGVVDDIDHLGNRRVRS
CEEEEEEEEECCEEEEEECCCEEEEEHHHHHHHHHHHHCCCCCCCHHHHHHHCCCHHHHH
VGEFIENQFRVGILRLERMIMDYMSSVNFDNAVPCDFVNPKILATVLKDFFSSSQLSQFM
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCHHHHHHHH
DQTNPLSEVTHKRRLSALGPGGLTRERAGFEVRDVHPTHYGRICPIETPEGQNIGLISSL
HCCCCHHHHHHHHHHHHCCCCCCCHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCHHHHH
AIYAKINKYGFIESPYRKVIDGVVTDSVEYLLATQESDYYIADAGAALDENNRFVDDMLY
EEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCEEEEECCCCCCCCCCHHHHHHH
CRHGGNFVMVKREDVNYIDVSPKQIVSVAASLIPFLENNDANRALMGSNMQRQAVPLLKA
HEECCCEEEEEECCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCHHCCCCHHHHCCHHHC
EAPLVGTGMESVVAAGSGAVVLAKRDGVVHRVDGSYIVIRAFDKNKDEYLGVDIYKLRKF
CCCEECCCHHHHHHCCCCEEEEEECCCEEEEECCCEEEEEEECCCCCCEECCHHHHHHHH
QRSNHNTCINQRPIVKIGDYVRTNDVIADGAAIDRGELALGKNVLVAFMSWQGYNFEDSI
HHCCCCCCCCCCCCEEECCEEECCCEEECCCCCCCCCHHCCCCEEEEEEECCCCCCCCCE
VISSDVVKRDVFTSIHIEEFECVVRDTPLGPEKIMRSVPDVNEESLSHLDDVGIVNIGAE
EEEHHHHHHHHHHEEEHHHHEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCEEEECCC
VSAGSVLVGKVTPRPPVSLPPETKLLVTIFGEKVFDCVDSSLYLPPDVEGTVIDVHVFVR
CCCCCEEEECCCCCCCCCCCCCCEEEEEEHHHHHHHHHCCCEECCCCCCCCEEEEEEEEC
RGVEENDRSLLIKQSEVNSFRKERDYEIDVVSEYFYDELKKLLCSADLPLNGHADVESLL
CCCCCCCCEEEEEHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH
AAKSLEALWEIGLSNPKISAKVADMKGKFDELITEAHSKFDQKIDKLNYGYDLPQGVLTI
HHHHHHHHHHHCCCCCCCEEEEEHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH
VKVFVAVKHNLQPGDKMAGRHGNKGVISRIVPVEDMPHLEDGTPVDIILNSLGVPSRMNI
HHHHHHHHCCCCCCHHHCCCCCCCCHHHEECCCCCCCCCCCCCCHHHHHHHCCCCCCCCH
GQILETHLGWAAVNLGHRVGRMLDSGEEEGPVVESIRSFLSEVYEGQKLKEDVASMSDEA
HHHHHHHHCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCHHH
LLKFANRLRRGVPMAAPVFEGPKDAQISRLLELADVDPSGQVDLYDGRSGQKFDRKVTVG
HHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCEEEHH
YIYMLKLHHLVDDKIHARSVGPYGLVTQQPLGGKSHFGGQRFGEMECWALQAYGAAYTLQ
HHHHHHHHHHHCCCHHHCCCCCCCCEECCCCCCCCCCCCCCCCCCEEEEEEHHCHHHHHH
EMLTVKSDDIVGRVRIYESIIKGDSNFECGIPESFNVMVKELRSLCLDVVLKQDKEFTSS
HHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHC
KVE
CCC
>Mature Secondary Structure 
SSAGDSGPGYVLNDFDAVPRLSYARSIDIRDSLSDLIRIQRDSYDAFIGIDEGSSGGIQ
CCCCCCCCCCEECCCCCCCCHHHHHCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHH
SIFQSMFPIRDPLGRAVLEFVSCNIGEPQYDEYECIKRGITFSVPMRITLRFVVWKVQEV
HHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHCCCEEECCHHEEEEEEEHHHHHC
SFKEVKYVVDEGTLERSVKYMKEQEVSIGDLPMMTSYGTFIINGIERVIVSQMHRSPGVF
CHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHCCCCCE
FDSDKGKTYSSGKLIYSARIIPYRGSWLDFEFDIKDIIYFRIDKKRKLPVTYLLKALGMS
EECCCCCEECCCEEEEEEEEEEECCCEEEEEECCCEEEEEEECCCCCCCHHHHHHHHCCC
NNDILDTFYDKVLYVRSDKGWKVPFVVDRFKGVRLSYDLMDVDGNVLIKANTRITLRIAK
CCCHHHHHHHCEEEEECCCCCCCCEEHHCCCCCEEEEEEEECCCCEEEEECCEEEEHHHH
KLYADGLREYLVPFAGISGLFVATDLVDPASGAVIVSAGEAIAAEHIVKLELFDISEIAF
HHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEECCCHHHHHHEEEEEEECCCEEEE
LNIDFLTVGPYVLNTLFLDRHITQEDALFEIYRVLRSGESPNLEAVKSFFKGLFFEPDRY
EEEEHEEECHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCC
DLSVVGRIKLNSHLRLDIDENLTVLTKDDIVHVIKKLVLLPDGEGVVDDIDHLGNRRVRS
CEEEEEEEEECCEEEEEECCCEEEEEHHHHHHHHHHHHCCCCCCCHHHHHHHCCCHHHHH
VGEFIENQFRVGILRLERMIMDYMSSVNFDNAVPCDFVNPKILATVLKDFFSSSQLSQFM
HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCHHHHHHHH
DQTNPLSEVTHKRRLSALGPGGLTRERAGFEVRDVHPTHYGRICPIETPEGQNIGLISSL
HCCCCHHHHHHHHHHHHCCCCCCCHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCHHHHH
AIYAKINKYGFIESPYRKVIDGVVTDSVEYLLATQESDYYIADAGAALDENNRFVDDMLY
EEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCEEEEECCCCCCCCCCHHHHHHH
CRHGGNFVMVKREDVNYIDVSPKQIVSVAASLIPFLENNDANRALMGSNMQRQAVPLLKA
HEECCCEEEEEECCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCHHCCCCHHHHCCHHHC
EAPLVGTGMESVVAAGSGAVVLAKRDGVVHRVDGSYIVIRAFDKNKDEYLGVDIYKLRKF
CCCEECCCHHHHHHCCCCEEEEEECCCEEEEECCCEEEEEEECCCCCCEECCHHHHHHHH
QRSNHNTCINQRPIVKIGDYVRTNDVIADGAAIDRGELALGKNVLVAFMSWQGYNFEDSI
HHCCCCCCCCCCCCEEECCEEECCCEEECCCCCCCCCHHCCCCEEEEEEECCCCCCCCCE
VISSDVVKRDVFTSIHIEEFECVVRDTPLGPEKIMRSVPDVNEESLSHLDDVGIVNIGAE
EEEHHHHHHHHHHEEEHHHHEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCEEEECCC
VSAGSVLVGKVTPRPPVSLPPETKLLVTIFGEKVFDCVDSSLYLPPDVEGTVIDVHVFVR
CCCCCEEEECCCCCCCCCCCCCCEEEEEEHHHHHHHHHCCCEECCCCCCCCEEEEEEEEC
RGVEENDRSLLIKQSEVNSFRKERDYEIDVVSEYFYDELKKLLCSADLPLNGHADVESLL
CCCCCCCCEEEEEHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH
AAKSLEALWEIGLSNPKISAKVADMKGKFDELITEAHSKFDQKIDKLNYGYDLPQGVLTI
HHHHHHHHHHHCCCCCCCEEEEEHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH
VKVFVAVKHNLQPGDKMAGRHGNKGVISRIVPVEDMPHLEDGTPVDIILNSLGVPSRMNI
HHHHHHHHCCCCCCHHHCCCCCCCCHHHEECCCCCCCCCCCCCCHHHHHHHCCCCCCCCH
GQILETHLGWAAVNLGHRVGRMLDSGEEEGPVVESIRSFLSEVYEGQKLKEDVASMSDEA
HHHHHHHHCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCHHH
LLKFANRLRRGVPMAAPVFEGPKDAQISRLLELADVDPSGQVDLYDGRSGQKFDRKVTVG
HHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCEEEHH
YIYMLKLHHLVDDKIHARSVGPYGLVTQQPLGGKSHFGGQRFGEMECWALQAYGAAYTLQ
HHHHHHHHHHHCCCHHHCCCCCCCCEECCCCCCCCCCCCCCCCCCEEEEEEHHCHHHHHH
EMLTVKSDDIVGRVRIYESIIKGDSNFECGIPESFNVMVKELRSLCLDVVLKQDKEFTSS
HHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHC
KVE
CCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA