Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is rpoC

Identifier: 187735535

GI number: 187735535

Start: 1238246

End: 1242442

Strand: Reverse

Name: rpoC

Synonym: Amuc_1040

Alternate gene names: 187735535

Gene position: 1242442-1238246 (Counterclockwise)

Preceding gene: 187735536

Following gene: 187735534

Centisome position: 46.64

GC content: 57.56

Gene sequence:

>4197_bases
ATGTCTGACACCCCCACCATCAGGGAAATGCACGGCCTGAGCGACAAGCCCCGGACCTTTGACCAGGTTGCCATCACCGT
GGCGGATCCGGATACCATCCGCAGCTGGTCATTCGGTGAAGTCGTCAACCCGGAAACCATCAACTACCGCACGTTCAAGC
CGGAAAAAGGCGGCCTGTTCTGCGAACGCATCTTCGGGCCCACCCGAGACATGGAATGCGCCTGCGGCAAGTACAAGCGC
ATCAAGCATAAGGGCATCACCTGCGACCGCTGCGGCGTGGAAGTAACCAACGCGCGCGTGCGCCGCGAACGAATGGGCCA
TATTGAACTGGCCGTTCCGGTTTCCCATATCTGGTTTTACAAATGCATGCCCAGCCGCATTGGCCTTATGCTGGACATGA
CAGCCCGCCATTTGGAACGCGTGATTTACTATGAAGACTACATCGTGGTAGATCCCGGCAGTACCCCTCTGGAAAAGGGG
GCCATCCTGACGGAAGAAGAATTCCGCAATGCGGAAGACGAATACGGCTATGACAGCTTTGAAGCCGGCATGGGCGCGGA
AGCCATCCAGAAAATGCTGGCGGCCATTGATCTGCCCACCCTCGTCGCGGATCTTCAGGAACAGCTGGACAATACCAACT
CCAAACAGAACAAGCGCAAGATTGCCAAGCGTCTGAAACTGGCCCAGGGGTTCCTTCAGTCCAACACACGCCCGGAATGG
ATGATTTTGAACGTTCTGCCCGTCATTCCTCCGGACCTGCGCCCGCTGGTTCCTCTGGAAGGCGGCCGTTTCGCAACGTC
CGACCTGAATGACCTGTACCGCCGCGTCATCAACCGCAACAACCGCCTGAAAACCCTCCTGAGCCTCAAAACTCCAGAAG
TCATCATCCGTAATGAAAAACGCATGCTTCAGGAAGCCGTGGATGCCCTGTTTGACAACGGCCGCCACGGCCGTGCCGTC
ACCGGCGCCGGCAACCGCCCCCTCAAATCCCTCTCCGACATGCTGAAGGGCAAGGGAGGCCGTTTCCGCCAGAACCTGCT
CGGCAAGCGCGTGGACTACTCCGGCCGCTCCGTTATCGTCATCGGCCCGGAATTGAAACTCAACCAGTGCGGTCTTCCCA
AGAAGATGGCGCTCATTCTGTTTGAACCCTTCATCATCCACCGTCTGAAAGAGCTGGGTTACGTGCACACGGTGCGCTCC
GCCAAGAAGCTCATTGACCGCAAGACGCCGGAAGTGTGGGATATTCTGGAAGAAGTGACCAAGGGCCACCCGGTCATGCT
CAACCGCGCGCCCACCCTGCACCGCCTCTCCATCCAGGCTTTTGAACCGGTTCTGATTGAAGGTTCCGCCATCCGTCTGC
ACCCGCTCGTCTGTAATGCGTACAACGCGGACTTCGACGGCGACCAGATGGCTGTGCACGTGCCTCTGTCCGTGGAAGCG
CAGATGGAAGCCCGGCAGCTCATGCTGGCGCCCAACAATATTTTCTCCCCTGCTTCCGGCAAGCCCATTGCCACACCCAC
GCAGGACATCATTCTGGGCGCGTACTTCCTGACGCATACCCGTGCTGCGGAAGTACAGAACAATCAGGATAATCATCACC
ATCTTCCCCTCTTCGAATCCATTGACGAGGTGGAATACGCCATTGCCGCCCGCAAAATCGGCTACCATGACTGGATCCGC
CTGCACAACCCGGACTACGGCAAAAAGCCTTCCGAAGTAGTGTATGGGGATGTCACCAAGAAGGTTATCATCACTACTGC
CGGACGCGTGCGTTTCAATGAAATCTGGCCCCGGGAACTCGGTTACATTAACCGCAACGTAGGCAAGAAACAGATGGGCG
ACATCATCTGGCGCTGCTACCAGACCGTCGGCAAGGAACGTACCGTGCAGACTCTGGACGCCCTGAAAAACCTGGGCTTC
AAGGAAGCAACCCGTTCCGGCTGCTCCATCGGCATCGTGGACATGGTGGTTCCCTCCCAGAAAAAGACGGAAATTGAAAA
AGCCTATGCGGAGCTGGACAAGGTGACCCGCCAGTATAAGAACGGTATTATCACGGATGGGGAACGCTACCAGAAGGTGG
TGGACATCTGGACCCAGACTACGGATGTCATCCAGGCGGCTCTGTACCGCAAGCTGGAACACAACGAAGGCTCCAAGATG
GCCAGCCCGCTCTTCATGATGGTGGACTCCGGAGCCCGAGGCAACAAGGCGCAGATCAAGCAGCTCTCCGGCATGCGCGG
TTTGATGGCGAAACCCAGCGGCGAAATTATCGAACGCCCCATCACGGCCAACTTCCGTGAAGGCCTTTCCGTGCTGGAAT
ACTTCATCTCCACCCACGGCGCCCGCAAGGGTCTGGCAGATACCGCGCTGAAAACGGCGGACTCCGGCTACATGACCCGC
AAACTCGTGGACGTGGCCCAGGATGTCATCGTCCATGCGGAAGATTGCGGCACCAGCAACGGCATCACCGTTCACGCCAT
CTATGACGGCGACGAAGAAGTGGCGTCCCTTTCCTCCCGTATCTACGGCCGGACTTCCTGCGAACGCATCGTTGACCCCG
TCAGCGGCGAGGTTATCGTAGACATCAACGACCTCATTAACGAAAAGCAGGCGGAACAACTGGAAAAAATCGGCATTGAA
CGGCTGAAAATCCGCTCCGTACTCACCTGCGAACTCAAAAAGGGCTGCTGTGCCAAGTGCTACGGCCTGAACCTGGCCAC
CGGACAGGAAGTGAAGATCGGGGAAGCGGTCGGCATTATTGCCGCCCAGTCCATCGGCGAACCCGGCACGCAGCTCACCA
TGCGTACGTTCCACGTGGGCGGAACGGCTACCACGGCGTTCAAGCAGCCCATCGTGAAAGCCAAGAACGACGGCCGCGTC
ATCTACACGGAAGATCTCCGCACGGTGGAAAACGCAGACGGCAACTTCGTCGTCCTGAATAAAAACTGCTCTGTCCGCAT
CGAAAACGAACAGGGCCGCGAACTGGAATCCTACCAGCCCGTCATCGGCACCATCCTGTACGTGCCCAACGGCGGCACTA
TCAAGAAGGATGAAACCCTCGCCACCTGGGATCCGTACAATGTGCCCGTGATTGCAGAAAAGGGCGGCATCGTCGAATTC
AAGGATATGATCGTCGGCATCACCGTTTCCAAGGAAACGGACCGGGAAACCGGTGCCTCCTCCCTTGTCGTGATGGAACA
CAAGCAGGAACTTCACCCGCAAGTGGTCATCCGCGATGCCAAGACCCGCGAAGTTCTGGCTCATCATGCCATTCCCGCAG
GCGCCAACCTCACTGTGAAGGACGGAGAAACCATCTCCGCCGGCACAATGGTGGCCAAGACGCCCCGCAAGGTAGCCAAG
ACGAAGGACATCACCGGCGGTCTGCCCCGCGTGGCGGAATTGTTCGAAGCCCGCAAGCCCAAGGACGCCTGCACCATTGC
ACGCGTGGAAGGCATTGTGCGCCTCAGCAGCAAGAATACTTCCCGCGGCAAGAAGGTCATTACCATTGAAACACCCACGG
GCGAACTGGTGGACCATCTGGTCCCGATGAACAAGCACGTCATCGTTCATGAAGACGACCACGTGCATCTGGGCGACCAG
CTTACGGAAGGCCCCGTTTCTCCGGAAGAAATTCTGGATGTCTGCGGCAAGGAACGTCTCCAGGAACACCTCGTTAACGA
AGTTCAGGAAGTGTACCGCCTCCAGGGGGTGGAAATCAACGACAAGCATGTGGAAATCATCGTGCGCCAGATGCTCCGCA
AGGTAGTCATCACGGAACCCGGAAATACCGAATTCCTGTGGGGAGACCAAGTGGACAAGACCACGTTCGACCGCATCAAT
GAACAAACCGTAGCCCAGGGCGGCCAACCGGCCGCAGCCAAGCCCGTTCTGCTCGGTATCACGAAGGCCTCCCTGGAAAC
GGAATCCTTCATTTCCGCGGCATCTTTCCAGGATACCACACGCGTTCTGACGGAAGCATCCACCCTCGGCAAGACCGATA
CTCTGGAAGGCTTCAAGGAAAACGTCATCATGGGCCACCTCATTCCCGCCGGCACCGGATTCTCCCGTTACAGCAAGATT
GAAGTGGAACCTGCAGAGGGCGCAGAAGAAATCGCGGCGGCCAGCGAAGAAGAGGAAGCGGCGGAACTTGCCGAAGACAT
GTTGAACGATACCATCAACTTCGACAACGAACGCTAA

Upstream 100 bases:

>100_bases
CGTCGACTTCTCCGACCTCAAATTCTAATATCCACCGCATTCCGCCCCCGTGCCGCAAAACGCGGGGGCGGGCATCAAAC
ATCCTTTAAACTTTCAATAT

Downstream 100 bases:

>100_bases
CCTGTTCTCTCCACCCACACTTCAAGGGCCGTACCCCGTTTCCGGGGTACGGCTTTTTTTATGGACTCTCCTCGTTATTC
CACGAAAAAGGAAAAGCGGA

Product: DNA-directed RNA polymerase subunit beta'

Products: NA

Alternate protein names: RNAP subunit beta'; RNA polymerase subunit beta'; Transcriptase subunit beta'

Number of amino acids: Translated: 1398; Mature: 1397

Protein sequence:

>1398_residues
MSDTPTIREMHGLSDKPRTFDQVAITVADPDTIRSWSFGEVVNPETINYRTFKPEKGGLFCERIFGPTRDMECACGKYKR
IKHKGITCDRCGVEVTNARVRRERMGHIELAVPVSHIWFYKCMPSRIGLMLDMTARHLERVIYYEDYIVVDPGSTPLEKG
AILTEEEFRNAEDEYGYDSFEAGMGAEAIQKMLAAIDLPTLVADLQEQLDNTNSKQNKRKIAKRLKLAQGFLQSNTRPEW
MILNVLPVIPPDLRPLVPLEGGRFATSDLNDLYRRVINRNNRLKTLLSLKTPEVIIRNEKRMLQEAVDALFDNGRHGRAV
TGAGNRPLKSLSDMLKGKGGRFRQNLLGKRVDYSGRSVIVIGPELKLNQCGLPKKMALILFEPFIIHRLKELGYVHTVRS
AKKLIDRKTPEVWDILEEVTKGHPVMLNRAPTLHRLSIQAFEPVLIEGSAIRLHPLVCNAYNADFDGDQMAVHVPLSVEA
QMEARQLMLAPNNIFSPASGKPIATPTQDIILGAYFLTHTRAAEVQNNQDNHHHLPLFESIDEVEYAIAARKIGYHDWIR
LHNPDYGKKPSEVVYGDVTKKVIITTAGRVRFNEIWPRELGYINRNVGKKQMGDIIWRCYQTVGKERTVQTLDALKNLGF
KEATRSGCSIGIVDMVVPSQKKTEIEKAYAELDKVTRQYKNGIITDGERYQKVVDIWTQTTDVIQAALYRKLEHNEGSKM
ASPLFMMVDSGARGNKAQIKQLSGMRGLMAKPSGEIIERPITANFREGLSVLEYFISTHGARKGLADTALKTADSGYMTR
KLVDVAQDVIVHAEDCGTSNGITVHAIYDGDEEVASLSSRIYGRTSCERIVDPVSGEVIVDINDLINEKQAEQLEKIGIE
RLKIRSVLTCELKKGCCAKCYGLNLATGQEVKIGEAVGIIAAQSIGEPGTQLTMRTFHVGGTATTAFKQPIVKAKNDGRV
IYTEDLRTVENADGNFVVLNKNCSVRIENEQGRELESYQPVIGTILYVPNGGTIKKDETLATWDPYNVPVIAEKGGIVEF
KDMIVGITVSKETDRETGASSLVVMEHKQELHPQVVIRDAKTREVLAHHAIPAGANLTVKDGETISAGTMVAKTPRKVAK
TKDITGGLPRVAELFEARKPKDACTIARVEGIVRLSSKNTSRGKKVITIETPTGELVDHLVPMNKHVIVHEDDHVHLGDQ
LTEGPVSPEEILDVCGKERLQEHLVNEVQEVYRLQGVEINDKHVEIIVRQMLRKVVITEPGNTEFLWGDQVDKTTFDRIN
EQTVAQGGQPAAAKPVLLGITKASLETESFISAASFQDTTRVLTEASTLGKTDTLEGFKENVIMGHLIPAGTGFSRYSKI
EVEPAEGAEEIAAASEEEEAAELAEDMLNDTINFDNER

Sequences:

>Translated_1398_residues
MSDTPTIREMHGLSDKPRTFDQVAITVADPDTIRSWSFGEVVNPETINYRTFKPEKGGLFCERIFGPTRDMECACGKYKR
IKHKGITCDRCGVEVTNARVRRERMGHIELAVPVSHIWFYKCMPSRIGLMLDMTARHLERVIYYEDYIVVDPGSTPLEKG
AILTEEEFRNAEDEYGYDSFEAGMGAEAIQKMLAAIDLPTLVADLQEQLDNTNSKQNKRKIAKRLKLAQGFLQSNTRPEW
MILNVLPVIPPDLRPLVPLEGGRFATSDLNDLYRRVINRNNRLKTLLSLKTPEVIIRNEKRMLQEAVDALFDNGRHGRAV
TGAGNRPLKSLSDMLKGKGGRFRQNLLGKRVDYSGRSVIVIGPELKLNQCGLPKKMALILFEPFIIHRLKELGYVHTVRS
AKKLIDRKTPEVWDILEEVTKGHPVMLNRAPTLHRLSIQAFEPVLIEGSAIRLHPLVCNAYNADFDGDQMAVHVPLSVEA
QMEARQLMLAPNNIFSPASGKPIATPTQDIILGAYFLTHTRAAEVQNNQDNHHHLPLFESIDEVEYAIAARKIGYHDWIR
LHNPDYGKKPSEVVYGDVTKKVIITTAGRVRFNEIWPRELGYINRNVGKKQMGDIIWRCYQTVGKERTVQTLDALKNLGF
KEATRSGCSIGIVDMVVPSQKKTEIEKAYAELDKVTRQYKNGIITDGERYQKVVDIWTQTTDVIQAALYRKLEHNEGSKM
ASPLFMMVDSGARGNKAQIKQLSGMRGLMAKPSGEIIERPITANFREGLSVLEYFISTHGARKGLADTALKTADSGYMTR
KLVDVAQDVIVHAEDCGTSNGITVHAIYDGDEEVASLSSRIYGRTSCERIVDPVSGEVIVDINDLINEKQAEQLEKIGIE
RLKIRSVLTCELKKGCCAKCYGLNLATGQEVKIGEAVGIIAAQSIGEPGTQLTMRTFHVGGTATTAFKQPIVKAKNDGRV
IYTEDLRTVENADGNFVVLNKNCSVRIENEQGRELESYQPVIGTILYVPNGGTIKKDETLATWDPYNVPVIAEKGGIVEF
KDMIVGITVSKETDRETGASSLVVMEHKQELHPQVVIRDAKTREVLAHHAIPAGANLTVKDGETISAGTMVAKTPRKVAK
TKDITGGLPRVAELFEARKPKDACTIARVEGIVRLSSKNTSRGKKVITIETPTGELVDHLVPMNKHVIVHEDDHVHLGDQ
LTEGPVSPEEILDVCGKERLQEHLVNEVQEVYRLQGVEINDKHVEIIVRQMLRKVVITEPGNTEFLWGDQVDKTTFDRIN
EQTVAQGGQPAAAKPVLLGITKASLETESFISAASFQDTTRVLTEASTLGKTDTLEGFKENVIMGHLIPAGTGFSRYSKI
EVEPAEGAEEIAAASEEEEAAELAEDMLNDTINFDNER
>Mature_1397_residues
SDTPTIREMHGLSDKPRTFDQVAITVADPDTIRSWSFGEVVNPETINYRTFKPEKGGLFCERIFGPTRDMECACGKYKRI
KHKGITCDRCGVEVTNARVRRERMGHIELAVPVSHIWFYKCMPSRIGLMLDMTARHLERVIYYEDYIVVDPGSTPLEKGA
ILTEEEFRNAEDEYGYDSFEAGMGAEAIQKMLAAIDLPTLVADLQEQLDNTNSKQNKRKIAKRLKLAQGFLQSNTRPEWM
ILNVLPVIPPDLRPLVPLEGGRFATSDLNDLYRRVINRNNRLKTLLSLKTPEVIIRNEKRMLQEAVDALFDNGRHGRAVT
GAGNRPLKSLSDMLKGKGGRFRQNLLGKRVDYSGRSVIVIGPELKLNQCGLPKKMALILFEPFIIHRLKELGYVHTVRSA
KKLIDRKTPEVWDILEEVTKGHPVMLNRAPTLHRLSIQAFEPVLIEGSAIRLHPLVCNAYNADFDGDQMAVHVPLSVEAQ
MEARQLMLAPNNIFSPASGKPIATPTQDIILGAYFLTHTRAAEVQNNQDNHHHLPLFESIDEVEYAIAARKIGYHDWIRL
HNPDYGKKPSEVVYGDVTKKVIITTAGRVRFNEIWPRELGYINRNVGKKQMGDIIWRCYQTVGKERTVQTLDALKNLGFK
EATRSGCSIGIVDMVVPSQKKTEIEKAYAELDKVTRQYKNGIITDGERYQKVVDIWTQTTDVIQAALYRKLEHNEGSKMA
SPLFMMVDSGARGNKAQIKQLSGMRGLMAKPSGEIIERPITANFREGLSVLEYFISTHGARKGLADTALKTADSGYMTRK
LVDVAQDVIVHAEDCGTSNGITVHAIYDGDEEVASLSSRIYGRTSCERIVDPVSGEVIVDINDLINEKQAEQLEKIGIER
LKIRSVLTCELKKGCCAKCYGLNLATGQEVKIGEAVGIIAAQSIGEPGTQLTMRTFHVGGTATTAFKQPIVKAKNDGRVI
YTEDLRTVENADGNFVVLNKNCSVRIENEQGRELESYQPVIGTILYVPNGGTIKKDETLATWDPYNVPVIAEKGGIVEFK
DMIVGITVSKETDRETGASSLVVMEHKQELHPQVVIRDAKTREVLAHHAIPAGANLTVKDGETISAGTMVAKTPRKVAKT
KDITGGLPRVAELFEARKPKDACTIARVEGIVRLSSKNTSRGKKVITIETPTGELVDHLVPMNKHVIVHEDDHVHLGDQL
TEGPVSPEEILDVCGKERLQEHLVNEVQEVYRLQGVEINDKHVEIIVRQMLRKVVITEPGNTEFLWGDQVDKTTFDRINE
QTVAQGGQPAAAKPVLLGITKASLETESFISAASFQDTTRVLTEASTLGKTDTLEGFKENVIMGHLIPAGTGFSRYSKIE
VEPAEGAEEIAAASEEEEAAELAEDMLNDTINFDNER

Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates

COG id: COG0086

COG function: function code K; DNA-directed RNA polymerase, beta' subunit/160 kD subunit

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RNA polymerase beta' chain family

Homologues:

Organism=Homo sapiens, GI4505939, Length=925, Percent_Identity=26.8108108108108, Blast_Score=192, Evalue=2e-48,
Organism=Homo sapiens, GI39725938, Length=508, Percent_Identity=27.1653543307087, Blast_Score=126, Evalue=1e-28,
Organism=Homo sapiens, GI103471997, Length=622, Percent_Identity=25.4019292604502, Blast_Score=122, Evalue=2e-27,
Organism=Escherichia coli, GI2367335, Length=1410, Percent_Identity=51.2765957446809, Blast_Score=1428, Evalue=0.0,
Organism=Caenorhabditis elegans, GI25145495, Length=864, Percent_Identity=25.6944444444444, Blast_Score=190, Evalue=5e-48,
Organism=Caenorhabditis elegans, GI71987878, Length=556, Percent_Identity=28.5971223021583, Blast_Score=166, Evalue=8e-41,
Organism=Caenorhabditis elegans, GI71998295, Length=258, Percent_Identity=28.2945736434109, Blast_Score=92, Evalue=2e-18,
Organism=Saccharomyces cerevisiae, GI6320061, Length=912, Percent_Identity=26.2061403508772, Blast_Score=204, Evalue=1e-52,
Organism=Saccharomyces cerevisiae, GI6324690, Length=678, Percent_Identity=26.8436578171091, Blast_Score=184, Evalue=6e-47,
Organism=Saccharomyces cerevisiae, GI6324917, Length=335, Percent_Identity=27.7611940298507, Blast_Score=99, Evalue=6e-21,
Organism=Drosophila melanogaster, GI281360912, Length=953, Percent_Identity=25.6033578174187, Blast_Score=201, Evalue=3e-51,
Organism=Drosophila melanogaster, GI17530899, Length=928, Percent_Identity=26.2931034482759, Blast_Score=186, Evalue=1e-46,
Organism=Drosophila melanogaster, GI17647875, Length=334, Percent_Identity=26.0479041916168, Blast_Score=104, Evalue=4e-22,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): RPOC_AKKM8 (B2UQY1)

Other databases:

- EMBL:   CP001071
- RefSeq:   YP_001877647.1
- GeneID:   6274073
- GenomeReviews:   CP001071_GR
- KEGG:   amu:Amuc_1040
- HOGENOM:   HBG621785
- OMA:   FEARVPK
- ProtClustDB:   PRK00566
- HAMAP:   MF_01322
- InterPro:   IPR000722
- InterPro:   IPR006592
- InterPro:   IPR007080
- InterPro:   IPR007066
- InterPro:   IPR007083
- InterPro:   IPR007081
- InterPro:   IPR012754
- SMART:   SM00663
- TIGRFAMs:   TIGR02386

Pfam domain/function: PF04997 RNA_pol_Rpb1_1; PF00623 RNA_pol_Rpb1_2; PF04983 RNA_pol_Rpb1_3; PF05000 RNA_pol_Rpb1_4; PF04998 RNA_pol_Rpb1_5

EC number: =2.7.7.6

Molecular weight: Translated: 155530; Mature: 155399

Theoretical pI: Translated: 6.73; Mature: 6.73

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.4 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.7 %Cys+Met (Translated Protein)
1.4 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDTPTIREMHGLSDKPRTFDQVAITVADPDTIRSWSFGEVVNPETINYRTFKPEKGGLF
CCCCCCHHHHHCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCEE
CERIFGPTRDMECACGKYKRIKHKGITCDRCGVEVTNARVRRERMGHIELAVPVSHIWFY
HHHHCCCCCCCHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHCCCEEEEEEHHHHHHH
KCMPSRIGLMLDMTARHLERVIYYEDYIVVDPGSTPLEKGAILTEEEFRNAEDEYGYDSF
HCCHHHHCEEEHHHHHHHHHHEEEECEEEECCCCCCHHCCCEECHHHHCCCCCCCCCCHH
EAGMGAEAIQKMLAAIDLPTLVADLQEQLDNTNSKQNKRKIAKRLKLAQGFLQSNTRPEW
HCCCCHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCE
MILNVLPVIPPDLRPLVPLEGGRFATSDLNDLYRRVINRNNRLKTLLSLKTPEVIIRNEK
EEEEEECCCCCCCCCCCCCCCCEEECCHHHHHHHHHHCCCCCEEHHHHCCCCHHEECCHH
RMLQEAVDALFDNGRHGRAVTGAGNRPLKSLSDMLKGKGGRFRQNLLGKRVDYSGRSVIV
HHHHHHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCCCCEEEE
IGPELKLNQCGLPKKMALILFEPFIIHRLKELGYVHTVRSAKKLIDRKTPEVWDILEEVT
ECCCCEECCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHH
KGHPVMLNRAPTLHRLSIQAFEPVLIEGSAIRLHPLVCNAYNADFDGDQMAVHVPLSVEA
CCCCEEEECCCCCEEEEHHHCCCEEEECCEEEEEEEEEECCCCCCCCCCEEEEECCCCCH
QMEARQLMLAPNNIFSPASGKPIATPTQDIILGAYFLTHTRAAEVQNNQDNHHHLPLFES
HHHHHHEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHH
IDEVEYAIAARKIGYHDWIRLHNPDYGKKPSEVVYGDVTKKVIITTAGRVRFNEIWPREL
HHHHHHHHHHHHCCCHHEEEECCCCCCCCCCCEEECCCCEEEEEEECCCEEECCCCCHHH
GYINRNVGKKQMGDIIWRCYQTVGKERTVQTLDALKNLGFKEATRSGCSIGIVDMVVPSQ
HHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHCCCCCEEEEEEECCCC
KKTEIEKAYAELDKVTRQYKNGIITDGERYQKVVDIWTQTTDVIQAALYRKLEHNEGSKM
HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
ASPLFMMVDSGARGNKAQIKQLSGMRGLMAKPSGEIIERPITANFREGLSVLEYFISTHG
HCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHCC
ARKGLADTALKTADSGYMTRKLVDVAQDVIVHAEDCGTSNGITVHAIYDGDEEVASLSSR
CCCCHHHHHHHHCCCCHHHHHHHHHHHHHHEECHHCCCCCCEEEEEEECCHHHHHHHHHH
IYGRTSCERIVDPVSGEVIVDINDLINEKQAEQLEKIGIERLKIRSVLTCELKKGCCAKC
HCCCCHHHHHHCCCCCEEEEEHHHHHCHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCHHH
YGLNLATGQEVKIGEAVGIIAAQSIGEPGTQLTMRTFHVGGTATTAFKQPIVKAKNDGRV
HCCCCCCCCCEECCHHHHHHHHHCCCCCCCEEEEEEEEECCCCHHHHHCCHHCCCCCCEE
IYTEDLRTVENADGNFVVLNKNCSVRIENEQGRELESYQPVIGTILYVPNGGTIKKDETL
EEECCCHHHHCCCCCEEEECCCCEEEEECCCCCCHHHCCCEEEEEEECCCCCCCCCCCEE
ATWDPYNVPVIAEKGGIVEFKDMIVGITVSKETDRETGASSLVVMEHKQELHPQVVIRDA
ECCCCCCCCEEECCCCEEEEEEEEEEEEECCCCCCCCCCCEEEEEECHHHCCCEEEEECC
KTREVLAHHAIPAGANLTVKDGETISAGTMVAKTPRKVAKTKDITGGLPRVAELFEARKP
HHHHHHHHHCCCCCCEEEECCCCEEECCCEEECCCHHHHHCCCCCCCCHHHHHHHHCCCC
KDACTIARVEGIVRLSSKNTSRGKKVITIETPTGELVDHLVPMNKHVIVHEDDHVHLGDQ
CHHHHHHHHHHEEEECCCCCCCCCEEEEEECCCHHHHHHHCCCCCEEEEECCCCEECCCC
LTEGPVSPEEILDVCGKERLQEHLVNEVQEVYRLQGVEINDKHVEIIVRQMLRKVVITEP
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHEEECC
GNTEFLWGDQVDKTTFDRINEQTVAQGGQPAAAKPVLLGITKASLETESFISAASFQDTT
CCCEEEECCCCCHHHHHHCCHHHHHCCCCCCCCCCEEEEEEHHHCCHHHHHHHHHHHHHH
RVLTEASTLGKTDTLEGFKENVIMGHLIPAGTGFSRYSKIEVEPAEGAEEIAAASEEEEA
HHHHHHHHCCCCHHHHHHHHCEEEEEEECCCCCCCCCEEEEECCCCCHHHHHHCCCHHHH
AELAEDMLNDTINFDNER
HHHHHHHHHHCCCCCCCC
>Mature Secondary Structure 
SDTPTIREMHGLSDKPRTFDQVAITVADPDTIRSWSFGEVVNPETINYRTFKPEKGGLF
CCCCCHHHHHCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCCEE
CERIFGPTRDMECACGKYKRIKHKGITCDRCGVEVTNARVRRERMGHIELAVPVSHIWFY
HHHHCCCCCCCHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHCCCEEEEEEHHHHHHH
KCMPSRIGLMLDMTARHLERVIYYEDYIVVDPGSTPLEKGAILTEEEFRNAEDEYGYDSF
HCCHHHHCEEEHHHHHHHHHHEEEECEEEECCCCCCHHCCCEECHHHHCCCCCCCCCCHH
EAGMGAEAIQKMLAAIDLPTLVADLQEQLDNTNSKQNKRKIAKRLKLAQGFLQSNTRPEW
HCCCCHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCE
MILNVLPVIPPDLRPLVPLEGGRFATSDLNDLYRRVINRNNRLKTLLSLKTPEVIIRNEK
EEEEEECCCCCCCCCCCCCCCCEEECCHHHHHHHHHHCCCCCEEHHHHCCCCHHEECCHH
RMLQEAVDALFDNGRHGRAVTGAGNRPLKSLSDMLKGKGGRFRQNLLGKRVDYSGRSVIV
HHHHHHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCCCCEEEE
IGPELKLNQCGLPKKMALILFEPFIIHRLKELGYVHTVRSAKKLIDRKTPEVWDILEEVT
ECCCCEECCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHH
KGHPVMLNRAPTLHRLSIQAFEPVLIEGSAIRLHPLVCNAYNADFDGDQMAVHVPLSVEA
CCCCEEEECCCCCEEEEHHHCCCEEEECCEEEEEEEEEECCCCCCCCCCEEEEECCCCCH
QMEARQLMLAPNNIFSPASGKPIATPTQDIILGAYFLTHTRAAEVQNNQDNHHHLPLFES
HHHHHHEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHH
IDEVEYAIAARKIGYHDWIRLHNPDYGKKPSEVVYGDVTKKVIITTAGRVRFNEIWPREL
HHHHHHHHHHHHCCCHHEEEECCCCCCCCCCCEEECCCCEEEEEEECCCEEECCCCCHHH
GYINRNVGKKQMGDIIWRCYQTVGKERTVQTLDALKNLGFKEATRSGCSIGIVDMVVPSQ
HHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCHHHHHCCCCCEEEEEEECCCC
KKTEIEKAYAELDKVTRQYKNGIITDGERYQKVVDIWTQTTDVIQAALYRKLEHNEGSKM
HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH
ASPLFMMVDSGARGNKAQIKQLSGMRGLMAKPSGEIIERPITANFREGLSVLEYFISTHG
HCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHCC
ARKGLADTALKTADSGYMTRKLVDVAQDVIVHAEDCGTSNGITVHAIYDGDEEVASLSSR
CCCCHHHHHHHHCCCCHHHHHHHHHHHHHHEECHHCCCCCCEEEEEEECCHHHHHHHHHH
IYGRTSCERIVDPVSGEVIVDINDLINEKQAEQLEKIGIERLKIRSVLTCELKKGCCAKC
HCCCCHHHHHHCCCCCEEEEEHHHHHCHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCHHH
YGLNLATGQEVKIGEAVGIIAAQSIGEPGTQLTMRTFHVGGTATTAFKQPIVKAKNDGRV
HCCCCCCCCCEECCHHHHHHHHHCCCCCCCEEEEEEEEECCCCHHHHHCCHHCCCCCCEE
IYTEDLRTVENADGNFVVLNKNCSVRIENEQGRELESYQPVIGTILYVPNGGTIKKDETL
EEECCCHHHHCCCCCEEEECCCCEEEEECCCCCCHHHCCCEEEEEEECCCCCCCCCCCEE
ATWDPYNVPVIAEKGGIVEFKDMIVGITVSKETDRETGASSLVVMEHKQELHPQVVIRDA
ECCCCCCCCEEECCCCEEEEEEEEEEEEECCCCCCCCCCCEEEEEECHHHCCCEEEEECC
KTREVLAHHAIPAGANLTVKDGETISAGTMVAKTPRKVAKTKDITGGLPRVAELFEARKP
HHHHHHHHHCCCCCCEEEECCCCEEECCCEEECCCHHHHHCCCCCCCCHHHHHHHHCCCC
KDACTIARVEGIVRLSSKNTSRGKKVITIETPTGELVDHLVPMNKHVIVHEDDHVHLGDQ
CHHHHHHHHHHEEEECCCCCCCCCEEEEEECCCHHHHHHHCCCCCEEEEECCCCEECCCC
LTEGPVSPEEILDVCGKERLQEHLVNEVQEVYRLQGVEINDKHVEIIVRQMLRKVVITEP
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHEEECC
GNTEFLWGDQVDKTTFDRINEQTVAQGGQPAAAKPVLLGITKASLETESFISAASFQDTT
CCCEEEECCCCCHHHHHHCCHHHHHCCCCCCCCCCEEEEEEHHHCCHHHHHHHHHHHHHH
RVLTEASTLGKTDTLEGFKENVIMGHLIPAGTGFSRYSKIEVEPAEGAEEIAAASEEEEA
HHHHHHHHCCCCHHHHHHHHCEEEEEEECCCCCCCCCEEEEECCCCCHHHHHHCCCHHHH
AELAEDMLNDTINFDNER
HHHHHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA