Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is dnaE

Identifier: 121637469

GI number: 121637469

Start: 1760025

End: 1763579

Strand: Direct

Name: dnaE

Synonym: BCG_1600

Alternate gene names: 121637469

Gene position: 1760025-1763579 (Clockwise)

Preceding gene: 121637468

Following gene: 121637471

Centisome position: 40.23

GC content: 64.16

Gene sequence:

>3555_bases
ATGAGCGGTTCATCTGCGGGGTCCTCCTTCGTGCACCTGCACAACCACACCGAGTATTCGATGCTGGACGGTGCCGCGAA
GATCACGCCCATGCTCGCCGAGGTGGAGCGGCTGGGGATGCCCGCGGTGGGGATGACCGACCACGGAAACATGTTCGGTG
CCAGCGAGTTCTACAACTCCGCGACCAAGGCCGGGATCAAGCCGATCATCGGCGTGGAGGCATACATCGCGCCGGGCTCG
CGGTTCGACACCCGGCGCATCCTGTGGGGTGACCCCAGCCAAAAGGCCGACGACGTCTCCGGCAGCGGCTCCTACACGCA
CCTGACGATGATGGCCGAGAACGCCACCGGTCTGCGCAACCTGTTCAAGCTGTCCTCGCATGCTTCCTTCGAGGGCCAGC
TGAGCAAGTGGTCGCGCATGGACGCCGAGCTCATCGCCGAACACGCCGAGGGCATCATCATCACCACCGGATGCCCGTCG
GGGGAGGTGCAGACCCGCCTGCGGCTCGGCCAGGATCGGGAGGCGCTCGAAGCCGCGGCGAAGTGGCGGGAGATCGTCGG
ACCGGACAACTACTTCCTTGAGCTGATGGACCACGGGCTGACCATCGAACGCCGGGTCCGTGACGGTCTGCTCGAGATCG
GACGCGCGCTCAACATTCCGCCTCTTGCCACCAATGACTGCCACTACGTGACCCGCGACGCCGCCCACAACCATGAGGCT
TTGTTGTGTGTGCAGACCGGCAAGACCCTCTCGGATCCGAATCGCTTCAAGTTCGACGGTGACGGCTACTACCTGAAGTC
GGCCGCCGAGATGCGCCAGATCTGGGACGACGAAGTGCCGGGCGCGTGTGACTCCACCTTGTTGATCGCCGAACGGGTGC
AGTCCTACGCCGACGTGTGGACACCGCGCGACCGGATGCCCGTGTTTCCGGTGCCCGATGGGCATGACCAGGCGTCCTGG
CTGCGTCACGAGGTGGACGCCGGGCTTCGCCGGCGATTTCCGGCCGGTCCGCCGGACGGGTACCGCGAGCGCGCCGCCTA
CGAGATCGACGTCATCTGCTCCAAAGGTTTCCCATCGTACTTTCTGATCGTCGCCGACCTGATCAGCTACGCGCGGTCGG
CGGGCATAAGGGTGGGTCCCGGCCGCGGCTCGGCCGCCGGCTCGCTGGTCGCCTACGCGCTGGGCATCACCGACATCGAC
CCGATTCCACACGGTCTGCTGTTCGAGCGGTTCCTCAACCCCGAGCGCACCTCGATGCCCGACATCGATATCGACTTCGA
CGACCGGCGCCGCGGTGAGATGGTGCGCTACGCAGCCGACAAGTGGGGCCACGACCGGGTCGCGCAGGTCATCACCTTCG
GCACCATCAAAACCAAAGCGGCGCTGAAGGATTCGGCGCGAATCCACTACGGGCAGCCCGGGTTCGCCATCGCCGACCGG
ATCACCAAGGCGCTGCCGCCGGCGATCATGGCCAAAGACATCCCGCTGTCTGGGATCACCGATCCCAGCCACGAACGGTA
CAAGGAGGCCGCCGAGGTCCGCGGCCTGATCGAAACCGACCCGGACGTACGCACCATCTACCAGACCGCACGCGGGTTGG
AAGGCCTGATCCGCAACGCGGGTGTGCACGCCTGCGCGGTGATCATGAGCAGCGAGCCGCTGACTGAGGCCATCCCGTTG
TGGAAGCGGCCGCAGGACGGGGCCATCATCACCGGCTGGGATTACCCGGCGTGCGAGGCCATCGGTCTGCTGAAAATGGA
CTTCCTGGGCCTGCGGAACCTGACGATCATCGGCGACGCGATCGACAACGTCAGGGCCAACAGGGGTATCGACCTCGACC
TGGAATCCGTGCCGCTGGACGACAAGGCCACCTATGAGCTGCTGGGCCGCGGCGACACCCTGGGCGTGTTCCAGCTCGAC
GGCGGGCCCATGCGCGACCTGCTGCGCCGCATGCAGCCGACCGGGTTCGAAGACGTCGTCGCCGTTATCGCGCTGTACCG
GCCCGGCCCGATGGGCATGAACGCACACAACGACTATGCCGACCGCAAGAACAACCGGCAGGCCATCAAACCTATTCACC
CGGAACTCGAAGAACCGCTGCGCGAGATCCTCGCCGAGACCTACGGCCTCATCGTCTATCAAGAGCAGATCATGCGCATC
GCGCAGAAGGTGGCGAGCTACTCGTTGGCCCGCGCCGACATTCTACGCAAGGCCATGGGCAAGAAGAAACGCGAGGTGCT
GGAGAAGGAGTTCGAGGGCTTCTCCGATGGCATGCAGGCCAACGGGTTCTCTCCGGCGGCCATCAAGGCGCTGTGGGACA
CCATCCTGCCGTTCGCTGACTACGCGTTCAACAAGTCACATGCCGCCGGCTACGGCATGGTGTCCTACTGGACGGCCTAC
CTCAAGGCCAACTATCCCGCCGAGTACATGGCCGGTCTGTTGACGTCGGTCGGCGACGATAAAGACAAGGCCGCGGTTTA
TCTGGCCGACTGCCGCAAGCTCGGCATCACCGTGCTCCCGCCCGACGTCAACGAATCTGGCTTGAACTTCGCATCGGTCG
GCCAAGACATCCGCTACGGGCTGGGCGCGGTGCGCAACGTTGGCGCTAATGTCGTGGGCTCGTTGCTCCAAACCCGCAAC
GACAAGGGCAAGTTCACCGACTTTTCGGACTACCTGAACAAGATCGACATCTCGGCGTGCAACAAGAAGGTGACCGAATC
GCTGATCAAGGCGGGTGCGTTCGACTCGCTGGGGCATGCCCGCAAGGGTCTTTTCCTGGTGCACAGCGATGCGGTGGACT
CGGTGCTGGGCACCAAGAAGGCCGAGGCACTGGGGCAGTTCGATCTCTTCGGCAGCAATGATGATGGGACCGGCACCGCA
GATCCCGTGTTCACCATCAAGGTGCCCGATGATGAGTGGGAGGACAAACACAAACTCGCCCTAGAGCGCGAGATGCTGGG
ACTGTACGTCTCGGGGCATCCCCTCAACGGTGTGGCACACTTGCTGGCTGCCCAGGTCGACACCGCGATCCCAGCGATCC
TCGACGGCGATGTCCCCAACGATGCCCAAGTGCGGGTGGGCGGCATCCTGGCGTCGGTGAACCGGAGGGTCAACAAAAAC
GGAATGCCATGGGCTTCAGCGCAATTGGAGGATCTCACGGGCGGCATCGAGGTGATGTTCTTCCCGCACACCTACTCCAG
CTATGGTGCCGACATCGTCGACGATGCAGTCGTGCTGGTCAACGCCAAGGTGGCGGTCCGTGACGACCGCATCGCATTGA
TCGCCAATGACCTCACAGTGCCCGACTTTTCCAACGCCGAGGTGGAGCGGCCGCTGGCGGTCAGCTTGCCCACCCGGCAG
TGCACCTTTGACAAGGTGAGTGCGCTCAAACAGGTGTTGGCGCGCCACCCCGGCACCTCGCAGGTGCATCTGCGGCTCAT
CAGCGGAGACCGGATCACCACGCTGGCACTTGATCAGTCGTTGCGGGTGACGCCGTCGCCGGCGTTGATGGGTGACCTCA
AGGAGCTGCTCGGCCCTGGATGTCTGGGGAGTTAG

Upstream 100 bases:

>100_bases
TCGCTTAAGCAGTTCGCCGAGCTATACGGCTAGCCGCTAGAAGACACACTTTGCGACACGCCCGAACGGTGTCGGTCCTC
GGTCATAGACTGGCGTCCCT

Downstream 100 bases:

>100_bases
CGAGGCGACCGCCCCCAGCGGTTTCCGCACGATCGCCCGTGAGCGCCGCTAATGGATCCAGCCCGACGCCCGACTGTCCC
CGTTGAGATACCCCGAGACC

Product: DNA polymerase III subunit alpha

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1184; Mature: 1183

Protein sequence:

>1184_residues
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGS
RFDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPS
GEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEA
LLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASW
LRHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID
PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADR
ITKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPL
WKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLD
GGPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRI
AQKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY
LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRN
DKGKFTDFSDYLNKIDISACNKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTA
DPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKN
GMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQ
CTFDKVSALKQVLARHPGTSQVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS

Sequences:

>Translated_1184_residues
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGS
RFDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPS
GEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEA
LLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASW
LRHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID
PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADR
ITKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPL
WKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLD
GGPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRI
AQKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY
LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRN
DKGKFTDFSDYLNKIDISACNKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTA
DPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKN
GMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQ
CTFDKVSALKQVLARHPGTSQVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
>Mature_1183_residues
SGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGSR
FDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPSG
EVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEAL
LCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASWL
RHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDIDP
IPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADRI
TKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPLW
KRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDG
GPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRIA
QKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAYL
KANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRND
KGKFTDFSDYLNKIDISACNKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTAD
PVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKNG
MPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQC
TFDKVSALKQVLARHPGTSQVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS

Specific function: DNA polymerase III is a complex, multichain enzyme responsible for most of the replicative synthesis in bacteria. This DNA polymerase also exhibits 3' to 5' exonuclease activity. The alpha chain is the DNA polymerase

COG id: COG0587

COG function: function code L; DNA polymerase III, alpha subunit

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the DNA polymerase type-C family. DnaE subfamily

Homologues:

Organism=Escherichia coli, GI1786381, Length=1196, Percent_Identity=36.0367892976589, Blast_Score=699, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): DPO3A_MYCBO (P63978)

Other databases:

- EMBL:   BX248339
- RefSeq:   NP_855226.1
- ProteinModelPortal:   P63978
- EnsemblBacteria:   EBMYCT00000015260
- GeneID:   1092442
- GenomeReviews:   BX248333_GR
- KEGG:   mbo:Mb1574
- GeneTree:   EBGT00050000016568
- HOGENOM:   HBG734490
- OMA:   QIVTFGT
- ProtClustDB:   PRK05673
- BioCyc:   MBOV233413:MB1574-MONOMER
- BRENDA:   2.7.7.7
- GO:   GO:0005737
- InterPro:   IPR011708
- InterPro:   IPR004365
- InterPro:   IPR004013
- InterPro:   IPR003141
- InterPro:   IPR016195
- InterPro:   IPR004805
- SMART:   SM00481
- TIGRFAMs:   TIGR00594

Pfam domain/function: PF07733 DNA_pol3_alpha; PF02811 PHP; PF01336 tRNA_anti; SSF89550 PHP-like

EC number: =2.7.7.7

Molecular weight: Translated: 129324; Mature: 129193

Theoretical pI: Translated: 5.57; Mature: 5.57

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNS
CCCCCCCCCEEEEECCCCHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH
ATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRN
HHHHCCCCEECEEEEECCCCCCCCCEEEECCCCCCCCCCCCCCCEEEEEEEECCCHHHHH
LFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPSGEVQTRLRLGQDREALEAAA
HHHHHCCCCCCHHHHHHHHCCHHHHHHHCCCEEEECCCCCCCHHHHHHCCCCHHHHHHHH
KWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEA
HHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCCCE
LLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVW
EEEEECCCCCCCCCCEEECCCCEEEHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHC
TPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSY
CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHCCCEEEEEEECCCCCHH
FLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDIDPIPHGLLFERFLNPERTSMP
HHHHHHHHHHHHHCCCEECCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCC
DIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADR
CCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHCCCCEEEECCCCHHHHHH
ITKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNA
HHHHCCHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCC
GVHACAVIMSSEPLTEAIPLWKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDA
CHHEEEEEECCCCHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEHH
IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRRMQPTGFEDVV
HHHHHCCCCCCCCCCCCCCCCCHHHHHHCCCCEEEEEEECCCCHHHHHHHHCCCCHHHHH
AVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRI
HHHHHCCCCCCCCCCCCCHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
AQKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFAD
HHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH
YAFNKSHAAGYGMVSYWTAYLKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLP
HHCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHEEEEEHHHHCCCEEEC
PDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISAC
CCCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHH
NKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTA
HHHHHHHHHHCCCCHHHCHHHCCEEEEECHHHHHHHCCHHHHHHCCEEEECCCCCCCCCC
DPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPN
CCEEEEECCCCCCCCHHHHHHHHHHHEEEECCCCCHHHHHHHHHHHHHHCCHHCCCCCCC
DAQVRVGGILASVNRRVNKNGMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLV
CCEEEEHHHHHHHHHHHCCCCCCCCCCHHHHCCCCEEEEEECCCCHHCCCHHHCCEEEEE
NAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTS
EEEEEEECCCEEEEECCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCC
QVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
EEEEEEECCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCC
>Mature Secondary Structure 
SGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNS
CCCCCCCCEEEEECCCCHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH
ATKAGIKPIIGVEAYIAPGSRFDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRN
HHHHCCCCEECEEEEECCCCCCCCCEEEECCCCCCCCCCCCCCCEEEEEEEECCCHHHHH
LFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPSGEVQTRLRLGQDREALEAAA
HHHHHCCCCCCHHHHHHHHCCHHHHHHHCCCEEEECCCCCCCHHHHHHCCCCHHHHHHHH
KWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEA
HHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCCCE
LLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVW
EEEEECCCCCCCCCCEEECCCCEEEHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHC
TPRDRMPVFPVPDGHDQASWLRHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSY
CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHCCCEEEEEEECCCCCHH
FLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDIDPIPHGLLFERFLNPERTSMP
HHHHHHHHHHHHHCCCEECCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCC
DIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADR
CCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHCCHHHHHHHCCCCEEEECCCCHHHHHH
ITKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNA
HHHHCCHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCC
GVHACAVIMSSEPLTEAIPLWKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDA
CHHEEEEEECCCCHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEHH
IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRDLLRRMQPTGFEDVV
HHHHHCCCCCCCCCCCCCCCCCHHHHHHCCCCEEEEEEECCCCHHHHHHHHCCCCHHHHH
AVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRI
HHHHHCCCCCCCCCCCCCHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
AQKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFAD
HHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH
YAFNKSHAAGYGMVSYWTAYLKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLP
HHCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHEEEEEHHHHCCCEEEC
PDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRNDKGKFTDFSDYLNKIDISAC
CCCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHH
NKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTA
HHHHHHHHHHCCCCHHHCHHHCCEEEEECHHHHHHHCCHHHHHHCCEEEECCCCCCCCCC
DPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPN
CCEEEEECCCCCCCCHHHHHHHHHHHEEEECCCCCHHHHHHHHHHHHHHCCHHCCCCCCC
DAQVRVGGILASVNRRVNKNGMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLV
CCEEEEHHHHHHHHHHHCCCCCCCCCCHHHHCCCCEEEEEECCCCHHCCCHHHCCEEEEE
NAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQCTFDKVSALKQVLARHPGTS
EEEEEEECCCEEEEECCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCC
QVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
EEEEEEECCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12788972