Definition Trichodesmium erythraeum IMS101 chromosome, complete genome.
Accession NC_008312
Length 7,750,108

Click here to switch to the map view.

The map label for this gene is glpQ [H]

Identifier: 113476371

GI number: 113476371

Start: 4307061

End: 4311179

Strand: Reverse

Name: glpQ [H]

Synonym: Tery_2777

Alternate gene names: 113476371

Gene position: 4311179-4307061 (Counterclockwise)

Preceding gene: 113476373

Following gene: 113476367

Centisome position: 55.63

GC content: 39.79

Gene sequence:

>4119_bases
ATGGCACGATTAAAAAGTTTTACCGTATTATCAGCGGATACATTTGCAGAAGGTCCCCCATCAGGAAAATTTATTGAAAC
AAATAATCGAGATGTACCATTTGATAGTCAACCAGTCCAAGGGTTTAGTGGTGTTGAGTTTGCTCCCGGTAGCGAAGATG
GCAGTACTTATTGGTTTCTATCTGATAATGGTTTTGGCTCTCAAGGAAATAGTGCTGATTATTTGCTCAGACTATATAAA
GTTAACCCGAACTTTGCTGGAGCAGAAAATGGCGATACTAAGGTTGAGGTTCAAGATTTTATTCAACTAGCAGATCCTAA
TAATCTTATACCCTTTGATATTTTCAATGAAGATACAAGCGATCGCTTCTTAACTGGAGCTGACTTTGATACAGAGTCTT
TTGTTATAGATAATAATGGAGATATTTGGGTCGGAGACGAATTTGGACCTTATATTTTACACTTCGATAGCACGGGAACT
TTGCTTGAGGCTCCCATTGCCACACCCAACATAGAAATTTCTTTTAACTCTCTGAGTGGGGAAGCGCCTTTAGTCATTGG
ACACCGGGGTGCAAGTGGTTCTCGTCCCGAACATACTCTAGAAGCTTATAGACTAGCGATCGAAAAAGGTGCAGATTTTA
TTGAACCAGACCTGGTAATTACTGCCGATGGTGTGCTTATTGCCCGACACGAACCACTGTTAGACGATACTACAAATGTT
GCTGAAGTATTCGCTCCAGAACGTCAAAGCACTAAGTTTCTTGATGGAGTAGAAACTACTGGCTATTTTGCTGAAGATTT
CACCCTTGAAGAAATCAAACAATTACGCGCTGTTCAACCTCGAGATTTCCGTGATCAATCTTTTAACGGTCAATTTGAAA
TTCCCACCTTCCAAGAAGTTATCGAGTTGGTACAAAAAATGGAAGAATCTGGGTTTGAGGTCGGTATCTATCCGGAAACT
AAACATCCTACGTTCTTTGACCAACAAGGTTTATCCTTAGAAGAACCTCTAATTCAAACCCTACAAAATACAAGTTTCAC
TGACCCAGATCGTATTTTCATCCAGTCGTTTGAGTTTGCTAACTTAATTGACTTACAAAAACAGTTGAATGCTCAAAGTC
TCGGTGATATTCCTTTAGTTCAACTGTATGGAAATACTACTGAAGGAGCAGATCCAAACAGTACCTTTTCTTCCCCCTTC
GACATTAGGTTTAATGTTGCTCAAGATAATGACTTAGCAGCTATCTATGGTCAAGAATTCCTAAATGCAGTAGAAGTTTC
TCTCTCAGAAAATACAGTTTATGCAGACCTTGACAGTGAAGAATTTCTGGAAGTTATTAGTGGTCTATATGCAGAAGGTG
TTGGAACTTGGAAGAATAACATTCTGTTGCGTGAGTCCCTTGATACTCGAGTCGATGGTAACGGAGATGGTGTAGCCGAA
ATAAGTACCCAGTTAACAGGAGAAATTACTTCCTTTATCGAAGATGCTCACGATGCAGGTTTACAGGTTCATCCCTACAC
TTTACGGGATGAAGAGCGTTACCTTACCCTCAAGCCTGATCTAGAAGCTTTTGGCACAGGAACACCTCAAACTGCGGAGG
AGGAATTTCAACAACTAATAGATATAGGTGTAGATGGTTTCTTTACTGACTTCCCTGGGACAGGTCGTAAAGTTCTAGAA
CAAAACGACTCTAATGATGAAGTTCGCTCTCCCCAAAACCCAGAAATTTCCTTTAACTCTCTGAGTGGGGAAGCGCCTTT
AGTCATTGGACACCGGGGTGCAAGTGGTTCTCGTCCCGAACATACTCTAGAAGCTTATAGACTAGCGATCGAAAAAGGTG
CAGATTTTATTGAACCAGACCTGGTAATTACTGCCGATGGTGTGCTTATTGCCCGACACGAGCCACTGTTAGACGATACT
ACAAATGTTGCTGAAGTATTCGCTTCAGAACGTCAAAGCACTAAGTTTCTTGATGGAGTAGAAACTACTGGCTATTTTGC
TGAAGATTTCACCCTTGAAGAAATCAAACAATTACGCGCTGTTCAACCTCGAGATTTCCGTGATCAATCTTTTAACGGTC
AATTTGAAATTCCCACCTTCCAAGAAGTTATCGAGTTGGTACAAAAAATGGAAGAATCTGGGTTTGAGGTCGGTATCTAT
CCGGAAACTAAACATCCTACGTTCTTTGACCAACAAGGTTTATCCTTAGAAGAACCTCTAATTCAAACCCTACAAAATAC
AAGTTTCACTGACCCAGATCGTATTTTCATCCAGTCGTTTGAGTTTGCTAACTTAATTGACTTACAAAAACAGTTGAATG
CTCAAAGTCTCGGTGATATTCCTTTAGTTCAACTGTATGGAAATACTACAGAAGGAGCAGATCCAAACAGTACCTTTTCT
TCCCCCTTCGACATTAGGTTTAATGTTGCTCAAGATAACGACTTAGCAGCTATCTATGGTCAAGCATTCCTAAATGCAGT
AGAAGTTTCTCTCTCAGAAAATACAGTTTATGCAGACCTTGACAGTGAAGAATTTCTGAAAGTTATTAGTGGTATATATG
CAGAAGGTGTTGGAACTTGGAAGAATAACATTCTGTTGCGTGAGTCCCTTGATACTCGAGTCGATGGTAACGGAGATGGT
GTAGCTGAAATAAGTACCCAGTTAACAGGAGAAATTACTTCCTTTATCGAAGATGCTCACGATGCAGGTTTACAGGTTCA
TCCCTACACTTTACGGGATGAAGAGCGTTACCTTACCCTCAAGTCTGATCTAGAAGCTTTTGGCACAGGAACACCTCAAA
CTGCGGAGGAGGAATTTCAACAACTAATAGATATAGGTGTAGATGGTTTCTTTACTGACTTCCCTGGGACAGGTCGTAAA
GTTCTAGAACAAAACGACCCTAATTTGGCTACTTCCAGCGGTTATAAAGGTATGGCATTCAGTCGCGATCGCCAAACTTT
CTATCCTTTATTAGGGGGTACAGTTGAAGGCGATCCAGATAATGCTTTACGAATTTACGAGTTCGACTCTGTTTCTAGCT
CTTTTGAGGAGGAGCTAGTTGGTTTCTATCCTACTGATGTTACTGGTCATACTATCGGTGACTTTACTCCTATTAATGAA
AGTGAATTTTTAGTTATTGAACTGGATGATCGTCAAGGGAATGAAGCTGAGTTCAAAAAAATATTTAAAATTGACATTTA
TGAAGTTGATGATCAGGGTTTTGTTGAAAAAGAGGAAATTGTAGACTTATTAGATATTGAGGATCCTAATGACCTTAATG
GTGATGGAGATACTAGCTTTAACTTCCCTTTTTCCAGCATTGAGAATATGTTAGTTATTGATGAAAACACTATTCTGGTT
GCCAACGACAACAACTATCCTTTCTCCAAAAGCCGAGGAGATGATATTGATAATACCGAAATAATTCAGATAGAGCTAGA
ACAACCTCTCCTTGTTGATAAGGGTTTGCTAATTAAAGGTACTCCTAAAACAGAGCGTCTCATTGGTGGTGTTGGAGACG
ATACTATTATCGGTTTGAATGGCGCGGATACCTTAGCAGGTCGTATTGGTAATGATCAAATAATTGGTAGTCGTGGCAAT
GACTTATTGTTGGGTCAGGATGGTAATGATGAACTTCAAGGTCGTCAAGGCAACGATCACCTCTTGGGTGGTGATGGTGA
TGATCAGTTAAATGGTGGTCAAGGGCGTGATCGCATCAATGGCGGTCCTGGTGATGATACTTTGACTGGTGGTGCAAGTA
TTGACCGCTTTATCTTCAATAGTAATGAAGAATTTGAGTCTGATAATTTTGGTATTGATACTATCAATAATTTCCAACTA
GACCTCAGGACAAATGAAGATGGAAACCAAGGCGACTTAATTCTTCTTGATAAGTCAAGCTTTACTGAGCTTAATAGTAC
TGCTGGAATTGGTTTTAGTGTTAATAATGATTTTGAAACCGTTAACACTGATAATAGTGTAGATGACTCTGATGCTTTCA
TTGTTTATAGTGAAAAGTCTGGAGGTCTATTCTACAATGTAGATGGTGAGTCTACTCAGTTTGCTATCCTTGATGGTGCT
CCCAATATTACTGAGGATAATTTCCAAATTAGAAATTAG

Upstream 100 bases:

>100_bases
TTGCACATAAACTAAATATTGACATTTACTTAAAACTGATCTAATGTAAGACAAAATAATTCTTAACAAGTTGTCAATTT
ACAAGGTAAAAAAAAATCAA

Downstream 100 bases:

>100_bases
GGAGTATATTTAACAACGGTTTTGAACTCACTAGACTCAGGTAGGTACGTTTCATAAAACGTCCGGTTCTGTTTTTAATA
TGATGTCCGGATAATAAGTA

Product: glycerophosphoryl diester phosphodiesterase

Products: NA

Alternate protein names: Glycerophosphodiester phosphodiesterase [H]

Number of amino acids: Translated: 1372; Mature: 1371

Protein sequence:

>1372_residues
MARLKSFTVLSADTFAEGPPSGKFIETNNRDVPFDSQPVQGFSGVEFAPGSEDGSTYWFLSDNGFGSQGNSADYLLRLYK
VNPNFAGAENGDTKVEVQDFIQLADPNNLIPFDIFNEDTSDRFLTGADFDTESFVIDNNGDIWVGDEFGPYILHFDSTGT
LLEAPIATPNIEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNV
AEVFAPERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIYPET
KHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSSPF
DIRFNVAQDNDLAAIYGQEFLNAVEVSLSENTVYADLDSEEFLEVISGLYAEGVGTWKNNILLRESLDTRVDGNGDGVAE
ISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKPDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRKVLE
QNDSNDEVRSPQNPEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDT
TNVAEVFASERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIY
PETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFS
SPFDIRFNVAQDNDLAAIYGQAFLNAVEVSLSENTVYADLDSEEFLKVISGIYAEGVGTWKNNILLRESLDTRVDGNGDG
VAEISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKSDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRK
VLEQNDPNLATSSGYKGMAFSRDRQTFYPLLGGTVEGDPDNALRIYEFDSVSSSFEEELVGFYPTDVTGHTIGDFTPINE
SEFLVIELDDRQGNEAEFKKIFKIDIYEVDDQGFVEKEEIVDLLDIEDPNDLNGDGDTSFNFPFSSIENMLVIDENTILV
ANDNNYPFSKSRGDDIDNTEIIQIELEQPLLVDKGLLIKGTPKTERLIGGVGDDTIIGLNGADTLAGRIGNDQIIGSRGN
DLLLGQDGNDELQGRQGNDHLLGGDGDDQLNGGQGRDRINGGPGDDTLTGGASIDRFIFNSNEEFESDNFGIDTINNFQL
DLRTNEDGNQGDLILLDKSSFTELNSTAGIGFSVNNDFETVNTDNSVDDSDAFIVYSEKSGGLFYNVDGESTQFAILDGA
PNITEDNFQIRN

Sequences:

>Translated_1372_residues
MARLKSFTVLSADTFAEGPPSGKFIETNNRDVPFDSQPVQGFSGVEFAPGSEDGSTYWFLSDNGFGSQGNSADYLLRLYK
VNPNFAGAENGDTKVEVQDFIQLADPNNLIPFDIFNEDTSDRFLTGADFDTESFVIDNNGDIWVGDEFGPYILHFDSTGT
LLEAPIATPNIEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNV
AEVFAPERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIYPET
KHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSSPF
DIRFNVAQDNDLAAIYGQEFLNAVEVSLSENTVYADLDSEEFLEVISGLYAEGVGTWKNNILLRESLDTRVDGNGDGVAE
ISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKPDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRKVLE
QNDSNDEVRSPQNPEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDT
TNVAEVFASERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIY
PETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFS
SPFDIRFNVAQDNDLAAIYGQAFLNAVEVSLSENTVYADLDSEEFLKVISGIYAEGVGTWKNNILLRESLDTRVDGNGDG
VAEISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKSDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRK
VLEQNDPNLATSSGYKGMAFSRDRQTFYPLLGGTVEGDPDNALRIYEFDSVSSSFEEELVGFYPTDVTGHTIGDFTPINE
SEFLVIELDDRQGNEAEFKKIFKIDIYEVDDQGFVEKEEIVDLLDIEDPNDLNGDGDTSFNFPFSSIENMLVIDENTILV
ANDNNYPFSKSRGDDIDNTEIIQIELEQPLLVDKGLLIKGTPKTERLIGGVGDDTIIGLNGADTLAGRIGNDQIIGSRGN
DLLLGQDGNDELQGRQGNDHLLGGDGDDQLNGGQGRDRINGGPGDDTLTGGASIDRFIFNSNEEFESDNFGIDTINNFQL
DLRTNEDGNQGDLILLDKSSFTELNSTAGIGFSVNNDFETVNTDNSVDDSDAFIVYSEKSGGLFYNVDGESTQFAILDGA
PNITEDNFQIRN
>Mature_1371_residues
ARLKSFTVLSADTFAEGPPSGKFIETNNRDVPFDSQPVQGFSGVEFAPGSEDGSTYWFLSDNGFGSQGNSADYLLRLYKV
NPNFAGAENGDTKVEVQDFIQLADPNNLIPFDIFNEDTSDRFLTGADFDTESFVIDNNGDIWVGDEFGPYILHFDSTGTL
LEAPIATPNIEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNVA
EVFAPERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIYPETK
HPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSSPFD
IRFNVAQDNDLAAIYGQEFLNAVEVSLSENTVYADLDSEEFLEVISGLYAEGVGTWKNNILLRESLDTRVDGNGDGVAEI
STQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKPDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRKVLEQ
NDSNDEVRSPQNPEISFNSLSGEAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTT
NVAEVFASERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIYP
ETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSS
PFDIRFNVAQDNDLAAIYGQAFLNAVEVSLSENTVYADLDSEEFLKVISGIYAEGVGTWKNNILLRESLDTRVDGNGDGV
AEISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKSDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRKV
LEQNDPNLATSSGYKGMAFSRDRQTFYPLLGGTVEGDPDNALRIYEFDSVSSSFEEELVGFYPTDVTGHTIGDFTPINES
EFLVIELDDRQGNEAEFKKIFKIDIYEVDDQGFVEKEEIVDLLDIEDPNDLNGDGDTSFNFPFSSIENMLVIDENTILVA
NDNNYPFSKSRGDDIDNTEIIQIELEQPLLVDKGLLIKGTPKTERLIGGVGDDTIIGLNGADTLAGRIGNDQIIGSRGND
LLLGQDGNDELQGRQGNDHLLGGDGDDQLNGGQGRDRINGGPGDDTLTGGASIDRFIFNSNEEFESDNFGIDTINNFQLD
LRTNEDGNQGDLILLDKSSFTELNSTAGIGFSVNNDFETVNTDNSVDDSDAFIVYSEKSGGLFYNVDGESTQFAILDGAP
NITEDNFQIRN

Specific function: Glycerophosphoryl diester phosphodiesterase hydrolyzes deacylated phospholipids to G3P and the corresponding alcohols [H]

COG id: COG0584

COG function: function code C; Glycerophosphoryl diester phosphodiesterase

Gene ontology:

Cell location: Periplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycerophosphoryl diester phosphodiesterase family [H]

Homologues:

Organism=Escherichia coli, GI1788572, Length=395, Percent_Identity=30.6329113924051, Blast_Score=132, Evalue=1e-31,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR004129
- InterPro:   IPR017946 [H]

Pfam domain/function: PF03009 GDPD [H]

EC number: =3.1.4.46 [H]

Molecular weight: Translated: 151422; Mature: 151291

Theoretical pI: Translated: 3.79; Mature: 3.79

Prosite motif: PS00330 HEMOLYSIN_CALCIUM

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.4 %Met     (Translated Protein)
0.4 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.3 %Met     (Mature Protein)
0.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MARLKSFTVLSADTFAEGPPSGKFIETNNRDVPFDSQPVQGFSGVEFAPGSEDGSTYWFL
CCCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEECCCCCCCCEEEEE
SDNGFGSQGNSADYLLRLYKVNPNFAGAENGDTKVEVQDFIQLADPNNLIPFDIFNEDTS
ECCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEHHHHHEECCCCCCCEEEEECCCCC
DRFLTGADFDTESFVIDNNGDIWVGDEFGPYILHFDSTGTLLEAPIATPNIEISFNSLSG
CCEEECCCCCCCEEEEECCCCEEECCCCCCEEEEECCCCCEEECCCCCCCEEEEEECCCC
EAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNV
CCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEECCEEEEECCCCCCCCCCH
AEVFAPERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEV
HHHHCCCCCCCHHHCCCCCCCEEECCCCHHHHHHHHCCCCCCCCCCCCCCEECCCCHHHH
IELVQKMEESGFEVGIYPETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFA
HHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCHH
NLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSSPFDIRFNVAQDNDLAAIYGQEF
HHHHHHHHCCHHHCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEHHHHH
LNAVEVSLSENTVYADLDSEEFLEVISGLYAEGVGTWKNNILLRESLDTRVDGNGDGVAE
HHHEEEEECCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCHHH
ISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKPDLEAFGTGTPQTAEEEFQQLI
HHHHHHHHHHHHHHHHHHCCCEEECEEECCCCEEEEECCCHHHHCCCCCCHHHHHHHHHH
DIGVDGFFTDFPGTGRKVLEQNDSNDEVRSPQNPEISFNSLSGEAPLVIGHRGASGSRPE
HHCCCCEEECCCCCCHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCEEEEECCCCCCCCH
HTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNVAEVFASERQSTKFLDGV
HHHHHHHHHHHCCCCCCCCCEEEEECCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCC
ETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIY
CCCCEEECCCCHHHHHHHHCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCEEEEE
PETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDI
CCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHCCHHHCCCC
PLVQLYGNTTEGADPNSTFSSPFDIRFNVAQDNDLAAIYGQAFLNAVEVSLSENTVYADL
EEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHEEEEECCCEEEEEC
DSEEFLKVISGIYAEGVGTWKNNILLRESLDTRVDGNGDGVAEISTQLTGEITSFIEDAH
CHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
DAGLQVHPYTLRDEERYLTLKSDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRK
HCCCEEECEEECCCCEEEEEHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHH
VLEQNDPNLATSSGYKGMAFSRDRQTFYPLLGGTVEGDPDNALRIYEFDSVSSSFEEELV
HHHCCCCCCCCCCCCCCCEECCCCCEEHHHHCCCCCCCCCCCEEEEEECCCHHHHHHHHH
GFYPTDVTGHTIGDFTPINESEFLVIELDDRQGNEAEFKKIFKIDIYEVDDQGFVEKEEI
EECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHEEEEEEEEEEECCCCCCCHHHH
VDLLDIEDPNDLNGDGDTSFNFPFSSIENMLVIDENTILVANDNNYPFSKSRGDDIDNTE
HHHHCCCCCCCCCCCCCCCCCCCHHHCCCEEEEECCEEEEECCCCCCCCCCCCCCCCCCE
IIQIELEQPLLVDKGLLIKGTPKTERLIGGVGDDTIIGLNGADTLAGRIGNDQIIGSRGN
EEEEEECCCEEECCCEEEECCCCHHHHHCCCCCCEEEECCCCHHHHCCCCCCEEECCCCC
DLLLGQDGNDELQGRQGNDHLLGGDGDDQLNGGQGRDRINGGPGDDTLTGGASIDRFIFN
EEEEECCCCHHCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC
SNEEFESDNFGIDTINNFQLDLRTNEDGNQGDLILLDKSSFTELNSTAGIGFSVNNDFET
CCCCCCCCCCCCCEECCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCCEEEECCCCCC
VNTDNSVDDSDAFIVYSEKSGGLFYNVDGESTQFAILDGAPNITEDNFQIRN
CCCCCCCCCCCEEEEEEECCCCEEEEECCCCEEEEEECCCCCCCCCCCEECC
>Mature Secondary Structure 
ARLKSFTVLSADTFAEGPPSGKFIETNNRDVPFDSQPVQGFSGVEFAPGSEDGSTYWFL
CCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEECCCCCCCCEEEEE
SDNGFGSQGNSADYLLRLYKVNPNFAGAENGDTKVEVQDFIQLADPNNLIPFDIFNEDTS
ECCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEHHHHHEECCCCCCCEEEEECCCCC
DRFLTGADFDTESFVIDNNGDIWVGDEFGPYILHFDSTGTLLEAPIATPNIEISFNSLSG
CCEEECCCCCCCEEEEECCCCEEECCCCCCEEEEECCCCCEEECCCCCCCEEEEEECCCC
EAPLVIGHRGASGSRPEHTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNV
CCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEECCEEEEECCCCCCCCCCH
AEVFAPERQSTKFLDGVETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEV
HHHHCCCCCCCHHHCCCCCCCEEECCCCHHHHHHHHCCCCCCCCCCCCCCEECCCCHHHH
IELVQKMEESGFEVGIYPETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFA
HHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCHH
NLIDLQKQLNAQSLGDIPLVQLYGNTTEGADPNSTFSSPFDIRFNVAQDNDLAAIYGQEF
HHHHHHHHCCHHHCCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEHHHHH
LNAVEVSLSENTVYADLDSEEFLEVISGLYAEGVGTWKNNILLRESLDTRVDGNGDGVAE
HHHEEEEECCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCHHH
ISTQLTGEITSFIEDAHDAGLQVHPYTLRDEERYLTLKPDLEAFGTGTPQTAEEEFQQLI
HHHHHHHHHHHHHHHHHHCCCEEECEEECCCCEEEEECCCHHHHCCCCCCHHHHHHHHHH
DIGVDGFFTDFPGTGRKVLEQNDSNDEVRSPQNPEISFNSLSGEAPLVIGHRGASGSRPE
HHCCCCEEECCCCCCHHHHHCCCCCCCCCCCCCCCEEEECCCCCCCEEEEECCCCCCCCH
HTLEAYRLAIEKGADFIEPDLVITADGVLIARHEPLLDDTTNVAEVFASERQSTKFLDGV
HHHHHHHHHHHCCCCCCCCCEEEEECCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCC
ETTGYFAEDFTLEEIKQLRAVQPRDFRDQSFNGQFEIPTFQEVIELVQKMEESGFEVGIY
CCCCEEECCCCHHHHHHHHCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCEEEEE
PETKHPTFFDQQGLSLEEPLIQTLQNTSFTDPDRIFIQSFEFANLIDLQKQLNAQSLGDI
CCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHCCHHHCCCC
PLVQLYGNTTEGADPNSTFSSPFDIRFNVAQDNDLAAIYGQAFLNAVEVSLSENTVYADL
EEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHEEEEECCCEEEEEC
DSEEFLKVISGIYAEGVGTWKNNILLRESLDTRVDGNGDGVAEISTQLTGEITSFIEDAH
CHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
DAGLQVHPYTLRDEERYLTLKSDLEAFGTGTPQTAEEEFQQLIDIGVDGFFTDFPGTGRK
HCCCEEECEEECCCCEEEEEHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHH
VLEQNDPNLATSSGYKGMAFSRDRQTFYPLLGGTVEGDPDNALRIYEFDSVSSSFEEELV
HHHCCCCCCCCCCCCCCCEECCCCCEEHHHHCCCCCCCCCCCEEEEEECCCHHHHHHHHH
GFYPTDVTGHTIGDFTPINESEFLVIELDDRQGNEAEFKKIFKIDIYEVDDQGFVEKEEI
EECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHEEEEEEEEEEECCCCCCCHHHH
VDLLDIEDPNDLNGDGDTSFNFPFSSIENMLVIDENTILVANDNNYPFSKSRGDDIDNTE
HHHHCCCCCCCCCCCCCCCCCCCHHHCCCEEEEECCEEEEECCCCCCCCCCCCCCCCCCE
IIQIELEQPLLVDKGLLIKGTPKTERLIGGVGDDTIIGLNGADTLAGRIGNDQIIGSRGN
EEEEEECCCEEECCCEEEECCCCHHHHHCCCCCCEEEECCCCHHHHCCCCCCEEECCCCC
DLLLGQDGNDELQGRQGNDHLLGGDGDDQLNGGQGRDRINGGPGDDTLTGGASIDRFIFN
EEEEECCCCHHCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC
SNEEFESDNFGIDTINNFQLDLRTNEDGNQGDLILLDKSSFTELNSTAGIGFSVNNDFET
CCCCCCCCCCCCCEECCEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCCEEEECCCCCC
VNTDNSVDDSDAFIVYSEKSGGLFYNVDGESTQFAILDGAPNITEDNFQIRN
CCCCCCCCCCCEEEEEEECCCCEEEEECCCCEEEEEECCCCCCCCCCCEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1851953; 9205837; 9278503; 3329281; 8899705 [H]