The gene/protein map for NC_009972 is currently unavailable.
Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is cpt [H]

Identifier: 159897712

GI number: 159897712

Start: 1358498

End: 1361683

Strand: Reverse

Name: cpt [H]

Synonym: Haur_1183

Alternate gene names: 159897712

Gene position: 1361683-1358498 (Counterclockwise)

Preceding gene: 159897713

Following gene: 159897711

Centisome position: 21.46

GC content: 50.44

Gene sequence:

>3186_bases
ATGGACCGTCTGAGGCATCGTTGGTCGTTAATTGGGACAATCCTAGCGTTAATTGGCTTGTGGAGTAGCTTGGTGGTGAT
TTCGCTGCCCCAACGAACCCAAGCCCAGCCTGTGGTCGAGGAACAACGGATTGTGGCGCGGATTGAGGCCAAAGATCGCG
CTGATTCACTGGCACTTAGTGCGCGAGGCCTCGATTTGTTGGAAATGCGCGATAAGCACGATTTGTTTGCATTGATTACG
CCAAGCGAATTGGCTAAATTGCAACAAGAAGGCTTTGTCGCTGAAATTGATCAAGAGCAAACTCGTTTGTTGCAAGAACC
TTCAATCATGCCAGTCCAAGGTGGATTCCGCACGGTTGAAGAAGGCTATGCCTTGCTTGATCAATGGCATGCAACCTATC
CCAACCTAACGGATTTGTTCACCTATGGGACTTCATGGGATAAAGTGACCGCTGGTGGGCCAGCAGGCTACGATTTGCGT
GGGATCACACTGACCAATTCGTTGATTCCTGGGCCAAAACCAACCTTCTTCTTAATGTCGGCGATTCATGCCCGTGAAAT
GTCAACTGCTGAATTGACCTTGCGCTATACCGAGTATTTGCTTTCGCGCTATGAAACCGACCCCGATGTGCATTGGTTGC
TTGATGAACACACAATTGTGATTGTGCCTTTTGTCAACCCCGATGGCCGCAAGATTGCCGAGCAAAGCTTATCGCAACGC
AAAAATCGCAACACGGTTGATACTTCAAGTTGTAGCGGCGTGAATATTGGGATCGACCTCAACCGCAACTCATCGTTCCA
CTGGGGCGAAGTTGATAGCCCGAATGGTGATCGTTGTGGCGCAACATGGCCTGGCGTTTCAGCTGCTTCAGAGCCAGAAG
TTGCCACCTTACAACAATGGATTCGTGGCGTATTTGCCGATCAACGTGGGCCAAGTGATACTGATCCTGCGCCAGATACC
ACAACCGGGGTCTATATCTCAATTCACTCATATAGCGATTTGGTCTTGTGGCCATATGGTCACTCAGCCCAACTTGCGCC
AAACGATGCCGATTTGCGTGGTTTGGGCAAGAAATTCGCCAGCTACAACGGCTACACACCGCAAAAATCCGACGAACTGT
ATCCAACCAGTGGTACAACCGACGATTGGGCCTATGGTGAGTTAGGGGTAGCGGCCTATACCTTTGAAATTGGGCCAGAA
TCAGGCACATGTAGCGGCTTCTTCCCAGCATTTACCTGTTTGGATGGCCAAGCTCCTGGTAATTTCTGGGGTCGCAACTT
GCCTGCCTTCTTGTATGCCTCGAAAGTTGCCCGTACACCATATTTGTTGCAACGTGGTCCCGATGCTTTGAATGTAACCG
CTCAATCGATGAGCAATGGTTACAAATTGCTGGCAACGATTAATGATGTAACCAATGGCAACCAAACAATTGCTGCTGCC
GAAGCCTATGTTGATACACCACCATGGCGAGCTGGGGCAACTGCAATTAGCCTGAGCGCAACCGATGGCAGTTTCAACAG
CACCCAAGAAGCGGTCAATGCAACCATTCCGCAAACCTTGAATGCTGGTCGCCACTTAGTCTATTTCCGTGGGCGCGATG
CTGCGGGCAACTGGGGGCCGGTTAGCGCTCAATGGCTCGATGTTGCACCGCAAGGCTTGGTTGGGTTTGTCCGCGCAAGC
GATAACAATCAGCCAATTGCCAATGCAACGGTCGTCGCCACAACTGGCACGTTTACCAGCACGACGACCAGCGGCGCTGA
TGGCAGTTATCGTTTGGAATTGCCAGTTGGTAGCTACACGCTCAAGGCCAGTGGCACAGGCTTGACTCCTGCTAGCTACA
ACCTGACTGTTAGCAGCAATAGTTTCACAACCCAAGATATTAGTTTGGCGCAGTTGGCAGTCTTGACGACCTCGCCCAGT
CCATTGACCTTCAACGTGGCCAGTGGCAGCCAAGATCGCACGTTGGTGGTGGGCAATGCTGGTGGTACAAGCTTGAATGC
AGCCATCTCACTCGCTCCAACTGGCTATGAAGTTAAGAGCAGTGACGATGCTGGTGGCCCAAGCTATACTTGGAACGACA
TTAGTAGCACAGGTACACGCCTCAGTTTGGGCGATGATACCTGTTCGGTGGTGAACTTGCCAAGCAGCTTCAATTACTAT
GGCACCGCCTATAGCAAATTGATTGTCAACAGCAATGGTTTTGTTAGCCCAACCAATGCCACTACCTGTAGCTCAACTGG
TACATCGACCAACGGCGTTGTGCCAAGCACGTCAACGCCCAACAATGTGATTGCAGCCTTGTGGGACGACCTTGATCCTG
AAGGTTTGACGGGCACGAACGGGGTCTTTACCTATAACGATAGTGCCAACAATCAATTTATTGTTGAATTTAGTGGTGTG
CCACACTGGGCAAACAATGGCAACTTCAGCCCCGAAGATTTCCAATTTGTGCTGAATCTGACAACTGGCGATGTTACGCT
CAATTATCAAAACATTGATACCCAAAATAGCGTCAGTGTTGGTATCGAAGATAGCACTGGGGCCAATGGCTATCAGTGGG
TCTATAACAGCACTGGCCGTTTACACGATAATTTGGCAATTCAATTTGCGGCCTATGCTGGCAGTGCCCCATGGCTCAGT
TGGACACCTAGCAACATTGATGTAGCAGCGCGTGGTTCGACGAACGTCCAAGTAACCGCCAATGCTACAGGCCTTGCCAA
TGGCACCTATCGCACTCGCTTACGAGTTAACGCAGGAGCCAACACGATCAACGGCGACCAAACGGTTCCAGTGGTTTTGA
ATGTTGGTAGCTCGACAGTCCATGATGTTGCAGTTAGTGCACCGCAAGCAGCCTTGAGTGGGTTCGTTGGTTCAACCATC
ACCTACACGCTGAGCGTCACTAACACAGGCAATGTCAGCGATAGCTTTAACTTGAGCTTGAGTGGCAATGTTTGGCCAAC
AACCTTGAGCCAAACCAGTGTTAATTTGGCGGCTGGTGCAAGCACTACAATTCAAGTGTCGGTGGCAATTCCGGCCAATG
CTGCCGCCAATAGCACGGATAGCGTGACTATTACGGCAACGTCGGCTGCCGATAGCAGTGCAACCAATAGCATCAGCTTG
GTTTCAACCGCTAATAGCATTCCAGTCAGCCAATATAAGGTATTTATGCCCTATATTGTGAAGTAG

Upstream 100 bases:

>100_bases
GCCTCAATGAAGAGCAGGTATTCCCCAAAACTTCCCTAGCGTAACAATCGAAGGAGATTCTGGCACGGCGCTAGGCTCAC
ACTGTGTGGAGGTCGCTTGT

Downstream 100 bases:

>100_bases
TTCGAGCAGTTGGGGATTGAGGAATTTGTTGTTGGGATCGGTAACGAGGTTGTTATCGATCCCTTTTTCGATCCACGAAG
GACACGAAGAGCACGAAGGA

Product: peptidase M14 carboxypeptidase A

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1061; Mature: 1061

Protein sequence:

>1061_residues
MDRLRHRWSLIGTILALIGLWSSLVVISLPQRTQAQPVVEEQRIVARIEAKDRADSLALSARGLDLLEMRDKHDLFALIT
PSELAKLQQEGFVAEIDQEQTRLLQEPSIMPVQGGFRTVEEGYALLDQWHATYPNLTDLFTYGTSWDKVTAGGPAGYDLR
GITLTNSLIPGPKPTFFLMSAIHAREMSTAELTLRYTEYLLSRYETDPDVHWLLDEHTIVIVPFVNPDGRKIAEQSLSQR
KNRNTVDTSSCSGVNIGIDLNRNSSFHWGEVDSPNGDRCGATWPGVSAASEPEVATLQQWIRGVFADQRGPSDTDPAPDT
TTGVYISIHSYSDLVLWPYGHSAQLAPNDADLRGLGKKFASYNGYTPQKSDELYPTSGTTDDWAYGELGVAAYTFEIGPE
SGTCSGFFPAFTCLDGQAPGNFWGRNLPAFLYASKVARTPYLLQRGPDALNVTAQSMSNGYKLLATINDVTNGNQTIAAA
EAYVDTPPWRAGATAISLSATDGSFNSTQEAVNATIPQTLNAGRHLVYFRGRDAAGNWGPVSAQWLDVAPQGLVGFVRAS
DNNQPIANATVVATTGTFTSTTTSGADGSYRLELPVGSYTLKASGTGLTPASYNLTVSSNSFTTQDISLAQLAVLTTSPS
PLTFNVASGSQDRTLVVGNAGGTSLNAAISLAPTGYEVKSSDDAGGPSYTWNDISSTGTRLSLGDDTCSVVNLPSSFNYY
GTAYSKLIVNSNGFVSPTNATTCSSTGTSTNGVVPSTSTPNNVIAALWDDLDPEGLTGTNGVFTYNDSANNQFIVEFSGV
PHWANNGNFSPEDFQFVLNLTTGDVTLNYQNIDTQNSVSVGIEDSTGANGYQWVYNSTGRLHDNLAIQFAAYAGSAPWLS
WTPSNIDVAARGSTNVQVTANATGLANGTYRTRLRVNAGANTINGDQTVPVVLNVGSSTVHDVAVSAPQAALSGFVGSTI
TYTLSVTNTGNVSDSFNLSLSGNVWPTTLSQTSVNLAAGASTTIQVSVAIPANAAANSTDSVTITATSAADSSATNSISL
VSTANSIPVSQYKVFMPYIVK

Sequences:

>Translated_1061_residues
MDRLRHRWSLIGTILALIGLWSSLVVISLPQRTQAQPVVEEQRIVARIEAKDRADSLALSARGLDLLEMRDKHDLFALIT
PSELAKLQQEGFVAEIDQEQTRLLQEPSIMPVQGGFRTVEEGYALLDQWHATYPNLTDLFTYGTSWDKVTAGGPAGYDLR
GITLTNSLIPGPKPTFFLMSAIHAREMSTAELTLRYTEYLLSRYETDPDVHWLLDEHTIVIVPFVNPDGRKIAEQSLSQR
KNRNTVDTSSCSGVNIGIDLNRNSSFHWGEVDSPNGDRCGATWPGVSAASEPEVATLQQWIRGVFADQRGPSDTDPAPDT
TTGVYISIHSYSDLVLWPYGHSAQLAPNDADLRGLGKKFASYNGYTPQKSDELYPTSGTTDDWAYGELGVAAYTFEIGPE
SGTCSGFFPAFTCLDGQAPGNFWGRNLPAFLYASKVARTPYLLQRGPDALNVTAQSMSNGYKLLATINDVTNGNQTIAAA
EAYVDTPPWRAGATAISLSATDGSFNSTQEAVNATIPQTLNAGRHLVYFRGRDAAGNWGPVSAQWLDVAPQGLVGFVRAS
DNNQPIANATVVATTGTFTSTTTSGADGSYRLELPVGSYTLKASGTGLTPASYNLTVSSNSFTTQDISLAQLAVLTTSPS
PLTFNVASGSQDRTLVVGNAGGTSLNAAISLAPTGYEVKSSDDAGGPSYTWNDISSTGTRLSLGDDTCSVVNLPSSFNYY
GTAYSKLIVNSNGFVSPTNATTCSSTGTSTNGVVPSTSTPNNVIAALWDDLDPEGLTGTNGVFTYNDSANNQFIVEFSGV
PHWANNGNFSPEDFQFVLNLTTGDVTLNYQNIDTQNSVSVGIEDSTGANGYQWVYNSTGRLHDNLAIQFAAYAGSAPWLS
WTPSNIDVAARGSTNVQVTANATGLANGTYRTRLRVNAGANTINGDQTVPVVLNVGSSTVHDVAVSAPQAALSGFVGSTI
TYTLSVTNTGNVSDSFNLSLSGNVWPTTLSQTSVNLAAGASTTIQVSVAIPANAAANSTDSVTITATSAADSSATNSISL
VSTANSIPVSQYKVFMPYIVK
>Mature_1061_residues
MDRLRHRWSLIGTILALIGLWSSLVVISLPQRTQAQPVVEEQRIVARIEAKDRADSLALSARGLDLLEMRDKHDLFALIT
PSELAKLQQEGFVAEIDQEQTRLLQEPSIMPVQGGFRTVEEGYALLDQWHATYPNLTDLFTYGTSWDKVTAGGPAGYDLR
GITLTNSLIPGPKPTFFLMSAIHAREMSTAELTLRYTEYLLSRYETDPDVHWLLDEHTIVIVPFVNPDGRKIAEQSLSQR
KNRNTVDTSSCSGVNIGIDLNRNSSFHWGEVDSPNGDRCGATWPGVSAASEPEVATLQQWIRGVFADQRGPSDTDPAPDT
TTGVYISIHSYSDLVLWPYGHSAQLAPNDADLRGLGKKFASYNGYTPQKSDELYPTSGTTDDWAYGELGVAAYTFEIGPE
SGTCSGFFPAFTCLDGQAPGNFWGRNLPAFLYASKVARTPYLLQRGPDALNVTAQSMSNGYKLLATINDVTNGNQTIAAA
EAYVDTPPWRAGATAISLSATDGSFNSTQEAVNATIPQTLNAGRHLVYFRGRDAAGNWGPVSAQWLDVAPQGLVGFVRAS
DNNQPIANATVVATTGTFTSTTTSGADGSYRLELPVGSYTLKASGTGLTPASYNLTVSSNSFTTQDISLAQLAVLTTSPS
PLTFNVASGSQDRTLVVGNAGGTSLNAAISLAPTGYEVKSSDDAGGPSYTWNDISSTGTRLSLGDDTCSVVNLPSSFNYY
GTAYSKLIVNSNGFVSPTNATTCSSTGTSTNGVVPSTSTPNNVIAALWDDLDPEGLTGTNGVFTYNDSANNQFIVEFSGV
PHWANNGNFSPEDFQFVLNLTTGDVTLNYQNIDTQNSVSVGIEDSTGANGYQWVYNSTGRLHDNLAIQFAAYAGSAPWLS
WTPSNIDVAARGSTNVQVTANATGLANGTYRTRLRVNAGANTINGDQTVPVVLNVGSSTVHDVAVSAPQAALSGFVGSTI
TYTLSVTNTGNVSDSFNLSLSGNVWPTTLSQTSVNLAAGASTTIQVSVAIPANAAANSTDSVTITATSAADSSATNSISL
VSTANSIPVSQYKVFMPYIVK

Specific function: Able to split off hydrophobic and basic amino acids with comparable efficiency [H]

COG id: COG2866

COG function: function code E; Predicted carboxypeptidase

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase M14 family [H]

Homologues:

Organism=Homo sapiens, GI54607080, Length=259, Percent_Identity=30.5019305019305, Blast_Score=99, Evalue=3e-20,
Organism=Homo sapiens, GI61743916, Length=315, Percent_Identity=28.8888888888889, Blast_Score=98, Evalue=4e-20,
Organism=Homo sapiens, GI254540196, Length=249, Percent_Identity=31.7269076305221, Blast_Score=95, Evalue=4e-19,
Organism=Homo sapiens, GI221316749, Length=235, Percent_Identity=29.7872340425532, Blast_Score=91, Evalue=6e-18,
Organism=Homo sapiens, GI217416390, Length=310, Percent_Identity=26.7741935483871, Blast_Score=87, Evalue=1e-16,
Organism=Homo sapiens, GI4502997, Length=303, Percent_Identity=26.4026402640264, Blast_Score=79, Evalue=3e-14,
Organism=Homo sapiens, GI188536053, Length=306, Percent_Identity=27.1241830065359, Blast_Score=79, Evalue=3e-14,
Organism=Homo sapiens, GI188536051, Length=306, Percent_Identity=27.1241830065359, Blast_Score=79, Evalue=3e-14,
Organism=Homo sapiens, GI126273569, Length=303, Percent_Identity=24.7524752475248, Blast_Score=74, Evalue=1e-12,
Organism=Homo sapiens, GI27436871, Length=248, Percent_Identity=25.8064516129032, Blast_Score=71, Evalue=5e-12,
Organism=Caenorhabditis elegans, GI212645216, Length=298, Percent_Identity=30.2013422818792, Blast_Score=115, Evalue=2e-25,
Organism=Caenorhabditis elegans, GI212645214, Length=298, Percent_Identity=30.2013422818792, Blast_Score=114, Evalue=2e-25,
Organism=Caenorhabditis elegans, GI71990310, Length=253, Percent_Identity=33.596837944664, Blast_Score=110, Evalue=3e-24,
Organism=Caenorhabditis elegans, GI71990304, Length=253, Percent_Identity=33.596837944664, Blast_Score=109, Evalue=6e-24,
Organism=Caenorhabditis elegans, GI25143424, Length=305, Percent_Identity=29.8360655737705, Blast_Score=109, Evalue=9e-24,
Organism=Caenorhabditis elegans, GI32563699, Length=314, Percent_Identity=32.1656050955414, Blast_Score=107, Evalue=3e-23,
Organism=Caenorhabditis elegans, GI71990283, Length=314, Percent_Identity=32.1656050955414, Blast_Score=107, Evalue=4e-23,
Organism=Caenorhabditis elegans, GI193207630, Length=302, Percent_Identity=30.4635761589404, Blast_Score=102, Evalue=9e-22,
Organism=Caenorhabditis elegans, GI32563693, Length=244, Percent_Identity=29.5081967213115, Blast_Score=92, Evalue=1e-18,
Organism=Caenorhabditis elegans, GI71994581, Length=315, Percent_Identity=25.7142857142857, Blast_Score=77, Evalue=4e-14,
Organism=Caenorhabditis elegans, GI71994573, Length=315, Percent_Identity=25.7142857142857, Blast_Score=76, Evalue=8e-14,
Organism=Caenorhabditis elegans, GI25146607, Length=232, Percent_Identity=27.5862068965517, Blast_Score=71, Evalue=2e-12,
Organism=Caenorhabditis elegans, GI193203483, Length=311, Percent_Identity=25.7234726688103, Blast_Score=71, Evalue=3e-12,
Organism=Drosophila melanogaster, GI24583124, Length=282, Percent_Identity=30.4964539007092, Blast_Score=115, Evalue=2e-25,
Organism=Drosophila melanogaster, GI20129775, Length=306, Percent_Identity=32.6797385620915, Blast_Score=113, Evalue=6e-25,
Organism=Drosophila melanogaster, GI24586446, Length=306, Percent_Identity=32.6797385620915, Blast_Score=113, Evalue=6e-25,
Organism=Drosophila melanogaster, GI85726410, Length=294, Percent_Identity=31.6326530612245, Blast_Score=113, Evalue=8e-25,
Organism=Drosophila melanogaster, GI20129321, Length=346, Percent_Identity=29.4797687861272, Blast_Score=107, Evalue=3e-23,
Organism=Drosophila melanogaster, GI45550795, Length=238, Percent_Identity=33.6134453781513, Blast_Score=105, Evalue=1e-22,
Organism=Drosophila melanogaster, GI24583126, Length=311, Percent_Identity=30.2250803858521, Blast_Score=104, Evalue=4e-22,
Organism=Drosophila melanogaster, GI21357369, Length=376, Percent_Identity=28.9893617021277, Blast_Score=99, Evalue=1e-20,
Organism=Drosophila melanogaster, GI28574958, Length=344, Percent_Identity=29.6511627906977, Blast_Score=98, Evalue=4e-20,
Organism=Drosophila melanogaster, GI24660095, Length=395, Percent_Identity=27.3417721518987, Blast_Score=94, Evalue=4e-19,
Organism=Drosophila melanogaster, GI221330951, Length=356, Percent_Identity=26.9662921348315, Blast_Score=87, Evalue=6e-17,
Organism=Drosophila melanogaster, GI28574373, Length=236, Percent_Identity=30.5084745762712, Blast_Score=86, Evalue=2e-16,
Organism=Drosophila melanogaster, GI19922132, Length=253, Percent_Identity=30.8300395256917, Blast_Score=86, Evalue=2e-16,
Organism=Drosophila melanogaster, GI24639970, Length=317, Percent_Identity=28.391167192429, Blast_Score=83, Evalue=1e-15,
Organism=Drosophila melanogaster, GI21356641, Length=304, Percent_Identity=27.9605263157895, Blast_Score=82, Evalue=3e-15,
Organism=Drosophila melanogaster, GI24639964, Length=317, Percent_Identity=26.4984227129338, Blast_Score=74, Evalue=5e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000834 [H]

Pfam domain/function: PF00246 Peptidase_M14 [H]

EC number: =3.4.17.18 [H]

Molecular weight: Translated: 112685; Mature: 112685

Theoretical pI: Translated: 4.45; Mature: 4.45

Prosite motif: PS00133 CARBOXYPEPT_ZN_2 ; PS00178 AA_TRNA_LIGASE_I

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
1.2 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
1.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDRLRHRWSLIGTILALIGLWSSLVVISLPQRTQAQPVVEEQRIVARIEAKDRADSLALS
CCHHHHHHHHHHHHHHHHHHHHCEEEEECCCCCCCCCCHHHHHEEEEEECCCCCCCEEEE
ARGLDLLEMRDKHDLFALITPSELAKLQQEGFVAEIDQEQTRLLQEPSIMPVQGGFRTVE
CCCCHHEEECCCCCEEEEECHHHHHHHHHCCCEEECCHHHHHHHCCCCEEEECCCCHHHH
EGYALLDQWHATYPNLTDLFTYGTSWDKVTAGGPAGYDLRGITLTNSLIPGPKPTFFLMS
HHHHHHHHHHCCCCCHHHHEECCCCCCEEECCCCCCCEEEEEEEECCCCCCCCCHHHHHH
AIHAREMSTAELTLRYTEYLLSRYETDPDVHWLLDEHTIVIVPFVNPDGRKIAEQSLSQR
HHHHHCCCCCEEEHHHHHHHHHHHCCCCCEEEEECCCEEEEEEEECCCCHHHHHHHHHHH
KNRNTVDTSSCSGVNIGIDLNRNSSFHWGEVDSPNGDRCGATWPGVSAASEPEVATLQQW
HCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
IRGVFADQRGPSDTDPAPDTTTGVYISIHSYSDLVLWPYGHSAQLAPNDADLRGLGKKFA
HHHHHCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHHHHH
SYNGYTPQKSDELYPTSGTTDDWAYGELGVAAYTFEIGPESGTCSGFFPAFTCLDGQAPG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCCC
NFWGRNLPAFLYASKVARTPYLLQRGPDALNVTAQSMSNGYKLLATINDVTNGNQTIAAA
CCCCCCCCHHHHHHHHHCCCHHHHCCCCCEEEEHHHHCCCEEEEEEEECCCCCCEEEEEE
EAYVDTPPWRAGATAISLSATDGSFNSTQEAVNATIPQTLNAGRHLVYFRGRDAAGNWGP
EEECCCCCCCCCCEEEEEEECCCCCCCHHHHHHCCCCHHHCCCCEEEEEECCCCCCCCCC
VSAQWLDVAPQGLVGFVRASDNNQPIANATVVATTGTFTSTTTSGADGSYRLELPVGSYT
CCCEEEECCCCCEEEEEEECCCCCCCCCEEEEEECCCEEECCCCCCCCEEEEEEECCCEE
LKASGTGLTPASYNLTVSSNSFTTQDISLAQLAVLTTSPSPLTFNVASGSQDRTLVVGNA
EEECCCCCCCCEEEEEECCCCCEECCCCEEEEEEEECCCCCEEEEECCCCCCCEEEEECC
GGTSLNAAISLAPTGYEVKSSDDAGGPSYTWNDISSTGTRLSLGDDTCSVVNLPSSFNYY
CCCCEEEEEEECCCCEEEECCCCCCCCCEECCCCCCCCCEEEECCCCEEEEECCCCCCCC
GTAYSKLIVNSNGFVSPTNATTCSSTGTSTNGVVPSTSTPNNVIAALWDDLDPEGLTGTN
CEEEEEEEECCCCEECCCCCCEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
GVFTYNDSANNQFIVEFSGVPHWANNGNFSPEDFQFVLNLTTGDVTLNYQNIDTQNSVSV
CEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECEEEEEEECCCCCCCEEE
GIEDSTGANGYQWVYNSTGRLHDNLAIQFAAYAGSAPWLSWTPSNIDVAARGSTNVQVTA
EECCCCCCCCEEEEECCCCCEECCEEEEEEEECCCCCCEEECCCCEEEEECCCCEEEEEE
NATGLANGTYRTRLRVNAGANTINGDQTVPVVLNVGSSTVHDVAVSAPQAALSGFVGSTI
CCCCCCCCEEEEEEEEECCCCCCCCCCEEEEEEECCCCCHHHHHHCCCHHHHHHCCCCEE
TYTLSVTNTGNVSDSFNLSLSGNVWPTTLSQTSVNLAAGASTTIQVSVAIPANAAANSTD
EEEEEECCCCCCCCCEEEEECCCCCCCCCCCCEEEEECCCCEEEEEEEEECCCCCCCCCC
SVTITATSAADSSATNSISLVSTANSIPVSQYKVFMPYIVK
EEEEEEECCCCCCCCCCEEEEECCCCCCHHHEEEEEEEECC
>Mature Secondary Structure
MDRLRHRWSLIGTILALIGLWSSLVVISLPQRTQAQPVVEEQRIVARIEAKDRADSLALS
CCHHHHHHHHHHHHHHHHHHHHCEEEEECCCCCCCCCCHHHHHEEEEEECCCCCCCEEEE
ARGLDLLEMRDKHDLFALITPSELAKLQQEGFVAEIDQEQTRLLQEPSIMPVQGGFRTVE
CCCCHHEEECCCCCEEEEECHHHHHHHHHCCCEEECCHHHHHHHCCCCEEEECCCCHHHH
EGYALLDQWHATYPNLTDLFTYGTSWDKVTAGGPAGYDLRGITLTNSLIPGPKPTFFLMS
HHHHHHHHHHCCCCCHHHHEECCCCCCEEECCCCCCCEEEEEEEECCCCCCCCCHHHHHH
AIHAREMSTAELTLRYTEYLLSRYETDPDVHWLLDEHTIVIVPFVNPDGRKIAEQSLSQR
HHHHHCCCCCEEEHHHHHHHHHHHCCCCCEEEEECCCEEEEEEEECCCCHHHHHHHHHHH
KNRNTVDTSSCSGVNIGIDLNRNSSFHWGEVDSPNGDRCGATWPGVSAASEPEVATLQQW
HCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH
IRGVFADQRGPSDTDPAPDTTTGVYISIHSYSDLVLWPYGHSAQLAPNDADLRGLGKKFA
HHHHHCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHHHHH
SYNGYTPQKSDELYPTSGTTDDWAYGELGVAAYTFEIGPESGTCSGFFPAFTCLDGQAPG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCCC
NFWGRNLPAFLYASKVARTPYLLQRGPDALNVTAQSMSNGYKLLATINDVTNGNQTIAAA
CCCCCCCCHHHHHHHHHCCCHHHHCCCCCEEEEHHHHCCCEEEEEEEECCCCCCEEEEEE
EAYVDTPPWRAGATAISLSATDGSFNSTQEAVNATIPQTLNAGRHLVYFRGRDAAGNWGP
EEECCCCCCCCCCEEEEEEECCCCCCCHHHHHHCCCCHHHCCCCEEEEEECCCCCCCCCC
VSAQWLDVAPQGLVGFVRASDNNQPIANATVVATTGTFTSTTTSGADGSYRLELPVGSYT
CCCEEEECCCCCEEEEEEECCCCCCCCCEEEEEECCCEEECCCCCCCCEEEEEEECCCEE
LKASGTGLTPASYNLTVSSNSFTTQDISLAQLAVLTTSPSPLTFNVASGSQDRTLVVGNA
EEECCCCCCCCEEEEEECCCCCEECCCCEEEEEEEECCCCCEEEEECCCCCCCEEEEECC
GGTSLNAAISLAPTGYEVKSSDDAGGPSYTWNDISSTGTRLSLGDDTCSVVNLPSSFNYY
CCCCEEEEEEECCCCEEEECCCCCCCCCEECCCCCCCCCEEEECCCCEEEEECCCCCCCC
GTAYSKLIVNSNGFVSPTNATTCSSTGTSTNGVVPSTSTPNNVIAALWDDLDPEGLTGTN
CEEEEEEEECCCCEECCCCCCEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
GVFTYNDSANNQFIVEFSGVPHWANNGNFSPEDFQFVLNLTTGDVTLNYQNIDTQNSVSV
CEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECEEEEEEECCCCCCCEEE
GIEDSTGANGYQWVYNSTGRLHDNLAIQFAAYAGSAPWLSWTPSNIDVAARGSTNVQVTA
EECCCCCCCCEEEEECCCCCEECCEEEEEEEECCCCCCEEECCCCEEEEECCCCEEEEEE
NATGLANGTYRTRLRVNAGANTINGDQTVPVVLNVGSSTVHDVAVSAPQAALSGFVGSTI
CCCCCCCCEEEEEEEEECCCCCCCCCCEEEEEEECCCCCHHHHHHCCCHHHHHHCCCCEE
TYTLSVTNTGNVSDSFNLSLSGNVWPTTLSQTSVNLAAGASTTIQVSVAIPANAAANSTD
EEEEEECCCCCCCCCEEEEECCCCCCCCCCCCEEEEECCCCEEEEEEEEECCCCCCCCCC
SVTITATSAADSSATNSISLVSTANSIPVSQYKVFMPYIVK
EEEEEEECCCCCCCCCCEEEEECCCCCCHHHEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1936254; 6424730; 1521526 [H]