The gene/protein map for NC_008600 is currently unavailable.
Definition Bacillus thuringiensis str. Al Hakam chromosome, complete genome.
Accession NC_008600
Length 5,257,091

Click here to switch to the map view.

The map label for this gene is vpr [H]

Identifier: 118476320

GI number: 118476320

Start: 669123

End: 673469

Strand: Direct

Name: vpr [H]

Synonym: BALH_0575

Alternate gene names: 118476320

Gene position: 669123-673469 (Clockwise)

Preceding gene: 118476319

Following gene: 118476321

Centisome position: 12.73

GC content: 36.46

Gene sequence:

>4347_bases
ATGATACAAAATAAAAGTTTAAGATTAATAGAATTTTCAATACTAAATGGGAATTTAAGAATCAATTTGTTTAGAAAAAT
AATTTTAGTAATGGGGAGATGCAATATGAAACGAGGGAAATTCGGAAGAATACTTATCGGAACGCTAACCGTTGGTATGT
TAATGTCTCAAGGAATTCCATATAACGTATTGGCAGAAGAAGTAAATACATCTACTTTAACTGGAATTGATGATGCAAGT
TCTATATTGAAAGGACTTACAAAGGAACAACGTAATGCTTTAAAAACATTAGATACAAAACCTGGTTTTGTTATTTCACC
AGGCATTAATACAGCAAGTCCTGATAATGTGAATGTGATTGTAGAATTTAAGCAAGCACCTAGCAAAATTGAGATGTTAA
AACAAGCAGCTAAAGGGAAAAAAATAGCTCTTTCAACTGCAGAACAAAAGGTGGAAGCATCCCATAAAGGATTTAAAACT
GAGCTTGAACAGCTTCAAAAAAAGAAAGATAAGGGGCCGGATTTTAAATCTGCCAAAATAACAAGAGAATATAAGAATGC
TTTTAATGGTGTGGCAATGTCGTTACCTGCGAATATGATCGAAGACTTAGTTCGCACTGGTATTGTTAAGCGTGTATGGG
AAGATCAAGAGGTTAAAATTGATTTACCAAAAGAGACAGCTAAGACTGCTGTTGAACCAAAAATGGCAGATAGTGTACCG
CAAATTGGTGTGGACAAGCTACACGATGAAAAAATAACAGGTAAAGGAATTAAGGTGGGGGTACTGGATACAGGTATTGA
TTATAACCACCCTGATTTAAAAGATGCATATAAAGGATATCGTGCAAAGCAAGGTGAAGATCCAAGCAAAATTGATCCAA
ACTCTATAAAGGGATGGGACTTTGTTAATAATGATGCTGATCCGATGGAGACAACGTATAAGGATTGGCAAAATTCTGGA
GGATATCCTGAGATTTATGATGGAAGTGCATATTATACATCCCATGGAACCCATGTAGCTGGGACAATTGCTGGAGATAA
ACAGAATAGTGTGGATTATGCAGTTAAGGGCGTTGCTCCAGATGTAGATTTATATTCATATCGTGTATTAGGTCCATATG
GAAGTGGACAAACAAGCGGTATTCTTGCTGCGATTGATAAAGCAGTGAAGGACGATATGGATGTTATCAATTTATCACTA
GGTGCGTCTATTAATGATCCTTTATATCCTACTTCTATCGCAGTGAACAACGCGATGTTAGCTGGTGTTGTTACAGTAGT
AGCAGCAGGTAATAGTGGTCCGGGAGAAGGTACCCTTGGATCACCTAGTGCAGCGGCACTTCCCATTACAGTGGGAGCAA
GTGACGCTGCGATGACTATACCAACGTTTTCCGCTGATGCAGGTGATTTACATGTGGATAAAATGATGCTACTTGGAAAA
AGCTTTACTGATAAGATTGAAGATTTGAAAGGTCAATCCTTATCGGTTGTATATGCAGGGCTTGGGAAATCAGGTGATTT
TACAGGGAAAGATGTAAAAGGGAAGTTAGCTCTTATCCAACGCGGTGAGATTACATTTGATGAAAAAATTAAAAATGCTA
AGGAAGCAGGCGCAAAGGCGGTAATTGTATACAACAATGTAGATGGGGAAATTACAAGTTATCTTGGGGAAAGTACTTCA
TCTATTCCATCGTTCCGCTTAACAAAAGTAGATGGTGAGAACTTGCAAGCAAAAGCTGTACAAGGAGATGTGTCATTAGC
GTTTGGGGAACTTAGTAATATAAAAACAGAGGGAGATCACTTAGCTGATTTCAGCTCCCGTGGTCCTGCAACGAAAACAG
ATGATATTAAGCCAGATATTGTAGCACCAGGTGTGTCTATCTTCTCAACTGTTCCTGAATATATTAATGATCCAAAGGAT
GGAGAAAATTATCCGGTAGCGTATGGACGTATGTCAGGTACATCTATGGCAACTCCTCATACAGCGGGGGTGGCAGCACT
CATTTTGCAAGAACATCCAAACTATAGCCCGTTTGAAGTAAAAGAAGCGCTTATGAATACTGCAGTTGACTTAAAAGAAG
CACGCTCTGTATTTGAGGTGGGATCGGGTCGAATTGATGCGTATCGTGCAGTTCATGCAGATACAGCTATCGAGGTTATC
GATAAAACATCAAACATTGTAAATGATGAAGAAGTAGATATTGAAGAAAAAACAGGTTCTATTGCATTCGGATATAAAAA
TCAATTGGGAAATGGACCAATTAAAGATAGTCGAAAAATCTTAATTAAGAACAGCAATAAAACAGACGAGAAAGAGTTTA
AATTAGAAGTAGAGTTTTCACCAACAAGTGTAGGTGTGCAAGATGCAGTGAAGAATGGTGTGAAGCTAAACGTACAAGAT
TCTATTAAAGTAGCTCCAGGTACATCAGGGGAAATTAGCCCTGAAATTATCATTCCAGAAAACGCCGAATTTGGTAGATA
TGAGGGATATATTCATATTTCAAATAAAAAAAATGAAAAAGAAGTATATCAAGTGCCATTTGCAGTTAAATTCACAGAAA
AAGGAATTGAATCTGTAGATTTGCTGAGAGATGCAATGGCAACAGATACATCTAACTTCCATCCATTCATGGAAAGGCCG
AGTTCACCGCTGACATTTAAATTGAACAGTCCACTTGAAACTATCGATGCAGTGGTGAAGGATCGAAAAACAGGAAAGGC
ATTAGGAATTGTGGGGACAATTAATGCTAGCAGTCTAACACCAAATATTGAATATATCATGTTCGATGGTATGGGTGGCT
ATGTATTCCCGTTTACAGGAGATCCAAATCATCCAATTGGAGACAAGCGAGTTACGTTGCCTGATGGAGATTACCAACTG
GATTTCATTGGATATGACAAAGAAGGGAAACCTTATACAAAAGGAGATAGCGTAATTATTGATAATATCAAACCTGAAAT
GAAATTTACCGATGTAAAACCTGGCGTTCATGAGGTAAATGAATCTATGTTCAAAGAAGAAGATGGTCAGCGTGCATTAT
GGGTACATGGTAACATTTATGATTCTACGATAGATGTATTAAACGCAAAAGGCCTTCAGTATGACCAAAAAACTAATGAA
ATTGTATATTACCAAAACTCTGCCTTTCCGTCAGGTTGGTTGAATACAATTCAAGCCAATGGTAATTTTAAATTTGGTGT
ACTTCCAGAGGAAATCAATGAACCGCTTAATTTAAGGTTATTTGGATATGATCTTGCAACTGCATCAAATATGGCAAATG
GATTTAAAGATTATGTCTTTGTTAAAGAGGGAACGGAATACGCTGTTCCAAGCTATGACAAGGATAAAGTTAAACTAGGA
GAAAAAATTACATTAACTTTAAATCTTAATAATGTAAAACAACTTATGTCAGGTACATTTGAGATTCCTTATAGTAAACA
GTTGTTTAAATTTGTAGATGTTAAACCAAATCCAGCACTTGCAGAATATGCAAAACAACATGGATTAAATATTAAATTAG
AAGATCCAGTGATAAATGAAGAAGGAAACTGGGAAAACAAGGTGAAAGTTGGTGCATCTTTAGAAGGAACAGAATTTAAA
GGTCTAGATGGAGACACACCATTTGTAGATGTTACATTCGAGACGACGAGTGACGAGTATTTTAATAATTTAACAGCATT
TGGAGTAGATAAATTCTCTTATACAAAAACAGGTGCATCAGAAGGCGTTGAGATTCCTGTGTTTAAAGATAAATCCTTTT
CTATTATTTCGAAACATGCAATGGTTACAGGATATATTGGACCAGAAGCTTTCTTAACTGAAGAGGGGTATTTAGGTAAG
AATGACTACACAAAACTAGGAGCAAAAGTGTATGCAGTTGGTAAGGATGGAAAGAAATATACAGGAACGGTTGATGATAA
TGGGCAATTTGAAATTCATAGTGTTCCGGTAAGCGATACAGAATATAATATTTTCGTAGAAATGCCAGGTCATTTAAATA
GTAAATTAACTACAAAAATAGGAAAAATGCAGGATGGAGAATTAGTGGGACAAAACTTTAGAGCATATATGGATGACAGC
CTTGCAGGTGATGTAAACGGCGATAAAATGGTAGATATTCAAGATGCTAGAATAGCAGCTTTATCGTATGGAAAAGGAAA
AGTATTTGTAAAAGATGGAGATATAAATCAAGATGGTGTTGTAGATGAAACAGATATTCGTTTTATTGAGAAAAACTTCT
TGAAAAAGGGGCCAGATGCTAAAGGAAATCAGAAACCGAAAGAAAATGTAGGGCCAGTGACATTAGATAAAATCTTACGC
TCTATTGGCTTAGAACCCAAAAAATAA

Upstream 100 bases:

>100_bases
TGAAAATCTTTTAGTGAAAATAAATTATGGGGGGCTTCTATAGAAAGATTAAAGTGAATGATTAGCTATATGGGGATGGT
AGTAGTGTTGGAAATTCTTA

Downstream 100 bases:

>100_bases
GTGGACAAACTTCTAGAGTAAATGAAATGAGATTAGCTAACAAATGACCGGTAATGATTCACGGTAATATTACCGGTCAT
TTTAAAAGTAATAGTCGTTT

Product: peptidase Vpr

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1448; Mature: 1448

Protein sequence:

>1448_residues
MIQNKSLRLIEFSILNGNLRINLFRKIILVMGRCNMKRGKFGRILIGTLTVGMLMSQGIPYNVLAEEVNTSTLTGIDDAS
SILKGLTKEQRNALKTLDTKPGFVISPGINTASPDNVNVIVEFKQAPSKIEMLKQAAKGKKIALSTAEQKVEASHKGFKT
ELEQLQKKKDKGPDFKSAKITREYKNAFNGVAMSLPANMIEDLVRTGIVKRVWEDQEVKIDLPKETAKTAVEPKMADSVP
QIGVDKLHDEKITGKGIKVGVLDTGIDYNHPDLKDAYKGYRAKQGEDPSKIDPNSIKGWDFVNNDADPMETTYKDWQNSG
GYPEIYDGSAYYTSHGTHVAGTIAGDKQNSVDYAVKGVAPDVDLYSYRVLGPYGSGQTSGILAAIDKAVKDDMDVINLSL
GASINDPLYPTSIAVNNAMLAGVVTVVAAGNSGPGEGTLGSPSAAALPITVGASDAAMTIPTFSADAGDLHVDKMMLLGK
SFTDKIEDLKGQSLSVVYAGLGKSGDFTGKDVKGKLALIQRGEITFDEKIKNAKEAGAKAVIVYNNVDGEITSYLGESTS
SIPSFRLTKVDGENLQAKAVQGDVSLAFGELSNIKTEGDHLADFSSRGPATKTDDIKPDIVAPGVSIFSTVPEYINDPKD
GENYPVAYGRMSGTSMATPHTAGVAALILQEHPNYSPFEVKEALMNTAVDLKEARSVFEVGSGRIDAYRAVHADTAIEVI
DKTSNIVNDEEVDIEEKTGSIAFGYKNQLGNGPIKDSRKILIKNSNKTDEKEFKLEVEFSPTSVGVQDAVKNGVKLNVQD
SIKVAPGTSGEISPEIIIPENAEFGRYEGYIHISNKKNEKEVYQVPFAVKFTEKGIESVDLLRDAMATDTSNFHPFMERP
SSPLTFKLNSPLETIDAVVKDRKTGKALGIVGTINASSLTPNIEYIMFDGMGGYVFPFTGDPNHPIGDKRVTLPDGDYQL
DFIGYDKEGKPYTKGDSVIIDNIKPEMKFTDVKPGVHEVNESMFKEEDGQRALWVHGNIYDSTIDVLNAKGLQYDQKTNE
IVYYQNSAFPSGWLNTIQANGNFKFGVLPEEINEPLNLRLFGYDLATASNMANGFKDYVFVKEGTEYAVPSYDKDKVKLG
EKITLTLNLNNVKQLMSGTFEIPYSKQLFKFVDVKPNPALAEYAKQHGLNIKLEDPVINEEGNWENKVKVGASLEGTEFK
GLDGDTPFVDVTFETTSDEYFNNLTAFGVDKFSYTKTGASEGVEIPVFKDKSFSIISKHAMVTGYIGPEAFLTEEGYLGK
NDYTKLGAKVYAVGKDGKKYTGTVDDNGQFEIHSVPVSDTEYNIFVEMPGHLNSKLTTKIGKMQDGELVGQNFRAYMDDS
LAGDVNGDKMVDIQDARIAALSYGKGKVFVKDGDINQDGVVDETDIRFIEKNFLKKGPDAKGNQKPKENVGPVTLDKILR
SIGLEPKK

Sequences:

>Translated_1448_residues
MIQNKSLRLIEFSILNGNLRINLFRKIILVMGRCNMKRGKFGRILIGTLTVGMLMSQGIPYNVLAEEVNTSTLTGIDDAS
SILKGLTKEQRNALKTLDTKPGFVISPGINTASPDNVNVIVEFKQAPSKIEMLKQAAKGKKIALSTAEQKVEASHKGFKT
ELEQLQKKKDKGPDFKSAKITREYKNAFNGVAMSLPANMIEDLVRTGIVKRVWEDQEVKIDLPKETAKTAVEPKMADSVP
QIGVDKLHDEKITGKGIKVGVLDTGIDYNHPDLKDAYKGYRAKQGEDPSKIDPNSIKGWDFVNNDADPMETTYKDWQNSG
GYPEIYDGSAYYTSHGTHVAGTIAGDKQNSVDYAVKGVAPDVDLYSYRVLGPYGSGQTSGILAAIDKAVKDDMDVINLSL
GASINDPLYPTSIAVNNAMLAGVVTVVAAGNSGPGEGTLGSPSAAALPITVGASDAAMTIPTFSADAGDLHVDKMMLLGK
SFTDKIEDLKGQSLSVVYAGLGKSGDFTGKDVKGKLALIQRGEITFDEKIKNAKEAGAKAVIVYNNVDGEITSYLGESTS
SIPSFRLTKVDGENLQAKAVQGDVSLAFGELSNIKTEGDHLADFSSRGPATKTDDIKPDIVAPGVSIFSTVPEYINDPKD
GENYPVAYGRMSGTSMATPHTAGVAALILQEHPNYSPFEVKEALMNTAVDLKEARSVFEVGSGRIDAYRAVHADTAIEVI
DKTSNIVNDEEVDIEEKTGSIAFGYKNQLGNGPIKDSRKILIKNSNKTDEKEFKLEVEFSPTSVGVQDAVKNGVKLNVQD
SIKVAPGTSGEISPEIIIPENAEFGRYEGYIHISNKKNEKEVYQVPFAVKFTEKGIESVDLLRDAMATDTSNFHPFMERP
SSPLTFKLNSPLETIDAVVKDRKTGKALGIVGTINASSLTPNIEYIMFDGMGGYVFPFTGDPNHPIGDKRVTLPDGDYQL
DFIGYDKEGKPYTKGDSVIIDNIKPEMKFTDVKPGVHEVNESMFKEEDGQRALWVHGNIYDSTIDVLNAKGLQYDQKTNE
IVYYQNSAFPSGWLNTIQANGNFKFGVLPEEINEPLNLRLFGYDLATASNMANGFKDYVFVKEGTEYAVPSYDKDKVKLG
EKITLTLNLNNVKQLMSGTFEIPYSKQLFKFVDVKPNPALAEYAKQHGLNIKLEDPVINEEGNWENKVKVGASLEGTEFK
GLDGDTPFVDVTFETTSDEYFNNLTAFGVDKFSYTKTGASEGVEIPVFKDKSFSIISKHAMVTGYIGPEAFLTEEGYLGK
NDYTKLGAKVYAVGKDGKKYTGTVDDNGQFEIHSVPVSDTEYNIFVEMPGHLNSKLTTKIGKMQDGELVGQNFRAYMDDS
LAGDVNGDKMVDIQDARIAALSYGKGKVFVKDGDINQDGVVDETDIRFIEKNFLKKGPDAKGNQKPKENVGPVTLDKILR
SIGLEPKK
>Mature_1448_residues
MIQNKSLRLIEFSILNGNLRINLFRKIILVMGRCNMKRGKFGRILIGTLTVGMLMSQGIPYNVLAEEVNTSTLTGIDDAS
SILKGLTKEQRNALKTLDTKPGFVISPGINTASPDNVNVIVEFKQAPSKIEMLKQAAKGKKIALSTAEQKVEASHKGFKT
ELEQLQKKKDKGPDFKSAKITREYKNAFNGVAMSLPANMIEDLVRTGIVKRVWEDQEVKIDLPKETAKTAVEPKMADSVP
QIGVDKLHDEKITGKGIKVGVLDTGIDYNHPDLKDAYKGYRAKQGEDPSKIDPNSIKGWDFVNNDADPMETTYKDWQNSG
GYPEIYDGSAYYTSHGTHVAGTIAGDKQNSVDYAVKGVAPDVDLYSYRVLGPYGSGQTSGILAAIDKAVKDDMDVINLSL
GASINDPLYPTSIAVNNAMLAGVVTVVAAGNSGPGEGTLGSPSAAALPITVGASDAAMTIPTFSADAGDLHVDKMMLLGK
SFTDKIEDLKGQSLSVVYAGLGKSGDFTGKDVKGKLALIQRGEITFDEKIKNAKEAGAKAVIVYNNVDGEITSYLGESTS
SIPSFRLTKVDGENLQAKAVQGDVSLAFGELSNIKTEGDHLADFSSRGPATKTDDIKPDIVAPGVSIFSTVPEYINDPKD
GENYPVAYGRMSGTSMATPHTAGVAALILQEHPNYSPFEVKEALMNTAVDLKEARSVFEVGSGRIDAYRAVHADTAIEVI
DKTSNIVNDEEVDIEEKTGSIAFGYKNQLGNGPIKDSRKILIKNSNKTDEKEFKLEVEFSPTSVGVQDAVKNGVKLNVQD
SIKVAPGTSGEISPEIIIPENAEFGRYEGYIHISNKKNEKEVYQVPFAVKFTEKGIESVDLLRDAMATDTSNFHPFMERP
SSPLTFKLNSPLETIDAVVKDRKTGKALGIVGTINASSLTPNIEYIMFDGMGGYVFPFTGDPNHPIGDKRVTLPDGDYQL
DFIGYDKEGKPYTKGDSVIIDNIKPEMKFTDVKPGVHEVNESMFKEEDGQRALWVHGNIYDSTIDVLNAKGLQYDQKTNE
IVYYQNSAFPSGWLNTIQANGNFKFGVLPEEINEPLNLRLFGYDLATASNMANGFKDYVFVKEGTEYAVPSYDKDKVKLG
EKITLTLNLNNVKQLMSGTFEIPYSKQLFKFVDVKPNPALAEYAKQHGLNIKLEDPVINEEGNWENKVKVGASLEGTEFK
GLDGDTPFVDVTFETTSDEYFNNLTAFGVDKFSYTKTGASEGVEIPVFKDKSFSIISKHAMVTGYIGPEAFLTEEGYLGK
NDYTKLGAKVYAVGKDGKKYTGTVDDNGQFEIHSVPVSDTEYNIFVEMPGHLNSKLTTKIGKMQDGELVGQNFRAYMDDS
LAGDVNGDKMVDIQDARIAALSYGKGKVFVKDGDINQDGVVDETDIRFIEKNFLKKGPDAKGNQKPKENVGPVTLDKILR
SIGLEPKK

Specific function: Not required for growth or sporulation [H]

COG id: COG1404

COG function: function code O; Subtilisin-like serine proteases

Gene ontology:

Cell location: Secreted [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the peptidase S8 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR000209
- InterPro:   IPR022398
- InterPro:   IPR015500
- InterPro:   IPR010259
- InterPro:   IPR003137 [H]

Pfam domain/function: PF05922 Inhibitor_I9; PF02225 PA; PF00082 Peptidase_S8 [H]

EC number: NA

Molecular weight: Translated: 157859; Mature: 157859

Theoretical pI: Translated: 5.00; Mature: 5.00

Prosite motif: PS00018 EF_HAND_1 ; PS00136 SUBTILASE_ASP ; PS00137 SUBTILASE_HIS ; PS00138 SUBTILASE_SER

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIQNKSLRLIEFSILNGNLRINLFRKIILVMGRCNMKRGKFGRILIGTLTVGMLMSQGIP
CCCCCCEEEEEEEEECCCEEEHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHCCCC
YNVLAEEVNTSTLTGIDDASSILKGLTKEQRNALKTLDTKPGFVISPGINTASPDNVNVI
HHHHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCEEEE
VEFKQAPSKIEMLKQAAKGKKIALSTAEQKVEASHKGFKTELEQLQKKKDKGPDFKSAKI
EEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHH
TREYKNAFNGVAMSLPANMIEDLVRTGIVKRVWEDQEVKIDLPKETAKTAVEPKMADSVP
HHHHHHHHCCEEEECCHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHCCCHHHCCCC
QIGVDKLHDEKITGKGIKVGVLDTGIDYNHPDLKDAYKGYRAKQGEDPSKIDPNSIKGWD
CCCCHHHCCCCCCCCCEEEEEEECCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCC
FVNNDADPMETTYKDWQNSGGYPEIYDGSAYYTSHGTHVAGTIAGDKQNSVDYAVKGVAP
CCCCCCCCHHHHHHHHHCCCCCCEEECCCEEEECCCCEEEEEEECCCCCCCEEEEECCCC
DVDLYSYRVLGPYGSGQTSGILAAIDKAVKDDMDVINLSLGASINDPLYPTSIAVNNAML
CCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCEEEECHHHH
AGVVTVVAAGNSGPGEGTLGSPSAAALPITVGASDAAMTIPTFSADAGDLHVDKMMLLGK
EEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCEEHHHHHHHCC
SFTDKIEDLKGQSLSVVYAGLGKSGDFTGKDVKGKLALIQRGEITFDEKIKNAKEAGAKA
HHHHHHHHHCCCEEEEEEEECCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHCCCEE
VIVYNNVDGEITSYLGESTSSIPSFRLTKVDGENLQAKAVQGDVSLAFGELSNIKTEGDH
EEEEECCCCHHHHHHCCCCCCCCCEEEEEECCCCCEEEEEECCEEEEEHHHHCCCCCCCH
LADFSSRGPATKTDDIKPDIVAPGVSIFSTVPEYINDPKDGENYPVAYGRMSGTSMATPH
HHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCC
TAGVAALILQEHPNYSPFEVKEALMNTAVDLKEARSVFEVGSGRIDAYRAVHADTAIEVI
CCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHH
DKTSNIVNDEEVDIEEKTGSIAFGYKNQLGNGPIKDSRKILIKNSNKTDEKEFKLEVEFS
HHHCCCCCCCCCCEECCCCCEEEECHHCCCCCCCCCCCEEEEECCCCCCCEEEEEEEEEC
PTSVGVQDAVKNGVKLNVQDSIKVAPGTSGEISPEIIIPENAEFGRYEGYIHISNKKNEK
CCCCCHHHHHHCCCEEEECCCEEECCCCCCCCCCEEEECCCCCCCCEECEEEECCCCCCC
EVYQVPFAVKFTEKGIESVDLLRDAMATDTSNFHPFMERPSSPLTFKLNSPLETIDAVVK
EEEECCEEEEECHHCCHHHHHHHHHHHCCCCCCCHHHHCCCCCEEEEECCCHHHHHHHHH
DRKTGKALGIVGTINASSLTPNIEYIMFDGMGGYVFPFTGDPNHPIGDKRVTLPDGDYQL
CCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCEEECCCCCEEE
DFIGYDKEGKPYTKGDSVIIDNIKPEMKFTDVKPGVHEVNESMFKEEDGQRALWVHGNIY
EEEEECCCCCCCCCCCCEEEECCCCCCEEECCCCCHHHHHHHHHHCCCCCEEEEEECCEE
DSTIDVLNAKGLQYDQKTNEIVYYQNSAFPSGWLNTIQANGNFKFGVLPEEINEPLNLRL
CCHHHHHCCCCCEECCCCCEEEEEECCCCCCCCCEEEEECCCEEEEECHHHHCCCEEEEE
FGYDLATASNMANGFKDYVFVKEGTEYAVPSYDKDKVKLGEKITLTLNLNNVKQLMSGTF
EEEEHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEECCCEEEEEEEHHHHHHHHCCCC
EIPYSKQLFKFVDVKPNPALAEYAKQHGLNIKLEDPVINEEGNWENKVKVGASLEGTEFK
CCCCHHHHHHEECCCCCCHHHHHHHHCCCEEEECCCEECCCCCCCCEEEECCCCCCCEEC
GLDGDTPFVDVTFETTSDEYFNNLTAFGVDKFSYTKTGASEGVEIPVFKDKSFSIISKHA
CCCCCCCEEEEEEECCCHHHHCCEEEEECCCCEEECCCCCCCCEEEEECCCCCCEEECCE
MVTGYIGPEAFLTEEGYLGKNDYTKLGAKVYAVGKDGKKYTGTVDDNGQFEIHSVPVSDT
EEEEEECCCCEEECCCCCCCCCHHCCCEEEEEECCCCCEEEEECCCCCCEEEEEECCCCC
EYNIFVEMPGHLNSKLTTKIGKMQDGELVGQNFRAYMDDSLAGDVNGDKMVDIQDARIAA
CEEEEEECCCCCCCHHHHHHCCCCCCCHHCCCHHHHHCCCCCCCCCCCEEEEECCCEEEE
LSYGKGKVFVKDGDINQDGVVDETDIRFIEKNFLKKGPDAKGNQKPKENVGPVTLDKILR
EECCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHH
SIGLEPKK
HHCCCCCC
>Mature Secondary Structure
MIQNKSLRLIEFSILNGNLRINLFRKIILVMGRCNMKRGKFGRILIGTLTVGMLMSQGIP
CCCCCCEEEEEEEEECCCEEEHHHHHHHHHHCCCCCCCCCCCEEEHHHHHHHHHHHCCCC
YNVLAEEVNTSTLTGIDDASSILKGLTKEQRNALKTLDTKPGFVISPGINTASPDNVNVI
HHHHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCEEEE
VEFKQAPSKIEMLKQAAKGKKIALSTAEQKVEASHKGFKTELEQLQKKKDKGPDFKSAKI
EEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHH
TREYKNAFNGVAMSLPANMIEDLVRTGIVKRVWEDQEVKIDLPKETAKTAVEPKMADSVP
HHHHHHHHCCEEEECCHHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHCCCHHHCCCC
QIGVDKLHDEKITGKGIKVGVLDTGIDYNHPDLKDAYKGYRAKQGEDPSKIDPNSIKGWD
CCCCHHHCCCCCCCCCEEEEEEECCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCC
FVNNDADPMETTYKDWQNSGGYPEIYDGSAYYTSHGTHVAGTIAGDKQNSVDYAVKGVAP
CCCCCCCCHHHHHHHHHCCCCCCEEECCCEEEECCCCEEEEEEECCCCCCCEEEEECCCC
DVDLYSYRVLGPYGSGQTSGILAAIDKAVKDDMDVINLSLGASINDPLYPTSIAVNNAML
CCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCEEEECHHHH
AGVVTVVAAGNSGPGEGTLGSPSAAALPITVGASDAAMTIPTFSADAGDLHVDKMMLLGK
EEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCEEHHHHHHHCC
SFTDKIEDLKGQSLSVVYAGLGKSGDFTGKDVKGKLALIQRGEITFDEKIKNAKEAGAKA
HHHHHHHHHCCCEEEEEEEECCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHCCCEE
VIVYNNVDGEITSYLGESTSSIPSFRLTKVDGENLQAKAVQGDVSLAFGELSNIKTEGDH
EEEEECCCCHHHHHHCCCCCCCCCEEEEEECCCCCEEEEEECCEEEEEHHHHCCCCCCCH
LADFSSRGPATKTDDIKPDIVAPGVSIFSTVPEYINDPKDGENYPVAYGRMSGTSMATPH
HHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCC
TAGVAALILQEHPNYSPFEVKEALMNTAVDLKEARSVFEVGSGRIDAYRAVHADTAIEVI
CCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHH
DKTSNIVNDEEVDIEEKTGSIAFGYKNQLGNGPIKDSRKILIKNSNKTDEKEFKLEVEFS
HHHCCCCCCCCCCEECCCCCEEEECHHCCCCCCCCCCCEEEEECCCCCCCEEEEEEEEEC
PTSVGVQDAVKNGVKLNVQDSIKVAPGTSGEISPEIIIPENAEFGRYEGYIHISNKKNEK
CCCCCHHHHHHCCCEEEECCCEEECCCCCCCCCCEEEECCCCCCCCEECEEEECCCCCCC
EVYQVPFAVKFTEKGIESVDLLRDAMATDTSNFHPFMERPSSPLTFKLNSPLETIDAVVK
EEEECCEEEEECHHCCHHHHHHHHHHHCCCCCCCHHHHCCCCCEEEEECCCHHHHHHHHH
DRKTGKALGIVGTINASSLTPNIEYIMFDGMGGYVFPFTGDPNHPIGDKRVTLPDGDYQL
CCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCEEECCCCCEEE
DFIGYDKEGKPYTKGDSVIIDNIKPEMKFTDVKPGVHEVNESMFKEEDGQRALWVHGNIY
EEEEECCCCCCCCCCCCEEEECCCCCCEEECCCCCHHHHHHHHHHCCCCCEEEEEECCEE
DSTIDVLNAKGLQYDQKTNEIVYYQNSAFPSGWLNTIQANGNFKFGVLPEEINEPLNLRL
CCHHHHHCCCCCEECCCCCEEEEEECCCCCCCCCEEEEECCCEEEEECHHHHCCCEEEEE
FGYDLATASNMANGFKDYVFVKEGTEYAVPSYDKDKVKLGEKITLTLNLNNVKQLMSGTF
EEEEHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEECCCEEEEEEEHHHHHHHHCCCC
EIPYSKQLFKFVDVKPNPALAEYAKQHGLNIKLEDPVINEEGNWENKVKVGASLEGTEFK
CCCCHHHHHHEECCCCCCHHHHHHHHCCCEEEECCCEECCCCCCCCEEEECCCCCCCEEC
GLDGDTPFVDVTFETTSDEYFNNLTAFGVDKFSYTKTGASEGVEIPVFKDKSFSIISKHA
CCCCCCCEEEEEEECCCHHHHCCEEEEECCCCEEECCCCCCCCEEEEECCCCCCEEECCE
MVTGYIGPEAFLTEEGYLGKNDYTKLGAKVYAVGKDGKKYTGTVDDNGQFEIHSVPVSDT
EEEEEECCCCEEECCCCCCCCCHHCCCEEEEEECCCCCEEEEECCCCCCEEEEEECCCCC
EYNIFVEMPGHLNSKLTTKIGKMQDGELVGQNFRAYMDDSLAGDVNGDKMVDIQDARIAA
CEEEEEECCCCCCCHHHHHHCCCCCCCHHCCCHHHHHCCCCCCCCCCCEEEEECCCEEEE
LSYGKGKVFVKDGDINQDGVVDETDIRFIEKNFLKKGPDAKGNQKPKENVGPVTLDKILR
EECCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHH
SIGLEPKK
HHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1938892; 7934828; 9384377; 10658653 [H]