Definition Chloroflexus sp. Y-400-fl chromosome, complete genome.
Accession NC_012032
Length 5,268,950

Click here to switch to the map view.

The map label for this gene is 222523563

Identifier: 222523563

GI number: 222523563

Start: 323360

End: 327775

Strand: Reverse

Name: 222523563

Synonym: Chy400_0269

Alternate gene names: NA

Gene position: 327775-323360 (Counterclockwise)

Preceding gene: 222523564

Following gene: 222523562

Centisome position: 6.22

GC content: 64.06

Gene sequence:

>4416_bases
ATGTCTATGGAACACCTACCACCGTGGTTGCGTAATGTACCCTTGCCTCCACCGCCAACCGACGCCGCTGATGAGACGCC
GGCCTGGTTGCGCGGTATTGATAGTTTTACCCCTCCAGCCCCGACGCCGGCAACAGACCATCCGCCTGCCTGGCTTACTG
AACCTGAACCAGCCTCTGAGTCGGCAGCCGAATCGGCAAGTGTGCCCGACTGGCTGGCCGAATTGCAAGCCGAGGTTGCC
GATCCGCTTGCTGATACCGGTTCCGCCGCTGAATGGTTAGGTGGTGTTGAGGCTGAACCGCCGGCTGAACGCCCTTCAAC
CTTTGGGGCCACCGGCTGGCTGCAAGGGTTGGGGGGTGAGACGTCGACTGGCCCGGTGACACCGTCTGAACCGCCACCAA
CGACGAGTTCGCGCCTGCGGATGCCGGTCGGCCCGACCGATTGGCTGCGCAGTATGGGTCATGAAGATGAGGTTGAGCCA
GCTTCAGCAGAACAGTCGCCAGAACCGGCACCCCCTGATCCAAACGCCGGTGTTCCCGATTGGTTGCGTGAGTTGAGTGA
AGAGGATGTCTCGCAAGCCTTAGCGTCTTTACCTCCGGCAGAGACGACCCCTCCGGTTGAAGCGAAGGAAGCGTCTGAAA
AGACTCCAACCGATTGGCTGGATGAGTTGGCCAGACTGGCCTCAGAGGATGATTGGGCGCGAGCCGGTGCTGCAAATGCT
GAGACGGTTGTCGCGACCGATAGTCCTGACTGGCTCTCCTCGGCTCAGCCACCGCCTGTTGATCCTTCGGCACCGGCTCT
ACCGGCCTGGTTACAGGATGTTGCTGGTGATGAGCCACCTGCGGCACCGGGTTCGGTGCGTCTTGATTTGCCTTCCTGGC
TGCTCGAAGATGAGTCTTCTCTATCGCCGACTTCACCTGCGATTGCACCTGATGCGCCAACGTTGCTGGCCGATAGTCGT
GCTGTTGTCCCCGGGCAGGAAGGTGTCGATCTGTCTGCTCCTTCGAGGTTGACGCCTGCCGGTTCTGAGCCTGAAGCCTC
TGCCGACATCCCAACCTGGCTGCGCGAGGCCGAGGTACCTGCGGCTGATGAATGGCCGGCTGCTGCCGCCAGTGAAGCGC
CAGCGTGGTTGCAGGAAGAAGGGACACCGGCAGAGCGCGCAGGTGACATCCCAACCTGGCTGCGCGAGGCCGAGGCACCT
GCGGCTGATGAATGGCCGGCTGCCGCTGCCAGTGAAGTGCCGGCCTGGCTGCAAGACGAAGCGGCCCCAGCAGAGCGAGA
GGGTGACATCCCAACCTGGCTGCGCGAGGCCGAGGCACCTACTGCCGACGAATGGCCGGCTGCTGCCAGTGAAGCGCCAG
CGTGGCTGCAAGACGAAGCGGCCCCAGCAGAGCGAGAGGGTGACATCCCAACCTGGCTGCGCGAGGCCGAGGCACCTGCT
GCCGACGAATGGCCGGCTGCTGCCAGTGAAGCGCCAGCGTGGCTGCAAGACGAAGCGGCTCCAGCAGAGCGAGAGGGTGA
CATCCCAGCCTGGCTGCGCGAGGCCGAGGCACCTGCGGCTGATGAATGGCCGGCTGCCGCCGCCAGTGAAGCGCCGGCGT
GGTTGCAAGACGAAGCGGCCCCAGCAGAGCGAGAGGGTGACATCCCAACCTGGCTGCGCGAGGCCGAGGCACCTGCGGCT
GATGAATGGCCGGCTGCCGCCGCCAGTGAAGCGCCGGCGTGGTTGCAAGACGAAGCGGCTCCAGCAGAGCGAGAGGGTGA
CATCCCAACCTGGCTGCGCGAGGCCGAGGCACCTGCGGCTGATGAATGGCCGGCTGCCGCCAGTGAAGCGCCGGCGTGGT
TGCAGGAAGAAGGGACACCGGCAGAGCGCGCAGGTGACATCCCAACCTGGCTGCGCGAGGCCGAGGCACCTGCTGCCGAC
GAATGGCCGGCTACCGCCGCCAGTGAAGCGCCGGCGTGGTTGCAAGACGAAGCGGCGCCGGCAGAGCGCGCAGGTGACAT
CCCAACCTGGCTGCGCGAAGCCGAGGCACCTGCGGCTGATGAATGGCCGGCTGCCGCCAGTGAAGCGCCGGCGTGGTTGC
AGGAAGAAGCCGCCCCAGCGACGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCGGCCCCGGCAGAGCGAGAGGGT
GACATCCCAACCTGGCTGCGCGAGGCCGAGGCCCCCGCTGCCGACGAATGGCCGGCTGCCGCCAGTGAAGCGCCAGCGTG
GTTGCAGGAAGAGGCGGCACCGGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCGGCACCGGCGGCGGGTA
GCGACGTACCGGCGTGGTTGCGGGAAGAGGCCGCTCCAGCGGCTGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCA
GCCCCAGCGGCGGGTAGCGACGTGCCGGCGTGGTTGCGGGAAGAGGCAGCCCCAGCGGCTGGTAGCGACGTACCGGCGTG
GTTGCAGGAAGAGGCCGCTCCAGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCAGCCCCAGCGGCGGGTA
GCGACGTACCGGCGTGGTTGCGGGAAGAGGCGGCACCGGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCG
GCACCGGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCCGCTCCAGCGGCGGGTAGCGACGTACCGGCGTG
GTTGCGGGAAGAGGCCGCTCCAGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCCGCTCCAGCGGCGGGTA
GCGACGTACCGGCGTGGTTGCGGGAAGAGGCCGCTCCAGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCC
GCTCCAGCGGCGGGTAGCGACGTACCGGCGTGGTTGCGGGAAGAGGCGGCTCCAGCGGCGGGTAGCGACGTACCGGCGTG
GTTGCGGGAAGAGGCGGCTCCAGCGGCGGGTGGCGACGTACCGGCGTGGTTGCGGGAAGAGGCCGCTCCAGCGGCGGGTA
GCGACGTACCGGTGTGGTTGCAGGAAGAGTCGGCACCGGCGGCTGGAAGTGATGTTCCGGACTGGCTGCAGCAAGCTTCA
ACTGCTCCGGTTGATACTCCTGATTGGTTGACAGCAGATGTCTCAGCGGAGGCCATCTCCTGGTTACAATCAGCTCCAGC
CGAGAATGCACCAATCGCCGAACCTTCGCTCACTACAAGTTCAACCAGCGACGAATTCTTCAGCGGTGCTGAGCTACCGC
CATGGTTGCGGGCCTCGACCGAACGAGCTGTTGAGCCTGCCATTACTCCCTTGTCTGACTGGCTGGAACGTCTGCGCCAC
CGCGAAGCCGAGGAGGAAGAAGAGGTCGTTGCAGGGCCTGACGTGGTTGTTGTAAAGCCACCACCTCCTGCACCTGCACA
GCGCACCGATGATCAAATTGCCGCTGCTGCATTGCTTGAACGACTCCTGCAATCACCGCTACCGGCAATCCTGCCTGTTG
AACGTCCAGTAATCGTTCGGCGTCGTTTACCGACCCTGGAACAGTGGCTGGCAGTGGTACTGCTGATCGCTGTGTTGATC
GGTATCGCTATTCCCGGTCTCACTGCCACGTTTACGAATCGCGCTCAACCGTCGCCGGTGGCTGTTGCGTTGAACGAACA
ACTTGCCGGTCTGAGCAGTGAGGATGTGGTTTTAGTAGCGTATGAATGGGGTGCTCAGCGTGTCGCTGAGCTGCGTCCAC
TTGAAGATGTGCTGCTCACCAGATTAACTGCGGATCGGACAAAACTGATTATCGTCAGTACCGATCTCCAGGGATCATTG
CTTGCCTTTGATGTGATTGGGCCGTTACGATCAGCCGGCTACAACAACGAAAATGGTGTGATCTTCGGCGGTCGTGATTA
CGTCTTGCTGGGGTATCGTCCAGGCGGTGAACTGGCATTGCGCAGTATGGCGATAGATTTGCGGGCAGAGTTGCGTCGTG
ACTATACCGGCCAGGATGCGACTGCCGGCTTGCTGGCTACGCGCTCAGATGGAACGCCACGCATTCAGTCACTACGTGAT
CTGGCGATGATTGTGGTGATGGCCGATCAGGTTCAGGATGTACAGGCCTGGATGGAGCAGATTCATAGTGCCGCTCCGCA
GGTGCCGATTGCATTCTTGCTCCCGCAAGAGGTCTATCCACAAGTGCTACCCTATACGCGCCTACCCAATGTGTATGCGG
TTGCCGGTCAACGTGGTGCCAGTGATTTGCTGGCTGCCGGAAAGGCTGATAACCTGGCAACACCTGAGCTTTCCTACCAG
ACATGGGCAACGATAGCGTTTGTTGGCGTCCTGCTATTGGGGGCCTTCATTGTTGGGATTGGGCAGTTGCGACGCCTGGC
ACGAGGTAGAGGCTGA

Upstream 100 bases:

>100_bases
TTGGCTCGGCGATTGGGGCGCTGGTGGCCGGAGTCCGTCTGCTAATTGGTTTTGATATGCCGTATGCCGACCGTTGATCC
GGTTGGGTAGCGGCACCCGA

Downstream 100 bases:

>100_bases
TGGACATTGCAATCAACCTTATTCCGATCTTGCTTACGCTACTGGTCTTCAGCCGTGTTCTGGGCGATACACCGGCCTTT
CGGTTGGTCCAATACCTGTT

Product: hypothetical protein

Products: NA

Alternate protein names: None

Number of amino acids: Translated: 1471; Mature: 1470

Protein sequence:

>1471_residues
MSMEHLPPWLRNVPLPPPPTDAADETPAWLRGIDSFTPPAPTPATDHPPAWLTEPEPASESAAESASVPDWLAELQAEVA
DPLADTGSAAEWLGGVEAEPPAERPSTFGATGWLQGLGGETSTGPVTPSEPPPTTSSRLRMPVGPTDWLRSMGHEDEVEP
ASAEQSPEPAPPDPNAGVPDWLRELSEEDVSQALASLPPAETTPPVEAKEASEKTPTDWLDELARLASEDDWARAGAANA
ETVVATDSPDWLSSAQPPPVDPSAPALPAWLQDVAGDEPPAAPGSVRLDLPSWLLEDESSLSPTSPAIAPDAPTLLADSR
AVVPGQEGVDLSAPSRLTPAGSEPEASADIPTWLREAEVPAADEWPAAAASEAPAWLQEEGTPAERAGDIPTWLREAEAP
AADEWPAAAASEVPAWLQDEAAPAEREGDIPTWLREAEAPTADEWPAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPA
ADEWPAAASEAPAWLQDEAAPAEREGDIPAWLREAEAPAADEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAA
DEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAADEWPAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAAD
EWPATAASEAPAWLQDEAAPAERAGDIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPATGSDVPAWLREEAAPAEREG
DIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGGDVPAWLREEAAPAAGSDVPVWLQEESAPAAGSDVPDWLQQAS
TAPVDTPDWLTADVSAEAISWLQSAPAENAPIAEPSLTTSSTSDEFFSGAELPPWLRASTERAVEPAITPLSDWLERLRH
REAEEEEEVVAGPDVVVVKPPPPAPAQRTDDQIAAAALLERLLQSPLPAILPVERPVIVRRRLPTLEQWLAVVLLIAVLI
GIAIPGLTATFTNRAQPSPVAVALNEQLAGLSSEDVVLVAYEWGAQRVAELRPLEDVLLTRLTADRTKLIIVSTDLQGSL
LAFDVIGPLRSAGYNNENGVIFGGRDYVLLGYRPGGELALRSMAIDLRAELRRDYTGQDATAGLLATRSDGTPRIQSLRD
LAMIVVMADQVQDVQAWMEQIHSAAPQVPIAFLLPQEVYPQVLPYTRLPNVYAVAGQRGASDLLAAGKADNLATPELSYQ
TWATIAFVGVLLLGAFIVGIGQLRRLARGRG

Sequences:

>Translated_1471_residues
MSMEHLPPWLRNVPLPPPPTDAADETPAWLRGIDSFTPPAPTPATDHPPAWLTEPEPASESAAESASVPDWLAELQAEVA
DPLADTGSAAEWLGGVEAEPPAERPSTFGATGWLQGLGGETSTGPVTPSEPPPTTSSRLRMPVGPTDWLRSMGHEDEVEP
ASAEQSPEPAPPDPNAGVPDWLRELSEEDVSQALASLPPAETTPPVEAKEASEKTPTDWLDELARLASEDDWARAGAANA
ETVVATDSPDWLSSAQPPPVDPSAPALPAWLQDVAGDEPPAAPGSVRLDLPSWLLEDESSLSPTSPAIAPDAPTLLADSR
AVVPGQEGVDLSAPSRLTPAGSEPEASADIPTWLREAEVPAADEWPAAAASEAPAWLQEEGTPAERAGDIPTWLREAEAP
AADEWPAAAASEVPAWLQDEAAPAEREGDIPTWLREAEAPTADEWPAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPA
ADEWPAAASEAPAWLQDEAAPAEREGDIPAWLREAEAPAADEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAA
DEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAADEWPAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAAD
EWPATAASEAPAWLQDEAAPAERAGDIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPATGSDVPAWLREEAAPAEREG
DIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGGDVPAWLREEAAPAAGSDVPVWLQEESAPAAGSDVPDWLQQAS
TAPVDTPDWLTADVSAEAISWLQSAPAENAPIAEPSLTTSSTSDEFFSGAELPPWLRASTERAVEPAITPLSDWLERLRH
REAEEEEEVVAGPDVVVVKPPPPAPAQRTDDQIAAAALLERLLQSPLPAILPVERPVIVRRRLPTLEQWLAVVLLIAVLI
GIAIPGLTATFTNRAQPSPVAVALNEQLAGLSSEDVVLVAYEWGAQRVAELRPLEDVLLTRLTADRTKLIIVSTDLQGSL
LAFDVIGPLRSAGYNNENGVIFGGRDYVLLGYRPGGELALRSMAIDLRAELRRDYTGQDATAGLLATRSDGTPRIQSLRD
LAMIVVMADQVQDVQAWMEQIHSAAPQVPIAFLLPQEVYPQVLPYTRLPNVYAVAGQRGASDLLAAGKADNLATPELSYQ
TWATIAFVGVLLLGAFIVGIGQLRRLARGRG
>Mature_1470_residues
SMEHLPPWLRNVPLPPPPTDAADETPAWLRGIDSFTPPAPTPATDHPPAWLTEPEPASESAAESASVPDWLAELQAEVAD
PLADTGSAAEWLGGVEAEPPAERPSTFGATGWLQGLGGETSTGPVTPSEPPPTTSSRLRMPVGPTDWLRSMGHEDEVEPA
SAEQSPEPAPPDPNAGVPDWLRELSEEDVSQALASLPPAETTPPVEAKEASEKTPTDWLDELARLASEDDWARAGAANAE
TVVATDSPDWLSSAQPPPVDPSAPALPAWLQDVAGDEPPAAPGSVRLDLPSWLLEDESSLSPTSPAIAPDAPTLLADSRA
VVPGQEGVDLSAPSRLTPAGSEPEASADIPTWLREAEVPAADEWPAAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPA
ADEWPAAAASEVPAWLQDEAAPAEREGDIPTWLREAEAPTADEWPAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAA
DEWPAAASEAPAWLQDEAAPAEREGDIPAWLREAEAPAADEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAAD
EWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAADEWPAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAADE
WPATAASEAPAWLQDEAAPAERAGDIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPATGSDVPAWLREEAAPAEREGD
IPTWLREAEAPAADEWPAAASEAPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAA
PAAGSDVPAWLREEAAPAAGSDVPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAA
PAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAA
PAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGGDVPAWLREEAAPAAGSDVPVWLQEESAPAAGSDVPDWLQQAST
APVDTPDWLTADVSAEAISWLQSAPAENAPIAEPSLTTSSTSDEFFSGAELPPWLRASTERAVEPAITPLSDWLERLRHR
EAEEEEEVVAGPDVVVVKPPPPAPAQRTDDQIAAAALLERLLQSPLPAILPVERPVIVRRRLPTLEQWLAVVLLIAVLIG
IAIPGLTATFTNRAQPSPVAVALNEQLAGLSSEDVVLVAYEWGAQRVAELRPLEDVLLTRLTADRTKLIIVSTDLQGSLL
AFDVIGPLRSAGYNNENGVIFGGRDYVLLGYRPGGELALRSMAIDLRAELRRDYTGQDATAGLLATRSDGTPRIQSLRDL
AMIVVMADQVQDVQAWMEQIHSAAPQVPIAFLLPQEVYPQVLPYTRLPNVYAVAGQRGASDLLAAGKADNLATPELSYQT
WATIAFVGVLLLGAFIVGIGQLRRLARGRG

Specific function: Unknown

COG id: COG5164

COG function: function code K; Transcription elongation factor

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 154798; Mature: 154667

Theoretical pI: Translated: 3.74; Mature: 3.74

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.5 %Met     (Translated Protein)
0.5 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.5 %Met     (Mature Protein)
0.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSMEHLPPWLRNVPLPPPPTDAADETPAWLRGIDSFTPPAPTPATDHPPAWLTEPEPASE
CCCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCH
SAAESASVPDWLAELQAEVADPLADTGSAAEWLGGVEAEPPAERPSTFGATGWLQGLGGE
HHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCC
TSTGPVTPSEPPPTTSSRLRMPVGPTDWLRSMGHEDEVEPASAEQSPEPAPPDPNAGVPD
CCCCCCCCCCCCCCCCCCEECCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHH
WLRELSEEDVSQALASLPPAETTPPVEAKEASEKTPTDWLDELARLASEDDWARAGAANA
HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHCCCHHHHCCCCCC
ETVVATDSPDWLSSAQPPPVDPSAPALPAWLQDVAGDEPPAAPGSVRLDLPSWLLEDESS
CEEEECCCCHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEECCHHHHHCCCCC
LSPTSPAIAPDAPTLLADSRAVVPGQEGVDLSAPSRLTPAGSEPEASADIPTWLREAEVP
CCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCC
AADEWPAAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAADEWPAAAASEVPAWLQDE
CCCCCCCHHCCCCCHHHHCCCCCHHHCCCCHHHHHHHCCCCCCCCCCHHHHHCCHHHHHC
AAPAEREGDIPTWLREAEAPTADEWPAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPA
CCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCC
ADEWPAAASEAPAWLQDEAAPAEREGDIPAWLREAEAPAADEWPAAAASEAPAWLQDEAA
CCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCHHCCCCCCHHCCCCC
PAEREGDIPTWLREAEAPAADEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAA
CCCCCCCCHHHHHHHCCCCCCCCCCHHCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCCC
DEWPAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAADEWPATAASEAPAWLQDEAAP
CCCCCCCCCCCHHHHCCCCCHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCCCC
AERAGDIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPATGSDVPAWLREEAAPAEREG
HHHCCCCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHCCCCCCCC
DIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWL
CCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHH
REEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLQEEAAPAAGSDV
HHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCC
PAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAA
HHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCC
GSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
CCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHC
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGGDVPAWLREEAAPAAGSDVPVWL
CCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCCEEEE
QEESAPAAGSDVPDWLQQASTAPVDTPDWLTADVSAEAISWLQSAPAENAPIAEPSLTTS
ECCCCCCCCCCCHHHHHHHCCCCCCCCCCEECCHHHHHHHHHHHCCCCCCCCCCCCCCCC
STSDEFFSGAELPPWLRASTERAVEPAITPLSDWLERLRHREAEEEEEVVAGPDVVVVKP
CCCHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCHHHHHHCCCCCEEEECC
PPPAPAQRTDDQIAAAALLERLLQSPLPAILPVERPVIVRRRLPTLEQWLAVVLLIAVLI
CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHH
GIAIPGLTATFTNRAQPSPVAVALNEQLAGLSSEDVVLVAYEWGAQRVAELRPLEDVLLT
HHHCCCCHHHHCCCCCCCCEEEEECHHHCCCCCCCEEEEEECCCHHHHHHHCCHHHHHHH
RLTADRTKLIIVSTDLQGSLLAFDVIGPLRSAGYNNENGVIFGGRDYVLLGYRPGGELAL
HHCCCCEEEEEEECCCCCCEEHHHHHHHHHHCCCCCCCCEEECCCCEEEEEECCCHHHHH
RSMAIDLRAELRRDYTGQDATAGLLATRSDGTPRIQSLRDLAMIVVMADQVQDVQAWMEQ
HHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IHSAAPQVPIAFLLPQEVYPQVLPYTRLPNVYAVAGQRGASDLLAAGKADNLATPELSYQ
HHHHCCCCCEEEECCHHHHHHHCCCCCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCHH
TWATIAFVGVLLLGAFIVGIGQLRRLARGRG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SMEHLPPWLRNVPLPPPPTDAADETPAWLRGIDSFTPPAPTPATDHPPAWLTEPEPASE
CCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCH
SAAESASVPDWLAELQAEVADPLADTGSAAEWLGGVEAEPPAERPSTFGATGWLQGLGGE
HHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCC
TSTGPVTPSEPPPTTSSRLRMPVGPTDWLRSMGHEDEVEPASAEQSPEPAPPDPNAGVPD
CCCCCCCCCCCCCCCCCCEECCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHH
WLRELSEEDVSQALASLPPAETTPPVEAKEASEKTPTDWLDELARLASEDDWARAGAANA
HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHCCCHHHHCCCCCC
ETVVATDSPDWLSSAQPPPVDPSAPALPAWLQDVAGDEPPAAPGSVRLDLPSWLLEDESS
CEEEECCCCHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEECCHHHHHCCCCC
LSPTSPAIAPDAPTLLADSRAVVPGQEGVDLSAPSRLTPAGSEPEASADIPTWLREAEVP
CCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCC
AADEWPAAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAADEWPAAAASEVPAWLQDE
CCCCCCCHHCCCCCHHHHCCCCCHHHCCCCHHHHHHHCCCCCCCCCCHHHHHCCHHHHHC
AAPAEREGDIPTWLREAEAPTADEWPAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPA
CCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCC
ADEWPAAASEAPAWLQDEAAPAEREGDIPAWLREAEAPAADEWPAAAASEAPAWLQDEAA
CCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCHHCCCCCCHHCCCCC
PAEREGDIPTWLREAEAPAADEWPAAAASEAPAWLQDEAAPAEREGDIPTWLREAEAPAA
CCCCCCCCHHHHHHHCCCCCCCCCCHHCCCCCCHHCCCCCCCCCCCCCHHHHHHHCCCCC
DEWPAAASEAPAWLQEEGTPAERAGDIPTWLREAEAPAADEWPATAASEAPAWLQDEAAP
CCCCCCCCCCCHHHHCCCCCHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCCCC
AERAGDIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPATGSDVPAWLREEAAPAEREG
HHHCCCCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHCCCCCCCC
DIPTWLREAEAPAADEWPAAASEAPAWLQEEAAPAAGSDVPAWLREEAAPAAGSDVPAWL
CCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHH
REEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLQEEAAPAAGSDV
HHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCC
PAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAA
HHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCC
GSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGSDVPAWLREEA
CCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHC
APAAGSDVPAWLREEAAPAAGSDVPAWLREEAAPAAGGDVPAWLREEAAPAAGSDVPVWL
CCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCCEEEE
QEESAPAAGSDVPDWLQQASTAPVDTPDWLTADVSAEAISWLQSAPAENAPIAEPSLTTS
ECCCCCCCCCCCHHHHHHHCCCCCCCCCCEECCHHHHHHHHHHHCCCCCCCCCCCCCCCC
STSDEFFSGAELPPWLRASTERAVEPAITPLSDWLERLRHREAEEEEEVVAGPDVVVVKP
CCCHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCHHHHHHCCCCCEEEECC
PPPAPAQRTDDQIAAAALLERLLQSPLPAILPVERPVIVRRRLPTLEQWLAVVLLIAVLI
CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHH
GIAIPGLTATFTNRAQPSPVAVALNEQLAGLSSEDVVLVAYEWGAQRVAELRPLEDVLLT
HHHCCCCHHHHCCCCCCCCEEEEECHHHCCCCCCCEEEEEECCCHHHHHHHCCHHHHHHH
RLTADRTKLIIVSTDLQGSLLAFDVIGPLRSAGYNNENGVIFGGRDYVLLGYRPGGELAL
HHCCCCEEEEEEECCCCCCEEHHHHHHHHHHCCCCCCCCEEECCCCEEEEEECCCHHHHH
RSMAIDLRAELRRDYTGQDATAGLLATRSDGTPRIQSLRDLAMIVVMADQVQDVQAWMEQ
HHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IHSAAPQVPIAFLLPQEVYPQVLPYTRLPNVYAVAGQRGASDLLAAGKADNLATPELSYQ
HHHHCCCCCEEEECCHHHHHHHCCCCCCCCEEEECCCCCHHHHHHCCCCCCCCCCCCCHH
TWATIAFVGVLLLGAFIVGIGQLRRLARGRG
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA