Definition Mesorhizobium loti MAFF303099 plasmid pMLb, complete sequence.
Accession NC_002682
Length 208,315

Click here to switch to the map view.

The map label for this gene is 13488538

Identifier: 13488538

GI number: 13488538

Start: 162490

End: 166881

Strand: Direct

Name: 13488538

Synonym: mlr9704

Alternate gene names: NA

Gene position: 162490-166881 (Clockwise)

Preceding gene: 13488537

Following gene: 13488539

Centisome position: 78.0

GC content: 65.21

Gene sequence:

>4392_bases
GTGTCCGGTCGGGGCGGGTTGAGCCCGTGCGGTCAAAGAGAGCGCCGGGGGCTTTTCCCGGTCTCGCTCTCCAGAGGATC
TCCAATGAATTTCTTGTCCCCGACAGCAACCGGCGATTCCGCCACGCCCGTCGATCCGGCCGCGGCAGTTTGTTCCGCAG
CCCTGCGTCTGCTCCCCCACCTCGAATGTGGCCGGCGCGTCGATGCCGTTGTCCTGCGAAGCGCGATGGAAGCCTCATTC
GGAGCCTCGGACACAAGCGGTGCCTGGAACTGGGCAATGGCGTATGACGCCTGCGAGGCCGCGACCGTCCTGTTCCTTCG
CAAATACGGCAGGTTGCTTCTGCGCAAGGCCGGCTCCATGGCGTCCGCCCAACCGCTCTTTAGCCGGATAGCGGACCTTC
TGCCCACGCATACCCGCCGCTCCGAGGAAAGCCAGGCGCTGCAGCAGTTCTCAACGCCTATCCCGCTGGGTCTCGCAGCC
GTCTCGGCGGCGGCCATAACGGCATCCGACCGCGTGCTAGAGCCTTCCGCCGGTACTGGTCTGCTCGCTGTTCTCGCCGA
GACGTCGGGCGGGTCGCTTGTCCTGAACGAGCTTGCGGAGGTTCGTGCGGGTCTTCTGTCTGTTCTCTTCCCGGGCCTTA
CCGTCACGCGGTTCGACGCGGCCCAGATCGATGATTATCTCGATCCCGCCAGCTTGCCCACCGTCGTGCTGATGAATCCG
CCGTTCTCGGTCCTCGCCAACGTACATGGTCGCGTCGCGGATGCGGCCTATCGCCATGTCGCCTCGGCACTCGCCCGGCT
TGCCGCCGGGGGGCGGCTGGTCACCATCACCAATGCGAGTTTCGGCCCGGAGACCCCGGCCTGGCGTAACGCCTTCGGGC
GCCTCCAGGAGCAGGGTCGTGTCGTTTTCACGGCCACGATCAGCGGCGCCGTCTATGCCAAGCATGGCACGACGATCGAC
ACCCGACTGACGGTCATCGACAAGACGCTGGCTGGCGATACGTCCTTCATTCCGGGCACGCCCGGCTTCGCGCCGGATGT
CGCCACGTTGCTCAGCTGGATCGAGAAGCATGTCCCGGCACGTCCGCCGATCGCCGAAGCGGCGAAGGTCGTTCCCGCCA
TCACCGCGCCCCGGACGGTTCGCGGTTATCTCGCCCGCCCTGTCGCAGGCTCAGTTCACCGGCAGCTGTCCGAGCCTGTC
GGCGTTGAACTGTCGTACGAGACGGTCGATTGGGCACCGGACGAGGGAGCGTGGCTGACCGACGCGATCTATGAAGACTA
TGGATTGCAGACGATCCGGATTCCCGGATCACAGGCGCACCCGACCAGGCTGGTCCAGTCCGCGGCGATGGCGTCGGTCG
CGCCGCCAAAGCCCACTTATCGGCCGACCCTGCCCAACCATATCTTGACCTCGCTCTCCGACGCGCAGCTCGAAACGGTG
ATCCTGGCCGGCGAGGCCCATTGCGGGTTTCTCACCGGATCGTGGTCAGTCGACCACACGCTCGATCTTGTCACCGCAGC
ACCCGAGGAGGCGCCAACAGCGGTGCGCTTCCGGCGTGGCTTCTTCATCGGCGACGGAACCGGCGTCGGCAAGGGCCGGC
AGTCGGCGAGCATTGTGCTCGACAACTGGCTGCAAGGCCGGCGCAAGGCCGTGTGGATTTCCAAATCAGACAAGCTGCTT
GAGGACGCGCAGCGCGACTGGGCCGCGCTCGGCACGGAGCGTCTGCTGGTCACGCCGCTCTCACGCTTCCCGCAGGGACA
TCCCGTCACGCTTCCAGAAGGCATCCTGTTTACAACCTATGCCACGCTACGCTCCGACGACCGTGGCGAGAAGGTTTCGC
GCGTCAGGCAGATCGTCGAATGGTTGGGCTCGGATTTCGACGGAGTGCTCATATTCGACGAGGCGCACGCCATGCAGAAT
GCCGGCGGCGGCAAGGGAGAACGTGGCGACGTCGCGCCCTCACAGCAAGGTCGCGCCGGCTTGCGCCTGCAACATGCGTT
GCCTGGCGCCCGTGTCGTCTATGTCTCCGCCACCGGCGCGACCACCGTCGCCAATCTGGCCTACGCCCAGAGGCTTGGCC
TCTGGGGTAGCGAGGACTTCCCGTTCTCGACCCGAGCGGAGTTCGTCGAGGCGATAGAGGCGGGTGGCGTGGCGGCAATG
GAGGTGCTCGCCCGGGACCTTCGCGCGCTCGGCCTCTACACGGCACGCTCGCTCTCCTTCGACGGCGTCGAATATGAACT
GGTTGAGCATGAACTGACGCTTGAGCAGCGGCGCATCTACGATGCCTACGCGGGCGCCTTCGCCGTCATTCACAATCATC
TCGACGCGGCCATGCGGGCCGCCAACATCACCGGCGACAGCGGCACGTTGAACCGCCAGGCCAAGTCCGCCGCGCGCTCG
GCCTTCGAAAGCGCCAAGCAGCGCTTCTTCGGTCACCTGCTGACGTCGATGAAGACGCCGACGCTGATCCGCTCGATCGA
GCAGGATCTCCAGTCCGGGCATTCGAGCGTTATTCAGATCGTCTCCACCGGCGAGGCGCTGATGGAACGCCGGCTGGCGG
AGGTGCCGACGCAGGAATGGAACGATGTCCGCGTCGACATCACGCCGCGCGAATATGTGCTCGATTACCTCGAGCATTCC
TTTCCAGTTCAACTCTACGAGCCCTTTACGGATTCGGAAGGGAACCTGTCGTCGAGGCCGGTGTTTCGCGACGGCCAGCC
CGTTGAAAGCCGCGAGGCGCTCGCGCGCCGCACGGCGCTGATTGAGAAGCTGGCGAGCTTGCCACCGGTTCCTGGGGCGC
TCGACCAGATCGTCCAGCTCTTCGGCACCGACATGGTCGCGGAGGTGACGGGTCGGTCGCGACGCATCGTCAGGAAGGGC
GAACGCCTGATGGTCGAGGGCCGCGCAGCCTCGGCCAATCTCGCCGAGACGCAGGCCTTCATGGATGACGTCAAGCGCAT
CCTGGTGTTCAGCGAGGCCGGCGGGACCGGCCGAAGCTACCATGCGGAGCTTTCGGCACGGAACACACGTCTGCGCGTCC
ACTATCTTCTTGAGCCTGGCTGGAAGGCCGATACCGCCATCCAGGGGCTTGGCCGCACGCATCGGACCAACCAGGCGCAG
CCGCCGCTGTTCCGGCCGATCGCCACCAATGTGAAAGCCGAGAAACGCTTCCTCTCCACGATTGCCCGGCGTCTCGATAC
GCTGGGCGCGATCACGCGAGGCCAGCGCCAGACCGGCGGGCAGGGCCTGTTCCGGCCGGAGGACAATCTGGAATCGCATT
ATGCGCGCGATGCGCTGCGCCAGCTTTATCTGCTGATCGTCCGCGGCAAGGTGGAGGGCTGCTCACTCAAGCTGTTCGAG
CAGACCACAGGGCTGACGCTGACGGACGAGAACGGCATCAAGGACGAGTTGCCGCCGGTCACCACTTTCCTCAATCGCCT
GCTGGCGCTTACCATCGAACTGCAGGACATCCTGTTCGCGGCTTTCGACCAGTTGCTCACAGCCAGGGTGCAAGGCGCCA
TCGCGGCTGGGGTTTATGATGTTGGGCTCGAAACGCTGCGAGCGGAGAGCTTTGTCGTCACCGATCGGCGGGCGATCTAC
ACCCATCCGGGCACGGGCGCCGAAACGCAGCTCCTCACCATCGATCAGCGCCAGAGCAACCGACCCGTGCCTGTCGAGGA
AGCTGTCGCCCAGCTCGACGACCAGCGCGCCATCCTGCTGGTCAACGAACGCTCAGGGCGTGCTGCCCTGCAGGTCCCCG
CGCCGTCGTTCATGCTCGATGATGGCGAAATCGAACGGCGGGTCAGGTTGATCCGGCCGATGGAGCAACACCATGCCTCC
CTGCGTATGATGGGGGACAGTCATTGGCAGCAGGCTGATCGGGAGACTTTCGCCGCCGCCTGGAGTGCGGAGGTCGCGGG
CGTCCCGGCATTCTCGGACTCGACCATCCACATCGTCGCGGGGCTGCTGTTGCCGATCTGGAAGCGACTACCAAACGAAT
CGAGCCGGGTCTATCGGCTCCAGACCGACGAGGGGGAGCGCATCATCGGACGCAGGGTTTCGGCCGCATGGGCCGCCGGC
GCGCTCGCAACCGGCGTCAGCACCCTTACGGCACGAGATGCCTTCAACGCCCTTGCGGACGGACGAACGATGCTCGACCT
CGCCGAGGGACTTCATCTGCGCCGCGTTCGTGTCATGGGCGCCAATCGCATCGAGCTGTCGGGCTTCACCGATAGCATGC
GCGAGCGGCTGACGGCCTACGGGCTTTTCCACGAGATCATCTCCTGGAAGCTGCGGATGTTCGTGCCTGCGGACGCGAGT
GGACCTGCGGTGCTCGCCAAGGTCATCGAGCGGTATCCCATACACCGCATCAGCGAGAAGGAGGCCACCTGA

Upstream 100 bases:

>100_bases
AAAGCAGCTCGACGCTGCCGAGTAGGCCCGGCGCCACTCGCCATCCGAACGCCTGGTACTCGCAAGAGTGCCGGGCGTTT
TTTGTTCTGTCTGAGAGTGA

Downstream 100 bases:

>100_bases
TGTCGCCTTGCGATGCATCGGAACTAGCAAGCCGTCTCGCCCATCGGGCGGAGGCGGTCTGCCGCCGCTACCTTTCCAAC
GGACGCCGTGAGGGCCGCTA

Product: methylase/helicase

Products: NA

Alternate protein names: Probably Methylase/Helicase; Helicase Domain Protein; Helicase

Number of amino acids: Translated: 1463; Mature: 1462

Protein sequence:

>1463_residues
MSGRGGLSPCGQRERRGLFPVSLSRGSPMNFLSPTATGDSATPVDPAAAVCSAALRLLPHLECGRRVDAVVLRSAMEASF
GASDTSGAWNWAMAYDACEAATVLFLRKYGRLLLRKAGSMASAQPLFSRIADLLPTHTRRSEESQALQQFSTPIPLGLAA
VSAAAITASDRVLEPSAGTGLLAVLAETSGGSLVLNELAEVRAGLLSVLFPGLTVTRFDAAQIDDYLDPASLPTVVLMNP
PFSVLANVHGRVADAAYRHVASALARLAAGGRLVTITNASFGPETPAWRNAFGRLQEQGRVVFTATISGAVYAKHGTTID
TRLTVIDKTLAGDTSFIPGTPGFAPDVATLLSWIEKHVPARPPIAEAAKVVPAITAPRTVRGYLARPVAGSVHRQLSEPV
GVELSYETVDWAPDEGAWLTDAIYEDYGLQTIRIPGSQAHPTRLVQSAAMASVAPPKPTYRPTLPNHILTSLSDAQLETV
ILAGEAHCGFLTGSWSVDHTLDLVTAAPEEAPTAVRFRRGFFIGDGTGVGKGRQSASIVLDNWLQGRRKAVWISKSDKLL
EDAQRDWAALGTERLLVTPLSRFPQGHPVTLPEGILFTTYATLRSDDRGEKVSRVRQIVEWLGSDFDGVLIFDEAHAMQN
AGGGKGERGDVAPSQQGRAGLRLQHALPGARVVYVSATGATTVANLAYAQRLGLWGSEDFPFSTRAEFVEAIEAGGVAAM
EVLARDLRALGLYTARSLSFDGVEYELVEHELTLEQRRIYDAYAGAFAVIHNHLDAAMRAANITGDSGTLNRQAKSAARS
AFESAKQRFFGHLLTSMKTPTLIRSIEQDLQSGHSSVIQIVSTGEALMERRLAEVPTQEWNDVRVDITPREYVLDYLEHS
FPVQLYEPFTDSEGNLSSRPVFRDGQPVESREALARRTALIEKLASLPPVPGALDQIVQLFGTDMVAEVTGRSRRIVRKG
ERLMVEGRAASANLAETQAFMDDVKRILVFSEAGGTGRSYHAELSARNTRLRVHYLLEPGWKADTAIQGLGRTHRTNQAQ
PPLFRPIATNVKAEKRFLSTIARRLDTLGAITRGQRQTGGQGLFRPEDNLESHYARDALRQLYLLIVRGKVEGCSLKLFE
QTTGLTLTDENGIKDELPPVTTFLNRLLALTIELQDILFAAFDQLLTARVQGAIAAGVYDVGLETLRAESFVVTDRRAIY
THPGTGAETQLLTIDQRQSNRPVPVEEAVAQLDDQRAILLVNERSGRAALQVPAPSFMLDDGEIERRVRLIRPMEQHHAS
LRMMGDSHWQQADRETFAAAWSAEVAGVPAFSDSTIHIVAGLLLPIWKRLPNESSRVYRLQTDEGERIIGRRVSAAWAAG
ALATGVSTLTARDAFNALADGRTMLDLAEGLHLRRVRVMGANRIELSGFTDSMRERLTAYGLFHEIISWKLRMFVPADAS
GPAVLAKVIERYPIHRISEKEAT

Sequences:

>Translated_1463_residues
MSGRGGLSPCGQRERRGLFPVSLSRGSPMNFLSPTATGDSATPVDPAAAVCSAALRLLPHLECGRRVDAVVLRSAMEASF
GASDTSGAWNWAMAYDACEAATVLFLRKYGRLLLRKAGSMASAQPLFSRIADLLPTHTRRSEESQALQQFSTPIPLGLAA
VSAAAITASDRVLEPSAGTGLLAVLAETSGGSLVLNELAEVRAGLLSVLFPGLTVTRFDAAQIDDYLDPASLPTVVLMNP
PFSVLANVHGRVADAAYRHVASALARLAAGGRLVTITNASFGPETPAWRNAFGRLQEQGRVVFTATISGAVYAKHGTTID
TRLTVIDKTLAGDTSFIPGTPGFAPDVATLLSWIEKHVPARPPIAEAAKVVPAITAPRTVRGYLARPVAGSVHRQLSEPV
GVELSYETVDWAPDEGAWLTDAIYEDYGLQTIRIPGSQAHPTRLVQSAAMASVAPPKPTYRPTLPNHILTSLSDAQLETV
ILAGEAHCGFLTGSWSVDHTLDLVTAAPEEAPTAVRFRRGFFIGDGTGVGKGRQSASIVLDNWLQGRRKAVWISKSDKLL
EDAQRDWAALGTERLLVTPLSRFPQGHPVTLPEGILFTTYATLRSDDRGEKVSRVRQIVEWLGSDFDGVLIFDEAHAMQN
AGGGKGERGDVAPSQQGRAGLRLQHALPGARVVYVSATGATTVANLAYAQRLGLWGSEDFPFSTRAEFVEAIEAGGVAAM
EVLARDLRALGLYTARSLSFDGVEYELVEHELTLEQRRIYDAYAGAFAVIHNHLDAAMRAANITGDSGTLNRQAKSAARS
AFESAKQRFFGHLLTSMKTPTLIRSIEQDLQSGHSSVIQIVSTGEALMERRLAEVPTQEWNDVRVDITPREYVLDYLEHS
FPVQLYEPFTDSEGNLSSRPVFRDGQPVESREALARRTALIEKLASLPPVPGALDQIVQLFGTDMVAEVTGRSRRIVRKG
ERLMVEGRAASANLAETQAFMDDVKRILVFSEAGGTGRSYHAELSARNTRLRVHYLLEPGWKADTAIQGLGRTHRTNQAQ
PPLFRPIATNVKAEKRFLSTIARRLDTLGAITRGQRQTGGQGLFRPEDNLESHYARDALRQLYLLIVRGKVEGCSLKLFE
QTTGLTLTDENGIKDELPPVTTFLNRLLALTIELQDILFAAFDQLLTARVQGAIAAGVYDVGLETLRAESFVVTDRRAIY
THPGTGAETQLLTIDQRQSNRPVPVEEAVAQLDDQRAILLVNERSGRAALQVPAPSFMLDDGEIERRVRLIRPMEQHHAS
LRMMGDSHWQQADRETFAAAWSAEVAGVPAFSDSTIHIVAGLLLPIWKRLPNESSRVYRLQTDEGERIIGRRVSAAWAAG
ALATGVSTLTARDAFNALADGRTMLDLAEGLHLRRVRVMGANRIELSGFTDSMRERLTAYGLFHEIISWKLRMFVPADAS
GPAVLAKVIERYPIHRISEKEAT
>Mature_1462_residues
SGRGGLSPCGQRERRGLFPVSLSRGSPMNFLSPTATGDSATPVDPAAAVCSAALRLLPHLECGRRVDAVVLRSAMEASFG
ASDTSGAWNWAMAYDACEAATVLFLRKYGRLLLRKAGSMASAQPLFSRIADLLPTHTRRSEESQALQQFSTPIPLGLAAV
SAAAITASDRVLEPSAGTGLLAVLAETSGGSLVLNELAEVRAGLLSVLFPGLTVTRFDAAQIDDYLDPASLPTVVLMNPP
FSVLANVHGRVADAAYRHVASALARLAAGGRLVTITNASFGPETPAWRNAFGRLQEQGRVVFTATISGAVYAKHGTTIDT
RLTVIDKTLAGDTSFIPGTPGFAPDVATLLSWIEKHVPARPPIAEAAKVVPAITAPRTVRGYLARPVAGSVHRQLSEPVG
VELSYETVDWAPDEGAWLTDAIYEDYGLQTIRIPGSQAHPTRLVQSAAMASVAPPKPTYRPTLPNHILTSLSDAQLETVI
LAGEAHCGFLTGSWSVDHTLDLVTAAPEEAPTAVRFRRGFFIGDGTGVGKGRQSASIVLDNWLQGRRKAVWISKSDKLLE
DAQRDWAALGTERLLVTPLSRFPQGHPVTLPEGILFTTYATLRSDDRGEKVSRVRQIVEWLGSDFDGVLIFDEAHAMQNA
GGGKGERGDVAPSQQGRAGLRLQHALPGARVVYVSATGATTVANLAYAQRLGLWGSEDFPFSTRAEFVEAIEAGGVAAME
VLARDLRALGLYTARSLSFDGVEYELVEHELTLEQRRIYDAYAGAFAVIHNHLDAAMRAANITGDSGTLNRQAKSAARSA
FESAKQRFFGHLLTSMKTPTLIRSIEQDLQSGHSSVIQIVSTGEALMERRLAEVPTQEWNDVRVDITPREYVLDYLEHSF
PVQLYEPFTDSEGNLSSRPVFRDGQPVESREALARRTALIEKLASLPPVPGALDQIVQLFGTDMVAEVTGRSRRIVRKGE
RLMVEGRAASANLAETQAFMDDVKRILVFSEAGGTGRSYHAELSARNTRLRVHYLLEPGWKADTAIQGLGRTHRTNQAQP
PLFRPIATNVKAEKRFLSTIARRLDTLGAITRGQRQTGGQGLFRPEDNLESHYARDALRQLYLLIVRGKVEGCSLKLFEQ
TTGLTLTDENGIKDELPPVTTFLNRLLALTIELQDILFAAFDQLLTARVQGAIAAGVYDVGLETLRAESFVVTDRRAIYT
HPGTGAETQLLTIDQRQSNRPVPVEEAVAQLDDQRAILLVNERSGRAALQVPAPSFMLDDGEIERRVRLIRPMEQHHASL
RMMGDSHWQQADRETFAAAWSAEVAGVPAFSDSTIHIVAGLLLPIWKRLPNESSRVYRLQTDEGERIIGRRVSAAWAAGA
LATGVSTLTARDAFNALADGRTMLDLAEGLHLRRVRVMGANRIELSGFTDSMRERLTAYGLFHEIISWKLRMFVPADASG
PAVLAKVIERYPIHRISEKEAT

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI269846807, Length=416, Percent_Identity=34.6153846153846, Blast_Score=249, Evalue=1e-65,
Organism=Homo sapiens, GI269846812, Length=416, Percent_Identity=34.6153846153846, Blast_Score=249, Evalue=1e-65,
Organism=Homo sapiens, GI154355002, Length=433, Percent_Identity=37.4133949191686, Blast_Score=240, Evalue=7e-63,
Organism=Homo sapiens, GI154355004, Length=433, Percent_Identity=37.6443418013857, Blast_Score=239, Evalue=1e-62,
Organism=Caenorhabditis elegans, GI17553078, Length=523, Percent_Identity=32.6959847036329, Blast_Score=252, Evalue=1e-66,
Organism=Drosophila melanogaster, GI161077794, Length=472, Percent_Identity=34.9576271186441, Blast_Score=251, Evalue=3e-66,
Organism=Drosophila melanogaster, GI161077796, Length=436, Percent_Identity=36.4678899082569, Blast_Score=251, Evalue=4e-66,
Organism=Drosophila melanogaster, GI24641704, Length=436, Percent_Identity=36.4678899082569, Blast_Score=250, Evalue=4e-66,
Organism=Drosophila melanogaster, GI19921354, Length=422, Percent_Identity=35.781990521327, Blast_Score=233, Evalue=8e-61,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 159020; Mature: 158888

Theoretical pI: Translated: 7.12; Mature: 7.12

Prosite motif: PS00092 N6_MTASE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSGRGGLSPCGQRERRGLFPVSLSRGSPMNFLSPTATGDSATPVDPAAAVCSAALRLLPH
CCCCCCCCCCCHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC
LECGRRVDAVVLRSAMEASFGASDTSGAWNWAMAYDACEAATVLFLRKYGRLLLRKAGSM
HHCCCHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
ASAQPLFSRIADLLPTHTRRSEESQALQQFSTPIPLGLAAVSAAAITASDRVLEPSAGTG
CHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHEECCCCEECCCCCCC
LLAVLAETSGGSLVLNELAEVRAGLLSVLFPGLTVTRFDAAQIDDYLDPASLPTVVLMNP
EEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHCCCCCCCCEEEEECC
PFSVLANVHGRVADAAYRHVASALARLAAGGRLVTITNASFGPETPAWRNAFGRLQEQGR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC
VVFTATISGAVYAKHGTTIDTRLTVIDKTLAGDTSFIPGTPGFAPDVATLLSWIEKHVPA
EEEEEEECCEEEECCCCEEEEEEEEEEHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC
RPPIAEAAKVVPAITAPRTVRGYLARPVAGSVHRQLSEPVGVELSYETVDWAPDEGAWLT
CCCHHHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCCCEEEEEEECCCCCCCCCHH
DAIYEDYGLQTIRIPGSQAHPTRLVQSAAMASVAPPKPTYRPTLPNHILTSLSDAQLETV
HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCCEEEE
ILAGEAHCGFLTGSWSVDHTLDLVTAAPEEAPTAVRFRRGFFIGDGTGVGKGRQSASIVL
EEECCCCCEEEECCCCCCHHHHHEECCCCCCCCHHEEECCEEEECCCCCCCCCCCHHHHH
DNWLQGRRKAVWISKSDKLLEDAQRDWAALGTERLLVTPLSRFPQGHPVTLPEGILFTTY
HHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEHHHCCCCCCCCCCCCEEEEEH
ATLRSDDRGEKVSRVRQIVEWLGSDFDGVLIFDEAHAMQNAGGGKGERGDVAPSQQGRAG
HHHHCCCCHHHHHHHHHHHHHHCCCCCCEEEEECHHHHHCCCCCCCCCCCCCCCCCCCCC
LRLQHALPGARVVYVSATGATTVANLAYAQRLGLWGSEDFPFSTRAEFVEAIEAGGVAAM
CEEHHCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCHHHH
EVLARDLRALGLYTARSLSFDGVEYELVEHELTLEQRRIYDAYAGAFAVIHNHLDAAMRA
HHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ANITGDSGTLNRQAKSAARSAFESAKQRFFGHLLTSMKTPTLIRSIEQDLQSGHSSVIQI
HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHH
VSTGEALMERRLAEVPTQEWNDVRVDITPREYVLDYLEHSFPVQLYEPFTDSEGNLSSRP
HHCCHHHHHHHHHHCCCCCCCCEEEEECHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCC
VFRDGQPVESREALARRTALIEKLASLPPVPGALDQIVQLFGTDMVAEVTGRSRRIVRKG
CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHCCCHHHHHHCC
ERLMVEGRAASANLAETQAFMDDVKRILVFSEAGGTGRSYHAELSARNTRLRVHYLLEPG
CEEEEECCCCCCCHHHHHHHHHHHHHHEEEECCCCCCCCEEEEEECCCCEEEEEEEECCC
WKADTAIQGLGRTHRTNQAQPPLFRPIATNVKAEKRFLSTIARRLDTLGAITRGQRQTGG
CCCHHHHHHCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
QGLFRPEDNLESHYARDALRQLYLLIVRGKVEGCSLKLFEQTTGLTLTDENGIKDELPPV
CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEECCCCCCCCCCHH
TTFLNRLLALTIELQDILFAAFDQLLTARVQGAIAAGVYDVGLETLRAESFVVTDRRAIY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEE
THPGTGAETQLLTIDQRQSNRPVPVEEAVAQLDDQRAILLVNERSGRAALQVPAPSFMLD
ECCCCCCCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEECCCCCEEEEECCCCEEEC
DGEIERRVRLIRPMEQHHASLRMMGDSHWQQADRETFAAAWSAEVAGVPAFSDSTIHIVA
CHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
GLLLPIWKRLPNESSRVYRLQTDEGERIIGRRVSAAWAAGALATGVSTLTARDAFNALAD
HHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
GRTMLDLAEGLHLRRVRVMGANRIELSGFTDSMRERLTAYGLFHEIISWKLRMFVPADAS
CCHHHHHHCCCCEEEEEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHCEEEEEECCCCC
GPAVLAKVIERYPIHRISEKEAT
CHHHHHHHHHHCCCCCCCCCCCC
>Mature Secondary Structure 
SGRGGLSPCGQRERRGLFPVSLSRGSPMNFLSPTATGDSATPVDPAAAVCSAALRLLPH
CCCCCCCCCCHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC
LECGRRVDAVVLRSAMEASFGASDTSGAWNWAMAYDACEAATVLFLRKYGRLLLRKAGSM
HHCCCHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
ASAQPLFSRIADLLPTHTRRSEESQALQQFSTPIPLGLAAVSAAAITASDRVLEPSAGTG
CHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHEECCCCEECCCCCCC
LLAVLAETSGGSLVLNELAEVRAGLLSVLFPGLTVTRFDAAQIDDYLDPASLPTVVLMNP
EEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHCCCCCCCCEEEEECC
PFSVLANVHGRVADAAYRHVASALARLAAGGRLVTITNASFGPETPAWRNAFGRLQEQGR
CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC
VVFTATISGAVYAKHGTTIDTRLTVIDKTLAGDTSFIPGTPGFAPDVATLLSWIEKHVPA
EEEEEEECCEEEECCCCEEEEEEEEEEHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC
RPPIAEAAKVVPAITAPRTVRGYLARPVAGSVHRQLSEPVGVELSYETVDWAPDEGAWLT
CCCHHHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCCCEEEEEEECCCCCCCCCHH
DAIYEDYGLQTIRIPGSQAHPTRLVQSAAMASVAPPKPTYRPTLPNHILTSLSDAQLETV
HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCCEEEE
ILAGEAHCGFLTGSWSVDHTLDLVTAAPEEAPTAVRFRRGFFIGDGTGVGKGRQSASIVL
EEECCCCCEEEECCCCCCHHHHHEECCCCCCCCHHEEECCEEEECCCCCCCCCCCHHHHH
DNWLQGRRKAVWISKSDKLLEDAQRDWAALGTERLLVTPLSRFPQGHPVTLPEGILFTTY
HHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEHHHCCCCCCCCCCCCEEEEEH
ATLRSDDRGEKVSRVRQIVEWLGSDFDGVLIFDEAHAMQNAGGGKGERGDVAPSQQGRAG
HHHHCCCCHHHHHHHHHHHHHHCCCCCCEEEEECHHHHHCCCCCCCCCCCCCCCCCCCCC
LRLQHALPGARVVYVSATGATTVANLAYAQRLGLWGSEDFPFSTRAEFVEAIEAGGVAAM
CEEHHCCCCCEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCHHHH
EVLARDLRALGLYTARSLSFDGVEYELVEHELTLEQRRIYDAYAGAFAVIHNHLDAAMRA
HHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ANITGDSGTLNRQAKSAARSAFESAKQRFFGHLLTSMKTPTLIRSIEQDLQSGHSSVIQI
HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHH
VSTGEALMERRLAEVPTQEWNDVRVDITPREYVLDYLEHSFPVQLYEPFTDSEGNLSSRP
HHCCHHHHHHHHHHCCCCCCCCEEEEECHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCC
VFRDGQPVESREALARRTALIEKLASLPPVPGALDQIVQLFGTDMVAEVTGRSRRIVRKG
CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHCCCHHHHHHCC
ERLMVEGRAASANLAETQAFMDDVKRILVFSEAGGTGRSYHAELSARNTRLRVHYLLEPG
CEEEEECCCCCCCHHHHHHHHHHHHHHEEEECCCCCCCCEEEEEECCCCEEEEEEEECCC
WKADTAIQGLGRTHRTNQAQPPLFRPIATNVKAEKRFLSTIARRLDTLGAITRGQRQTGG
CCCHHHHHHCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
QGLFRPEDNLESHYARDALRQLYLLIVRGKVEGCSLKLFEQTTGLTLTDENGIKDELPPV
CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEECCCCCCCCCCHH
TTFLNRLLALTIELQDILFAAFDQLLTARVQGAIAAGVYDVGLETLRAESFVVTDRRAIY
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEE
THPGTGAETQLLTIDQRQSNRPVPVEEAVAQLDDQRAILLVNERSGRAALQVPAPSFMLD
ECCCCCCCEEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEECCCCCEEEEECCCCEEEC
DGEIERRVRLIRPMEQHHASLRMMGDSHWQQADRETFAAAWSAEVAGVPAFSDSTIHIVA
CHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH
GLLLPIWKRLPNESSRVYRLQTDEGERIIGRRVSAAWAAGALATGVSTLTARDAFNALAD
HHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
GRTMLDLAEGLHLRRVRVMGANRIELSGFTDSMRERLTAYGLFHEIISWKLRMFVPADAS
CCHHHHHHCCCCEEEEEEECCCEEEECCCHHHHHHHHHHHHHHHHHHHCEEEEEECCCCC
GPAVLAKVIERYPIHRISEKEAT
CHHHHHHHHHHCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA