Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is smc
Identifier: 159184469
GI number: 159184469
Start: 796372
End: 799839
Strand: Direct
Name: smc
Synonym: Atu0801
Alternate gene names: NA
Gene position: 796372-799839 (Clockwise)
Preceding gene: 15888143
Following gene: 159184470
Centisome position: 28.03
GC content: 64.5
Gene sequence:
>3468_bases ATGAAGTTCAACAAGCTCCGCGTCGTCGGTTTCAAGTCCTTCGTTGAACCCTCCGAATTCATCATCGAGCCCGGTCTGAC CGGCGTTGTCGGCCCGAATGGCTGCGGCAAGTCCAATCTGGTCGAAGCGCTGCGCTGGGTGATGGGCGAGAATTCCTACA AGAACATGCGCGCATCCGGCATGGATGACGTCATCTTTTCCGGCTCCGGCAACCGCCCGGCCAGAAACACCGCCGAAGTC GGCCTTTATCTCGACAATTCCGACCGCACCGCGCCCGCCGCTTTCAACGATGCCGATGAAATTCAGGTGACGCGCCGCAT CGAGCGCGAAAACGGCTCCGTTTACCGCATCAATGGCAAGGAAGCCCGCGCCAAGGATGTGCAACTGCTGTTTGCCGATG CCTCCACCGGCGCCCGCTCGCCCTCCATGGTGGGGCAAGGGCGTATCGGCGAGCTCATCAATGCCAAGCCACAAGCCCGT CGCCAGCTGCTGGAAGAGGCGGCCGGCATTTCCGGCCTGCATTCGCGCCGCCACGAGGCCGAGCTTCGCCTGCGCGCCGC CGAGACCAATCTGGAGCGTCTGGAAGACGTGACTGCCCAGCTGGAAAGCCAGATCGAAAGCCTGAAACGTCAGGCGCGCC AGGCCAACCGCTTCAAGATGCTGTCCGCCGATATCCGCGCTCGCGAGGCGACCCTTCTGCACATCCGCTGGGTGGAGGCG AAGGAAGCGGAAGGCGAGGCGGAAAGCGCGCTCAATCAGGCGACCAACATCGTCGCCGAAAAGGCTCAAGGCCAGATGGA AGCGGCCAAGCAGCAGGGCATCGCCAGCCTGAAATTGCCGGAACTGCGCGAGGACGAGGCCCGCGTGGCCGCCGCCCTGC AACGCCTGCAGATCGCCCGCACCCAGCTGGATGACGAGGCAAACCGCCTGCTGCGCCGTCGTGACGAACTGGCCCGCCGT CTCTCGCAGCTTGGCGAGGATATCGTCCGCGAGGAACGGCTGGTCGCCGATAATGCCCAGATACTCGCACGGCTGGACGA AGAAGAGGCCGAACTTCTCGACATCCTCTCCGATTCCGGTCGTCATGCGGATGAGATGCGCGAAGCCTTCGAGGCTGCGG CCGTCAAGCTGGCGGAAAGCGAAGCCGTTTTCACATCAATCACCGCCGAACGCGCCGAGGCGGCGGCCGGCCGTCAGCAG TTGGAGCGGGCGATCCGCGATCTTTCTGACCGCAAGCTGCGGCTGGAGCGGCAATCGCAGGAAGCCTCTGCCGAGATTGA CACCATCGACGAAAAACTCTCCGGTCTTCCCGACCCCGCGGAGCGGCGCGAAGCGGTGGAAGCGGCTGAGATTGCCGTCG AGGACGCGTTGATCGTGGCGGAGGAGGCCGAGGCCGCCGTTGCCGAGGCGCGTTCCGCCGAAGCGCTGGCGCGCGGGCCG CTGGAAACGGCGAAGAACCGGCTGAATGCGCTGGATACCGAAGCACGCACCATCACCAAAATGCTCGCCACCAGCGCTGC CGCCAATGGTAGTTTCACGCCGGTGGCGGAAGAAATGACGGTGGAGCGCGGTTATGAGGCCGCACTTGGCGCGGCGCTTG GCGACGATCTCGAAAGCCCGCTCGATGCGAGCGCCCCCGCCTATTGGGGCGGCAATGGAAACGGTGCGGATGATCCCGGC CTGCCGCAGGGCGCTAAACCGCTTCTGGACTATGCGCAGGCGCCGGATGCCCTCCGGCGCGCCCTTGCACAGATTGGTGT CGTTGCAGACGTATCGGAGGCCCGACGCCTTCTGCCGTCGCTGAAAGCCGGCCAGCGGCTGGTAACGCGCGAAGGCGCGC TGTTCCGCTGGGACGGCCATATCGCCAGCGCCGATGCGCCGGGTGCTGCGGCCCTTCGCCTGTCGCAGAAGAACCGCCTC GCCGAAATCGAAGCCGAACTGGACGAGGCCCGCTCCATTCTGGAAGAGGCCGAAGACCAGCTTGCCGCGAAAACCGAGGA CATCAGAAGCAGTGAATTGCGGCTCTCGGAGGTGCGTGACCGGAGCCGGCTCGCGACCCGTCAGCTTGCCGAGGCACGCG AGGCGCTGACATCCGCCGAACGGGCCTCGGGCGATCTGCTGCGCCGCCGGGATGTCGTTTCCGAAGCGCAGAACCAGATC GGCGCGCAGATCGACGAGATCGCCGTTCAGGAAGAAAATGCCCGCATCGAAATGGAAGATGCGCCGGATCTTTCCGTGCT TGATCTCCGGCTGCGTGAAAGCCAGCTGGAAGTCGCGACCGACCGCGGCCTGCTGGCGGAGGCCCGCGCTCGCCATGAAG GCGTGAGTCGCGAGGCGGAAAGCCGCCAGCGCAGAATTCAGGCCATAGGGCAGGAGCGTTCCACCTGGGCATCGCGCGCT GCAAGTGCGGCCGATCATATCGCCACATTGCGCGAACGCGAGGAAGAGGCGCGCGAGGAAATCGCCGAGCTTGATATAGC GCCGGAGGAATTCGACGAGAAACGCCGCAACCTCCTCAACGAATTGCAAAAGACCGAAGACGCCCGCCGCGCCGCCGCCG ACCGGCTGGCCGAGGCGGAAAACCTGCAGCGTGCCGCCGATCGGGTGGCGGCAACGGCGCTTTCCGAACTGGCCGAAGCC CGCGAAAAGCGCGGCCGTGCCGAAGAACGTCTGGTTTCCGCCCGCGAGAAACGGCTGGAAACCGAACACCGCATCCGCGA AACACTGAATACCGAGCCTCATATGGCATTTCGCCTGACCGGCCTTGGCCCGGATCAGCCGAAGCCCGATATCCGCGATG TCGAGCGCGATCTCGACCGGCTGAAGATCGAGCGCGAAAGGCTTGGTGCCGTCAATCTGCGCGCCGAGGAGGAACAGGCG GAGCTTTCCGGCAAGCTCGAGGCGCTGATCAAGGAGCGGGATGATATCATCGATGCCGTGCGCAAGCTGCGCGCCGGCAT CCAGAGCCTCAACCGCGAGGGTCGTGAGAGGCTGATTGCCGCCTTCGACGTGGTCAATTCACAGTTCCAGCGGCTGTTCA CCCATCTTTTCGGTGGCGGCACGGCAGAATTGCAGCTGATCGAATCCGACGACCCGCTGGAAGCCGGCCTCGAAATCCTC GCCCGCCCGCCCGGCAAGAAGCCGCAGACCATGACGCTGCTTTCCGGCGGCGAGCAGGCGCTGACGGCGATGGCGCTGAT CTTTGCGGTCTTCCTCACCAATCCCGCGCCCATCTGCGTGCTGGACGAGGTGGATGCGCCGCTCGACGACCACAATGTCG AGCGCTACTGCAACCTGATGGATGAGATGGTGGCCTCCACCGAGACGCGATTCGTTATCATCACCCATAATCCCATCACC ATGGCGCGCATGAACCGCCTGTTCGGTGTCACCATGGCCGAACAGGGCGTCTCGCAACTCGTCTCCGTGGATTTGCAGAC TGCCGAACAGCTGCGCGAAGCCGTCTGA
Upstream 100 bases:
>100_bases CGACAAGCTGCTCTGATCACCGAAGCCTTCAGAAAGCGGGCGGCGGATATTTCCGCACGCCCGTTTTTTATTGCCCATCT TCTTGCTGGGGTGTGACGGC
Downstream 100 bases:
>100_bases GGCGCCTTTTCCGGTTCTCTTCGCCAATTATGCTCATTTTGAATTTATCTTGCTATCTCACCCATTCGGGTGAGCGACAT CACGCGGCATTTTGTTATGG
Product: chromosome segregation protein
Products: NA
Alternate protein names: URF3 [H]
Number of amino acids: Translated: 1155; Mature: 1155
Protein sequence:
>1155_residues MKFNKLRVVGFKSFVEPSEFIIEPGLTGVVGPNGCGKSNLVEALRWVMGENSYKNMRASGMDDVIFSGSGNRPARNTAEV GLYLDNSDRTAPAAFNDADEIQVTRRIERENGSVYRINGKEARAKDVQLLFADASTGARSPSMVGQGRIGELINAKPQAR RQLLEEAAGISGLHSRRHEAELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVEA KEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLKLPELREDEARVAAALQRLQIARTQLDDEANRLLRRRDELARR LSQLGEDIVREERLVADNAQILARLDEEEAELLDILSDSGRHADEMREAFEAAAVKLAESEAVFTSITAERAEAAAGRQQ LERAIRDLSDRKLRLERQSQEASAEIDTIDEKLSGLPDPAERREAVEAAEIAVEDALIVAEEAEAAVAEARSAEALARGP LETAKNRLNALDTEARTITKMLATSAAANGSFTPVAEEMTVERGYEAALGAALGDDLESPLDASAPAYWGGNGNGADDPG LPQGAKPLLDYAQAPDALRRALAQIGVVADVSEARRLLPSLKAGQRLVTREGALFRWDGHIASADAPGAAALRLSQKNRL AEIEAELDEARSILEEAEDQLAAKTEDIRSSELRLSEVRDRSRLATRQLAEAREALTSAERASGDLLRRRDVVSEAQNQI GAQIDEIAVQEENARIEMEDAPDLSVLDLRLRESQLEVATDRGLLAEARARHEGVSREAESRQRRIQAIGQERSTWASRA ASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLLNELQKTEDARRAAADRLAEAENLQRAADRVAATALSELAEA REKRGRAEERLVSAREKRLETEHRIRETLNTEPHMAFRLTGLGPDQPKPDIRDVERDLDRLKIERERLGAVNLRAEEEQA ELSGKLEALIKERDDIIDAVRKLRAGIQSLNREGRERLIAAFDVVNSQFQRLFTHLFGGGTAELQLIESDDPLEAGLEIL ARPPGKKPQTMTLLSGGEQALTAMALIFAVFLTNPAPICVLDEVDAPLDDHNVERYCNLMDEMVASTETRFVIITHNPIT MARMNRLFGVTMAEQGVSQLVSVDLQTAEQLREAV
Sequences:
>Translated_1155_residues MKFNKLRVVGFKSFVEPSEFIIEPGLTGVVGPNGCGKSNLVEALRWVMGENSYKNMRASGMDDVIFSGSGNRPARNTAEV GLYLDNSDRTAPAAFNDADEIQVTRRIERENGSVYRINGKEARAKDVQLLFADASTGARSPSMVGQGRIGELINAKPQAR RQLLEEAAGISGLHSRRHEAELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVEA KEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLKLPELREDEARVAAALQRLQIARTQLDDEANRLLRRRDELARR LSQLGEDIVREERLVADNAQILARLDEEEAELLDILSDSGRHADEMREAFEAAAVKLAESEAVFTSITAERAEAAAGRQQ LERAIRDLSDRKLRLERQSQEASAEIDTIDEKLSGLPDPAERREAVEAAEIAVEDALIVAEEAEAAVAEARSAEALARGP LETAKNRLNALDTEARTITKMLATSAAANGSFTPVAEEMTVERGYEAALGAALGDDLESPLDASAPAYWGGNGNGADDPG LPQGAKPLLDYAQAPDALRRALAQIGVVADVSEARRLLPSLKAGQRLVTREGALFRWDGHIASADAPGAAALRLSQKNRL AEIEAELDEARSILEEAEDQLAAKTEDIRSSELRLSEVRDRSRLATRQLAEAREALTSAERASGDLLRRRDVVSEAQNQI GAQIDEIAVQEENARIEMEDAPDLSVLDLRLRESQLEVATDRGLLAEARARHEGVSREAESRQRRIQAIGQERSTWASRA ASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLLNELQKTEDARRAAADRLAEAENLQRAADRVAATALSELAEA REKRGRAEERLVSAREKRLETEHRIRETLNTEPHMAFRLTGLGPDQPKPDIRDVERDLDRLKIERERLGAVNLRAEEEQA ELSGKLEALIKERDDIIDAVRKLRAGIQSLNREGRERLIAAFDVVNSQFQRLFTHLFGGGTAELQLIESDDPLEAGLEIL ARPPGKKPQTMTLLSGGEQALTAMALIFAVFLTNPAPICVLDEVDAPLDDHNVERYCNLMDEMVASTETRFVIITHNPIT MARMNRLFGVTMAEQGVSQLVSVDLQTAEQLREAV >Mature_1155_residues MKFNKLRVVGFKSFVEPSEFIIEPGLTGVVGPNGCGKSNLVEALRWVMGENSYKNMRASGMDDVIFSGSGNRPARNTAEV GLYLDNSDRTAPAAFNDADEIQVTRRIERENGSVYRINGKEARAKDVQLLFADASTGARSPSMVGQGRIGELINAKPQAR RQLLEEAAGISGLHSRRHEAELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVEA KEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLKLPELREDEARVAAALQRLQIARTQLDDEANRLLRRRDELARR LSQLGEDIVREERLVADNAQILARLDEEEAELLDILSDSGRHADEMREAFEAAAVKLAESEAVFTSITAERAEAAAGRQQ LERAIRDLSDRKLRLERQSQEASAEIDTIDEKLSGLPDPAERREAVEAAEIAVEDALIVAEEAEAAVAEARSAEALARGP LETAKNRLNALDTEARTITKMLATSAAANGSFTPVAEEMTVERGYEAALGAALGDDLESPLDASAPAYWGGNGNGADDPG LPQGAKPLLDYAQAPDALRRALAQIGVVADVSEARRLLPSLKAGQRLVTREGALFRWDGHIASADAPGAAALRLSQKNRL AEIEAELDEARSILEEAEDQLAAKTEDIRSSELRLSEVRDRSRLATRQLAEAREALTSAERASGDLLRRRDVVSEAQNQI GAQIDEIAVQEENARIEMEDAPDLSVLDLRLRESQLEVATDRGLLAEARARHEGVSREAESRQRRIQAIGQERSTWASRA ASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLLNELQKTEDARRAAADRLAEAENLQRAADRVAATALSELAEA REKRGRAEERLVSAREKRLETEHRIRETLNTEPHMAFRLTGLGPDQPKPDIRDVERDLDRLKIERERLGAVNLRAEEEQA ELSGKLEALIKERDDIIDAVRKLRAGIQSLNREGRERLIAAFDVVNSQFQRLFTHLFGGGTAELQLIESDDPLEAGLEIL ARPPGKKPQTMTLLSGGEQALTAMALIFAVFLTNPAPICVLDEVDAPLDDHNVERYCNLMDEMVASTETRFVIITHNPIT MARMNRLFGVTMAEQGVSQLVSVDLQTAEQLREAV
Specific function: Unknown
COG id: COG1196
COG function: function code D; Chromosome segregation ATPases
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI110347425, Length=200, Percent_Identity=29, Blast_Score=100, Evalue=1e-20, Organism=Homo sapiens, GI110347420, Length=200, Percent_Identity=29, Blast_Score=100, Evalue=1e-20, Organism=Homo sapiens, GI110347418, Length=200, Percent_Identity=29, Blast_Score=100, Evalue=1e-20, Organism=Homo sapiens, GI50658065, Length=202, Percent_Identity=30.1980198019802, Blast_Score=93, Evalue=1e-18, Organism=Homo sapiens, GI50658063, Length=202, Percent_Identity=30.1980198019802, Blast_Score=93, Evalue=1e-18, Organism=Homo sapiens, GI30581135, Length=205, Percent_Identity=27.3170731707317, Blast_Score=81, Evalue=8e-15, Organism=Homo sapiens, GI4885399, Length=368, Percent_Identity=24.4565217391304, Blast_Score=75, Evalue=3e-13, Organism=Homo sapiens, GI71565160, Length=208, Percent_Identity=27.4038461538462, Blast_Score=70, Evalue=9e-12, Organism=Caenorhabditis elegans, GI17553272, Length=166, Percent_Identity=32.5301204819277, Blast_Score=98, Evalue=3e-20, Organism=Caenorhabditis elegans, GI17535279, Length=189, Percent_Identity=29.6296296296296, Blast_Score=92, Evalue=1e-18, Organism=Caenorhabditis elegans, GI212656546, Length=381, Percent_Identity=24.1469816272966, Blast_Score=82, Evalue=1e-15, Organism=Caenorhabditis elegans, GI193210872, Length=381, Percent_Identity=24.1469816272966, Blast_Score=82, Evalue=1e-15, Organism=Caenorhabditis elegans, GI17552844, Length=138, Percent_Identity=31.8840579710145, Blast_Score=80, Evalue=5e-15, Organism=Caenorhabditis elegans, GI193202684, Length=219, Percent_Identity=26.027397260274, Blast_Score=77, Evalue=5e-14, Organism=Saccharomyces cerevisiae, GI6321144, Length=245, Percent_Identity=30.2040816326531, Blast_Score=116, Evalue=2e-26, Organism=Saccharomyces cerevisiae, GI6323115, Length=161, Percent_Identity=31.6770186335404, Blast_Score=96, Evalue=4e-20, Organism=Saccharomyces cerevisiae, GI6321104, Length=204, Percent_Identity=29.4117647058824, Blast_Score=94, Evalue=2e-19, Organism=Saccharomyces cerevisiae, GI6322387, Length=211, Percent_Identity=25.5924170616114, Blast_Score=71, Evalue=1e-12, Organism=Drosophila melanogaster, GI19922276, Length=194, Percent_Identity=29.8969072164948, Blast_Score=103, Evalue=6e-22, Organism=Drosophila melanogaster, GI24584683, Length=157, Percent_Identity=28.0254777070064, Blast_Score=92, Evalue=3e-18, Organism=Drosophila melanogaster, GI24642555, Length=319, Percent_Identity=25.3918495297806, Blast_Score=78, Evalue=4e-14, Organism=Drosophila melanogaster, GI24649535, Length=223, Percent_Identity=26.457399103139, Blast_Score=75, Evalue=3e-13, Organism=Drosophila melanogaster, GI24642557, Length=248, Percent_Identity=23.7903225806452, Blast_Score=74, Evalue=4e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003395 [H]
Pfam domain/function: PF02463 SMC_N [H]
EC number: NA
Molecular weight: Translated: 127406; Mature: 127406
Theoretical pI: Translated: 4.64; Mature: 4.64
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 1.9 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 1.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKFNKLRVVGFKSFVEPSEFIIEPGLTGVVGPNGCGKSNLVEALRWVMGENSYKNMRASG CCCCCEEEECHHHHCCCHHHEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCCC MDDVIFSGSGNRPARNTAEVGLYLDNSDRTAPAAFNDADEIQVTRRIERENGSVYRINGK CCCEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCC EARAKDVQLLFADASTGARSPSMVGQGRIGELINAKPQARRQLLEEAAGISGLHSRRHEA CCCCCCEEEEEEECCCCCCCCCCCCCCCHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHH ELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVEA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEECC KEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLKLPELREDEARVAAALQRLQIAR CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH TQLDDEANRLLRRRDELARRLSQLGEDIVREERLVADNAQILARLDEEEAELLDILSDSG HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHCCC RHADEMREAFEAAAVKLAESEAVFTSITAERAEAAAGRQQLERAIRDLSDRKLRLERQSQ CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EASAEIDTIDEKLSGLPDPAERREAVEAAEIAVEDALIVAEEAEAAVAEARSAEALARGP HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC LETAKNRLNALDTEARTITKMLATSAAANGSFTPVAEEMTVERGYEAALGAALGDDLESP HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCC LDASAPAYWGGNGNGADDPGLPQGAKPLLDYAQAPDALRRALAQIGVVADVSEARRLLPS CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHCHHHHHHHHHHHHHH LKAGQRLVTREGALFRWDGHIASADAPGAAALRLSQKNRLAEIEAELDEARSILEEAEDQ HHHCCHHHHCCCCEEEECCCEECCCCCCHHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHH LAAKTEDIRSSELRLSEVRDRSRLATRQLAEAREALTSAERASGDLLRRRDVVSEAQNQI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH GAQIDEIAVQEENARIEMEDAPDLSVLDLRLRESQLEVATDRGLLAEARARHEGVSREAE CCHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHH SRQRRIQAIGQERSTWASRAASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLLN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH ELQKTEDARRAAADRLAEAENLQRAADRVAATALSELAEAREKRGRAEERLVSAREKRLE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH TEHRIRETLNTEPHMAFRLTGLGPDQPKPDIRDVERDLDRLKIERERLGAVNLRAEEEQA HHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH ELSGKLEALIKERDDIIDAVRKLRAGIQSLNREGRERLIAAFDVVNSQFQRLFTHLFGGG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC TAELQLIESDDPLEAGLEILARPPGKKPQTMTLLSGGEQALTAMALIFAVFLTNPAPICV CEEEEEECCCCCHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCCCCEEE LDEVDAPLDDHNVERYCNLMDEMVASTETRFVIITHNPITMARMNRLFGVTMAEQGVSQL ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHH VSVDLQTAEQLREAV HHHHHHHHHHHHHCC >Mature Secondary Structure MKFNKLRVVGFKSFVEPSEFIIEPGLTGVVGPNGCGKSNLVEALRWVMGENSYKNMRASG CCCCCEEEECHHHHCCCHHHEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCCC MDDVIFSGSGNRPARNTAEVGLYLDNSDRTAPAAFNDADEIQVTRRIERENGSVYRINGK CCCEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCC EARAKDVQLLFADASTGARSPSMVGQGRIGELINAKPQARRQLLEEAAGISGLHSRRHEA CCCCCCEEEEEEECCCCCCCCCCCCCCCHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHH ELRLRAAETNLERLEDVTAQLESQIESLKRQARQANRFKMLSADIRAREATLLHIRWVEA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEECC KEAEGEAESALNQATNIVAEKAQGQMEAAKQQGIASLKLPELREDEARVAAALQRLQIAR CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH TQLDDEANRLLRRRDELARRLSQLGEDIVREERLVADNAQILARLDEEEAELLDILSDSG HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHCCC RHADEMREAFEAAAVKLAESEAVFTSITAERAEAAAGRQQLERAIRDLSDRKLRLERQSQ CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH EASAEIDTIDEKLSGLPDPAERREAVEAAEIAVEDALIVAEEAEAAVAEARSAEALARGP HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC LETAKNRLNALDTEARTITKMLATSAAANGSFTPVAEEMTVERGYEAALGAALGDDLESP HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCC LDASAPAYWGGNGNGADDPGLPQGAKPLLDYAQAPDALRRALAQIGVVADVSEARRLLPS CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHCHHHHHHHHHHHHHH LKAGQRLVTREGALFRWDGHIASADAPGAAALRLSQKNRLAEIEAELDEARSILEEAEDQ HHHCCHHHHCCCCEEEECCCEECCCCCCHHHEEEHHHHHHHHHHHHHHHHHHHHHHHHHH LAAKTEDIRSSELRLSEVRDRSRLATRQLAEAREALTSAERASGDLLRRRDVVSEAQNQI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH GAQIDEIAVQEENARIEMEDAPDLSVLDLRLRESQLEVATDRGLLAEARARHEGVSREAE CCHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHH SRQRRIQAIGQERSTWASRAASAADHIATLREREEEAREEIAELDIAPEEFDEKRRNLLN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH ELQKTEDARRAAADRLAEAENLQRAADRVAATALSELAEAREKRGRAEERLVSAREKRLE HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH TEHRIRETLNTEPHMAFRLTGLGPDQPKPDIRDVERDLDRLKIERERLGAVNLRAEEEQA HHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH ELSGKLEALIKERDDIIDAVRKLRAGIQSLNREGRERLIAAFDVVNSQFQRLFTHLFGGG HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC TAELQLIESDDPLEAGLEILARPPGKKPQTMTLLSGGEQALTAMALIFAVFLTNPAPICV CEEEEEECCCCCHHHHHHHHCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCCCCEEE LDEVDAPLDDHNVERYCNLMDEMVASTETRFVIITHNPITMARMNRLFGVTMAEQGVSQL ECCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHH VSVDLQTAEQLREAV HHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2902844 [H]