Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is rpoB [H]
Identifier: 159184990
GI number: 159184990
Start: 1927198
End: 1931334
Strand: Reverse
Name: rpoB [H]
Synonym: Atu1956
Alternate gene names: 159184990
Gene position: 1931334-1927198 (Counterclockwise)
Preceding gene: 15889251
Following gene: 159184989
Centisome position: 67.97
GC content: 59.27
Gene sequence:
>4137_bases ATGGCTCAGACCCTTTCGTTTAACGGTCGCAGGCGCGTACGCAAGTTTTTCGGCAAAATTCCAGAAGTAGCGGAAATGCC GAACCTCATCGAGGTTCAGAAGGCTTCGTATGACCAGTTTCTCATGGTTGACGAGCCCAAGGGTGGCCGTCCCGACGAGG GATTGAATGCCGTATTCAAATCCGTATTCCCGATCACCGATTTTTCCGGCGCCTCCATGCTCGAATTCGTGTCTTACGAA TTCGAAGCGCCGAAGTTCGACGTCGAGGAATGCCGTCAGCGCGATCTGACCTATGCAGCGCCGCTGAAGGTGACGCTGCG CCTCATCGTGTTCGATATCGACGAAGATACCGGCGCGAAGTCCATCAAGGACATCAAGGAACAGTCCGTCTACATGGGCG ACATGCCGCTCATGACCAATAACGGCACGTTCATCGTCAACGGCACCGAGCGCGTCATCGTTTCGCAGATGCACCGTTCC CCGGGCGTGTTCTTCGACCACGACAAGGGCAAAAGCCATTCTTCCGGCAAGCTGCTTTTTGCTGCCCGCGTGATCCCGTA TCGCGGTTCTTGGCTCGACATCGAGTTCGACGCCAAGGACATCGTCTATGCGCGTATCGACCGTCGCCGCAAGCTGCCCG TGACCTCGCTGCTGATGGCGCTCGGCATGGATGGCGAAGAAATCCTGTCGACCTTCTACACCAAGGCGACCTATGAGCGC TCCGGCGATGGCTGGCGCATTCCGTTCCAGCCTGAAGCGCTGAAGAATGCCAAGGTCATCACCGACATGATCGACGCCGA CACCGGCGAAGTCGTTGTCGAAGGTGGCAAGAAGCTGACCCCGCGCCTCATCCGCCAGCTCGTCGACAAGGGCCTGAAGG CGCTGAAGGCGACCGACGAAGATCTCTACGGCAACTACCTGGCCGAAGACATCGTCAATTACTCGACGGGTGAGATCTAT CTCGAAGCCGGCGACGAAATCGACGAGAAGACGCTCGGCCTCATTCTGCAGTCCGGCTTTGACGAGATTCCGGTACTCAA CATCGACCACGTCAATGTTGGCGCCTATATCCGCAACACGCTGTCTGCGGACAAGAACCAGAACCGCCAGGAAGCGCTGT TCGACATCTACCGCGTCATGCGTCCCGGCGAGCCGCCGACCATGGATTCGGCGGAGGCGATGTTCAACTCGCTGTTCTTC GATGCCGAACGTTACGATCTCTCGGCTGTTGGCCGCGTGAAGATGAACATGCGTCTCGACCTCGACGCGGAAGACACCGT GCGCACGCTGCGCAAGGAAGACATCCTCGCGGTCGTGAAGATGCTGGTCGAACTGCGTGACGGCAAGGGCGAAATCGACG ACATCGACAACCTCGGCAACCGCCGTGTCCGTTCTGTCGGCGAGCTGATGGAAAACCAGTATCGTCTGGGGCTTCTGCGC ATGGAACGTGCGATCAAGGAACGTATGTCCTCGATCGAAATCGACACCGTGATGCCGCAGGACCTGATCAACGCGAAGCC GGCAGCCGCCGCCGTTCGCGAATTCTTCGGTTCCTCGCAGCTGTCGCAGTTCATGGACCAGGTGAACCCGCTTTCGGAAA TCACCCACAAGCGCCGTCTTTCGGCTCTTGGACCGGGTGGTCTGACCCGTGAGCGCGCCGGCTTCGAAGTCCGCGACGTT CACCCGACCCATTACGGCCGTATTTGCCCGATCGAAACGCCTGAAGGCCCGAACATCGGTCTGATCAACTCGCTTGCAAC CTTTGCCCGTGTGAACAAGTACGGCTTTATCGAAAGCCCGTACCGCAAGATCATCGACGGCAAGGTGACAACCGACGTGA TCTACCTCTCCGCCATGGAAGAGGCCAAGTACTACGTCGCACAGGCCAATGCCGAACTCGATGGCGAAGGCGCCTTCACG GAAGAGTTCGTTGTTTGCCGTCATTCGGGCGAAGTCATGCTCGCACCGCGCGACAACATCAACCTGATGGACGTTTCGCC GAAGCAGCTCGTTTCGGTCGCAGCGGCTCTCATTCCGTTCCTGGAAAACGACGACGCCAACCGCGCTCTCATGGGCTCGA ACATGCAGCGTCAGGCCGTTCCGCTTCTGCGCGCCGAGGCACCGTTCGTCGGTACCGGCATGGAGCCGATCGTTGCCCGT GACTCCGGCGCTGCCATTGCAGCCCGTCGTGGCGGTGTTGTCGATCAGGTGGATGCGACCCGTATCGTTATCCGGGCTAC GGAAGATCTCGATGCCGGCAAATCCGGTGTTGATATCTACCGTCTGCAGAAGTTCCAGCGTTCGAACCAGAACACCTGCG TCAACCAGCGTCCGCTGGTTTCCGTCGGTGACGCCATCTCCAAGGGTGACATCATCGCGGACGGTCCGTCGACCGACCTC GGCGATCTGGCACTCGGCCGTAACGCGCTCGTCGCGTTCATGCCCTGGAATGGCTACAACTACGAAGACTCGATTCTGAT GTCGGAGCGTATCGTTTCCGACGACGTGTTCACCTCCATCCACATCGAAGAATTCGAAGTGATGGCGCGTGACACGAAGC TTGGTCCGGAAGAAATCACGCGCGACATTCCGAACGTTTCGGAAGAAGCGCTGAAGAACCTCGACGAAGCCGGTATCGTC TACATCGGTGCGGAAGTTCAGCCGGGCGACATCCTCGTCGGCAAGATCACGCCGAAGGGCGAAAGCCCGATGACGCCGGA AGAAAAGCTTCTGCGCGCCATCTTCGGTGAAAAGGCTTCCGACGTTCGCGACACGTCCATGCGCATGCCTCCGGGCACGT TCGGTACGATCGTGGAAGTCCGCGTCTTCAACCGTCACGGTGTGGAGAAGGACGAGCGCGCGATGGCTATCGAGCGCGAA GAGATCGAGCGTCTGGCGAAGGACCGCGACGACGAGCAGGCAATTCTCGACCGTAACGTCTACGGCCGCCTGATCGACAT GCTGCGTGGCCACGTTTCCATCGCTGGTCCGAAGGGCTTCAAGAAGGGCGTCGAGCTTTCCAACGCCGTCGTCTCCGAAT ATCCCCGCTCGCAGTGGTGGATGTTCGCGGTCGAGGACGAGAAGGCCCAGTCCGAACTGGAAGCACTTCGCGGCCAGTAC GACGAATCCAAGTCGCGCCTTGAACAGCGCTTCATGGACAAGGTCGAAAAGGTCCAGCGCGGCGATGAAATGCCTCCGGG TGTCATGAAGATGGTCAAGGTCTTCGTCGCTGTGAAGCGCAAGATCCAGCCGGGCGACAAGATGGCCGGCCGTCACGGTA ACAAGGGCGTCGTCTCGCGTATCGTTCCGGTCGAGGACATGCCGTTCCTCGAAGACGGCACGCATGTCGACATCTGCTTG AACCCGCTTGGCGTGCCTTCGCGCATGAACGTCGGCCAGATCCTCGAAACCCACCTCGCATGGGCATGCGCAGGCATGGG CAAGAAGATCGGCGAGATGCTCGAAGAGTATCGCAAGACGATGGACATCAGCGAGCTTCGTAGCGAGCTGACGGAAATCT ACGCGTCCGAGGCTAATGATGAGGTTCAGCGTTTCGATGACGACTCGCTGGTGAAGCTTGCCGAAGAAGCCAAGCGCGGT GTTTCCATCGCGACCCCGGTTTTCGACGGTGCGCATGAGCCTGACGTCGCCGCGATGCTGAAGAAGGCAGGTCTGCATGA ATCCGGTCAGTCCGTCCTTTATGACGGTCGTACCGGTGAGCCGTTCGACCGCAAGGTCACCGTCGGCTACATGTACATGA TCAAGCTGAACCACCTTGTCGACGACAAGATCCACGCTCGCTCGATCGGTCCTTACTCGCTCGTTACCCAGCAGCCGCTG GGCGGCAAGGCGCAGTTCGGCGGACAGCGCTTCGGGGAAATGGAAGTCTGGGCTCTGGAAGCATACGGCGCGGCTTACAC GCTGCAGGAAATGCTCACCGTCAAGTCGGACGACGTTGCCGGCCGCACCAAGGTCTACGAAGCGATCGTCCGTGGCGACG ACACCTTCGAGGCCGGTATCCCTGAGAGCTTCAACGTTCTCGTCAAGGAAATGCGGTCGCTCGGTCTGTCGGTCGAACTG GAGAACTCGAAGATCGAGAACCAGTCCGAGGACCAGCTGCCCGACGCGGCGGAATAA
Upstream 100 bases:
>100_bases GATGTGGAACGAGATTGATGACACCCATTGCTTGACGGGATCGACTGGCCATCGGTTCCCGTCCGTTGCAGGCCCGGATG CAATTTTTAAAGGAGCGACG
Downstream 100 bases:
>100_bases ACACGATAGAGGCGGGCGCCTTCCAGGGCGCCTGCCGGTTTCCCCGTCAGCCCGGCTGGCGTGGGACACTTTCGCCGCAT TGTGCGGTTAAATCCGTATT
Product: DNA-directed RNA polymerase subunit beta
Products: NA
Alternate protein names: RNAP subunit beta; RNA polymerase subunit beta; Transcriptase subunit beta [H]
Number of amino acids: Translated: 1378; Mature: 1377
Protein sequence:
>1378_residues MAQTLSFNGRRRVRKFFGKIPEVAEMPNLIEVQKASYDQFLMVDEPKGGRPDEGLNAVFKSVFPITDFSGASMLEFVSYE FEAPKFDVEECRQRDLTYAAPLKVTLRLIVFDIDEDTGAKSIKDIKEQSVYMGDMPLMTNNGTFIVNGTERVIVSQMHRS PGVFFDHDKGKSHSSGKLLFAARVIPYRGSWLDIEFDAKDIVYARIDRRRKLPVTSLLMALGMDGEEILSTFYTKATYER SGDGWRIPFQPEALKNAKVITDMIDADTGEVVVEGGKKLTPRLIRQLVDKGLKALKATDEDLYGNYLAEDIVNYSTGEIY LEAGDEIDEKTLGLILQSGFDEIPVLNIDHVNVGAYIRNTLSADKNQNRQEALFDIYRVMRPGEPPTMDSAEAMFNSLFF DAERYDLSAVGRVKMNMRLDLDAEDTVRTLRKEDILAVVKMLVELRDGKGEIDDIDNLGNRRVRSVGELMENQYRLGLLR MERAIKERMSSIEIDTVMPQDLINAKPAAAAVREFFGSSQLSQFMDQVNPLSEITHKRRLSALGPGGLTRERAGFEVRDV HPTHYGRICPIETPEGPNIGLINSLATFARVNKYGFIESPYRKIIDGKVTTDVIYLSAMEEAKYYVAQANAELDGEGAFT EEFVVCRHSGEVMLAPRDNINLMDVSPKQLVSVAAALIPFLENDDANRALMGSNMQRQAVPLLRAEAPFVGTGMEPIVAR DSGAAIAARRGGVVDQVDATRIVIRATEDLDAGKSGVDIYRLQKFQRSNQNTCVNQRPLVSVGDAISKGDIIADGPSTDL GDLALGRNALVAFMPWNGYNYEDSILMSERIVSDDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEALKNLDEAGIV YIGAEVQPGDILVGKITPKGESPMTPEEKLLRAIFGEKASDVRDTSMRMPPGTFGTIVEVRVFNRHGVEKDERAMAIERE EIERLAKDRDDEQAILDRNVYGRLIDMLRGHVSIAGPKGFKKGVELSNAVVSEYPRSQWWMFAVEDEKAQSELEALRGQY DESKSRLEQRFMDKVEKVQRGDEMPPGVMKMVKVFVAVKRKIQPGDKMAGRHGNKGVVSRIVPVEDMPFLEDGTHVDICL NPLGVPSRMNVGQILETHLAWACAGMGKKIGEMLEEYRKTMDISELRSELTEIYASEANDEVQRFDDDSLVKLAEEAKRG VSIATPVFDGAHEPDVAAMLKKAGLHESGQSVLYDGRTGEPFDRKVTVGYMYMIKLNHLVDDKIHARSIGPYSLVTQQPL GGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVAGRTKVYEAIVRGDDTFEAGIPESFNVLVKEMRSLGLSVEL ENSKIENQSEDQLPDAAE
Sequences:
>Translated_1378_residues MAQTLSFNGRRRVRKFFGKIPEVAEMPNLIEVQKASYDQFLMVDEPKGGRPDEGLNAVFKSVFPITDFSGASMLEFVSYE FEAPKFDVEECRQRDLTYAAPLKVTLRLIVFDIDEDTGAKSIKDIKEQSVYMGDMPLMTNNGTFIVNGTERVIVSQMHRS PGVFFDHDKGKSHSSGKLLFAARVIPYRGSWLDIEFDAKDIVYARIDRRRKLPVTSLLMALGMDGEEILSTFYTKATYER SGDGWRIPFQPEALKNAKVITDMIDADTGEVVVEGGKKLTPRLIRQLVDKGLKALKATDEDLYGNYLAEDIVNYSTGEIY LEAGDEIDEKTLGLILQSGFDEIPVLNIDHVNVGAYIRNTLSADKNQNRQEALFDIYRVMRPGEPPTMDSAEAMFNSLFF DAERYDLSAVGRVKMNMRLDLDAEDTVRTLRKEDILAVVKMLVELRDGKGEIDDIDNLGNRRVRSVGELMENQYRLGLLR MERAIKERMSSIEIDTVMPQDLINAKPAAAAVREFFGSSQLSQFMDQVNPLSEITHKRRLSALGPGGLTRERAGFEVRDV HPTHYGRICPIETPEGPNIGLINSLATFARVNKYGFIESPYRKIIDGKVTTDVIYLSAMEEAKYYVAQANAELDGEGAFT EEFVVCRHSGEVMLAPRDNINLMDVSPKQLVSVAAALIPFLENDDANRALMGSNMQRQAVPLLRAEAPFVGTGMEPIVAR DSGAAIAARRGGVVDQVDATRIVIRATEDLDAGKSGVDIYRLQKFQRSNQNTCVNQRPLVSVGDAISKGDIIADGPSTDL GDLALGRNALVAFMPWNGYNYEDSILMSERIVSDDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEALKNLDEAGIV YIGAEVQPGDILVGKITPKGESPMTPEEKLLRAIFGEKASDVRDTSMRMPPGTFGTIVEVRVFNRHGVEKDERAMAIERE EIERLAKDRDDEQAILDRNVYGRLIDMLRGHVSIAGPKGFKKGVELSNAVVSEYPRSQWWMFAVEDEKAQSELEALRGQY DESKSRLEQRFMDKVEKVQRGDEMPPGVMKMVKVFVAVKRKIQPGDKMAGRHGNKGVVSRIVPVEDMPFLEDGTHVDICL NPLGVPSRMNVGQILETHLAWACAGMGKKIGEMLEEYRKTMDISELRSELTEIYASEANDEVQRFDDDSLVKLAEEAKRG VSIATPVFDGAHEPDVAAMLKKAGLHESGQSVLYDGRTGEPFDRKVTVGYMYMIKLNHLVDDKIHARSIGPYSLVTQQPL GGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVAGRTKVYEAIVRGDDTFEAGIPESFNVLVKEMRSLGLSVEL ENSKIENQSEDQLPDAAE >Mature_1377_residues AQTLSFNGRRRVRKFFGKIPEVAEMPNLIEVQKASYDQFLMVDEPKGGRPDEGLNAVFKSVFPITDFSGASMLEFVSYEF EAPKFDVEECRQRDLTYAAPLKVTLRLIVFDIDEDTGAKSIKDIKEQSVYMGDMPLMTNNGTFIVNGTERVIVSQMHRSP GVFFDHDKGKSHSSGKLLFAARVIPYRGSWLDIEFDAKDIVYARIDRRRKLPVTSLLMALGMDGEEILSTFYTKATYERS GDGWRIPFQPEALKNAKVITDMIDADTGEVVVEGGKKLTPRLIRQLVDKGLKALKATDEDLYGNYLAEDIVNYSTGEIYL EAGDEIDEKTLGLILQSGFDEIPVLNIDHVNVGAYIRNTLSADKNQNRQEALFDIYRVMRPGEPPTMDSAEAMFNSLFFD AERYDLSAVGRVKMNMRLDLDAEDTVRTLRKEDILAVVKMLVELRDGKGEIDDIDNLGNRRVRSVGELMENQYRLGLLRM ERAIKERMSSIEIDTVMPQDLINAKPAAAAVREFFGSSQLSQFMDQVNPLSEITHKRRLSALGPGGLTRERAGFEVRDVH PTHYGRICPIETPEGPNIGLINSLATFARVNKYGFIESPYRKIIDGKVTTDVIYLSAMEEAKYYVAQANAELDGEGAFTE EFVVCRHSGEVMLAPRDNINLMDVSPKQLVSVAAALIPFLENDDANRALMGSNMQRQAVPLLRAEAPFVGTGMEPIVARD SGAAIAARRGGVVDQVDATRIVIRATEDLDAGKSGVDIYRLQKFQRSNQNTCVNQRPLVSVGDAISKGDIIADGPSTDLG DLALGRNALVAFMPWNGYNYEDSILMSERIVSDDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEALKNLDEAGIVY IGAEVQPGDILVGKITPKGESPMTPEEKLLRAIFGEKASDVRDTSMRMPPGTFGTIVEVRVFNRHGVEKDERAMAIEREE IERLAKDRDDEQAILDRNVYGRLIDMLRGHVSIAGPKGFKKGVELSNAVVSEYPRSQWWMFAVEDEKAQSELEALRGQYD ESKSRLEQRFMDKVEKVQRGDEMPPGVMKMVKVFVAVKRKIQPGDKMAGRHGNKGVVSRIVPVEDMPFLEDGTHVDICLN PLGVPSRMNVGQILETHLAWACAGMGKKIGEMLEEYRKTMDISELRSELTEIYASEANDEVQRFDDDSLVKLAEEAKRGV SIATPVFDGAHEPDVAAMLKKAGLHESGQSVLYDGRTGEPFDRKVTVGYMYMIKLNHLVDDKIHARSIGPYSLVTQQPLG GKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVAGRTKVYEAIVRGDDTFEAGIPESFNVLVKEMRSLGLSVELE NSKIENQSEDQLPDAAE
Specific function: DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates [H]
COG id: COG0085
COG function: function code K; DNA-directed RNA polymerase, beta subunit/140 kD subunit
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the RNA polymerase beta chain family [H]
Homologues:
Organism=Homo sapiens, GI238908503, Length=304, Percent_Identity=30.9210526315789, Blast_Score=130, Evalue=1e-29, Organism=Homo sapiens, GI33469941, Length=321, Percent_Identity=29.9065420560748, Blast_Score=130, Evalue=1e-29, Organism=Homo sapiens, GI238908505, Length=304, Percent_Identity=30.9210526315789, Blast_Score=130, Evalue=1e-29, Organism=Homo sapiens, GI212286172, Length=321, Percent_Identity=29.9065420560748, Blast_Score=129, Evalue=1e-29, Organism=Homo sapiens, GI4505941, Length=248, Percent_Identity=32.6612903225806, Blast_Score=124, Evalue=5e-28, Organism=Escherichia coli, GI1790419, Length=1362, Percent_Identity=58.7371512481645, Blast_Score=1618, Evalue=0.0, Organism=Caenorhabditis elegans, GI17552304, Length=322, Percent_Identity=29.5031055900621, Blast_Score=125, Evalue=1e-28, Organism=Caenorhabditis elegans, GI17506623, Length=237, Percent_Identity=32.0675105485232, Blast_Score=116, Evalue=9e-26, Organism=Caenorhabditis elegans, GI25144348, Length=305, Percent_Identity=28.5245901639344, Blast_Score=99, Evalue=1e-20, Organism=Saccharomyces cerevisiae, GI6324725, Length=309, Percent_Identity=28.4789644012945, Blast_Score=125, Evalue=6e-29, Organism=Saccharomyces cerevisiae, GI6324781, Length=308, Percent_Identity=29.5454545454545, Blast_Score=124, Evalue=1e-28, Organism=Saccharomyces cerevisiae, GI6325267, Length=249, Percent_Identity=31.7269076305221, Blast_Score=119, Evalue=4e-27, Organism=Drosophila melanogaster, GI17647877, Length=247, Percent_Identity=32.7935222672065, Blast_Score=124, Evalue=4e-28, Organism=Drosophila melanogaster, GI17136444, Length=249, Percent_Identity=32.5301204819277, Blast_Score=120, Evalue=1e-26, Organism=Drosophila melanogaster, GI17136446, Length=248, Percent_Identity=32.6612903225806, Blast_Score=110, Evalue=7e-24,
Paralogues:
None
Copy number: 4233 Molecules/Cell In: Growth Phase, Glucose-minimal MOPS Media. 2,500 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010243 - InterPro: IPR019462 - InterPro: IPR015712 - InterPro: IPR007120 - InterPro: IPR007121 - InterPro: IPR007644 - InterPro: IPR007642 - InterPro: IPR007645 - InterPro: IPR007641 - InterPro: IPR014724 [H]
Pfam domain/function: PF04563 RNA_pol_Rpb2_1; PF04561 RNA_pol_Rpb2_2; PF04565 RNA_pol_Rpb2_3; PF10385 RNA_pol_Rpb2_45; PF00562 RNA_pol_Rpb2_6; PF04560 RNA_pol_Rpb2_7 [H]
EC number: =2.7.7.6 [H]
Molecular weight: Translated: 153647; Mature: 153516
Theoretical pI: Translated: 4.73; Mature: 4.73
Prosite motif: PS01166 RNA_POL_BETA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.4 %Cys (Translated Protein) 3.8 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 0.4 %Cys (Mature Protein) 3.8 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAQTLSFNGRRRVRKFFGKIPEVAEMPNLIEVQKASYDQFLMVDEPKGGRPDEGLNAVFK CCCCCCCCCHHHHHHHHCCCCCHHHCCCHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHH SVFPITDFSGASMLEFVSYEFEAPKFDVEECRQRDLTYAAPLKVTLRLIVFDIDEDTGAK HHCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHCCCCEECCCEEEEEEEEEEECCCCCHH SIKDIKEQSVYMGDMPLMTNNGTFIVNGTERVIVSQMHRSPGVFFDHDKGKSHSSGKLLF HHHHHHHHCEEECCCCEEECCCEEEECCCHHHHHHHHHCCCCCEEECCCCCCCCCCCEEE AARVIPYRGSWLDIEFDAKDIVYARIDRRRKLPVTSLLMALGMDGEEILSTFYTKATYER EEEEECCCCCEEEEEECCCHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHCC SGDGWRIPFQPEALKNAKVITDMIDADTGEVVVEGGKKLTPRLIRQLVDKGLKALKATDE CCCCCCCCCCCHHHCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCH DLYGNYLAEDIVNYSTGEIYLEAGDEIDEKTLGLILQSGFDEIPVLNIDHVNVGAYIRNT HHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHH LSADKNQNRQEALFDIYRVMRPGEPPTMDSAEAMFNSLFFDAERYDLSAVGRVKMNMRLD HHCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCHHCCHHEECEEEEEEEEC LDAEDTVRTLRKEDILAVVKMLVELRDGKGEIDDIDNLGNRRVRSVGELMENQYRLGLLR CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH MERAIKERMSSIEIDTVMPQDLINAKPAAAAVREFFGSSQLSQFMDQVNPLSEITHKRRL HHHHHHHHHHCCEEHHCCCHHHHCCCHHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHH SALGPGGLTRERAGFEVRDVHPTHYGRICPIETPEGPNIGLINSLATFARVNKYGFIESP HHCCCCCCCHHHCCCEEEECCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCH YRKIIDGKVTTDVIYLSAMEEAKYYVAQANAELDGEGAFTEEFVVCRHSGEVMLAPRDNI HHHHHCCCHHHHEEEEEHHHHHHEEEEECCCCCCCCCCCCCEEEEEECCCCEEEECCCCC NLMDVSPKQLVSVAAALIPFLENDDANRALMGSNMQRQAVPLLRAEAPFVGTGMEPIVAR EEEECCHHHHHHHHHHHHHHHCCCCCCCHHHCCCCHHHHCHHHHCCCCCCCCCCCCEEEC DSGAAIAARRGGVVDQVDATRIVIRATEDLDAGKSGVDIYRLQKFQRSNQNTCVNQRPLV CCCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCC SVGDAISKGDIIADGPSTDLGDLALGRNALVAFMPWNGYNYEDSILMSERIVSDDVFTSI HHCCCCCCCCEEECCCCCCHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHCCHHHHEE HIEEFEVMARDTKLGPEEITRDIPNVSEEALKNLDEAGIVYIGAEVQPGDILVGKITPKG EHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHCCCCCCEEEEECCCCCCCEEEEEECCCC ESPMTPEEKLLRAIFGEKASDVRDTSMRMPPGTFGTIVEVRVFNRHGVEKDERAMAIERE CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHH EIERLAKDRDDEQAILDRNVYGRLIDMLRGHVSIAGPKGFKKGVELSNAVVSEYPRSQWW HHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCEEECCCCCHHHHHHHHHHHHHHCCCCCEE MFAVEDEKAQSELEALRGQYDESKSRLEQRFMDKVEKVQRGDEMPPGVMKMVKVFVAVKR EEEECCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH KIQPGDKMAGRHGNKGVVSRIVPVEDMPFLEDGTHVDICLNPLGVPSRMNVGQILETHLA HCCCCHHHCCCCCCCCHHHEECCCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHH WACAGMGKKIGEMLEEYRKTMDISELRSELTEIYASEANDEVQRFDDDSLVKLAEEAKRG HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHCCCHHHHHHHHHHHCC VSIATPVFDGAHEPDVAAMLKKAGLHESGQSVLYDGRTGEPFDRKVTVGYMYMIKLNHLV CEEECCCCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCCCCCCCEEEEEEEEEEEEHHHH DDKIHARSIGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVA CCCHHHCCCCCHHHEECCCCCCCCCCCCCCCCCEEEEEEEHHCHHHHHHHHHHHCCCCCC GRTKVYEAIVRGDDTFEAGIPESFNVLVKEMRSLGLSVELENSKIENQSEDQLPDAAE HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCHHCCCCCCC >Mature Secondary Structure AQTLSFNGRRRVRKFFGKIPEVAEMPNLIEVQKASYDQFLMVDEPKGGRPDEGLNAVFK CCCCCCCCHHHHHHHHCCCCCHHHCCCHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHH SVFPITDFSGASMLEFVSYEFEAPKFDVEECRQRDLTYAAPLKVTLRLIVFDIDEDTGAK HHCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHCCCCEECCCEEEEEEEEEEECCCCCHH SIKDIKEQSVYMGDMPLMTNNGTFIVNGTERVIVSQMHRSPGVFFDHDKGKSHSSGKLLF HHHHHHHHCEEECCCCEEECCCEEEECCCHHHHHHHHHCCCCCEEECCCCCCCCCCCEEE AARVIPYRGSWLDIEFDAKDIVYARIDRRRKLPVTSLLMALGMDGEEILSTFYTKATYER EEEEECCCCCEEEEEECCCHHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHCC SGDGWRIPFQPEALKNAKVITDMIDADTGEVVVEGGKKLTPRLIRQLVDKGLKALKATDE CCCCCCCCCCCHHHCCHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCH DLYGNYLAEDIVNYSTGEIYLEAGDEIDEKTLGLILQSGFDEIPVLNIDHVNVGAYIRNT HHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHH LSADKNQNRQEALFDIYRVMRPGEPPTMDSAEAMFNSLFFDAERYDLSAVGRVKMNMRLD HHCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCHHCCHHEECEEEEEEEEC LDAEDTVRTLRKEDILAVVKMLVELRDGKGEIDDIDNLGNRRVRSVGELMENQYRLGLLR CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH MERAIKERMSSIEIDTVMPQDLINAKPAAAAVREFFGSSQLSQFMDQVNPLSEITHKRRL HHHHHHHHHHCCEEHHCCCHHHHCCCHHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHH SALGPGGLTRERAGFEVRDVHPTHYGRICPIETPEGPNIGLINSLATFARVNKYGFIESP HHCCCCCCCHHHCCCEEEECCCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCH YRKIIDGKVTTDVIYLSAMEEAKYYVAQANAELDGEGAFTEEFVVCRHSGEVMLAPRDNI HHHHHCCCHHHHEEEEEHHHHHHEEEEECCCCCCCCCCCCCEEEEEECCCCEEEECCCCC NLMDVSPKQLVSVAAALIPFLENDDANRALMGSNMQRQAVPLLRAEAPFVGTGMEPIVAR EEEECCHHHHHHHHHHHHHHHCCCCCCCHHHCCCCHHHHCHHHHCCCCCCCCCCCCEEEC DSGAAIAARRGGVVDQVDATRIVIRATEDLDAGKSGVDIYRLQKFQRSNQNTCVNQRPLV CCCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCC SVGDAISKGDIIADGPSTDLGDLALGRNALVAFMPWNGYNYEDSILMSERIVSDDVFTSI HHCCCCCCCCEEECCCCCCHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHCCHHHHEE HIEEFEVMARDTKLGPEEITRDIPNVSEEALKNLDEAGIVYIGAEVQPGDILVGKITPKG EHHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHCCCCCCEEEEECCCCCCCEEEEEECCCC ESPMTPEEKLLRAIFGEKASDVRDTSMRMPPGTFGTIVEVRVFNRHGVEKDERAMAIERE CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHH EIERLAKDRDDEQAILDRNVYGRLIDMLRGHVSIAGPKGFKKGVELSNAVVSEYPRSQWW HHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCEEECCCCCHHHHHHHHHHHHHHCCCCCEE MFAVEDEKAQSELEALRGQYDESKSRLEQRFMDKVEKVQRGDEMPPGVMKMVKVFVAVKR EEEECCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH KIQPGDKMAGRHGNKGVVSRIVPVEDMPFLEDGTHVDICLNPLGVPSRMNVGQILETHLA HCCCCHHHCCCCCCCCHHHEECCCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHH WACAGMGKKIGEMLEEYRKTMDISELRSELTEIYASEANDEVQRFDDDSLVKLAEEAKRG HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHCCCHHHHHHHHHHHCC VSIATPVFDGAHEPDVAAMLKKAGLHESGQSVLYDGRTGEPFDRKVTVGYMYMIKLNHLV CEEECCCCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCCCCCCCEEEEEEEEEEEEHHHH DDKIHARSIGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVA CCCHHHCCCCCHHHEECCCCCCCCCCCCCCCCCEEEEEEEHHCHHHHHHHHHHHCCCCCC GRTKVYEAIVRGDDTFEAGIPESFNVLVKEMRSLGLSVELENSKIENQSEDQLPDAAE HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA