Definition | Acidovorax citrulli AAC00-1 chromosome, complete genome. |
---|---|
Accession | NC_008752 |
Length | 5,352,772 |
Click here to switch to the map view.
The map label for this gene is ytfN [H]
Identifier: 120612595
GI number: 120612595
Start: 4382634
End: 4387055
Strand: Direct
Name: ytfN [H]
Synonym: Aave_3956
Alternate gene names: 120612595
Gene position: 4382634-4387055 (Clockwise)
Preceding gene: 120612594
Following gene: 120612598
Centisome position: 81.88
GC content: 77.57
Gene sequence:
>4422_bases ATGGCCACGCACCAGACCCCGAACCCCTATGCCGCCGCCGCGGCCCCGCCACCCCGCGCACCCCGGCGGCGTGCCCTGCG CATCGCGCTCTGGTCCCTCGCCAGCCTCATCGCACTGCTGCTCGTGCTGCTCGCCGCGGCCTGGTGGTGGACGGGCACCG GCAACTCCCTGGCCACCGCGCTCGCCCAGGTCGCCCGCTACCTGCCGGCCGGGCAGACCCTGGAAACGCGCGACGTCTCC GGCTCCGTGCGCGCGGGCGGGCAGATCGGCTGGCTGCGCTGGAGCAGCCCCACCCTCGTGGTGGAAGTGCAGGACGCGCG CATCGGCTGGAGCCTGCCGCCGCTGCTGCAGCGCGAACTCTCGCTGGGCGAAGTGCATGCCGCGCGCATCGTCGCCACGC CGCGCACGCCCGCGGAGCCCGAGCCGGACAAGCCGGTGCAGCCCCTGGAACAGCTGGTGCTGCCGCTGCGCATCGACATT CCCCTGCGCGTGGACGAAATCACCTGGGCCGCCGCCAGCCCCGCCACCGTGCGCAACCTCGCCGGCCGCTACCGCTTCGA CGGCGACCAGCACAGCGCCACGCTCGACAACCTGGAGCTGGCCCGGGGCCGCTACAGCGGCAGCGCCACGCTGCAGGCAC AGGCCCCCATGGCCCTGGAACTCACGGCCGACGGCGTGGTCCGCACACCCGCTCCCGGCGGCGGCGCGGACCTGGAAGCC ACTGCCCGCGCCGAGGTGCGCGGCACCCTCGCCACGGCCGCCGCGCGCCTGGACGTGTCCGGGCGCCTGCAGGCCGTGGC CGCGCAGGCCGCAGCGCCGCGGGGCCCCGCGGCGTCCGCGCCCCGCCGCGCCGGCTCCGCGCCCCGTCCGGCCCCCGCCA CGCCCACCACGCCGTCCGCTTCGGTAGCTGCTTCATCCCCCGCCGGCACGACGCCTTCCGGCACGGAGCCCATGCAGGCC CAGGTCCAGGCCCGCGTCGCCCCCTGGGCGCCGCAGCCCCTGCTGCAGGCACAGGCGCAGGTGCAGGCCCTGGACGTGGC CGCGATCTGGCCCCAGGCCCCTGCCACGCGCCTGAGCGGCCGCATCGAAGCCGCGCCCGCGCCGGCCACCGCGCCAGGAA CCGGGGCTCCGCAGGCCGTGCCGCAGCCGGCGCCTGCCCGGGCACCGGCATCGGCCGCTTCCGCGCCCTCCGCGTCCGCC GGCACGCCGCCAGCGACCGGCTGGGCCCTGTCCGCCCAGCTGGAAAACGCCCTGCCCGGCCCCTGGGACAAGGGCCGGCT GCCGGTCGAATCCATCGACGCGCGCGCCGGTTTCGATGGCGCACGCTGGAGCGTGCCCCAGGCCACCGCCCGCGTGGGCA GCGGCACGATCGCGCTGGAAGGCGCCTTCACGCCCGCCACGCATGCCCTGCAGGGCGACCTGGAACTGCGCGGCGTGCGG CCCGACGCCGTGCTCAGCACCCTGGACGCCGCGCCGCTGTCCGGCCGGGCGCGCGCGCGCAGCGACGGCGCCCCCGCCGC CGACCACAACGATGCCGCGCCCGTCCCGCTGGTGCGCTTCTCGGCCGACATCCGCTCCGCAGGCACCGGCCGCGCGCCAC GCGAGGGCGGCACGTCCACCGCGCAGGCCGCCCCGCCGCTGCGCATCGACCGGCTCGCCACCGAAGGCACCTGGCAGGGC ACCCTGCTCACCCTGGCGCAGCTGCAGCTGGACGCGCTGCAGGTGCAGGCCAGTGCGCGCCAGTTGCGCATCGACACCGC CGACCGCTCCGCCCAGGGCCAGTTGCAGGCCACGCTGCCCGGCGCCACCGCACAGGTGGACGGCCGCATCGCCCCGCGTG CCGGCGCGGGCACGCTGGACCTGCGCGTCGCGGACGCGCAGCGCGTGCAGCGCTGGATCGAATCCGTTCCCGGCCTGGGC ACCGCCCTCGGCGGCGCGGCGCTGCAGGGCGAGGCGCGCCTCGACGCCCGCTGGAACGGCGGCTGGGAATCCCTGCTGGG CCAGTTGCAGACCGCCGGACTCGTGGCAAAGCCCCGGACCGGCGGCGCCACCGCCGCCGGCCCCTTCGAGCTGCAGGCCC GCCTCGCCGCCCCGCGCTGGGAAGTGGCCCTGCCGCCGCGGCCCGGCACCGGCGCGGGCCCCGCGACGCTCAGGCTCACG GCCGTGCGGGCCGACGTGGCCGGCTCCGTGCCGCGCGCCACGCTTTCCCTGGACGGCGAGGCGCGGCTCGACGAGCGCCG GCTGGACCTGCGCCTGCGCGGCTCCGGCGGCGCCGCCGGCCCGGACCAGTGGCGCGCGCAGATCGACGAACTGCGCCTGC AGGCCCGCGACGGACAGCGCCCGGGCCCCTGGACGGTGCAGTTCGCGCAGCCGCTCACCGTGTCGGCACGCACCGCGCCC ACGCTGCAGGTGGAAACCTCCGGCGGGCAGGCGCGCGTGTCCGCGCCCGCGCCGGGCGATGTCACCCTGCGCTGGGAGCC CGTGCGCTTCGCGCGCACCGCATCGGGCGGCCTCCAGCTGCGCACGCGCGGCCAGCTGCAGGGCCTGCCCATGGCCTGGG CCGAAGCGCTGGCGCAGGGCAGCGACGCCCTGGACCGCCTGGGCGTGCAGGGCAACCTCGTCTTCGACGGCGACTGGGAC GTGGACGCGGGCGACACCCTGCGCGCGTCCGCCAGCCTGCGCCGCACCTCCGGCGACCTGCGCCTGCTGACCGGCAACGC GCCGGCGGCCACCGTCGTGCGCAGCAGCGGCCCGAGCGCGGGCACCGGCACGGGGCCGGCCGCCGGCATGGTCCGTGGAA CGGCTGCCGCCGCCAACGCCTCCGCGACCGACACGCCGCGCGGCGCCGGCACCCCCGCCGGCGTGCGCGCCGCCGAAGTG CGCGTGCAGGCGGAAGGCGACACCGTGCGCGCGCGCCTGCAATGGGACAGCGAGCGCGCCGGCCAGGTCCAGGCCGAGGC CTCCACGCACCTGGCGCGCGTGGGCGAAGGGTGGGAATGGCCCGCCGACGCGCCCCTCTCCGCCACCGCGCGCGCGCGGC TGCCCGACGTGGGCGTGTGGTCCACGCTCGCCCCGCCGGGCTGGCGCGTGCAGGGCACGCTCGATGCCGACGTGGTGCTG TCCGGCACCCGCACCGCGCCGCGCTGGAGCGGCACGCTCGCCGCCGACCAGCTGGCCGTGCGCTCGCTGCTCGACGGCGT GGACCTGCAGAACGGCCGCCTGCGCGCGGCGCTGCGCGGCGACCGGCTCGAGATCACCGAGTTCCGCGTGAACGGCGGCC CCGGAAGCAGCGCACGCATCGCGGGCTTCAGCGGCAACCGCACCGCCGCGCCCAAGGACGGCGGCACCCTCTCGGGCACC GGCACCGTGTCCTGGGCCGGCATGGGCCGGGAGGCCGCAGCGGGCGGCTCGGGCATCGCCATGGACTTCACCGCGGATGC GCGCGCCCTGCAGGTGCTGGTGCGCGCCGACCGGCAGGTGAGCGTCTCGGGCCAGGTGCGCGCGCAGCTGCGCCAAGGCC AGTTGTCGCTGCGCGGCAAGCTCACCGCCGACCGGGCCACCATCATCCTCCCGGAGGCCGGCGCGCCCAGCCTGGGCAGC GACGTGGTCGTACGCTCCGCCGCCACCGACCGTGCCCGTGCCGAGGCCGCCCAGCGCGAAGGCGCCCAGGCCGGCCGCGT GGAAGCCGCGCGCCCACCGGACATCGCCCTCACCTTCGACCTCGGCGACGACTTCGCGCTGCAGGGCCACGGCGTGACCA CGCGCCTGGCCGGCGAGCTGGACATCCGCGGCGCCACCACGGCCGGCGGCCCGCCGCGCATCACCGGCGAGGTCCGCACC GTCGAGGGCCGCTACCGCGCATGGGGGCAGGCGCTCAACATCGAGACCGGCCTCGCGCGCTTCAACGGCCCCTACGACAA CCCCGCGCTGGACGTGCTGGCCATCCGGCCCAACATCAGCGTGCGCGCGGGCGTGCAGGTCTCCGGCACGGCCAAGGCGC CGCGCGTGGCGCTCTACTCCGACCCCGAGCTGCCCGACGCCGAAAAACTCTCCTGGGTGGTCCTGGGCCGCAGCACCGCC GCCGGCGGCGCCGAGGCCGCGCTGCTGCAGCAGGCCGCGCTGGCCCTGCTGGGCGGCGGGGGCGGCAACGCCGGTGCGGG CAATTTCGCGAGCCGCCTGGGGCTGGACGAGATCGGCTTCCGCGGACCGAACTCCGGCGCCAGCGGCGAAGACGCCTCGG GCGCCGCGCTCACCTTCGGCAAGCGGCTGTCCAAGGATCTGTACGTCACCTACGAGCGCAGCCTGTCGGGCGCCCTGGGC ACGCTCTACATCTTCTACGACCTCACGCAGCGGCTCACGCTGCGCGGCCAGACGGGCGTGCAGAGCGCGGTGGACCTGAT CTACACGCTGCGGTACGACTGA
Upstream 100 bases:
>100_bases CCGACGTGGCCTACGGCGTGCAGGCCAAGGCCCTGCGGCTGCACCTGCGCCTGGGTTTCAGCTTCTGACCGGCTCCGGCC GCTTTTATCCGCGCGATCCG
Downstream 100 bases:
>100_bases CGCAGTGCGGCGCGGCGCGGCGCGAGGCGCGGCCGGCCGGCGGTCAGCCCCGGCCGCCGGGCGCCAGCTGGATGGTGCCC AGCCCCGCAGGCGCATCGCC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 1473; Mature: 1472
Protein sequence:
>1473_residues MATHQTPNPYAAAAAPPPRAPRRRALRIALWSLASLIALLLVLLAAAWWWTGTGNSLATALAQVARYLPAGQTLETRDVS GSVRAGGQIGWLRWSSPTLVVEVQDARIGWSLPPLLQRELSLGEVHAARIVATPRTPAEPEPDKPVQPLEQLVLPLRIDI PLRVDEITWAAASPATVRNLAGRYRFDGDQHSATLDNLELARGRYSGSATLQAQAPMALELTADGVVRTPAPGGGADLEA TARAEVRGTLATAAARLDVSGRLQAVAAQAAAPRGPAASAPRRAGSAPRPAPATPTTPSASVAASSPAGTTPSGTEPMQA QVQARVAPWAPQPLLQAQAQVQALDVAAIWPQAPATRLSGRIEAAPAPATAPGTGAPQAVPQPAPARAPASAASAPSASA GTPPATGWALSAQLENALPGPWDKGRLPVESIDARAGFDGARWSVPQATARVGSGTIALEGAFTPATHALQGDLELRGVR PDAVLSTLDAAPLSGRARARSDGAPAADHNDAAPVPLVRFSADIRSAGTGRAPREGGTSTAQAAPPLRIDRLATEGTWQG TLLTLAQLQLDALQVQASARQLRIDTADRSAQGQLQATLPGATAQVDGRIAPRAGAGTLDLRVADAQRVQRWIESVPGLG TALGGAALQGEARLDARWNGGWESLLGQLQTAGLVAKPRTGGATAAGPFELQARLAAPRWEVALPPRPGTGAGPATLRLT AVRADVAGSVPRATLSLDGEARLDERRLDLRLRGSGGAAGPDQWRAQIDELRLQARDGQRPGPWTVQFAQPLTVSARTAP TLQVETSGGQARVSAPAPGDVTLRWEPVRFARTASGGLQLRTRGQLQGLPMAWAEALAQGSDALDRLGVQGNLVFDGDWD VDAGDTLRASASLRRTSGDLRLLTGNAPAATVVRSSGPSAGTGTGPAAGMVRGTAAAANASATDTPRGAGTPAGVRAAEV RVQAEGDTVRARLQWDSERAGQVQAEASTHLARVGEGWEWPADAPLSATARARLPDVGVWSTLAPPGWRVQGTLDADVVL SGTRTAPRWSGTLAADQLAVRSLLDGVDLQNGRLRAALRGDRLEITEFRVNGGPGSSARIAGFSGNRTAAPKDGGTLSGT GTVSWAGMGREAAAGGSGIAMDFTADARALQVLVRADRQVSVSGQVRAQLRQGQLSLRGKLTADRATIILPEAGAPSLGS DVVVRSAATDRARAEAAQREGAQAGRVEAARPPDIALTFDLGDDFALQGHGVTTRLAGELDIRGATTAGGPPRITGEVRT VEGRYRAWGQALNIETGLARFNGPYDNPALDVLAIRPNISVRAGVQVSGTAKAPRVALYSDPELPDAEKLSWVVLGRSTA AGGAEAALLQQAALALLGGGGGNAGAGNFASRLGLDEIGFRGPNSGASGEDASGAALTFGKRLSKDLYVTYERSLSGALG TLYIFYDLTQRLTLRGQTGVQSAVDLIYTLRYD
Sequences:
>Translated_1473_residues MATHQTPNPYAAAAAPPPRAPRRRALRIALWSLASLIALLLVLLAAAWWWTGTGNSLATALAQVARYLPAGQTLETRDVS GSVRAGGQIGWLRWSSPTLVVEVQDARIGWSLPPLLQRELSLGEVHAARIVATPRTPAEPEPDKPVQPLEQLVLPLRIDI PLRVDEITWAAASPATVRNLAGRYRFDGDQHSATLDNLELARGRYSGSATLQAQAPMALELTADGVVRTPAPGGGADLEA TARAEVRGTLATAAARLDVSGRLQAVAAQAAAPRGPAASAPRRAGSAPRPAPATPTTPSASVAASSPAGTTPSGTEPMQA QVQARVAPWAPQPLLQAQAQVQALDVAAIWPQAPATRLSGRIEAAPAPATAPGTGAPQAVPQPAPARAPASAASAPSASA GTPPATGWALSAQLENALPGPWDKGRLPVESIDARAGFDGARWSVPQATARVGSGTIALEGAFTPATHALQGDLELRGVR PDAVLSTLDAAPLSGRARARSDGAPAADHNDAAPVPLVRFSADIRSAGTGRAPREGGTSTAQAAPPLRIDRLATEGTWQG TLLTLAQLQLDALQVQASARQLRIDTADRSAQGQLQATLPGATAQVDGRIAPRAGAGTLDLRVADAQRVQRWIESVPGLG TALGGAALQGEARLDARWNGGWESLLGQLQTAGLVAKPRTGGATAAGPFELQARLAAPRWEVALPPRPGTGAGPATLRLT AVRADVAGSVPRATLSLDGEARLDERRLDLRLRGSGGAAGPDQWRAQIDELRLQARDGQRPGPWTVQFAQPLTVSARTAP TLQVETSGGQARVSAPAPGDVTLRWEPVRFARTASGGLQLRTRGQLQGLPMAWAEALAQGSDALDRLGVQGNLVFDGDWD VDAGDTLRASASLRRTSGDLRLLTGNAPAATVVRSSGPSAGTGTGPAAGMVRGTAAAANASATDTPRGAGTPAGVRAAEV RVQAEGDTVRARLQWDSERAGQVQAEASTHLARVGEGWEWPADAPLSATARARLPDVGVWSTLAPPGWRVQGTLDADVVL SGTRTAPRWSGTLAADQLAVRSLLDGVDLQNGRLRAALRGDRLEITEFRVNGGPGSSARIAGFSGNRTAAPKDGGTLSGT GTVSWAGMGREAAAGGSGIAMDFTADARALQVLVRADRQVSVSGQVRAQLRQGQLSLRGKLTADRATIILPEAGAPSLGS DVVVRSAATDRARAEAAQREGAQAGRVEAARPPDIALTFDLGDDFALQGHGVTTRLAGELDIRGATTAGGPPRITGEVRT VEGRYRAWGQALNIETGLARFNGPYDNPALDVLAIRPNISVRAGVQVSGTAKAPRVALYSDPELPDAEKLSWVVLGRSTA AGGAEAALLQQAALALLGGGGGNAGAGNFASRLGLDEIGFRGPNSGASGEDASGAALTFGKRLSKDLYVTYERSLSGALG TLYIFYDLTQRLTLRGQTGVQSAVDLIYTLRYD >Mature_1472_residues ATHQTPNPYAAAAAPPPRAPRRRALRIALWSLASLIALLLVLLAAAWWWTGTGNSLATALAQVARYLPAGQTLETRDVSG SVRAGGQIGWLRWSSPTLVVEVQDARIGWSLPPLLQRELSLGEVHAARIVATPRTPAEPEPDKPVQPLEQLVLPLRIDIP LRVDEITWAAASPATVRNLAGRYRFDGDQHSATLDNLELARGRYSGSATLQAQAPMALELTADGVVRTPAPGGGADLEAT ARAEVRGTLATAAARLDVSGRLQAVAAQAAAPRGPAASAPRRAGSAPRPAPATPTTPSASVAASSPAGTTPSGTEPMQAQ VQARVAPWAPQPLLQAQAQVQALDVAAIWPQAPATRLSGRIEAAPAPATAPGTGAPQAVPQPAPARAPASAASAPSASAG TPPATGWALSAQLENALPGPWDKGRLPVESIDARAGFDGARWSVPQATARVGSGTIALEGAFTPATHALQGDLELRGVRP DAVLSTLDAAPLSGRARARSDGAPAADHNDAAPVPLVRFSADIRSAGTGRAPREGGTSTAQAAPPLRIDRLATEGTWQGT LLTLAQLQLDALQVQASARQLRIDTADRSAQGQLQATLPGATAQVDGRIAPRAGAGTLDLRVADAQRVQRWIESVPGLGT ALGGAALQGEARLDARWNGGWESLLGQLQTAGLVAKPRTGGATAAGPFELQARLAAPRWEVALPPRPGTGAGPATLRLTA VRADVAGSVPRATLSLDGEARLDERRLDLRLRGSGGAAGPDQWRAQIDELRLQARDGQRPGPWTVQFAQPLTVSARTAPT LQVETSGGQARVSAPAPGDVTLRWEPVRFARTASGGLQLRTRGQLQGLPMAWAEALAQGSDALDRLGVQGNLVFDGDWDV DAGDTLRASASLRRTSGDLRLLTGNAPAATVVRSSGPSAGTGTGPAAGMVRGTAAAANASATDTPRGAGTPAGVRAAEVR VQAEGDTVRARLQWDSERAGQVQAEASTHLARVGEGWEWPADAPLSATARARLPDVGVWSTLAPPGWRVQGTLDADVVLS GTRTAPRWSGTLAADQLAVRSLLDGVDLQNGRLRAALRGDRLEITEFRVNGGPGSSARIAGFSGNRTAAPKDGGTLSGTG TVSWAGMGREAAAGGSGIAMDFTADARALQVLVRADRQVSVSGQVRAQLRQGQLSLRGKLTADRATIILPEAGAPSLGSD VVVRSAATDRARAEAAQREGAQAGRVEAARPPDIALTFDLGDDFALQGHGVTTRLAGELDIRGATTAGGPPRITGEVRTV EGRYRAWGQALNIETGLARFNGPYDNPALDVLAIRPNISVRAGVQVSGTAKAPRVALYSDPELPDAEKLSWVVLGRSTAA GGAEAALLQQAALALLGGGGGNAGAGNFASRLGLDEIGFRGPNSGASGEDASGAALTFGKRLSKDLYVTYERSLSGALGT LYIFYDLTQRLTLRGQTGVQSAVDLIYTLRYD
Specific function: Unknown
COG id: COG2911
COG function: function code S; Uncharacterized protein conserved in bacteria
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: To H.influenzae HI_0696 [H]
Homologues:
Organism=Escherichia coli, GI1790667, Length=327, Percent_Identity=28.1345565749235, Blast_Score=93, Evalue=1e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR007452 [H]
Pfam domain/function: PF04357 DUF490 [H]
EC number: NA
Molecular weight: Translated: 151882; Mature: 151751
Theoretical pI: Translated: 10.59; Mature: 10.59
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 0.5 %Met (Translated Protein) 0.5 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 0.4 %Met (Mature Protein) 0.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MATHQTPNPYAAAAAPPPRAPRRRALRIALWSLASLIALLLVLLAAAWWWTGTGNSLATA CCCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCHHHHHH LAQVARYLPAGQTLETRDVSGSVRAGGQIGWLRWSSPTLVVEVQDARIGWSLPPLLQREL HHHHHHHCCCCCCEEECCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCHHHHHHC SLGEVHAARIVATPRTPAEPEPDKPVQPLEQLVLPLRIDIPLRVDEITWAAASPATVRNL CCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHCCEEECCCEEECCEEEECCCCHHHHHH AGRYRFDGDQHSATLDNLELARGRYSGSATLQAQAPMALELTADGVVRTPAPGGGADLEA CCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEECCCEEEEEECCCEEECCCCCCCCCCCH TARAEVRGTLATAAARLDVSGRLQAVAAQAAAPRGPAASAPRRAGSAPRPAPATPTTPSA HHHHHHHHHHHHHHHEECCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC SVAASSPAGTTPSGTEPMQAQVQARVAPWAPQPLLQAQAQVQALDVAAIWPQAPATRLSG CEECCCCCCCCCCCCCCHHHHHHEEECCCCCCHHHHHHHHHEEEEEEEECCCCCHHHHCC RIEAAPAPATAPGTGAPQAVPQPAPARAPASAASAPSASAGTPPATGWALSAQLENALPG EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHCCCC PWDKGRLPVESIDARAGFDGARWSVPQATARVGSGTIALEGAFTPATHALQGDLELRGVR CCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHCCCCEEEEECCCCCCCHHEECCEEEECCC PDAVLSTLDAAPLSGRARARSDGAPAADHNDAAPVPLVRFSADIRSAGTGRAPREGGTST HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCHHCCCCCCCCCCCCCCC AQAAPPLRIDRLATEGTWQGTLLTLAQLQLDALQVQASARQLRIDTADRSAQGQLQATLP CCCCCCCEEEHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCEEEEECC GATAQVDGRIAPRAGAGTLDLRVADAQRVQRWIESVPGLGTALGGAALQGEARLDARWNG CCCEEECCEECCCCCCCEEEEEECCHHHHHHHHHHCCCCCHHHCCCEECCCEEEEEEECC GWESLLGQLQTAGLVAKPRTGGATAAGPFELQARLAAPRWEVALPPRPGTGAGPATLRLT CHHHHHHHHHHCCEEECCCCCCCCCCCCCEEEEEECCCCEEEEECCCCCCCCCCCEEEEE AVRADVAGSVPRATLSLDGEARLDERRLDLRLRGSGGAAGPDQWRAQIDELRLQARDGQR EEEECCCCCCCCEEEECCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCC PGPWTVQFAQPLTVSARTAPTLQVETSGGQARVSAPAPGDVTLRWEPVRFARTASGGLQL CCCEEEEECCCEEEECCCCCEEEEECCCCEEEECCCCCCCEEEEEECCEEEEECCCCEEE RTRGQLQGLPMAWAEALAQGSDALDRLGVQGNLVFDGDWDVDAGDTLRASASLRRTSGDL EECCCCCCCCHHHHHHHHCCCHHHHHCCCCCEEEECCCCCCCCCCCEECCCHHEECCCCE RLLTGNAPAATVVRSSGPSAGTGTGPAAGMVRGTAAAANASATDTPRGAGTPAGVRAAEV EEEECCCCCEEEEECCCCCCCCCCCCCCHHEECCHHHCCCCCCCCCCCCCCCCCCEEEEE RVQAEGDTVRARLQWDSERAGQVQAEASTHLARVGEGWEWPADAPLSATARARLPDVGVW EEEECCCEEEEEEEECCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCC STLAPPGWRVQGTLDADVVLSGTRTAPRWSGTLAADQLAVRSLLDGVDLQNGRLRAALRG CCCCCCCEEEEEECCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEECC DRLEITEFRVNGGPGSSARIAGFSGNRTAAPKDGGTLSGTGTVSWAGMGREAAAGGSGIA CEEEEEEEEECCCCCCCCEEEECCCCCCCCCCCCCEECCCCEEEECCCCCHHCCCCCCEE MDFTADARALQVLVRADRQVSVSGQVRAQLRQGQLSLRGKLTADRATIILPEAGAPSLGS EEECCCHHHHHHHHHCCCEEECCHHHHHHHHCCCEEEEEEEEECCEEEEEECCCCCCCCC DVVVRSAATDRARAEAAQREGAQAGRVEAARPPDIALTFDLGDDFALQGHGVTTRLAGEL CEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCEEEECCCEEEEEECEE DIRGATTAGGPPRITGEVRTVEGRYRAWGQALNIETGLARFNGPYDNPALDVLAIRPNIS EECCCCCCCCCCEECCEEEEECCHHHHCCCEECCCHHHHHCCCCCCCCCEEEEEECCCCE VRAGVQVSGTAKAPRVALYSDPELPDAEKLSWVVLGRSTAAGGAEAALLQQAALALLGGG EEECEEEECCCCCCEEEEECCCCCCCHHCEEEEEEECCCCCCCHHHHHHHHHHHHEEECC GGNAGAGNFASRLGLDEIGFRGPNSGASGEDASGAALTFGKRLSKDLYVTYERSLSGALG CCCCCCCHHHHHCCCHHHCCCCCCCCCCCCCCCCCEEHHHHHCCCCEEEEEECCCCCCCE TLYIFYDLTQRLTLRGQTGVQSAVDLIYTLRYD EEEEEEEHHHHHEECCCCCHHHHHHEEEEEECC >Mature Secondary Structure ATHQTPNPYAAAAAPPPRAPRRRALRIALWSLASLIALLLVLLAAAWWWTGTGNSLATA CCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCHHHHHH LAQVARYLPAGQTLETRDVSGSVRAGGQIGWLRWSSPTLVVEVQDARIGWSLPPLLQREL HHHHHHHCCCCCCEEECCCCCCCCCCCEEEEEEECCCEEEEEEECCCCCCCCCHHHHHHC SLGEVHAARIVATPRTPAEPEPDKPVQPLEQLVLPLRIDIPLRVDEITWAAASPATVRNL CCCCEEEEEEEECCCCCCCCCCCCCHHHHHHHHCCEEECCCEEECCEEEECCCCHHHHHH AGRYRFDGDQHSATLDNLELARGRYSGSATLQAQAPMALELTADGVVRTPAPGGGADLEA CCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEECCCEEEEEECCCEEECCCCCCCCCCCH TARAEVRGTLATAAARLDVSGRLQAVAAQAAAPRGPAASAPRRAGSAPRPAPATPTTPSA HHHHHHHHHHHHHHHEECCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC SVAASSPAGTTPSGTEPMQAQVQARVAPWAPQPLLQAQAQVQALDVAAIWPQAPATRLSG CEECCCCCCCCCCCCCCHHHHHHEEECCCCCCHHHHHHHHHEEEEEEEECCCCCHHHHCC RIEAAPAPATAPGTGAPQAVPQPAPARAPASAASAPSASAGTPPATGWALSAQLENALPG EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHCCCC PWDKGRLPVESIDARAGFDGARWSVPQATARVGSGTIALEGAFTPATHALQGDLELRGVR CCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHCCCCEEEEECCCCCCCHHEECCEEEECCC PDAVLSTLDAAPLSGRARARSDGAPAADHNDAAPVPLVRFSADIRSAGTGRAPREGGTST HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCHHCCCCCCCCCCCCCCC AQAAPPLRIDRLATEGTWQGTLLTLAQLQLDALQVQASARQLRIDTADRSAQGQLQATLP CCCCCCCEEEHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCEEEEECC GATAQVDGRIAPRAGAGTLDLRVADAQRVQRWIESVPGLGTALGGAALQGEARLDARWNG CCCEEECCEECCCCCCCEEEEEECCHHHHHHHHHHCCCCCHHHCCCEECCCEEEEEEECC GWESLLGQLQTAGLVAKPRTGGATAAGPFELQARLAAPRWEVALPPRPGTGAGPATLRLT CHHHHHHHHHHCCEEECCCCCCCCCCCCCEEEEEECCCCEEEEECCCCCCCCCCCEEEEE AVRADVAGSVPRATLSLDGEARLDERRLDLRLRGSGGAAGPDQWRAQIDELRLQARDGQR EEEECCCCCCCCEEEECCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCC PGPWTVQFAQPLTVSARTAPTLQVETSGGQARVSAPAPGDVTLRWEPVRFARTASGGLQL CCCEEEEECCCEEEECCCCCEEEEECCCCEEEECCCCCCCEEEEEECCEEEEECCCCEEE RTRGQLQGLPMAWAEALAQGSDALDRLGVQGNLVFDGDWDVDAGDTLRASASLRRTSGDL EECCCCCCCCHHHHHHHHCCCHHHHHCCCCCEEEECCCCCCCCCCCEECCCHHEECCCCE RLLTGNAPAATVVRSSGPSAGTGTGPAAGMVRGTAAAANASATDTPRGAGTPAGVRAAEV EEEECCCCCEEEEECCCCCCCCCCCCCCHHEECCHHHCCCCCCCCCCCCCCCCCCEEEEE RVQAEGDTVRARLQWDSERAGQVQAEASTHLARVGEGWEWPADAPLSATARARLPDVGVW EEEECCCEEEEEEEECCCCCCEEEHHHHHHHHHCCCCCCCCCCCCCCCHHHCCCCCCCCC STLAPPGWRVQGTLDADVVLSGTRTAPRWSGTLAADQLAVRSLLDGVDLQNGRLRAALRG CCCCCCCEEEEEECCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEEEECC DRLEITEFRVNGGPGSSARIAGFSGNRTAAPKDGGTLSGTGTVSWAGMGREAAAGGSGIA CEEEEEEEEECCCCCCCCEEEECCCCCCCCCCCCCEECCCCEEEECCCCCHHCCCCCCEE MDFTADARALQVLVRADRQVSVSGQVRAQLRQGQLSLRGKLTADRATIILPEAGAPSLGS EEECCCHHHHHHHHHCCCEEECCHHHHHHHHCCCEEEEEEEEECCEEEEEECCCCCCCCC DVVVRSAATDRARAEAAQREGAQAGRVEAARPPDIALTFDLGDDFALQGHGVTTRLAGEL CEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCEEEECCCEEEEEECEE DIRGATTAGGPPRITGEVRTVEGRYRAWGQALNIETGLARFNGPYDNPALDVLAIRPNIS EECCCCCCCCCCEECCEEEEECCHHHHCCCEECCCHHHHHCCCCCCCCCEEEEEECCCCE VRAGVQVSGTAKAPRVALYSDPELPDAEKLSWVVLGRSTAAGGAEAALLQQAALALLGGG EEECEEEECCCCCCEEEEECCCCCCCHHCEEEEEEECCCCCCCHHHHHHHHHHHHEEECC GGNAGAGNFASRLGLDEIGFRGPNSGASGEDASGAALTFGKRLSKDLYVTYERSLSGALG CCCCCCCHHHHHCCCHHHCCCCCCCCCCCCCCCCCEEHHHHHCCCCEEEEEECCCCCCCE TLYIFYDLTQRLTLRGQTGVQSAVDLIYTLRYD EEEEEEEHHHHHEECCCCCHHHHHHEEEEEECC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 7610040; 9278503 [H]