Definition Mycobacterium tuberculosis H37Ra, complete genome.
Accession NC_009525
Length 4,419,977

Click here to switch to the map view.

The map label for this gene is ppe54 [H]

Identifier: 148660070

GI number: 148660070

Start: 367512

End: 374633

Strand: Reverse

Name: ppe54 [H]

Synonym: MRA_0313

Alternate gene names: 148660070

Gene position: 374633-367512 (Counterclockwise)

Preceding gene: 148660071

Following gene: 148660062

Centisome position: 8.48

GC content: 60.88

Gene sequence:

>7122_bases
ATGGCGAACACCGGCAATATCAACACCGGCGCTTTCATCTCCGGCAACCACAGCAACGGCCTTCTGTGGCGGGGCGACAA
CCAGGGTCTGATCGACCTCGCCATCGGCGTCGACATTCCCGAAATCCCGATTGTGAGCGTCGACGTGAATATCCCGATTC
ACATACCGATCACCGCCAGCTTCACGGACATCGTATACAGCGGGCTCGATCTTCCACCGAACACAGCCGTCACTGTTATT
TTTTTCGGACCCGTCGATATCGACCCCTTCACCGTCCCAGTGATACGGATAACCGGTCCCACACCTGTGGTCATGGTGGG
TGGACCCACTACCGCGATCAATATCGGCGCCACTGTGGTCGTCGACGCCATCAACATCCCGATTATCCATATTCCAGCGA
CTCCAGGCTTCGGCAACTCGACCGGCGGACTGTCGTCGGGCTTCTTCAATAGCGGCGCTGGCAGCGCCTCGGGCTTCGGC
AACTTCGGGGGCGCCGCGTCGGGCTTTATGAACCTGGTCTCCACAACGTCGGGAATGTCGGGCTTCCTCAACGTCGGCGC
GCTGGGATCGGGTGTGGCGAATGTGGGCAACACCATCTCGGGTATCTACAACGTGGGCACGTCGGACCTCTCGACGCCCG
CCGTTAACTCCGGGTTGGCAAATATCGGAACCAATATTGCCGGCCTGCTGCGCGACGGCGCGGGTACTGCGGCTATTAAC
TTGGGCTTGGCCAACCACGGCAACCTCAACGTGGGCTTCGCAAGTCTCGGCGGCTTTAACTTCGGCGGCGCCACCATCGG
CCACAACAACGTCGGCATCGGGAACACCGGAATCTTCGATGTCGGCCTGGCGAACCTGGGCAGCTACAACATCGGCTTCG
GAAACCTTGGCGACGACAACCTGGGCTTCGGCAACTTCGGCAGCTACAACATCGGCTTCGGCAACGTCGGCAACGACAAT
CTGGGTTTCGCTAACGCGGGCGGCGGCAACATCGGCTTTGCGAACACCGGCAGCAACAATGTCGGCTTTGGGAACACGGG
CAGCAACAATGTCGGCATCGGGCTCACGGGCAACGGACAGATCGGGTTCGGCAGCTTCAACTCGGGCAGCGGAAACATCG
GCCTGTTCAACTCGGGCAGCAACAACATCGGATTCTTCAATTCCGGCAGCGGCAACTTCGGCATCGCAAACTCGGGCAGC
TTCAACACTGGCATCGGAAACACCGGCAACACCAATACCGGCCTATTCAACTCCGGCGACGTCAACACGGGCGCCTTCAA
CCCGGGCAGCTTCAACACCGGTAGCTTCAACACCGGCAGCTTCAACACCGGTGGCTTCAATCCGGGCAATACCAACACCG
GCTACCTCAACATTGGCAACTACAACACCGGCATCGCCAACACCGGCGACGTTGACACCGGGGCTTTCATCACCGGAAAC
TACAGCAACGGGTTGTTCTTAAGCGGCGATTACCAGGGCCTGGTCGGCCTCAACCTGGTGATCGATATGCCTCTCCCCAT
AAGCCTCGGCGTGAATATTCCCATCGATATCCCGATCACCGCCTCGGCCGGCAACATCACCCTTATGGGCGTCACGATTC
CGCCCACCGGCGATATCGTCCTTTCGTCAATAGCGGGCCAGCGAGCCCACTTTGGCCCCATTACCATTCCGAACATCACG
GTTGTCGGCCCCACGACGACAGTCGCCATAGGAGGGCCGAATACCGCGATCACCATAACTGGCGGTGGCGCCATTAGGAT
CCCGCTCATCAGTATCCCGGCGGCGCCAGGTTTCGGAAACTCGACCACCAACCCGTCGTCAGGTTTCTTCAATACCGGCG
CCGGCGGCGCCTCGGGCTTCGGCAACTTCGGCGGCGCCAATTCGGGCTTTTGGAACCTGGCCTCCGCGACCTCGGGGGCG
TCGGGGCTCCTCAACGTCGGCGCCCTGGGATCAGGTCTGGCGAACGTGGGCACCACCGTCTCGGGGTTCTACAACACCAG
CACGTCGGACCTCGCGACGCCGGCCTTCAATTCAGGCCTGGCCAACATCAGCACCAGTATCGCCGGCCTGCTGCGCGACA
GCACGGGCACCATGGTCCTCAACCTGGGCTTGGCAAACCACGGCACCCTCAACGTCGGCATTGCAAACCTCGGCGACTAC
AACATCGGCTTTGCAAACCTCGGCAGCGCCAACTTCGGCAGCGCCAATATCGGTGGCAACAACATCGGCGGCGCAAACAC
CGGAATATTCGACATCGGTTTGGCAAATCTGGGCAGTTACAACATCGGCTTCGGAAACTTCGGCGATGACAACCTGGGCT
TCGGAAACCTCGGCAGCTACAACGTCGGCTTCGGAAACTTGGGCAACGACAACCTGGGCTTCGCCAACACCGGCAGCAAC
AATATCGGGTTCGCGAACACCGGCAGCAACAATATCGGCATTGGGCTCACGGGCGACGGCCAGATCGGGTTCGGCTCCCT
GAATTCTGGCAGCGGAAACATCGGCTTGTTCAACTCGGGCAGCGGAAACATCGGCTTTTTCAACTCGGGCAACGGAAACG
TTGGCATCGGCAACACCGGCACCGCAAACTTCGGGCTTGGAAACACCGGCAGCACCAACACCGGCTTCTTCAACTCCGGC
GACGTCAATACCGGTATCGGCAACACCGGCAGCTTCAACACCGGCAGCTTCAATCCGGGCGATTCCAACACCGGGGATTT
CAACCCAGGCAGCTACAACACGGGACTCGGAAACACCGGCGATGTTGACACCGGCGCCTTCATCTCCGGCAGCTACAGCA
ACGGGTTCTTGTGGAGTGGAAATTATCAGGGCCTCATTGGCTTGCACGCGGCGCTAGCGATTCCCGAAATCGCCCTAACC
TTTGGCGTCGACATCCCGATACATATACCCATCAACATCGACGCCGGGGTCGTCACCCTCCAGGGCTTCAGCATCGTAGC
TGCCGAAAATAATATCGACTTCACCCCCATCATCATCCCGACCATCAATATCACCTTGCCCACGGCGGCGATCACCGTGG
GCGGACCCACCACCTCGATCGGTATCACCGCCAGCGCCGGTATCGGCTCCATCACCATCCCGATCATCGACATTCCCGCG
ACATCGGGCTTCGGCAACTCGACCACTAGTCCGTCGTCGGGCTTCTTCAACTCCGGAGCGGGCAGCGCGTCGGGCTTTTT
GAACGTGGTCGCCGGCGCCTCAGGGATTTCGGGTTATCTCAATGTCGGTGCGCTGGGGTCGGGTGTGACTAACGTGGGTC
ACACCGTCTCGGGTTTCTACAACGCGAGCGCGTTGGACCTCGTGACGCCGGCCTTTGCCTCCGGTCTCATGCGCGACGGT
ATGGGCACGATGACTCTGAACCTTGGGCTGGCAAACCTGGGCAGCAATAACGCCGGCTTCGGCAACACCGGGATCTTTGA
CGTCGGCGTGGCGAATCTGGGCAACTACAACATCGGCTTCGGAAACTTCGGCGACGACAACCTGGGCTTTGCCAACCTAG
GCAGCTACAACATCGGCGTTGCCAACACCGGCAGCAACAATATCGGCTTTGCCAACACCGGCAGCAACAATATCGGCATC
GGGCTCACCGGTACCGGCCAAATCGGGATCGGCGCTCTGAACTCGGGCAGCGGAAACATCGGCTTGTTCAACTCGGGCGA
CGGAAACATCGGCTTCTTTAACTCGGGCACCGGGAACTTCGGCATCGGCAACACCGGCACCGGAAACTTCGGCATCGGCA
ACTCGGGCAGCACCAGCACGGGCTTGTTCAACTCGGGCGACGGCAACACCGGCGGCTTCAACCCCGGTAACTTCAACACC
GGCAATTTCAATACCGGCAGCTTCAACACCGGCGGCTTCAACGCGGGTAACACCAACACCGGCCACTTCAACACCGGGAA
CTACAACACCGGCATCGCGAATACGGGCGACGTCAGCACCGGCGCTTTCATCTCCGGCAACTACAGCAACGGCATCTTGT
GGCGGGGCGACTACCAGGGCCTGATCGGTTACTCCTACGCGCTGACTATTCCGGAGATTCCGGCGCACTTGGACGTCAAT
ATCCCAATCGACATACCGATCACCGGCAGTTTCACCGACCTCGTGGTGGACAATTTCACTATCCCCATCATCGGCTTCGA
ATCCTTCGCGTTTAGCTTTCACATCCATACCGAGCCGGACATCGGTCCCATCATTGTCCCGAGCTTCGTGCTCAGCGTTC
CCACGTTCGCGATCGCCGTGGGCGGACCCACGACCGCGATCAACATCAGCGCCACCGCCGGCCTCGGCCCCATCACCATC
CCGATCATCGACATTCCGGCAGCGCCGGGCATCGGAAACTCGACCACCAGCCCGTCGTCAGGCTTCTTCAACACCGGCGC
CGGCACCGCATCCGGGTTCGGCAACGTCGGCGGCAACACATCGGGCCTGTGGAACCTTGCGTCGGCAGCCTCAGGAGTCT
CGGGCTTGCTCAACGTCGGCGCGTTGGGATCGGGTGTGGCGAATGTGGGCAACACCATCTCGGGTATCTACAACACGAGC
CCGCTGGACCTCGGGACGCCGGCCTTCGGCTCCGGCCTCGCAAACATCGCCGGCCTGCTGCAGGGCGGCGCCGGCACGAC
GATCCTCGACTTGGCCGGCCTCGGCAACCTCAATGTCGGCTTGGCAAACCTCGGGGGCTCTAACTTCGGGATCGGGAACA
CCGGAATCTTCAATGTCGGTTTCGCAAACGTGGGCAACCACAACATTGGCTTGGCAAACCTGGGCAACTACAGCGTCGGC
TTCGCCAACTCGGGCAACTACCATATCGGCATTGCTAACACCGGCAGTGCCAATATCGGCTTCGCCAACACCGGTAGCGG
CAATATCGGCATCGGGCTCACCGGCACCGGTCAGATCGGGTTCGGCAGCTTCAACTCGGGCAGCCACAACATCGGCTTGT
TCAACTCCGGTGACGGAAACGTAGGATTCTTCAACTCGGGCACCGGCAACGTGGGCATCGGAAACACCGGCACCGCAAAC
TTCGGCATCGCAAACTCGGGCAGCTTCAACACCGGCCTCGGGAACACGGGCAGCACCAACACGGGCCTGTTCAACCCGGG
CAACGTCAACACCGGCGTCGGCAACACCGGCAGCATCAACACCGGCAGCATCAACACCGGCAGCTTCAACACTGGCAGCA
CCAATACCGGCAGCTTCAACCTCGGCGATCACAACACCGGCAGCTTCAACTCCGGTGACTACAACACGGGCTACTTCAAC
GCGGGTGACTACAACACGGGTGTGGCCAACACGGGCAACGTCAACACCGGCGCGTTCATCTCCGGCAATTACAGCAACGG
GTTCTTCTGGCGAGGTGACTACCAGGGGTTGATTGGCCTTTCCACAACGATCACCATTCCCGAAATCCCCTACCGCTACG
ACTTGAGTGTTCCAATCGACATACCCATCACCGGCACCGTCGTCGCCACCACGCCAAACAGTTTCACCATTCCCGGTTTC
CAGATACGAGTCTTGCTTGGTCCTGCGGCGGTGCTTGTCAACGAGATGATCGGCCCCATCACGATCGATGTCAATCAAGT
CATCGCCATCGATTCGCCCATTCAGCAAACCATCAGCATGGTTGGCACCGGCGGCTTCGGCCCGATCCCCATCGGCATCA
GCATCGGTGGTACCCCGGGTTTCGGCAACTCGACCACCGGCCCGTCGTCGGGTTTCTTCCACACCGGCGCCGGCCATGTA
TCGGGCTTCGGGAACTTCGGCGCCGGCAACATGTCGGGCTCCGGGAACTTCGGCGCTGGCAATTCGGGCTTCTTTAACGC
CGGCGGCTTGGGCAATTCGGGCCTACTGAATTTCGGCGCGCTGCAGTCGGGTCTGGCGAACCTGGGCAACACCATCTCGG
GCGTCTACAACACGAGCACGCTGGACCTCGCGACGCCCGCCTTCGGCTCGGGCATCGCAAACATCGGCGCCAACCTGGCC
GGCCTGTTCCTCGACAACACCGGCAACCTGACGCTGAACTTCGGCGTCGCAAACCAGGGCGGCCTCAACGCGGGCATCGG
GAACCTGGGCAGCGTCAACATCGGCTTCGTTAATACCGGCGACTCCAACCTGGGCATCGGCAACCTCGGCGACCTCAACT
TCGGCGGGGTCAACATCGGCGGTAACAACATCGGCATCGCCAACACCGGGATCTTCGATATCGGCTTGGCGAACCTGGGC
AGCTACAACATCGGGTTGGCAAATCTGGGCGACGACAACCTGGGCTTTGGCAACGCCGGCAGCTACAACATCGGCTTCGC
GAACTTCGGCAGCGACAACCTGGGCTTTGCCAACACCGGCAGCTACAACATCGGCTTCGCGAATACCGGTAACAACAACA
TCGGCGTCGGGCTCACCGGCAACGGCCAGATCGGGATCGGCAGCCTCAACTCGGGCAGCAACAACATCGGGCTGTTCAAC
TCCGGCAGCGGAAACATCGGGTTCTTCAACTCGGGCACCGGCAACGTCGGCATCTTTAACACCGGCACCGGCAACTTCGG
TCTCGCGAACTCGGGCGGCTTCAACACCGGCATCGGCAACGCGGGCAGCACCAACACGGGCGTGTTCAACCCCGGGGACC
TCAACACCGGCAGCTTCAACCCGGGCAGCTTCAACACCGGCGGCTTCAACCCGGGCAGTGGCAACACGGGCTACCTCAAC
ACCGGTGACTACAACACGGGCGTGGCGAACACGGGCGATGTGGACACCGGTGCGTTCATTACCGGCAGCTACAGCAACGG
CTTCTTGGTGAGTGGCGACTATCAGGGCCTGATCGGCCTGCCGCTGTTGGGCATTCCGGTGACCCCCGGCTACTTCAACC
TCACTGGCGGCCCGTCGTCGGGCTTCTTCAACAGCGGCGCCGGAAGCGTATCGGGATTCGTGAACTCCGGTGCCGGCCTG
TCGGGCTACCTCAATACCGGCGCGCTGGGATCGGGTGTCGCCAACGTGGGCAACACCATCTCGGGCTGGTTGAACGCCAG
CGCGCTGGATCTCGCGACGCCGGGGTTCCTTTCCGGCATCGGTAACTTTGGCACCAACCTGGCGGGTTTCTTTAGGGGAT
AA

Upstream 100 bases:

>100_bases
AACACCGGCAGCGTCAACACCGGCAGCGTCAACACCGGTGGCTTCAATGCAGGTAACTACAACACCGGCTACTTCAACAC
CGGCGACTTACAACACCGGC

Downstream 100 bases:

>100_bases
CGCGCAGCTATGTCACCTGGGCCAGCAGCCGTTCACTGGTCTCCCAGAGGTCGAGCGCCTTTGCCCGATCGTAGGACTCG
GCGGAAGACCGGATCGCCTT

Product: PPE family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 2373; Mature: 2372

Protein sequence:

>2373_residues
MANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIPIVSVDVNIPIHIPITASFTDIVYSGLDLPPNTAVTVI
FFGPVDIDPFTVPVIRITGPTPVVMVGGPTTAINIGATVVVDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGFG
NFGGAASGFMNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGLANIGTNIAGLLRDGAGTAAIN
LGLANHGNLNVGFASLGGFNFGGATIGHNNVGIGNTGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGNDN
LGFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSGNIGLFNSGSNNIGFFNSGSGNFGIANSGS
FNTGIGNTGNTNTGLFNSGDVNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGN
YSNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNIT
VVGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASGFGNFGGANSGFWNLASATSGA
SGLLNVGALGSGLANVGTTVSGFYNTSTSDLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDY
NIGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSYNVGFGNLGNDNLGFANTGSN
NIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGNGNVGIGNTGTANFGLGNTGSTNTGFFNSG
DVNTGIGNTGSFNTGSFNPGDSNTGDFNPGSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALT
FGVDIPIHIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSIGITASAGIGSITIPIIDIPA
TSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYLNVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDG
MGTMTLNLGLANLGSNNAGFGNTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIGI
GLTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTSTGLFNSGDGNTGGFNPGNFNT
GNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVSTGAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVN
IPIDIPITGSFTDLVVDNFTIPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPITI
PIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLLNVGALGSGVANVGNTISGIYNTS
PLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVGLANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVG
FANSGNYHIGIANTGSANIGFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNVGIGNTGTAN
FGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINTGSINTGSFNTGSTNTGSFNLGDHNTGSFNSGDYNTGYFN
AGDYNTGVANTGNVNTGAFISGNYSNGFFWRGDYQGLIGLSTTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGF
QIRVLLGPAAVLVNEMIGPITIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGHV
SGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLGNTISGVYNTSTLDLATPAFGSGIANIGANLA
GLFLDNTGNLTLNFGVANQGGLNAGIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLG
SYNIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFN
SGSGNIGFFNSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFNPGSFNTGGFNPGSGNTGYLN
TGDYNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAGL
SGYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSGIGNFGTNLAGFFRG

Sequences:

>Translated_2373_residues
MANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIPIVSVDVNIPIHIPITASFTDIVYSGLDLPPNTAVTVI
FFGPVDIDPFTVPVIRITGPTPVVMVGGPTTAINIGATVVVDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGFG
NFGGAASGFMNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGLANIGTNIAGLLRDGAGTAAIN
LGLANHGNLNVGFASLGGFNFGGATIGHNNVGIGNTGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGNDN
LGFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSGNIGLFNSGSNNIGFFNSGSGNFGIANSGS
FNTGIGNTGNTNTGLFNSGDVNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGN
YSNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNIT
VVGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASGFGNFGGANSGFWNLASATSGA
SGLLNVGALGSGLANVGTTVSGFYNTSTSDLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDY
NIGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSYNVGFGNLGNDNLGFANTGSN
NIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGNGNVGIGNTGTANFGLGNTGSTNTGFFNSG
DVNTGIGNTGSFNTGSFNPGDSNTGDFNPGSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALT
FGVDIPIHIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSIGITASAGIGSITIPIIDIPA
TSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYLNVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDG
MGTMTLNLGLANLGSNNAGFGNTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIGI
GLTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTSTGLFNSGDGNTGGFNPGNFNT
GNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVSTGAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVN
IPIDIPITGSFTDLVVDNFTIPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPITI
PIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLLNVGALGSGVANVGNTISGIYNTS
PLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVGLANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVG
FANSGNYHIGIANTGSANIGFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNVGIGNTGTAN
FGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINTGSINTGSFNTGSTNTGSFNLGDHNTGSFNSGDYNTGYFN
AGDYNTGVANTGNVNTGAFISGNYSNGFFWRGDYQGLIGLSTTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGF
QIRVLLGPAAVLVNEMIGPITIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGHV
SGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLGNTISGVYNTSTLDLATPAFGSGIANIGANLA
GLFLDNTGNLTLNFGVANQGGLNAGIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLG
SYNIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFN
SGSGNIGFFNSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFNPGSFNTGGFNPGSGNTGYLN
TGDYNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAGL
SGYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSGIGNFGTNLAGFFRG
>Mature_2372_residues
ANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIPIVSVDVNIPIHIPITASFTDIVYSGLDLPPNTAVTVIF
FGPVDIDPFTVPVIRITGPTPVVMVGGPTTAINIGATVVVDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGFGN
FGGAASGFMNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGLANIGTNIAGLLRDGAGTAAINL
GLANHGNLNVGFASLGGFNFGGATIGHNNVGIGNTGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGNDNL
GFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSGNIGLFNSGSNNIGFFNSGSGNFGIANSGSF
NTGIGNTGNTNTGLFNSGDVNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGNY
SNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNITV
VGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASGFGNFGGANSGFWNLASATSGAS
GLLNVGALGSGLANVGTTVSGFYNTSTSDLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDYN
IGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSYNVGFGNLGNDNLGFANTGSNN
IGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGNGNVGIGNTGTANFGLGNTGSTNTGFFNSGD
VNTGIGNTGSFNTGSFNPGDSNTGDFNPGSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALTF
GVDIPIHIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSIGITASAGIGSITIPIIDIPAT
SGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYLNVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDGM
GTMTLNLGLANLGSNNAGFGNTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIGIG
LTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTSTGLFNSGDGNTGGFNPGNFNTG
NFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVSTGAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNI
PIDIPITGSFTDLVVDNFTIPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPITIP
IIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLLNVGALGSGVANVGNTISGIYNTSP
LDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVGLANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVGF
ANSGNYHIGIANTGSANIGFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNVGIGNTGTANF
GIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINTGSINTGSFNTGSTNTGSFNLGDHNTGSFNSGDYNTGYFNA
GDYNTGVANTGNVNTGAFISGNYSNGFFWRGDYQGLIGLSTTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGFQ
IRVLLGPAAVLVNEMIGPITIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGHVS
GFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLGNTISGVYNTSTLDLATPAFGSGIANIGANLAG
LFLDNTGNLTLNFGVANQGGLNAGIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLGS
YNIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFNS
GSGNIGFFNSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFNPGSFNTGGFNPGSGNTGYLNT
GDYNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAGLS
GYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSGIGNFGTNLAGFFRG

Specific function: Unknown

COG id: COG5651

COG function: function code N; PPE-repeat proteins

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the mycobacterial PPE family [H]

Homologues:

Organism=Homo sapiens, GI14149793, Length=331, Percent_Identity=24.4712990936556, Blast_Score=89, Evalue=5e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002989
- InterPro:   IPR000030 [H]

Pfam domain/function: PF01469 Pentapeptide_2; PF00823 PPE [H]

EC number: NA

Molecular weight: Translated: 232676; Mature: 232545

Theoretical pI: Translated: 3.69; Mature: 3.69

Prosite motif: PS00178 AA_TRNA_LIGASE_I

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
0.5 %Met     (Translated Protein)
0.5 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
0.5 %Met     (Mature Protein)
0.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIPIVSVDVNIPIHIPITAS
CCCCCCCCCCEEEECCCCCCEEEECCCCCEEEEEECCCCCCCCEEEEEECCEEEEEEECC
FTDIVYSGLDLPPNTAVTVIFFGPVDIDPFTVPVIRITGPTPVVMVGGPTTAINIGATVV
HHHHHHCCCCCCCCCEEEEEEECCCCCCCEEEEEEEECCCCCEEEECCCCEEEECCEEEE
VDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGFGNFGGAASGFMNLVSTTSGMS
EEEECCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCC
GFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGLANIGTNIAGLLRDGAGTAAIN
CEEECCCCCCCHHHCCCHHHHEEECCCCCCCCCCCCCCHHHHCCCCEEHHCCCCCEEEEE
LGLANHGNLNVGFASLGGFNFGGATIGHNNVGIGNTGIFDVGLANLGSYNIGFGNLGDDN
EEECCCCCEEEEEEECCCCCCCCEEECCCCCCCCCCCEEEEHHHCCCCCCCCCCCCCCCC
LGFGNFGSYNIGFGNVGNDNLGFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQ
CCCCCCCCEECCCCCCCCCCCCEECCCCCCEEEECCCCCCCCCCCCCCCCEEEEEECCCE
IGFGSFNSGSGNIGLFNSGSNNIGFFNSGSGNFGIANSGSFNTGIGNTGNTNTGLFNSGD
EEECCCCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCC
VNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGN
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEC
YSNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIV
CCCCEEEECCCCCEEEEEEEEECCCCEEECCCCCEEEEEECCCCCEEEEEEEECCCCCCC
LSSIAGQRAHFGPITIPNITVVGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGN
EEHHCCCCCCCCCEECCCEEEECCCEEEEECCCCEEEEEECCCEEEEEEEEECCCCCCCC
STTNPSSGFFNTGAGGASGFGNFGGANSGFWNLASATSGASGLLNVGALGSGLANVGTTV
CCCCCCCCCEECCCCCCCCCCCCCCCCCCCEEECCCCCCCCCEEEECCCCCCHHHCCCEE
SGFYNTSTSDLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDY
EEEECCCCCCCCCCCCCCCHHHHHHHHHHHEECCCCEEEEEEECCCCCEEEEEEEECCCC
NIGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSY
CCCEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEECCCCCCCCCCCCCCCCCCC
NVGFGNLGNDNLGFANTGSNNIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSG
CCCCCCCCCCCCCEEECCCCCCEEEECCCCCEEEEEECCCCEEECCCCCCCCCEEEEECC
SGNIGFFNSGNGNVGIGNTGTANFGLGNTGSTNTGFFNSGDVNTGIGNTGSFNTGSFNPG
CCCEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCC
DSNTGDFNPGSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALT
CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEEECCCCCCEEEHHHHHHCCEEEEE
FGVDIPIHIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSI
ECCCEEEEEEEECCCCEEEEECEEEEEECCCCCEEEEEEEEEEEEECEEEEEECCCCCEE
GITASAGIGSITIPIIDIPATSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYL
EEEECCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEE
NVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDGMGTMTLNLGLANLGSNNAGF
EECCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC
GNTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIGI
CCCCEEEEECCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEECCCCCCEEEECCCCCEEE
GLTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTST
EEECCCCEEEEEEECCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC
GLFNSGDGNTGGFNPGNFNTGNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVST
CEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCC
GAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNIPIDIPITGSFTDLVVDNFT
CEEEECCCCCCCEECCCCCCEEEEEEEEECCCCCCEEEEECEEEEEECCCHHHEEECCEE
IPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPITI
EEEEEEEEEEEEEEEECCCCCCCEEHHHHHHHCCEEEEEECCCCEEEEEEECCCCCCEEE
PIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLLNVG
EEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCEEEHHHHHCCHHHHHHHC
ALGSGVANVGNTISGIYNTSPLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVG
CCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCCEE
LANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVGFANSGNYHIGIANTGSANIG
EEECCCCCCCCCCCCEEEEEEECCCCCCEEEEECCCEEEEECCCCCEEEEEECCCCCCCC
FANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNVGIGNTGTAN
EEECCCCCEEEEEEECCCEECCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCC
FGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINTGSINTGSFNTGSTNTGSFN
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE
LGDHNTGSFNSGDYNTGYFNAGDYNTGVANTGNVNTGAFISGNYSNGFFWRGDYQGLIGL
CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCEEEECCCCCCCEECCCCCEEEEE
STTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGFQIRVLLGPAAVLVNEMIGPI
EEEEECCCCCEEEECCEEEECCCCCEEEEECCCCEECCCEEEEEEECCHHHHHHHHCCCE
TIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGHV
EEECCEEEEECCHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEECCCCCC
SGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLGNTISGVYNTST
CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCEEHHHHHHHHHHHCCHHCCEECCCE
LDLATPAFGSGIANIGANLAGLFLDNTGNLTLNFGVANQGGLNAGIGNLGSVNIGFVNTG
EEECCCCCCCCHHHHCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECC
DSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLGSYNIGLANLGDDNLGFGNAG
CCCCCCCCCCCCCCCCEEECCCEEEEECCCEEEEECCCCCCCCEEEEECCCCCCCCCCCC
SYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFN
CEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCEEEEEECCCCEEEEECCCCCCCEEEEE
SGSGNIGFFNSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFN
CCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
PGSFNTGGFNPGSGNTGYLNTGDYNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGL
CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCEEEECCCCCEEEECCCCCEECC
PLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAGLSGYLNTGALGSGVANVGNTI
CEEEEECCCCEEEECCCCCCCCCCCCCCCCCHHCCCCCCCEEEECCCCCCCCHHHCCCCH
SGWLNASALDLATPGFLSGIGNFGTNLAGFFRG
HHHCCCCEEECCCCHHHHHHHHCCCCCHHCCCC
>Mature Secondary Structure 
ANTGNINTGAFISGNHSNGLLWRGDNQGLIDLAIGVDIPEIPIVSVDVNIPIHIPITAS
CCCCCCCCCEEEECCCCCCEEEECCCCCEEEEEECCCCCCCCEEEEEECCEEEEEEECC
FTDIVYSGLDLPPNTAVTVIFFGPVDIDPFTVPVIRITGPTPVVMVGGPTTAINIGATVV
HHHHHHCCCCCCCCCEEEEEEECCCCCCCEEEEEEEECCCCCEEEECCCCEEEECCEEEE
VDAINIPIIHIPATPGFGNSTGGLSSGFFNSGAGSASGFGNFGGAASGFMNLVSTTSGMS
EEEECCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCC
GFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGLANIGTNIAGLLRDGAGTAAIN
CEEECCCCCCCHHHCCCHHHHEEECCCCCCCCCCCCCCHHHHCCCCEEHHCCCCCEEEEE
LGLANHGNLNVGFASLGGFNFGGATIGHNNVGIGNTGIFDVGLANLGSYNIGFGNLGDDN
EEECCCCCEEEEEEECCCCCCCCEEECCCCCCCCCCCEEEEHHHCCCCCCCCCCCCCCCC
LGFGNFGSYNIGFGNVGNDNLGFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQ
CCCCCCCCEECCCCCCCCCCCCEECCCCCCEEEECCCCCCCCCCCCCCCCEEEEEECCCE
IGFGSFNSGSGNIGLFNSGSNNIGFFNSGSGNFGIANSGSFNTGIGNTGNTNTGLFNSGD
EEECCCCCCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCC
VNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTGDVDTGAFITGN
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEC
YSNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPITASAGNITLMGVTIPPTGDIV
CCCCEEEECCCCCEEEEEEEEECCCCEEECCCCCEEEEEECCCCCEEEEEEEECCCCCCC
LSSIAGQRAHFGPITIPNITVVGPTTTVAIGGPNTAITITGGGAIRIPLISIPAAPGFGN
EEHHCCCCCCCCCEECCCEEEECCCEEEEECCCCEEEEEECCCEEEEEEEEECCCCCCCC
STTNPSSGFFNTGAGGASGFGNFGGANSGFWNLASATSGASGLLNVGALGSGLANVGTTV
CCCCCCCCCEECCCCCCCCCCCCCCCCCCCEEECCCCCCCCCEEEECCCCCCHHHCCCEE
SGFYNTSTSDLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGDY
EEEECCCCCCCCCCCCCCCHHHHHHHHHHHEECCCCEEEEEEECCCCCEEEEEEEECCCC
NIGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGDDNLGFGNLGSY
CCCEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEECCCCCCCCCCCCCCCCCCC
NVGFGNLGNDNLGFANTGSNNIGFANTGSNNIGIGLTGDGQIGFGSLNSGSGNIGLFNSG
CCCCCCCCCCCCCEEECCCCCCEEEECCCCCEEEEEECCCCEEECCCCCCCCCEEEEECC
SGNIGFFNSGNGNVGIGNTGTANFGLGNTGSTNTGFFNSGDVNTGIGNTGSFNTGSFNPG
CCCEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCC
DSNTGDFNPGSYNTGLGNTGDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALT
CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEEECCCCCCEEEHHHHHHCCEEEEE
FGVDIPIHIPINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTSI
ECCCEEEEEEEECCCCEEEEECEEEEEECCCCCEEEEEEEEEEEEECEEEEEECCCCCEE
GITASAGIGSITIPIIDIPATSGFGNSTTSPSSGFFNSGAGSASGFLNVVAGASGISGYL
EEEECCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEE
NVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRDGMGTMTLNLGLANLGSNNAGF
EECCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC
GNTGIFDVGVANLGNYNIGFGNFGDDNLGFANLGSYNIGVANTGSNNIGFANTGSNNIGI
CCCCEEEEECCCCCCCCCCCCCCCCCCCCEEECCCEEEEEEECCCCCCEEEECCCCCEEE
GLTGTGQIGIGALNSGSGNIGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTST
EEECCCCEEEEEEECCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC
GLFNSGDGNTGGFNPGNFNTGNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVST
CEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCC
GAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNIPIDIPITGSFTDLVVDNFT
CEEEECCCCCCCEECCCCCCEEEEEEEEECCCCCCEEEEECEEEEEECCCHHHEEECCEE
IPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIAVGGPTTAINISATAGLGPITI
EEEEEEEEEEEEEEEECCCCCCCEEHHHHHHHCCEEEEEECCCCEEEEEEECCCCCCEEE
PIIDIPAAPGIGNSTTSPSSGFFNTGAGTASGFGNVGGNTSGLWNLASAASGVSGLLNVG
EEEECCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCEEEHHHHHCCHHHHHHHC
ALGSGVANVGNTISGIYNTSPLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVG
CCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCCEE
LANLGGSNFGIGNTGIFNVGFANVGNHNIGLANLGNYSVGFANSGNYHIGIANTGSANIG
EEECCCCCCCCCCCCEEEEEEECCCCCCEEEEECCCEEEEECCCCCEEEEEECCCCCCCC
FANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTGNVGIGNTGTAN
EEECCCCCEEEEEEECCCEECCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCC
FGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSINTGSINTGSFNTGSTNTGSFN
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE
LGDHNTGSFNSGDYNTGYFNAGDYNTGVANTGNVNTGAFISGNYSNGFFWRGDYQGLIGL
CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCEEEECCCCCCCEECCCCCEEEEE
STTITIPEIPYRYDLSVPIDIPITGTVVATTPNSFTIPGFQIRVLLGPAAVLVNEMIGPI
EEEEECCCCCEEEECCEEEECCCCCEEEEECCCCEECCCEEEEEEECCHHHHHHHHCCCE
TIDVNQVIAIDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGHV
EEECCEEEEECCHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEECCCCCC
SGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLGNTISGVYNTST
CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCEEHHHHHHHHHHHCCHHCCEECCCE
LDLATPAFGSGIANIGANLAGLFLDNTGNLTLNFGVANQGGLNAGIGNLGSVNIGFVNTG
EEECCCCCCCCHHHHCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECC
DSNLGIGNLGDLNFGGVNIGGNNIGIANTGIFDIGLANLGSYNIGLANLGDDNLGFGNAG
CCCCCCCCCCCCCCCCEEECCCEEEEECCCEEEEECCCCCCCCEEEEECCCCCCCCCCCC
SYNIGFANFGSDNLGFANTGSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFN
CEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCEEEEEECCCCEEEEECCCCCCCEEEEE
SGSGNIGFFNSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSFN
CCCCCEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
PGSFNTGGFNPGSGNTGYLNTGDYNTGVANTGDVDTGAFITGSYSNGFLVSGDYQGLIGL
CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCEEEECCCCCEEEECCCCCEECC
PLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAGLSGYLNTGALGSGVANVGNTI
CEEEEECCCCEEEECCCCCCCCCCCCCCCCCHHCCCCCCCEEEECCCCCCCCHHHCCCCH
SGWLNASALDLATPGFLSGIGNFGTNLAGFFRG
HHHCCCCEEECCCCHHHHHHHHCCCCCHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9634230 [H]