The gene/protein map for NC_004631 is currently unavailable.
Definition Salmonella enterica subsp. enterica serovar Typhi str. Ty2 chromosome, complete genome.
Accession NC_004631
Length 4,791,961

Click here to switch to the map view.

The map label for this gene is rhsA [H]

Identifier: 29142946

GI number: 29142946

Start: 2645735

End: 2649799

Strand: Reverse

Name: rhsA [H]

Synonym: t2569

Alternate gene names: 29142946

Gene position: 2649799-2645735 (Counterclockwise)

Preceding gene: 29142947

Following gene: 29142945

Centisome position: 55.3

GC content: 59.83

Gene sequence:

>4065_bases
ATGTATGAAGCAGCCCGTGTGGATGACCCTATCTACCACACCAGTGCGCTCGCCGGGTTTCTTATCGGCGCTATCATCGG
CATCGCCATTATCGCGCTTGCCGCCTTTGCCTTCTTTAGCTGCGGTTTTCTTGCCGGGCTGATTCTGGGTTTTATGGCCG
ATCAAATAGCCTCCGGGGTATTGCAACTGGGCGAGGCCATCGGGCGCTCCATCCACCACACGGCAGGAAAAATCCTCACC
GGTTCGGAGAATGTCAGCACCAACAGTCGCCCGGCGGCGCGCGCGGTACTGAGTACGGTGAAATGCGATAACCATATCGC
AGAAAAACGCATCGCCCAGGGTTCCGGCAATATTTACATCAACAGCCAGCCTGCCGCCCGTAAGGATGACCATACCGAAT
GCGATGCGGTGGTTGAAGACGGTTCGCCGAATGTGTTTCTCGGCGGCGGCACACAGACGGTACTGGAAATCAGTTCTGAA
ATTCCGGACTGGCTGCGCAAGGTGGTGGATGTATTGTTTGTCGTGGCGAGTCTGCTCGGCGGGCTGGCCGGGGCGTGGCG
GCAGGCGGCAAAGCTGGGGACGAAATTTGGCACTAAATGTGCCGCTAAGTTTATCGGCGGGGAGCTTACCGGGATGGCCG
TGAGTGAGGCTATCAGCGGGCTGTTCAGCAATCCGGTGGATGTGACCACCGGGCAGAAAATCCTGCTGCCGGAAACGGAC
TTCACCCTGCCCGGTCGCCTGCCGGTCACCTGCTCGCGTTTTTACGCCAGCCACCTGGAAACCGTGGGACTGTTGGGACA
GGGCTGGCGGCTGAACTGGGAAACCAGCCTGCGCGAGGACGATGAACACATCACGCTGACCGGCGTACAGGGGCGGGAAC
TGCGTTACCCGAAAATGATGCTGACGCCCGGCCACCAGATATTTGGCCCGAAAGAACAGTTATACCTCAGCCGCCTGCAT
GACGGGCGTTACGTGCTGCATTACACCGATCGCAGCTATTACGTATTTGGTGATTTTGACAGTGACGGCATGGCATACCT
GCTGTTTATGGAGACGCCGCACCGCCAGCGCATTGTCTTCGGGCACGAAGGAGGCAGACTGGTACGGATAGCCTCCAGCA
GCGGGCATCACCTGTTACTGCACCGCACACAGACCCCGGCAGGGGAGCGGCTGTCGCGAATTGAACTGGTGCAGGGCGGC
ACCTGTGGCAATCTGGTGGAGTACCGGTATGACGATAACGGTCAACTGACCGGCGTGGTGAACCGGGCGGGAACGCAGGT
GCGGCAGTTTGCTTATGAAAACGGGCTGATGACGGCGCACAGCAATGCGGCGGGGTTCACCTGCCGCTACCGCTGGCAGG
AACTCGACGGCGCGCCGCGCGTGACGGAGCACGACACCAGTGACGGTGAACATTACCGCTTTGACTATGATTTTGCCGCA
GGCACCACCACCGTCACCGGCAGGCAGGGGGAGTCATGGCAGTGGTGGTACGACAGGGAGACGTATATCACCGCGCACCG
GACGCCGGGCGGTGGAATGTACCGCTTCACGTACAACGAAGACCACTTCCCTGTCAACATTGGGCTGCCCGGCGGTCGCA
CGGTGGCGTATGAATATGACATCCAGAACCGGGTGGTGAAGACGACAGATCCGGAAGGCCGGGTGACGCAGACGCAGTGG
AACGGCGAGTTCGACGAAATCACGCGCACGGCGCTGGACGATGACGCTGTCTGGAAAACGCAGTACAACGCCCACGGCCA
GCCAGTGCAGGAGACGGACCCGGAAGGGCGGGTGACGCAGTACGCTTACGATGAACAGGGGCAGATGTGCAGCCGGACGG
ATGCGGCGGGCGGCACGGTGGTGACGGCGTTCGACAGCCGGGGGCAGATGACGCGGTACACCGACTGTTCAGGGCGTAGC
ACAGGATATGACCACGATGAGGACGGCAACCTGACGCGGGTGACGGACGCGGAAGGGAAGGTGGTACGCATCAGCTACAA
CCGACTTGGGTTGCCGGAGACGGTGAACTCACCGGGGAAACAGCAGGACAGGTATACCTGGAATGCGCTGGGGCTGATGA
GCAGCCACCGGCGCATCACGGGGAGCGTGGAGAGCTGGCGGTATACGCCGCGCGGTCTGCTGGCGGCGCACACGGATGAG
GAGAAGCGCGAGACGCGCTGGCAGTACACGCCGGAAGGCCGGGTGACCGCGCTGACCAACGGCAACGGGGCGCAGTACCG
GTTCAGTCACGATGCGGACGGCAGGCTGATGCGTGAGGTGCGCCCGGACGGACTGAGCCGTACTTTTATCCTGGACGACA
GCGGTTATCTGACGGCGATACAGACCACGGGTACGCAGGGCGGCGTGCGGCGGGAGACGCAGCAGCGGGATGCGCTGGGC
CGTCTGTTACGGACGGAGAATGAACACGGCCAGCGGACGTTCAGCTACAACCGGCTGGACCAGATAACGGCAGTGACGCT
CACGCCCACGGAGGCGGGGCAACAGCAGCACCGGATGCAGGCCGACACGGTGCGTTTTGAGTATGACCGCAGCGGCTGGC
TGACGGCGGAGCACGCGGGGAACGGTAGCATATGTTATCAGCGCGACGCGCTGGGCAACCCGACGGACATCACGCTGCCG
GACGGGCAGCACCTGACGCATCTGTATTACGGGAGCGGGCATCTGTTACAGACGGCGCTGGACGGCCTGACGGTGAGCGA
GTATGAGCGCGACAGCCTGCACCGTCAGATAATGCGCACGCAGGGGCAGCTTGCGACGTACAGCGGCTATGACGACGACG
GGCTGCTGAGCTGGCAGCGCAGCCTGGCGCCCGGCAGTGCCCCTGTTCTTCCCGGCCAGCGCCCGGCGCGGCAGGGCTGC
GTGACGTCGAGGGACTATTACTGGAACAACCACGGCGAGGTGGGCACGATTGACGACGGCCTGCGTGGCAGCGTGGTGTA
CAGCTATGACAGAAGCGGTTACCTGACCGGGCGCTCAGGTCAGATGTATGACCATGACCGTTATTATTACGATAAGGCGG
GCAACCTGCTGGATAACGAAGGGCAGGGTCCGGTGATGAACAACCGGCTGCCGGGCTGTGGTCGTGACCGTTACGGTTAT
AACGAGTGGGGCGAGCTGACCACGCGGCGCGACCAGCAACTGGAGTGGAACGCGCAGGGGCAGCTGACGCGGGTCATCAG
CGGCAACACGGAGACGCACTACGGCTACGATGCGCTGGGGAGGCGAATCCGCAAGGCGACGTACGGGCGGCACACGGGCC
ATACGGCGCGGAGCCGGACGGACTTTGTGTGGGAGGGGTTCAGGCTGTTGCAGGAGAACGTGCAGCAGCAGGGCTGGCGG
ACATATGTGTACGATGCGGAACAGCCGTACACGCCGGTGGCGAGCGTGACGGGGCGGGGAGAAAGCAGGCAGGTGTGGTA
TTACCACACGGACGTGACGGGCACGCCGCAGGAGGTGACGGCGGCGGACGGAACGCTGGTGTGGGCGGGGTATATCAGGG
GGTTTGGAGAGAATGCGGCGGACATCAGCAACAGCGGGGCGTACTTTCACCAGCCGCTGCGGCTGCCGGGGCAGTATTTT
GACGACGAGACGGGGCTGCATTACAATCTGTTCAGATATTATGCACCGGAGTGTGGACGGTTCGTCAGTCAGGATCCGAT
TGGGTTAAGGGGCGGGTTAAACCTGTATGCGTACTGCCCTAATCCACTGACATGGATAGATCCTCTGGGACTGGATTTAC
ACCATATAATTCCGCAAGAGGTTTGGAAAACTTTTAAAACGGATTTAAAAAAAGTCACCGGATATGTGCAAAACGTCACG
AAGAAAGCAATGGATGTAACAAACTTAATTGATCTGGATAAACCTTTCCATGGTAATCATCCTGCCTATAGTGCTTATGT
AAAAGATAAAATTAGTGAACTTATAAAAAATAAAAAATTAAGTTTAACTGAGCTAAGAAATTTGCAGGATGAGTTGAGGT
CTAAAATTAATGCTGCAAAAGCATCTGGTAAAAATCTGAATGACTATTTTAAAGGAGGTTGCTAA

Upstream 100 bases:

>100_bases
CAGTCCCCGTCCTTTTGATGATAAAGCAGACCTACTCTGGAACACCTGGCTGGCAGGCTTTCAGCCGGATAAAAACGAAT
AATCACACGGAGGTGTGACC

Downstream 100 bases:

>100_bases
TAATGAAATATAAAAAAATAAGAGTATCTTATGATACCGAAGTTACTGGAAATGTTAATGGAGTCTACTCGGTAGAGATA
AAAGATAGTCTTTCCTTTAA

Product: Rhs-family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1354; Mature: 1354

Protein sequence:

>1354_residues
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGVLQLGEAIGRSIHHTAGKILT
GSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSGNIYINSQPAARKDDHTECDAVVEDGSPNVFLGGGTQTVLEISSE
IPDWLRKVVDVLFVVASLLGGLAGAWRQAAKLGTKFGTKCAAKFIGGELTGMAVSEAISGLFSNPVDVTTGQKILLPETD
FTLPGRLPVTCSRFYASHLETVGLLGQGWRLNWETSLREDDEHITLTGVQGRELRYPKMMLTPGHQIFGPKEQLYLSRLH
DGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVFGHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGG
TCGNLVEYRYDDNGQLTGVVNRAGTQVRQFAYENGLMTAHSNAAGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
GTTTVTGRQGESWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIGLPGGRTVAYEYDIQNRVVKTTDPEGRVTQTQW
NGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQYAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRS
TGYDHDEDGNLTRVTDAEGKVVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EKRETRWQYTPEGRVTALTNGNGAQYRFSHDADGRLMREVRPDGLSRTFILDDSGYLTAIQTTGTQGGVRRETQQRDALG
RLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLP
DGQHLTHLYYGSGHLLQTALDGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLAPGSAPVLPGQRPARQGC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNEGQGPVMNNRLPGCGRDRYGY
NEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALGRRIRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWR
TYVYDAEQPYTPVASVTGRGESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDPLGLDLHHIIPQEVWKTFKTDLKKVTGYVQNVT
KKAMDVTNLIDLDKPFHGNHPAYSAYVKDKISELIKNKKLSLTELRNLQDELRSKINAAKASGKNLNDYFKGGC

Sequences:

>Translated_1354_residues
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGVLQLGEAIGRSIHHTAGKILT
GSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSGNIYINSQPAARKDDHTECDAVVEDGSPNVFLGGGTQTVLEISSE
IPDWLRKVVDVLFVVASLLGGLAGAWRQAAKLGTKFGTKCAAKFIGGELTGMAVSEAISGLFSNPVDVTTGQKILLPETD
FTLPGRLPVTCSRFYASHLETVGLLGQGWRLNWETSLREDDEHITLTGVQGRELRYPKMMLTPGHQIFGPKEQLYLSRLH
DGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVFGHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGG
TCGNLVEYRYDDNGQLTGVVNRAGTQVRQFAYENGLMTAHSNAAGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
GTTTVTGRQGESWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIGLPGGRTVAYEYDIQNRVVKTTDPEGRVTQTQW
NGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQYAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRS
TGYDHDEDGNLTRVTDAEGKVVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EKRETRWQYTPEGRVTALTNGNGAQYRFSHDADGRLMREVRPDGLSRTFILDDSGYLTAIQTTGTQGGVRRETQQRDALG
RLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLP
DGQHLTHLYYGSGHLLQTALDGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLAPGSAPVLPGQRPARQGC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNEGQGPVMNNRLPGCGRDRYGY
NEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALGRRIRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWR
TYVYDAEQPYTPVASVTGRGESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDPLGLDLHHIIPQEVWKTFKTDLKKVTGYVQNVT
KKAMDVTNLIDLDKPFHGNHPAYSAYVKDKISELIKNKKLSLTELRNLQDELRSKINAAKASGKNLNDYFKGGC
>Mature_1354_residues
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGVLQLGEAIGRSIHHTAGKILT
GSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSGNIYINSQPAARKDDHTECDAVVEDGSPNVFLGGGTQTVLEISSE
IPDWLRKVVDVLFVVASLLGGLAGAWRQAAKLGTKFGTKCAAKFIGGELTGMAVSEAISGLFSNPVDVTTGQKILLPETD
FTLPGRLPVTCSRFYASHLETVGLLGQGWRLNWETSLREDDEHITLTGVQGRELRYPKMMLTPGHQIFGPKEQLYLSRLH
DGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVFGHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGG
TCGNLVEYRYDDNGQLTGVVNRAGTQVRQFAYENGLMTAHSNAAGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
GTTTVTGRQGESWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIGLPGGRTVAYEYDIQNRVVKTTDPEGRVTQTQW
NGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQYAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRS
TGYDHDEDGNLTRVTDAEGKVVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EKRETRWQYTPEGRVTALTNGNGAQYRFSHDADGRLMREVRPDGLSRTFILDDSGYLTAIQTTGTQGGVRRETQQRDALG
RLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLP
DGQHLTHLYYGSGHLLQTALDGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLAPGSAPVLPGQRPARQGC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNEGQGPVMNNRLPGCGRDRYGY
NEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALGRRIRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWR
TYVYDAEQPYTPVASVTGRGESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDPLGLDLHHIIPQEVWKTFKTDLKKVTGYVQNVT
KKAMDVTNLIDLDKPFHGNHPAYSAYVKDKISELIKNKKLSLTELRNLQDELRSKINAAKASGKNLNDYFKGGC

Specific function: Rhs elements have a nonessential function. They may play an important role in the natural ecology of the cell [H]

COG id: COG3209

COG function: function code M; Rhs family protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RHS family [H]

Homologues:

Organism=Escherichia coli, GI1790020, Length=825, Percent_Identity=26.9090909090909, Blast_Score=167, Evalue=4e-42,
Organism=Escherichia coli, GI48994942, Length=825, Percent_Identity=26.7878787878788, Blast_Score=166, Evalue=7e-42,
Organism=Escherichia coli, GI1786917, Length=822, Percent_Identity=26.8856447688564, Blast_Score=164, Evalue=3e-41,
Organism=Escherichia coli, GI1786706, Length=853, Percent_Identity=26.0257913247362, Blast_Score=146, Evalue=8e-36,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001826
- InterPro:   IPR022385
- InterPro:   IPR006530 [H]

Pfam domain/function: PF03527 RHS; PF05593 RHS_repeat [H]

EC number: NA

Molecular weight: Translated: 151459; Mature: 151459

Theoretical pI: Translated: 6.55; Mature: 6.55

Prosite motif: PS00995 TCP1_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
2.4 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
2.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGV
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LQLGEAIGRSIHHTAGKILTGSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSGNIYI
HHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHCCCCCEEE
NSQPAARKDDHTECDAVVEDGSPNVFLGGGTQTVLEISSEIPDWLRKVVDVLFVVASLLG
CCCCCCCCCCCCCCCEEEECCCCCEEECCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH
GLAGAWRQAAKLGTKFGTKCAAKFIGGELTGMAVSEAISGLFSNPVDVTTGQKILLPETD
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCC
FTLPGRLPVTCSRFYASHLETVGLLGQGWRLNWETSLREDDEHITLTGVQGRELRYPKMM
CCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCEEEEEECCCCCCCCCEEE
LTPGHQIFGPKEQLYLSRLHDGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVF
ECCCCHHCCCHHHHHHHHHCCCEEEEEEECCEEEEEEEECCCCEEEEEEEECCCCCEEEE
GHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGGTCGNLVEYRYDDNGQLTGVV
ECCCCEEEEEECCCCCEEEEEECCCCCHHHHHHEEEECCCCCCCEEEEEECCCCCEEEEE
NRAGTQVRQFAYENGLMTAHSNAAGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
HHHHHHHHHHHHHCCCEEECCCCCCCEEEEEEHHCCCCCCCCCCCCCCCCEEEEEEEECC
GTTTVTGRQGESWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIGLPGGRTVAYEYD
CCEEEECCCCCCEEEEECCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCEEEEEEC
IQNRVVKTTDPEGRVTQTQWNGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQ
CCCCEEECCCCCCCEEEEECCCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEE
YAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRSTGYDHDEDGNLTRVTDAEGK
EEECCCCCHHCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCC
VVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EEEEECCCCCCCHHCCCCCCCCCCEEEHHHHHHHCCCEEECCHHHCCCCCCCEEEECCCC
EKRETRWQYTPEGRVTALTNGNGAQYRFSHDADGRLMREVRPDGLSRTFILDDSGYLTAI
HHHHCCCEECCCCCEEEEECCCCCEEEECCCCCCHHHHHCCCCCCCEEEEEECCCCEEEE
QTTGTQGGVRRETQQRDALGRLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQ
EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCEEEEEECCCCCCHHHHHHH
ADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLPDGQHLTHLYYGSGHLLQTAL
HHCEEEEECCCCCEEEECCCCCCEEEECCCCCCCCEEECCCCCCEEEEEECCCHHHHHHH
DGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLAPGSAPVLPGQRPARQGC
CCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHCCCCCCCCCCCCCCHHHCC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNE
CCCCCCEECCCCCEECCCCCCCCCEEEEECCCCEEECCCCCCCCCCCEEECCCCCCCCCC
GQGPVMNNRLPGCGRDRYGYNEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALG
CCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCEEECCCCCEEEEEECCCCCCCCHHHHH
RRIRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWRTYVYDAEQPYTPVASVTGRG
HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHCCCCC
ESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
CCCEEEEEECCCCCCCCCEECCCCCEEEEHHHHCCCCCCCCCCCCCCEEECCCCCCCCCC
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDPLGLDLHHIIPQE
CCCCCCEEEHHHHHCCHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHCHHH
VWKTFKTDLKKVTGYVQNVTKKAMDVTNLIDLDKPFHGNHPAYSAYVKDKISELIKNKKL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC
SLTELRNLQDELRSKINAAKASGKNLNDYFKGGC
CHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCC
>Mature Secondary Structure
MYEAARVDDPIYHTSALAGFLIGAIIGIAIIALAAFAFFSCGFLAGLILGFMADQIASGV
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LQLGEAIGRSIHHTAGKILTGSENVSTNSRPAARAVLSTVKCDNHIAEKRIAQGSGNIYI
HHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHCCCCCEEE
NSQPAARKDDHTECDAVVEDGSPNVFLGGGTQTVLEISSEIPDWLRKVVDVLFVVASLLG
CCCCCCCCCCCCCCCEEEECCCCCEEECCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHH
GLAGAWRQAAKLGTKFGTKCAAKFIGGELTGMAVSEAISGLFSNPVDVTTGQKILLPETD
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCC
FTLPGRLPVTCSRFYASHLETVGLLGQGWRLNWETSLREDDEHITLTGVQGRELRYPKMM
CCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCEEEEEECCCCCCCCCEEE
LTPGHQIFGPKEQLYLSRLHDGRYVLHYTDRSYYVFGDFDSDGMAYLLFMETPHRQRIVF
ECCCCHHCCCHHHHHHHHHCCCEEEEEEECCEEEEEEEECCCCEEEEEEEECCCCCEEEE
GHEGGRLVRIASSSGHHLLLHRTQTPAGERLSRIELVQGGTCGNLVEYRYDDNGQLTGVV
ECCCCEEEEEECCCCCEEEEEECCCCCHHHHHHEEEECCCCCCCEEEEEECCCCCEEEEE
NRAGTQVRQFAYENGLMTAHSNAAGFTCRYRWQELDGAPRVTEHDTSDGEHYRFDYDFAA
HHHHHHHHHHHHHCCCEEECCCCCCCEEEEEEHHCCCCCCCCCCCCCCCCEEEEEEEECC
GTTTVTGRQGESWQWWYDRETYITAHRTPGGGMYRFTYNEDHFPVNIGLPGGRTVAYEYD
CCEEEECCCCCCEEEEECCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCEEEEEEC
IQNRVVKTTDPEGRVTQTQWNGEFDEITRTALDDDAVWKTQYNAHGQPVQETDPEGRVTQ
CCCCEEECCCCCCCEEEEECCCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEE
YAYDEQGQMCSRTDAAGGTVVTAFDSRGQMTRYTDCSGRSTGYDHDEDGNLTRVTDAEGK
EEECCCCCHHCCCCCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCC
VVRISYNRLGLPETVNSPGKQQDRYTWNALGLMSSHRRITGSVESWRYTPRGLLAAHTDE
EEEEECCCCCCCHHCCCCCCCCCCEEEHHHHHHHCCCEEECCHHHCCCCCCCEEEECCCC
EKRETRWQYTPEGRVTALTNGNGAQYRFSHDADGRLMREVRPDGLSRTFILDDSGYLTAI
HHHHCCCEECCCCCEEEEECCCCCEEEECCCCCCHHHHHCCCCCCCEEEEEECCCCEEEE
QTTGTQGGVRRETQQRDALGRLLRTENEHGQRTFSYNRLDQITAVTLTPTEAGQQQHRMQ
EECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCEEEEEECCCCCCHHHHHHH
ADTVRFEYDRSGWLTAEHAGNGSICYQRDALGNPTDITLPDGQHLTHLYYGSGHLLQTAL
HHCEEEEECCCCCEEEECCCCCCEEEECCCCCCCCEEECCCCCCEEEEEECCCHHHHHHH
DGLTVSEYERDSLHRQIMRTQGQLATYSGYDDDGLLSWQRSLAPGSAPVLPGQRPARQGC
CCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHCCCCCCCCCCCCCCHHHCC
VTSRDYYWNNHGEVGTIDDGLRGSVVYSYDRSGYLTGRSGQMYDHDRYYYDKAGNLLDNE
CCCCCCEECCCCCEECCCCCCCCCEEEEECCCCEEECCCCCCCCCCCEEECCCCCCCCCC
GQGPVMNNRLPGCGRDRYGYNEWGELTTRRDQQLEWNAQGQLTRVISGNTETHYGYDALG
CCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCEEECCCCCEEEEEECCCCCCCCHHHHH
RRIRKATYGRHTGHTARSRTDFVWEGFRLLQENVQQQGWRTYVYDAEQPYTPVASVTGRG
HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHCCCCC
ESRQVWYYHTDVTGTPQEVTAADGTLVWAGYIRGFGENAADISNSGAYFHQPLRLPGQYF
CCCEEEEEECCCCCCCCCEECCCCCEEEEHHHHCCCCCCCCCCCCCCEEECCCCCCCCCC
DDETGLHYNLFRYYAPECGRFVSQDPIGLRGGLNLYAYCPNPLTWIDPLGLDLHHIIPQE
CCCCCCEEEHHHHHCCHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHCHHH
VWKTFKTDLKKVTGYVQNVTKKAMDVTNLIDLDKPFHGNHPAYSAYVKDKISELIKNKKL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC
SLTELRNLQDELRSKINAAKASGKNLNDYFKGGC
CHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2403547; 8041620; 9278503; 7934896 [H]