The gene/protein map for NC_008752 is currently unavailable.
Definition Acidovorax citrulli AAC00-1 chromosome, complete genome.
Accession NC_008752
Length 5,352,772

Click here to switch to the map view.

The map label for this gene is rhsD [H]

Identifier: 120610728

GI number: 120610728

Start: 2237201

End: 2242513

Strand: Direct

Name: rhsD [H]

Synonym: Aave_2049

Alternate gene names: 120610728

Gene position: 2237201-2242513 (Clockwise)

Preceding gene: 120610727

Following gene: 120610729

Centisome position: 41.8

GC content: 69.21

Gene sequence:

>5313_bases
ATGAGCGGCAAACCCGCAGCCCGGCAGGGCGACTTGACAAAAAAAGGCGGCCCGATCGTCCAGGGCTCGGCGACGGTGCT
GATCGGCTCGGCGGGAGGCGTGGCGTGCTCGGTGTGCCCCGGGGGCATGGCGGTGGGCAACCCGGTGAACCCGGCGCTGG
GCGCGAAGGTGCTCACGGGGGGCGATGAACTGGACTTCGCGCTGCCCGGGCCATTGCCGCTGGCGTGGCAGCGGGTGTAC
AGCAGCTATGTGAACGCGGAGCATGGCGCGGCGTGCGGGCTGCTGGGGTATGGGTGGAAGCTGCCGCTGGAACTGCGGTT
GGTACTGCAGGATGAGCGCGCGGTGCTCTTCGATGCATCCGGACGGGCGATCACGTTCGAGGAGCCCTTGCAGCCGGGGC
AGGCGCTGCACAGCAGCAGCGAGGACCTGTGGCTGTTGCGGGGTGGGGGCATGGCTGGCTCCACGGCCCCATCCTCCACG
CCTGCCGAACTGCTGCCCTGGGCGCAGCAACCGCGCTGGTCGCACGTGCCGGCGGTGCTGCGTGCGGACCCGGGCTGCGT
GATCGCCGTGCCGGGCGCGGGCGGTGCTGGTGCGCCGGTCTGGGTGTTCCTGACGGCGGGCGTATCGGGCGACGGAACAG
CTTCGGGCCATGTCCTGCATGCCGTGATCGACCGCTTCGGCCGCAGCCAGCGCTACCAGTGGGGCACGGAAGGCGAACAG
CAGGGCCGCGTCGTAGGCATCACCGATGGCAGCGGGCGACGCTATGCGCTGCGCTACGAGCGCATCGCGCAGGATGCCGG
GACAGCACAGACAAGTGCCCATCCCCTGCTGCAACCCGACGACGGTGTAAGGCTGGTGGGCGTGGACTGCACCTTCAGCC
CGCTCGACCCTGCAGTGATACCCGGCGCGGCCCCGCGCCCGCAACCCCTGGTGGGCTACCGCTACGACAGCGCAGGCAAC
CTGGCCGAGGTGCTGGGCGCGGACGGCACCGTGCTGCGCCGCTTCGGCTACGACGCATTGCACCGGATGACGGAGCACCA
GGTGCGCCAGGGCCCCAGGCACCGCTACGTCTATGAGGACCAGACCGCGCAGGGCCGTCGCCAGGGGCTGGCGGCCCGCC
CGGGTGCACGCGTGGCCGAGCAGCACAACGAAGAGGGGCTGTCATACTTCTTCGAGTACAGCCAGGCATCCGCGCAACCT
GCGCCATCCACGCACACCACGCAGGCGACCCAGGGCCCGGAGGATGACCAGGATGGCCCGACTCCGGCCACCGCGTCTTT
GCCCAGCAATCGCCAAAGCAGCACGTTGGTGCACGACAGCCTGGGCCGTACCACGGCCTACCACTTCGAAGGCGAAGGCG
GCCTCAAGCGCCTGGTGCGTCTCGTGGCCCCGGATGGGACCGAACAGAGCTACCGCCACGACAGCGCCGGCCGTCGCCTC
TCGGCCACCGATGCGCTGGGCCGCACGACGTGGTGGCGCTACGACGGCGCGGGCCGGCTGCTGGGTGTGCAGGGCCCGGA
CGGGCGCAGCACGCAACAGCACTGGGGCGCGGCGGGCAGCGCGCAGGACGGCTTGCTGTTGGCCAGCCAGGACGCGGCGG
GGCTGCGCACGCACTTCCGCTACGACGACTGGGGCCGGCTGGTGGAGGTGGCGATGGCACCAGCCGGGAGCGGGGACGGA
GCCACGAACACCCAGGTCCTCACCACCCGCTTCGAATACGAACAACCCCGGCAGGATGCCGCCACCAGCACAACTGCCGG
CAACATCGCCTTCCCACCGCACACGCTCGCCTGGTGGGACCAGCCCGTGGCGATGATCGACGCGCAGGGCGGGCGTAGCC
TCTACGCCTACAACGCCTGCGGCCAGCTCGCGCGGCACATCGACTGCTCCGGCCGCAGCCAGTCCTGGCGCCACGGCGCC
TGGGGCGAGGTGCTGGAGGCGACGGACGCGCTGGGCCAGCGCACCCAGCTGCACCACGTGCTGGAGCATGGCGCGCTGCG
GCTGGTGGGCGTGCAGCAGCCGGGCAACACGGCGGTGCGCCACCGCTGGTCGGCCGCCGGCACGCTGGAGGCGACCACCT
ACGGCGCGCACGACGTGCTGGAAGGCACGGGCGAGCCCGCGGGCACCAGCACGACAGTCACCTACCGCCACGACCTCTGG
GGCCGGGTGGTGGAGCAGGTGCAGGCGGGCCGGGGCGTGCAGCTGCGCTACGACGTGGCCGGGCGGCTGCAGGAACTCGT
CAACGAGAACGGCGATGTCACACGCTTCGTGCACGACGCGGCCGACCGGCTGGTGCAGGAAGTGGGCTTCGATGGGCGCA
GCCAGGTGTATGGCTACGACGCGGCCGGCCAGCTCACGCACACGGGCGACGGGCATGGCGAAGGCCACCACCCGGGAGCG
GCGGCGCGGCCGGAACTCGGGGCGGTGGTGCGCACGCGGCTGCACTACGACCTGGGCGGCCGGCTGGTGGCGCGTGTGGC
CGTACGGCTGCCGGCAGCGGCATTGGCCGGTGTTGGCGTCAGCGTCGCAGGTGATGCGCCCGACAACCCCACGGCTGAAC
CGCATGCCGTGCTGCAGATCCAGCGCTTCGGGCACACCGCGGCCGGAGCCCTGCTGCAGGCGCGGACCTGGGAGACAGAA
CTGCCCCACGGGATCGAGCCCATGGCCCTGCCAACGGCCCTGTCTCCCGACTCCTCGCGCCCAGGCGCTGCCACGGACCG
ACCCATCCCCCTCGCCGAGCGCTGGCTGGCACTGGACACGCAGGCACTGTTGTCCCTGCTGGACCGCCCCAGCGATCCGA
CCCAGGCCCCGCTGGCAGCCGCGCTGCAGGCGCAGCGGCTGCAGCCCGAGGCCCGCGTGGCCCTGGCGCGCGATGCCTTC
GGGCGTGTCTGTGGAGAGACCCAGACGCTGTACCGGCAGGCCACCCAGCCGCAAGCGCCCCATGCCGGCGGCGAGCCGCC
CGTGGAGTTCGAGCACGCGATCACCCACACGCTGGGCCCGCTGGGCCAGCGCACGGCCACTCAGGCGCAGGGCCTGGGCA
CGCTGCAGTGGCTGGCCTACGGCTCGGGCCATGTGCACGGGCTGCTGCTGGACGGCCAGCCGCTGGTGGACTGGGAGCGC
GATGCGCTGCACCGCGAGGTGGGGCGTACGCTGCACGTGCTCGAAGGCAAAGACAACGAAGACCTGCACGCCATCGTGCA
TGCGCGGCAGCTCGACCCGATGGGCCGCATGCTGCACCAGGACTGGCGCGGCCTGCGCCATGCGACGCCGGTGGTTCCTG
CCGACCCGGCCAACGCCTTCGGAACGGGTCCTTCGTCCGCCGCGGGCCGCATCGCCCCGGCCCTGGGCCCGCTGTCCACG
CTCGCGCAGCGCCGCTACTGGTACGACCCGCTGGGCCAACTGGTCGGCGTGCAGACCCCGGGCGAAGCCACGCGCTACGG
CTACGACGCCTGGCAGCGCCTGAGCGGGCTGCACCGCGCGGGCCAGGGTACGCCGGAAGTGCAAGCGCACTGGGCGCTGG
ACGCGGCGGGCAACCGCCTGCCCGCGTCCCTGGATGCGCGCGCGGCCGGGTCTTCTGCGAGCGAGCGCCAGGGCTGGGCA
CGGCAGGTGCGCGAGAACCTGCACGATGCGGACTTCGACCTGCTGCGCGCGGGCGACGGGCCCGGCGAAGGTGCAGGCCC
GGTCACGCGCTGGCCGGGCAACCGCATCGGGTGGAGCACCCCCGAACCGGATGCCGATGGCACCAGCATCGCTGGCAGCG
ACGGCAACGGCAACGGCAACGGCAACGGCAACGGCAACGGCAACGGCATCCTGATCCGCTACCGCTACGACGCCTTCGGC
AACCGGGTGCAGGCGCTGCACGCCGACGGCCGGGCGCAGCGGCTGCGCTACGACGCGCTGCACCAATTGCGCGAGGTCTG
GCAGCGCGAGGCAGCGGGTGGCGCCTGGCAGCGGGTGGCCTGCTACCGCTACGACCCCTTCGGGCGGCGGCTGGCCAAGA
CCGTCTTCGGGCACGGCAACAGAAGCCGAAACGGCAACGCACCTGCATCGGGACCGGCGACCACCACCTATGCCGGCTGG
GACGGCGACCGCCTCGTGCACAACGAAGGCCCGCAGGGGCTGCAGCACGTGCTCTACGAGCCCGGCTCCTTCGCGCCGCT
GCTGCGCCTGGAGCGCGAGCAGGCCATTCCCACGGCCATGCAGGCCATGCTGGTGATGGAAGAAAACCGTGGAAGCATCG
CCAATGATGGTCCGGATGGCGGTGAATTCCCCGCCGCAGCCCTGTTCGCGGGCCTGCCGCGAGCCCAGCGCGAACTGCTT
GAGCGTGCCCTGCACGACGCCACGGGCCCCCAGGGGGATGCACTACTGGCGCGCCTGCGGGCAGGCCTACCCGATGAGGC
CGGCGCATTGCTGGCTGCGGGGGTGCAGTCCGTGCGCCGGCAACAGCAGGTCGCCACGCAAGCCCACTCGACGCGTATCC
GCCATTTCCTGTGCGATCACCTGGGCACGCCTATTGCCTTGGTGGACGCCAACGGCCCGCAGGCGGGCCTGGTCACCTGG
GCCGCGACCTACCACGCCTGGGGTGCCGTGCGCGAGGAATACGACCCGCACGGCATCGGCCAGGACATCCGTTTCCAGGG
CCAGCAGCTCGATGTGGAGACAGGGTTGCACTACAACCGGTTCCGGTACTACGACCCCCTGCTGGGGCAGTATGTGACGC
AGGATCCGATTGGGCTGCTTGGCGGCTTGAATAAGTTCACCTATCCGGGAAACCCGATAAGCTGGATAGACCCCCTGGGA
TTGTCTGGAATATTGACCATCCAATCTTCAGGCAATGGAAATCCGTTGTCTGGCCATTCATGGATAACGTACGCTCCAGA
TGGCGGTTATCCGACTACCTATGGTACTTGGGGGAATAATCCCACGGGCCAAGGAAATGGGCTCTTCGAGAATTTAGAGA
TTGGCCGTAATGCAGATGCCACGCGATCTATGCGAATCAACGATGAACAAGAGAAGGAGTTGATGGCGAAAATTGAAGAC
TACAAGAATAGGAAAGAAAACGCCTGGAAGCTTGGTGCTCCATGCTCGAGCTTTGCAAGAGATGCTTGGCAGACTGCAAC
GGGTGAAAATTTGAACTCGAACCTTGGCCCCATCAGTAACCCGAGTACTCTAAAGAATTCGATTATTAATGCGAATGGAG
GGAAGGTAAGTGAATTCGCAACCAAGCCCAACGGGACTTCAAGTGCGTCCTTCGGTTCTTTTGGTCGCTCTTCAGGTAGT
GTTTCACTAAACTCACTTGGCAGTTCTTTATGA

Upstream 100 bases:

>100_bases
GAGAGCCACAGGTACTGCTTGCTTGAAGGAGCGCAGTGCCTTGGCCCCGTGGGCGCCACGACGAAGAACAAATAAATAAG
CAAGCAAGGGGAACCAGAAA

Downstream 100 bases:

>100_bases
GATTTATACGCTTATTTTTTATGGGACTGATCTCTTTTATTTTATCGGCTTGCTCTGCGACTGTGCTTGAGTACAAGGAC
GTCGGAGATTTTGAGGTTCG

Product: YD repeat-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1770; Mature: 1769

Protein sequence:

>1770_residues
MSGKPAARQGDLTKKGGPIVQGSATVLIGSAGGVACSVCPGGMAVGNPVNPALGAKVLTGGDELDFALPGPLPLAWQRVY
SSYVNAEHGAACGLLGYGWKLPLELRLVLQDERAVLFDASGRAITFEEPLQPGQALHSSSEDLWLLRGGGMAGSTAPSST
PAELLPWAQQPRWSHVPAVLRADPGCVIAVPGAGGAGAPVWVFLTAGVSGDGTASGHVLHAVIDRFGRSQRYQWGTEGEQ
QGRVVGITDGSGRRYALRYERIAQDAGTAQTSAHPLLQPDDGVRLVGVDCTFSPLDPAVIPGAAPRPQPLVGYRYDSAGN
LAEVLGADGTVLRRFGYDALHRMTEHQVRQGPRHRYVYEDQTAQGRRQGLAARPGARVAEQHNEEGLSYFFEYSQASAQP
APSTHTTQATQGPEDDQDGPTPATASLPSNRQSSTLVHDSLGRTTAYHFEGEGGLKRLVRLVAPDGTEQSYRHDSAGRRL
SATDALGRTTWWRYDGAGRLLGVQGPDGRSTQQHWGAAGSAQDGLLLASQDAAGLRTHFRYDDWGRLVEVAMAPAGSGDG
ATNTQVLTTRFEYEQPRQDAATSTTAGNIAFPPHTLAWWDQPVAMIDAQGGRSLYAYNACGQLARHIDCSGRSQSWRHGA
WGEVLEATDALGQRTQLHHVLEHGALRLVGVQQPGNTAVRHRWSAAGTLEATTYGAHDVLEGTGEPAGTSTTVTYRHDLW
GRVVEQVQAGRGVQLRYDVAGRLQELVNENGDVTRFVHDAADRLVQEVGFDGRSQVYGYDAAGQLTHTGDGHGEGHHPGA
AARPELGAVVRTRLHYDLGGRLVARVAVRLPAAALAGVGVSVAGDAPDNPTAEPHAVLQIQRFGHTAAGALLQARTWETE
LPHGIEPMALPTALSPDSSRPGAATDRPIPLAERWLALDTQALLSLLDRPSDPTQAPLAAALQAQRLQPEARVALARDAF
GRVCGETQTLYRQATQPQAPHAGGEPPVEFEHAITHTLGPLGQRTATQAQGLGTLQWLAYGSGHVHGLLLDGQPLVDWER
DALHREVGRTLHVLEGKDNEDLHAIVHARQLDPMGRMLHQDWRGLRHATPVVPADPANAFGTGPSSAAGRIAPALGPLST
LAQRRYWYDPLGQLVGVQTPGEATRYGYDAWQRLSGLHRAGQGTPEVQAHWALDAAGNRLPASLDARAAGSSASERQGWA
RQVRENLHDADFDLLRAGDGPGEGAGPVTRWPGNRIGWSTPEPDADGTSIAGSDGNGNGNGNGNGNGNGILIRYRYDAFG
NRVQALHADGRAQRLRYDALHQLREVWQREAAGGAWQRVACYRYDPFGRRLAKTVFGHGNRSRNGNAPASGPATTTYAGW
DGDRLVHNEGPQGLQHVLYEPGSFAPLLRLEREQAIPTAMQAMLVMEENRGSIANDGPDGGEFPAAALFAGLPRAQRELL
ERALHDATGPQGDALLARLRAGLPDEAGALLAAGVQSVRRQQQVATQAHSTRIRHFLCDHLGTPIALVDANGPQAGLVTW
AATYHAWGAVREEYDPHGIGQDIRFQGQQLDVETGLHYNRFRYYDPLLGQYVTQDPIGLLGGLNKFTYPGNPISWIDPLG
LSGILTIQSSGNGNPLSGHSWITYAPDGGYPTTYGTWGNNPTGQGNGLFENLEIGRNADATRSMRINDEQEKELMAKIED
YKNRKENAWKLGAPCSSFARDAWQTATGENLNSNLGPISNPSTLKNSIINANGGKVSEFATKPNGTSSASFGSFGRSSGS
VSLNSLGSSL

Sequences:

>Translated_1770_residues
MSGKPAARQGDLTKKGGPIVQGSATVLIGSAGGVACSVCPGGMAVGNPVNPALGAKVLTGGDELDFALPGPLPLAWQRVY
SSYVNAEHGAACGLLGYGWKLPLELRLVLQDERAVLFDASGRAITFEEPLQPGQALHSSSEDLWLLRGGGMAGSTAPSST
PAELLPWAQQPRWSHVPAVLRADPGCVIAVPGAGGAGAPVWVFLTAGVSGDGTASGHVLHAVIDRFGRSQRYQWGTEGEQ
QGRVVGITDGSGRRYALRYERIAQDAGTAQTSAHPLLQPDDGVRLVGVDCTFSPLDPAVIPGAAPRPQPLVGYRYDSAGN
LAEVLGADGTVLRRFGYDALHRMTEHQVRQGPRHRYVYEDQTAQGRRQGLAARPGARVAEQHNEEGLSYFFEYSQASAQP
APSTHTTQATQGPEDDQDGPTPATASLPSNRQSSTLVHDSLGRTTAYHFEGEGGLKRLVRLVAPDGTEQSYRHDSAGRRL
SATDALGRTTWWRYDGAGRLLGVQGPDGRSTQQHWGAAGSAQDGLLLASQDAAGLRTHFRYDDWGRLVEVAMAPAGSGDG
ATNTQVLTTRFEYEQPRQDAATSTTAGNIAFPPHTLAWWDQPVAMIDAQGGRSLYAYNACGQLARHIDCSGRSQSWRHGA
WGEVLEATDALGQRTQLHHVLEHGALRLVGVQQPGNTAVRHRWSAAGTLEATTYGAHDVLEGTGEPAGTSTTVTYRHDLW
GRVVEQVQAGRGVQLRYDVAGRLQELVNENGDVTRFVHDAADRLVQEVGFDGRSQVYGYDAAGQLTHTGDGHGEGHHPGA
AARPELGAVVRTRLHYDLGGRLVARVAVRLPAAALAGVGVSVAGDAPDNPTAEPHAVLQIQRFGHTAAGALLQARTWETE
LPHGIEPMALPTALSPDSSRPGAATDRPIPLAERWLALDTQALLSLLDRPSDPTQAPLAAALQAQRLQPEARVALARDAF
GRVCGETQTLYRQATQPQAPHAGGEPPVEFEHAITHTLGPLGQRTATQAQGLGTLQWLAYGSGHVHGLLLDGQPLVDWER
DALHREVGRTLHVLEGKDNEDLHAIVHARQLDPMGRMLHQDWRGLRHATPVVPADPANAFGTGPSSAAGRIAPALGPLST
LAQRRYWYDPLGQLVGVQTPGEATRYGYDAWQRLSGLHRAGQGTPEVQAHWALDAAGNRLPASLDARAAGSSASERQGWA
RQVRENLHDADFDLLRAGDGPGEGAGPVTRWPGNRIGWSTPEPDADGTSIAGSDGNGNGNGNGNGNGNGILIRYRYDAFG
NRVQALHADGRAQRLRYDALHQLREVWQREAAGGAWQRVACYRYDPFGRRLAKTVFGHGNRSRNGNAPASGPATTTYAGW
DGDRLVHNEGPQGLQHVLYEPGSFAPLLRLEREQAIPTAMQAMLVMEENRGSIANDGPDGGEFPAAALFAGLPRAQRELL
ERALHDATGPQGDALLARLRAGLPDEAGALLAAGVQSVRRQQQVATQAHSTRIRHFLCDHLGTPIALVDANGPQAGLVTW
AATYHAWGAVREEYDPHGIGQDIRFQGQQLDVETGLHYNRFRYYDPLLGQYVTQDPIGLLGGLNKFTYPGNPISWIDPLG
LSGILTIQSSGNGNPLSGHSWITYAPDGGYPTTYGTWGNNPTGQGNGLFENLEIGRNADATRSMRINDEQEKELMAKIED
YKNRKENAWKLGAPCSSFARDAWQTATGENLNSNLGPISNPSTLKNSIINANGGKVSEFATKPNGTSSASFGSFGRSSGS
VSLNSLGSSL
>Mature_1769_residues
SGKPAARQGDLTKKGGPIVQGSATVLIGSAGGVACSVCPGGMAVGNPVNPALGAKVLTGGDELDFALPGPLPLAWQRVYS
SYVNAEHGAACGLLGYGWKLPLELRLVLQDERAVLFDASGRAITFEEPLQPGQALHSSSEDLWLLRGGGMAGSTAPSSTP
AELLPWAQQPRWSHVPAVLRADPGCVIAVPGAGGAGAPVWVFLTAGVSGDGTASGHVLHAVIDRFGRSQRYQWGTEGEQQ
GRVVGITDGSGRRYALRYERIAQDAGTAQTSAHPLLQPDDGVRLVGVDCTFSPLDPAVIPGAAPRPQPLVGYRYDSAGNL
AEVLGADGTVLRRFGYDALHRMTEHQVRQGPRHRYVYEDQTAQGRRQGLAARPGARVAEQHNEEGLSYFFEYSQASAQPA
PSTHTTQATQGPEDDQDGPTPATASLPSNRQSSTLVHDSLGRTTAYHFEGEGGLKRLVRLVAPDGTEQSYRHDSAGRRLS
ATDALGRTTWWRYDGAGRLLGVQGPDGRSTQQHWGAAGSAQDGLLLASQDAAGLRTHFRYDDWGRLVEVAMAPAGSGDGA
TNTQVLTTRFEYEQPRQDAATSTTAGNIAFPPHTLAWWDQPVAMIDAQGGRSLYAYNACGQLARHIDCSGRSQSWRHGAW
GEVLEATDALGQRTQLHHVLEHGALRLVGVQQPGNTAVRHRWSAAGTLEATTYGAHDVLEGTGEPAGTSTTVTYRHDLWG
RVVEQVQAGRGVQLRYDVAGRLQELVNENGDVTRFVHDAADRLVQEVGFDGRSQVYGYDAAGQLTHTGDGHGEGHHPGAA
ARPELGAVVRTRLHYDLGGRLVARVAVRLPAAALAGVGVSVAGDAPDNPTAEPHAVLQIQRFGHTAAGALLQARTWETEL
PHGIEPMALPTALSPDSSRPGAATDRPIPLAERWLALDTQALLSLLDRPSDPTQAPLAAALQAQRLQPEARVALARDAFG
RVCGETQTLYRQATQPQAPHAGGEPPVEFEHAITHTLGPLGQRTATQAQGLGTLQWLAYGSGHVHGLLLDGQPLVDWERD
ALHREVGRTLHVLEGKDNEDLHAIVHARQLDPMGRMLHQDWRGLRHATPVVPADPANAFGTGPSSAAGRIAPALGPLSTL
AQRRYWYDPLGQLVGVQTPGEATRYGYDAWQRLSGLHRAGQGTPEVQAHWALDAAGNRLPASLDARAAGSSASERQGWAR
QVRENLHDADFDLLRAGDGPGEGAGPVTRWPGNRIGWSTPEPDADGTSIAGSDGNGNGNGNGNGNGNGILIRYRYDAFGN
RVQALHADGRAQRLRYDALHQLREVWQREAAGGAWQRVACYRYDPFGRRLAKTVFGHGNRSRNGNAPASGPATTTYAGWD
GDRLVHNEGPQGLQHVLYEPGSFAPLLRLEREQAIPTAMQAMLVMEENRGSIANDGPDGGEFPAAALFAGLPRAQRELLE
RALHDATGPQGDALLARLRAGLPDEAGALLAAGVQSVRRQQQVATQAHSTRIRHFLCDHLGTPIALVDANGPQAGLVTWA
ATYHAWGAVREEYDPHGIGQDIRFQGQQLDVETGLHYNRFRYYDPLLGQYVTQDPIGLLGGLNKFTYPGNPISWIDPLGL
SGILTIQSSGNGNPLSGHSWITYAPDGGYPTTYGTWGNNPTGQGNGLFENLEIGRNADATRSMRINDEQEKELMAKIEDY
KNRKENAWKLGAPCSSFARDAWQTATGENLNSNLGPISNPSTLKNSIINANGGKVSEFATKPNGTSSASFGSFGRSSGSV
SLNSLGSSL

Specific function: Rhs elements have a nonessential function. They may play an important role in the natural ecology of the cell [H]

COG id: COG3209

COG function: function code M; Rhs family protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the RHS family [H]

Homologues:

Organism=Escherichia coli, GI1786706, Length=820, Percent_Identity=35.1219512195122, Blast_Score=359, Evalue=1e-100,
Organism=Escherichia coli, GI1786917, Length=853, Percent_Identity=33.5287221570926, Blast_Score=345, Evalue=1e-95,
Organism=Escherichia coli, GI1790020, Length=860, Percent_Identity=33.2558139534884, Blast_Score=339, Evalue=1e-93,
Organism=Escherichia coli, GI48994942, Length=860, Percent_Identity=33.2558139534884, Blast_Score=338, Evalue=1e-93,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001826
- InterPro:   IPR022385
- InterPro:   IPR006530 [H]

Pfam domain/function: PF03527 RHS; PF05593 RHS_repeat [H]

EC number: NA

Molecular weight: Translated: 188932; Mature: 188801

Theoretical pI: Translated: 6.64; Mature: 6.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
0.8 %Met     (Translated Protein)
1.4 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSGKPAARQGDLTKKGGPIVQGSATVLIGSAGGVACSVCPGGMAVGNPVNPALGAKVLTG
CCCCCCCCCCCCCCCCCCEEECCCEEEEECCCCCEEEECCCCCCCCCCCCHHHCCEEEEC
GDELDFALPGPLPLAWQRVYSSYVNAEHGAACGLLGYGWKLPLELRLVLQDERAVLFDAS
CCCCCEECCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEEEECCCEEEEECC
GRAITFEEPLQPGQALHSSSEDLWLLRGGGMAGSTAPSSTPAELLPWAQQPRWSHVPAVL
CCEEEECCCCCCHHHHHCCCCCEEEEECCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCE
RADPGCVIAVPGAGGAGAPVWVFLTAGVSGDGTASGHVLHAVIDRFGRSQRYQWGTEGEQ
ECCCCEEEEECCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC
QGRVVGITDGSGRRYALRYERIAQDAGTAQTSAHPLLQPDDGVRLVGVDCTFSPLDPAVI
CCCEEEEECCCCCEEEHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
PGAAPRPQPLVGYRYDSAGNLAEVLGADGTVLRRFGYDALHRMTEHQVRQGPRHRYVYED
CCCCCCCCCCEEEEECCCCCHHHHHCCCCHHHHHCCHHHHHHHHHHHHHCCCCCCEEECC
QTAQGRRQGLAARPGARVAEQHNEEGLSYFFEYSQASAQPAPSTHTTQATQGPEDDQDGP
CHHHHHHCCCCCCCCCHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC
TPATASLPSNRQSSTLVHDSLGRTTAYHFEGEGGLKRLVRLVAPDGTEQSYRHDSAGRRL
CCCCCCCCCCCCCCHHHHHCCCCCEEEEECCCHHHHHHHHHHCCCCCCHHHHHHHCCCEE
SATDALGRTTWWRYDGAGRLLGVQGPDGRSTQQHWGAAGSAQDGLLLASQDAAGLRTHFR
EHHHHHCCCEEEEECCCCCEEEECCCCCCCHHHHCCCCCCCCCCEEEECCCCCCHHHHCC
YDDWGRLVEVAMAPAGSGDGATNTQVLTTRFEYEQPRQDAATSTTAGNIAFPPHTLAWWD
CCHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCHHHCC
QPVAMIDAQGGRSLYAYNACGQLARHIDCSGRSQSWRHGAWGEVLEATDALGQRTQLHHV
CCEEEEECCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
LEHGALRLVGVQQPGNTAVRHRWSAAGTLEATTYGAHDVLEGTGEPAGTSTTVTYRHDLW
HHCCCEEEEEECCCCCHHHHEECCCCCCEEEECCCCHHHHCCCCCCCCCCEEEEEHHHHH
GRVVEQVQAGRGVQLRYDVAGRLQELVNENGDVTRFVHDAADRLVQEVGFDGRSQVYGYD
HHHHHHHHCCCCCEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEC
AAGQLTHTGDGHGEGHHPGAAARPELGAVVRTRLHYDLGGRLVARVAVRLPAAALAGVGV
CCCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHCCCE
SVAGDAPDNPTAEPHAVLQIQRFGHTAAGALLQARTWETELPHGIEPMALPTALSPDSSR
EEECCCCCCCCCCCHHEEEEHHHCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC
PGAATDRPIPLAERWLALDTQALLSLLDRPSDPTQAPLAAALQAQRLQPEARVALARDAF
CCCCCCCCCCHHHHHHHHCHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHH
GRVCGETQTLYRQATQPQAPHAGGEPPVEFEHAITHTLGPLGQRTATQAQGLGTLQWLAY
HHHHCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHCCCCEEEEEEE
GSGHVHGLLLDGQPLVDWERDALHREVGRTLHVLEGKDNEDLHAIVHARQLDPMGRMLHQ
CCCCEEEEEECCCCCCCCHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHCCHHHHHHHH
DWRGLRHATPVVPADPANAFGTGPSSAAGRIAPALGPLSTLAQRRYWYDPLGQLVGVQTP
HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCCCC
GEATRYGYDAWQRLSGLHRAGQGTPEVQAHWALDAAGNRLPASLDARAAGSSASERQGWA
CCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHH
RQVRENLHDADFDLLRAGDGPGEGAGPVTRWPGNRIGWSTPEPDADGTSIAGSDGNGNGN
HHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCC
GNGNGNGNGILIRYRYDAFGNRVQALHADGRAQRLRYDALHQLREVWQREAAGGAWQRVA
CCCCCCCCEEEEEEEECCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCHHHEEE
CYRYDPFGRRLAKTVFGHGNRSRNGNAPASGPATTTYAGWDGDRLVHNEGPQGLQHVLYE
EEEECHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCCEEECCCCHHHHHHHHCC
PGSFAPLLRLEREQAIPTAMQAMLVMEENRGSIANDGPDGGEFPAAALFAGLPRAQRELL
CCCCCHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHH
ERALHDATGPQGDALLARLRAGLPDEAGALLAAGVQSVRRQQQVATQAHSTRIRHFLCDH
HHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LGTPIALVDANGPQAGLVTWAATYHAWGAVREEYDPHGIGQDIRFQGQQLDVETGLHYNR
CCCCEEEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCCCCEECCCEEEHHCCCCCCH
FRYYDPLLGQYVTQDPIGLLGGLNKFTYPGNPISWIDPLGLSGILTIQSSGNGNPLSGHS
HHHHCHHHHHHHCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCC
WITYAPDGGYPTTYGTWGNNPTGQGNGLFENLEIGRNADATRSMRINDEQEKELMAKIED
EEEECCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCEECCCCHHHHHHHHHHHH
YKNRKENAWKLGAPCSSFARDAWQTATGENLNSNLGPISNPSTLKNSIINANGGKVSEFA
HHCCCCCCEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEHHHC
TKPNGTSSASFGSFGRSSGSVSLNSLGSSL
CCCCCCCCCCCCCCCCCCCCEEHHHCCCCC
>Mature Secondary Structure 
SGKPAARQGDLTKKGGPIVQGSATVLIGSAGGVACSVCPGGMAVGNPVNPALGAKVLTG
CCCCCCCCCCCCCCCCCEEECCCEEEEECCCCCEEEECCCCCCCCCCCCHHHCCEEEEC
GDELDFALPGPLPLAWQRVYSSYVNAEHGAACGLLGYGWKLPLELRLVLQDERAVLFDAS
CCCCCEECCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEEEECCCEEEEECC
GRAITFEEPLQPGQALHSSSEDLWLLRGGGMAGSTAPSSTPAELLPWAQQPRWSHVPAVL
CCEEEECCCCCCHHHHHCCCCCEEEEECCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCE
RADPGCVIAVPGAGGAGAPVWVFLTAGVSGDGTASGHVLHAVIDRFGRSQRYQWGTEGEQ
ECCCCEEEEECCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC
QGRVVGITDGSGRRYALRYERIAQDAGTAQTSAHPLLQPDDGVRLVGVDCTFSPLDPAVI
CCCEEEEECCCCCEEEHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
PGAAPRPQPLVGYRYDSAGNLAEVLGADGTVLRRFGYDALHRMTEHQVRQGPRHRYVYED
CCCCCCCCCCEEEEECCCCCHHHHHCCCCHHHHHCCHHHHHHHHHHHHHCCCCCCEEECC
QTAQGRRQGLAARPGARVAEQHNEEGLSYFFEYSQASAQPAPSTHTTQATQGPEDDQDGP
CHHHHHHCCCCCCCCCHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC
TPATASLPSNRQSSTLVHDSLGRTTAYHFEGEGGLKRLVRLVAPDGTEQSYRHDSAGRRL
CCCCCCCCCCCCCCHHHHHCCCCCEEEEECCCHHHHHHHHHHCCCCCCHHHHHHHCCCEE
SATDALGRTTWWRYDGAGRLLGVQGPDGRSTQQHWGAAGSAQDGLLLASQDAAGLRTHFR
EHHHHHCCCEEEEECCCCCEEEECCCCCCCHHHHCCCCCCCCCCEEEECCCCCCHHHHCC
YDDWGRLVEVAMAPAGSGDGATNTQVLTTRFEYEQPRQDAATSTTAGNIAFPPHTLAWWD
CCHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCCHHHHCCCCCCCCCCCCCCCHHHCC
QPVAMIDAQGGRSLYAYNACGQLARHIDCSGRSQSWRHGAWGEVLEATDALGQRTQLHHV
CCEEEEECCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH
LEHGALRLVGVQQPGNTAVRHRWSAAGTLEATTYGAHDVLEGTGEPAGTSTTVTYRHDLW
HHCCCEEEEEECCCCCHHHHEECCCCCCEEEECCCCHHHHCCCCCCCCCCEEEEEHHHHH
GRVVEQVQAGRGVQLRYDVAGRLQELVNENGDVTRFVHDAADRLVQEVGFDGRSQVYGYD
HHHHHHHHCCCCCEEEEHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEC
AAGQLTHTGDGHGEGHHPGAAARPELGAVVRTRLHYDLGGRLVARVAVRLPAAALAGVGV
CCCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHCCCE
SVAGDAPDNPTAEPHAVLQIQRFGHTAAGALLQARTWETELPHGIEPMALPTALSPDSSR
EEECCCCCCCCCCCHHEEEEHHHCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC
PGAATDRPIPLAERWLALDTQALLSLLDRPSDPTQAPLAAALQAQRLQPEARVALARDAF
CCCCCCCCCCHHHHHHHHCHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHH
GRVCGETQTLYRQATQPQAPHAGGEPPVEFEHAITHTLGPLGQRTATQAQGLGTLQWLAY
HHHHCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHCCCCEEEEEEE
GSGHVHGLLLDGQPLVDWERDALHREVGRTLHVLEGKDNEDLHAIVHARQLDPMGRMLHQ
CCCCEEEEEECCCCCCCCHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHCCHHHHHHHH
DWRGLRHATPVVPADPANAFGTGPSSAAGRIAPALGPLSTLAQRRYWYDPLGQLVGVQTP
HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCCCC
GEATRYGYDAWQRLSGLHRAGQGTPEVQAHWALDAAGNRLPASLDARAAGSSASERQGWA
CCHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHH
RQVRENLHDADFDLLRAGDGPGEGAGPVTRWPGNRIGWSTPEPDADGTSIAGSDGNGNGN
HHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCC
GNGNGNGNGILIRYRYDAFGNRVQALHADGRAQRLRYDALHQLREVWQREAAGGAWQRVA
CCCCCCCCEEEEEEEECCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCHHHEEE
CYRYDPFGRRLAKTVFGHGNRSRNGNAPASGPATTTYAGWDGDRLVHNEGPQGLQHVLYE
EEEECHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCCEEECCCCHHHHHHHHCC
PGSFAPLLRLEREQAIPTAMQAMLVMEENRGSIANDGPDGGEFPAAALFAGLPRAQRELL
CCCCCHHHHHHHHHHHHHHHHHHHHEECCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHH
ERALHDATGPQGDALLARLRAGLPDEAGALLAAGVQSVRRQQQVATQAHSTRIRHFLCDH
HHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LGTPIALVDANGPQAGLVTWAATYHAWGAVREEYDPHGIGQDIRFQGQQLDVETGLHYNR
CCCCEEEEECCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCCCCEECCCEEEHHCCCCCCH
FRYYDPLLGQYVTQDPIGLLGGLNKFTYPGNPISWIDPLGLSGILTIQSSGNGNPLSGHS
HHHHCHHHHHHHCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCC
WITYAPDGGYPTTYGTWGNNPTGQGNGLFENLEIGRNADATRSMRINDEQEKELMAKIED
EEEECCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCEECCCCHHHHHHHHHHHH
YKNRKENAWKLGAPCSSFARDAWQTATGENLNSNLGPISNPSTLKNSIINANGGKVSEFA
HHCCCCCCEECCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEHHHC
TKPNGTSSASFGSFGRSSGSVSLNSLGSSL
CCCCCCCCCCCCCCCCCCCCEEHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1766878; 9278503; 2644231; 2403547; 7934896 [H]