Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is wapA [H]

Identifier: 187735481

GI number: 187735481

Start: 1171299

End: 1177088

Strand: Reverse

Name: wapA [H]

Synonym: Amuc_0983

Alternate gene names: 187735481

Gene position: 1177088-1171299 (Counterclockwise)

Preceding gene: 187735482

Following gene: 187735480

Centisome position: 44.18

GC content: 60.0

Gene sequence:

>5790_bases
ATGAATCCGCCTCATCGTTCCTCCTCCGCCCATCGTTCTGCCGTTCCCGCTTACCGGAAATGGTACTTTGGTTCCGGTGC
CAGCCAATCCGGCTCTACTCGCCGCGTGCGCTTATCCGGCCAGCCGGATCCCGATCCGCTGCCGTTTGAAGCGATCCGGA
TCCATGAGGAAAGGAAAGTGGGGCCGCCGAGCACGTCGGCGGACGGGCACAGCGCGCACGGGACATTCACCATCCCGGAA
GGAGCGGAGGGCAAAAAACTCTACGGCACCTGCTCGCTCTTCCTTGGGGTGGACGACTGGGGAATCCTGGAGGTGAAGGA
CTCCGGCGGCAACGTGGTGGCGCAGGTGGATCTGAAAGAAAACCCGCAGACGGCGGGCGAACAGGGCGGGCACAAATACC
ACACGGGGACTGGCGGGGCGCAGCTCCCCTCCGGCACCTACAGCTGGGAAGTCAGCCAGACCAACATCGACTACAATCCG
GCAAGCGGCAACACCTCCATCTGCAACTACAGCATCGACGTGGTGCCGACGGAACCGGGGGGCAGGAAAGAACCCGAACC
GTGTCCCTGCGAGGGAGACACGTGCGACAACAGCGGCGGAACGCCGCCCTCCCCGCCGCAGGCCCGGTCGTGCCCTGAGG
CGGGCATGGAAAGCGGCGCCCTGGGAAACTACAGCTCGGCGGGGTGCAGCGTGACGGCGGAAAGCACGGCCACGCTGATG
TACTGGTCCTGCAACTTCGGAGCGTTCCGTGGACTTGGGGGCCTTCCGGCCGGAAGGGTGGAACTGAGGGCTGAACAAAA
CGTCTCCGGCCTGGAAAGCCCCTCCTCGCTGGCCTACAACCATCCCCTGAACAGCCGTCTGGACGTGCCGGAAGGGGGCA
TTGCGCCGGGAGTGCGGTTCAACCTGGTGCAGGGAGACCGGGTGATTGCCATGCGCTGCTACACGGACGGGTCCGTGCTG
CCCATCGGGGTGGACACGTCGGGCGGAGGGCGTGCGGCGCTGGCCACGGTGGAAGGACAATCCTGCCTGCGCTGGGTGGT
GGAAGACGGCAGCCAATACCTCTTCTCGGCGGAAACGGGAACGCTCCTCTCCTACACCACCACGGACAGGCAGGTCATCT
CCAACGCGTCATCCTATCTGGACGTCAGGCATGCCGGAGACGGCTCGCTGAGGCAAATCTGGAACCTGTGGGACGGCCTG
CTCAACGTGGAAAACGTCACCTCCACGGGCTACACCATCGCGCTCTATACCCCTGGGCAAATCACCGGAACGGACGAACA
GGGATTTTATACCGTTACGGGCGCTCCCTTGAAAACATTTATTCTTTCCCTGGATGCTGAGGAAAAGTTCACCATCACGG
AACAGGCGCCTGCCAGGCAGCCCTACGCCGTCACCTGGTGGAACGACGGCCTGGCGTGGAACATGCGGCAGGGCACGGGG
GAAGACGCCCTCACGACCCTCCGCACGCGCACGGAGCTGGAACCGGAAAACTCGGTCTGGCAGCTGGTCACGGAAATCTC
CAAAAACGGAATCGTGGCGGCGCGCACCTGCGCCATCTACCAGACCACGGACGTGGGCGACCTGCTGCTCACGCTGGCGG
AAGGCTACGGAAGCCCGGAGGAGCAAACCACGCAATACGCCTACGACCAGTGCGGACGGCTCAGAACGGAAACGGCCCCG
GGCGGCAGCCAGACTCATTACGCCTATGACCTCTACGGCCGCCTGCTCAGCCGGGACGAACCGTGGGCGGAAAGCGGCAG
GCGCATCACGCGCTACACCTACGCCTGTTCGGGAGAAGCCGACTTCAGCAACGAACCCGCCACGGAAACGGCAGACCTGC
TTCCGCTGGAAGGACACGTCAAAACGCTGACATCCACCACCTGGAAATACACGACGGCCAACCACATCAAAAGAACGGAA
CGGCGGGTCACCGGACTGGGCGTGACGGGCACGCGCCTGACGGCGGAGGAACAATGGCTGGCCGGAGCCGCCAACATCCA
TGCCCGCGGACGCACGCGGTTCAGCCGGGACCTCGACGGCGTGCAAACGTGGCACGACTACGCGGCCACGACGGAGCACG
GCGCCCTCTACACGGAAACGGTGGAAACGCGCATCAACGGAGAAGCCGTGCCGGGACAAAGCACGCGCGCCGTCACCTGG
ATCACGGCGGAAGGGCAGCGCGTCAGGGAAGAAAACTACCTCCTGCTTTCCACCGGGCAATGGGCGCTCACGGGCAGCGC
CGTCTACGAATTTGACACGCAGAACCGGTGGGTGAAGCGGACGGCGGGCAACGGCCGGCTCACGGAACGCGAACTGATGT
GCGACGGAGGCCTGCTGTGGGAAATCGATGAAAACGGCATCAGGACGGACTACGCCTACGACACGGCGCGCCAACTGGTG
GAAGTCACGCGTTCCGCCGTGATGGACGGGGAAACCGTCATCACGCCGGAAACCATCACCACCTACGTCCGGGATGCGGC
AGGGCGCGTACTCTCCACGCGTCAAGACACGGGGGCGATGACCACGCGGGAAAGCGCCACCTACGACCTTCTTGGCAGAA
CAACCTCCACCACGGACGTCCTGGGCCGGGTCACTACCTACGCCTACAGCCAGGACGGCTTGACGGTCACGCAAACCGTC
CCTTCCGGGGCTACATTCATCACGCGCAGCGCGCCGGACGGAACGGTGATGGAAGAATCCGGCACGGGGCAGCGGCACGT
CATCTACGCCATCGACCTGGTCAGCGACGGTGTGCGGACCTTCACGAAAGCCGTCTCCGGGGAAACGCAAACCGAGCTGC
AGCGCAGCATTGTCAACGGAGCCGGGGAAACCCTGCGCACGGGCGTCCCCAACACCACCGGTGGCGTCATTTACACGAGG
AACACCTACAACGCCAGGGGGCAGCTCACCAAAACGCAGACGGACGCGGGCAATGCGGCCACGACGATGGCCCCGACCCT
GTGGGAATACGACGCCTTCGGCAACAAAACGAAAGAAACCTGGAAACTCGCCGATCCGGCCACGACATCCAACTCGCGCA
TCACCACGTGGAGCTACGGCGTGGAACAGGCCCAGGATGAAGTATACCGCGTTGTTACGGCGACCAGGAACAACAGCCGG
GGAACGACCTATAACGAAACGCAGAAAACGCTGGCTTCCTCCCTCTCGTCCACGCTGGAAAGCAAAGTCATTTCCATCGA
CCCCAGGGGAAACGCTTCCGAACAATGGAGCGAATACGGTCCGGGCGCCGTCCGGACGCAGAAAAGCAGCATCCCCACCT
CCGACATCACGGCCGCCGCTACGGTCATCGACGGTTTTATCATCTCGCAAACGGACCATGCGGGCGTCACGGCCACGCAT
ACCCGCGCCTACACGGAAACCGGCGTCATCTACGCCAGCACGGACGGCCGGGGCAACACGGTCACGACGCACACCGACCT
TACCGGGCGCACGATCTCGGTGACGGACGCGGCGGGCAACACGACTTCTACCGCCTACGGCCCCTGGTTTGACCAGCCTG
CCGTCGTCACCAACGCCCTGGGCAACACGACCTGCTACGGCTACGACCTCCGGGGCCGCAACACGGCGCAATGGGGAACG
AGGGCCCAGCCCCTGCTCTTCGGCTATGACGAGGCGGACAGGATGATAAGCCTCACCACGTTCCGGGAGGACGCGGGCGA
CATCACCGCCGACCCCACGGGACGCACGGACGGGGACGTCACTACGTGGAGCTACGATGACGCCACGGGCCTGCTCATCC
GCAAAACCTGGGCGGACGGCACCCATGAAGACACCGCCTACAATGCCCTGAACTTCAAATCCACGCTCATGGACGCGCGG
GGGGTGGTCACCACCTGGGGCTACAACCTGAAGAAGGGGGTCAACAACTCCGTCTCCTACAGCGACTCCACGCCCGGCAT
CCAGTACGCCTACAACCACCTCAACCAGCTGACCCAGGTCACGGACGCCTCCGGCTCGCGCGTCCTCACGTACACCCCCT
GCAACGAACCGGACACCGACAGCATCACCATCGGAGGGAGCTCTTACCAGCTCCAGGAACACTACGACACTTACGGACGC
TCCTCCGGCTATACCCTGAAACAGGGAACCGACGTCCTCCAGGAAGCCAGCCAGGGCTATGAAACCGACGGAAGGCTGGC
CAGCGCCGGAATCAGGCACGGGGGAACGGAGCAAAGCTTCGCCTACGGCTACCTGGCAGGAAGCAGCCTGCTCTCCAGCC
TTGCGATGCCCGACGGCATCGTCCGGGAACTTGCCTATGAACAGCGCCGCAACCTGGTCACGGCAATCAACTGCCGCCTG
GGGGAAACCGTGCTGGTCTCCCGCAGCCAGGGCTACGATGCCCTGGGACGCCCGGTCACCCGCACCCAGCAGCGTGGAAC
GGAACCCGCCCGCAGCGACAGCTTCAGCTACAACGGCAGAAACGAACTCACCGCCGCTACCCTGGGCGCCGCCCCCTACG
GCTACAGCTACGACAACATCGGCAACCGCAAGACGGCACGGGAACCGGCCGAAGAACTCGCCTACGCGGCCAACGGGCTC
AACCAGTACACCGGCATTGAAGAAAGCGGGGAAGCTCCTTTTGTGCCGACGTACGACGCCTCGGGCAACCAGACCCTCAT
CAAGACGTCAACGGGCATCTGGACGGCCGTGTACAACGCGGCCAACCGCGCGGTGAGCTTCACCAGCCGGGACGGCGCGA
CAGTCGTGGAATGCGGCTACGATTACCAGGGACGCCGCTACATGAAGAAAGTGACCCAAAACGGCACGGTCGCCAGCCAC
GAACGCTATCTATACCGCGGCTATTTACAAATAGCGGCATTGGATATGCTGGACAACCGTAACGTGCTTCGCACGCTGTT
GTGGGATCCTCTGGAACCGGTGGCCACCCGCCCCCTGGCCCTCGCGCAGGGCGCTTCCCTGTACTGCTACGGCATGGACT
TCAACAAGAATGTGTCGGAGGTCTTCGACGCACAGGGAACGATCGCGGCGGCTTACGACTACTCGCCCTATGGGATAGTT
GGCAGCACAGGCAACCTCGTCCAACCCGTACAGTGGTCCGGCGAGATGCACGACGAAGAACCCACCCTGGCCTATTATAA
TTACCGCTTTTACAACCCCAAAGACGGCAGGTGGATCAATAGGGATCCCATCGCTGAACAGGGAGGGTGGAATTTGTACG
GGTTTGTTGATAATGGAGTGGTATTTTCTATTGATTATCTCGGAAAAGAAACTAACGAATTTGGCTACACAATGACTATT
CCTGCGGGTTACATACCCATTCTTCTTATTATTGAGGGAAGTATATCAATAGAAAAAAAGAAAAATTGTGTATGCATTGA
AGCAAAATTACGAACAGATGTTGGTATAGGTGTAGGTATAGGGCTTAAAATTAAACAAAAATGGTGGCCTATTCTTCCTG
ATATTGAATATTCTTTAAAAACTATGATTGCTGGATTAAGTAATGAAAAAATACTTAAAATTGACAATTGTAATGGCAAA
ACTTTAACATCCTATCGGGAAGAATTATTAGGATTTTCTCACAGAATTGATGCAGGCATTACACTATCATCTTATATACA
AGCATCTTATTCTATCGATTTTAAGATAAGTTCTGGATTAACTTTGAACGCAAAACCAATATATTTAACTTTAGATGCAA
CGGAATATATTAAGATTGACGCTACTTTAAAAATCCCATTTATCATAGATCAAAACGAAGAACTTGTTAACAAATCTTTT
AATGAAAATATAAAAATATGGTCTAAATAA

Upstream 100 bases:

>100_bases
ATTCTATCGACACCGACTTCGGATCATACTATTGATAAAACGATCAGGAAAGGGCGGCCACCCCTCGCTTCATTCATTAC
TCCCAACTCTTGAATACCTG

Downstream 100 bases:

>100_bases
TGAACGAAATATAATTAGTTACCTGTATTATGATTTACCCTGAGTCTCCTTTAACTGTATATGAATATTTTGAATTAGTT
TATGGATGTGTTTTTGTAAT

Product: YD repeat protein

Products: NA

Alternate protein names: Cell wall-associated polypeptide CWBP200; CWBP200 [H]

Number of amino acids: Translated: 1929; Mature: 1929

Protein sequence:

>1929_residues
MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE
GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP
ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM
YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL
PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL
LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG
EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP
GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE
RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW
ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV
EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV
PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR
NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR
GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH
TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT
RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR
GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR
SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL
GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL
NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH
ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV
GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI
PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK
TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF
NENIKIWSK

Sequences:

>Translated_1929_residues
MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE
GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP
ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM
YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL
PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL
LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG
EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP
GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE
RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW
ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV
EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV
PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR
NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR
GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH
TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT
RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR
GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR
SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL
GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL
NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH
ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV
GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI
PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK
TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF
NENIKIWSK
>Mature_1929_residues
MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE
GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP
ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM
YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL
PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL
LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG
EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP
GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE
RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW
ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV
EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV
PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR
NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR
GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH
TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT
RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR
GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR
SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL
GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL
NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH
ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV
GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI
PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK
TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF
NENIKIWSK

Specific function: Still unknown. Not involved in cell membrane metabolism, motility, secretion or differentiation [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Secreted, cell wall. Note=Released into the medium [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008979
- InterPro:   IPR022385
- InterPro:   IPR006530 [H]

Pfam domain/function: PF05593 RHS_repeat [H]

EC number: NA

Molecular weight: Translated: 210173; Mature: 210173

Theoretical pI: Translated: 4.83; Mature: 4.83

Prosite motif: PS00178 AA_TRNA_LIGASE_I

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKV
CCCCCCCCCCHHHCCCHHHHEEECCCCCCCCCEEEEEECCCCCCCCCCHHEEEEEHHCCC
GPPSTSADGHSAHGTFTIPEGAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKE
CCCCCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEECCCCEEEEEECCCCCEEEEEECCC
NPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNPASGNTSICNYSIDVVPTEPG
CCCCCHHCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCC
GRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCEEEECCCEEEE
YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRF
EEECCCCHHCCCCCCCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEE
NLVQGDRVIAMRCYTDGSVLPIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETG
EEEECCEEEEEEEECCCCEEEEEECCCCCCEEEEEEECCHHHHHHHHHCCCEEEEEECCC
TLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGLLNVENVTSTGYTIALYTPGQ
CEEEEECCHHHHHHCCHHEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCEEEEEECCCE
ITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG
EECCCCCCEEEEECCCCEEEEEEECCCCCEEEECCCCCCCCEEEEEECCCEEEEECCCCC
EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPE
HHHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEEEEEEECCHHHHHHHHHHCCCCCH
EQTTQYAYDQCGRLRTETAPGGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEA
HHHHHHHHHHHCCEEECCCCCCCCCEEEHHHHHHHHCCCCCHHHCCCEEEEEEEEECCCC
DFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTERRVTGLGVTGTRLTAEEQWL
CCCCCCCCCCHHEEECCCCCEEEECCEEEEECHHHHHHHHHHHEECCCCCCEEECCHHHH
AGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW
CCCCEEEECCCCHHHHCCCHHHHHHHHHCCCCCCEEEEEEEHEEECCCCCCCCCCEEEEE
ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLW
EECCCCEECCCCEEEEECCCEEEECCEEEEECCCCCEEEECCCCCCCCHHEEEECCCEEE
EIDENGIRTDYAYDTARQLVEVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAM
EECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHEECCCCCCCE
TTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTVPSGATFITRSAPDGTVMEES
EECCCCCEEECCCCCCHHHHHHHHEEEEECCCCEEEEEECCCCCEEEEECCCCCCEEECC
GTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR
CCCCEEEEEEEEEHHCHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHCCCCCCCCCEEEEC
NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYG
CCCCCCCEEEEECCCCCCCCCCCCCCCEEHHCCCCCCCCEEEECCCCCCCCCEEEEEECC
VEQAQDEVYRVVTATRNNSRGTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYG
HHHHHHHEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCHHHHHHCC
PGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATHTRAYTETGVIYASTDGRGNT
CCCEEECCCCCCCHHHHHHHHHHCCEEEECCCCCCCEEECCEEEEECCEEEEECCCCCCE
VTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT
EEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCEEEEEECCCCCCCCCCC
RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADG
CCCCEEEECCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEEECCCC
THEDTAYNALNFKSTLMDARGVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQV
CCCCCCEEEECHHHHHHHCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH
TDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGRSSGYTLKQGTDVLQEASQGY
HCCCCCEEEEECCCCCCCCCEEEECCCCEEHHHHHHHCCCCCCEEEHHHHHHHHHHHCCC
ETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL
CCCCCEEHHHCCCCCCCCCEEEEEEHHHHHHHHHCCCHHHHHHHHHHHHCCEEEEEECCC
GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNI
CCEEEEECCCCCHHHCCCHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
GNRKTAREPAEELAYAANGLNQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNA
CCCCHHCCHHHHHHHHHCCCHHHCCCCCCCCCCEEEEECCCCCEEEEEECCCEEEHHHHH
ANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASHERYLYRGYLQIAALDMLDNR
CCCEEEEECCCCCEEEEECCCCCCHHHHHHHHCCCCEECCCHHHHHHEEEEEEEEHHCCC
NVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV
HHHHHHHCCCCCHHHCCCEEEECCCEEEEEECCCCCCHHHHHCCCCCEEEEEECCCCEEE
GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGV
CCCCCCCCCEECCCCCCCCCCCEEEEEEEEECCCCCCEECCCCCCCCCCCEEEEEECCCE
VFSIDYLGKETNEFGYTMTIPAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGI
EEEEEECCCCCCCCCEEEEECCCCEEEEEEEECCEEEECCCCEEEEEEEECCCCCEEEEC
GLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGKTLTSYREELLGFSHRIDAGI
CEEECCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHCCCCC
TLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF
CHHHHEEEEEEEEEEECCCEEECCCEEEEEECCCEEEEEEEEEEEEEEECCCHHHHHCCC
NENIKIWSK
CCCEEEEEC
>Mature Secondary Structure
MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKV
CCCCCCCCCCHHHCCCHHHHEEECCCCCCCCCEEEEEECCCCCCCCCCHHEEEEEHHCCC
GPPSTSADGHSAHGTFTIPEGAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKE
CCCCCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEECCCCEEEEEECCCCCEEEEEECCC
NPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNPASGNTSICNYSIDVVPTEPG
CCCCCHHCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCC
GRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCEEEECCCEEEE
YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRF
EEECCCCHHCCCCCCCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEE
NLVQGDRVIAMRCYTDGSVLPIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETG
EEEECCEEEEEEEECCCCEEEEEECCCCCCEEEEEEECCHHHHHHHHHCCCEEEEEECCC
TLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGLLNVENVTSTGYTIALYTPGQ
CEEEEECCHHHHHHCCHHEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCEEEEEECCCE
ITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG
EECCCCCCEEEEECCCCEEEEEEECCCCCEEEECCCCCCCCEEEEEECCCEEEEECCCCC
EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPE
HHHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEEEEEEECCHHHHHHHHHHCCCCCH
EQTTQYAYDQCGRLRTETAPGGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEA
HHHHHHHHHHHCCEEECCCCCCCCCEEEHHHHHHHHCCCCCHHHCCCEEEEEEEEECCCC
DFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTERRVTGLGVTGTRLTAEEQWL
CCCCCCCCCCHHEEECCCCCEEEECCEEEEECHHHHHHHHHHHEECCCCCCEEECCHHHH
AGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW
CCCCEEEECCCCHHHHCCCHHHHHHHHHCCCCCCEEEEEEEHEEECCCCCCCCCCEEEEE
ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLW
EECCCCEECCCCEEEEECCCEEEECCEEEEECCCCCEEEECCCCCCCCHHEEEECCCEEE
EIDENGIRTDYAYDTARQLVEVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAM
EECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHEECCCCCCCE
TTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTVPSGATFITRSAPDGTVMEES
EECCCCCEEECCCCCCHHHHHHHHEEEEECCCCEEEEEECCCCCEEEEECCCCCCEEECC
GTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR
CCCCEEEEEEEEEHHCHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHCCCCCCCCCEEEEC
NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYG
CCCCCCCEEEEECCCCCCCCCCCCCCCEEHHCCCCCCCCEEEECCCCCCCCCEEEEEECC
VEQAQDEVYRVVTATRNNSRGTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYG
HHHHHHHEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCHHHHHHCC
PGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATHTRAYTETGVIYASTDGRGNT
CCCEEECCCCCCCHHHHHHHHHHCCEEEECCCCCCCEEECCEEEEECCEEEEECCCCCCE
VTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT
EEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCEEEEEECCCCCCCCCCC
RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADG
CCCCEEEECCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEEECCCC
THEDTAYNALNFKSTLMDARGVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQV
CCCCCCEEEECHHHHHHHCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH
TDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGRSSGYTLKQGTDVLQEASQGY
HCCCCCEEEEECCCCCCCCCEEEECCCCEEHHHHHHHCCCCCCEEEHHHHHHHHHHHCCC
ETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL
CCCCCEEHHHCCCCCCCCCEEEEEEHHHHHHHHHCCCHHHHHHHHHHHHCCEEEEEECCC
GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNI
CCEEEEECCCCCHHHCCCHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC
GNRKTAREPAEELAYAANGLNQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNA
CCCCHHCCHHHHHHHHHCCCHHHCCCCCCCCCCEEEEECCCCCEEEEEECCCEEEHHHHH
ANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASHERYLYRGYLQIAALDMLDNR
CCCEEEEECCCCCEEEEECCCCCCHHHHHHHHCCCCEECCCHHHHHHEEEEEEEEHHCCC
NVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV
HHHHHHHCCCCCHHHCCCEEEECCCEEEEEECCCCCCHHHHHCCCCCEEEEEECCCCEEE
GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGV
CCCCCCCCCEECCCCCCCCCCCEEEEEEEEECCCCCCEECCCCCCCCCCCEEEEEECCCE
VFSIDYLGKETNEFGYTMTIPAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGI
EEEEEECCCCCCCCCEEEEECCCCEEEEEEEECCEEEECCCCEEEEEEEECCCCCEEEEC
GLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGKTLTSYREELLGFSHRIDAGI
CEEECCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHCCCCC
TLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF
CHHHHEEEEEEEEEEECCCEEECCCEEEEEECCCEEEEEEEEEEEEEEECCCHHHHHCCC
NENIKIWSK
CCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8316082; 7704263; 8969509; 9384377; 10658653 [H]