Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is wapA [H]
Identifier: 187735481
GI number: 187735481
Start: 1171299
End: 1177088
Strand: Reverse
Name: wapA [H]
Synonym: Amuc_0983
Alternate gene names: 187735481
Gene position: 1177088-1171299 (Counterclockwise)
Preceding gene: 187735482
Following gene: 187735480
Centisome position: 44.18
GC content: 60.0
Gene sequence:
>5790_bases ATGAATCCGCCTCATCGTTCCTCCTCCGCCCATCGTTCTGCCGTTCCCGCTTACCGGAAATGGTACTTTGGTTCCGGTGC CAGCCAATCCGGCTCTACTCGCCGCGTGCGCTTATCCGGCCAGCCGGATCCCGATCCGCTGCCGTTTGAAGCGATCCGGA TCCATGAGGAAAGGAAAGTGGGGCCGCCGAGCACGTCGGCGGACGGGCACAGCGCGCACGGGACATTCACCATCCCGGAA GGAGCGGAGGGCAAAAAACTCTACGGCACCTGCTCGCTCTTCCTTGGGGTGGACGACTGGGGAATCCTGGAGGTGAAGGA CTCCGGCGGCAACGTGGTGGCGCAGGTGGATCTGAAAGAAAACCCGCAGACGGCGGGCGAACAGGGCGGGCACAAATACC ACACGGGGACTGGCGGGGCGCAGCTCCCCTCCGGCACCTACAGCTGGGAAGTCAGCCAGACCAACATCGACTACAATCCG GCAAGCGGCAACACCTCCATCTGCAACTACAGCATCGACGTGGTGCCGACGGAACCGGGGGGCAGGAAAGAACCCGAACC GTGTCCCTGCGAGGGAGACACGTGCGACAACAGCGGCGGAACGCCGCCCTCCCCGCCGCAGGCCCGGTCGTGCCCTGAGG CGGGCATGGAAAGCGGCGCCCTGGGAAACTACAGCTCGGCGGGGTGCAGCGTGACGGCGGAAAGCACGGCCACGCTGATG TACTGGTCCTGCAACTTCGGAGCGTTCCGTGGACTTGGGGGCCTTCCGGCCGGAAGGGTGGAACTGAGGGCTGAACAAAA CGTCTCCGGCCTGGAAAGCCCCTCCTCGCTGGCCTACAACCATCCCCTGAACAGCCGTCTGGACGTGCCGGAAGGGGGCA TTGCGCCGGGAGTGCGGTTCAACCTGGTGCAGGGAGACCGGGTGATTGCCATGCGCTGCTACACGGACGGGTCCGTGCTG CCCATCGGGGTGGACACGTCGGGCGGAGGGCGTGCGGCGCTGGCCACGGTGGAAGGACAATCCTGCCTGCGCTGGGTGGT GGAAGACGGCAGCCAATACCTCTTCTCGGCGGAAACGGGAACGCTCCTCTCCTACACCACCACGGACAGGCAGGTCATCT CCAACGCGTCATCCTATCTGGACGTCAGGCATGCCGGAGACGGCTCGCTGAGGCAAATCTGGAACCTGTGGGACGGCCTG CTCAACGTGGAAAACGTCACCTCCACGGGCTACACCATCGCGCTCTATACCCCTGGGCAAATCACCGGAACGGACGAACA GGGATTTTATACCGTTACGGGCGCTCCCTTGAAAACATTTATTCTTTCCCTGGATGCTGAGGAAAAGTTCACCATCACGG AACAGGCGCCTGCCAGGCAGCCCTACGCCGTCACCTGGTGGAACGACGGCCTGGCGTGGAACATGCGGCAGGGCACGGGG GAAGACGCCCTCACGACCCTCCGCACGCGCACGGAGCTGGAACCGGAAAACTCGGTCTGGCAGCTGGTCACGGAAATCTC CAAAAACGGAATCGTGGCGGCGCGCACCTGCGCCATCTACCAGACCACGGACGTGGGCGACCTGCTGCTCACGCTGGCGG AAGGCTACGGAAGCCCGGAGGAGCAAACCACGCAATACGCCTACGACCAGTGCGGACGGCTCAGAACGGAAACGGCCCCG GGCGGCAGCCAGACTCATTACGCCTATGACCTCTACGGCCGCCTGCTCAGCCGGGACGAACCGTGGGCGGAAAGCGGCAG GCGCATCACGCGCTACACCTACGCCTGTTCGGGAGAAGCCGACTTCAGCAACGAACCCGCCACGGAAACGGCAGACCTGC TTCCGCTGGAAGGACACGTCAAAACGCTGACATCCACCACCTGGAAATACACGACGGCCAACCACATCAAAAGAACGGAA CGGCGGGTCACCGGACTGGGCGTGACGGGCACGCGCCTGACGGCGGAGGAACAATGGCTGGCCGGAGCCGCCAACATCCA TGCCCGCGGACGCACGCGGTTCAGCCGGGACCTCGACGGCGTGCAAACGTGGCACGACTACGCGGCCACGACGGAGCACG GCGCCCTCTACACGGAAACGGTGGAAACGCGCATCAACGGAGAAGCCGTGCCGGGACAAAGCACGCGCGCCGTCACCTGG ATCACGGCGGAAGGGCAGCGCGTCAGGGAAGAAAACTACCTCCTGCTTTCCACCGGGCAATGGGCGCTCACGGGCAGCGC CGTCTACGAATTTGACACGCAGAACCGGTGGGTGAAGCGGACGGCGGGCAACGGCCGGCTCACGGAACGCGAACTGATGT GCGACGGAGGCCTGCTGTGGGAAATCGATGAAAACGGCATCAGGACGGACTACGCCTACGACACGGCGCGCCAACTGGTG GAAGTCACGCGTTCCGCCGTGATGGACGGGGAAACCGTCATCACGCCGGAAACCATCACCACCTACGTCCGGGATGCGGC AGGGCGCGTACTCTCCACGCGTCAAGACACGGGGGCGATGACCACGCGGGAAAGCGCCACCTACGACCTTCTTGGCAGAA CAACCTCCACCACGGACGTCCTGGGCCGGGTCACTACCTACGCCTACAGCCAGGACGGCTTGACGGTCACGCAAACCGTC CCTTCCGGGGCTACATTCATCACGCGCAGCGCGCCGGACGGAACGGTGATGGAAGAATCCGGCACGGGGCAGCGGCACGT CATCTACGCCATCGACCTGGTCAGCGACGGTGTGCGGACCTTCACGAAAGCCGTCTCCGGGGAAACGCAAACCGAGCTGC AGCGCAGCATTGTCAACGGAGCCGGGGAAACCCTGCGCACGGGCGTCCCCAACACCACCGGTGGCGTCATTTACACGAGG AACACCTACAACGCCAGGGGGCAGCTCACCAAAACGCAGACGGACGCGGGCAATGCGGCCACGACGATGGCCCCGACCCT GTGGGAATACGACGCCTTCGGCAACAAAACGAAAGAAACCTGGAAACTCGCCGATCCGGCCACGACATCCAACTCGCGCA TCACCACGTGGAGCTACGGCGTGGAACAGGCCCAGGATGAAGTATACCGCGTTGTTACGGCGACCAGGAACAACAGCCGG GGAACGACCTATAACGAAACGCAGAAAACGCTGGCTTCCTCCCTCTCGTCCACGCTGGAAAGCAAAGTCATTTCCATCGA CCCCAGGGGAAACGCTTCCGAACAATGGAGCGAATACGGTCCGGGCGCCGTCCGGACGCAGAAAAGCAGCATCCCCACCT CCGACATCACGGCCGCCGCTACGGTCATCGACGGTTTTATCATCTCGCAAACGGACCATGCGGGCGTCACGGCCACGCAT ACCCGCGCCTACACGGAAACCGGCGTCATCTACGCCAGCACGGACGGCCGGGGCAACACGGTCACGACGCACACCGACCT TACCGGGCGCACGATCTCGGTGACGGACGCGGCGGGCAACACGACTTCTACCGCCTACGGCCCCTGGTTTGACCAGCCTG CCGTCGTCACCAACGCCCTGGGCAACACGACCTGCTACGGCTACGACCTCCGGGGCCGCAACACGGCGCAATGGGGAACG AGGGCCCAGCCCCTGCTCTTCGGCTATGACGAGGCGGACAGGATGATAAGCCTCACCACGTTCCGGGAGGACGCGGGCGA CATCACCGCCGACCCCACGGGACGCACGGACGGGGACGTCACTACGTGGAGCTACGATGACGCCACGGGCCTGCTCATCC GCAAAACCTGGGCGGACGGCACCCATGAAGACACCGCCTACAATGCCCTGAACTTCAAATCCACGCTCATGGACGCGCGG GGGGTGGTCACCACCTGGGGCTACAACCTGAAGAAGGGGGTCAACAACTCCGTCTCCTACAGCGACTCCACGCCCGGCAT CCAGTACGCCTACAACCACCTCAACCAGCTGACCCAGGTCACGGACGCCTCCGGCTCGCGCGTCCTCACGTACACCCCCT GCAACGAACCGGACACCGACAGCATCACCATCGGAGGGAGCTCTTACCAGCTCCAGGAACACTACGACACTTACGGACGC TCCTCCGGCTATACCCTGAAACAGGGAACCGACGTCCTCCAGGAAGCCAGCCAGGGCTATGAAACCGACGGAAGGCTGGC CAGCGCCGGAATCAGGCACGGGGGAACGGAGCAAAGCTTCGCCTACGGCTACCTGGCAGGAAGCAGCCTGCTCTCCAGCC TTGCGATGCCCGACGGCATCGTCCGGGAACTTGCCTATGAACAGCGCCGCAACCTGGTCACGGCAATCAACTGCCGCCTG GGGGAAACCGTGCTGGTCTCCCGCAGCCAGGGCTACGATGCCCTGGGACGCCCGGTCACCCGCACCCAGCAGCGTGGAAC GGAACCCGCCCGCAGCGACAGCTTCAGCTACAACGGCAGAAACGAACTCACCGCCGCTACCCTGGGCGCCGCCCCCTACG GCTACAGCTACGACAACATCGGCAACCGCAAGACGGCACGGGAACCGGCCGAAGAACTCGCCTACGCGGCCAACGGGCTC AACCAGTACACCGGCATTGAAGAAAGCGGGGAAGCTCCTTTTGTGCCGACGTACGACGCCTCGGGCAACCAGACCCTCAT CAAGACGTCAACGGGCATCTGGACGGCCGTGTACAACGCGGCCAACCGCGCGGTGAGCTTCACCAGCCGGGACGGCGCGA CAGTCGTGGAATGCGGCTACGATTACCAGGGACGCCGCTACATGAAGAAAGTGACCCAAAACGGCACGGTCGCCAGCCAC GAACGCTATCTATACCGCGGCTATTTACAAATAGCGGCATTGGATATGCTGGACAACCGTAACGTGCTTCGCACGCTGTT GTGGGATCCTCTGGAACCGGTGGCCACCCGCCCCCTGGCCCTCGCGCAGGGCGCTTCCCTGTACTGCTACGGCATGGACT TCAACAAGAATGTGTCGGAGGTCTTCGACGCACAGGGAACGATCGCGGCGGCTTACGACTACTCGCCCTATGGGATAGTT GGCAGCACAGGCAACCTCGTCCAACCCGTACAGTGGTCCGGCGAGATGCACGACGAAGAACCCACCCTGGCCTATTATAA TTACCGCTTTTACAACCCCAAAGACGGCAGGTGGATCAATAGGGATCCCATCGCTGAACAGGGAGGGTGGAATTTGTACG GGTTTGTTGATAATGGAGTGGTATTTTCTATTGATTATCTCGGAAAAGAAACTAACGAATTTGGCTACACAATGACTATT CCTGCGGGTTACATACCCATTCTTCTTATTATTGAGGGAAGTATATCAATAGAAAAAAAGAAAAATTGTGTATGCATTGA AGCAAAATTACGAACAGATGTTGGTATAGGTGTAGGTATAGGGCTTAAAATTAAACAAAAATGGTGGCCTATTCTTCCTG ATATTGAATATTCTTTAAAAACTATGATTGCTGGATTAAGTAATGAAAAAATACTTAAAATTGACAATTGTAATGGCAAA ACTTTAACATCCTATCGGGAAGAATTATTAGGATTTTCTCACAGAATTGATGCAGGCATTACACTATCATCTTATATACA AGCATCTTATTCTATCGATTTTAAGATAAGTTCTGGATTAACTTTGAACGCAAAACCAATATATTTAACTTTAGATGCAA CGGAATATATTAAGATTGACGCTACTTTAAAAATCCCATTTATCATAGATCAAAACGAAGAACTTGTTAACAAATCTTTT AATGAAAATATAAAAATATGGTCTAAATAA
Upstream 100 bases:
>100_bases ATTCTATCGACACCGACTTCGGATCATACTATTGATAAAACGATCAGGAAAGGGCGGCCACCCCTCGCTTCATTCATTAC TCCCAACTCTTGAATACCTG
Downstream 100 bases:
>100_bases TGAACGAAATATAATTAGTTACCTGTATTATGATTTACCCTGAGTCTCCTTTAACTGTATATGAATATTTTGAATTAGTT TATGGATGTGTTTTTGTAAT
Product: YD repeat protein
Products: NA
Alternate protein names: Cell wall-associated polypeptide CWBP200; CWBP200 [H]
Number of amino acids: Translated: 1929; Mature: 1929
Protein sequence:
>1929_residues MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF NENIKIWSK
Sequences:
>Translated_1929_residues MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF NENIKIWSK >Mature_1929_residues MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKVGPPSTSADGHSAHGTFTIPE GAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKENPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNP ASGNTSICNYSIDVVPTEPGGRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRFNLVQGDRVIAMRCYTDGSVL PIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETGTLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGL LNVENVTSTGYTIALYTPGQITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPEEQTTQYAYDQCGRLRTETAP GGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEADFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTE RRVTGLGVTGTRLTAEEQWLAGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLWEIDENGIRTDYAYDTARQLV EVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAMTTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTV PSGATFITRSAPDGTVMEESGTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYGVEQAQDEVYRVVTATRNNSR GTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYGPGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATH TRAYTETGVIYASTDGRGNTVTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADGTHEDTAYNALNFKSTLMDAR GVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQVTDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGR SSGYTLKQGTDVLQEASQGYETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNIGNRKTAREPAEELAYAANGL NQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNAANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASH ERYLYRGYLQIAALDMLDNRNVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGVVFSIDYLGKETNEFGYTMTI PAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGIGLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGK TLTSYREELLGFSHRIDAGITLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF NENIKIWSK
Specific function: Still unknown. Not involved in cell membrane metabolism, motility, secretion or differentiation [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Secreted, cell wall. Note=Released into the medium [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008979 - InterPro: IPR022385 - InterPro: IPR006530 [H]
Pfam domain/function: PF05593 RHS_repeat [H]
EC number: NA
Molecular weight: Translated: 210173; Mature: 210173
Theoretical pI: Translated: 4.83; Mature: 4.83
Prosite motif: PS00178 AA_TRNA_LIGASE_I
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 1.0 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 2.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKV CCCCCCCCCCHHHCCCHHHHEEECCCCCCCCCEEEEEECCCCCCCCCCHHEEEEEHHCCC GPPSTSADGHSAHGTFTIPEGAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKE CCCCCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEECCCCEEEEEECCCCCEEEEEECCC NPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNPASGNTSICNYSIDVVPTEPG CCCCCHHCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCC GRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCEEEECCCEEEE YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRF EEECCCCHHCCCCCCCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEE NLVQGDRVIAMRCYTDGSVLPIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETG EEEECCEEEEEEEECCCCEEEEEECCCCCCEEEEEEECCHHHHHHHHHCCCEEEEEECCC TLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGLLNVENVTSTGYTIALYTPGQ CEEEEECCHHHHHHCCHHEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCEEEEEECCCE ITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG EECCCCCCEEEEECCCCEEEEEEECCCCCEEEECCCCCCCCEEEEEECCCEEEEECCCCC EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPE HHHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEEEEEEECCHHHHHHHHHHCCCCCH EQTTQYAYDQCGRLRTETAPGGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEA HHHHHHHHHHHCCEEECCCCCCCCCEEEHHHHHHHHCCCCCHHHCCCEEEEEEEEECCCC DFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTERRVTGLGVTGTRLTAEEQWL CCCCCCCCCCHHEEECCCCCEEEECCEEEEECHHHHHHHHHHHEECCCCCCEEECCHHHH AGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW CCCCEEEECCCCHHHHCCCHHHHHHHHHCCCCCCEEEEEEEHEEECCCCCCCCCCEEEEE ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLW EECCCCEECCCCEEEEECCCEEEECCEEEEECCCCCEEEECCCCCCCCHHEEEECCCEEE EIDENGIRTDYAYDTARQLVEVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAM EECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHEECCCCCCCE TTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTVPSGATFITRSAPDGTVMEES EECCCCCEEECCCCCCHHHHHHHHEEEEECCCCEEEEEECCCCCEEEEECCCCCCEEECC GTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR CCCCEEEEEEEEEHHCHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHCCCCCCCCCEEEEC NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYG CCCCCCCEEEEECCCCCCCCCCCCCCCEEHHCCCCCCCCEEEECCCCCCCCCEEEEEECC VEQAQDEVYRVVTATRNNSRGTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYG HHHHHHHEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCHHHHHHCC PGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATHTRAYTETGVIYASTDGRGNT CCCEEECCCCCCCHHHHHHHHHHCCEEEECCCCCCCEEECCEEEEECCEEEEECCCCCCE VTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT EEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCEEEEEECCCCCCCCCCC RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADG CCCCEEEECCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEEECCCC THEDTAYNALNFKSTLMDARGVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQV CCCCCCEEEECHHHHHHHCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH TDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGRSSGYTLKQGTDVLQEASQGY HCCCCCEEEEECCCCCCCCCEEEECCCCEEHHHHHHHCCCCCCEEEHHHHHHHHHHHCCC ETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL CCCCCEEHHHCCCCCCCCCEEEEEEHHHHHHHHHCCCHHHHHHHHHHHHCCEEEEEECCC GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNI CCEEEEECCCCCHHHCCCHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC GNRKTAREPAEELAYAANGLNQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNA CCCCHHCCHHHHHHHHHCCCHHHCCCCCCCCCCEEEEECCCCCEEEEEECCCEEEHHHHH ANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASHERYLYRGYLQIAALDMLDNR CCCEEEEECCCCCEEEEECCCCCCHHHHHHHHCCCCEECCCHHHHHHEEEEEEEEHHCCC NVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV HHHHHHHCCCCCHHHCCCEEEECCCEEEEEECCCCCCHHHHHCCCCCEEEEEECCCCEEE GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGV CCCCCCCCCEECCCCCCCCCCCEEEEEEEEECCCCCCEECCCCCCCCCCCEEEEEECCCE VFSIDYLGKETNEFGYTMTIPAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGI EEEEEECCCCCCCCCEEEEECCCCEEEEEEEECCEEEECCCCEEEEEEEECCCCCEEEEC GLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGKTLTSYREELLGFSHRIDAGI CEEECCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHCCCCC TLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF CHHHHEEEEEEEEEEECCCEEECCCEEEEEECCCEEEEEEEEEEEEEEECCCHHHHHCCC NENIKIWSK CCCEEEEEC >Mature Secondary Structure MNPPHRSSSAHRSAVPAYRKWYFGSGASQSGSTRRVRLSGQPDPDPLPFEAIRIHEERKV CCCCCCCCCCHHHCCCHHHHEEECCCCCCCCCEEEEEECCCCCCCCCCHHEEEEEHHCCC GPPSTSADGHSAHGTFTIPEGAEGKKLYGTCSLFLGVDDWGILEVKDSGGNVVAQVDLKE CCCCCCCCCCCCCCEEECCCCCCCCEEEEEEEEEEECCCCEEEEEECCCCCEEEEEECCC NPQTAGEQGGHKYHTGTGGAQLPSGTYSWEVSQTNIDYNPASGNTSICNYSIDVVPTEPG CCCCCHHCCCCEEECCCCCCCCCCCCEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCC GRKEPEPCPCEGDTCDNSGGTPPSPPQARSCPEAGMESGALGNYSSAGCSVTAESTATLM CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCEEEECCCEEEE YWSCNFGAFRGLGGLPAGRVELRAEQNVSGLESPSSLAYNHPLNSRLDVPEGGIAPGVRF EEECCCCHHCCCCCCCCCEEEEEECCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCEE NLVQGDRVIAMRCYTDGSVLPIGVDTSGGGRAALATVEGQSCLRWVVEDGSQYLFSAETG EEEECCEEEEEEEECCCCEEEEEECCCCCCEEEEEEECCHHHHHHHHHCCCEEEEEECCC TLLSYTTTDRQVISNASSYLDVRHAGDGSLRQIWNLWDGLLNVENVTSTGYTIALYTPGQ CEEEEECCHHHHHHCCHHEEEEEECCCCHHHHHHHHHHHHCCCCEECCCCEEEEEECCCE ITGTDEQGFYTVTGAPLKTFILSLDAEEKFTITEQAPARQPYAVTWWNDGLAWNMRQGTG EECCCCCCEEEEECCCCEEEEEEECCCCCEEEECCCCCCCCEEEEEECCCEEEEECCCCC EDALTTLRTRTELEPENSVWQLVTEISKNGIVAARTCAIYQTTDVGDLLLTLAEGYGSPE HHHHHHHHHHCCCCCCHHHHHHHHHHHCCCEEEEEEEEEEEECCHHHHHHHHHHCCCCCH EQTTQYAYDQCGRLRTETAPGGSQTHYAYDLYGRLLSRDEPWAESGRRITRYTYACSGEA HHHHHHHHHHHCCEEECCCCCCCCCEEEHHHHHHHHCCCCCHHHCCCEEEEEEEEECCCC DFSNEPATETADLLPLEGHVKTLTSTTWKYTTANHIKRTERRVTGLGVTGTRLTAEEQWL CCCCCCCCCCHHEEECCCCCEEEECCEEEEECHHHHHHHHHHHEECCCCCCEEECCHHHH AGAANIHARGRTRFSRDLDGVQTWHDYAATTEHGALYTETVETRINGEAVPGQSTRAVTW CCCCEEEECCCCHHHHCCCHHHHHHHHHCCCCCCEEEEEEEHEEECCCCCCCCCCEEEEE ITAEGQRVREENYLLLSTGQWALTGSAVYEFDTQNRWVKRTAGNGRLTERELMCDGGLLW EECCCCEECCCCEEEEECCCEEEECCEEEEECCCCCEEEECCCCCCCCHHEEEECCCEEE EIDENGIRTDYAYDTARQLVEVTRSAVMDGETVITPETITTYVRDAAGRVLSTRQDTGAM EECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHEECCCCCCCE TTRESATYDLLGRTTSTTDVLGRVTTYAYSQDGLTVTQTVPSGATFITRSAPDGTVMEES EECCCCCEEECCCCCCHHHHHHHHEEEEECCCCEEEEEECCCCCEEEEECCCCCCEEECC GTGQRHVIYAIDLVSDGVRTFTKAVSGETQTELQRSIVNGAGETLRTGVPNTTGGVIYTR CCCCEEEEEEEEEHHCHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHCCCCCCCCCEEEEC NTYNARGQLTKTQTDAGNAATTMAPTLWEYDAFGNKTKETWKLADPATTSNSRITTWSYG CCCCCCCEEEEECCCCCCCCCCCCCCCEEHHCCCCCCCCEEEECCCCCCCCCEEEEEECC VEQAQDEVYRVVTATRNNSRGTTYNETQKTLASSLSSTLESKVISIDPRGNASEQWSEYG HHHHHHHEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCCCCCHHHHHHCC PGAVRTQKSSIPTSDITAAATVIDGFIISQTDHAGVTATHTRAYTETGVIYASTDGRGNT CCCEEECCCCCCCHHHHHHHHHHCCEEEECCCCCCCEEECCEEEEECCEEEEECCCCCCE VTTHTDLTGRTISVTDAAGNTTSTAYGPWFDQPAVVTNALGNTTCYGYDLRGRNTAQWGT EEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCEEEEEECCCCCCCCCCC RAQPLLFGYDEADRMISLTTFREDAGDITADPTGRTDGDVTTWSYDDATGLLIRKTWADG CCCCEEEECCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEEECCCC THEDTAYNALNFKSTLMDARGVVTTWGYNLKKGVNNSVSYSDSTPGIQYAYNHLNQLTQV CCCCCCEEEECHHHHHHHCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH TDASGSRVLTYTPCNEPDTDSITIGGSSYQLQEHYDTYGRSSGYTLKQGTDVLQEASQGY HCCCCCEEEEECCCCCCCCCEEEECCCCEEHHHHHHHCCCCCCEEEHHHHHHHHHHHCCC ETDGRLASAGIRHGGTEQSFAYGYLAGSSLLSSLAMPDGIVRELAYEQRRNLVTAINCRL CCCCCEEHHHCCCCCCCCCEEEEEEHHHHHHHHHCCCHHHHHHHHHHHHCCEEEEEECCC GETVLVSRSQGYDALGRPVTRTQQRGTEPARSDSFSYNGRNELTAATLGAAPYGYSYDNI CCEEEEECCCCCHHHCCCHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC GNRKTAREPAEELAYAANGLNQYTGIEESGEAPFVPTYDASGNQTLIKTSTGIWTAVYNA CCCCHHCCHHHHHHHHHCCCHHHCCCCCCCCCCEEEEECCCCCEEEEEECCCEEEHHHHH ANRAVSFTSRDGATVVECGYDYQGRRYMKKVTQNGTVASHERYLYRGYLQIAALDMLDNR CCCEEEEECCCCCEEEEECCCCCCHHHHHHHHCCCCEECCCHHHHHHEEEEEEEEHHCCC NVLRTLLWDPLEPVATRPLALAQGASLYCYGMDFNKNVSEVFDAQGTIAAAYDYSPYGIV HHHHHHHCCCCCHHHCCCEEEECCCEEEEEECCCCCCHHHHHCCCCCEEEEEECCCCEEE GSTGNLVQPVQWSGEMHDEEPTLAYYNYRFYNPKDGRWINRDPIAEQGGWNLYGFVDNGV CCCCCCCCCEECCCCCCCCCCCEEEEEEEEECCCCCCEECCCCCCCCCCCEEEEEECCCE VFSIDYLGKETNEFGYTMTIPAGYIPILLIIEGSISIEKKKNCVCIEAKLRTDVGIGVGI EEEEEECCCCCCCCCEEEEECCCCEEEEEEEECCEEEECCCCEEEEEEEECCCCCEEEEC GLKIKQKWWPILPDIEYSLKTMIAGLSNEKILKIDNCNGKTLTSYREELLGFSHRIDAGI CEEECCCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHCCCCC TLSSYIQASYSIDFKISSGLTLNAKPIYLTLDATEYIKIDATLKIPFIIDQNEELVNKSF CHHHHEEEEEEEEEEECCCEEECCCEEEEEECCCEEEEEEEEEEEEEEECCCHHHHHCCC NENIKIWSK CCCEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8316082; 7704263; 8969509; 9384377; 10658653 [H]