Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is aas [H]
Identifier: 187736207
GI number: 187736207
Start: 2091197
End: 2093359
Strand: Reverse
Name: aas [H]
Synonym: Amuc_1720
Alternate gene names: 187736207
Gene position: 2093359-2091197 (Counterclockwise)
Preceding gene: 187736212
Following gene: 187736206
Centisome position: 78.58
GC content: 56.59
Gene sequence:
>2163_bases ATGGCCATTTTTGGTTCCTCCAATTTATCCGACTCAGGTGATCTGTTGATCTTCAACAAAGTATCTTATAAAACCCTACT GATGTTCGAAAAAGAACTGGGCAGGGACCGCATGACCTACCTGGTGGAAGAAGGTCTCCCCCCGGATACGGCCACCGCCT CCCATCTGGAATCCACTAAAGCGGACGGCATCCTGTTCCAGGCCGGAGGCAGCGACCCTGTCGCGCTGCGTTCCGCCATC ATGGAACGCATTAACGAAGGCAGAAGAGTAGTATTTCTGCCCGGTCCCGTCTCCCATGTGAAAGGTTCCATCAGCCAAAT TCCTCCGCGGGTCATCAAGGCCCTGGAAGCCCTGCATATTTCCCCCGTACCGGTCTATGCCGGTTTTTACACGAACTCCG TGCTGGATGCGGAGGCCGATACGGACGCCCAGGCGGACATCCAGATACATATTCTCCCCAAACTGGCCCCCGGGGCCGAG ATGGCAGCACGCCTGACCTCCGCGTGGCTGGAATGTTCCGCCCAGGCATACGCCACGCTGCCTCAGCTCCACGGTTCCCT GTCCGCCCTCCTTTTCCGCAGCCTCAAACTTCATTCCGATTGCCGGGTCATCGACGGCATTGATGACACTACGCTGACTT ACGGCCAGCTGCTGGCCATTTCCGTAGCCTTTGCCAAGAGGTTGAAAAAAATTACCTCGAATCGCCGGGTCGGCATCATT CTTCCGCCGGGCAAGGGAGCGGCCATAGCCAATCTGGGCTGCCTGTTCGCCGGGAAAACACCGGTGAATTTCAATTATTC CGCTTCGGAAGGAGCCTTTGCCAGCTCTGTAAAGCAATCCGGCGTGGACTGGTTTATTACTGCGGATACCTTCATGCGAA AGCTCCAGAATTTCCCGTGGCCCCCCCAGCGGGATCTGATCCTCATGGAGCGGGAAATTCCCCTGCTTAAAGGTTCCGCC AAACGCTGGGGCCTCGCCATCAAATTCCTGACAGCGGGGTTCATGATTAAAAAACTGGGGCTGGACGCGCCTACAGGCGC GGACGAAGCCGTTCTGATGTTCACTTCCGGCTCTTCCGGGGAACCCAAGGGCGTGCCGCTGACCCACCATAACCTTCTTT CCAACATCTCTCAATGTTCCTCCCGCATTACGCTGGAACCGCAAAACAGGTTTCTGGGAAGCCTGCCCGTATTCCACTGC TTCGGCATCACCATCGGGCTATGGTATCCGATGATCGGCGGGTACGACATGGTCACCTACCCCTCCCCTCTTGAGGCCAA ACGGCTGGGAGCCCTTATCAAGCAGTACGGAATCAGTCTGGTAGTCACCACGCCCACTTTCCTGCGCGGTTTCATGAAAC GCTGCGAACCGGACACCTTTAAAACCGTCCGCTACCTGATCGTTGGCGCGGAAAAACTGCCGGAAGACCTTTCCATCGCT TTCCGGGAAAAATTCGGCATTATTCCATGTGAAGGCTACGGCCTGACGGAAGCCTCTCCCGTCTGTTCCGTCAACTTCAT TGACCCGGCGCCATCCAATGCCGCCGGAGACTTCATTCCCGGCATGAAAAAGAGTTCCGTAGGAGCCCTCCTTCCCGGAA TCGCCATACGCATCACCAGCCCTCACACGGGACGCGTGGTTCCCATCACTACTTCCGGCATGATCTGGCTGAAAGGACCG AACATCTTTCCCGGCTATCTGGGCGGTCCGGAGACGGATCGCGATATTTTCGTGGACGGCTGGCTAAAAACCGGAGACAT AGGTTCCGCAGACGAATTCGGCTTCCTGAAGATTGAAGGACGCATTTCCCGCTTTTCCAAAATAGGCGGGGAAATGGTGC CTCATGAAGCCCTGGAAGCAGCCATTATGAACATTTGGAATCTGGATCCGGCGGACGAAGAACGGCGGATAGCCGTCGTC ACCATTCCGGATCCCGTAAAAGGGGAAGCCGTAGCCCTGCTCACCACCCTGGTGACGGATTACGTGCACCAGGCGCGAAC CCTCATCAGGCACGGCCTGATTGACCAGGGTCTGCCGGCCCTCTGGTGCCCCAAGGAGATTATTCCCGTAGAACGCATCC CGGTGCTCCCATCCGGCAAGCTGGATATCAAGCAATGCAGGATGCTGGCGTATGAAGCGCTGAACATCCCCTTTGAACCG TAA
Upstream 100 bases:
>100_bases ATCCGGATAAAGGAAAAGCAACCGGGAATGCACCTTTCCTCCTTGCTCCCGGTTCTTCAAGTTGCTAAAATGCTTCAAGC AAATTTTTTATCCTTTTCCC
Downstream 100 bases:
>100_bases TTTATGGAATTCCCGGAGGAAACGCCCGCCGCGCCCAAACCCATCACAGTCAAACAGCTCGTTTACCGCCTGAGGGACAC GGTCAGCATCGCTATGGGCA
Product: AMP-dependent synthetase and ligase
Products: NA
Alternate protein names: 2-acylglycerophosphoethanolamine acyltransferase; 2-acyl-GPE acyltransferase; Acyl-[acyl-carrier-protein]--phospholipid O-acyltransferase; Acyl-[acyl-carrier-protein] synthetase; Acyl-ACP synthetase; Long-chain-fatty-acid--[acyl-carrier-protein] ligase [H]
Number of amino acids: Translated: 720; Mature: 719
Protein sequence:
>720_residues MAIFGSSNLSDSGDLLIFNKVSYKTLLMFEKELGRDRMTYLVEEGLPPDTATASHLESTKADGILFQAGGSDPVALRSAI MERINEGRRVVFLPGPVSHVKGSISQIPPRVIKALEALHISPVPVYAGFYTNSVLDAEADTDAQADIQIHILPKLAPGAE MAARLTSAWLECSAQAYATLPQLHGSLSALLFRSLKLHSDCRVIDGIDDTTLTYGQLLAISVAFAKRLKKITSNRRVGII LPPGKGAAIANLGCLFAGKTPVNFNYSASEGAFASSVKQSGVDWFITADTFMRKLQNFPWPPQRDLILMEREIPLLKGSA KRWGLAIKFLTAGFMIKKLGLDAPTGADEAVLMFTSGSSGEPKGVPLTHHNLLSNISQCSSRITLEPQNRFLGSLPVFHC FGITIGLWYPMIGGYDMVTYPSPLEAKRLGALIKQYGISLVVTTPTFLRGFMKRCEPDTFKTVRYLIVGAEKLPEDLSIA FREKFGIIPCEGYGLTEASPVCSVNFIDPAPSNAAGDFIPGMKKSSVGALLPGIAIRITSPHTGRVVPITTSGMIWLKGP NIFPGYLGGPETDRDIFVDGWLKTGDIGSADEFGFLKIEGRISRFSKIGGEMVPHEALEAAIMNIWNLDPADEERRIAVV TIPDPVKGEAVALLTTLVTDYVHQARTLIRHGLIDQGLPALWCPKEIIPVERIPVLPSGKLDIKQCRMLAYEALNIPFEP
Sequences:
>Translated_720_residues MAIFGSSNLSDSGDLLIFNKVSYKTLLMFEKELGRDRMTYLVEEGLPPDTATASHLESTKADGILFQAGGSDPVALRSAI MERINEGRRVVFLPGPVSHVKGSISQIPPRVIKALEALHISPVPVYAGFYTNSVLDAEADTDAQADIQIHILPKLAPGAE MAARLTSAWLECSAQAYATLPQLHGSLSALLFRSLKLHSDCRVIDGIDDTTLTYGQLLAISVAFAKRLKKITSNRRVGII LPPGKGAAIANLGCLFAGKTPVNFNYSASEGAFASSVKQSGVDWFITADTFMRKLQNFPWPPQRDLILMEREIPLLKGSA KRWGLAIKFLTAGFMIKKLGLDAPTGADEAVLMFTSGSSGEPKGVPLTHHNLLSNISQCSSRITLEPQNRFLGSLPVFHC FGITIGLWYPMIGGYDMVTYPSPLEAKRLGALIKQYGISLVVTTPTFLRGFMKRCEPDTFKTVRYLIVGAEKLPEDLSIA FREKFGIIPCEGYGLTEASPVCSVNFIDPAPSNAAGDFIPGMKKSSVGALLPGIAIRITSPHTGRVVPITTSGMIWLKGP NIFPGYLGGPETDRDIFVDGWLKTGDIGSADEFGFLKIEGRISRFSKIGGEMVPHEALEAAIMNIWNLDPADEERRIAVV TIPDPVKGEAVALLTTLVTDYVHQARTLIRHGLIDQGLPALWCPKEIIPVERIPVLPSGKLDIKQCRMLAYEALNIPFEP >Mature_719_residues AIFGSSNLSDSGDLLIFNKVSYKTLLMFEKELGRDRMTYLVEEGLPPDTATASHLESTKADGILFQAGGSDPVALRSAIM ERINEGRRVVFLPGPVSHVKGSISQIPPRVIKALEALHISPVPVYAGFYTNSVLDAEADTDAQADIQIHILPKLAPGAEM AARLTSAWLECSAQAYATLPQLHGSLSALLFRSLKLHSDCRVIDGIDDTTLTYGQLLAISVAFAKRLKKITSNRRVGIIL PPGKGAAIANLGCLFAGKTPVNFNYSASEGAFASSVKQSGVDWFITADTFMRKLQNFPWPPQRDLILMEREIPLLKGSAK RWGLAIKFLTAGFMIKKLGLDAPTGADEAVLMFTSGSSGEPKGVPLTHHNLLSNISQCSSRITLEPQNRFLGSLPVFHCF GITIGLWYPMIGGYDMVTYPSPLEAKRLGALIKQYGISLVVTTPTFLRGFMKRCEPDTFKTVRYLIVGAEKLPEDLSIAF REKFGIIPCEGYGLTEASPVCSVNFIDPAPSNAAGDFIPGMKKSSVGALLPGIAIRITSPHTGRVVPITTSGMIWLKGPN IFPGYLGGPETDRDIFVDGWLKTGDIGSADEFGFLKIEGRISRFSKIGGEMVPHEALEAAIMNIWNLDPADEERRIAVVT IPDPVKGEAVALLTTLVTDYVHQARTLIRHGLIDQGLPALWCPKEIIPVERIPVLPSGKLDIKQCRMLAYEALNIPFEP
Specific function: Plays a role in lysophospholipid acylation. Transfers fatty acids to the 1-position via an enzyme-bound acyl-ACP intermediate in the presence of ATP and magnesium. Its physiological function is to regenerate phosphatidylethanolamine from 2-acyl-glycero-3-
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: In the C-terminal section; belongs to the ATP- dependent AMP-binding enzyme family [H]
Homologues:
Organism=Homo sapiens, GI156151445, Length=381, Percent_Identity=25.9842519685039, Blast_Score=102, Evalue=1e-21, Organism=Homo sapiens, GI187761345, Length=391, Percent_Identity=27.3657289002558, Blast_Score=92, Evalue=2e-18, Organism=Homo sapiens, GI187761343, Length=391, Percent_Identity=27.3657289002558, Blast_Score=92, Evalue=2e-18, Organism=Homo sapiens, GI40807491, Length=329, Percent_Identity=24.9240121580547, Blast_Score=83, Evalue=1e-15, Organism=Homo sapiens, GI42544132, Length=534, Percent_Identity=22.4719101123595, Blast_Score=81, Evalue=3e-15, Organism=Escherichia coli, GI1789201, Length=671, Percent_Identity=28.7630402384501, Blast_Score=238, Evalue=1e-63, Organism=Escherichia coli, GI1788107, Length=504, Percent_Identity=24.6031746031746, Blast_Score=116, Evalue=5e-27, Organism=Escherichia coli, GI145693145, Length=359, Percent_Identity=26.4623955431755, Blast_Score=111, Evalue=2e-25, Organism=Escherichia coli, GI221142682, Length=364, Percent_Identity=27.1978021978022, Blast_Score=84, Evalue=3e-17, Organism=Escherichia coli, GI1786801, Length=283, Percent_Identity=24.3816254416961, Blast_Score=65, Evalue=1e-11, Organism=Caenorhabditis elegans, GI17558820, Length=509, Percent_Identity=26.3261296660118, Blast_Score=122, Evalue=7e-28, Organism=Caenorhabditis elegans, GI17559526, Length=317, Percent_Identity=29.9684542586751, Blast_Score=117, Evalue=3e-26, Organism=Caenorhabditis elegans, GI32563687, Length=533, Percent_Identity=27.2045028142589, Blast_Score=116, Evalue=3e-26, Organism=Caenorhabditis elegans, GI17557194, Length=390, Percent_Identity=27.4358974358974, Blast_Score=107, Evalue=3e-23, Organism=Caenorhabditis elegans, GI17538037, Length=384, Percent_Identity=25.5208333333333, Blast_Score=96, Evalue=8e-20, Organism=Caenorhabditis elegans, GI71985884, Length=316, Percent_Identity=25.6329113924051, Blast_Score=92, Evalue=1e-18, Organism=Caenorhabditis elegans, GI17560308, Length=527, Percent_Identity=22.9601518026565, Blast_Score=87, Evalue=4e-17, Organism=Caenorhabditis elegans, GI17560140, Length=356, Percent_Identity=23.314606741573, Blast_Score=82, Evalue=7e-16, Organism=Caenorhabditis elegans, GI71994690, Length=451, Percent_Identity=21.5077605321508, Blast_Score=81, Evalue=2e-15, Organism=Caenorhabditis elegans, GI71994703, Length=451, Percent_Identity=21.5077605321508, Blast_Score=81, Evalue=2e-15, Organism=Caenorhabditis elegans, GI71994694, Length=451, Percent_Identity=21.5077605321508, Blast_Score=80, Evalue=3e-15, Organism=Caenorhabditis elegans, GI17531443, Length=369, Percent_Identity=23.8482384823848, Blast_Score=75, Evalue=1e-13, Organism=Caenorhabditis elegans, GI71996755, Length=464, Percent_Identity=23.4913793103448, Blast_Score=73, Evalue=5e-13, Organism=Saccharomyces cerevisiae, GI6319699, Length=367, Percent_Identity=23.1607629427793, Blast_Score=100, Evalue=1e-21, Organism=Drosophila melanogaster, GI18859661, Length=528, Percent_Identity=27.6515151515151, Blast_Score=126, Evalue=5e-29, Organism=Drosophila melanogaster, GI21355181, Length=364, Percent_Identity=27.7472527472527, Blast_Score=114, Evalue=3e-25, Organism=Drosophila melanogaster, GI24581924, Length=359, Percent_Identity=27.8551532033426, Blast_Score=92, Evalue=1e-18, Organism=Drosophila melanogaster, GI161076582, Length=556, Percent_Identity=23.5611510791367, Blast_Score=75, Evalue=1e-13, Organism=Drosophila melanogaster, GI24653035, Length=347, Percent_Identity=24.7838616714697, Blast_Score=74, Evalue=3e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002123 - InterPro: IPR020845 - InterPro: IPR000873 [H]
Pfam domain/function: PF01553 Acyltransferase; PF00501 AMP-binding [H]
EC number: =2.3.1.40; =6.2.1.20 [H]
Molecular weight: Translated: 78243; Mature: 78112
Theoretical pI: Translated: 7.53; Mature: 7.53
Prosite motif: PS00455 AMP_BINDING
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAIFGSSNLSDSGDLLIFNKVSYKTLLMFEKELGRDRMTYLVEEGLPPDTATASHLESTK CEEECCCCCCCCCCEEEEECCCCCHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHHCC ADGILFQAGGSDPVALRSAIMERINEGRRVVFLPGPVSHVKGSISQIPPRVIKALEALHI CCCEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHCCCHHHCCHHHHHHHHHHCC SPVPVYAGFYTNSVLDAEADTDAQADIQIHILPKLAPGAEMAARLTSAWLECSAQAYATL CCCCEEECCCCCCEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCHHHHHHH PQLHGSLSALLFRSLKLHSDCRVIDGIDDTTLTYGQLLAISVAFAKRLKKITSNRRVGII HHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEE LPPGKGAAIANLGCLFAGKTPVNFNYSASEGAFASSVKQSGVDWFITADTFMRKLQNFPW ECCCCCCCEEECEEEEECCCCCCCCCCCCCCHHHHHHHHCCCCEEEEHHHHHHHHHCCCC PPQRDLILMEREIPLLKGSAKRWGLAIKFLTAGFMIKKLGLDAPTGADEAVLMFTSGSSG CCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCC EPKGVPLTHHNLLSNISQCSSRITLEPQNRFLGSLPVFHCFGITIGLWYPMIGGYDMVTY CCCCCCEEHHHHHHHHHHHHHEEEECCHHHHHCCCHHHHHHHHHHHHHHHHCCCEEEECC PSPLEAKRLGALIKQYGISLVVTTPTFLRGFMKRCEPDTFKTVRYLIVGAEKLPEDLSIA CCCCCHHHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCHHHHHHHHEECHHHCCHHHHHH FREKFGIIPCEGYGLTEASPVCSVNFIDPAPSNAAGDFIPGMKKSSVGALLPGIAIRITS HHHHCCCEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHCCCEEEEEEC PHTGRVVPITTSGMIWLKGPNIFPGYLGGPETDRDIFVDGWLKTGDIGSADEFGFLKIEG CCCCCEEEEECCCEEEEECCCCCCCCCCCCCCCCEEEEECCEECCCCCCCCCCEEEEEEC RISRFSKIGGEMVPHEALEAAIMNIWNLDPADEERRIAVVTIPDPVKGEAVALLTTLVTD HHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHH YVHQARTLIRHGLIDQGLPALWCPKEIIPVERIPVLPSGKLDIKQCRMLAYEALNIPFEP HHHHHHHHHHHCHHHCCCCCCCCCHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC >Mature Secondary Structure AIFGSSNLSDSGDLLIFNKVSYKTLLMFEKELGRDRMTYLVEEGLPPDTATASHLESTK EEECCCCCCCCCCEEEEECCCCCHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHHCC ADGILFQAGGSDPVALRSAIMERINEGRRVVFLPGPVSHVKGSISQIPPRVIKALEALHI CCCEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCHHHHCCCHHHCCHHHHHHHHHHCC SPVPVYAGFYTNSVLDAEADTDAQADIQIHILPKLAPGAEMAARLTSAWLECSAQAYATL CCCCEEECCCCCCEECCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCHHHHHHH PQLHGSLSALLFRSLKLHSDCRVIDGIDDTTLTYGQLLAISVAFAKRLKKITSNRRVGII HHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEE LPPGKGAAIANLGCLFAGKTPVNFNYSASEGAFASSVKQSGVDWFITADTFMRKLQNFPW ECCCCCCCEEECEEEEECCCCCCCCCCCCCCHHHHHHHHCCCCEEEEHHHHHHHHHCCCC PPQRDLILMEREIPLLKGSAKRWGLAIKFLTAGFMIKKLGLDAPTGADEAVLMFTSGSSG CCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCC EPKGVPLTHHNLLSNISQCSSRITLEPQNRFLGSLPVFHCFGITIGLWYPMIGGYDMVTY CCCCCCEEHHHHHHHHHHHHHEEEECCHHHHHCCCHHHHHHHHHHHHHHHHCCCEEEECC PSPLEAKRLGALIKQYGISLVVTTPTFLRGFMKRCEPDTFKTVRYLIVGAEKLPEDLSIA CCCCCHHHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCHHHHHHHHEECHHHCCHHHHHH FREKFGIIPCEGYGLTEASPVCSVNFIDPAPSNAAGDFIPGMKKSSVGALLPGIAIRITS HHHHCCCEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHCCCEEEEEEC PHTGRVVPITTSGMIWLKGPNIFPGYLGGPETDRDIFVDGWLKTGDIGSADEFGFLKIEG CCCCCEEEEECCCEEEEECCCCCCCCCCCCCCCCEEEEECCEECCCCCCCCCCEEEEEEC RISRFSKIGGEMVPHEALEAAIMNIWNLDPADEERRIAVVTIPDPVKGEAVALLTTLVTD HHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHH YVHQARTLIRHGLIDQGLPALWCPKEIIPVERIPVLPSGKLDIKQCRMLAYEALNIPFEP HHHHHHHHHHHCHHHCCCCCCCCCHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: NA