Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is sdcS [H]

Identifier: 187735361

GI number: 187735361

Start: 1024464

End: 1026203

Strand: Reverse

Name: sdcS [H]

Synonym: Amuc_0859

Alternate gene names: 187735361

Gene position: 1026203-1024464 (Counterclockwise)

Preceding gene: 187735362

Following gene: 187735360

Centisome position: 38.52

GC content: 56.38

Gene sequence:

>1740_bases
ATGAGTGAAACCCAAAACAAGCTGCTCAAGGCAGCTATCAGCCTGGCCGCCGCAGGAGTCGTCTTCTTCCTGCCTTTCGC
TTCATGGGGAATCCAATTAAGCCCTATTGAAATCCGCGTCATCGCCATGTTTGTCATGGCGGCGCTCTTCTGGATTTTGG
AACCCATCCCTATCTGGACCACCTCCGTGATGGTCATTACCCTGTCCCTGCTGTGCGTCTCCAACGGCTCTCTCTCATTC
TTGATGCCGGAGCGCTATGACAAGGCGGCGGTATCCTCCATTCTGGATGACGCCATAGGCAAAGGGATCAATCCGGAGAT
AGTCGGCAAATTGAAGGAAAATGTCGAAAACCGGTTAAATAAGAAAACCAAGCTGGACGCGGAAGAAGTGAGGATGACCC
TGGGGTTCCAGCTCATGGACGCTTATGAAAAAATAGACTTGAACGCACAGGAACTGTCCCGTGAAGGAAAAACGGAAGAA
GCCGCCGGGCAGGAATCCATAGCCGCCCAGCTCAAGACCGCGGCCGGGCGCCTGTACAGCAAGGAAATAACCGCGCGCAT
CCAGGGGCTGCAATTTGTCAACACCATGCAGCAAAAATCCACGATGGCCACCTTCGCGGATCCCATCATCATGCTTTTCC
TGGGCGGCTTTTTCCTAGCTGCGGCCGCAACCAAATACAGGCTGGACATGAACCTGGCCAAAGTACTTTTAAAACCCTTC
GGCACCAATCCCAAATTCGTACTTCTGGGGCTGATGTCCGTAACGGCTCTTTTCTCCATGTTTATGAGCAATACGGCTAC
GGCCGCCATGATGCTCGCCATCCTGACGCCGGTGCTGGCCCTGTTCACTCCGGAAGACAAAGGCCGCGCCGCCTTTGCCC
TGGCCATCCCCATCGCCGCCAACCTCGGCGGCATTGGAACGCCTATCGGCACGCCCCCCAACGCAATCGCGCTGAAGGCG
CTGCAGGGTATGGGGCTGGACGTCTCCTTCGGCAAATGGCTCATGTTCGGCATTCCTTTCGTCATCGTCATGATTCTGAT
TGCGTGGCTTCTCCTGCTGTGGCTGTTCCCCATCTCTCAAAAGAAACTGGAACTTCAGGTGGGAGGGAAATTCCTGAGAA
CCCCCAAGGCCATTATCGTTTACGTCACCTTTGCCGTTACCGTCCTGCTGTGGGTAACGGGCAAGGGAGTACACGGGCTG
GACTCCAATACCATCGCCATGATTCCCATCGCCGTCTTTGCCATTACGGAAACCATCACCAAGGAAGACTTGAAGAAAAT
GGGCTGGGACGTGCTCTGGCTGGTAGCCGGCGGCTTTGCCCTGGGGCTGGCCCTGCAAGACACCGGACTGGCAAAAAACC
TGATCGGCTCCATTCCCTTTGCCCAATGGTCCCCCTTCCCGCTCATGGTGGGTACGGGAATCATCTGCCTGTTCATGGCC
ACCTTCATGAGCCATACGGCTACGGCCTCCCTGCTCATCCCGATTATCGCCGTTGTCGGCGTCAACATGGGTGACAACCT
GGCCCCCCTGGGCGGCGTTACCGCCCTGCTGGTGTCCGTGGCCTTCGCCTCCTCCCTGGGCATGAGCCTTCCCATCAGCA
CCCCCCCCAACGCCCTGGCCTATGCCACCGGCCTGGTCAGCTCCAAGGGAATGGCCATTTCCGGCGTCATCCTGGGCATC
CTGGGCATGATTTTAACCTACGTCATGATGATGATTCTTGCCCAATGCCATGCTTTTTAA

Upstream 100 bases:

>100_bases
GCAGCCGTCCCCTCTGACATAATTTTTTTAAATAAAAGTTTCTATTTGTAAAATCGGGAAGTACCCTTCCGCCCGCCAAC
CATATTACCAGCTATTATTC

Downstream 100 bases:

>100_bases
GTAAGGTAATGTTCCATGCCGGGACATGTTGCCCCGGCATGGCCAACATCAACGGGAAAAGCGCCCTATGATCATTGAAC
AGGAAGAATTCCAGAAAAAA

Product: anion transporter

Products: NA

Alternate protein names: Na(+)/dicarboxylate symporter [H]

Number of amino acids: Translated: 579; Mature: 578

Protein sequence:

>579_residues
MSETQNKLLKAAISLAAAGVVFFLPFASWGIQLSPIEIRVIAMFVMAALFWILEPIPIWTTSVMVITLSLLCVSNGSLSF
LMPERYDKAAVSSILDDAIGKGINPEIVGKLKENVENRLNKKTKLDAEEVRMTLGFQLMDAYEKIDLNAQELSREGKTEE
AAGQESIAAQLKTAAGRLYSKEITARIQGLQFVNTMQQKSTMATFADPIIMLFLGGFFLAAAATKYRLDMNLAKVLLKPF
GTNPKFVLLGLMSVTALFSMFMSNTATAAMMLAILTPVLALFTPEDKGRAAFALAIPIAANLGGIGTPIGTPPNAIALKA
LQGMGLDVSFGKWLMFGIPFVIVMILIAWLLLLWLFPISQKKLELQVGGKFLRTPKAIIVYVTFAVTVLLWVTGKGVHGL
DSNTIAMIPIAVFAITETITKEDLKKMGWDVLWLVAGGFALGLALQDTGLAKNLIGSIPFAQWSPFPLMVGTGIICLFMA
TFMSHTATASLLIPIIAVVGVNMGDNLAPLGGVTALLVSVAFASSLGMSLPISTPPNALAYATGLVSSKGMAISGVILGI
LGMILTYVMMMILAQCHAF

Sequences:

>Translated_579_residues
MSETQNKLLKAAISLAAAGVVFFLPFASWGIQLSPIEIRVIAMFVMAALFWILEPIPIWTTSVMVITLSLLCVSNGSLSF
LMPERYDKAAVSSILDDAIGKGINPEIVGKLKENVENRLNKKTKLDAEEVRMTLGFQLMDAYEKIDLNAQELSREGKTEE
AAGQESIAAQLKTAAGRLYSKEITARIQGLQFVNTMQQKSTMATFADPIIMLFLGGFFLAAAATKYRLDMNLAKVLLKPF
GTNPKFVLLGLMSVTALFSMFMSNTATAAMMLAILTPVLALFTPEDKGRAAFALAIPIAANLGGIGTPIGTPPNAIALKA
LQGMGLDVSFGKWLMFGIPFVIVMILIAWLLLLWLFPISQKKLELQVGGKFLRTPKAIIVYVTFAVTVLLWVTGKGVHGL
DSNTIAMIPIAVFAITETITKEDLKKMGWDVLWLVAGGFALGLALQDTGLAKNLIGSIPFAQWSPFPLMVGTGIICLFMA
TFMSHTATASLLIPIIAVVGVNMGDNLAPLGGVTALLVSVAFASSLGMSLPISTPPNALAYATGLVSSKGMAISGVILGI
LGMILTYVMMMILAQCHAF
>Mature_578_residues
SETQNKLLKAAISLAAAGVVFFLPFASWGIQLSPIEIRVIAMFVMAALFWILEPIPIWTTSVMVITLSLLCVSNGSLSFL
MPERYDKAAVSSILDDAIGKGINPEIVGKLKENVENRLNKKTKLDAEEVRMTLGFQLMDAYEKIDLNAQELSREGKTEEA
AGQESIAAQLKTAAGRLYSKEITARIQGLQFVNTMQQKSTMATFADPIIMLFLGGFFLAAAATKYRLDMNLAKVLLKPFG
TNPKFVLLGLMSVTALFSMFMSNTATAAMMLAILTPVLALFTPEDKGRAAFALAIPIAANLGGIGTPIGTPPNAIALKAL
QGMGLDVSFGKWLMFGIPFVIVMILIAWLLLLWLFPISQKKLELQVGGKFLRTPKAIIVYVTFAVTVLLWVTGKGVHGLD
SNTIAMIPIAVFAITETITKEDLKKMGWDVLWLVAGGFALGLALQDTGLAKNLIGSIPFAQWSPFPLMVGTGIICLFMAT
FMSHTATASLLIPIIAVVGVNMGDNLAPLGGVTALLVSVAFASSLGMSLPISTPPNALAYATGLVSSKGMAISGVILGIL
GMILTYVMMMILAQCHAF

Specific function: Mediates the transport of dicarboxylates across the cytoplasmic membrane via a Na(+)-electrochemical gradient [H]

COG id: COG0471

COG function: function code P; Di- and tricarboxylate transporters

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the SLC13A transporter (TC 2.A.47) family. NADC subfamily [H]

Homologues:

Organism=Homo sapiens, GI301069349, Length=321, Percent_Identity=25.8566978193146, Blast_Score=86, Evalue=7e-17,
Organism=Homo sapiens, GI225637549, Length=475, Percent_Identity=26.5263157894737, Blast_Score=86, Evalue=9e-17,
Organism=Homo sapiens, GI225637552, Length=475, Percent_Identity=26.5263157894737, Blast_Score=86, Evalue=1e-16,
Organism=Homo sapiens, GI4506979, Length=475, Percent_Identity=26.5263157894737, Blast_Score=85, Evalue=2e-16,
Organism=Homo sapiens, GI301069353, Length=156, Percent_Identity=26.9230769230769, Blast_Score=67, Evalue=5e-11,
Organism=Homo sapiens, GI58761541, Length=156, Percent_Identity=26.9230769230769, Blast_Score=67, Evalue=6e-11,
Organism=Homo sapiens, GI31377715, Length=156, Percent_Identity=26.9230769230769, Blast_Score=67, Evalue=6e-11,
Organism=Homo sapiens, GI301069346, Length=156, Percent_Identity=26.9230769230769, Blast_Score=66, Evalue=7e-11,
Organism=Caenorhabditis elegans, GI71988385, Length=438, Percent_Identity=29.4520547945205, Blast_Score=138, Evalue=9e-33,
Organism=Caenorhabditis elegans, GI32565804, Length=439, Percent_Identity=28.4738041002278, Blast_Score=137, Evalue=2e-32,
Organism=Caenorhabditis elegans, GI71989118, Length=472, Percent_Identity=24.5762711864407, Blast_Score=107, Evalue=2e-23,
Organism=Saccharomyces cerevisiae, GI6324340, Length=348, Percent_Identity=32.183908045977, Blast_Score=152, Evalue=1e-37,
Organism=Saccharomyces cerevisiae, GI6322263, Length=348, Percent_Identity=32.7586206896552, Blast_Score=144, Evalue=3e-35,
Organism=Saccharomyces cerevisiae, GI6319885, Length=344, Percent_Identity=32.8488372093023, Blast_Score=130, Evalue=7e-31,
Organism=Drosophila melanogaster, GI281366409, Length=462, Percent_Identity=24.4588744588745, Blast_Score=89, Evalue=6e-18,
Organism=Drosophila melanogaster, GI17737623, Length=462, Percent_Identity=24.4588744588745, Blast_Score=89, Evalue=6e-18,
Organism=Drosophila melanogaster, GI24666468, Length=462, Percent_Identity=24.4588744588745, Blast_Score=89, Evalue=6e-18,
Organism=Drosophila melanogaster, GI45551633, Length=462, Percent_Identity=24.2424242424242, Blast_Score=89, Evalue=9e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001898 [H]

Pfam domain/function: PF00939 Na_sulph_symp [H]

EC number: NA

Molecular weight: Translated: 61969; Mature: 61838

Theoretical pI: Translated: 9.42; Mature: 9.42

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
5.4 %Met     (Translated Protein)
5.9 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
5.2 %Met     (Mature Protein)
5.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSETQNKLLKAAISLAAAGVVFFLPFASWGIQLSPIEIRVIAMFVMAALFWILEPIPIWT
CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCHHHHHHHHHHHHHHHHHHCCCCHHH
TSVMVITLSLLCVSNGSLSFLMPERYDKAAVSSILDDAIGKGINPEIVGKLKENVENRLN
HHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHC
KKTKLDAEEVRMTLGFQLMDAYEKIDLNAQELSREGKTEEAAGQESIAAQLKTAAGRLYS
HHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHHH
KEITARIQGLQFVNTMQQKSTMATFADPIIMLFLGGFFLAAAATKYRLDMNLAKVLLKPF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
GTNPKFVLLGLMSVTALFSMFMSNTATAAMMLAILTPVLALFTPEDKGRAAFALAIPIAA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEHHH
NLGGIGTPIGTPPNAIALKALQGMGLDVSFGKWLMFGIPFVIVMILIAWLLLLWLFPISQ
CCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
KKLELQVGGKFLRTPKAIIVYVTFAVTVLLWVTGKGVHGLDSNTIAMIPIAVFAITETIT
CEEEHEECCHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH
KEDLKKMGWDVLWLVAGGFALGLALQDTGLAKNLIGSIPFAQWSPFPLMVGTGIICLFMA
HHHHHHCCCHHHHHHHHHHHHHHEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH
TFMSHTATASLLIPIIAVVGVNMGDNLAPLGGVTALLVSVAFASSLGMSLPISTPPNALA
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH
YATGLVSSKGMAISGVILGILGMILTYVMMMILAQCHAF
HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SETQNKLLKAAISLAAAGVVFFLPFASWGIQLSPIEIRVIAMFVMAALFWILEPIPIWT
CHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCHHHHHHHHHHHHHHHHHHCCCCHHH
TSVMVITLSLLCVSNGSLSFLMPERYDKAAVSSILDDAIGKGINPEIVGKLKENVENRLN
HHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHC
KKTKLDAEEVRMTLGFQLMDAYEKIDLNAQELSREGKTEEAAGQESIAAQLKTAAGRLYS
HHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHHH
KEITARIQGLQFVNTMQQKSTMATFADPIIMLFLGGFFLAAAATKYRLDMNLAKVLLKPF
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
GTNPKFVLLGLMSVTALFSMFMSNTATAAMMLAILTPVLALFTPEDKGRAAFALAIPIAA
CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEHHH
NLGGIGTPIGTPPNAIALKALQGMGLDVSFGKWLMFGIPFVIVMILIAWLLLLWLFPISQ
CCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
KKLELQVGGKFLRTPKAIIVYVTFAVTVLLWVTGKGVHGLDSNTIAMIPIAVFAITETIT
CEEEHEECCHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH
KEDLKKMGWDVLWLVAGGFALGLALQDTGLAKNLIGSIPFAQWSPFPLMVGTGIICLFMA
HHHHHHCCCHHHHHHHHHHHHHHEECCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH
TFMSHTATASLLIPIIAVVGVNMGDNLAPLGGVTALLVSVAFASSLGMSLPISTPPNALA
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH
YATGLVSSKGMAISGVILGILGMILTYVMMMILAQCHAF
HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA