Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is yvdB [H]

Identifier: 187735342

GI number: 187735342

Start: 998943

End: 1000622

Strand: Reverse

Name: yvdB [H]

Synonym: Amuc_0840

Alternate gene names: 187735342

Gene position: 1000622-998943 (Counterclockwise)

Preceding gene: 187735343

Following gene: 187735336

Centisome position: 37.56

GC content: 58.39

Gene sequence:

>1680_bases
ATGTTCAAACCTGCTCTTCTTTCCTCTCTCAAAACTTATACCAAACAAACCTTTCTGGCGGACCTTTTTGCGGGACTGAC
CGTCGGTGTAGTAGCCATTCCGCTGGCCATGGCCTTTGCCATCGCATGCGGACTCTCCCCAACCCAGGGCCTCATCACCG
CCATTGTGGCCGGGTTCCTCATCTCCCTGTTCAGCGGAAGCAAATATCAAATAGGCGGCCCCACCGGAGCCTTCGTGATC
ATTATCATGGGCGTCCTGGAGCAATACCACGCATCCGGTCTGCTGGTCTGCACATTGATGGCGGGCCTCTTCCTCATCAT
CTTTGGGTTCTGCCGCATGGGGGCGCTCATCCGCTTTATTCCATTCCCTGTCACCACAGGGTTCACCTCCGGCATCGCCG
TGGTAATCTTTTCCACGCAAATTAAAGACATCTTCGGCCTCACCATCACGGAAAAAATTCCCGGAGAGTTCATTGAAAAA
TGGGCGTGTTACTTCCATTACTTCCACACCATCAACTGGGCGGCGCTGGGGCTGGCCGCCGGCACCGTAATCATTACCCT
GCTGAGCCGCCGCTTCTGGCCCAGAATACCGGCCATGCTAGTGGGCATGCTGGGCATGACGGCCGTTTCCGTGGCGTTTT
CGTTGCCTGTGACAACCATCGGGCAAGCCTTCGGCAGCCTCCCGAATACACTCCCCCTGCCCTCCCTGCCCAGCATTGAC
TGGAGTACCCTGGGGGCGCTGACGGCCCCTGCTTTCACCATCGCGCTGCTGGCGGCGATCGAATCCCTGTTAAGCGCCTC
CGTGGCGGACGGCATGACCGGAGGGCGCCACAAGCCCAACATGGAGCTGATTGCACAAGGCATCGGCAACATCGGCTCCG
CCCTGTTTGGCGGCATTCCGGCCACCGGAGCCATTGCCCGCACCGCCACTAACATCAAGGCTGGAGCTAAAAGCCCGGTT
TCCGGCATGATTCACGCCCTGACCCTGCTAGCCATTCTGATGGCCTTTGCCCACTATGCCCAGCAGATTCCCCTGGCTGT
CCTGGCGGGCATTCTGACGGTAGTGTGCTACAACATGAGTGAAATACACACGTTCAGCCGTCTGCTGAAAGGGCCCAGGC
AGGATGCGGCGGTGCTGGTAATCACCTTCCTGCTGACCGTGTTTGTGGACCTCGTTGTAGCCGTGGAAGTAGGCGTGGTG
CTGGCCGCCCTGCTCTTCATGGGCCGCATGGCCCAAATCAGCGATGTTTCCGCCATCAAAAACGAACTGCTGGAAAATGA
TGAGGAAGATGATGGAAACCGCTCTGCCGCCAAGCTGGACATCCCGGAAGGTGTGGAAGTTTTCGACGTGAAAGGTCCCT
TCTTCTTCGGTGCCGTGGAGCAATTCAAGGACCAGGTGCTGGAAACGCTGGAACATGATACCAAGGTGGTTATCCTGCGC
ATGCGCCTGGTTCCCGCGCTGGACGCCACCGGCCTGAACGTCCTTTCCGACTTCTGCCACCAGTGCCGGGAACACGGTTC
CACCCTGCTGGTTTGCGGCGTGCAGCCCCAGCCTCTGGACGTCATCCGCCACGCGCCCTTTTACCGGGAGCTGAAACGCT
ACAATATCTGCGAGAATATTGACGCCGCCCTGAACCGGGCCTGCAAAATCATCAACGGCCCTGCGCCCAAACACCTGTAA

Upstream 100 bases:

>100_bases
TGCCGTCCGTCTCCGCGGGGCGCCTCCGGGTGCAGGAAGGGGACGCTTCAACCCCGCCGGCGGCGGCCTCCACCGCAGCC
GGCCAACCATCCCGGCAGCC

Downstream 100 bases:

>100_bases
CGGCGGCCGTCCCCCCGTGCGGTCAGGGAGCAAGACCTCACTCTGCCTTATTCACGGCCTTCCGGTACGGGTGAGGCTTC
TCCTGCTGCAAACAGCCAGA

Product: sulfate transporter

Products: Proton [Cytoplasm]; SO42- [Cytoplasm] [C]

Alternate protein names: NA

Number of amino acids: Translated: 559; Mature: 559

Protein sequence:

>559_residues
MFKPALLSSLKTYTKQTFLADLFAGLTVGVVAIPLAMAFAIACGLSPTQGLITAIVAGFLISLFSGSKYQIGGPTGAFVI
IIMGVLEQYHASGLLVCTLMAGLFLIIFGFCRMGALIRFIPFPVTTGFTSGIAVVIFSTQIKDIFGLTITEKIPGEFIEK
WACYFHYFHTINWAALGLAAGTVIITLLSRRFWPRIPAMLVGMLGMTAVSVAFSLPVTTIGQAFGSLPNTLPLPSLPSID
WSTLGALTAPAFTIALLAAIESLLSASVADGMTGGRHKPNMELIAQGIGNIGSALFGGIPATGAIARTATNIKAGAKSPV
SGMIHALTLLAILMAFAHYAQQIPLAVLAGILTVVCYNMSEIHTFSRLLKGPRQDAAVLVITFLLTVFVDLVVAVEVGVV
LAALLFMGRMAQISDVSAIKNELLENDEEDDGNRSAAKLDIPEGVEVFDVKGPFFFGAVEQFKDQVLETLEHDTKVVILR
MRLVPALDATGLNVLSDFCHQCREHGSTLLVCGVQPQPLDVIRHAPFYRELKRYNICENIDAALNRACKIINGPAPKHL

Sequences:

>Translated_559_residues
MFKPALLSSLKTYTKQTFLADLFAGLTVGVVAIPLAMAFAIACGLSPTQGLITAIVAGFLISLFSGSKYQIGGPTGAFVI
IIMGVLEQYHASGLLVCTLMAGLFLIIFGFCRMGALIRFIPFPVTTGFTSGIAVVIFSTQIKDIFGLTITEKIPGEFIEK
WACYFHYFHTINWAALGLAAGTVIITLLSRRFWPRIPAMLVGMLGMTAVSVAFSLPVTTIGQAFGSLPNTLPLPSLPSID
WSTLGALTAPAFTIALLAAIESLLSASVADGMTGGRHKPNMELIAQGIGNIGSALFGGIPATGAIARTATNIKAGAKSPV
SGMIHALTLLAILMAFAHYAQQIPLAVLAGILTVVCYNMSEIHTFSRLLKGPRQDAAVLVITFLLTVFVDLVVAVEVGVV
LAALLFMGRMAQISDVSAIKNELLENDEEDDGNRSAAKLDIPEGVEVFDVKGPFFFGAVEQFKDQVLETLEHDTKVVILR
MRLVPALDATGLNVLSDFCHQCREHGSTLLVCGVQPQPLDVIRHAPFYRELKRYNICENIDAALNRACKIINGPAPKHL
>Mature_559_residues
MFKPALLSSLKTYTKQTFLADLFAGLTVGVVAIPLAMAFAIACGLSPTQGLITAIVAGFLISLFSGSKYQIGGPTGAFVI
IIMGVLEQYHASGLLVCTLMAGLFLIIFGFCRMGALIRFIPFPVTTGFTSGIAVVIFSTQIKDIFGLTITEKIPGEFIEK
WACYFHYFHTINWAALGLAAGTVIITLLSRRFWPRIPAMLVGMLGMTAVSVAFSLPVTTIGQAFGSLPNTLPLPSLPSID
WSTLGALTAPAFTIALLAAIESLLSASVADGMTGGRHKPNMELIAQGIGNIGSALFGGIPATGAIARTATNIKAGAKSPV
SGMIHALTLLAILMAFAHYAQQIPLAVLAGILTVVCYNMSEIHTFSRLLKGPRQDAAVLVITFLLTVFVDLVVAVEVGVV
LAALLFMGRMAQISDVSAIKNELLENDEEDDGNRSAAKLDIPEGVEVFDVKGPFFFGAVEQFKDQVLETLEHDTKVVILR
MRLVPALDATGLNVLSDFCHQCREHGSTLLVCGVQPQPLDVIRHAPFYRELKRYNICENIDAALNRACKIINGPAPKHL

Specific function: Possible Sulfate Transporter. [C]

COG id: COG0659

COG function: function code P; Sulfate permease and related transporters (MFS superfamily)

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 STAS domain [H]

Homologues:

Organism=Homo sapiens, GI262206105, Length=578, Percent_Identity=24.7404844290657, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI262206075, Length=578, Percent_Identity=24.7404844290657, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI262206069, Length=578, Percent_Identity=24.7404844290657, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI262206063, Length=578, Percent_Identity=24.7404844290657, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI4557535, Length=513, Percent_Identity=22.6120857699805, Blast_Score=107, Evalue=3e-23,
Organism=Homo sapiens, GI39752683, Length=487, Percent_Identity=22.7926078028747, Blast_Score=103, Evalue=3e-22,
Organism=Homo sapiens, GI94721259, Length=479, Percent_Identity=23.5908141962422, Blast_Score=103, Evalue=3e-22,
Organism=Homo sapiens, GI45827800, Length=487, Percent_Identity=22.7926078028747, Blast_Score=103, Evalue=3e-22,
Organism=Homo sapiens, GI94721257, Length=479, Percent_Identity=23.5908141962422, Blast_Score=103, Evalue=4e-22,
Organism=Homo sapiens, GI94721253, Length=479, Percent_Identity=23.5908141962422, Blast_Score=103, Evalue=5e-22,
Organism=Homo sapiens, GI94721255, Length=479, Percent_Identity=23.5908141962422, Blast_Score=103, Evalue=5e-22,
Organism=Homo sapiens, GI4505697, Length=505, Percent_Identity=23.1683168316832, Blast_Score=95, Evalue=2e-19,
Organism=Homo sapiens, GI45827802, Length=432, Percent_Identity=23.3796296296296, Blast_Score=93, Evalue=6e-19,
Organism=Homo sapiens, GI269784651, Length=483, Percent_Identity=20.9109730848861, Blast_Score=87, Evalue=3e-17,
Organism=Homo sapiens, GI20336282, Length=533, Percent_Identity=21.0131332082552, Blast_Score=87, Evalue=4e-17,
Organism=Homo sapiens, GI16418457, Length=444, Percent_Identity=23.4234234234234, Blast_Score=87, Evalue=4e-17,
Organism=Homo sapiens, GI301601599, Length=444, Percent_Identity=23.4234234234234, Blast_Score=87, Evalue=4e-17,
Organism=Homo sapiens, GI16306483, Length=533, Percent_Identity=21.0131332082552, Blast_Score=86, Evalue=6e-17,
Organism=Homo sapiens, GI16418413, Length=521, Percent_Identity=22.6487523992322, Blast_Score=86, Evalue=1e-16,
Organism=Homo sapiens, GI217272867, Length=521, Percent_Identity=22.6487523992322, Blast_Score=85, Evalue=2e-16,
Organism=Homo sapiens, GI100913030, Length=513, Percent_Identity=23.1968810916179, Blast_Score=84, Evalue=4e-16,
Organism=Homo sapiens, GI47131207, Length=524, Percent_Identity=22.7099236641221, Blast_Score=74, Evalue=4e-13,
Organism=Homo sapiens, GI20336272, Length=524, Percent_Identity=22.7099236641221, Blast_Score=74, Evalue=4e-13,
Organism=Escherichia coli, GI87081859, Length=545, Percent_Identity=40.7339449541284, Blast_Score=321, Evalue=9e-89,
Organism=Caenorhabditis elegans, GI17566848, Length=570, Percent_Identity=22.4561403508772, Blast_Score=122, Evalue=4e-28,
Organism=Caenorhabditis elegans, GI193203292, Length=506, Percent_Identity=24.3083003952569, Blast_Score=114, Evalue=1e-25,
Organism=Caenorhabditis elegans, GI17551690, Length=498, Percent_Identity=23.0923694779116, Blast_Score=112, Evalue=7e-25,
Organism=Caenorhabditis elegans, GI86565215, Length=545, Percent_Identity=24.4036697247706, Blast_Score=106, Evalue=4e-23,
Organism=Caenorhabditis elegans, GI86564196, Length=506, Percent_Identity=23.9130434782609, Blast_Score=98, Evalue=1e-20,
Organism=Saccharomyces cerevisiae, GI6319771, Length=452, Percent_Identity=25.2212389380531, Blast_Score=91, Evalue=5e-19,
Organism=Saccharomyces cerevisiae, GI6325260, Length=523, Percent_Identity=23.3269598470363, Blast_Score=82, Evalue=3e-16,
Organism=Drosophila melanogaster, GI85815873, Length=515, Percent_Identity=27.378640776699, Blast_Score=138, Evalue=1e-32,
Organism=Drosophila melanogaster, GI19922482, Length=529, Percent_Identity=27.0321361058601, Blast_Score=134, Evalue=2e-31,
Organism=Drosophila melanogaster, GI24651449, Length=566, Percent_Identity=23.6749116607774, Blast_Score=126, Evalue=4e-29,
Organism=Drosophila melanogaster, GI24663084, Length=445, Percent_Identity=25.8426966292135, Blast_Score=114, Evalue=2e-25,
Organism=Drosophila melanogaster, GI21357695, Length=445, Percent_Identity=25.8426966292135, Blast_Score=114, Evalue=2e-25,
Organism=Drosophila melanogaster, GI24647160, Length=430, Percent_Identity=25.3488372093023, Blast_Score=114, Evalue=2e-25,
Organism=Drosophila melanogaster, GI21355087, Length=430, Percent_Identity=25.3488372093023, Blast_Score=113, Evalue=3e-25,
Organism=Drosophila melanogaster, GI24649801, Length=578, Percent_Identity=23.1833910034602, Blast_Score=108, Evalue=7e-24,
Organism=Drosophila melanogaster, GI21358633, Length=548, Percent_Identity=22.992700729927, Blast_Score=105, Evalue=6e-23,
Organism=Drosophila melanogaster, GI21358229, Length=388, Percent_Identity=25.7731958762887, Blast_Score=91, Evalue=3e-18,
Organism=Drosophila melanogaster, GI24666186, Length=288, Percent_Identity=25.6944444444444, Blast_Score=87, Evalue=4e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002645
- InterPro:   IPR001902
- InterPro:   IPR011547 [H]

Pfam domain/function: PF01740 STAS; PF00916 Sulfate_transp [H]

EC number: NA

Molecular weight: Translated: 59709; Mature: 59709

Theoretical pI: Translated: 7.30; Mature: 7.30

Prosite motif: PS50801 STAS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.8 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
1.8 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MFKPALLSSLKTYTKQTFLADLFAGLTVGVVAIPLAMAFAIACGLSPTQGLITAIVAGFL
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH
ISLFSGSKYQIGGPTGAFVIIIMGVLEQYHASGLLVCTLMAGLFLIIFGFCRMGALIRFI
HHHHCCCCEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PFPVTTGFTSGIAVVIFSTQIKDIFGLTITEKIPGEFIEKWACYFHYFHTINWAALGLAA
CCCCCCCCCCCEEEEEEHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH
GTVIITLLSRRFWPRIPAMLVGMLGMTAVSVAFSLPVTTIGQAFGSLPNTLPLPSLPSID
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCC
WSTLGALTAPAFTIALLAAIESLLSASVADGMTGGRHKPNMELIAQGIGNIGSALFGGIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC
ATGAIARTATNIKAGAKSPVSGMIHALTLLAILMAFAHYAQQIPLAVLAGILTVVCYNMS
CCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCHH
EIHTFSRLLKGPRQDAAVLVITFLLTVFVDLVVAVEVGVVLAALLFMGRMAQISDVSAIK
HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
NELLENDEEDDGNRSAAKLDIPEGVEVFDVKGPFFFGAVEQFKDQVLETLEHDTKVVILR
HHHHCCCCCCCCCCCCEEECCCCCCEEEECCCCHHHHHHHHHHHHHHHHHCCCCCEEEHH
MRLVPALDATGLNVLSDFCHQCREHGSTLLVCGVQPQPLDVIRHAPFYRELKRYNICENI
HHHHHCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHCCHHHHHHHHCCHHHHH
DAALNRACKIINGPAPKHL
HHHHHHHHHHHCCCCCCCC
>Mature Secondary Structure
MFKPALLSSLKTYTKQTFLADLFAGLTVGVVAIPLAMAFAIACGLSPTQGLITAIVAGFL
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH
ISLFSGSKYQIGGPTGAFVIIIMGVLEQYHASGLLVCTLMAGLFLIIFGFCRMGALIRFI
HHHHCCCCEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
PFPVTTGFTSGIAVVIFSTQIKDIFGLTITEKIPGEFIEKWACYFHYFHTINWAALGLAA
CCCCCCCCCCCEEEEEEHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH
GTVIITLLSRRFWPRIPAMLVGMLGMTAVSVAFSLPVTTIGQAFGSLPNTLPLPSLPSID
HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCC
WSTLGALTAPAFTIALLAAIESLLSASVADGMTGGRHKPNMELIAQGIGNIGSALFGGIP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC
ATGAIARTATNIKAGAKSPVSGMIHALTLLAILMAFAHYAQQIPLAVLAGILTVVCYNMS
CCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCHH
EIHTFSRLLKGPRQDAAVLVITFLLTVFVDLVVAVEVGVVLAALLFMGRMAQISDVSAIK
HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
NELLENDEEDDGNRSAAKLDIPEGVEVFDVKGPFFFGAVEQFKDQVLETLEHDTKVVILR
HHHHCCCCCCCCCCCCEEECCCCCCEEEECCCCHHHHHHHHHHHHHHHHHCCCCCEEEHH
MRLVPALDATGLNVLSDFCHQCREHGSTLLVCGVQPQPLDVIRHAPFYRELKRYNICENI
HHHHHCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHCCHHHHHHHHCCHHHHH
DAALNRACKIINGPAPKHL
HHHHHHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: Proton [Periplasm]; SO42- [Periplasm] [C]

Specific reaction: Proton [Periplasm] + SO42- [Periplasm] = Proton [Cytoplasm] + SO42- [Cytoplasm] [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]