Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is cysNC [H]

Identifier: 187735791

GI number: 187735791

Start: 1573952

End: 1575679

Strand: Reverse

Name: cysNC [H]

Synonym: Amuc_1298

Alternate gene names: 187735791

Gene position: 1575679-1573952 (Counterclockwise)

Preceding gene: 187735792

Following gene: 187735790

Centisome position: 59.14

GC content: 51.56

Gene sequence:

>1728_bases
ATGGACATCGACTCATACCTGAACGAACACGAAAATAAAAGCCTGCTACGCGTACTTACCTGCGGTTCCGTGGACGACGG
GAAATCTACACTCATCGGACGCCTCCTTTATGACAGCAAACTGATTTTTGACGACCAGCTGGCAGAGCTGCGCAAAGCCA
GTGAAAAAAATGGAACTGCTGGAGCAGGTAAAATTGATTACGCCCTGTTGCTGGACGGCCTTAGAGCGGAACGGGAACAG
GGAATAACCATTGATGTAGCCTACCGGTACTTCACCACCCCACGCCGCAAATTCATCATTGCCGACTGCCCCGGACATGA
ACAATACACCCGGAACATGGCCACCGGAGCTTCCACGGCAGATGCCGCCATCATCCTGATTGATGCTCGCCATGGAGTAC
TCACACAAACGAAACGGCATGCGTTCATCGTCTCTCTTCTGAAAATACGGCACCTCATCGTAGCCGTCAACAAGATGGAT
CTTTTGAAATACTCTGAAGAAAAATTCCGGAAAATTGAAGAAGAATTCGGAAGCTTCACGCAACAGTTGAATATCCCGGA
TGTTCGTTTCGTTCCCATTTCCGCCATTGAAGGGGAAAATGTGACGCAAACAACAGGAAAAACGCCTTGGTACCAGGGCG
ATCATCTGCTTTCCATTCTGGAAACGCTGGATGCCAGCGACAGCAGGAATCTCCGGGATTTCCGCTTTCCGGTACAGACA
GTCATACGGCCCAATCTCGATTTCCGGGGGTTCGCCGGCTCCATTACCTCCGGCTCCATCCGCAGGGGTGATCCTATCGT
GACGTTGCCTTCGTTTCAAAACAGCCGGATCAAAAGAATCGTTACTCCGGACGGAGAACTGGAAGAAGCATTCTCTCCCC
AAGCCGTCGTATTGGAACTTGAGGATGAAATAGATATCAGCAGCGGTGACATGATTGTCAAAAAAGGGAATCTCCCCCAT
ATAGAGGATCGGCTGGAAGCCCGTGTCATCTGGATGTCTGAAAAACCCCTTCTTCCCAGAAGCAAATACATTATGCGCCA
TGCGGGCAGAAATATCCAGGGGAGGATAGTAGAACTCCAATACGACATAGACGTCAATACACTGGAAAGCCGCCATGCAA
CGCAGCTTCCTCTGAATCATGTCGGCCGTATCGTCCTGGAAACCAGTTCCCCCTTGTTCTATGATTATTACCGGGATAAC
CGTTCCGGAGGAGCTTTCATCCTGATTGACCCGCTGAATAACGTCACAGCGGGAGCTGGTATGCTCCGCCCCCCTCACAG
AGATAAGGTTCCCGAAAAAGAAAAGGAACAACTCCAAACATTCGTTTCAAGCGATGAACGCGCTGAAACCTTCGGGCATG
GTGGAAAACAAATTTACGTAGCGGGAGAAGACAGCGAACTGGCACGCAGCTTCGCCAAACAGCTGGAACGGGAACTCCAT
CGGCTCAAGGCTCATACCTACGGTCTGGATTTCAAGGCAGAAGGCGTATGGGGCAGATCCGCTAGAGAAATCGTCAATGC
CTCAGGCCTGCTGGCCGAAGCGGGGCTCATGAGCATTGCAGTGCTGCCGGGCCTTCCCGTCCTGCCCAGAAAAGCAAAGG
GAACCTACTGCATCTGGCTTGGGAATGTCGTCTCCGCGCCAGAAACGGCAGACCGCATCCTCCCCCCTGCGAAAGCAAAC
GAAAATACTGCGTTTCTTCTGGCGCGCACTCTCTATGTGGAATTTTAA

Upstream 100 bases:

>100_bases
TGACCACGACGGTGATGCCTCCATGGAACAGAAAAAACGAGAAGGATATTTTTAACCCCTATTATGAACCATTCTGACGA
TACCTCTTATTTTTACCATT

Downstream 100 bases:

>100_bases
TTTTATCCCATTTAAATACCTGTTCTACCATTTTATGAAAACTCTGCCTCTTGCATTAGATGCTCTTCTTTGTATTGGCG
CTCTCCTGGTTCCAGCATTT

Product: sulfate adenylyltransferase, large subunit

Products: NA

Alternate protein names: Sulfate adenylyltransferase subunit 1; ATP-sulfurylase large subunit; Sulfate adenylate transferase; SAT; Adenylyl-sulfate kinase; APS kinase; ATP adenosine-5'-phosphosulfate 3'-phosphotransferase [H]

Number of amino acids: Translated: 575; Mature: 575

Protein sequence:

>575_residues
MDIDSYLNEHENKSLLRVLTCGSVDDGKSTLIGRLLYDSKLIFDDQLAELRKASEKNGTAGAGKIDYALLLDGLRAEREQ
GITIDVAYRYFTTPRRKFIIADCPGHEQYTRNMATGASTADAAIILIDARHGVLTQTKRHAFIVSLLKIRHLIVAVNKMD
LLKYSEEKFRKIEEEFGSFTQQLNIPDVRFVPISAIEGENVTQTTGKTPWYQGDHLLSILETLDASDSRNLRDFRFPVQT
VIRPNLDFRGFAGSITSGSIRRGDPIVTLPSFQNSRIKRIVTPDGELEEAFSPQAVVLELEDEIDISSGDMIVKKGNLPH
IEDRLEARVIWMSEKPLLPRSKYIMRHAGRNIQGRIVELQYDIDVNTLESRHATQLPLNHVGRIVLETSSPLFYDYYRDN
RSGGAFILIDPLNNVTAGAGMLRPPHRDKVPEKEKEQLQTFVSSDERAETFGHGGKQIYVAGEDSELARSFAKQLERELH
RLKAHTYGLDFKAEGVWGRSAREIVNASGLLAEAGLMSIAVLPGLPVLPRKAKGTYCIWLGNVVSAPETADRILPPAKAN
ENTAFLLARTLYVEF

Sequences:

>Translated_575_residues
MDIDSYLNEHENKSLLRVLTCGSVDDGKSTLIGRLLYDSKLIFDDQLAELRKASEKNGTAGAGKIDYALLLDGLRAEREQ
GITIDVAYRYFTTPRRKFIIADCPGHEQYTRNMATGASTADAAIILIDARHGVLTQTKRHAFIVSLLKIRHLIVAVNKMD
LLKYSEEKFRKIEEEFGSFTQQLNIPDVRFVPISAIEGENVTQTTGKTPWYQGDHLLSILETLDASDSRNLRDFRFPVQT
VIRPNLDFRGFAGSITSGSIRRGDPIVTLPSFQNSRIKRIVTPDGELEEAFSPQAVVLELEDEIDISSGDMIVKKGNLPH
IEDRLEARVIWMSEKPLLPRSKYIMRHAGRNIQGRIVELQYDIDVNTLESRHATQLPLNHVGRIVLETSSPLFYDYYRDN
RSGGAFILIDPLNNVTAGAGMLRPPHRDKVPEKEKEQLQTFVSSDERAETFGHGGKQIYVAGEDSELARSFAKQLERELH
RLKAHTYGLDFKAEGVWGRSAREIVNASGLLAEAGLMSIAVLPGLPVLPRKAKGTYCIWLGNVVSAPETADRILPPAKAN
ENTAFLLARTLYVEF
>Mature_575_residues
MDIDSYLNEHENKSLLRVLTCGSVDDGKSTLIGRLLYDSKLIFDDQLAELRKASEKNGTAGAGKIDYALLLDGLRAEREQ
GITIDVAYRYFTTPRRKFIIADCPGHEQYTRNMATGASTADAAIILIDARHGVLTQTKRHAFIVSLLKIRHLIVAVNKMD
LLKYSEEKFRKIEEEFGSFTQQLNIPDVRFVPISAIEGENVTQTTGKTPWYQGDHLLSILETLDASDSRNLRDFRFPVQT
VIRPNLDFRGFAGSITSGSIRRGDPIVTLPSFQNSRIKRIVTPDGELEEAFSPQAVVLELEDEIDISSGDMIVKKGNLPH
IEDRLEARVIWMSEKPLLPRSKYIMRHAGRNIQGRIVELQYDIDVNTLESRHATQLPLNHVGRIVLETSSPLFYDYYRDN
RSGGAFILIDPLNNVTAGAGMLRPPHRDKVPEKEKEQLQTFVSSDERAETFGHGGKQIYVAGEDSELARSFAKQLERELH
RLKAHTYGLDFKAEGVWGRSAREIVNASGLLAEAGLMSIAVLPGLPVLPRKAKGTYCIWLGNVVSAPETADRILPPAKAN
ENTAFLLARTLYVEF

Specific function: APS kinase catalyzes the synthesis of activated sulfate [H]

COG id: COG2895

COG function: function code P; GTPases - Sulfate adenylate transferase subunit 1

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: In the C-terminal section; belongs to the APS kinase family [H]

Homologues:

Organism=Homo sapiens, GI223555963, Length=436, Percent_Identity=28.6697247706422, Blast_Score=166, Evalue=5e-41,
Organism=Homo sapiens, GI5729864, Length=436, Percent_Identity=28.6697247706422, Blast_Score=165, Evalue=9e-41,
Organism=Homo sapiens, GI194018520, Length=440, Percent_Identity=27.2727272727273, Blast_Score=146, Evalue=5e-35,
Organism=Homo sapiens, GI194097354, Length=440, Percent_Identity=27.2727272727273, Blast_Score=146, Evalue=5e-35,
Organism=Homo sapiens, GI194018522, Length=440, Percent_Identity=27.2727272727273, Blast_Score=146, Evalue=6e-35,
Organism=Homo sapiens, GI46094014, Length=444, Percent_Identity=26.1261261261261, Blast_Score=143, Evalue=5e-34,
Organism=Homo sapiens, GI4503475, Length=331, Percent_Identity=29.9093655589124, Blast_Score=142, Evalue=6e-34,
Organism=Homo sapiens, GI4503471, Length=453, Percent_Identity=27.3730684326711, Blast_Score=142, Evalue=1e-33,
Organism=Homo sapiens, GI34147630, Length=345, Percent_Identity=28.1159420289855, Blast_Score=101, Evalue=2e-21,
Organism=Escherichia coli, GI1789108, Length=426, Percent_Identity=53.0516431924883, Blast_Score=451, Evalue=1e-128,
Organism=Escherichia coli, GI1790412, Length=150, Percent_Identity=31.3333333333333, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI1789737, Length=150, Percent_Identity=31.3333333333333, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI2367247, Length=387, Percent_Identity=24.5478036175711, Blast_Score=78, Evalue=2e-15,
Organism=Caenorhabditis elegans, GI115532067, Length=316, Percent_Identity=30.6962025316456, Blast_Score=164, Evalue=1e-40,
Organism=Caenorhabditis elegans, GI115532065, Length=316, Percent_Identity=30.6962025316456, Blast_Score=164, Evalue=1e-40,
Organism=Caenorhabditis elegans, GI32566629, Length=433, Percent_Identity=26.5588914549654, Blast_Score=156, Evalue=3e-38,
Organism=Caenorhabditis elegans, GI17552884, Length=440, Percent_Identity=27.9545454545455, Blast_Score=148, Evalue=1e-35,
Organism=Caenorhabditis elegans, GI17569207, Length=440, Percent_Identity=27.9545454545455, Blast_Score=148, Evalue=1e-35,
Organism=Caenorhabditis elegans, GI32566303, Length=433, Percent_Identity=26.7898383371824, Blast_Score=125, Evalue=7e-29,
Organism=Caenorhabditis elegans, GI25141371, Length=257, Percent_Identity=26.8482490272374, Blast_Score=95, Evalue=8e-20,
Organism=Caenorhabditis elegans, GI32566301, Length=150, Percent_Identity=34.6666666666667, Blast_Score=94, Evalue=2e-19,
Organism=Caenorhabditis elegans, GI17556456, Length=345, Percent_Identity=23.1884057971014, Blast_Score=77, Evalue=2e-14,
Organism=Saccharomyces cerevisiae, GI6325337, Length=308, Percent_Identity=33.4415584415584, Blast_Score=162, Evalue=1e-40,
Organism=Saccharomyces cerevisiae, GI6319594, Length=308, Percent_Identity=33.4415584415584, Blast_Score=162, Evalue=1e-40,
Organism=Saccharomyces cerevisiae, GI6322937, Length=400, Percent_Identity=30, Blast_Score=154, Evalue=3e-38,
Organism=Saccharomyces cerevisiae, GI6320377, Length=429, Percent_Identity=27.039627039627, Blast_Score=120, Evalue=5e-28,
Organism=Saccharomyces cerevisiae, GI6324761, Length=259, Percent_Identity=28.957528957529, Blast_Score=100, Evalue=7e-22,
Organism=Drosophila melanogaster, GI45550900, Length=429, Percent_Identity=27.7389277389277, Blast_Score=152, Evalue=5e-37,
Organism=Drosophila melanogaster, GI24652838, Length=302, Percent_Identity=31.4569536423841, Blast_Score=145, Evalue=5e-35,
Organism=Drosophila melanogaster, GI17137572, Length=302, Percent_Identity=31.4569536423841, Blast_Score=145, Evalue=5e-35,
Organism=Drosophila melanogaster, GI45553807, Length=271, Percent_Identity=32.4723247232472, Blast_Score=144, Evalue=2e-34,
Organism=Drosophila melanogaster, GI45553816, Length=271, Percent_Identity=32.4723247232472, Blast_Score=144, Evalue=2e-34,
Organism=Drosophila melanogaster, GI24651721, Length=271, Percent_Identity=32.4723247232472, Blast_Score=144, Evalue=2e-34,
Organism=Drosophila melanogaster, GI17864154, Length=271, Percent_Identity=32.4723247232472, Blast_Score=144, Evalue=2e-34,
Organism=Drosophila melanogaster, GI17137380, Length=356, Percent_Identity=25.2808988764045, Blast_Score=122, Evalue=9e-28,
Organism=Drosophila melanogaster, GI281363316, Length=352, Percent_Identity=27.5568181818182, Blast_Score=105, Evalue=7e-23,
Organism=Drosophila melanogaster, GI17864358, Length=352, Percent_Identity=27.5568181818182, Blast_Score=105, Evalue=7e-23,
Organism=Drosophila melanogaster, GI19921738, Length=165, Percent_Identity=31.5151515151515, Blast_Score=81, Evalue=2e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002891
- InterPro:   IPR000795
- InterPro:   IPR011779
- InterPro:   IPR009001
- InterPro:   IPR004161
- InterPro:   IPR009000 [H]

Pfam domain/function: PF01583 APS_kinase; PF00009 GTP_EFTU; PF03144 GTP_EFTU_D2 [H]

EC number: =2.7.7.4; =2.7.1.25 [H]

Molecular weight: Translated: 64385; Mature: 64385

Theoretical pI: Translated: 6.76; Mature: 6.76

Prosite motif: PS00301 EFACTOR_GTP

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
1.4 %Met     (Translated Protein)
1.9 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
1.4 %Met     (Mature Protein)
1.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDIDSYLNEHENKSLLRVLTCGSVDDGKSTLIGRLLYDSKLIFDDQLAELRKASEKNGTA
CCHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHCCEEEEHHHHHHHHHHHCCCCCC
GAGKIDYALLLDGLRAEREQGITIDVAYRYFTTPRRKFIIADCPGHEQYTRNMATGASTA
CCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCEEEEEECCCCHHHHHHHHCCCCCC
DAAIILIDARHGVLTQTKRHAFIVSLLKIRHLIVAVNKMDLLKYSEEKFRKIEEEFGSFT
CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
QQLNIPDVRFVPISAIEGENVTQTTGKTPWYQGDHLLSILETLDASDSRNLRDFRFPVQT
HHCCCCCEEEEEEEEECCCCCEECCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHCCHHH
VIRPNLDFRGFAGSITSGSIRRGDPIVTLPSFQNSRIKRIVTPDGELEEAFSPQAVVLEL
HHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCCHHHHCCCCEEEEEE
EDEIDISSGDMIVKKGNLPHIEDRLEARVIWMSEKPLLPRSKYIMRHAGRNIQGRIVELQ
CCCCCCCCCCEEEECCCCCCHHHCCCEEEEEEECCCCCCHHHHHHHHCCCCCCEEEEEEE
YDIDVNTLESRHATQLPLNHVGRIVLETSSPLFYDYYRDNRSGGAFILIDPLNNVTAGAG
EECCCCCCCCCCCCCCCHHHCCEEEEECCCCEEEEEEECCCCCCEEEEECCCCCCCCCCC
MLRPPHRDKVPEKEKEQLQTFVSSDERAETFGHGGKQIYVAGEDSELARSFAKQLERELH
CCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHH
RLKAHTYGLDFKAEGVWGRSAREIVNASGLLAEAGLMSIAVLPGLPVLPRKAKGTYCIWL
HHHHHHCCCCEEECCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEE
GNVVSAPETADRILPPAKANENTAFLLARTLYVEF
CCCCCCCCHHHHCCCCCCCCCCEEEEEEEEEEECC
>Mature Secondary Structure
MDIDSYLNEHENKSLLRVLTCGSVDDGKSTLIGRLLYDSKLIFDDQLAELRKASEKNGTA
CCHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHCCEEEEHHHHHHHHHHHCCCCCC
GAGKIDYALLLDGLRAEREQGITIDVAYRYFTTPRRKFIIADCPGHEQYTRNMATGASTA
CCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCEEEEEECCCCHHHHHHHHCCCCCC
DAAIILIDARHGVLTQTKRHAFIVSLLKIRHLIVAVNKMDLLKYSEEKFRKIEEEFGSFT
CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
QQLNIPDVRFVPISAIEGENVTQTTGKTPWYQGDHLLSILETLDASDSRNLRDFRFPVQT
HHCCCCCEEEEEEEEECCCCCEECCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHCCHHH
VIRPNLDFRGFAGSITSGSIRRGDPIVTLPSFQNSRIKRIVTPDGELEEAFSPQAVVLEL
HHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCCHHHHCCCCEEEEEE
EDEIDISSGDMIVKKGNLPHIEDRLEARVIWMSEKPLLPRSKYIMRHAGRNIQGRIVELQ
CCCCCCCCCCEEEECCCCCCHHHCCCEEEEEEECCCCCCHHHHHHHHCCCCCCEEEEEEE
YDIDVNTLESRHATQLPLNHVGRIVLETSSPLFYDYYRDNRSGGAFILIDPLNNVTAGAG
EECCCCCCCCCCCCCCCHHHCCEEEEECCCCEEEEEEECCCCCCEEEEECCCCCCCCCCC
MLRPPHRDKVPEKEKEQLQTFVSSDERAETFGHGGKQIYVAGEDSELARSFAKQLERELH
CCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHH
RLKAHTYGLDFKAEGVWGRSAREIVNASGLLAEAGLMSIAVLPGLPVLPRKAKGTYCIWL
HHHHHHCCCCEEECCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEE
GNVVSAPETADRILPPAKANENTAFLLARTLYVEF
CCCCCCCCHHHHCCCCCCCCCCEEEEEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 12835416 [H]