Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is nikA [C]
Identifier: 187736172
GI number: 187736172
Start: 2046109
End: 2048115
Strand: Direct
Name: nikA [C]
Synonym: Amuc_1685
Alternate gene names: 187736172
Gene position: 2046109-2048115 (Clockwise)
Preceding gene: 187736171
Following gene: 187736173
Centisome position: 76.8
GC content: 55.41
Gene sequence:
>2007_bases ATGGACGGCAGGGGGATGCCTGCCGTATTTGACGGTTTTGTGGAACGCTGGAACAGGGCCGTGGAACGGGAACTGCGCCG TGAATTGGCTGAAGGGGAGAGGAAGGAAAAACTTCTGGTCGTTTCCCGCGGGGAAGGGGACGGCCGGCGTTTGGCGGAAC AGGCTCGTTCCGTGGAACGCATCAGGAAAAGGCTGTTATTGAAAGACCATATCCGCCTTCTTCCTTTGGAAGACGTTCCG GACGGATTGGAATGGCAAACCGGAATGGAAGAGCCGGATGTGGGTGATCCGCGCGCCGTCAAGGGTGGAAAAGTCCGGCT GTGGATCAATACTCCGTTTCCCGGAACTTTGCGTGCTTTCGGGCCGGGAAGTGAAAATTTTTTCAATTATTCCGCCATTG ACAACGTGTGGCTTCCCCTCGTGGGGCTTCATCCGGAAACCTTCCGTCCCATTCCGGGCCTGGCGGACCGCTGGGCTTTG TCCGCAGACGGAAAAACCGTTTTTTATCATCTGGATCCGGAAGCGGCCTATTCGGACGGACGCATCGTGAAAGCGCAGGA TTTTCTGCTGAACATTTGCCTCCGTACGTCCGGTTTTGCCCGGGATCCGTTCTGGACTACCCTGTTCCGTTCCATGTATG ATCAGATTACGGTATACGGGGATTCCGTCATTGCCCTTACGCTGCCGTCCCGCGGGCCGCTGCTGCCTTACATGGCCTGT GCGGATTTTCATCCGGCCCATCCCGGTTTTTACCATGATTTCAACGCTCTGTTTCTGGAACGCTACCAGTGGAAAGTCCC TCCAAATACGGGTGGCTACATGGTTGTTCCCTCTAAAATCCGGATAGGAGAAAGCATTACCCTGGCCCGCGTGAAAAATT GGTGGGCGAAGGAGAGGAAGTATTACCGCCATTCCTGCAATGCGGACCAGGTGGAACATGTGTTTGTGAACATGGAAAAC AAGGCGGTTGAAATGTTCCGCCGCGGGGAACTGGACATCATGAACGTTCGCAAGCCGGAGGTGTGGGAAACCGGGCTGGA GCTGCCGGAAGTGCACCGGGGCTATATAGATAAATACAGCCTGGAAGCCAATTACGCCTGTCCCCCGTACGGACTGTATC TGAATTGCTCGGACAAGTTGTTGAAAAATCCGGATATCCGCCGCGGTCTGGCGCATTCCGTGAACATGGGCCTGGTGATT GATACTTTGTTCCGCGGGAATATGAGAAGGCTCGGTTCCTACATGGAGGGGTACGGAGACCTGACTCTTCCGTTGAAAGC TCCGGAATACAGCAAGAAAAAAGCCATGGAATATTTTGCCCGCGCCGGCTACCGGGAAATGGGGACTGACGGAGTGCTGA AAAATGAACGGGGAGAGCGGCTGGTGGTGGAACTGACGTTTGCGGATTCATCCACCCTGATGACCAACGTCTGTTCCATC CTCCGGCAGGAGGCCCTGAAGTGCGGGGTGGACCTGCGTCTGGATTCCCTTACCTACAGTGTCTGTTCCCGTAAGGTTTT TGAAAAACGGTATCAGGCGGCTCTATGGGCGTGGCCGCTGCAGACGCCGTTCCCCCGGCTGTATGAAACATTCTCTTCCG AACTGGCGTATGATGCCCGCGGAAATCCCGTAGGCAATACGAATAATATTATGGCCGTGTCGGACGCTGGACTGGATGCC GCCCTGGATGCGGAACGCAATGCCCCGGATACCGGTTCCCTGAAACTGGCCCTTCACCGTGCCCAGCAGCGTCTTCATGA ACTTTGCGTCTGGATTCCCGGCTGGAGGGAACCCTACACGCATATCGCCTGCTGGCGCTGGATACGCTGGCCGGAATCTC CCACCCGGTTCTGTTCCCCCAGAATTTACAATCCGCTGGAGTCGCATCTGTACTGGGTGGATGAAGAAATGAAGAAGGAG ACGCTGGAGGCTCGGAGCCGGGGCGTACCGTTTGAGGAAAAGCATCAAATAATATATCTTGAAAGACAGGATGGTGCCGC TCCGTAA
Upstream 100 bases:
>100_bases GATATCCATACTGGGGGCATTGGCCCTTTCCGTCTTTCTGGGAGGTGTCTTTTACTTTTCCCAGCGTGAGAAGAATGTTT GGCATGAATTTTCCGGACGG
Downstream 100 bases:
>100_bases TAAAACCGGGTCACCGTCTCCTTTTTAAATTGAAAGGAGGTGGTTTTTCATCATAATCTCCCTCTCATTTACCGTAATTA ATTATTAATTCCATAATCAT
Product: extracellular solute-binding protein family 5
Products: Ni2; [Cytoplasm]; ADP; phosphate [C]
Alternate protein names: NA
Number of amino acids: Translated: 668; Mature: 668
Protein sequence:
>668_residues MDGRGMPAVFDGFVERWNRAVERELRRELAEGERKEKLLVVSRGEGDGRRLAEQARSVERIRKRLLLKDHIRLLPLEDVP DGLEWQTGMEEPDVGDPRAVKGGKVRLWINTPFPGTLRAFGPGSENFFNYSAIDNVWLPLVGLHPETFRPIPGLADRWAL SADGKTVFYHLDPEAAYSDGRIVKAQDFLLNICLRTSGFARDPFWTTLFRSMYDQITVYGDSVIALTLPSRGPLLPYMAC ADFHPAHPGFYHDFNALFLERYQWKVPPNTGGYMVVPSKIRIGESITLARVKNWWAKERKYYRHSCNADQVEHVFVNMEN KAVEMFRRGELDIMNVRKPEVWETGLELPEVHRGYIDKYSLEANYACPPYGLYLNCSDKLLKNPDIRRGLAHSVNMGLVI DTLFRGNMRRLGSYMEGYGDLTLPLKAPEYSKKKAMEYFARAGYREMGTDGVLKNERGERLVVELTFADSSTLMTNVCSI LRQEALKCGVDLRLDSLTYSVCSRKVFEKRYQAALWAWPLQTPFPRLYETFSSELAYDARGNPVGNTNNIMAVSDAGLDA ALDAERNAPDTGSLKLALHRAQQRLHELCVWIPGWREPYTHIACWRWIRWPESPTRFCSPRIYNPLESHLYWVDEEMKKE TLEARSRGVPFEEKHQIIYLERQDGAAP
Sequences:
>Translated_668_residues MDGRGMPAVFDGFVERWNRAVERELRRELAEGERKEKLLVVSRGEGDGRRLAEQARSVERIRKRLLLKDHIRLLPLEDVP DGLEWQTGMEEPDVGDPRAVKGGKVRLWINTPFPGTLRAFGPGSENFFNYSAIDNVWLPLVGLHPETFRPIPGLADRWAL SADGKTVFYHLDPEAAYSDGRIVKAQDFLLNICLRTSGFARDPFWTTLFRSMYDQITVYGDSVIALTLPSRGPLLPYMAC ADFHPAHPGFYHDFNALFLERYQWKVPPNTGGYMVVPSKIRIGESITLARVKNWWAKERKYYRHSCNADQVEHVFVNMEN KAVEMFRRGELDIMNVRKPEVWETGLELPEVHRGYIDKYSLEANYACPPYGLYLNCSDKLLKNPDIRRGLAHSVNMGLVI DTLFRGNMRRLGSYMEGYGDLTLPLKAPEYSKKKAMEYFARAGYREMGTDGVLKNERGERLVVELTFADSSTLMTNVCSI LRQEALKCGVDLRLDSLTYSVCSRKVFEKRYQAALWAWPLQTPFPRLYETFSSELAYDARGNPVGNTNNIMAVSDAGLDA ALDAERNAPDTGSLKLALHRAQQRLHELCVWIPGWREPYTHIACWRWIRWPESPTRFCSPRIYNPLESHLYWVDEEMKKE TLEARSRGVPFEEKHQIIYLERQDGAAP >Mature_668_residues MDGRGMPAVFDGFVERWNRAVERELRRELAEGERKEKLLVVSRGEGDGRRLAEQARSVERIRKRLLLKDHIRLLPLEDVP DGLEWQTGMEEPDVGDPRAVKGGKVRLWINTPFPGTLRAFGPGSENFFNYSAIDNVWLPLVGLHPETFRPIPGLADRWAL SADGKTVFYHLDPEAAYSDGRIVKAQDFLLNICLRTSGFARDPFWTTLFRSMYDQITVYGDSVIALTLPSRGPLLPYMAC ADFHPAHPGFYHDFNALFLERYQWKVPPNTGGYMVVPSKIRIGESITLARVKNWWAKERKYYRHSCNADQVEHVFVNMEN KAVEMFRRGELDIMNVRKPEVWETGLELPEVHRGYIDKYSLEANYACPPYGLYLNCSDKLLKNPDIRRGLAHSVNMGLVI DTLFRGNMRRLGSYMEGYGDLTLPLKAPEYSKKKAMEYFARAGYREMGTDGVLKNERGERLVVELTFADSSTLMTNVCSI LRQEALKCGVDLRLDSLTYSVCSRKVFEKRYQAALWAWPLQTPFPRLYETFSSELAYDARGNPVGNTNNIMAVSDAGLDA ALDAERNAPDTGSLKLALHRAQQRLHELCVWIPGWREPYTHIACWRWIRWPESPTRFCSPRIYNPLESHLYWVDEEMKKE TLEARSRGVPFEEKHQIIYLERQDGAAP
Specific function: Involved In A Nickel Transport System, Probably Represents The Nickel Binder. [C]
COG id: COG4166
COG function: function code E; ABC-type oligopeptide transport system, periplasmic component
Gene ontology:
Cell location: Periplasm (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the bacterial solute-binding protein 5 family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000914 [H]
Pfam domain/function: PF00496 SBP_bac_5 [H]
EC number: NA
Molecular weight: Translated: 76762; Mature: 76762
Theoretical pI: Translated: 7.73; Mature: 7.73
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.6 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 4.2 %Cys+Met (Translated Protein) 1.6 %Cys (Mature Protein) 2.5 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDGRGMPAVFDGFVERWNRAVERELRRELAEGERKEKLLVVSRGEGDGRRLAEQARSVER CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHH IRKRLLLKDHIRLLPLEDVPDGLEWQTGMEEPDVGDPRAVKGGKVRLWINTPFPGTLRAF HHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEC GPGSENFFNYSAIDNVWLPLVGLHPETFRPIPGLADRWALSADGKTVFYHLDPEAAYSDG CCCCCCCEECHHHCCCHHCEECCCCCCCCCCCCCHHHEEEECCCCEEEEEECCCCCCCCC RIVKAQDFLLNICLRTSGFARDPFWTTLFRSMYDQITVYGDSVIALTLPSRGPLLPYMAC CEEEHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHEEEECCEEEEEECCCCCCCCCHHHH ADFHPAHPGFYHDFNALFLERYQWKVPPNTGGYMVVPSKIRIGESITLARVKNWWAKERK CCCCCCCCCCHHHHHHHHHHHHEEECCCCCCCEEEECCEEECCCCCHHHHHHHHHHHHHH YYRHSCNADQVEHVFVNMENKAVEMFRRGELDIMNVRKPEVWETGLELPEVHRGYIDKYS HHHHCCCCCCEEEEEEECCHHHHHHHHCCCCEEEECCCCHHHHCCCCCCHHHHCCCCCEE LEANYACPPYGLYLNCSDKLLKNPDIRRGLAHSVNMGLVIDTLFRGNMRRLGSYMEGYGD CCCCCCCCCCCEEECCCHHHHCCCCHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHHCCCC LTLPLKAPEYSKKKAMEYFARAGYREMGTDGVLKNERGERLVVELTFADSSTLMTNVCSI EEEEECCCCHHHHHHHHHHHHCCCHHCCCCCCEECCCCCEEEEEEEECCCHHHHHHHHHH LRQEALKCGVDLRLDSLTYSVCSRKVFEKRYQAALWAWPLQTPFPRLYETFSSELAYDAR HHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHEECCC GNPVGNTNNIMAVSDAGLDAALDAERNAPDTGSLKLALHRAQQRLHELCVWIPGWREPYT CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCCCCHH HIACWRWIRWPESPTRFCSPRIYNPLESHLYWVDEEMKKETLEARSRGVPFEEKHQIIYL HHHHHHHCCCCCCCHHHCCCCCCCCHHHHEEECCHHHHHHHHHHHHCCCCCCCCCCEEEE ERQDGAAP ECCCCCCC >Mature Secondary Structure MDGRGMPAVFDGFVERWNRAVERELRRELAEGERKEKLLVVSRGEGDGRRLAEQARSVER CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHH IRKRLLLKDHIRLLPLEDVPDGLEWQTGMEEPDVGDPRAVKGGKVRLWINTPFPGTLRAF HHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEC GPGSENFFNYSAIDNVWLPLVGLHPETFRPIPGLADRWALSADGKTVFYHLDPEAAYSDG CCCCCCCEECHHHCCCHHCEECCCCCCCCCCCCCHHHEEEECCCCEEEEEECCCCCCCCC RIVKAQDFLLNICLRTSGFARDPFWTTLFRSMYDQITVYGDSVIALTLPSRGPLLPYMAC CEEEHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHEEEECCEEEEEECCCCCCCCCHHHH ADFHPAHPGFYHDFNALFLERYQWKVPPNTGGYMVVPSKIRIGESITLARVKNWWAKERK CCCCCCCCCCHHHHHHHHHHHHEEECCCCCCCEEEECCEEECCCCCHHHHHHHHHHHHHH YYRHSCNADQVEHVFVNMENKAVEMFRRGELDIMNVRKPEVWETGLELPEVHRGYIDKYS HHHHCCCCCCEEEEEEECCHHHHHHHHCCCCEEEECCCCHHHHCCCCCCHHHHCCCCCEE LEANYACPPYGLYLNCSDKLLKNPDIRRGLAHSVNMGLVIDTLFRGNMRRLGSYMEGYGD CCCCCCCCCCCEEECCCHHHHCCCCHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHHCCCC LTLPLKAPEYSKKKAMEYFARAGYREMGTDGVLKNERGERLVVELTFADSSTLMTNVCSI EEEEECCCCHHHHHHHHHHHHCCCHHCCCCCCEECCCCCEEEEEEEECCCHHHHHHHHHH LRQEALKCGVDLRLDSLTYSVCSRKVFEKRYQAALWAWPLQTPFPRLYETFSSELAYDAR HHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHEECCC GNPVGNTNNIMAVSDAGLDAALDAERNAPDTGSLKLALHRAQQRLHELCVWIPGWREPYT CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCCCCCHH HIACWRWIRWPESPTRFCSPRIYNPLESHLYWVDEEMKKETLEARSRGVPFEEKHQIIYL HHHHHHHCCCCCCCHHHCCCCCCCCHHHHEEECCHHHHHHHHHHHHCCCCCCCCCCEEEE ERQDGAAP ECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: Ni2; [Periplasm]; ATP; H2O [C]
Specific reaction: Ni2+ [Periplasm] + ATP + H2O = Ni2+ [Cytoplasm] + ADP + phosphate [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA