Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is yggB [C]
Identifier: 187735813
GI number: 187735813
Start: 1597950
End: 1598852
Strand: Reverse
Name: yggB [C]
Synonym: Amuc_1320
Alternate gene names: 187735813
Gene position: 1598852-1597950 (Counterclockwise)
Preceding gene: 187735814
Following gene: 187735812
Centisome position: 60.01
GC content: 54.82
Gene sequence:
>903_bases ATGAATCTCATGGCAGACGCATCCTCCAGGGAAGATATCTGGACAAAAACCATGGACTTCCTGAACATGCCGGACTGGCT CAAGCTGGAAATGGTGCAGTCCGTGGGAATCAAAATCCTGCATGTGGCGGTATGGCTGGTGATCAGCTGGATACTCCTGA AACTGTTCTGCCAGGTGCTCCGGAAAATCACTGCGTTGAAAATGTCGCCCCAGGCCTCCCGCCTGACGGTAAAAATCGTC AAAAACGTCGGCTACATTATTATAGGCGTGGAGGCCTTCGCCCTGATGGGGTTCGATATCCTGACCCTGCTGGGCGCGGC CAGCATCATCGGCGTCGCCGTAGGCTTCGCCTCCCAGACCTCCCTGTCCAACATCATCAGCGGCCTCTTCCTGGTAGGGG AAAAGCAGATCAACCTGGGGGATATGATTGAAGTGAATGGAATCACCGGGAACGTGGACTCCATCAACCTGATGTCCGTC CAGCTCCGGCTGCCGAACAACACCATGGTGCGCATCCCCAATGAAATAATCATCAAGAATCCGGTCAGCAATATCACCCG TTTTTCCACGCGCCGGTGCGACCTGAGCCTGGGCGTGGACTATAACTGCGACATTGAGCATGTCGTGAACGTCCTCCGGG AAGTAGTCAAACAAAACAAATTCTGCCTGGATGACCCCGCGCCCCTCATCTCGTTTTCAGGGTTCCAGGACTCTTCCCTG GGCTTTACCGTAGGGGCCTGGTGCCGGAAAGACAATTTCCTGGACTGCCAGAAAACGCTCGCCCATGACATCAAGCGCCG CTTTGAGGAAGAAGGCATTTCCTTCCCCTTCCCCACCCGCAGCCTGGAGAGCAGAAGCCCCATCAAGGTGGAAATCTCCG GACCTCCGGAGAAAAATCAATAA
Upstream 100 bases:
>100_bases GAACACCCTGCCCTTTCCGCTGCCGGAGGAACTGTCATAAACCGGGGGGAATACGCGCGCGCCCCTTGAGCCGGGGAGAA GCGTCTGCTACATCCTTCAT
Downstream 100 bases:
>100_bases GGAATCTCTTTTTGCTTTCTTTTTGCCCCGTGAAATTTCAATCTTGTCTGCGTAAAATCCGCATCTATGTCTGACACATC CGCACCAGCCTATCGAAGAA
Product: MscS Mechanosensitive ion channel
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 300; Mature: 300
Protein sequence:
>300_residues MNLMADASSREDIWTKTMDFLNMPDWLKLEMVQSVGIKILHVAVWLVISWILLKLFCQVLRKITALKMSPQASRLTVKIV KNVGYIIIGVEAFALMGFDILTLLGAASIIGVAVGFASQTSLSNIISGLFLVGEKQINLGDMIEVNGITGNVDSINLMSV QLRLPNNTMVRIPNEIIIKNPVSNITRFSTRRCDLSLGVDYNCDIEHVVNVLREVVKQNKFCLDDPAPLISFSGFQDSSL GFTVGAWCRKDNFLDCQKTLAHDIKRRFEEEGISFPFPTRSLESRSPIKVEISGPPEKNQ
Sequences:
>Translated_300_residues MNLMADASSREDIWTKTMDFLNMPDWLKLEMVQSVGIKILHVAVWLVISWILLKLFCQVLRKITALKMSPQASRLTVKIV KNVGYIIIGVEAFALMGFDILTLLGAASIIGVAVGFASQTSLSNIISGLFLVGEKQINLGDMIEVNGITGNVDSINLMSV QLRLPNNTMVRIPNEIIIKNPVSNITRFSTRRCDLSLGVDYNCDIEHVVNVLREVVKQNKFCLDDPAPLISFSGFQDSSL GFTVGAWCRKDNFLDCQKTLAHDIKRRFEEEGISFPFPTRSLESRSPIKVEISGPPEKNQ >Mature_300_residues MNLMADASSREDIWTKTMDFLNMPDWLKLEMVQSVGIKILHVAVWLVISWILLKLFCQVLRKITALKMSPQASRLTVKIV KNVGYIIIGVEAFALMGFDILTLLGAASIIGVAVGFASQTSLSNIISGLFLVGEKQINLGDMIEVNGITGNVDSINLMSV QLRLPNNTMVRIPNEIIIKNPVSNITRFSTRRCDLSLGVDYNCDIEHVVNVLREVVKQNKFCLDDPAPLISFSGFQDSSL GFTVGAWCRKDNFLDCQKTLAHDIKRRFEEEGISFPFPTRSLESRSPIKVEISGPPEKNQ
Specific function: Unknown
COG id: COG0668
COG function: function code M; Small-conductance mechanosensitive channel
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the mscS (TC 1.A.23) family [H]
Homologues:
Organism=Escherichia coli, GI1789291, Length=179, Percent_Identity=31.8435754189944, Blast_Score=106, Evalue=2e-24, Organism=Escherichia coli, GI1786670, Length=212, Percent_Identity=26.8867924528302, Blast_Score=76, Evalue=3e-15, Organism=Escherichia coli, GI2367355, Length=247, Percent_Identity=23.4817813765182, Blast_Score=75, Evalue=7e-15, Organism=Escherichia coli, GI1787591, Length=247, Percent_Identity=25.1012145748988, Blast_Score=69, Evalue=3e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010920 - InterPro: IPR011066 - InterPro: IPR006685 - InterPro: IPR006686 - InterPro: IPR011014 [H]
Pfam domain/function: PF00924 MS_channel [H]
EC number: NA
Molecular weight: Translated: 33448; Mature: 33448
Theoretical pI: Translated: 7.79; Mature: 7.79
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 5.3 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 3.3 %Met (Mature Protein) 5.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNLMADASSREDIWTKTMDFLNMPDWLKLEMVQSVGIKILHVAVWLVISWILLKLFCQVL CCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH RKITALKMSPQASRLTVKIVKNVGYIIIGVEAFALMGFDILTLLGAASIIGVAVGFASQT HHHHHHCCCCCHHHHHHEEHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH SLSNIISGLFLVGEKQINLGDMIEVNGITGNVDSINLMSVQLRLPNNTMVRIPNEIIIKN HHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCEEEEEEEEECCCCCEEECCCEEEEEC PVSNITRFSTRRCDLSLGVDYNCDIEHVVNVLREVVKQNKFCLDDPAPLISFSGFQDSSL CHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCCC GFTVGAWCRKDNFLDCQKTLAHDIKRRFEEEGISFPFPTRSLESRSPIKVEISGPPEKNQ CEEEEHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCC >Mature Secondary Structure MNLMADASSREDIWTKTMDFLNMPDWLKLEMVQSVGIKILHVAVWLVISWILLKLFCQVL CCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH RKITALKMSPQASRLTVKIVKNVGYIIIGVEAFALMGFDILTLLGAASIIGVAVGFASQT HHHHHHCCCCCHHHHHHEEHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH SLSNIISGLFLVGEKQINLGDMIEVNGITGNVDSINLMSVQLRLPNNTMVRIPNEIIIKN HHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCEEEEEEEEECCCCCEEECCCEEEEEC PVSNITRFSTRRCDLSLGVDYNCDIEHVVNVLREVVKQNKFCLDDPAPLISFSGFQDSSL CHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCCCCC GFTVGAWCRKDNFLDCQKTLAHDIKRRFEEEGISFPFPTRSLESRSPIKVEISGPPEKNQ CEEEEHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 9389475 [H]