Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is yhaI [C]

Identifier: 187735973

GI number: 187735973

Start: 1772177

End: 1773292

Strand: Reverse

Name: yhaI [C]

Synonym: Amuc_1482

Alternate gene names: 187735973

Gene position: 1773292-1772177 (Counterclockwise)

Preceding gene: 187735974

Following gene: 187735970

Centisome position: 66.56

GC content: 54.75

Gene sequence:

>1116_bases
ATGCCGGAATCTCCTGCTTCTCCTGCTTCTCCTGCTGTTTCCTCCGTTCCTCCAACCTCCACGCAGGCATCTTCCCTCCT
TTCCCCCCTCTCCTGCTGGAAAAAAGGTTTTCTTCATTATGCGGACTTTCGGGGCTGCGCTTCCCGTGCGGAATTCTGGT
GGTTCATGGCTCTTCCCCTCCTGGCGCTGATTCCAGCCCTGGCAGGGTATATCCTGACGGACTGGCTGCACATCCCTGAT
ACAAGACTGAGCATCTACGGAGACGCCTTAACCATCCTCCTGTGGGCGGTGTTATTCATCCCAAGCATATCAGCCGCGTT
CAGACGCCTGCATGATACAGGCAGGAGCGGCCTCTGGCTTTTTTCCCTTTTCATTCCCTTCGGGCTGGGGCATCTGATCT
TTTTTTATCTGACGCTAGGAGAAAGCAAGGCGGACGGCAACAAATACAGCCGCCGTCCGGAGCCCCAACCGGCTGATCCC
CCTGCCGGAAAACTGAAAGAGCAGCCATTGACTCCGTTTTACCTTTACTGGCTCATCAGCCTGCGGAAATTGAATACGGT
GGCAGGCCGCGCGTCCCGGACGGAATTCTGGTCCTTTTTCCTCCTTTCCGTCCTCCTGTTCCTTCCGCTGGGCTACAGCA
TGATAGACGTTGACAGCCAGCCGGCGGGTTTTTATGTCTCTCCTTCCCTCCAAATCCTGTTATATGCCGCCCATCCGCAA
GATGCTCTGATCCTGCTGGCTCACTCCTGCTTCAATCCCACCTTTTACTTTTTCTACCAATCCGGAGAGCTGAGCATGCT
TTCCCTGGAGCTTCTGGCAGCCGTGGCGGGGCTCAATATCCTCTTCAATCTGCCGGTCGCCGTGCGCCGCCTGCATGACA
GCAATCTGAGCGGAAAATTCATCCTGATTCCCATTCTTATTTTCATCGTCACTTTCCTGCTGATTTTCCTGCTGCGCCTG
GTCCCGGAGGACATGGCCCCCTATCTGGACTACCTGGGAATGGTGTCCAGCCTGATGGATCTGCTTTCCATCCTCTTCCT
GTCCATGATGCTTCTTAAAAGCTCGCCAGGCCCCAATGAATACGGCGTGCTTCCGCAAAAAATAACCGTATCCTGA

Upstream 100 bases:

>100_bases
TCTGAACCGGCTGCAGGAGGGAACTTTCCATGTCCTGGGCACAGACGCATTGAGCATTCCCCGGCATCAGGGAGTCTAAA
CGGAAACCCCCTTTTCATCC

Downstream 100 bases:

>100_bases
TTCCAGGATACGGTTGAAAAGCAATCAAAACCCTATCGGGTACAATCAATTTCCCGTGACGCAGACGGAGAACGTCAGTC
CCCGAAGCATACGCCCAAAT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 371; Mature: 370

Protein sequence:

>371_residues
MPESPASPASPAVSSVPPTSTQASSLLSPLSCWKKGFLHYADFRGCASRAEFWWFMALPLLALIPALAGYILTDWLHIPD
TRLSIYGDALTILLWAVLFIPSISAAFRRLHDTGRSGLWLFSLFIPFGLGHLIFFYLTLGESKADGNKYSRRPEPQPADP
PAGKLKEQPLTPFYLYWLISLRKLNTVAGRASRTEFWSFFLLSVLLFLPLGYSMIDVDSQPAGFYVSPSLQILLYAAHPQ
DALILLAHSCFNPTFYFFYQSGELSMLSLELLAAVAGLNILFNLPVAVRRLHDSNLSGKFILIPILIFIVTFLLIFLLRL
VPEDMAPYLDYLGMVSSLMDLLSILFLSMMLLKSSPGPNEYGVLPQKITVS

Sequences:

>Translated_371_residues
MPESPASPASPAVSSVPPTSTQASSLLSPLSCWKKGFLHYADFRGCASRAEFWWFMALPLLALIPALAGYILTDWLHIPD
TRLSIYGDALTILLWAVLFIPSISAAFRRLHDTGRSGLWLFSLFIPFGLGHLIFFYLTLGESKADGNKYSRRPEPQPADP
PAGKLKEQPLTPFYLYWLISLRKLNTVAGRASRTEFWSFFLLSVLLFLPLGYSMIDVDSQPAGFYVSPSLQILLYAAHPQ
DALILLAHSCFNPTFYFFYQSGELSMLSLELLAAVAGLNILFNLPVAVRRLHDSNLSGKFILIPILIFIVTFLLIFLLRL
VPEDMAPYLDYLGMVSSLMDLLSILFLSMMLLKSSPGPNEYGVLPQKITVS
>Mature_370_residues
PESPASPASPAVSSVPPTSTQASSLLSPLSCWKKGFLHYADFRGCASRAEFWWFMALPLLALIPALAGYILTDWLHIPDT
RLSIYGDALTILLWAVLFIPSISAAFRRLHDTGRSGLWLFSLFIPFGLGHLIFFYLTLGESKADGNKYSRRPEPQPADPP
AGKLKEQPLTPFYLYWLISLRKLNTVAGRASRTEFWSFFLLSVLLFLPLGYSMIDVDSQPAGFYVSPSLQILLYAAHPQD
ALILLAHSCFNPTFYFFYQSGELSMLSLELLAAVAGLNILFNLPVAVRRLHDSNLSGKFILIPILIFIVTFLLIFLLRLV
PEDMAPYLDYLGMVSSLMDLLSILFLSMMLLKSSPGPNEYGVLPQKITVS

Specific function: Unknown

COG id: COG3152

COG function: function code S; Predicted membrane protein

Gene ontology:

Cell location: Integral Membrane Protein [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 41353; Mature: 41222

Theoretical pI: Translated: 7.62; Mature: 7.62

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPESPASPASPAVSSVPPTSTQASSLLSPLSCWKKGFLHYADFRGCASRAEFWWFMALPL
CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH
LALIPALAGYILTDWLHIPDTRLSIYGDALTILLWAVLFIPSISAAFRRLHDTGRSGLWL
HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
FSLFIPFGLGHLIFFYLTLGESKADGNKYSRRPEPQPADPPAGKLKEQPLTPFYLYWLIS
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH
LRKLNTVAGRASRTEFWSFFLLSVLLFLPLGYSMIDVDSQPAGFYVSPSLQILLYAAHPQ
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCEEECCCEEEEEEECCCC
DALILLAHSCFNPTFYFFYQSGELSMLSLELLAAVAGLNILFNLPVAVRRLHDSNLSGKF
CHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCCC
ILIPILIFIVTFLLIFLLRLVPEDMAPYLDYLGMVSSLMDLLSILFLSMMLLKSSPGPNE
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC
YGVLPQKITVS
CCCCCCEEECC
>Mature Secondary Structure 
PESPASPASPAVSSVPPTSTQASSLLSPLSCWKKGFLHYADFRGCASRAEFWWFMALPL
CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH
LALIPALAGYILTDWLHIPDTRLSIYGDALTILLWAVLFIPSISAAFRRLHDTGRSGLWL
HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH
FSLFIPFGLGHLIFFYLTLGESKADGNKYSRRPEPQPADPPAGKLKEQPLTPFYLYWLIS
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH
LRKLNTVAGRASRTEFWSFFLLSVLLFLPLGYSMIDVDSQPAGFYVSPSLQILLYAAHPQ
HHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCEEECCCEEEEEEECCCC
DALILLAHSCFNPTFYFFYQSGELSMLSLELLAAVAGLNILFNLPVAVRRLHDSNLSGKF
CHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCCC
ILIPILIFIVTFLLIFLLRLVPEDMAPYLDYLGMVSSLMDLLSILFLSMMLLKSSPGPNE
HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC
YGVLPQKITVS
CCCCCCEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA