Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is yfbS [C]
Identifier: 187735981
GI number: 187735981
Start: 1780160
End: 1782034
Strand: Reverse
Name: yfbS [C]
Synonym: Amuc_1490
Alternate gene names: 187735981
Gene position: 1782034-1780160 (Counterclockwise)
Preceding gene: 187735983
Following gene: 187735977
Centisome position: 66.89
GC content: 56.91
Gene sequence:
>1875_bases ATGCTTCCTCTTCTCAACATCAACTCCGCCTCCGTCATAGGCTGGCTGGAATCAGCGGCCGCCCAGCAATGGATTGTAGG CATTCTGCTGGTCCTGTTATTCATCAGCTTCATCAAGGAATGGATGCCCGTGGAAATCACAGCCCTGACAGGCACCGCCG TCCTCATGCTCACGGGAATTCTGAGCACGCGTGATGTGCTGTCCAGCTTTGCCAACAGCGGACCGCTGACAGTGGTGTGC ATGTTCATTCTGAGTGCTTCCCTGGAAAGAACGGGATTAATAGGAGACCTTTCCAAACTGTTCAACAAGGTAGCCAAGGG GAGGGAACTGACCGCCCTGCTGGTCATTACACTGGGAGCGTTCATGGTTTCCCCCTTCGTCAACAACACCCCGGTGGTCG TCATCCTGATGCCCATCGTGCTGGCCTTCTGCAGGGATCATAACATCGCGGCCTCCAAGCTGCTCATCCCCCTCTCCTAC GCCACCATTCTGGGAGGCACCTGCTCCGTGGTCGGAACATCCACCAACGTAGTTGTGCTCGGCCAGGTGCAGAAACTGGG TTATGACGGCATCCAGATGTTCACGGTAACGCCCATGGGCCTGATTTATGCGGCGGCGGGCCTGCTCTACCTCTGGACAT TAGGCCGCAAATGGCTTCCATCCCGCCCCACTTTGTCCACCATGCTTCCGGGCGGCATCCAGCGCGATTTCCTGCTCCAG GTCAGAATTCCCGCGGATTCCCCCCACATCGGCACCACTCCCATCAACCTGATGCAAACCGAGTTGCTGGGCACCAAAAT CGTGGAAGTTCGCCGCAGGGGATTCTCCATGCAGGAAGAGCTGCAGCACATCAACCTGGAAGAAGGAGACCGCATTCTTT TCCTGTGCAACGCCAGAAAAGTCAACCAGGTGAGGGAAGCCAAAGGCGTGGACCTGGGCTGGGATGACAGCCGCGGGCTG GAAACGCTGGAGCAGCGCGATGTGCAAATCGTGGAAGGCATGATCGCCAACAATTCCGAATTCGCCGGGCTGTCCCTGTC CGAACTCAAGCTGCGCCAGAGATTCAACATCTTCGTGCTCGCCATCCACAGGCAGGGCAAAAACATCACGGACATGGGGC CGAACACCAAGCTGGCGGCAGGGGACACGCTTCTTCTGGAAGGACCGCAGGAAGGAATGAACCGCATCCTGACCAAGCAG CGCATCATCCCCCTGAGCCAGCGTCCTGCGGAAGCGCACAACCGCAGCAAACAGGGCTGGGCCATCCTGGCCATGGGGCT GTTCATTTTTATCGGCCTGCTGGGCTCCTTTGAGCAATACGGGGAATTCTTCAAATTCTTCGCGCGCTTCAATCCCTTCT ATCTCGCCTACATCGGAGCTCTCATCGTCATCATCTCCGGCTGCATCAAGCCGAAGGAAGCCTACCAGGCGGTGGACTGG GGCATTATTTTTCTGATTCTGGGAATGCTGTGCGTGGGAGAAGCCATGAGCAAAACCGGGCTTGCCAAAGCCATTGCTTT CGGCGTAGTGGATAATATAGGCCCGTTGGGGTGCCTGGTCGCCATCTCCGGCCTGTACCTGATCTGCTCTATCCTGACGG AGATGATCTCCAACAATGCCGTAGCGGCCGTCATGGGGCCTCTGGCTTATGAAATGGCCCTGCAATTCGACGCCAACCCC ATTCCCTTCATTCTGGCTGTCATGTTCGGCGCCAGCGCCAGCTTCTCCACCCCCATCGGCTACCAGACCAACACTTACGT GTACAATGCGGGCGGTTACAAATTTAAGGACTTCGTCAAAGTGGGACTCCCCCTCAACCTGCTCCTCTGGGTCATTTTTA CCTGCGCCATCGGCTGGTTGTATCCGCTCAAGTAG
Upstream 100 bases:
>100_bases CCTTTTCAAACCCTTTCCGGAAAATTATTGCGGCTTGACAGATAGAAGCCGGGAAAGTAGGAGAGTTGGACCCTTTCCCC ACACCTTCAACTTTCAAACC
Downstream 100 bases:
>100_bases AGCGATAATAGCTTTGGAAGGCCCGGTCAACGGAAAGGAAACCAAGCACCCGGTCAAAGGCGCCATCATTCCCACGGGAA CAAAAAACAATATCCCTGTT
Product: TrkA-C domain protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 624; Mature: 624
Protein sequence:
>624_residues MLPLLNINSASVIGWLESAAAQQWIVGILLVLLFISFIKEWMPVEITALTGTAVLMLTGILSTRDVLSSFANSGPLTVVC MFILSASLERTGLIGDLSKLFNKVAKGRELTALLVITLGAFMVSPFVNNTPVVVILMPIVLAFCRDHNIAASKLLIPLSY ATILGGTCSVVGTSTNVVVLGQVQKLGYDGIQMFTVTPMGLIYAAAGLLYLWTLGRKWLPSRPTLSTMLPGGIQRDFLLQ VRIPADSPHIGTTPINLMQTELLGTKIVEVRRRGFSMQEELQHINLEEGDRILFLCNARKVNQVREAKGVDLGWDDSRGL ETLEQRDVQIVEGMIANNSEFAGLSLSELKLRQRFNIFVLAIHRQGKNITDMGPNTKLAAGDTLLLEGPQEGMNRILTKQ RIIPLSQRPAEAHNRSKQGWAILAMGLFIFIGLLGSFEQYGEFFKFFARFNPFYLAYIGALIVIISGCIKPKEAYQAVDW GIIFLILGMLCVGEAMSKTGLAKAIAFGVVDNIGPLGCLVAISGLYLICSILTEMISNNAVAAVMGPLAYEMALQFDANP IPFILAVMFGASASFSTPIGYQTNTYVYNAGGYKFKDFVKVGLPLNLLLWVIFTCAIGWLYPLK
Sequences:
>Translated_624_residues MLPLLNINSASVIGWLESAAAQQWIVGILLVLLFISFIKEWMPVEITALTGTAVLMLTGILSTRDVLSSFANSGPLTVVC MFILSASLERTGLIGDLSKLFNKVAKGRELTALLVITLGAFMVSPFVNNTPVVVILMPIVLAFCRDHNIAASKLLIPLSY ATILGGTCSVVGTSTNVVVLGQVQKLGYDGIQMFTVTPMGLIYAAAGLLYLWTLGRKWLPSRPTLSTMLPGGIQRDFLLQ VRIPADSPHIGTTPINLMQTELLGTKIVEVRRRGFSMQEELQHINLEEGDRILFLCNARKVNQVREAKGVDLGWDDSRGL ETLEQRDVQIVEGMIANNSEFAGLSLSELKLRQRFNIFVLAIHRQGKNITDMGPNTKLAAGDTLLLEGPQEGMNRILTKQ RIIPLSQRPAEAHNRSKQGWAILAMGLFIFIGLLGSFEQYGEFFKFFARFNPFYLAYIGALIVIISGCIKPKEAYQAVDW GIIFLILGMLCVGEAMSKTGLAKAIAFGVVDNIGPLGCLVAISGLYLICSILTEMISNNAVAAVMGPLAYEMALQFDANP IPFILAVMFGASASFSTPIGYQTNTYVYNAGGYKFKDFVKVGLPLNLLLWVIFTCAIGWLYPLK >Mature_624_residues MLPLLNINSASVIGWLESAAAQQWIVGILLVLLFISFIKEWMPVEITALTGTAVLMLTGILSTRDVLSSFANSGPLTVVC MFILSASLERTGLIGDLSKLFNKVAKGRELTALLVITLGAFMVSPFVNNTPVVVILMPIVLAFCRDHNIAASKLLIPLSY ATILGGTCSVVGTSTNVVVLGQVQKLGYDGIQMFTVTPMGLIYAAAGLLYLWTLGRKWLPSRPTLSTMLPGGIQRDFLLQ VRIPADSPHIGTTPINLMQTELLGTKIVEVRRRGFSMQEELQHINLEEGDRILFLCNARKVNQVREAKGVDLGWDDSRGL ETLEQRDVQIVEGMIANNSEFAGLSLSELKLRQRFNIFVLAIHRQGKNITDMGPNTKLAAGDTLLLEGPQEGMNRILTKQ RIIPLSQRPAEAHNRSKQGWAILAMGLFIFIGLLGSFEQYGEFFKFFARFNPFYLAYIGALIVIISGCIKPKEAYQAVDW GIIFLILGMLCVGEAMSKTGLAKAIAFGVVDNIGPLGCLVAISGLYLICSILTEMISNNAVAAVMGPLAYEMALQFDANP IPFILAVMFGASASFSTPIGYQTNTYVYNAGGYKFKDFVKVGLPLNLLLWVIFTCAIGWLYPLK
Specific function: Unknown
COG id: COG0471
COG function: function code P; Di- and tricarboxylate transporters
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 RCK C-terminal domains [H]
Homologues:
Organism=Escherichia coli, GI1788629, Length=621, Percent_Identity=27.0531400966184, Blast_Score=184, Evalue=1e-47,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001898 - InterPro: IPR006037 [H]
Pfam domain/function: PF00939 Na_sulph_symp; PF02080 TrkA_C [H]
EC number: NA
Molecular weight: Translated: 68178; Mature: 68178
Theoretical pI: Translated: 8.44; Mature: 8.44
Prosite motif: PS00178 AA_TRNA_LIGASE_I
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 4.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLPLLNINSASVIGWLESAAAQQWIVGILLVLLFISFIKEWMPVEITALTGTAVLMLTGI CCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH LSTRDVLSSFANSGPLTVVCMFILSASLERTGLIGDLSKLFNKVAKGRELTALLVITLGA HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH FMVSPFVNNTPVVVILMPIVLAFCRDHNIAASKLLIPLSYATILGGTCSVVGTSTNVVVL HHHHCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCEEEE GQVQKLGYDGIQMFTVTPMGLIYAAAGLLYLWTLGRKWLPSRPTLSTMLPGGIQRDFLLQ ECHHHCCCCCEEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCEEEE VRIPADSPHIGTTPINLMQTELLGTKIVEVRRRGFSMQEELQHINLEEGDRILFLCNARK EEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCEEEEEECCHH VNQVREAKGVDLGWDDSRGLETLEQRDVQIVEGMIANNSEFAGLSLSELKLRQRFNIFVL HHHHHHHCCCCCCCCCCCCHHHHHHCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEE AIHRQGKNITDMGPNTKLAAGDTLLLEGPQEGMNRILTKQRIIPLSQRPAEAHNRSKQGW EEEECCCCCCCCCCCCEEECCCEEEEECCHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCH AILAMGLFIFIGLLGSFEQYGEFFKFFARFNPFYLAYIGALIVIISGCIKPKEAYQAVDW HHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHH GIIFLILGMLCVGEAMSKTGLAKAIAFGVVDNIGPLGCLVAISGLYLICSILTEMISNNA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCC VAAVMGPLAYEMALQFDANPIPFILAVMFGASASFSTPIGYQTNTYVYNAGGYKFKDFVK HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCCCHHHHHH VGLPLNLLLWVIFTCAIGWLYPLK HCCCHHHHHHHHHHHHHHHHCCCC >Mature Secondary Structure MLPLLNINSASVIGWLESAAAQQWIVGILLVLLFISFIKEWMPVEITALTGTAVLMLTGI CCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH LSTRDVLSSFANSGPLTVVCMFILSASLERTGLIGDLSKLFNKVAKGRELTALLVITLGA HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH FMVSPFVNNTPVVVILMPIVLAFCRDHNIAASKLLIPLSYATILGGTCSVVGTSTNVVVL HHHHCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCEEEE GQVQKLGYDGIQMFTVTPMGLIYAAAGLLYLWTLGRKWLPSRPTLSTMLPGGIQRDFLLQ ECHHHCCCCCEEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCEEEE VRIPADSPHIGTTPINLMQTELLGTKIVEVRRRGFSMQEELQHINLEEGDRILFLCNARK EEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCEEEEEECCHH VNQVREAKGVDLGWDDSRGLETLEQRDVQIVEGMIANNSEFAGLSLSELKLRQRFNIFVL HHHHHHHCCCCCCCCCCCCHHHHHHCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEE AIHRQGKNITDMGPNTKLAAGDTLLLEGPQEGMNRILTKQRIIPLSQRPAEAHNRSKQGW EEEECCCCCCCCCCCCEEECCCEEEEECCHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCH AILAMGLFIFIGLLGSFEQYGEFFKFFARFNPFYLAYIGALIVIISGCIKPKEAYQAVDW HHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHH GIIFLILGMLCVGEAMSKTGLAKAIAFGVVDNIGPLGCLVAISGLYLICSILTEMISNNA HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCC VAAVMGPLAYEMALQFDANPIPFILAVMFGASASFSTPIGYQTNTYVYNAGGYKFKDFVK HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEECCCCCCHHHHHH VGLPLNLLLWVIFTCAIGWLYPLK HCCCHHHHHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8905231 [H]