Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is lysS [H]
Identifier: 187735301
GI number: 187735301
Start: 935258
End: 936739
Strand: Reverse
Name: lysS [H]
Synonym: Amuc_0798
Alternate gene names: 187735301
Gene position: 936739-935258 (Counterclockwise)
Preceding gene: 187735303
Following gene: 187735300
Centisome position: 35.16
GC content: 57.15
Gene sequence:
>1482_bases ATGTCCGAACAACAGCAGGCACATCCGACTACGACGGAATCCGAACTTATTGCCGTGCGCCGCGACAAGCTCGCCAAAAT CCGTGAGCTGGGAATTGACCCCTACGGCGCAAGATTTGACGTGACCACCACTCCCGCCGGATTGAAAGCCGATTTCCAGG AAGACAAACAGGTAGCCGTCGCCGGCCGCCTTCTGGCTATTCGCGACATGGGCAAATCCCAGTTTTTCGTCATTGGGGAC GTACGGGGAAAAATCCAGGGGTTCCTGCATAAAAATGAGGTGGATGAAACCACCTGGAAGCTTTGGAAGCTCCTGGACCG CGGGGACTGGGTCGGCATCAGGGGAACCACGTTCCTGACCCGTACCGGAGAACCTACCGTAAAGGTATCCGGACTGACCA TTCTTTCCAAAAGCCTCCGCCCCCTGCCGGACAAGTGGCACGGGCTGGCGGACAAGGAGGTGACCTACCGCAAGCGCCAC CTGGACCTCATTTCCAATGAGGAAAGCGCCTCCCTGTTCGTTACGCGCTCCCTCATGATTGCGGAAATCCGCCGCTTTCT CCAGGAACGCGGTTATCTGGAAGTGGAAACCCCCATGCTTCAGGATGTGGCCGGTGGGGCCGCCGCCAAGCCGTTTGAAA CGTACCATAACGCGCTGGATATGCCGCTGACGCTGCGCATTGCCCCGGAGCTTTTCCTCAAACGCCTGATGGTGGGCGGC TTCACGAAAATTTTCGAACTCAACCGCAGCTTCCGCAATGAAGGAATTGACCGCCGCCACAACCCGGAATTCACCATGCT GGAAGCCTATTGCGCCTGCGGTGATTTTGAAACCATGGCGAATATGGTGGAGGAGCTCATCTGCCATCTGGCGGAAAAAT TCTGCGGTGGCCTTCAGATTGACCACAAGGACGCGGAAGGCAACGTTCTTTACACAATCGACCTCAGCCGGCCCTGGAGA CGCGCCGACTACCAGGACCTGATCCGGGGCGTGGCGGGGGAAGACTGGTTTGACATCTCCCCCGAAGCGCGCCGCGCCCG CTGTGAAGAGCTGGGAGTGGAAATCAGCCCGGATATGAAAGATGTGGATGTTTCCCAGCAGGTGTATGAAAAACTGGTGG AGGAAAAGACCATGAACCCCTGCTTTGTTACCCACGTGGCCAAGGACTTGGTTCCCCTGGCCAAGCTGAACCGGGAAAAT CCGGACGTAGTGGATGTATATGAACTGGTGATCAACGGGCAGGAAATTTCCCCCGGCTATTCGGAATTGAACGATCCTGA CGTGCAAAAGGAACGCCTGGAGCACCAGGCTGCCGGGGAGACCCAGCGTGTGGACTATGATTTCATTGAAACGCTGGAAT ACGGAATGCCCTCCGCAGGCGGCATCGGCATCGGCATCGACCGCGTCGTCATGATGCTGACGGGGGCCTCTTCCATCCGT GACGTGCTGCTCTTCCCGCAGTTGAAACGTAAGGACAGTTAA
Upstream 100 bases:
>100_bases GGGGGAGGAACCTTCGGGTCAAAACCGGCCGCCAAAAATGTCCCCTCTGCCTGACAATGCGCTAGCCAAAACGGAAGCAA GCCTTTAGAATCCCCGCACT
Downstream 100 bases:
>100_bases TCCGAACCACCGGCCGCATGGCTCCCCGCACCCCGGATATTCTGGACTGCGCTTCCGCCCAATGGATCGGCAAACGCCGC GAGCAGGAAGACGTGGTTAA
Product: lysyl-tRNA synthetase
Products: NA
Alternate protein names: Lysine--tRNA ligase; LysRS [H]
Number of amino acids: Translated: 493; Mature: 492
Protein sequence:
>493_residues MSEQQQAHPTTTESELIAVRRDKLAKIRELGIDPYGARFDVTTTPAGLKADFQEDKQVAVAGRLLAIRDMGKSQFFVIGD VRGKIQGFLHKNEVDETTWKLWKLLDRGDWVGIRGTTFLTRTGEPTVKVSGLTILSKSLRPLPDKWHGLADKEVTYRKRH LDLISNEESASLFVTRSLMIAEIRRFLQERGYLEVETPMLQDVAGGAAAKPFETYHNALDMPLTLRIAPELFLKRLMVGG FTKIFELNRSFRNEGIDRRHNPEFTMLEAYCACGDFETMANMVEELICHLAEKFCGGLQIDHKDAEGNVLYTIDLSRPWR RADYQDLIRGVAGEDWFDISPEARRARCEELGVEISPDMKDVDVSQQVYEKLVEEKTMNPCFVTHVAKDLVPLAKLNREN PDVVDVYELVINGQEISPGYSELNDPDVQKERLEHQAAGETQRVDYDFIETLEYGMPSAGGIGIGIDRVVMMLTGASSIR DVLLFPQLKRKDS
Sequences:
>Translated_493_residues MSEQQQAHPTTTESELIAVRRDKLAKIRELGIDPYGARFDVTTTPAGLKADFQEDKQVAVAGRLLAIRDMGKSQFFVIGD VRGKIQGFLHKNEVDETTWKLWKLLDRGDWVGIRGTTFLTRTGEPTVKVSGLTILSKSLRPLPDKWHGLADKEVTYRKRH LDLISNEESASLFVTRSLMIAEIRRFLQERGYLEVETPMLQDVAGGAAAKPFETYHNALDMPLTLRIAPELFLKRLMVGG FTKIFELNRSFRNEGIDRRHNPEFTMLEAYCACGDFETMANMVEELICHLAEKFCGGLQIDHKDAEGNVLYTIDLSRPWR RADYQDLIRGVAGEDWFDISPEARRARCEELGVEISPDMKDVDVSQQVYEKLVEEKTMNPCFVTHVAKDLVPLAKLNREN PDVVDVYELVINGQEISPGYSELNDPDVQKERLEHQAAGETQRVDYDFIETLEYGMPSAGGIGIGIDRVVMMLTGASSIR DVLLFPQLKRKDS >Mature_492_residues SEQQQAHPTTTESELIAVRRDKLAKIRELGIDPYGARFDVTTTPAGLKADFQEDKQVAVAGRLLAIRDMGKSQFFVIGDV RGKIQGFLHKNEVDETTWKLWKLLDRGDWVGIRGTTFLTRTGEPTVKVSGLTILSKSLRPLPDKWHGLADKEVTYRKRHL DLISNEESASLFVTRSLMIAEIRRFLQERGYLEVETPMLQDVAGGAAAKPFETYHNALDMPLTLRIAPELFLKRLMVGGF TKIFELNRSFRNEGIDRRHNPEFTMLEAYCACGDFETMANMVEELICHLAEKFCGGLQIDHKDAEGNVLYTIDLSRPWRR ADYQDLIRGVAGEDWFDISPEARRARCEELGVEISPDMKDVDVSQQVYEKLVEEKTMNPCFVTHVAKDLVPLAKLNRENP DVVDVYELVINGQEISPGYSELNDPDVQKERLEHQAAGETQRVDYDFIETLEYGMPSAGGIGIGIDRVVMMLTGASSIRD VLLFPQLKRKDS
Specific function: Unknown
COG id: COG1190
COG function: function code J; Lysyl-tRNA synthetase (class II)
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family [H]
Homologues:
Organism=Homo sapiens, GI5031815, Length=507, Percent_Identity=35.1084812623274, Blast_Score=294, Evalue=1e-79, Organism=Homo sapiens, GI194272210, Length=507, Percent_Identity=35.1084812623274, Blast_Score=294, Evalue=1e-79, Organism=Homo sapiens, GI45439306, Length=332, Percent_Identity=23.7951807228916, Blast_Score=77, Evalue=5e-14, Organism=Escherichia coli, GI1789256, Length=508, Percent_Identity=40.748031496063, Blast_Score=387, Evalue=1e-109, Organism=Escherichia coli, GI1790571, Length=508, Percent_Identity=40.1574803149606, Blast_Score=386, Evalue=1e-108, Organism=Escherichia coli, GI87082379, Length=348, Percent_Identity=32.183908045977, Blast_Score=141, Evalue=1e-34, Organism=Escherichia coli, GI1788173, Length=297, Percent_Identity=27.2727272727273, Blast_Score=79, Evalue=6e-16, Organism=Caenorhabditis elegans, GI17535925, Length=517, Percent_Identity=35.7833655705996, Blast_Score=290, Evalue=1e-78, Organism=Caenorhabditis elegans, GI17535927, Length=517, Percent_Identity=35.7833655705996, Blast_Score=290, Evalue=1e-78, Organism=Caenorhabditis elegans, GI71994340, Length=494, Percent_Identity=36.6396761133603, Blast_Score=290, Evalue=2e-78, Organism=Caenorhabditis elegans, GI32566633, Length=332, Percent_Identity=25.9036144578313, Blast_Score=80, Evalue=3e-15, Organism=Saccharomyces cerevisiae, GI6320242, Length=515, Percent_Identity=36.8932038834951, Blast_Score=299, Evalue=7e-82, Organism=Saccharomyces cerevisiae, GI6324256, Length=530, Percent_Identity=28.4905660377358, Blast_Score=162, Evalue=8e-41, Organism=Saccharomyces cerevisiae, GI6323011, Length=472, Percent_Identity=21.6101694915254, Blast_Score=74, Evalue=8e-14, Organism=Drosophila melanogaster, GI24640849, Length=495, Percent_Identity=36.969696969697, Blast_Score=305, Evalue=5e-83, Organism=Drosophila melanogaster, GI24640851, Length=495, Percent_Identity=36.969696969697, Blast_Score=305, Evalue=6e-83,
Paralogues:
None
Copy number: 1200 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004364 - InterPro: IPR018150 - InterPro: IPR006195 - InterPro: IPR002313 - InterPro: IPR018149 - InterPro: IPR012340 - InterPro: IPR016027 - InterPro: IPR004365 [H]
Pfam domain/function: PF00152 tRNA-synt_2; PF01336 tRNA_anti [H]
EC number: =6.1.1.6 [H]
Molecular weight: Translated: 55829; Mature: 55698
Theoretical pI: Translated: 5.02; Mature: 5.02
Prosite motif: PS50862 AA_TRNA_LIGASE_II
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSEQQQAHPTTTESELIAVRRDKLAKIRELGIDPYGARFDVTTTPAGLKADFQEDKQVAV CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCHHCCHHHH AGRLLAIRDMGKSQFFVIGDVRGKIQGFLHKNEVDETTWKLWKLLDRGDWVGIRGTTFLT HHHHHHEECCCCCCEEEEECCCHHHHHHHHHCCCHHHHHHHHHHHHCCCEEEECCCEEEE RTGEPTVKVSGLTILSKSLRPLPDKWHGLADKEVTYRKRHLDLISNEESASLFVTRSLMI ECCCCEEEECCHHHHHHHCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCCCEEHHHHHHHH AEIRRFLQERGYLEVETPMLQDVAGGAAAKPFETYHNALDMPLTLRIAPELFLKRLMVGG HHHHHHHHHCCCEEECCHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHCC FTKIFELNRSFRNEGIDRRHNPEFTMLEAYCACGDFETMANMVEELICHLAEKFCGGLQI HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCEE DHKDAEGNVLYTIDLSRPWRRADYQDLIRGVAGEDWFDISPEARRARCEELGVEISPDMK ECCCCCCCEEEEEECCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCC DVDVSQQVYEKLVEEKTMNPCFVTHVAKDLVPLAKLNRENPDVVDVYELVINGQEISPGY CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCH SELNDPDVQKERLEHQAAGETQRVDYDFIETLEYGMPSAGGIGIGIDRVVMMLTGASSIR HHCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCHHHH DVLLFPQLKRKDS HHHHCCCHHCCCC >Mature Secondary Structure SEQQQAHPTTTESELIAVRRDKLAKIRELGIDPYGARFDVTTTPAGLKADFQEDKQVAV CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCHHCCHHHH AGRLLAIRDMGKSQFFVIGDVRGKIQGFLHKNEVDETTWKLWKLLDRGDWVGIRGTTFLT HHHHHHEECCCCCCEEEEECCCHHHHHHHHHCCCHHHHHHHHHHHHCCCEEEECCCEEEE RTGEPTVKVSGLTILSKSLRPLPDKWHGLADKEVTYRKRHLDLISNEESASLFVTRSLMI ECCCCEEEECCHHHHHHHCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCCCEEHHHHHHHH AEIRRFLQERGYLEVETPMLQDVAGGAAAKPFETYHNALDMPLTLRIAPELFLKRLMVGG HHHHHHHHHCCCEEECCHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHCC FTKIFELNRSFRNEGIDRRHNPEFTMLEAYCACGDFETMANMVEELICHLAEKFCGGLQI HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCEE DHKDAEGNVLYTIDLSRPWRRADYQDLIRGVAGEDWFDISPEARRARCEELGVEISPDMK ECCCCCCCEEEEEECCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCC DVDVSQQVYEKLVEEKTMNPCFVTHVAKDLVPLAKLNRENPDVVDVYELVINGQEISPGY CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCH SELNDPDVQKERLEHQAAGETQRVDYDFIETLEYGMPSAGGIGIGIDRVVMMLTGASSIR HHCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCHHHH DVLLFPQLKRKDS HHHHCCCHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA