Definition | Akkermansia muciniphila ATCC BAA-835, complete genome. |
---|---|
Accession | NC_010655 |
Length | 2,664,102 |
Click here to switch to the map view.
The map label for this gene is aspS [H]
Identifier: 187735889
GI number: 187735889
Start: 1667480
End: 1669273
Strand: Reverse
Name: aspS [H]
Synonym: Amuc_1396
Alternate gene names: 187735889
Gene position: 1669273-1667480 (Counterclockwise)
Preceding gene: 187735890
Following gene: 187735884
Centisome position: 62.66
GC content: 60.03
Gene sequence:
>1794_bases ATGAACTCATACCGCACTCATACCTGCAGCGAATTGCGCGCTGCGAACATCGGCAAGCCGACCACCCTCATCGGCTGGGT AGATTCCGTCCGGGACCATGGCGGCGTCATATTTATCGACTTGCGCGACCGCTCCGGCATCACGCAGGTAGTCTTTCACC CGGAAGTCAATCAGGATGTGGCGAAAGCCTCCCAGCAGCTCCGCTCCGAAGACATGATCCAGATTTCCGGCACCGTGGCC GCCCGCCTGAAAACGGATACGGTGGACACGACCAATGCGGACCTCCCCACCGGGGAAATCGAAGTCTCCGCAGACACCCT GAACGTCATCAACAAGGCTGACGTGCTCCCCTTCCAGCTGGATCGGGCCCTCTCCAACGAAGACCTGCGCCTCAAATACC GCTTTCTGGACCTGCGCCGCCCGACCATGGCCCGCAACATGCAGATCCGCCACCGCGTCACCAAATCCACGCGCGACTAT CTGGACGAGCACGGCTTTCTGGAAATTGAGACGCCCATCCTCTCCAAATCCACGCCGGAAGGCGCACGGGACTTCCTGGT GCCCTCCCGCCTGGCGCCCGGCAAATTCTACGCGCTGCCCCAGGCCCCGCAGCAATACAAGCAGCTGCTCATGGTAGCCG GCATGGAACGCTATTTCCAAATTGCCCGCTGCTTCCGTGACGAAGACCTGCGCGCGGACCGCCAGCCGGAATTCACCCAG GTGGACATTGAAGCTTCCTTCATCACGCCGGAAGACATCTACAACCTGGTGGAAGGCCTGCTCAAACGCGTGTACAAGGA ATCCCTGGGCGTGGACATTCCCACCCCCTTCCCCCGCATGACCTGGAAGGAAGCAATGGATCAGTACGGTTCCGACAAGC CGGAACGCCGCTTCGGCATGAAGCTAACGGACGTCTCCTCCATCTTTGAAAACAGCGGCTTCAAGGTATTCGCCTCAGCC GTAAGCAACGGCGGCGTGGTCAAGGCCATCAACGCCAAGGGATTCGGTTCCGCCTCCGTTGGCCAGATTGACGCCCTGAC GAAAACTGCCGTGGAAGCCGGAGCCAAAGGCCTGGCCTACATCAAGGTGCGCGAGGAAGACTGGAGAAGCCCCATCTCCA AATTCCTTTCCGACGAAGAAAAACAAAAACTCACGGAAGCCCTGGACATCGAAACAGGAGACCTCGTTCTCTTTGCCGCC GGCCCGTGGGAACCCTCCTGCGATATCCTGGGCCGCGTGCGCCTGCAATGCGCCGAATTCATGGAACTGCTCAAGGACAA CAAGGAACGCGACTTCCTGTGGGTTATTGAATTCCCGCTGGTGGGCTGGGATGAGGAAGAGCAGCGCTGGGTCGCCATCC ACCACCCCTTTACCCGCCCCGTCAAGGAAGACGAACAAAAACTGCTCAGCGGGGAACTCTCCGCAGACCTTCGCGCGCAG GCCTACGATGTGGTGCTCAACGGCACGGAGCTGGGAGGCGGTTCCATCCGCATCCATGAACGCGACCTGCAGTCCGCCAT GTTCAAGGCGCTCGGCATCACGGAAGAACAGGCCCGCGAACAATTCGGCCACATTCTGGACGCTTTCAGCTTCGGCGCTC CACCCCACGGAGGCCTTGCGCTGGGCCTCGACCGCCTGGTCATGATGATTTGCAACGCGGAATCCATCCGCGAAGTCATC GCATTCCCGAAAAACAACCGGGGAGCGGATCTGATGAGCGACTCCCCCGCCGCCGCGGAAGACCGTCAGCTGCGCGACAT TCACATTCAGGTGAAACTGCCCGCTAAAAAATAG
Upstream 100 bases:
>100_bases CCTCATCCCCCGTCGCCATGGACGACCTGTTTGACGCCGTGGCTTCACTCACCGGAAGCAATTGACACCGTTCTTTCCCC TTCTTCCATTTTCAATAAAG
Downstream 100 bases:
>100_bases CCGGATCCCTTAAACAAACCCCCGGATGGCAATCATTCCATCCGGGGTTTTTTCATGCAGGAAAAACAAAACTCCCTTTC CCGGCAGGCAGCGCACCATT
Product: aspartyl-tRNA synthetase
Products: NA
Alternate protein names: Aspartate--tRNA ligase; AspRS [H]
Number of amino acids: Translated: 597; Mature: 597
Protein sequence:
>597_residues MNSYRTHTCSELRAANIGKPTTLIGWVDSVRDHGGVIFIDLRDRSGITQVVFHPEVNQDVAKASQQLRSEDMIQISGTVA ARLKTDTVDTTNADLPTGEIEVSADTLNVINKADVLPFQLDRALSNEDLRLKYRFLDLRRPTMARNMQIRHRVTKSTRDY LDEHGFLEIETPILSKSTPEGARDFLVPSRLAPGKFYALPQAPQQYKQLLMVAGMERYFQIARCFRDEDLRADRQPEFTQ VDIEASFITPEDIYNLVEGLLKRVYKESLGVDIPTPFPRMTWKEAMDQYGSDKPERRFGMKLTDVSSIFENSGFKVFASA VSNGGVVKAINAKGFGSASVGQIDALTKTAVEAGAKGLAYIKVREEDWRSPISKFLSDEEKQKLTEALDIETGDLVLFAA GPWEPSCDILGRVRLQCAEFMELLKDNKERDFLWVIEFPLVGWDEEEQRWVAIHHPFTRPVKEDEQKLLSGELSADLRAQ AYDVVLNGTELGGGSIRIHERDLQSAMFKALGITEEQAREQFGHILDAFSFGAPPHGGLALGLDRLVMMICNAESIREVI AFPKNNRGADLMSDSPAAAEDRQLRDIHIQVKLPAKK
Sequences:
>Translated_597_residues MNSYRTHTCSELRAANIGKPTTLIGWVDSVRDHGGVIFIDLRDRSGITQVVFHPEVNQDVAKASQQLRSEDMIQISGTVA ARLKTDTVDTTNADLPTGEIEVSADTLNVINKADVLPFQLDRALSNEDLRLKYRFLDLRRPTMARNMQIRHRVTKSTRDY LDEHGFLEIETPILSKSTPEGARDFLVPSRLAPGKFYALPQAPQQYKQLLMVAGMERYFQIARCFRDEDLRADRQPEFTQ VDIEASFITPEDIYNLVEGLLKRVYKESLGVDIPTPFPRMTWKEAMDQYGSDKPERRFGMKLTDVSSIFENSGFKVFASA VSNGGVVKAINAKGFGSASVGQIDALTKTAVEAGAKGLAYIKVREEDWRSPISKFLSDEEKQKLTEALDIETGDLVLFAA GPWEPSCDILGRVRLQCAEFMELLKDNKERDFLWVIEFPLVGWDEEEQRWVAIHHPFTRPVKEDEQKLLSGELSADLRAQ AYDVVLNGTELGGGSIRIHERDLQSAMFKALGITEEQAREQFGHILDAFSFGAPPHGGLALGLDRLVMMICNAESIREVI AFPKNNRGADLMSDSPAAAEDRQLRDIHIQVKLPAKK >Mature_597_residues MNSYRTHTCSELRAANIGKPTTLIGWVDSVRDHGGVIFIDLRDRSGITQVVFHPEVNQDVAKASQQLRSEDMIQISGTVA ARLKTDTVDTTNADLPTGEIEVSADTLNVINKADVLPFQLDRALSNEDLRLKYRFLDLRRPTMARNMQIRHRVTKSTRDY LDEHGFLEIETPILSKSTPEGARDFLVPSRLAPGKFYALPQAPQQYKQLLMVAGMERYFQIARCFRDEDLRADRQPEFTQ VDIEASFITPEDIYNLVEGLLKRVYKESLGVDIPTPFPRMTWKEAMDQYGSDKPERRFGMKLTDVSSIFENSGFKVFASA VSNGGVVKAINAKGFGSASVGQIDALTKTAVEAGAKGLAYIKVREEDWRSPISKFLSDEEKQKLTEALDIETGDLVLFAA GPWEPSCDILGRVRLQCAEFMELLKDNKERDFLWVIEFPLVGWDEEEQRWVAIHHPFTRPVKEDEQKLLSGELSADLRAQ AYDVVLNGTELGGGSIRIHERDLQSAMFKALGITEEQAREQFGHILDAFSFGAPPHGGLALGLDRLVMMICNAESIREVI AFPKNNRGADLMSDSPAAAEDRQLRDIHIQVKLPAKK
Specific function: Unknown
COG id: COG0173
COG function: function code J; Aspartyl-tRNA synthetase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family [H]
Homologues:
Organism=Homo sapiens, GI40789249, Length=602, Percent_Identity=41.8604651162791, Blast_Score=464, Evalue=1e-131, Organism=Homo sapiens, GI5031815, Length=310, Percent_Identity=26.1290322580645, Blast_Score=87, Evalue=3e-17, Organism=Homo sapiens, GI194272210, Length=311, Percent_Identity=26.0450160771704, Blast_Score=87, Evalue=4e-17, Organism=Homo sapiens, GI45439306, Length=238, Percent_Identity=27.3109243697479, Blast_Score=78, Evalue=2e-14, Organism=Homo sapiens, GI4758762, Length=289, Percent_Identity=26.2975778546713, Blast_Score=76, Evalue=1e-13, Organism=Escherichia coli, GI1788173, Length=596, Percent_Identity=46.9798657718121, Blast_Score=532, Evalue=1e-152, Organism=Escherichia coli, GI1789256, Length=316, Percent_Identity=27.5316455696203, Blast_Score=99, Evalue=7e-22, Organism=Escherichia coli, GI1790571, Length=300, Percent_Identity=28, Blast_Score=95, Evalue=1e-20, Organism=Caenorhabditis elegans, GI32566633, Length=598, Percent_Identity=34.7826086956522, Blast_Score=333, Evalue=2e-91, Organism=Caenorhabditis elegans, GI71994340, Length=330, Percent_Identity=23.6363636363636, Blast_Score=85, Evalue=1e-16, Organism=Caenorhabditis elegans, GI17535927, Length=330, Percent_Identity=23.6363636363636, Blast_Score=85, Evalue=1e-16, Organism=Caenorhabditis elegans, GI17535925, Length=330, Percent_Identity=23.6363636363636, Blast_Score=84, Evalue=1e-16, Organism=Caenorhabditis elegans, GI17551876, Length=260, Percent_Identity=25, Blast_Score=77, Evalue=3e-14, Organism=Caenorhabditis elegans, GI71984122, Length=313, Percent_Identity=25.8785942492013, Blast_Score=74, Evalue=2e-13, Organism=Saccharomyces cerevisiae, GI6325153, Length=646, Percent_Identity=33.1269349845201, Blast_Score=298, Evalue=1e-81, Organism=Saccharomyces cerevisiae, GI6321807, Length=296, Percent_Identity=24.3243243243243, Blast_Score=74, Evalue=6e-14, Organism=Saccharomyces cerevisiae, GI6323011, Length=241, Percent_Identity=27.8008298755187, Blast_Score=71, Evalue=6e-13, Organism=Saccharomyces cerevisiae, GI6324256, Length=120, Percent_Identity=29.1666666666667, Blast_Score=66, Evalue=2e-11, Organism=Saccharomyces cerevisiae, GI6320242, Length=153, Percent_Identity=27.4509803921569, Blast_Score=64, Evalue=5e-11, Organism=Drosophila melanogaster, GI24584738, Length=600, Percent_Identity=39.6666666666667, Blast_Score=404, Evalue=1e-113, Organism=Drosophila melanogaster, GI24640851, Length=252, Percent_Identity=26.984126984127, Blast_Score=83, Evalue=6e-16, Organism=Drosophila melanogaster, GI24640849, Length=252, Percent_Identity=26.984126984127, Blast_Score=83, Evalue=6e-16, Organism=Drosophila melanogaster, GI17136276, Length=258, Percent_Identity=27.906976744186, Blast_Score=83, Evalue=7e-16,
Paralogues:
None
Copy number: 1320 Molecules/Cell In: Glucose minimal media [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004364 - InterPro: IPR018150 - InterPro: IPR006195 - InterPro: IPR020564 - InterPro: IPR004524 - InterPro: IPR018153 - InterPro: IPR002312 - InterPro: IPR004115 - InterPro: IPR012340 - InterPro: IPR016027 - InterPro: IPR004365 [H]
Pfam domain/function: PF02938 GAD; PF00152 tRNA-synt_2; PF01336 tRNA_anti [H]
EC number: =6.1.1.12 [H]
Molecular weight: Translated: 67033; Mature: 67033
Theoretical pI: Translated: 5.20; Mature: 5.20
Prosite motif: PS50862 AA_TRNA_LIGASE_II
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNSYRTHTCSELRAANIGKPTTLIGWVDSVRDHGGVIFIDLRDRSGITQVVFHPEVNQDV CCCCCCCCHHHHHHCCCCCCCEEEHHHHHHHCCCCEEEEEECCCCCCEEEEECCCCCHHH AKASQQLRSEDMIQISGTVAARLKTDTVDTTNADLPTGEIEVSADTLNVINKADVLPFQL HHHHHHHCCCCEEEEECEEEEEEEECCCCCCCCCCCCCEEEEEHHHHHHHHCCCCCHHHH DRALSNEDLRLKYRFLDLRRPTMARNMQIRHRVTKSTRDYLDEHGFLEIETPILSKSTPE HHHCCCCCEEEEEEEEECCCCHHHHCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCC GARDFLVPSRLAPGKFYALPQAPQQYKQLLMVAGMERYFQIARCFRDEDLRADRQPEFTQ CHHHCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEE VDIEASFITPEDIYNLVEGLLKRVYKESLGVDIPTPFPRMTWKEAMDQYGSDKPERRFGM EEEEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC KLTDVSSIFENSGFKVFASAVSNGGVVKAINAKGFGSASVGQIDALTKTAVEAGAKGLAY CHHHHHHHHHCCCCEEEEEHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCEEE IKVREEDWRSPISKFLSDEEKQKLTEALDIETGDLVLFAAGPWEPSCDILGRVRLQCAEF EEECCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHH MELLKDNKERDFLWVIEFPLVGWDEEEQRWVAIHHPFTRPVKEDEQKLLSGELSADLRAQ HHHHHCCCCCCEEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCCCHHCE AYDVVLNGTELGGGSIRIHERDLQSAMFKALGITEEQAREQFGHILDAFSFGAPPHGGLA EEEEEEECCEECCCEEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHH LGLDRLVMMICNAESIREVIAFPKNNRGADLMSDSPAAAEDRQLRDIHIQVKLPAKK HHHHHHHHHHHCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCC >Mature Secondary Structure MNSYRTHTCSELRAANIGKPTTLIGWVDSVRDHGGVIFIDLRDRSGITQVVFHPEVNQDV CCCCCCCCHHHHHHCCCCCCCEEEHHHHHHHCCCCEEEEEECCCCCCEEEEECCCCCHHH AKASQQLRSEDMIQISGTVAARLKTDTVDTTNADLPTGEIEVSADTLNVINKADVLPFQL HHHHHHHCCCCEEEEECEEEEEEEECCCCCCCCCCCCCEEEEEHHHHHHHHCCCCCHHHH DRALSNEDLRLKYRFLDLRRPTMARNMQIRHRVTKSTRDYLDEHGFLEIETPILSKSTPE HHHCCCCCEEEEEEEEECCCCHHHHCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCC GARDFLVPSRLAPGKFYALPQAPQQYKQLLMVAGMERYFQIARCFRDEDLRADRQPEFTQ CHHHCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEE VDIEASFITPEDIYNLVEGLLKRVYKESLGVDIPTPFPRMTWKEAMDQYGSDKPERRFGM EEEEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC KLTDVSSIFENSGFKVFASAVSNGGVVKAINAKGFGSASVGQIDALTKTAVEAGAKGLAY CHHHHHHHHHCCCCEEEEEHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCEEE IKVREEDWRSPISKFLSDEEKQKLTEALDIETGDLVLFAAGPWEPSCDILGRVRLQCAEF EEECCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHH MELLKDNKERDFLWVIEFPLVGWDEEEQRWVAIHHPFTRPVKEDEQKLLSGELSADLRAQ HHHHHCCCCCCEEEEEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCCCHHCE AYDVVLNGTELGGGSIRIHERDLQSAMFKALGITEEQAREQFGHILDAFSFGAPPHGGLA EEEEEEECCEECCCEEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCHH LGLDRLVMMICNAESIREVIAFPKNNRGADLMSDSPAAAEDRQLRDIHIQVKLPAKK HHHHHHHHHHHCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA