| Definition | Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome. |
|---|---|
| Accession | NC_004663 |
| Length | 6,260,361 |
Click here to switch to the map view.
The map label for this gene is alaS [H]
Identifier: 29349403
GI number: 29349403
Start: 5205400
End: 5208018
Strand: Reverse
Name: alaS [H]
Synonym: BT_3995
Alternate gene names: 29349403
Gene position: 5208018-5205400 (Counterclockwise)
Preceding gene: 29349406
Following gene: 29349402
Centisome position: 83.19
GC content: 48.87
Gene sequence:
>2619_bases ATGTTGACTGCAAAAGAGATCAGAGATTCATTCAAGAATTTCTTTGAGTCGAAAGGACACCACATTGTTCCCTCGGCTCC GATGGTGATAAAAGATGACCCTACCCTGATGTTTACCAACGCAGGGATGAACCAGTTTAAAGATATTATTTTGGGTAACC ACCCGGCCAAATATCACAGAGTCGCGGACTCTCAGAAATGTCTGCGTGTAAGCGGAAAGCATAATGACCTGGAAGAAGTG GGACACGATACGTACCACCACACCATGTTCGAAATGCTGGGCAACTGGTCATTCGGTGACTACTTCAAGAAAGAAGCCAT CAACTGGGCATGGGAATATCTGGTAGAAGTATTGAAACTGAATCCGGAACACCTGTACGCCACTGTATTCGAAGGAAGTC CCGAAGAAGGACTGAGCCGTGACGACGAAGCCGCTTCTTACTGGGAACAGTATCTGCCGAAAGATCACATCATCAACGGC AACAAGCACGACAACTTCTGGGAAATGGGTGATACAGGTCCTTGCGGACCTTGTTCGGAAATCCACATCGACCTCCGTCC GGCAGAAGAACGTGCCAAAATCTCCGGCCGTGATCTCGTCAACCACGATCATCCCCAAGTGATTGAAATATGGAACCTTG TGTTCATGCAATATAACCGTAAAGCTGACGGCAGCCTCGAACCACTTCCTGCAAAGGTTATCGATACAGGTATGGGATTC GAACGCCTCTGTATGGCTTTGCAGGGCAAGACTTCCAACTACGATACAGACGTATTCCAGCCGATGCTGAAAGCGATTGC AGCAATGTCAGGCACAGAGTACGGAAAAGACAAACAACAGGACATCGCCATGCGTGTAATCGCTGACCACATCCGTACGA TTGCTTTCTCCATTACGGACGGTCAGCTGCCTTCCAATGCCAAGGCAGGTTATGTAATCCGCCGTATCCTCCGCCGCGCC GTTCGCTACGGATATACTTTCCTCGGACAGAAACAGTCATTCATGTACAAGCTGCTTCCGGTGTTGATCGACAACATGGG AGACGCTTATCCGGAACTGATCGCACAGAAAGGTCTGATCGAAAAAGTAATCAAGGAAGAAGAAGAAGCCTTCCTTCGCA CGCTGGAAACAGGTATCCGTCTGCTGGATAAAACAATGGGAGATACAAAGGCCGCCGGAAAGACAGAAATCAGCGGTAAA GATGCCTTTACCTTATATGATACTTTCGGTTTTCCGTTAGACCTGACAGAACTGATTCTTCGCGAAAACGGAATGACAGT CAACATCGAAGAGTTCAACGCGGAAATGCAACAGCAGAAACAGCGTGCACGTAACGCTGCCGCCATCGAAACCGGTGACT GGGTGACTCTGAGAGAAGGAACAACCGAATTCGTAGGTTACGACTATACTGAATACGAAGCATCTATTCTCCGCTATCGT CAGATCAAGCAGAAAAACCAGACACTGTATCAGATCGTACTTGACTGTACTCCGTTCTATGCAGAAAGCGGTGGTCAGGT AGGCGACACCGGTGTATTGGTCAGCGAATTCGAGACCATCGAAGTGATTGACACCAAGAAAGAGAACAATCTACCGATAC ATATTACCAAAAAGTTGCCGGAACATCCGGAAGCTCCGATGATGGCTTGTGTAGACACCGACAAACGGGCAGCCTGCGCA GCCAATCACTCGGCAACTCACCTGCTGGACTCTGCCCTGCGTGAAGTACTGGGCGAGCATATCGAACAGAAAGGTTCGTT AGTAACACCGGATTCACTGCGTTTCGACTTCTCTCACTTCCAGAAGGTGACCGACGAAGAAATCCGCCAGGTAGAACACC TGGTGAATGCCAAGATTCGTGCCAACATACCTTTGAAGGAATACCGCAACATTCCTATCGAAGAAGCGAAAGAGCTAGGA GCTATCGCCCTCTTCGGTGAAAAGTACGGTGAAAGAGTACGTGTCATCCAGTTCGGTTCTTCTATCGAATTCTGTGGAGG TATCCACGTAGCCGCAACCGGAAATATCGGTATGGTGAAGATCATTTCCGAAAGCTCTGTTGCAGCCGGTGTACGCCGTA TCGAAGCATATACGGGAGCCCGTGTGGAAGAAATGCTGGATACCATTCAGGATACGATCAGTGAATTGAAATCGCTCTTC AACAACGCACCCGACTTGGGTATTGCCATCCGCAAGTATATTGAGGAAAACGCAGGACTGAAAAAGCAGGTAGAAGACTA CATGAAAGAGAAAGAAGCCTCGCTGAAAGAAAGACTGCTGAAGAATATACAGGAAATTCACGGTATCAAGGTAATCAAGT TCTGTGCTCCGTTGCCGGCGGAAGTGGTGAAGAATATCGCCTTCCAGCTTCGCGGTGAAATCACAGAAAACCTGTTCTTT GTGGCCGGAAGCCTCGACAACGGCAAGCCTATGCTGACCGTCATGCTAAGTGACAATCTGGTGGCCGGTGGCTTGAAAGC AGGCAACCTGGTGAAAGAAGCAGCCAAACTGATTCAAGGCGGCGGAGGCGGTCAACCTCATTTCGCTACTGCCGGAGGCA AGAATACAGACGGACTGAATGCCGCTATCGAAAAGGTACTGGAACTCGCAGGTATCTAA
Upstream 100 bases:
>100_bases GAAATATTTCGCTTGTTAATACGGCAAAAGTAATAAAAACAAGCTATCTTTGCACAGTTTTTTGAGAATATTAACCTCAA CAATAGATATATAGAAGAAT
Downstream 100 bases:
>100_bases AAAAGCAGTAACATACATCCAATACAAGAACCGTGTCTTAAAGTCACCCGCACTTTAAGGCACGTTTTTTTATTTCAGAT TCCAGTGTCAACAGCAACTA
Product: alanyl-tRNA synthetase
Products: NA
Alternate protein names: Alanine--tRNA ligase; AlaRS [H]
Number of amino acids: Translated: 872; Mature: 872
Protein sequence:
>872_residues MLTAKEIRDSFKNFFESKGHHIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHRVADSQKCLRVSGKHNDLEEV GHDTYHHTMFEMLGNWSFGDYFKKEAINWAWEYLVEVLKLNPEHLYATVFEGSPEEGLSRDDEAASYWEQYLPKDHIING NKHDNFWEMGDTGPCGPCSEIHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADGSLEPLPAKVIDTGMGF ERLCMALQGKTSNYDTDVFQPMLKAIAAMSGTEYGKDKQQDIAMRVIADHIRTIAFSITDGQLPSNAKAGYVIRRILRRA VRYGYTFLGQKQSFMYKLLPVLIDNMGDAYPELIAQKGLIEKVIKEEEEAFLRTLETGIRLLDKTMGDTKAAGKTEISGK DAFTLYDTFGFPLDLTELILRENGMTVNIEEFNAEMQQQKQRARNAAAIETGDWVTLREGTTEFVGYDYTEYEASILRYR QIKQKNQTLYQIVLDCTPFYAESGGQVGDTGVLVSEFETIEVIDTKKENNLPIHITKKLPEHPEAPMMACVDTDKRAACA ANHSATHLLDSALREVLGEHIEQKGSLVTPDSLRFDFSHFQKVTDEEIRQVEHLVNAKIRANIPLKEYRNIPIEEAKELG AIALFGEKYGERVRVIQFGSSIEFCGGIHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTISELKSLF NNAPDLGIAIRKYIEENAGLKKQVEDYMKEKEASLKERLLKNIQEIHGIKVIKFCAPLPAEVVKNIAFQLRGEITENLFF VAGSLDNGKPMLTVMLSDNLVAGGLKAGNLVKEAAKLIQGGGGGQPHFATAGGKNTDGLNAAIEKVLELAGI
Sequences:
>Translated_872_residues MLTAKEIRDSFKNFFESKGHHIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHRVADSQKCLRVSGKHNDLEEV GHDTYHHTMFEMLGNWSFGDYFKKEAINWAWEYLVEVLKLNPEHLYATVFEGSPEEGLSRDDEAASYWEQYLPKDHIING NKHDNFWEMGDTGPCGPCSEIHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADGSLEPLPAKVIDTGMGF ERLCMALQGKTSNYDTDVFQPMLKAIAAMSGTEYGKDKQQDIAMRVIADHIRTIAFSITDGQLPSNAKAGYVIRRILRRA VRYGYTFLGQKQSFMYKLLPVLIDNMGDAYPELIAQKGLIEKVIKEEEEAFLRTLETGIRLLDKTMGDTKAAGKTEISGK DAFTLYDTFGFPLDLTELILRENGMTVNIEEFNAEMQQQKQRARNAAAIETGDWVTLREGTTEFVGYDYTEYEASILRYR QIKQKNQTLYQIVLDCTPFYAESGGQVGDTGVLVSEFETIEVIDTKKENNLPIHITKKLPEHPEAPMMACVDTDKRAACA ANHSATHLLDSALREVLGEHIEQKGSLVTPDSLRFDFSHFQKVTDEEIRQVEHLVNAKIRANIPLKEYRNIPIEEAKELG AIALFGEKYGERVRVIQFGSSIEFCGGIHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTISELKSLF NNAPDLGIAIRKYIEENAGLKKQVEDYMKEKEASLKERLLKNIQEIHGIKVIKFCAPLPAEVVKNIAFQLRGEITENLFF VAGSLDNGKPMLTVMLSDNLVAGGLKAGNLVKEAAKLIQGGGGGQPHFATAGGKNTDGLNAAIEKVLELAGI >Mature_872_residues MLTAKEIRDSFKNFFESKGHHIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHRVADSQKCLRVSGKHNDLEEV GHDTYHHTMFEMLGNWSFGDYFKKEAINWAWEYLVEVLKLNPEHLYATVFEGSPEEGLSRDDEAASYWEQYLPKDHIING NKHDNFWEMGDTGPCGPCSEIHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADGSLEPLPAKVIDTGMGF ERLCMALQGKTSNYDTDVFQPMLKAIAAMSGTEYGKDKQQDIAMRVIADHIRTIAFSITDGQLPSNAKAGYVIRRILRRA VRYGYTFLGQKQSFMYKLLPVLIDNMGDAYPELIAQKGLIEKVIKEEEEAFLRTLETGIRLLDKTMGDTKAAGKTEISGK DAFTLYDTFGFPLDLTELILRENGMTVNIEEFNAEMQQQKQRARNAAAIETGDWVTLREGTTEFVGYDYTEYEASILRYR QIKQKNQTLYQIVLDCTPFYAESGGQVGDTGVLVSEFETIEVIDTKKENNLPIHITKKLPEHPEAPMMACVDTDKRAACA ANHSATHLLDSALREVLGEHIEQKGSLVTPDSLRFDFSHFQKVTDEEIRQVEHLVNAKIRANIPLKEYRNIPIEEAKELG AIALFGEKYGERVRVIQFGSSIEFCGGIHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTISELKSLF NNAPDLGIAIRKYIEENAGLKKQVEDYMKEKEASLKERLLKNIQEIHGIKVIKFCAPLPAEVVKNIAFQLRGEITENLFF VAGSLDNGKPMLTVMLSDNLVAGGLKAGNLVKEAAKLIQGGGGGQPHFATAGGKNTDGLNAAIEKVLELAGI
Specific function: Catalyzes the attachment of alanine to tRNA(Ala) in a two-step reaction:alanine is first activated by ATP to form Ala- AMP and then transferred to the acceptor end of tRNA(Ala). Also edits incorrectly charged Ser-tRNA(Ala) and Gly-tRNA(Ala) via its editin
COG id: COG0013
COG function: function code J; Alanyl-tRNA synthetase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family [H]
Homologues:
Organism=Homo sapiens, GI109148542, Length=973, Percent_Identity=37.8211716341213, Blast_Score=568, Evalue=1e-161, Organism=Homo sapiens, GI38569417, Length=783, Percent_Identity=36.7816091954023, Blast_Score=474, Evalue=1e-133, Organism=Escherichia coli, GI1789048, Length=885, Percent_Identity=43.2768361581921, Blast_Score=661, Evalue=0.0, Organism=Caenorhabditis elegans, GI17506981, Length=978, Percent_Identity=36.8098159509202, Blast_Score=564, Evalue=1e-161, Organism=Caenorhabditis elegans, GI17536681, Length=755, Percent_Identity=36.4238410596026, Blast_Score=425, Evalue=1e-119, Organism=Saccharomyces cerevisiae, GI6324911, Length=763, Percent_Identity=43.1192660550459, Blast_Score=591, Evalue=1e-169, Organism=Drosophila melanogaster, GI24582809, Length=974, Percent_Identity=37.4743326488706, Blast_Score=565, Evalue=1e-161, Organism=Drosophila melanogaster, GI45552267, Length=974, Percent_Identity=37.4743326488706, Blast_Score=565, Evalue=1e-161, Organism=Drosophila melanogaster, GI24658214, Length=816, Percent_Identity=33.9460784313725, Blast_Score=389, Evalue=1e-108,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002318 - InterPro: IPR018162 - InterPro: IPR018165 - InterPro: IPR018164 - InterPro: IPR023033 - InterPro: IPR003156 - InterPro: IPR018163 - InterPro: IPR012947 [H]
Pfam domain/function: PF02272 DHHA1; PF01411 tRNA-synt_2c; PF07973 tRNA_SAD [H]
EC number: =6.1.1.7 [H]
Molecular weight: Translated: 97555; Mature: 97555
Theoretical pI: Translated: 5.49; Mature: 5.49
Prosite motif: PS50860 AA_TRNA_LIGASE_II_ALA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 2.9 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTAKEIRDSFKNFFESKGHHIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHR CCCHHHHHHHHHHHHHCCCCEECCCCCEEEECCCCEEEECCCHHHHHHHHCCCCCHHHHH VADSQKCLRVSGKHNDLEEVGHDTYHHTMFEMLGNWSFGDYFKKEAINWAWEYLVEVLKL HCCCHHHHHCCCCCCCHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHC NPEHLYATVFEGSPEEGLSRDDEAASYWEQYLPKDHIINGNKHDNFWEMGDTGPCGPCSE CCCCEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC IHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADGSLEPLPAKVIDTGMGF EEEEECCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCH ERLCMALQGKTSNYDTDVFQPMLKAIAAMSGTEYGKDKQQDIAMRVIADHIRTIAFSITD HHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHEEEEEEC GQLPSNAKAGYVIRRILRRAVRYGYTFLGQKQSFMYKLLPVLIDNMGDAYPELIAQKGLI CCCCCCCCHHHHHHHHHHHHHHHCHHHHCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH EKVIKEEEEAFLRTLETGIRLLDKTMGDTKAAGKTEISGKDAFTLYDTFGFPLDLTELIL HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH RENGMTVNIEEFNAEMQQQKQRARNAAAIETGDWVTLREGTTEFVGYDYTEYEASILRYR HCCCCEEEHHHHHHHHHHHHHHHHCCCEEECCCEEEECCCCCHHCCCCHHHHHHHHHHHH QIKQKNQTLYQIVLDCTPFYAESGGQVGDTGVLVSEFETIEVIDTKKENNLPIHITKKLP HHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCEEEECCCEEEEEECCCCCCCCEEEECCCC EHPEAPMMACVDTDKRAACAANHSATHLLDSALREVLGEHIEQKGSLVTPDSLRFDFSHF CCCCCCEEEEECCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH QKVTDEEIRQVEHLVNAKIRANIPLKEYRNIPIEEAKELGAIALFGEKYGERVRVIQFGS HHCCHHHHHHHHHHHCCHHEECCCHHHHCCCCHHHHHHCCEEEEECHHHCCEEEEEEECC SIEFCGGIHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTISELKSLF CCCCCCCEEEEEECCCCEEEEECCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH NNAPDLGIAIRKYIEENAGLKKQVEDYMKEKEASLKERLLKNIQEIHGIKVIKFCAPLPA CCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCHH EVVKNIAFQLRGEITENLFFVAGSLDNGKPMLTVMLSDNLVAGGLKAGNLVKEAAKLIQG HHHHHHHHHHHHHHHCCEEEEEECCCCCCCEEEEEEECCEEECCCCHHHHHHHHHHHHHC GGGGQPHFATAGGKNTDGLNAAIEKVLELAGI CCCCCCCEECCCCCCCCHHHHHHHHHHHHHCC >Mature Secondary Structure MLTAKEIRDSFKNFFESKGHHIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHR CCCHHHHHHHHHHHHHCCCCEECCCCCEEEECCCCEEEECCCHHHHHHHHCCCCCHHHHH VADSQKCLRVSGKHNDLEEVGHDTYHHTMFEMLGNWSFGDYFKKEAINWAWEYLVEVLKL HCCCHHHHHCCCCCCCHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHC NPEHLYATVFEGSPEEGLSRDDEAASYWEQYLPKDHIINGNKHDNFWEMGDTGPCGPCSE CCCCEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC IHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADGSLEPLPAKVIDTGMGF EEEEECCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCH ERLCMALQGKTSNYDTDVFQPMLKAIAAMSGTEYGKDKQQDIAMRVIADHIRTIAFSITD HHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHEEEEEEC GQLPSNAKAGYVIRRILRRAVRYGYTFLGQKQSFMYKLLPVLIDNMGDAYPELIAQKGLI CCCCCCCCHHHHHHHHHHHHHHHCHHHHCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH EKVIKEEEEAFLRTLETGIRLLDKTMGDTKAAGKTEISGKDAFTLYDTFGFPLDLTELIL HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH RENGMTVNIEEFNAEMQQQKQRARNAAAIETGDWVTLREGTTEFVGYDYTEYEASILRYR HCCCCEEEHHHHHHHHHHHHHHHHCCCEEECCCEEEECCCCCHHCCCCHHHHHHHHHHHH QIKQKNQTLYQIVLDCTPFYAESGGQVGDTGVLVSEFETIEVIDTKKENNLPIHITKKLP HHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCEEEECCCEEEEEECCCCCCCCEEEECCCC EHPEAPMMACVDTDKRAACAANHSATHLLDSALREVLGEHIEQKGSLVTPDSLRFDFSHF CCCCCCEEEEECCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH QKVTDEEIRQVEHLVNAKIRANIPLKEYRNIPIEEAKELGAIALFGEKYGERVRVIQFGS HHCCHHHHHHHHHHHCCHHEECCCHHHHCCCCHHHHHHCCEEEEECHHHCCEEEEEEECC SIEFCGGIHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTISELKSLF CCCCCCCEEEEEECCCCEEEEECCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH NNAPDLGIAIRKYIEENAGLKKQVEDYMKEKEASLKERLLKNIQEIHGIKVIKFCAPLPA CCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCHH EVVKNIAFQLRGEITENLFFVAGSLDNGKPMLTVMLSDNLVAGGLKAGNLVKEAAKLIQG HHHHHHHHHHHHHHHCCEEEEEECCCCCCCEEEEEEECCEEECCCCHHHHHHHHHHHHHC GGGGQPHFATAGGKNTDGLNAAIEKVLELAGI CCCCCCCEECCCCCCCCHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA