Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
---|---|
Accession | NC_011353 |
Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is alaS [H]
Identifier: 209398791
GI number: 209398791
Start: 3647545
End: 3650175
Strand: Reverse
Name: alaS [H]
Synonym: ECH74115_3944
Alternate gene names: 209398791
Gene position: 3650175-3647545 (Counterclockwise)
Preceding gene: 209397890
Following gene: 209399950
Centisome position: 65.51
GC content: 53.59
Gene sequence:
>2631_bases ATGAGCAAGAGCACCGCTGAGATCCGTCAGGCGTTTCTCGACTTTTTCCATAGTAAGGGACATCAGGTAGTTGCCAGCAG CTCCCTGGTACCCCATAACGACCCAACTTTGTTGTTTACCAACGCCGGGATGAACCAGTTCAAGGATGTGTTCCTTGGGC TCGACAAGCGTAATTATTCCCGCGCTACCACTTCCCAACGCTGCGTGCGTGCGGGTGGTAAACACAACGACCTGGAAAAC GTCGGTTACACCGCGCGTCACCATACCTTCTTCGAAATGCTGGGCAACTTCAGCTTCGGCGACTATTTCAAACACGATGC CATTCAGTTTGCATGGGAACTGCTGACCAGCGAAAAATGGTTTGCCCTGCCGAAAGAGCGTCTGTGGGTTACCGTCTATG AAAGCGACGACGAAGCCTACGAAATCTGGGAAAAAGAAGTAGGGATCCCGCGCGAACGTATTATTCGCATCGGCGATAAC AAAGGTGCGCCATACGCATCTGACAACTTCTGGCAGATGGGTGACACTGGTCCGTGCGGCCCGTGCACCGAAATCTTCTA CGATCACGGCGACCACATTTGGGGTGGCCCTCCGGGAAGTCCGGAAGAAGACGGCGACCGCTACATTGAGATCTGGAACA TCGTCTTCATGCAGTTCAACCGCCAGGCCGATGGCACGATGGAACCGCTGCCGAAGCCGTCTGTAGATACCGGTATGGGT CTGGAGCGTATTGCTGCGGTGCTGCAACACGTTAACTCTAACTATGACATCGACCTGTTCCGCACGTTGATCCAGGCGGT AGCGAAAGTCACTGGCGCAACCGATCTGAGCAATAAATCGCTGCGCGTAATCGCTGACCACATTCGTTCTTGTGCGTTCC TGATCGCGGATGGCGTAATGCCGTCCAATGAAAACCGTGGTTATGTACTGCGTCGTATCATTCGTCGCGCAGTGCGTCAC GGTAATATGCTCGGCGCGAAAGAAACCTTCTTCTACAAACTGGTTGGTCCGCTGATCGACGTTATGGGCTCTGCGGGTGA AGACCTGAAACGCCAGCAGGCGCAGGTTGAGCAGGTGCTGAAGACTGAAGAAGAGCAGTTTGCTCGTACTCTGGAGCGCG GTCTGGCGTTGCTGGATGAAGAGCTGGCAAAACTTTCTGGTGATACGCTGGATGGTGAAACTGCTTTCCGTCTGTACGAC ACCTATGGCTTCCCGGTTGACCTGACGGCTGATGTTTGTCGTGAGCGCAACATCAAAGTTGACGAAGCTGGTTTTGAAGC AGCAATGGAAGAGCAGCGTCGTCGGGCGCGCGAAGCCAGCGGCTTTGGTGCCGATTACAACGCAATGATCCGTGTTGACA GTGCATCTGAATTTAAAGGCTATGACCATCTGGAACTGAACGGCAAAGTGACCGCGCTGTTTGTTGATGGTAAAGCGGTT GATGCCATCAATGCAGGCCAGGAAGCTGTGGTCGTGCTGGATCAAACGCCATTCTATGCGGAATCCGGCGGTCAGGTTGG TGATAAAGGCGAACTGAAAGGCGCTAACTTCTCCTTCGTGGTGGAAGATACGCAGAAATACGGCCAGGCGATTGGTCACA TCGGTAAACTTGCTGCGGGTTCTCTGAAAGTGGGCGACGCTGTGCAGGCTGATGTTGATGAGGCTCGTCGCGCCCGTATT CGTTTGAATCACTCCGCAACGCACCTGATGCACGCTGCGCTGCGCCAGGTTCTGGGGACTCATGTATCGCAGAAAGGTTC ACTGGTTAACGACAAGGTGCTGCGCTTCGACTTCTCACACAACGAAGCGATGAAACCGGAAGAGATTCGTGCGGTCGAAG ACCTGGTGAACGCACAGATTCGTCGCAATTTGCCGATCGAAACCAACATCATGGATCTCGAAGCGGCGAAAGCGAAAGGT GCGATGGCGCTGTTCGGCGAGAAGTATGATGAGCGTGTACGCGTGCTGAGCATGGGCGATTTCTCCACCGAGCTGTGTGG CGGTACTCACGCCAGCCGCACTGGTGATATTGGTCTGTTCCGCATCATCTCTGAATCGGGTACTGCTGCAGGCGTTCGTC GTATCGAAGCGGTAACCGGAGAAGGCGCTATCACCACCGTTCATGCAGACAGCGATCGCTTAAGCGAAGTCGCGCATCTG CTGAAAGGCGATAGCAATAATCTGGCGGATAAAGTGCGCTCAGTACTGGAACGTACGCGTCAGTTAGAAAAAGAACTACA ACAGCTTAAAGAACAAGCTGCCGCACAGGAGAGCGCAAATCTTTCCAGTAAGGCAATTGATGTTAATGGTGTTAAGCTGT TGGTTAGCGAGCTTAGCGGTGTTGAGCCGAAAATGTTGCGTACCATGGTTGACGATTTAAAAAATCAGCTGGGGTCGACA ATTATCGTGCTGGCAACGGTAGCCGAAGGTAAGGTTTCTCTGATTGCAGGCGTATCTAAGGACGTCACAGATCGTGTGAA AGCAGGGGAACTGATTGGTATGGTCGCTCAGCAGGTGGGCGGCAAGGGTGGTGGACGTCCTGACATGGCGCAAGCCGGTG GTACGGATGCTGCGGCCTTACCTGCAGCGTTAGCCAGTGTGAAAGGCTGGGTCAGCGCGAAATTGCAATAA
Upstream 100 bases:
>100_bases AAGAAAACTTATCTTATTCCCACTTTTCAGTTACCAGCCCGGCGGTTAAGACACGCTGGAGCTGGTGGCGATATTTCGTT AGCTTGATTTCAGGATAATT
Downstream 100 bases:
>100_bases TATAAGCGTCAGGCAATGCCGTGGACTCGCTTCACGGCATTCGCATTAACGCTATCGACAACGATAAAGTCAGGTTGAAG TTGTGTATATCGGCTAAACT
Product: alanyl-tRNA synthetase
Products: NA
Alternate protein names: Alanine--tRNA ligase; AlaRS [H]
Number of amino acids: Translated: 876; Mature: 875
Protein sequence:
>876_residues MSKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLEN VGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKWFALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDN KGAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMG LERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVMPSNENRGYVLRRIIRRAVRH GNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYD TYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAV DAINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFVVEDTQKYGQAIGHIGKLAAGSLKVGDAVQADVDEARRARI RLNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSHNEAMKPEEIRAVEDLVNAQIRRNLPIETNIMDLEAAKAKG AMALFGEKYDERVRVLSMGDFSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAITTVHADSDRLSEVAHL LKGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSGVEPKMLRTMVDDLKNQLGST IIVLATVAEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ
Sequences:
>Translated_876_residues MSKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLEN VGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKWFALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDN KGAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMG LERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVMPSNENRGYVLRRIIRRAVRH GNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYD TYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAV DAINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFVVEDTQKYGQAIGHIGKLAAGSLKVGDAVQADVDEARRARI RLNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSHNEAMKPEEIRAVEDLVNAQIRRNLPIETNIMDLEAAKAKG AMALFGEKYDERVRVLSMGDFSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAITTVHADSDRLSEVAHL LKGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSGVEPKMLRTMVDDLKNQLGST IIVLATVAEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ >Mature_875_residues SKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLENV GYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKWFALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDNK GAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMGL ERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVMPSNENRGYVLRRIIRRAVRHG NMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYDT YGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAVD AINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFVVEDTQKYGQAIGHIGKLAAGSLKVGDAVQADVDEARRARIR LNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSHNEAMKPEEIRAVEDLVNAQIRRNLPIETNIMDLEAAKAKGA MALFGEKYDERVRVLSMGDFSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAITTVHADSDRLSEVAHLL KGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSGVEPKMLRTMVDDLKNQLGSTI IVLATVAEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ
Specific function: Catalyzes the attachment of alanine to tRNA(Ala) in a two-step reaction:alanine is first activated by ATP to form Ala- AMP and then transferred to the acceptor end of tRNA(Ala). Also edits incorrectly charged Ser-tRNA(Ala) and Gly-tRNA(Ala) via its editin
COG id: COG0013
COG function: function code J; Alanyl-tRNA synthetase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family [H]
Homologues:
Organism=Homo sapiens, GI109148542, Length=979, Percent_Identity=38.6108273748723, Blast_Score=567, Evalue=1e-161, Organism=Homo sapiens, GI38569417, Length=965, Percent_Identity=35.3367875647668, Blast_Score=483, Evalue=1e-136, Organism=Escherichia coli, GI1789048, Length=876, Percent_Identity=99.5433789954338, Blast_Score=1798, Evalue=0.0, Organism=Caenorhabditis elegans, GI17506981, Length=982, Percent_Identity=37.4745417515275, Blast_Score=551, Evalue=1e-157, Organism=Caenorhabditis elegans, GI17536681, Length=740, Percent_Identity=33.7837837837838, Blast_Score=382, Evalue=1e-106, Organism=Saccharomyces cerevisiae, GI6324911, Length=778, Percent_Identity=42.8020565552699, Blast_Score=583, Evalue=1e-167, Organism=Drosophila melanogaster, GI24582809, Length=956, Percent_Identity=37.9707112970711, Blast_Score=575, Evalue=1e-164, Organism=Drosophila melanogaster, GI45552267, Length=956, Percent_Identity=37.9707112970711, Blast_Score=575, Evalue=1e-164, Organism=Drosophila melanogaster, GI24658214, Length=806, Percent_Identity=32.0099255583127, Blast_Score=358, Evalue=8e-99,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002318 - InterPro: IPR018162 - InterPro: IPR018165 - InterPro: IPR018164 - InterPro: IPR023033 - InterPro: IPR003156 - InterPro: IPR018163 - InterPro: IPR012947 [H]
Pfam domain/function: PF02272 DHHA1; PF01411 tRNA-synt_2c; PF07973 tRNA_SAD [H]
EC number: =6.1.1.7 [H]
Molecular weight: Translated: 96033; Mature: 95902
Theoretical pI: Translated: 5.57; Mature: 5.57
Prosite motif: PS50860 AA_TRNA_LIGASE_II_ALA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.1 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYS CCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEECCCHHHHHHHHHCCCCCCCC RATTSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKW HHHHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCC FALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDNKGAPYASDNFWQMGDTGPCG CCCCCCCEEEEEECCCCHHHHHHHHHHCCCHHHEEEECCCCCCCCCCCCCEECCCCCCCC PCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMG HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC LERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVM HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCC PSNENRGYVLRRIIRRAVRHGNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVL CCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH KTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYDTYGFPVDLTADVCRERNIKV HCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCHHHHHCCCCCC DEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAV CHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHCCCCCEEEECCEEEEEEECCCEE DAINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFVVEDTQKYGQAIGHIGKLAAG CCCCCCCCEEEEEECCCCEECCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHCCC SLKVGDAVQADVDEARRARIRLNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSH CCCCCCHHHHHHHHHHHHHEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCC NEAMKPEEIRAVEDLVNAQIRRNLPIETNIMDLEAAKAKGAMALFGEKYDERVRVLSMGD CCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECHHHHHCCCHHHHHCCHHCCEEEEEECC FSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAITTVHADSDRLSEVAHL CCHHHCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHH LKGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSG HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHCC VEPKMLRTMVDDLKNQLGSTIIVLATVAEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVG CCHHHHHHHHHHHHHHHCCEEEEEEEECCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHC GKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ CCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHCCCC >Mature Secondary Structure SKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYS CCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEECCCHHHHHHHHHCCCCCCCC RATTSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKW HHHHHHHHHHCCCCCCCHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCC FALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDNKGAPYASDNFWQMGDTGPCG CCCCCCCEEEEEECCCCHHHHHHHHHHCCCHHHEEEECCCCCCCCCCCCCEECCCCCCCC PCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMG HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC LERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVM HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCC PSNENRGYVLRRIIRRAVRHGNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVL CCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH KTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYDTYGFPVDLTADVCRERNIKV HCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCHHHHHCCCCCC DEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAV CHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHCCCCCEEEECCEEEEEEECCCEE DAINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFVVEDTQKYGQAIGHIGKLAAG CCCCCCCCEEEEEECCCCEECCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHHHHCCC SLKVGDAVQADVDEARRARIRLNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSH CCCCCCHHHHHHHHHHHHHEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCC NEAMKPEEIRAVEDLVNAQIRRNLPIETNIMDLEAAKAKGAMALFGEKYDERVRVLSMGD CCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECHHHHHCCCHHHHHCCHHCCEEEEEECC FSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAITTVHADSDRLSEVAHL CCHHHCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHH LKGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSG HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHCC VEPKMLRTMVDDLKNQLGSTIIVLATVAEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVG CCHHHHHHHHHHHHHHHCCEEEEEEEECCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHC GKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ CCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA