Definition | Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence. |
---|---|
Accession | NC_003062 |
Length | 2,841,580 |
Click here to switch to the map view.
The map label for this gene is hisS
Identifier: 15888019
GI number: 15888019
Start: 670894
End: 672417
Strand: Direct
Name: hisS
Synonym: Atu0676
Alternate gene names: 15888019
Gene position: 670894-672417 (Clockwise)
Preceding gene: 159184417
Following gene: 15888020
Centisome position: 23.61
GC content: 60.43
Gene sequence:
>1524_bases ATGAGCGAAAAAGCAAAAAAGCCTCAGAAACTGAAAGCCCGCCTGCCGCGCGGCTTCGTGGATCGTTCGGCTGCCGATAT CCATGCCACCAATGAGATGGTCGACAAGATCCGTAGGGTCTACGAGCTTTACGGCTTCGATCCGATCGAGACCCCTCTGT TCGAATATACCGATGCGCTCGGCAAGTTCCTGCCCGATAGCGACCGCCCGAACGAAGGCGTGTTTTCGCTGCAGGACGAC GACGACCAATGGATGAGCCTGCGGTACGATCTGACCGCGCCGCTCGCCCGTCATGTTGCGGAAAATTTCAACGAAATACA GCTGCCCTACCGCACCTATCGCGCCGGTTACGTCTTCCGCAATGAAAAGCCCGGCCCCGGCCGCTTCCGGCAATTCATGC AGTTCGATGCCGATACGGTGGGTGCTGCCGGTGTGCAGGCCGATGCCGAAATGTGCATGATGATGGCCGATACCATGGAA GCGCTCGGTATCGCGCGCGGCGACTATGTCATCCGCGTCAACAACCGCAAGGTGCTCGACGGCGTCATGGAAGCCATCGG TCTCGGCGGAGAGGACAATGCCGGTCGCCGCCTGAATGTGCTGCGCGCCATCGACAAGCTCGACAAGTTCGGCCCGGAAG GCGTGAAGCTGCTGCTCGGGCCTGGCCGCAAGGATGAATCCGGTGATTTCACCAAGGGTGCGGGTCTTGGCGATGAACAG ATCGAAAAAGTGCTGTTCTTCGTTGGTATCAAGGATTATGCGGCAAGCGCTGATGATCTCGCAAAACTGGTTGCGGGCAC ATCCAAAGGCGAGGAAGGCGTTGATGAACTGAACACCATAGGCGCTCTGGTTTCCGGTGCCGGTTATGACGCAACACGCA TCAAGATCGATCCCTCTGTCGTTCGCGGTCTCGAATATTACACCGGCCCGGTCTATGAGGCTGAGCTGACCTTCGACGTC ACCAATGAAAAGGGCGAAAAGGTCGTGTTCGGCTCGGTCGGCGGTGGCGGTCGTTACGATGGTCTCGTCTCGCGCTTCAT GGGTCAGCCGGTTCCAGCCACGGGCTTCTCCATCGGTGTCTCGCGCCTGATGACGGCGCTGAAGAACCTCGGCAAGCTTG GGCAGGTCAAGCCGCTCGCACCCGTCCTCATCACCGTCATGGACGGCGATGTGGAGAGCATGGGCCGTTACCAGCGCTTC ACCCAGGCGCTGCGCGCCGAGGGCATCCGCGCCGAAATGTACCAGGGCAACTGGAAGAAATTCGGCAACCAGCTGAAATA TGCCGATCGCCTCGGCTCTCCCATCGCCATCATCCAGGGCGGTGACGAGCGCGCCGAAGGCGTCGTGCAGATCAAGGATC TGATCGAAGGCAAGCGCCTCTCCGGCGAAATCGAGGACAATGCGAGCTGGCGCGAGGCGCGTGTGGCACAGGTCAGCGTG CCGGAAGCCGAGCTGGTGGCGAAGGTCCGCGAGATTCTGGAGCATCAGGCAGAGGACGTTCGCCGCGCCGCCGAAGGGCG CTGA
Upstream 100 bases:
>100_bases CCCTGCGCTGCCAGTAAAACCAAAGCCTTTGCGGCTTGCCGCACAGGGCTTCATCCTCTAATACACCGGCATAATTTCAG GAAAATATCGGATCGGCATC
Downstream 100 bases:
>100_bases AGCCATGGCGATCCTCGCGCTCGATCATGTTCAACTGGCGATGCCGGCGGGCCGCGAAGAAGAAGCGCGCCGGTTTTACG GTGACCTGCTCGGTTTCGCG
Product: histidyl-tRNA synthetase
Products: NA
Alternate protein names: Histidine--tRNA ligase; HisRS
Number of amino acids: Translated: 507; Mature: 506
Protein sequence:
>507_residues MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDD DDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTME ALGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDV TNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRF TQALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV PEAELVAKVREILEHQAEDVRRAAEGR
Sequences:
>Translated_507_residues MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDD DDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTME ALGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDV TNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRF TQALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV PEAELVAKVREILEHQAEDVRRAAEGR >Mature_506_residues SEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDDD DQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEA LGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQI EKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDVT NEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFT QALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSVP EAELVAKVREILEHQAEDVRRAAEGR
Specific function: Unknown
COG id: COG0124
COG function: function code J; Histidyl-tRNA synthetase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family
Homologues:
Organism=Homo sapiens, GI6996014, Length=483, Percent_Identity=30.6418219461698, Blast_Score=183, Evalue=4e-46, Organism=Homo sapiens, GI15029520, Length=477, Percent_Identity=29.559748427673, Blast_Score=172, Evalue=6e-43, Organism=Escherichia coli, GI1788861, Length=460, Percent_Identity=26.5217391304348, Blast_Score=124, Evalue=1e-29, Organism=Caenorhabditis elegans, GI71993693, Length=470, Percent_Identity=27.8723404255319, Blast_Score=162, Evalue=3e-40, Organism=Caenorhabditis elegans, GI71993686, Length=470, Percent_Identity=27.8723404255319, Blast_Score=162, Evalue=3e-40, Organism=Saccharomyces cerevisiae, GI6325290, Length=490, Percent_Identity=30, Blast_Score=163, Evalue=5e-41, Organism=Drosophila melanogaster, GI24643061, Length=487, Percent_Identity=29.7741273100616, Blast_Score=174, Evalue=1e-43, Organism=Drosophila melanogaster, GI24643059, Length=507, Percent_Identity=28.7968441814596, Blast_Score=164, Evalue=2e-40,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): SYH_AGRT5 (Q8UHK4)
Other databases:
- EMBL: AE007869 - PIR: AF2659 - PIR: D97441 - RefSeq: NP_353700.1 - ProteinModelPortal: Q8UHK4 - STRING: Q8UHK4 - GeneID: 1132714 - GenomeReviews: AE007869_GR - KEGG: atu:Atu0676 - eggNOG: COG0124 - HOGENOM: HBG616575 - OMA: IGKVFRD - PhylomeDB: Q8UHK4 - ProtClustDB: PRK00037 - BioCyc: ATUM176299-1:ATU0676-MONOMER - GO: GO:0005737 - HAMAP: MF_00127 - InterPro: IPR002314 - InterPro: IPR006195 - InterPro: IPR004154 - InterPro: IPR015807 - InterPro: IPR004516 - Gene3D: G3DSA:3.40.50.800 - PANTHER: PTHR11476 - PIRSF: PIRSF001549 - TIGRFAMs: TIGR00442
Pfam domain/function: PF03129 HGTP_anticodon; PF00587 tRNA-synt_2b; SSF52954 Anticodon_bd
EC number: =6.1.1.21
Molecular weight: Translated: 55729; Mature: 55598
Theoretical pI: Translated: 4.96; Mature: 4.96
Prosite motif: PS50862 AA_TRNA_LIGASE_II
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDAL CCCCCCCCHHHHHHCCCCHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH GKFLPDSDRPNEGVFSLQDDDDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFR HHHCCCCCCCCCCEEEECCCCCCCEEEEECHHHHHHHHHHCCCCEEECCHHHHHCCEEEE NEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEALGIARGDYVIRVNNRKVLD CCCCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHH GVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ HHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHH IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSV HHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECHHH VRGLEYYTGPVYEAELTFDVTNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGV HHHHHHHCCCEEEEEEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCHHHHH SRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFTQALRAEGIRAEMYQGNWKK HHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHH FGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEECC PEAELVAKVREILEHQAEDVRRAAEGR CHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure SEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDAL CCCCCCCHHHHHHCCCCHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH GKFLPDSDRPNEGVFSLQDDDDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFR HHHCCCCCCCCCCEEEECCCCCCCEEEEECHHHHHHHHHHCCCCEEECCHHHHHCCEEEE NEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEALGIARGDYVIRVNNRKVLD CCCCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHH GVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ HHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHH IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSV HHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECHHH VRGLEYYTGPVYEAELTFDVTNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGV HHHHHHHCCCEEEEEEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCHHHHH SRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFTQALRAEGIRAEMYQGNWKK HHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHH FGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEECC PEAELVAKVREILEHQAEDVRRAAEGR CHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11743193; 11743194