Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is hisS

Identifier: 15888019

GI number: 15888019

Start: 670894

End: 672417

Strand: Direct

Name: hisS

Synonym: Atu0676

Alternate gene names: 15888019

Gene position: 670894-672417 (Clockwise)

Preceding gene: 159184417

Following gene: 15888020

Centisome position: 23.61

GC content: 60.43

Gene sequence:

>1524_bases
ATGAGCGAAAAAGCAAAAAAGCCTCAGAAACTGAAAGCCCGCCTGCCGCGCGGCTTCGTGGATCGTTCGGCTGCCGATAT
CCATGCCACCAATGAGATGGTCGACAAGATCCGTAGGGTCTACGAGCTTTACGGCTTCGATCCGATCGAGACCCCTCTGT
TCGAATATACCGATGCGCTCGGCAAGTTCCTGCCCGATAGCGACCGCCCGAACGAAGGCGTGTTTTCGCTGCAGGACGAC
GACGACCAATGGATGAGCCTGCGGTACGATCTGACCGCGCCGCTCGCCCGTCATGTTGCGGAAAATTTCAACGAAATACA
GCTGCCCTACCGCACCTATCGCGCCGGTTACGTCTTCCGCAATGAAAAGCCCGGCCCCGGCCGCTTCCGGCAATTCATGC
AGTTCGATGCCGATACGGTGGGTGCTGCCGGTGTGCAGGCCGATGCCGAAATGTGCATGATGATGGCCGATACCATGGAA
GCGCTCGGTATCGCGCGCGGCGACTATGTCATCCGCGTCAACAACCGCAAGGTGCTCGACGGCGTCATGGAAGCCATCGG
TCTCGGCGGAGAGGACAATGCCGGTCGCCGCCTGAATGTGCTGCGCGCCATCGACAAGCTCGACAAGTTCGGCCCGGAAG
GCGTGAAGCTGCTGCTCGGGCCTGGCCGCAAGGATGAATCCGGTGATTTCACCAAGGGTGCGGGTCTTGGCGATGAACAG
ATCGAAAAAGTGCTGTTCTTCGTTGGTATCAAGGATTATGCGGCAAGCGCTGATGATCTCGCAAAACTGGTTGCGGGCAC
ATCCAAAGGCGAGGAAGGCGTTGATGAACTGAACACCATAGGCGCTCTGGTTTCCGGTGCCGGTTATGACGCAACACGCA
TCAAGATCGATCCCTCTGTCGTTCGCGGTCTCGAATATTACACCGGCCCGGTCTATGAGGCTGAGCTGACCTTCGACGTC
ACCAATGAAAAGGGCGAAAAGGTCGTGTTCGGCTCGGTCGGCGGTGGCGGTCGTTACGATGGTCTCGTCTCGCGCTTCAT
GGGTCAGCCGGTTCCAGCCACGGGCTTCTCCATCGGTGTCTCGCGCCTGATGACGGCGCTGAAGAACCTCGGCAAGCTTG
GGCAGGTCAAGCCGCTCGCACCCGTCCTCATCACCGTCATGGACGGCGATGTGGAGAGCATGGGCCGTTACCAGCGCTTC
ACCCAGGCGCTGCGCGCCGAGGGCATCCGCGCCGAAATGTACCAGGGCAACTGGAAGAAATTCGGCAACCAGCTGAAATA
TGCCGATCGCCTCGGCTCTCCCATCGCCATCATCCAGGGCGGTGACGAGCGCGCCGAAGGCGTCGTGCAGATCAAGGATC
TGATCGAAGGCAAGCGCCTCTCCGGCGAAATCGAGGACAATGCGAGCTGGCGCGAGGCGCGTGTGGCACAGGTCAGCGTG
CCGGAAGCCGAGCTGGTGGCGAAGGTCCGCGAGATTCTGGAGCATCAGGCAGAGGACGTTCGCCGCGCCGCCGAAGGGCG
CTGA

Upstream 100 bases:

>100_bases
CCCTGCGCTGCCAGTAAAACCAAAGCCTTTGCGGCTTGCCGCACAGGGCTTCATCCTCTAATACACCGGCATAATTTCAG
GAAAATATCGGATCGGCATC

Downstream 100 bases:

>100_bases
AGCCATGGCGATCCTCGCGCTCGATCATGTTCAACTGGCGATGCCGGCGGGCCGCGAAGAAGAAGCGCGCCGGTTTTACG
GTGACCTGCTCGGTTTCGCG

Product: histidyl-tRNA synthetase

Products: NA

Alternate protein names: Histidine--tRNA ligase; HisRS

Number of amino acids: Translated: 507; Mature: 506

Protein sequence:

>507_residues
MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDD
DDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTME
ALGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ
IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDV
TNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRF
TQALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV
PEAELVAKVREILEHQAEDVRRAAEGR

Sequences:

>Translated_507_residues
MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDD
DDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTME
ALGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ
IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDV
TNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRF
TQALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV
PEAELVAKVREILEHQAEDVRRAAEGR
>Mature_506_residues
SEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDALGKFLPDSDRPNEGVFSLQDDD
DQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFRNEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEA
LGIARGDYVIRVNNRKVLDGVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQI
EKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSVVRGLEYYTGPVYEAELTFDVT
NEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGVSRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFT
QALRAEGIRAEMYQGNWKKFGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSVP
EAELVAKVREILEHQAEDVRRAAEGR

Specific function: Unknown

COG id: COG0124

COG function: function code J; Histidyl-tRNA synthetase

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family

Homologues:

Organism=Homo sapiens, GI6996014, Length=483, Percent_Identity=30.6418219461698, Blast_Score=183, Evalue=4e-46,
Organism=Homo sapiens, GI15029520, Length=477, Percent_Identity=29.559748427673, Blast_Score=172, Evalue=6e-43,
Organism=Escherichia coli, GI1788861, Length=460, Percent_Identity=26.5217391304348, Blast_Score=124, Evalue=1e-29,
Organism=Caenorhabditis elegans, GI71993693, Length=470, Percent_Identity=27.8723404255319, Blast_Score=162, Evalue=3e-40,
Organism=Caenorhabditis elegans, GI71993686, Length=470, Percent_Identity=27.8723404255319, Blast_Score=162, Evalue=3e-40,
Organism=Saccharomyces cerevisiae, GI6325290, Length=490, Percent_Identity=30, Blast_Score=163, Evalue=5e-41,
Organism=Drosophila melanogaster, GI24643061, Length=487, Percent_Identity=29.7741273100616, Blast_Score=174, Evalue=1e-43,
Organism=Drosophila melanogaster, GI24643059, Length=507, Percent_Identity=28.7968441814596, Blast_Score=164, Evalue=2e-40,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): SYH_AGRT5 (Q8UHK4)

Other databases:

- EMBL:   AE007869
- PIR:   AF2659
- PIR:   D97441
- RefSeq:   NP_353700.1
- ProteinModelPortal:   Q8UHK4
- STRING:   Q8UHK4
- GeneID:   1132714
- GenomeReviews:   AE007869_GR
- KEGG:   atu:Atu0676
- eggNOG:   COG0124
- HOGENOM:   HBG616575
- OMA:   IGKVFRD
- PhylomeDB:   Q8UHK4
- ProtClustDB:   PRK00037
- BioCyc:   ATUM176299-1:ATU0676-MONOMER
- GO:   GO:0005737
- HAMAP:   MF_00127
- InterPro:   IPR002314
- InterPro:   IPR006195
- InterPro:   IPR004154
- InterPro:   IPR015807
- InterPro:   IPR004516
- Gene3D:   G3DSA:3.40.50.800
- PANTHER:   PTHR11476
- PIRSF:   PIRSF001549
- TIGRFAMs:   TIGR00442

Pfam domain/function: PF03129 HGTP_anticodon; PF00587 tRNA-synt_2b; SSF52954 Anticodon_bd

EC number: =6.1.1.21

Molecular weight: Translated: 55729; Mature: 55598

Theoretical pI: Translated: 4.96; Mature: 4.96

Prosite motif: PS50862 AA_TRNA_LIGASE_II

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDAL
CCCCCCCCHHHHHHCCCCHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH
GKFLPDSDRPNEGVFSLQDDDDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFR
HHHCCCCCCCCCCEEEECCCCCCCEEEEECHHHHHHHHHHCCCCEEECCHHHHHCCEEEE
NEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEALGIARGDYVIRVNNRKVLD
CCCCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHH
GVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ
HHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHH
IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSV
HHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECHHH
VRGLEYYTGPVYEAELTFDVTNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGV
HHHHHHHCCCEEEEEEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCHHHHH
SRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFTQALRAEGIRAEMYQGNWKK
HHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHH
FGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV
HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEECC
PEAELVAKVREILEHQAEDVRRAAEGR
CHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SEKAKKPQKLKARLPRGFVDRSAADIHATNEMVDKIRRVYELYGFDPIETPLFEYTDAL
CCCCCCCHHHHHHCCCCHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH
GKFLPDSDRPNEGVFSLQDDDDQWMSLRYDLTAPLARHVAENFNEIQLPYRTYRAGYVFR
HHHCCCCCCCCCCEEEECCCCCCCEEEEECHHHHHHHHHHCCCCEEECCHHHHHCCEEEE
NEKPGPGRFRQFMQFDADTVGAAGVQADAEMCMMMADTMEALGIARGDYVIRVNNRKVLD
CCCCCHHHHHHHHHHCCHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHH
GVMEAIGLGGEDNAGRRLNVLRAIDKLDKFGPEGVKLLLGPGRKDESGDFTKGAGLGDEQ
HHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHH
IEKVLFFVGIKDYAASADDLAKLVAGTSKGEEGVDELNTIGALVSGAGYDATRIKIDPSV
HHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECHHH
VRGLEYYTGPVYEAELTFDVTNEKGEKVVFGSVGGGGRYDGLVSRFMGQPVPATGFSIGV
HHHHHHHCCCEEEEEEEEEEECCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCHHHHH
SRLMTALKNLGKLGQVKPLAPVLITVMDGDVESMGRYQRFTQALRAEGIRAEMYQGNWKK
HHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHH
FGNQLKYADRLGSPIAIIQGGDERAEGVVQIKDLIEGKRLSGEIEDNASWREARVAQVSV
HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHEEEECC
PEAELVAKVREILEHQAEDVRRAAEGR
CHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11743193; 11743194