The gene/protein map for NC_009800 is currently unavailable.
Definition Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome.
Accession NC_008769
Length 4,374,522

Click here to switch to the map view.

The map label for this gene is pheS

Identifier: 121637557

GI number: 121637557

Start: 1863979

End: 1865004

Strand: Direct

Name: pheS

Synonym: BCG_1688

Alternate gene names: 121637557

Gene position: 1863979-1865004 (Clockwise)

Preceding gene: 121637556

Following gene: 121637558

Centisome position: 42.61

GC content: 66.18

Gene sequence:

>1026_bases
ATGTTGTCGCCGGAGGCATTGACCACGGCGGTCGACGCCGCCCAGCAGGCCATCGCCCTAGCGGACACCCTGGACGTCCT
GGCGCGCGTCAAGACGGAGCATCTCGGCGACCGCTCGCCGTTGGCGCTGGCGCGGCAGGCGCTGGCCGTGCTGCCCAAAG
AACAGCGAGCCGAGGCCGGTAAGCGCGTCAACGCCGCCCGCAATGCCGCTCAGCGCAGCTACGACGAACGGCTGGCGACG
CTGCGTGCCGAGCGCGACGCGGCCGTGCTGGTGGCCGAAGGTATCGATGTCACATTGCCCTCGACTCGGGTGCCGGCCGG
CGCCCGGCACCCGATCATCATGTTGGCCGAACACGTCGCCGACACGTTCATCGCGATGGGATGGGAACTGGCCGAGGGGC
CCGAGGTGGAGACCGAGCAGTTCAACTTCGACGCCCTCAACTTCCCTGCCGACCACCCTGCGCGCGGCGAACAAGATACC
TTCTACATCGCGCCGGAGGATTCGCGGCAGCTGCTGCGCACCCATACCTCACCGGTGCAGATTCGCACCCTGCTAGCGCG
TGAGCTGCCGGTCTACATCATCTCGATCGGTCGTACCTTTCGCACCGACGAACTCGACGCCACCCACACGCCCATCTTCC
ATCAGGTGGAAGGCCTAGCGGTGGACCGCGGTCTGTCGATGGCTCACCTACGTGGAACGCTGGACGCTTTTGCGCGCGCC
GAGTTCGGGCCGTCTGCGCGGACCCGGATCCGGCCACACTTCTTCCCCTTCACCGAACCGTCCGCCGAGGTCGATGTGTG
GTTTGCCAACAAGATTGGCGGCGCCGACTGGGTGGAGTGGGGCGGGTGCGGAATGGTGCATCCGAACGTGTTGCGGGCCA
CCGGCATTGATCCCGATCTCTACTCCGGTTTCGCGTTCGGGATGGGGTTGGAACGCACCCTGCAGTTTCGCAACGGCATT
CCTGACATGCGCGACATGGTCGAAGGCGACGTCCGATTCTCGTTGCCGTTCGGGGTGGGTGCCTGA

Upstream 100 bases:

>100_bases
TCCAGTTTCCGCTCCGCGACGATGCGGGCGGTCCGAATAGCCTCGTCAGCAAGGAGAGTGGCGCCGCGTGGGTGATCCCC
CCCTCGAGTCGATTGTGTCG

Downstream 100 bases:

>100_bases
TGCGGCTACCCTACAGCTGGCTGCGCGAGGTGGTTGCGGTCGGCGCTTCGGGCTGGGACGTTACCCCAGGCGAACTCGAG
CAGACGCTGTTGCGCATCGG

Product: phenylalanyl-tRNA synthetase subunit alpha

Products: NA

Alternate protein names: Phenylalanine--tRNA ligase alpha chain; PheRS

Number of amino acids: Translated: 341; Mature: 341

Protein sequence:

>341_residues
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLAT
LRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT
FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGADWVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGI
PDMRDMVEGDVRFSLPFGVGA

Sequences:

>Translated_341_residues
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLAT
LRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT
FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGADWVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGI
PDMRDMVEGDVRFSLPFGVGA
>Mature_341_residues
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLAT
LRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT
FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGADWVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGI
PDMRDMVEGDVRFSLPFGVGA

Specific function: Unknown

COG id: COG0016

COG function: function code J; Phenylalanyl-tRNA synthetase alpha subunit

Gene ontology:

Cell location: Cytoplasm

Metaboloic importance: Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the class-II aminoacyl-tRNA synthetase family. Phe-tRNA synthetase alpha chain type 1 subfamily

Homologues:

Organism=Homo sapiens, GI4758340, Length=274, Percent_Identity=32.4817518248175, Blast_Score=131, Evalue=1e-30,
Organism=Homo sapiens, GI5729820, Length=353, Percent_Identity=28.6118980169972, Blast_Score=104, Evalue=1e-22,
Organism=Escherichia coli, GI1788007, Length=331, Percent_Identity=46.5256797583082, Blast_Score=291, Evalue=5e-80,
Organism=Caenorhabditis elegans, GI32563657, Length=274, Percent_Identity=33.2116788321168, Blast_Score=137, Evalue=1e-32,
Organism=Caenorhabditis elegans, GI17508957, Length=274, Percent_Identity=33.2116788321168, Blast_Score=136, Evalue=1e-32,
Organism=Caenorhabditis elegans, GI32566635, Length=285, Percent_Identity=30.8771929824561, Blast_Score=111, Evalue=5e-25,
Organism=Saccharomyces cerevisiae, GI6321087, Length=266, Percent_Identity=31.9548872180451, Blast_Score=127, Evalue=2e-30,
Organism=Saccharomyces cerevisiae, GI6325304, Length=285, Percent_Identity=27.719298245614, Blast_Score=102, Evalue=6e-23,
Organism=Drosophila melanogaster, GI18858079, Length=271, Percent_Identity=33.5793357933579, Blast_Score=125, Evalue=5e-29,
Organism=Drosophila melanogaster, GI17137424, Length=251, Percent_Identity=29.0836653386454, Blast_Score=105, Evalue=4e-23,

Paralogues:

None

Copy number: 2,000 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): SYFA_MYCBO (Q7VEV4)

Other databases:

- EMBL:   BX248339
- RefSeq:   NP_855329.1
- ProteinModelPortal:   Q7VEV4
- SMR:   Q7VEV4
- EnsemblBacteria:   EBMYCT00000018116
- GeneID:   1092619
- GenomeReviews:   BX248333_GR
- KEGG:   mbo:Mb1676
- GeneTree:   EBGT00050000016426
- HOGENOM:   HBG284353
- OMA:   FRASYFP
- ProtClustDB:   PRK00488
- BioCyc:   MBOV233413:MB1676-MONOMER
- BRENDA:   6.1.1.20
- GO:   GO:0005737
- HAMAP:   MF_00281
- InterPro:   IPR006195
- InterPro:   IPR004529
- InterPro:   IPR004188
- InterPro:   IPR022911
- InterPro:   IPR002319
- InterPro:   IPR010978
- PANTHER:   PTHR11538
- TIGRFAMs:   TIGR00468

Pfam domain/function: PF02912 Phe_tRNA-synt_N; PF01409 tRNA-synt_2d; SSF46589 tRNA_binding_arm

EC number: =6.1.1.20

Molecular weight: Translated: 37416; Mature: 37416

Theoretical pI: Translated: 5.12; Mature: 5.12

Prosite motif: PS50862 AA_TRNA_LIGASE_II

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.6 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAG
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHH
KRVNAARNAAQRSYDERLATLRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVA
HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCEEECCCCCCCCCCCCCHHHHHHHHH
DTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDTFYIAPEDSRQLLRTHTSPVQ
HHHHHHCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCEEEECCCCHHHHHHHCCCHHH
IRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
HHHHHHHHCCEEEEEECCEECCCCCCCCCCCHHHHHCCEEHHCCCCHHHHHHHHHHHHHH
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGADWVEWGGCGMVHPNVLRATGIDPDL
CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEECCCCCCCCCCHHEECCCCHHH
YSGFAFGMGLERTLQFRNGIPDMRDMVEGDVRFSLPFGVGA
HHHHHHCCCHHHHHHHHCCCCHHHHHHCCCEEEECCCCCCC
>Mature Secondary Structure
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAG
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHH
KRVNAARNAAQRSYDERLATLRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVA
HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCEEECCCCCCCCCCCCCHHHHHHHHH
DTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDTFYIAPEDSRQLLRTHTSPVQ
HHHHHHCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCEEEECCCCHHHHHHHCCCHHH
IRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
HHHHHHHHCCEEEEEECCEECCCCCCCCCCCHHHHHCCEEHHCCCCHHHHHHHHHHHHHH
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGADWVEWGGCGMVHPNVLRATGIDPDL
CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEECCCCCCCCCCHHEECCCCHHH
YSGFAFGMGLERTLQFRNGIPDMRDMVEGDVRFSLPFGVGA
HHHHHHCCCHHHHHHHHCCCCHHHHHHCCCEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 12788972