Definition | Burkholderia cenocepacia HI2424 chromosome 1, complete sequence. |
---|---|
Accession | NC_008542 |
Length | 3,483,902 |
Click here to switch to the map view.
The map label for this gene is pcm [H]
Identifier: 116689843
GI number: 116689843
Start: 2024167
End: 2025099
Strand: Reverse
Name: pcm [H]
Synonym: Bcen2424_1822
Alternate gene names: 116689843
Gene position: 2025099-2024167 (Counterclockwise)
Preceding gene: 116689844
Following gene: 116689842
Centisome position: 58.13
GC content: 70.53
Gene sequence:
>933_bases ATGAGCGGCGAGCGCGCGAAGCGGTTCCCGCTCGCGCTCGAAGATCTCAAGCGAGCGCCACGCAAGTCGGAAGGTCGGCC CGGTGAACGCCAGACGGCGGGGGCGGTGCCGAAGGCTGCCGACAAACCCGCGGCCGTGCTGAAGCCGGTTGCGGTGAAGC CGGCCGCCGTGCGGGCGCCGCTGCCGGGTATCGCCGCCGCGAAGCCGGCGACCGCACCGAAGCCCACCGCGCTGAAGCCT GCGCTGCCGAAGCCGGCCGCGCCGAGCATCGCGCCGGCCGGCGCGTTCGCCCTCACGTCGGAACGTGTGCGCGAGCGGAT GGTCGAACGCCTGCGCGCGAACGGCGTGACCGATGCGCGCGTGCTGGACGCAATGGCCGCGGTGCCGCGCCACCTGTTCG TGGATCCCGGGCTCGCGACGCAGGCCTACGAGGATTCGGCATTGCCGATCGGCCATCAGCAGACCATTTCAAAGCCGTCG GTCGTCGCGCGCATGATCGAGCTCGCGATGGCCGGCCGCACCCTCGAGCGCGTGCTCGAGATCGGCACGGGTTGCGGCTA TCAGGCCGCCGTGCTGAGTCACGTGGCACGCGACGTGTATTCGATTGAACGCATCAAGCCGCTTTACGAGCGCGCGAAGC TGAACCTGCGGCCGCTGCGCGTGCCGAACATCCGTCTGCACTACGGCGACGGGCGTGTCGGCTTGCCGTCCGCGGCCCCG TTCGACGCGATCGTGATCGCGGCGGCGGGGCTCGACGTGCCGCAGGCGCTGCTCGAGCAGCTCGCGATCGGCGGGCGGCT CGTCGCGCCGGTCGGCGCGCAGAGCGGGCAGCACCAGGTGCTCACGCTCGTCGAGCGCGTCGCGCACGCGCAATGGCGAG AGTCCCGGCTTGATCGCGTTTTCTTTGTCCCTTTAAAATCCGGAGTGATTTGA
Upstream 100 bases:
>100_bases CAAACGGTTTCGTCTCGATCACACCGCTGCAACTCGATCTCACGCATACGCAGATGCTGCCCGCGATGCGCGAATGGGCG CGCGCCGGAGGGCGGGCTTC
Downstream 100 bases:
>100_bases AACCGATGAGTATGTTGCGCGCGATGCAAAACAACCGATCCAGGGAACCGCTCACGTTCGCCCAGCGCGCGATCTGTGTG GCTGCGTTCTCCACGCTACT
Product: protein-L-isoaspartate O-methyltransferase
Products: NA
Alternate protein names: L-isoaspartyl protein carboxyl methyltransferase; Protein L-isoaspartyl methyltransferase; Protein-beta-aspartate methyltransferase; PIMT [H]
Number of amino acids: Translated: 310; Mature: 309
Protein sequence:
>310_residues MSGERAKRFPLALEDLKRAPRKSEGRPGERQTAGAVPKAADKPAAVLKPVAVKPAAVRAPLPGIAAAKPATAPKPTALKP ALPKPAAPSIAPAGAFALTSERVRERMVERLRANGVTDARVLDAMAAVPRHLFVDPGLATQAYEDSALPIGHQQTISKPS VVARMIELAMAGRTLERVLEIGTGCGYQAAVLSHVARDVYSIERIKPLYERAKLNLRPLRVPNIRLHYGDGRVGLPSAAP FDAIVIAAAGLDVPQALLEQLAIGGRLVAPVGAQSGQHQVLTLVERVAHAQWRESRLDRVFFVPLKSGVI
Sequences:
>Translated_310_residues MSGERAKRFPLALEDLKRAPRKSEGRPGERQTAGAVPKAADKPAAVLKPVAVKPAAVRAPLPGIAAAKPATAPKPTALKP ALPKPAAPSIAPAGAFALTSERVRERMVERLRANGVTDARVLDAMAAVPRHLFVDPGLATQAYEDSALPIGHQQTISKPS VVARMIELAMAGRTLERVLEIGTGCGYQAAVLSHVARDVYSIERIKPLYERAKLNLRPLRVPNIRLHYGDGRVGLPSAAP FDAIVIAAAGLDVPQALLEQLAIGGRLVAPVGAQSGQHQVLTLVERVAHAQWRESRLDRVFFVPLKSGVI >Mature_309_residues SGERAKRFPLALEDLKRAPRKSEGRPGERQTAGAVPKAADKPAAVLKPVAVKPAAVRAPLPGIAAAKPATAPKPTALKPA LPKPAAPSIAPAGAFALTSERVRERMVERLRANGVTDARVLDAMAAVPRHLFVDPGLATQAYEDSALPIGHQQTISKPSV VARMIELAMAGRTLERVLEIGTGCGYQAAVLSHVARDVYSIERIKPLYERAKLNLRPLRVPNIRLHYGDGRVGLPSAAPF DAIVIAAAGLDVPQALLEQLAIGGRLVAPVGAQSGQHQVLTLVERVAHAQWRESRLDRVFFVPLKSGVI
Specific function: Catalyzes the methyl esterification of L-isoaspartyl residues in peptides and proteins that result from spontaneous decomposition of normal L-aspartyl and L-asparaginyl residues. It plays a role in the repair and/or degradation of damaged proteins [H]
COG id: COG2518
COG function: function code O; Protein-L-isoaspartate carboxylmethyltransferase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the methyltransferase superfamily. L- isoaspartyl/D-aspartyl protein methyltransferase family [H]
Homologues:
Organism=Homo sapiens, GI226530908, Length=214, Percent_Identity=33.6448598130841, Blast_Score=89, Evalue=5e-18, Organism=Escherichia coli, GI1789100, Length=211, Percent_Identity=45.4976303317536, Blast_Score=161, Evalue=4e-41, Organism=Caenorhabditis elegans, GI71983477, Length=223, Percent_Identity=32.2869955156951, Blast_Score=85, Evalue=4e-17, Organism=Caenorhabditis elegans, GI193207222, Length=219, Percent_Identity=31.5068493150685, Blast_Score=79, Evalue=4e-15, Organism=Drosophila melanogaster, GI17981723, Length=218, Percent_Identity=32.5688073394495, Blast_Score=83, Evalue=2e-16,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000682 [H]
Pfam domain/function: PF01135 PCMT [H]
EC number: =2.1.1.77 [H]
Molecular weight: Translated: 32970; Mature: 32839
Theoretical pI: Translated: 11.04; Mature: 11.04
Prosite motif: PS01279 PCMT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 1.9 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSGERAKRFPLALEDLKRAPRKSEGRPGERQTAGAVPKAADKPAAVLKPVAVKPAAVRAP CCCCCCHHCCCHHHHHHCCCCCCCCCCCCCHHCCCCCCCCCCCHHHHCCCCCCCHHHCCC LPGIAAAKPATAPKPTALKPALPKPAAPSIAPAGAFALTSERVRERMVERLRANGVTDAR CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHH VLDAMAAVPRHLFVDPGLATQAYEDSALPIGHQQTISKPSVVARMIELAMAGRTLERVLE HHHHHHHHHHHHEECCCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHH IGTGCGYQAAVLSHVARDVYSIERIKPLYERAKLNLRPLRVPNIRLHYGDGRVGLPSAAP HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCEEEEEECCCCCCCCCCCC FDAIVIAAAGLDVPQALLEQLAIGGRLVAPVGAQSGQHQVLTLVERVAHAQWRESRLDRV HHHHHHHHCCCCHHHHHHHHHHHCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCEE FFVPLKSGVI EEEECCCCCC >Mature Secondary Structure SGERAKRFPLALEDLKRAPRKSEGRPGERQTAGAVPKAADKPAAVLKPVAVKPAAVRAP CCCCCHHCCCHHHHHHCCCCCCCCCCCCCHHCCCCCCCCCCCHHHHCCCCCCCHHHCCC LPGIAAAKPATAPKPTALKPALPKPAAPSIAPAGAFALTSERVRERMVERLRANGVTDAR CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHH VLDAMAAVPRHLFVDPGLATQAYEDSALPIGHQQTISKPSVVARMIELAMAGRTLERVLE HHHHHHHHHHHHEECCCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHH IGTGCGYQAAVLSHVARDVYSIERIKPLYERAKLNLRPLRVPNIRLHYGDGRVGLPSAAP HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCEEEEEECCCCCCCCCCCC FDAIVIAAAGLDVPQALLEQLAIGGRLVAPVGAQSGQHQVLTLVERVAHAQWRESRLDRV HHHHHHHHCCCCHHHHHHHHHHHCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCEE FFVPLKSGVI EEEECCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA