Definition | Chromohalobacter salexigens DSM 3043 chromosome, complete genome. |
---|---|
Accession | NC_007963 |
Length | 3,696,649 |
Click here to switch to the map view.
The map label for this gene is ytxM [H]
Identifier: 92112274
GI number: 92112274
Start: 160347
End: 161201
Strand: Direct
Name: ytxM [H]
Synonym: Csal_0139
Alternate gene names: 92112274
Gene position: 160347-161201 (Clockwise)
Preceding gene: 92112273
Following gene: 92112275
Centisome position: 4.34
GC content: 68.3
Gene sequence:
>855_bases ATGAACGTGCCGGACGACGCCCTCGAGGCGGCGGAAGTCCAGTGTCGCGAGGTGTCGCTTGCGTCCGGACGTCAGGCGTT TTGCGACATGGGCAGTGGCATCCCCGTCGTGCTGTTGCATGGCATCAGTTCCGGAGCCCGTTCCTGGGCACCGCTGATGC ATCAGGCGACGGGAGTCCGCTGGCTGGCCTGGGACGCGCCGGGCTACGGCGAGAGCTCGGCGCTGGCCGAGCCCCATCCC ACGGCACGGGACTATGCCTTGCGGCTGGCGGCATGGCTGGAAGCGCTGGCACTCGAGCGCGTGGTGCTGATCGGCCACTC ACTGGGTGCGTTGATCGCGAGTGCCTATGCCCGTGACTTCCCTGACCGGGTCTCGGGGCTGCTGCTGGCCGACCCTGCCC AGGGCTATCGCCATGCCGATCCCGACAAGCGCGATGCCGTTTACCGGAGTCGTTGGACGCAACTCGCCGCGCAAGGGCAC GCGGCCTACGCAGCGGCGCGAGCGCCCAGGCTGTTGCGCGAGAACGCTCGCGTGGAAGATATCGCGCGTGTCCAGGCCGG CATGCACCGCCTCGAGGTGTCGGGATTCGCCCAGGCCAGCTGGATGCTGGCCAACGACTCGCTGGAGGACCATGCGAGCG GCACGTCCGTGCCCACTCGGGTGCTATGCGGCGACGAGGACCGCATCACGCCGCCGAGCGGTGCGCGTGCCCTGGCGGAG CGTCTCGGGGTCGCGTATCGCGACATCCCGTGTGCCGGGCACATCAGTTATATCGACGCTCCCGCTGCTTTCGCCGCCGC CGTTGCGGATTTCATGGCGACGCTGTCGCCGAGCCAGGAGAAAAGCGACTTATGA
Upstream 100 bases:
>100_bases GCGAGGAAGAGGCACTCATGTGCGTCATGCTGGGCACACCCAAACCGCAGATTCCGACGTATCCGGACGACCATCCGCTA TCCAAGATCAAGCGGCAATC
Downstream 100 bases:
>100_bases GTTTTCAGATCGAACGACGTGTGGCCGTGGTCACCGGTGGCTCCTCGGGAATCGGACTCGAAACCGTGCGCCTGCTGCTC GAATCGGGCGCCAGCGTGGC
Product: alpha/beta hydrolase
Products: 2-oxopent-4-enoate; succinate [C]
Alternate protein names: NA
Number of amino acids: Translated: 284; Mature: 284
Protein sequence:
>284_residues MNVPDDALEAAEVQCREVSLASGRQAFCDMGSGIPVVLLHGISSGARSWAPLMHQATGVRWLAWDAPGYGESSALAEPHP TARDYALRLAAWLEALALERVVLIGHSLGALIASAYARDFPDRVSGLLLADPAQGYRHADPDKRDAVYRSRWTQLAAQGH AAYAAARAPRLLRENARVEDIARVQAGMHRLEVSGFAQASWMLANDSLEDHASGTSVPTRVLCGDEDRITPPSGARALAE RLGVAYRDIPCAGHISYIDAPAAFAAAVADFMATLSPSQEKSDL
Sequences:
>Translated_284_residues MNVPDDALEAAEVQCREVSLASGRQAFCDMGSGIPVVLLHGISSGARSWAPLMHQATGVRWLAWDAPGYGESSALAEPHP TARDYALRLAAWLEALALERVVLIGHSLGALIASAYARDFPDRVSGLLLADPAQGYRHADPDKRDAVYRSRWTQLAAQGH AAYAAARAPRLLRENARVEDIARVQAGMHRLEVSGFAQASWMLANDSLEDHASGTSVPTRVLCGDEDRITPPSGARALAE RLGVAYRDIPCAGHISYIDAPAAFAAAVADFMATLSPSQEKSDL >Mature_284_residues MNVPDDALEAAEVQCREVSLASGRQAFCDMGSGIPVVLLHGISSGARSWAPLMHQATGVRWLAWDAPGYGESSALAEPHP TARDYALRLAAWLEALALERVVLIGHSLGALIASAYARDFPDRVSGLLLADPAQGYRHADPDKRDAVYRSRWTQLAAQGH AAYAAARAPRLLRENARVEDIARVQAGMHRLEVSGFAQASWMLANDSLEDHASGTSVPTRVLCGDEDRITPPSGARALAE RLGVAYRDIPCAGHISYIDAPAAFAAAVADFMATLSPSQEKSDL
Specific function: 3-hydroxyphenylpropionate degradation. [C]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the lipase/esterase LIP3/BchO family [H]
Homologues:
Organism=Homo sapiens, GI50658087, Length=141, Percent_Identity=29.7872340425532, Blast_Score=67, Evalue=1e-11, Organism=Caenorhabditis elegans, GI17558492, Length=112, Percent_Identity=33.0357142857143, Blast_Score=73, Evalue=2e-13, Organism=Caenorhabditis elegans, GI32566936, Length=112, Percent_Identity=33.0357142857143, Blast_Score=72, Evalue=3e-13, Organism=Caenorhabditis elegans, GI25146278, Length=118, Percent_Identity=30.5084745762712, Blast_Score=70, Evalue=1e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000073 - InterPro: IPR000639 - InterPro: IPR022485 [H]
Pfam domain/function: PF00561 Abhydrolase_1 [H]
EC number: 3.7.1.- [C]
Molecular weight: Translated: 30362; Mature: 30362
Theoretical pI: Translated: 5.80; Mature: 5.80
Prosite motif: PS00120 LIPASE_SER
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNVPDDALEAAEVQCREVSLASGRQAFCDMGSGIPVVLLHGISSGARSWAPLMHQATGVR CCCCHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHCCCE WLAWDAPGYGESSALAEPHPTARDYALRLAAWLEALALERVVLIGHSLGALIASAYARDF EEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC PDRVSGLLLADPAQGYRHADPDKRDAVYRSRWTQLAAQGHAAYAAARAPRLLRENARVED HHHHCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHH IARVQAGMHRLEVSGFAQASWMLANDSLEDHASGTSVPTRVLCGDEDRITPPSGARALAE HHHHHHHHHHHHHCCHHHHEEEEECCCCHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHH RLGVAYRDIPCAGHISYIDAPAAFAAAVADFMATLSPSQEKSDL HHCCHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHCCCCCCCCCC >Mature Secondary Structure MNVPDDALEAAEVQCREVSLASGRQAFCDMGSGIPVVLLHGISSGARSWAPLMHQATGVR CCCCHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHCCCE WLAWDAPGYGESSALAEPHPTARDYALRLAAWLEALALERVVLIGHSLGALIASAYARDF EEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC PDRVSGLLLADPAQGYRHADPDKRDAVYRSRWTQLAAQGHAAYAAARAPRLLRENARVED HHHHCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHH IARVQAGMHRLEVSGFAQASWMLANDSLEDHASGTSVPTRVLCGDEDRITPPSGARALAE HHHHHHHHHHHHHCCHHHHEEEEECCCCHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHH RLGVAYRDIPCAGHISYIDAPAAFAAAVADFMATLSPSQEKSDL HHCCHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: 2-hydroxy-6-ketononadienedicarboxylate; H2O [C]
Specific reaction: 2-hydroxy-6-ketononadienedicarboxylate + H2O = 2-oxopent-4-enoate + succinate [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 8566759; 9387221; 9384377 [H]