Definition Novosphingobium aromaticivorans DSM 12444 chromosome, complete genome.
Accession NC_007794
Length 3,561,584

Click here to switch to the map view.

The map label for this gene is lip2 [H]

Identifier: 87199773

GI number: 87199773

Start: 1850377

End: 1851333

Strand: Reverse

Name: lip2 [H]

Synonym: Saro_1756

Alternate gene names: 87199773

Gene position: 1851333-1850377 (Counterclockwise)

Preceding gene: 87199778

Following gene: 87199772

Centisome position: 51.98

GC content: 64.68

Gene sequence:

>957_bases
ATGTCTGATCAGACCACCCACTACGTGCGCCCCGATGTTAAGGCTTTCCTCGATTTCCTGAACGGCACCGATGCCCCGCC
GATGAGCCAGCTCGGCCTCGAAGCGGCGCGCGCCAGCTATATCGCAATGGGCCAGCTCGCCGAAGAGCCGCCACGCGACC
TTGCCGTGATCCGCGACCTCACCTGCCCCGGCCCCGCCGGTGACATCCCGCTGCGCCTCTATGATCCGCGCGAGACCCGC
GAGCCCGGTCCCGCGGTCGTCTTCTTCCACGGCGGCGGTTTCGTGATCGGCGACCTGGAAAGCCATCACAGCCTTTGCAC
CGAAATCGCGGCGGAACTCGACATGCCGGTGATCGCGGTGGACTACCGTCTCGCCCCTGAACATCCCTTCCCCGCCGCGC
CAGACGATTGCGAGGCCGCTGCCCGCTGGATCGCCGCCAATTCCGCGACCGCGCTTGGCCGGAAGGTCACCGGGCTCATC
CCGATGGGCGATTCGGCCGGCGGCAACCTGACCATCGTCGTGACGCAGGCCCTGGTCGACGAGCCTGCGGCCGTCCCCGT
CATCATCCAGGTGCCGATCTATCCACTCAGCGACGACAAGCCCGACCACAAGTCGTTCCTCGACTTTGCCGATGGCTTCC
TTCTCACTGCCGATTCCATGCAGTGGTTCGCCAATGCCTATCAGGCGGTTACCGGACACAAGCGGGCTTACCCGATCTAT
GGCGAGCATTCGACCACGCCGCCGACAGTGCTAGTGACCGCCAGCCTCGATCCGATTCGCGACAGCGGGCGGGTCTATGG
CGCGGAACTGATCCGCGCAGGCGCCGAAGTCGTGTTCCTCGAAATGAAGGGCACGATCCACGGCTTCACGCAGGTGCGCA
AGGCGATCCCCAGCGCACAGGCCGATATGCAGTCGATTTTCGCAGCCATCAGGCTGCTGCTGGAAAGGTTGAAATGA

Upstream 100 bases:

>100_bases
CACTCGGTCTTGATTCCTAACCGGCAGGCTAGTCAAGTTGCAAACGCGGACCTACCCCGTTGTAGACAACGAATCCCCAA
GCGAGGGACGAGGAGAGAGC

Downstream 100 bases:

>100_bases
ACGACGCGTTTGCCGGGCTTCCCTATCGTCCCTGTGTCGGGGTCATGCTGGTCAATTCGCAGGGCCGCGTGTTCGTCGGT
CGGCGCATCGACGACAAGGA

Product: lipolytic protein

Products: NA

Alternate protein names: Triacylglycerol lipase [H]

Number of amino acids: Translated: 318; Mature: 317

Protein sequence:

>318_residues
MSDQTTHYVRPDVKAFLDFLNGTDAPPMSQLGLEAARASYIAMGQLAEEPPRDLAVIRDLTCPGPAGDIPLRLYDPRETR
EPGPAVVFFHGGGFVIGDLESHHSLCTEIAAELDMPVIAVDYRLAPEHPFPAAPDDCEAAARWIAANSATALGRKVTGLI
PMGDSAGGNLTIVVTQALVDEPAAVPVIIQVPIYPLSDDKPDHKSFLDFADGFLLTADSMQWFANAYQAVTGHKRAYPIY
GEHSTTPPTVLVTASLDPIRDSGRVYGAELIRAGAEVVFLEMKGTIHGFTQVRKAIPSAQADMQSIFAAIRLLLERLK

Sequences:

>Translated_318_residues
MSDQTTHYVRPDVKAFLDFLNGTDAPPMSQLGLEAARASYIAMGQLAEEPPRDLAVIRDLTCPGPAGDIPLRLYDPRETR
EPGPAVVFFHGGGFVIGDLESHHSLCTEIAAELDMPVIAVDYRLAPEHPFPAAPDDCEAAARWIAANSATALGRKVTGLI
PMGDSAGGNLTIVVTQALVDEPAAVPVIIQVPIYPLSDDKPDHKSFLDFADGFLLTADSMQWFANAYQAVTGHKRAYPIY
GEHSTTPPTVLVTASLDPIRDSGRVYGAELIRAGAEVVFLEMKGTIHGFTQVRKAIPSAQADMQSIFAAIRLLLERLK
>Mature_317_residues
SDQTTHYVRPDVKAFLDFLNGTDAPPMSQLGLEAARASYIAMGQLAEEPPRDLAVIRDLTCPGPAGDIPLRLYDPRETRE
PGPAVVFFHGGGFVIGDLESHHSLCTEIAAELDMPVIAVDYRLAPEHPFPAAPDDCEAAARWIAANSATALGRKVTGLIP
MGDSAGGNLTIVVTQALVDEPAAVPVIIQVPIYPLSDDKPDHKSFLDFADGFLLTADSMQWFANAYQAVTGHKRAYPIYG
EHSTTPPTVLVTASLDPIRDSGRVYGAELIRAGAEVVFLEMKGTIHGFTQVRKAIPSAQADMQSIFAAIRLLLERLK

Specific function: Has An Esterase Activity. Triacetyl Glycerol (Triacetin) Is A Substrate Of The Enzyme. [C]

COG id: COG0657

COG function: function code I; Esterase/lipase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'GDXG' lipolytic enzyme family [H]

Homologues:

Organism=Homo sapiens, GI157041239, Length=131, Percent_Identity=38.1679389312977, Blast_Score=89, Evalue=5e-18,
Organism=Homo sapiens, GI206597554, Length=288, Percent_Identity=28.8194444444444, Blast_Score=84, Evalue=1e-16,
Organism=Homo sapiens, GI68299767, Length=327, Percent_Identity=25.3822629969419, Blast_Score=83, Evalue=4e-16,
Organism=Homo sapiens, GI61966717, Length=132, Percent_Identity=33.3333333333333, Blast_Score=76, Evalue=4e-14,
Organism=Homo sapiens, GI21328446, Length=89, Percent_Identity=44.9438202247191, Blast_Score=74, Evalue=2e-13,
Organism=Escherichia coli, GI1786682, Length=285, Percent_Identity=25.2631578947368, Blast_Score=78, Evalue=6e-16,
Organism=Caenorhabditis elegans, GI71996133, Length=176, Percent_Identity=31.8181818181818, Blast_Score=96, Evalue=3e-20,
Organism=Caenorhabditis elegans, GI72001146, Length=147, Percent_Identity=37.4149659863946, Blast_Score=93, Evalue=2e-19,
Organism=Caenorhabditis elegans, GI17540028, Length=131, Percent_Identity=36.6412213740458, Blast_Score=91, Evalue=6e-19,
Organism=Caenorhabditis elegans, GI17567059, Length=262, Percent_Identity=24.8091603053435, Blast_Score=89, Evalue=4e-18,
Organism=Caenorhabditis elegans, GI115533410, Length=94, Percent_Identity=39.3617021276596, Blast_Score=72, Evalue=3e-13,
Organism=Caenorhabditis elegans, GI115533412, Length=94, Percent_Identity=39.3617021276596, Blast_Score=72, Evalue=3e-13,
Organism=Drosophila melanogaster, GI24656084, Length=106, Percent_Identity=39.622641509434, Blast_Score=77, Evalue=1e-14,
Organism=Drosophila melanogaster, GI24656076, Length=106, Percent_Identity=39.622641509434, Blast_Score=77, Evalue=1e-14,
Organism=Drosophila melanogaster, GI20130169, Length=106, Percent_Identity=39.622641509434, Blast_Score=77, Evalue=1e-14,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013094
- InterPro:   IPR002168 [H]

Pfam domain/function: PF07859 Abhydrolase_3 [H]

EC number: =3.1.1.3 [H]

Molecular weight: Translated: 34270; Mature: 34139

Theoretical pI: Translated: 4.73; Mature: 4.73

Prosite motif: PS01173 LIPASE_GDXG_HIS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSDQTTHYVRPDVKAFLDFLNGTDAPPMSQLGLEAARASYIAMGQLAEEPPRDLAVIRDL
CCCCCCCEECCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHC
TCPGPAGDIPLRLYDPRETREPGPAVVFFHGGGFVIGDLESHHSLCTEIAAELDMPVIAV
CCCCCCCCCCEEEECCHHCCCCCCEEEEEECCCEEEECCHHHHHHHHHHHHHCCCCEEEE
DYRLAPEHPFPAAPDDCEAAARWIAANSATALGRKVTGLIPMGDSAGGNLTIVVTQALVD
EEEECCCCCCCCCCCHHHHHHHHHHCCCHHHHCCCEEEEEECCCCCCCEEEEEEEEHHHC
EPAAVPVIIQVPIYPLSDDKPDHKSFLDFADGFLLTADSMQWFANAYQAVTGHKRAYPIY
CCCCCEEEEEEEEEECCCCCCCHHHHHHHCCCEEEEECHHHHHHHHHHHHHCCCCCCEEC
GEHSTTPPTVLVTASLDPIRDSGRVYGAELIRAGAEVVFLEMKGTIHGFTQVRKAIPSAQ
CCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCCCHH
ADMQSIFAAIRLLLERLK
HHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
SDQTTHYVRPDVKAFLDFLNGTDAPPMSQLGLEAARASYIAMGQLAEEPPRDLAVIRDL
CCCCCCEECCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHC
TCPGPAGDIPLRLYDPRETREPGPAVVFFHGGGFVIGDLESHHSLCTEIAAELDMPVIAV
CCCCCCCCCCEEEECCHHCCCCCCEEEEEECCCEEEECCHHHHHHHHHHHHHCCCCEEEE
DYRLAPEHPFPAAPDDCEAAARWIAANSATALGRKVTGLIPMGDSAGGNLTIVVTQALVD
EEEECCCCCCCCCCCHHHHHHHHHHCCCHHHHCCCEEEEEECCCCCCCEEEEEEEEHHHC
EPAAVPVIIQVPIYPLSDDKPDHKSFLDFADGFLLTADSMQWFANAYQAVTGHKRAYPIY
CCCCCEEEEEEEEEECCCCCCCHHHHHHHCCCEEEEECHHHHHHHHHHHHHCCCCCCEEC
GEHSTTPPTVLVTASLDPIRDSGRVYGAELIRAGAEVVFLEMKGTIHGFTQVRKAIPSAQ
CCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHCCCEEEEEEECCCHHHHHHHHHHCCCHH
ADMQSIFAAIRLLLERLK
HHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 1907455 [H]