Definition Novosphingobium aromaticivorans DSM 12444 chromosome, complete genome.
Accession NC_007794
Length 3,561,584

Click here to switch to the map view.

The map label for this gene is 87198821

Identifier: 87198821

GI number: 87198821

Start: 849016

End: 850071

Strand: Direct

Name: 87198821

Synonym: Saro_0799

Alternate gene names: NA

Gene position: 849016-850071 (Clockwise)

Preceding gene: 87198820

Following gene: 87198822

Centisome position: 23.84

GC content: 63.35

Gene sequence:

>1056_bases
ATGACACAAGACCTTAAGACCGGCGGCGAGCAGGGCTACCTGCGCATCGCCACCGAGGAAGCCTTCGCCACGCGCGAGAT
CATCGACGTCTACCTGCGCATGATCCGCGATGGCACTGCCGACAAGGGCATGGTCTCGCTCTGGGGCTTCTACGCCCAGT
CCCCCTCAGAGCGCGCCACCCAGATCCTCGAACGCCTGCTCGATCTTGGCGAGCGCCGCATCGCCGACATGGACGCGACC
GGCATCGACAAGGCTATCCTCGCGCTGACCTCGCCCGGCGTCCAGCCGCTGCACGACCTTGACGAGGCCAGGACGCTCGC
CACCCGCGCCAACGACACGCTTGCCGACGCGTGCCAAAAGTACCCAGACCGCTTCATCGGCATGGGCACCGTCGCCCCGC
AGGACCCGGAATGGTCCGCGCGCGAGATCCATCGTGGTGCCAGGGAACTGGGCTTCAAGGGCATCCAGATCAACAGCCAC
ACGCAAGGGCGCTACCTCGACGAGGAGTTCTTCGACCCGATCTTCCGCGCCCTCGTTGAAGTCGACCAGCCGCTCTACAT
CCACCCTGCCACTTCGCCCGATTCCATGATCGACCCGATGCTCGAAGCGGGCCTCGACGGCGCCATCTTCGGCTTCGGCG
TGGAGACGGGCATGCACCTGCTGCGCCTCATCACCATCGGCATCTTCGACAAGTATCCCAGCCTTCAGATCATGGTCGGC
CACATGGGCGAGGCGCTGCCCTACTGGCTCTACCGCCTGGACTACATGCACCAGGCCGGTGTCCGCTCGCAGCGCTACGA
ACGCATGAAGCCCCTGAAGAAGACCATCGAGGGCTACCTCAAGTCCAACGTCCTCGTCACCAATTCGGGCGTCGCGTGGG
AACCTGCGATCAAGTTCTGCCAGCAGGTCATGGGCGAGGACCGCGTTATGTACGCGATGGACTACCCCTACCAGTACGTT
GCCGACGAGGTGCGCGCGATGGACGCCATGGACATGAGTGCGCAAACGAAGAAGAAGTTCTTCCAGACCAACGCGGAGAA
GTGGTTCAAGCTTTGA

Upstream 100 bases:

>100_bases
CGCTTTGCGCGGCCGGATCGCATGAATCACATCGGCAGTCTCACGCTCACCGTCCGCAGGTGACGCGGCGCAGGCTGCCC
TTCCCGGGAAGGAAGCCTGC

Downstream 100 bases:

>100_bases
CCAGCGCCGGGAGGGAGAGCCTATGACCACAGAACCGCGCCGCTCACTGGCCCCCGGCGCATGGTACGCGCTCGTCCTCG
TCGCGCTGACCAATGCGATG

Product: amidohydrolase 2

Products: NA

Alternate protein names: Amidohydrolase; Amidohydrolase/Decarboxylase; Metal-Dependent Hydrolase; Amidohydrolase Family Protein; 4-Oxalomesaconate Hydratase; 2-Amino-3-Carboxylmuconate-6-Semialdehyde Decarboxylase; O-Pyrocatechuate Decarboxylase; 2-Amino-3-Carboxymuconate-6-Semialdehyde Decarboxylase; Hydrolase; 2-Amino-3-Carboxymuconate 6-Semialdehyde Decarboxylase; 2-Amino-3-Carboxymuconate-6- Semialdehyde Decarboxylase; Tryptophan 2 3-Dioxygenase; Decarboxylase; 2 3-Dihydroxybenzoic Acid Decarboxylase; Amidohydrolase Family; Amidase; 5-Carboxyvanillate Decarboxylase; Metal-Dependent Hydrolase Of TIM-Barrel Fold; Barh Protein; Amidohydrolase 2 Family Protein; Metal Dependent Hydrolase; 5-Carboxy-2-Hydroxymuconate-6-Semialdehyde Decarboxylase

Number of amino acids: Translated: 351; Mature: 350

Protein sequence:

>351_residues
MTQDLKTGGEQGYLRIATEEAFATREIIDVYLRMIRDGTADKGMVSLWGFYAQSPSERATQILERLLDLGERRIADMDAT
GIDKAILALTSPGVQPLHDLDEARTLATRANDTLADACQKYPDRFIGMGTVAPQDPEWSAREIHRGARELGFKGIQINSH
TQGRYLDEEFFDPIFRALVEVDQPLYIHPATSPDSMIDPMLEAGLDGAIFGFGVETGMHLLRLITIGIFDKYPSLQIMVG
HMGEALPYWLYRLDYMHQAGVRSQRYERMKPLKKTIEGYLKSNVLVTNSGVAWEPAIKFCQQVMGEDRVMYAMDYPYQYV
ADEVRAMDAMDMSAQTKKKFFQTNAEKWFKL

Sequences:

>Translated_351_residues
MTQDLKTGGEQGYLRIATEEAFATREIIDVYLRMIRDGTADKGMVSLWGFYAQSPSERATQILERLLDLGERRIADMDAT
GIDKAILALTSPGVQPLHDLDEARTLATRANDTLADACQKYPDRFIGMGTVAPQDPEWSAREIHRGARELGFKGIQINSH
TQGRYLDEEFFDPIFRALVEVDQPLYIHPATSPDSMIDPMLEAGLDGAIFGFGVETGMHLLRLITIGIFDKYPSLQIMVG
HMGEALPYWLYRLDYMHQAGVRSQRYERMKPLKKTIEGYLKSNVLVTNSGVAWEPAIKFCQQVMGEDRVMYAMDYPYQYV
ADEVRAMDAMDMSAQTKKKFFQTNAEKWFKL
>Mature_350_residues
TQDLKTGGEQGYLRIATEEAFATREIIDVYLRMIRDGTADKGMVSLWGFYAQSPSERATQILERLLDLGERRIADMDATG
IDKAILALTSPGVQPLHDLDEARTLATRANDTLADACQKYPDRFIGMGTVAPQDPEWSAREIHRGARELGFKGIQINSHT
QGRYLDEEFFDPIFRALVEVDQPLYIHPATSPDSMIDPMLEAGLDGAIFGFGVETGMHLLRLITIGIFDKYPSLQIMVGH
MGEALPYWLYRLDYMHQAGVRSQRYERMKPLKKTIEGYLKSNVLVTNSGVAWEPAIKFCQQVMGEDRVMYAMDYPYQYVA
DEVRAMDAMDMSAQTKKKFFQTNAEKWFKL

Specific function: Unknown

COG id: COG2159

COG function: function code R; Predicted metal-dependent hydrolase of the TIM-barrel fold

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI109715856, Length=280, Percent_Identity=27.1428571428571, Blast_Score=108, Evalue=7e-24,
Organism=Caenorhabditis elegans, GI71995651, Length=280, Percent_Identity=25, Blast_Score=85, Evalue=5e-17,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 39852; Mature: 39721

Theoretical pI: Translated: 5.04; Mature: 5.04

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
5.1 %Met     (Translated Protein)
5.7 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
4.9 %Met     (Mature Protein)
5.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTQDLKTGGEQGYLRIATEEAFATREIIDVYLRMIRDGTADKGMVSLWGFYAQSPSERAT
CCCCCCCCCCCCEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCHHHHH
QILERLLDLGERRIADMDATGIDKAILALTSPGVQPLHDLDEARTLATRANDTLADACQK
HHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH
YPDRFIGMGTVAPQDPEWSAREIHRGARELGFKGIQINSHTQGRYLDEEFFDPIFRALVE
HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHH
VDQPLYIHPATSPDSMIDPMLEAGLDGAIFGFGVETGMHLLRLITIGIFDKYPSLQIMVG
CCCCEEEECCCCCHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCEEEHH
HMGEALPYWLYRLDYMHQAGVRSQRYERMKPLKKTIEGYLKSNVLVTNSGVAWEPAIKFC
HHCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCHHHHHHHH
QQVMGEDRVMYAMDYPYQYVADEVRAMDAMDMSAQTKKKFFQTNAEKWFKL
HHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHCC
>Mature Secondary Structure 
TQDLKTGGEQGYLRIATEEAFATREIIDVYLRMIRDGTADKGMVSLWGFYAQSPSERAT
CCCCCCCCCCCEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCHHHHH
QILERLLDLGERRIADMDATGIDKAILALTSPGVQPLHDLDEARTLATRANDTLADACQK
HHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH
YPDRFIGMGTVAPQDPEWSAREIHRGARELGFKGIQINSHTQGRYLDEEFFDPIFRALVE
HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHHHH
VDQPLYIHPATSPDSMIDPMLEAGLDGAIFGFGVETGMHLLRLITIGIFDKYPSLQIMVG
CCCCEEEECCCCCHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCEEEHH
HMGEALPYWLYRLDYMHQAGVRSQRYERMKPLKKTIEGYLKSNVLVTNSGVAWEPAIKFC
HHCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCHHHHHHHH
QQVMGEDRVMYAMDYPYQYVADEVRAMDAMDMSAQTKKKFFQTNAEKWFKL
HHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA