Definition Rhodopseudomonas palustris HaA2, complete genome.
Accession NC_007778
Length 5,331,656

Click here to switch to the map view.

The map label for this gene is 86748035

Identifier: 86748035

GI number: 86748035

Start: 1046404

End: 1047672

Strand: Direct

Name: 86748035

Synonym: RPB_0909

Alternate gene names: NA

Gene position: 1046404-1047672 (Clockwise)

Preceding gene: 86748034

Following gene: 86748040

Centisome position: 19.63

GC content: 65.96

Gene sequence:

>1269_bases
ATGCACCCCAACGATCCGCGCCTGCGCTCCTTCGTCGATGTGAAACCGGAATCGGACTTTCCGATCCAGAACCTTCCCTA
CGGCGTGATCTCGACCGCGTCCGACCCTTCCCCGCGTGTCGGCGTCGCGATCGGCGATTTCGTGCTCGATCTCGCGGCGC
TGCAGGCGGCCAAGCTGCTCGATCTGCCGGACGGCGTGTTCGCGCAATCGTCGATCAACGCCTTCATGGCGCTCGGGCTC
GCAATGTGGAGCACGACACGGGCGCGGATCAGTGCGTTGCTGCGTCACGACAATCCCGAGCTGCGCGACGACGCCGCGCT
GCGCGCGCGGGCGCTTGTTCCGATGAGTGACGCGAAGTTGCATCTGCCGCTGCGCGTCGAAGGCTTCACCGATTTCTACT
CGTCGAAGGAACACGCCACCAATGTCGGCACGATGTTCCGCGACAAGACCAATCCGCTGCTGCCGAACTGGCTGCACATC
CCGATCGGCTACAACGGCCGCGCCTCGACCGTCGTGGTCAGCGGCACCCAGATCCATCGTCCGCGCGGGCAGCTCAAGCC
ACCATCCGCCGAGCTGCCGAGCTTCGGCCCGTGCAAGCGGCTCGATTTCGAGCTGGAGATCGGCGTCGTGATCGGGCAGC
CGTCGGCGATGGGCACGACGCTGACCGAACAGCAGGCCGAGGAGATGATCTTCGGCTTCACGCTGTTGAACGACTGGAGC
GCGCGCGACATCCAGCAATGGGAGTATGTGCCGCTCGGGCCGTTCCAGGCGAAAGCGTTCGCCACCTCGATCAGCCCGTG
GATCGTGACGCGCGAGGCGCTGGAGCCGTTTCGGGTTCACGGGCCCACGCAGGATCCTGTGCCTCTGCCCTATCTGCAGC
AGCAGGGGCCTAACAACTACGACATGGCGCTGGAAGTGAACCTGCGCACGCCGGCCATGAACGCGCCGGCGCGGATCAGC
GCGACGAATTTCAAATACATGTACTGGTCGTCAGTGCAGCAGCTGGTGCACCATGCCTCCAGCGGCTGCGCGATGAATGT
CGGCGACCTGCTCGGCTCCGGCACCGTCTCGGGGCCGGCGAAGGATCAGCTCGGCAGCCTGCTGGAGCTGAGCTGGAACG
GCGCCGAACCGGTGCAGCTCCCCGGCGGCGAGACCCGCGGCTTCCTCGACGACGGCGATTCGCTGATCATGCGCGGCTGG
TGCCAGGCCGACGGCTACCGCGTCGGTTTCGGCGAGGTCGAGGGGACGATTCTGGCGGCGAAGAGCTGA

Upstream 100 bases:

>100_bases
GGCGCGCCAAGCTGCGCTTGCCGCGCCACCCTCTCCCCGCGCGCGGGGAGAGGGTACATCCGCCGACCCGATAGAACCGC
CCAACGTCCCAGGACCCTTC

Downstream 100 bases:

>100_bases
CGCTCTTTCCGCTCGCACGACTCTCGTTCGTCATTCCGGGGCGCTCACGTAGTGAGCGAACCCGGAATCCATAACCCCTG
CACGGGTGTTCAGGAAGAGC

Product: fumarylacetoacetase

Products: acetoacetate; fumarate

Alternate protein names: Fumarylacetoacetate Hydrolase; Hydroxylase; 2-Hydroxyhepta-2 4-Diene-1 7-Dioate Isomerase; Fumarylacetoacetate Hydrolase Family Protein; Fumarylacetoacetase Hydrolase; LOW QUALITY PROTEIN Fumarylacetoacetase

Number of amino acids: Translated: 422; Mature: 422

Protein sequence:

>422_residues
MHPNDPRLRSFVDVKPESDFPIQNLPYGVISTASDPSPRVGVAIGDFVLDLAALQAAKLLDLPDGVFAQSSINAFMALGL
AMWSTTRARISALLRHDNPELRDDAALRARALVPMSDAKLHLPLRVEGFTDFYSSKEHATNVGTMFRDKTNPLLPNWLHI
PIGYNGRASTVVVSGTQIHRPRGQLKPPSAELPSFGPCKRLDFELEIGVVIGQPSAMGTTLTEQQAEEMIFGFTLLNDWS
ARDIQQWEYVPLGPFQAKAFATSISPWIVTREALEPFRVHGPTQDPVPLPYLQQQGPNNYDMALEVNLRTPAMNAPARIS
ATNFKYMYWSSVQQLVHHASSGCAMNVGDLLGSGTVSGPAKDQLGSLLELSWNGAEPVQLPGGETRGFLDDGDSLIMRGW
CQADGYRVGFGEVEGTILAAKS

Sequences:

>Translated_422_residues
MHPNDPRLRSFVDVKPESDFPIQNLPYGVISTASDPSPRVGVAIGDFVLDLAALQAAKLLDLPDGVFAQSSINAFMALGL
AMWSTTRARISALLRHDNPELRDDAALRARALVPMSDAKLHLPLRVEGFTDFYSSKEHATNVGTMFRDKTNPLLPNWLHI
PIGYNGRASTVVVSGTQIHRPRGQLKPPSAELPSFGPCKRLDFELEIGVVIGQPSAMGTTLTEQQAEEMIFGFTLLNDWS
ARDIQQWEYVPLGPFQAKAFATSISPWIVTREALEPFRVHGPTQDPVPLPYLQQQGPNNYDMALEVNLRTPAMNAPARIS
ATNFKYMYWSSVQQLVHHASSGCAMNVGDLLGSGTVSGPAKDQLGSLLELSWNGAEPVQLPGGETRGFLDDGDSLIMRGW
CQADGYRVGFGEVEGTILAAKS
>Mature_422_residues
MHPNDPRLRSFVDVKPESDFPIQNLPYGVISTASDPSPRVGVAIGDFVLDLAALQAAKLLDLPDGVFAQSSINAFMALGL
AMWSTTRARISALLRHDNPELRDDAALRARALVPMSDAKLHLPLRVEGFTDFYSSKEHATNVGTMFRDKTNPLLPNWLHI
PIGYNGRASTVVVSGTQIHRPRGQLKPPSAELPSFGPCKRLDFELEIGVVIGQPSAMGTTLTEQQAEEMIFGFTLLNDWS
ARDIQQWEYVPLGPFQAKAFATSISPWIVTREALEPFRVHGPTQDPVPLPYLQQQGPNNYDMALEVNLRTPAMNAPARIS
ATNFKYMYWSSVQQLVHHASSGCAMNVGDLLGSGTVSGPAKDQLGSLLELSWNGAEPVQLPGGETRGFLDDGDSLIMRGW
CQADGYRVGFGEVEGTILAAKS

Specific function: Unknown

COG id: COG0179

COG function: function code Q; 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway)

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI4557587, Length=414, Percent_Identity=52.8985507246377, Blast_Score=467, Evalue=1e-131,
Organism=Caenorhabditis elegans, GI17568863, Length=419, Percent_Identity=54.4152744630072, Blast_Score=453, Evalue=1e-128,
Organism=Drosophila melanogaster, GI24657182, Length=413, Percent_Identity=50.1210653753027, Blast_Score=413, Evalue=1e-115,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: 3.7.1.2

Molecular weight: Translated: 46051; Mature: 46051

Theoretical pI: Translated: 5.40; Mature: 5.40

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.8 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MHPNDPRLRSFVDVKPESDFPIQNLPYGVISTASDPSPRVGVAIGDFVLDLAALQAAKLL
CCCCCCCHHHEEECCCCCCCCCCCCCCCEEECCCCCCCEEEEEHHHHHHHHHHHHHHHHH
DLPDGVFAQSSINAFMALGLAMWSTTRARISALLRHDNPELRDDAALRARALVPMSDAKL
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHEEECCCCCCEE
HLPLRVEGFTDFYSSKEHATNVGTMFRDKTNPLLPNWLHIPIGYNGRASTVVVSGTQIHR
EEEEEECCCHHHHCCCHHHCHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEECCEEEC
PRGQLKPPSAELPSFGPCKRLDFELEIGVVIGQPSAMGTTLTEQQAEEMIFGFTLLNDWS
CCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCHHHHHHHHHEEEEEEECCCC
ARDIQQWEYVPLGPFQAKAFATSISPWIVTREALEPFRVHGPTQDPVPLPYLQQQGPNNY
CCCCCCCCCCCCCCCCHHHHHHCCCCCEEEHHHCCCEEECCCCCCCCCCCHHHHCCCCCC
DMALEVNLRTPAMNAPARISATNFKYMYWSSVQQLVHHASSGCAMNVGDLLGSGTVSGPA
CEEEEEEECCCCCCCCCEEEECCEEEEHHHHHHHHHHHCCCCCEEEHHHHHCCCCCCCCC
KDQLGSLLELSWNGAEPVQLPGGETRGFLDDGDSLIMRGWCQADGYRVGFGEVEGTILAA
HHHHCCEEEEECCCCCCEECCCCCCCCCCCCCCCEEEEEEECCCCEEEECCCCCCEEEEE
KS
CC
>Mature Secondary Structure
MHPNDPRLRSFVDVKPESDFPIQNLPYGVISTASDPSPRVGVAIGDFVLDLAALQAAKLL
CCCCCCCHHHEEECCCCCCCCCCCCCCCEEECCCCCCCEEEEEHHHHHHHHHHHHHHHHH
DLPDGVFAQSSINAFMALGLAMWSTTRARISALLRHDNPELRDDAALRARALVPMSDAKL
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHEEECCCCCCEE
HLPLRVEGFTDFYSSKEHATNVGTMFRDKTNPLLPNWLHIPIGYNGRASTVVVSGTQIHR
EEEEEECCCHHHHCCCHHHCHHHHHHCCCCCCCCCCEEEEECCCCCCCCEEEEECCEEEC
PRGQLKPPSAELPSFGPCKRLDFELEIGVVIGQPSAMGTTLTEQQAEEMIFGFTLLNDWS
CCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCHHHHHHHHHEEEEEEECCCC
ARDIQQWEYVPLGPFQAKAFATSISPWIVTREALEPFRVHGPTQDPVPLPYLQQQGPNNY
CCCCCCCCCCCCCCCCHHHHHHCCCCCEEEHHHCCCEEECCCCCCCCCCCHHHHCCCCCC
DMALEVNLRTPAMNAPARISATNFKYMYWSSVQQLVHHASSGCAMNVGDLLGSGTVSGPA
CEEEEEEECCCCCCCCCEEEECCEEEEHHHHHHHHHHHCCCCCEEEHHHHHCCCCCCCCC
KDQLGSLLELSWNGAEPVQLPGGETRGFLDDGDSLIMRGWCQADGYRVGFGEVEGTILAA
HHHHCCEEEEECCCCCCEECCCCCCCCCCCCCCCEEEEEEECCCCEEEECCCCCCEEEEE
KS
CC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: 4-fumarylacetoacetate; H2O

Specific reaction: 4-fumarylacetoacetate + H2O = acetoacetate + fumarate

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA