Definition Sinorhizobium fredii NGR234 plasmid pNGR234a, complete sequence.
Accession NC_000914
Length 536,165

Click here to switch to the map view.

The map label for this gene is cpxP

Identifier: 16519835

GI number: 16519835

Start: 358229

End: 359431

Strand: Direct

Name: cpxP

Synonym: NGR_a02700

Alternate gene names: 16519835

Gene position: 358229-359431 (Clockwise)

Preceding gene: 16519838

Following gene: 16519834

Centisome position: 66.81

GC content: 66.25

Gene sequence:

>1203_bases
ATGCCCGAACAACCCTTGCCGACGCTGCCGATGTGGCGGGTTGATCACATCGAGCCCTCGCCCACGATGTTGGCGCTACG
CGCCAACGGTCCGATTCACAACGTGCGCTTCCCGCGGGGGCACGAAGGCTGGTGGGTGACAGGCTACGACGAGGCGAAGG
CGGTGCTGTCCGACGCGGCGTTCCGGCCTGCGGGAATGCCGCCGGCGGCATTCACCCCGGATTGTGTGATTCTCGGTTCG
CCGGGGTGGCTGGTCTCGCACGAAGGGGGCGAGCATGCCCGGTTACGCACGATCGTTGCGCCGGCCTTCAGCGACCGCAG
GGTGAAGCTGCTCGCGCAGCAAGTCGAGGCGATCGCAGCGCAGTTGTTCGAGACTCTGGCGGCCCAGCCCCAGCCCGCCG
ACCTGCGGCGCCACCTCTCCTTTCCGCTTCCGGCCATGGTCATCAGCGCGCTGATGGGCGTGCTCTACGAGGATCACGCC
TTTTTTGCCGGGCTGTCCGACGAGGTGATGACGCACCAGCATGAAAGCGGCCCGCGCAGCGCGTCGCGCCTGGCCTGGGA
AGAACTGCGCGCCTACATTCGCGGCAAGATGCGGGACAAGCGCCAGGATCCGGGCGACAACCTGCTGACGGATCTGCTCG
CGGCGGTCGACCGGGGCGAGGCGACCGAGGAAGAGGCGATCGGCCTCGCGGCGGGCATGCTGGTGGCAGGACACGAGAGC
ACCGTCGCGCAGATCGAATTCGGCCTCCTGGCCATGTTGCGCCATCCGCAACAGCGCGAACGTCTGGTCGGCAATCCATC
CCTGGTGGACAAGGCGGTGGAGGAAATCCTGCGCATGTACCCGCCGGGCGCGGGCTGGGACGGCATCATGCGCTATCCGA
GGACCGACGTGACCATCGCGGGCGTGCATATTCCCGCGGAGAGCAAGGTGCTGGTCGGCCTGCCGGCGACATCGTTCGAT
CCACGCCATTTCGAGGACCCTGAAATCTTCGACATCGGACGCGACGCAAAGCCGCACCTGGCGTTTTCCTACGGGCCGCA
CTACTGCATAGGCATGGCGCTGGCCAGGCTGGAACTCAAGGTGGTGTTCGGTTCGATCTTCCAGCGCTTTCCCGCGCTGC
GCCTGGCCGTGGCGCCCGAAGAACTGAAGTTGCGCAAGGAGATCATCACTGGCGGATTCGAGGAGTTCCCGGTGCTCTGG
TGA

Upstream 100 bases:

>100_bases
TGAGGCGCTCTCTGCGCAGTTGGCGCGGTTCTTGCTGCCCTAGTTGTACGGCGGCACACGCCGGCATGCCCTTCCATCGC
TTAGAGAGTGGAGTGCCACC

Downstream 100 bases:

>100_bases
TGCGCGGACGCCGCCGGGAATCGCGATCTTCTCCGCAATTTGCAGGCGCGCCTGGCGCGCCTGTCAGATCAGCCAGCCGA
CAGGTAACCAAGATGGACAT

Product: cytochrome P450 protein CpxP

Products: NA

Alternate protein names: Cytochrome P450 112 [H]

Number of amino acids: Translated: 400; Mature: 399

Protein sequence:

>400_residues
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGWWVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGS
PGWLVSHEGGEHARLRTIVAPAFSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHA
FFAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDRGEATEEEAIGLAAGMLVAGHES
TVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEILRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFD
PRHFEDPEIFDIGRDAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGGFEEFPVLW

Sequences:

>Translated_400_residues
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGWWVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGS
PGWLVSHEGGEHARLRTIVAPAFSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHA
FFAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDRGEATEEEAIGLAAGMLVAGHES
TVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEILRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFD
PRHFEDPEIFDIGRDAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGGFEEFPVLW
>Mature_399_residues
PEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGWWVTGYDEAKAVLSDAAFRPAGMPPAAFTPDCVILGSP
GWLVSHEGGEHARLRTIVAPAFSDRRVKLLAQQVEAIAAQLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHAF
FAGLSDEVMTHQHESGPRSASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDRGEATEEEAIGLAAGMLVAGHEST
VAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEILRMYPPGAGWDGIMRYPRTDVTIAGVHIPAESKVLVGLPATSFDP
RHFEDPEIFDIGRDAKPHLAFSYGPHYCIGMALARLELKVVFGSIFQRFPALRLAVAPEELKLRKEIITGGFEEFPVLW

Specific function: Cytochromes P450 are a group of heme-thiolate monooxygenases. They oxidize a variety of structurally unrelated compounds, including steroids, fatty acids, and xenobiotics [H]

COG id: COG2124

COG function: function code Q; Cytochrome P450

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the cytochrome P450 family [H]

Homologues:

Organism=Homo sapiens, GI5729796, Length=328, Percent_Identity=24.0853658536585, Blast_Score=69, Evalue=6e-12,
Organism=Caenorhabditis elegans, GI17552522, Length=200, Percent_Identity=26, Blast_Score=68, Evalue=8e-12,
Organism=Drosophila melanogaster, GI17933518, Length=196, Percent_Identity=28.0612244897959, Blast_Score=74, Evalue=1e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001128
- InterPro:   IPR002397
- InterPro:   IPR017972 [H]

Pfam domain/function: PF00067 p450 [H]

EC number: NA

Molecular weight: Translated: 44248; Mature: 44117

Theoretical pI: Translated: 6.12; Mature: 6.12

Prosite motif: PS00086 CYTOCHROME_P450

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGWWVTGYDEAKAVLSDAA
CCCCCCCCCCCCEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEECHHHHHHHHHHHC
FRPAGMPPAAFTPDCVILGSPGWLVSHEGGEHARLRTIVAPAFSDRRVKLLAQQVEAIAA
CCCCCCCCCCCCCCEEEECCCCEEEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH
QLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHAFFAGLSDEVMTHQHESGPRS
HHHHHHHCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCCCCCCH
ASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDRGEATEEEAIGLAAGMLVAGHES
HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHEEECCCC
TVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEILRMYPPGAGWDGIMRYPRTDVTIA
HHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCEEEE
GVHIPAESKVLVGLPATSFDPRHFEDPEIFDIGRDAKPHLAFSYGPHYCIGMALARLELK
EEEECCCCEEEEECCCCCCCCCCCCCCCEEECCCCCCCCEEEECCCHHHHHHHHHHHHHH
VVFGSIFQRFPALRLAVAPEELKLRKEIITGGFEEFPVLW
HHHHHHHHHCCHHEEEECHHHHHHHHHHHHCCHHHCCCCC
>Mature Secondary Structure 
PEQPLPTLPMWRVDHIEPSPTMLALRANGPIHNVRFPRGHEGWWVTGYDEAKAVLSDAA
CCCCCCCCCCCEECCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEECHHHHHHHHHHHC
FRPAGMPPAAFTPDCVILGSPGWLVSHEGGEHARLRTIVAPAFSDRRVKLLAQQVEAIAA
CCCCCCCCCCCCCCEEEECCCCEEEECCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH
QLFETLAAQPQPADLRRHLSFPLPAMVISALMGVLYEDHAFFAGLSDEVMTHQHESGPRS
HHHHHHHCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHCCHHCCCCCHHHHHHCCCCCCCH
ASRLAWEELRAYIRGKMRDKRQDPGDNLLTDLLAAVDRGEATEEEAIGLAAGMLVAGHES
HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHEEECCCC
TVAQIEFGLLAMLRHPQQRERLVGNPSLVDKAVEEILRMYPPGAGWDGIMRYPRTDVTIA
HHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCEEEE
GVHIPAESKVLVGLPATSFDPRHFEDPEIFDIGRDAKPHLAFSYGPHYCIGMALARLELK
EEEECCCCEEEEECCCCCCCCCCCCCCCEEECCCCCCCCEEEECCCHHHHHHHHHHHHHH
VVFGSIFQRFPALRLAVAPEELKLRKEIITGGFEEFPVLW
HHHHHHHHHCCHHEEEECHHHHHHHHHHHHCCHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9655913; 12597275 [H]