Definition Acaryochloris marina MBIC11017 chromosome, complete genome.
Accession NC_009925
Length 6,503,724

Click here to switch to the map view.

The map label for this gene is cph2 [H]

Identifier: 158337687

GI number: 158337687

Start: 4606843

End: 4608474

Strand: Reverse

Name: cph2 [H]

Synonym: AM1_4571

Alternate gene names: 158337687

Gene position: 4608474-4606843 (Counterclockwise)

Preceding gene: 158337688

Following gene: 158337686

Centisome position: 70.86

GC content: 44.85

Gene sequence:

>1632_bases
TTGAACAGAAAAATAGCGTTAATTTCTATTCCCACAGGAGTATTGGGAATTGGTCTATCAGCGTTATTAGCATTAAAGTT
CTATTCCCTAGAAACTCAAGCTATTCAAAAAGATTTTCAGCAGGATATCAATGAGCAGGTCCACCGCTTAGAAGCCAAAA
TTGATGCCAAACTGGAAGCTATTAATAGCCTCAAGCTGCTGTTTGATAGTTCTGAGCAGGTCACACCCAAAGAGTTCCAG
CAATTCACGCACAATCTCTTAGCTCGTCATAAAGATATTCAAGCCCTGGAATGGGTGCCTAAGGTTAAACATGCGGATCG
GGCAACATTTATCAAACAGCGACAACAAAACTATCCTGCTTTTGAAATTACCCAACAGGTTAGCCAAGGCAAGATGGTTC
GTGCTCAAAAACGAGCAGAATACTACCCGGTATCATTTTTAGAACCTTTTGCCGGTAATGAATTGGCTTTAGGATTTGAT
TTGGCTTCTGATGCGACTCGTAAACGGGCCATCACCTTAGCCACCGATACGGGAATGGTTCAATCCACCAGTAATTTGAC
CTTGGTCCAAGAACAAGAAGAGCAAAAAGGCTTCATCACCTTTATTCCGGTCTATCAAGACCAGCCCAACACATTAGATA
GCAGACGCCAACGGTTGGAAGGTTTGGTGTTGGGGGTCTTTCGGATCAGTGATTTAGTTAATGGGGCGATTCAGCCTGGG
GCGATAGATGCAATTAACCTGCAACTGATGGATACATCCAATCCAGAAGATATTCCCTATGTGCGGCAATCGAGATTAGG
TCAGCCGATGCCTGAACATGAATATCGTTCTGATCTCAAATCGATTGCTGGGCAGCAATGGACGATTGCGGCTATCCCTT
CTAATGTTTATTTCAACGAGAAGCGAAGTGGCCTGCCCTGGGCAGTGTTCTGGGTGGGTTTAGTTTTTACGGTTCTGACT
GAAGCCTATGTATTCTTTATCCTTCGGCAGTCAAAGCTTGTGGAGACCGTTGTCCGCGATCGCACGAAGGAATTAGAAGA
AGCGAACAAAAAACTCTCTCTAATCTCTACCACAGATGAATTAACCCATATTGCCAATCGCCGACACTTTAACGATTGTT
TGGATAAAGAGTGGAAGCGGGCGATCCGTGAGCAAACCCCCATGACGTTATTTCTGATCAATCTAGATTTTTTCCGACAG
TTTAATGAGGGATATGGGTTTGTCGCTGGCGATGAGTGCCTAAAAAAAATCGCGTCTCAACTGGAATCTCTGCTCAAGCG
TCCTGCAGACCTAGTGGCCAGATTTGAAGGGGAAACCTTTGCACTACTGCTACCGAATACCGCAAATGCGGAACCCCTTG
CTCAGCGCTGTATCGAATCCATAGAAGCTTTGAAGATCAAACATATTTACTCCCCCATTAGTGAATATGTCACCGTCAGT
ATTGGGGTTGGGTTTGTACGACCAGCTCATGATACACCCATGACTAAGTTAGTGGAGAAAGCCACGCAAGCCTTATTACA
AGCGAAGGATGCTGGGCGCAATCAGCATGCCTTTATTCATATCCCTTCCTTTGCCGATACCTTGGTGCCATCTGACCCAG
CATCTCAAGAGTTACAATTTCAGTCCCAATAA

Upstream 100 bases:

>100_bases
TACAGAATGACAATTATTACTTACTCAATAAAACAGGATAGATAACGAACATAGCTTGATTGGCCTGTTTTCGTAAGTAA
TCCAGAGGGGGATATTCATT

Downstream 100 bases:

>100_bases
TATCGACTCAAAAGCGAGTGGTTTGCCAAGATAAGGGGAGTACGCTCATTTTCCCCTATGACAGGCCCTCTCCAACATAC
TCTTAAGGCTTTACAGACGT

Product: diguanylate cyclase

Products: NA

Alternate protein names: Bacteriophytochrome cph2 [H]

Number of amino acids: Translated: 543; Mature: 543

Protein sequence:

>543_residues
MNRKIALISIPTGVLGIGLSALLALKFYSLETQAIQKDFQQDINEQVHRLEAKIDAKLEAINSLKLLFDSSEQVTPKEFQ
QFTHNLLARHKDIQALEWVPKVKHADRATFIKQRQQNYPAFEITQQVSQGKMVRAQKRAEYYPVSFLEPFAGNELALGFD
LASDATRKRAITLATDTGMVQSTSNLTLVQEQEEQKGFITFIPVYQDQPNTLDSRRQRLEGLVLGVFRISDLVNGAIQPG
AIDAINLQLMDTSNPEDIPYVRQSRLGQPMPEHEYRSDLKSIAGQQWTIAAIPSNVYFNEKRSGLPWAVFWVGLVFTVLT
EAYVFFILRQSKLVETVVRDRTKELEEANKKLSLISTTDELTHIANRRHFNDCLDKEWKRAIREQTPMTLFLINLDFFRQ
FNEGYGFVAGDECLKKIASQLESLLKRPADLVARFEGETFALLLPNTANAEPLAQRCIESIEALKIKHIYSPISEYVTVS
IGVGFVRPAHDTPMTKLVEKATQALLQAKDAGRNQHAFIHIPSFADTLVPSDPASQELQFQSQ

Sequences:

>Translated_543_residues
MNRKIALISIPTGVLGIGLSALLALKFYSLETQAIQKDFQQDINEQVHRLEAKIDAKLEAINSLKLLFDSSEQVTPKEFQ
QFTHNLLARHKDIQALEWVPKVKHADRATFIKQRQQNYPAFEITQQVSQGKMVRAQKRAEYYPVSFLEPFAGNELALGFD
LASDATRKRAITLATDTGMVQSTSNLTLVQEQEEQKGFITFIPVYQDQPNTLDSRRQRLEGLVLGVFRISDLVNGAIQPG
AIDAINLQLMDTSNPEDIPYVRQSRLGQPMPEHEYRSDLKSIAGQQWTIAAIPSNVYFNEKRSGLPWAVFWVGLVFTVLT
EAYVFFILRQSKLVETVVRDRTKELEEANKKLSLISTTDELTHIANRRHFNDCLDKEWKRAIREQTPMTLFLINLDFFRQ
FNEGYGFVAGDECLKKIASQLESLLKRPADLVARFEGETFALLLPNTANAEPLAQRCIESIEALKIKHIYSPISEYVTVS
IGVGFVRPAHDTPMTKLVEKATQALLQAKDAGRNQHAFIHIPSFADTLVPSDPASQELQFQSQ
>Mature_543_residues
MNRKIALISIPTGVLGIGLSALLALKFYSLETQAIQKDFQQDINEQVHRLEAKIDAKLEAINSLKLLFDSSEQVTPKEFQ
QFTHNLLARHKDIQALEWVPKVKHADRATFIKQRQQNYPAFEITQQVSQGKMVRAQKRAEYYPVSFLEPFAGNELALGFD
LASDATRKRAITLATDTGMVQSTSNLTLVQEQEEQKGFITFIPVYQDQPNTLDSRRQRLEGLVLGVFRISDLVNGAIQPG
AIDAINLQLMDTSNPEDIPYVRQSRLGQPMPEHEYRSDLKSIAGQQWTIAAIPSNVYFNEKRSGLPWAVFWVGLVFTVLT
EAYVFFILRQSKLVETVVRDRTKELEEANKKLSLISTTDELTHIANRRHFNDCLDKEWKRAIREQTPMTLFLINLDFFRQ
FNEGYGFVAGDECLKKIASQLESLLKRPADLVARFEGETFALLLPNTANAEPLAQRCIESIEALKIKHIYSPISEYVTVS
IGVGFVRPAHDTPMTKLVEKATQALLQAKDAGRNQHAFIHIPSFADTLVPSDPASQELQFQSQ

Specific function: Photoreceptor which exists in two forms that are reversibly interconvertible by light:the R form that absorbs maximally in the red region of the spectrum and the FR form that absorbs maximally in the far-red region [H]

COG id: COG2199

COG function: function code T; FOG: GGDEF domain

Gene ontology:

Cell location: Integral Membrane Protein [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 GGDEF domains [H]

Homologues:

Organism=Escherichia coli, GI1786584, Length=176, Percent_Identity=31.25, Blast_Score=84, Evalue=2e-17,
Organism=Escherichia coli, GI1787262, Length=267, Percent_Identity=26.9662921348315, Blast_Score=78, Evalue=1e-15,
Organism=Escherichia coli, GI87081881, Length=186, Percent_Identity=31.7204301075269, Blast_Score=75, Evalue=1e-14,
Organism=Escherichia coli, GI145693134, Length=159, Percent_Identity=29.559748427673, Blast_Score=74, Evalue=2e-14,
Organism=Escherichia coli, GI1787541, Length=174, Percent_Identity=27.0114942528736, Blast_Score=73, Evalue=4e-14,
Organism=Escherichia coli, GI1788085, Length=192, Percent_Identity=30.2083333333333, Blast_Score=73, Evalue=4e-14,
Organism=Escherichia coli, GI1787816, Length=170, Percent_Identity=29.4117647058824, Blast_Score=70, Evalue=3e-13,
Organism=Escherichia coli, GI1788381, Length=181, Percent_Identity=29.8342541436464, Blast_Score=69, Evalue=5e-13,
Organism=Escherichia coli, GI1787802, Length=171, Percent_Identity=28.0701754385965, Blast_Score=66, Evalue=6e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR003018
- InterPro:   IPR016132
- InterPro:   IPR001294
- InterPro:   IPR013515 [H]

Pfam domain/function: PF00563 EAL; PF01590 GAF; PF00990 GGDEF; PF00360 Phytochrome [H]

EC number: NA

Molecular weight: Translated: 61478; Mature: 61478

Theoretical pI: Translated: 6.75; Mature: 6.75

Prosite motif: PS50839 CHASE ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNRKIALISIPTGVLGIGLSALLALKFYSLETQAIQKDFQQDINEQVHRLEAKIDAKLEA
CCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
INSLKLLFDSSEQVTPKEFQQFTHNLLARHKDIQALEWVPKVKHADRATFIKQRQQNYPA
HHHHHHEECCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHCCCCH
FEITQQVSQGKMVRAQKRAEYYPVSFLEPFAGNELALGFDLASDATRKRAITLATDTGMV
HHHHHHHHCCHHHHHHHHCCCCCHHHHCCCCCCCEEEEEECCCCCCCCCEEEEEECCCCE
QSTSNLTLVQEQEEQKGFITFIPVYQDQPNTLDSRRQRLEGLVLGVFRISDLVNGAIQPG
ECCCCEEEEECCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC
AIDAINLQLMDTSNPEDIPYVRQSRLGQPMPEHEYRSDLKSIAGQQWTIAAIPSNVYFNE
CEEEEEEEEEECCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEECC
KRSGLPWAVFWVGLVFTVLTEAYVFFILRQSKLVETVVRDRTKELEEANKKLSLISTTDE
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECHHH
LTHIANRRHFNDCLDKEWKRAIREQTPMTLFLINLDFFRQFNEGYGFVAGDECLKKIASQ
HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHCCCCEEEHHHHHHHHHHH
LESLLKRPADLVARFEGETFALLLPNTANAEPLAQRCIESIEALKIKHIYSPISEYVTVS
HHHHHHCHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEE
IGVGFVRPAHDTPMTKLVEKATQALLQAKDAGRNQHAFIHIPSFADTLVPSDPASQELQF
ECCCEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHCCCCCCCCCCCCC
QSQ
CCC
>Mature Secondary Structure
MNRKIALISIPTGVLGIGLSALLALKFYSLETQAIQKDFQQDINEQVHRLEAKIDAKLEA
CCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
INSLKLLFDSSEQVTPKEFQQFTHNLLARHKDIQALEWVPKVKHADRATFIKQRQQNYPA
HHHHHHEECCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHCCCCH
FEITQQVSQGKMVRAQKRAEYYPVSFLEPFAGNELALGFDLASDATRKRAITLATDTGMV
HHHHHHHHCCHHHHHHHHCCCCCHHHHCCCCCCCEEEEEECCCCCCCCCEEEEEECCCCE
QSTSNLTLVQEQEEQKGFITFIPVYQDQPNTLDSRRQRLEGLVLGVFRISDLVNGAIQPG
ECCCCEEEEECCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC
AIDAINLQLMDTSNPEDIPYVRQSRLGQPMPEHEYRSDLKSIAGQQWTIAAIPSNVYFNE
CEEEEEEEEEECCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCEECC
KRSGLPWAVFWVGLVFTVLTEAYVFFILRQSKLVETVVRDRTKELEEANKKLSLISTTDE
CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECHHH
LTHIANRRHFNDCLDKEWKRAIREQTPMTLFLINLDFFRQFNEGYGFVAGDECLKKIASQ
HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHCCCCEEEHHHHHHHHHHH
LESLLKRPADLVARFEGETFALLLPNTANAEPLAQRCIESIEALKIKHIYSPISEYVTVS
HHHHHHCHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEE
IGVGFVRPAHDTPMTKLVEKATQALLQAKDAGRNQHAFIHIPSFADTLVPSDPASQELQF
ECCCEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHCCCCCCCCCCCCC
QSQ
CCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8590279; 8905231; 10978170; 11063585 [H]