Definition | Vibrio cholerae O395 chromosome 2, complete sequence. |
---|---|
Accession | NC_009457 |
Length | 3,024,069 |
Click here to switch to the map view.
The map label for this gene is cph2 [H]
Identifier: 147673373
GI number: 147673373
Start: 2081021
End: 2082760
Strand: Reverse
Name: cph2 [H]
Synonym: VC0395_A1949
Alternate gene names: 147673373
Gene position: 2082760-2081021 (Counterclockwise)
Preceding gene: 147675264
Following gene: 147673942
Centisome position: 68.87
GC content: 49.43
Gene sequence:
>1740_bases ATGCCTGAATTTCTCTCTGAATTTGTACGTTTCTTGTTCGCTGCCGGGCTTGTGCTTGGTGGTGGGTTATGGCTTTTTTC AGGATGGCAGCGCTATGTTCAACCTCAACAGTGGATTCAGTTACTTCACCATGCACCGTCAGGGATGCTCTTGGTAGGAG AGGATCGCGTTTTACGTGCCAATCTTGCCGCGTATTTGTTACTGGGGATCCGCTTGGTGGGACGTCACTATCTGTTTTCT GCCGAGCAGAGTGAAGAGAGTCAGCAAGCTTTTTATCGAGCGCTCGCCAGCAGTGCACAGCAAAAGCGTTCCGTCCCTCT GCTTTGGCCTGTGCCGGGCAATTTGACCCAAACTCTAGAGATCTCAGCTTCGCTCTTACGTCGTTGGCCGAAGAAATTAT GGCTAGTGAATGTGATTGGTTTTGAGTGTCCCAGCCATGACATTCAACAAGAGCGCCACTCACTGGCGATAGCGCGCACG GCACTTGATTCCCTCTCCGAGCTGATTTTTATTAAAAGTACCGAAGGCCACTTAATCGCAACCAACCGAGCGTTTGATCA GTTTTGGCAAGGCCGGATTGAAGAGGGCAGCGCTACTTTTAAAGGCATTATGAAAGGGCGCACGAGTCAGCGTTGCTGGA CTGTGACGCCTGATGGGCGCAGCTGTCTGTTAGAAACCTACCAACGGGTATTGATGTCGCCGCAAGGTGAAAATATTGGG CTACTTGGCATCAGTCATGATGTGACCGACTGGTACAACATGCAGCGTCAATTAAGAGAAGAGATGGAAAAACGCCGTGA CACCGAAGTGGCATTGGCACAGCGCGATACGATTTTACAAAACATCTTAGAATCTAGCCCCGATTCGATTGGTATCTTCA ATGAAAACATGGTCTACCAAGCCTGTAACCAGCCGTTTGTGGAAGCTCTCGGGATCGCGGAAGTGTCAGATCTGGTTGGT AAACGGCTGCAAGATGTGATCCCCGAGCACATCTATGCGCGTCTTTCCGATACGGATAGCCAAGTCCTGCACCAAGGTAA GTCTCTGCGCTACATCGACAGAATTGAACGCTCAGATGGTGAGTTTATCTGGTTTGATGTTGTGAAATCGCCTTTTCGAG ATCCGGCTTCGGGCACCAATGGCGTGCTGATCATGGCGCGAGATGTGTCGGAGCGCTATCTCGCCGCTGAACAATTAGAA GCCGCCAACCAAGAGCTGGAACGCCTAAGCTTTTTAGATAGCTTGACTCATGTTGCCAATCGTCGTCGTTTTGATGAACA ACTGCATACCCTCTGGCATTTGCATGTGCGTGAAGGCAAACCATTAAGCATCATTCTGTGTGATGTCGATTATTTCAAAG ATTACAACGACGCTTATGGCCATTTGATGGGCGATGAGACGCTCAAACAGATAGCGATTGCCTTTACTCAAGTCGCCAAT CGCCATTCTGATTGTGTTGCCCGCTACGGGGGAGAAGAGTTTGGTATTTTGCTGCCCAATACACCACAGTCCGGAGCAAT ACTGGTCGCAGAGCGAATCCATGAGAAAGTTCGTGGATTAGCGATTCCACATGATCATTCTAAGGTTGCCGATAGGATTA CCGTCAGCTTAGGCATAGTGACGCTTATTCCTCGGCCTGAGGATGTACCTGAGCAAATGGTTGAGCTAGCGGATCGGGCT TTATACCAAGCCAAAGCGAATGGCCGCAATCAGACCTCAATTTATCAACCAAACCACTAA
Upstream 100 bases:
>100_bases CCTTTTGAGTATTCACCTTTGAACTTAAAGTAATAATCCGCAGCAAAGTCAGATAATCTTTTGTACTATTTAAAGAATCA TCAATAACACCGCTGAATGC
Downstream 100 bases:
>100_bases AACTGTGCGGTTCGCCCTGTCACAGCGCCAAGAAGTGGATAAGTGCTGCTAACGTTTGTGTGGGAAAAAGGCTAGACTAA GTGCTCTTTATCTGACGGAG
Product: sensory box/GGDEF family protein
Products: NA
Alternate protein names: Bacteriophytochrome cph2 [H]
Number of amino acids: Translated: 579; Mature: 578
Protein sequence:
>579_residues MPEFLSEFVRFLFAAGLVLGGGLWLFSGWQRYVQPQQWIQLLHHAPSGMLLVGEDRVLRANLAAYLLLGIRLVGRHYLFS AEQSEESQQAFYRALASSAQQKRSVPLLWPVPGNLTQTLEISASLLRRWPKKLWLVNVIGFECPSHDIQQERHSLAIART ALDSLSELIFIKSTEGHLIATNRAFDQFWQGRIEEGSATFKGIMKGRTSQRCWTVTPDGRSCLLETYQRVLMSPQGENIG LLGISHDVTDWYNMQRQLREEMEKRRDTEVALAQRDTILQNILESSPDSIGIFNENMVYQACNQPFVEALGIAEVSDLVG KRLQDVIPEHIYARLSDTDSQVLHQGKSLRYIDRIERSDGEFIWFDVVKSPFRDPASGTNGVLIMARDVSERYLAAEQLE AANQELERLSFLDSLTHVANRRRFDEQLHTLWHLHVREGKPLSIILCDVDYFKDYNDAYGHLMGDETLKQIAIAFTQVAN RHSDCVARYGGEEFGILLPNTPQSGAILVAERIHEKVRGLAIPHDHSKVADRITVSLGIVTLIPRPEDVPEQMVELADRA LYQAKANGRNQTSIYQPNH
Sequences:
>Translated_579_residues MPEFLSEFVRFLFAAGLVLGGGLWLFSGWQRYVQPQQWIQLLHHAPSGMLLVGEDRVLRANLAAYLLLGIRLVGRHYLFS AEQSEESQQAFYRALASSAQQKRSVPLLWPVPGNLTQTLEISASLLRRWPKKLWLVNVIGFECPSHDIQQERHSLAIART ALDSLSELIFIKSTEGHLIATNRAFDQFWQGRIEEGSATFKGIMKGRTSQRCWTVTPDGRSCLLETYQRVLMSPQGENIG LLGISHDVTDWYNMQRQLREEMEKRRDTEVALAQRDTILQNILESSPDSIGIFNENMVYQACNQPFVEALGIAEVSDLVG KRLQDVIPEHIYARLSDTDSQVLHQGKSLRYIDRIERSDGEFIWFDVVKSPFRDPASGTNGVLIMARDVSERYLAAEQLE AANQELERLSFLDSLTHVANRRRFDEQLHTLWHLHVREGKPLSIILCDVDYFKDYNDAYGHLMGDETLKQIAIAFTQVAN RHSDCVARYGGEEFGILLPNTPQSGAILVAERIHEKVRGLAIPHDHSKVADRITVSLGIVTLIPRPEDVPEQMVELADRA LYQAKANGRNQTSIYQPNH >Mature_578_residues PEFLSEFVRFLFAAGLVLGGGLWLFSGWQRYVQPQQWIQLLHHAPSGMLLVGEDRVLRANLAAYLLLGIRLVGRHYLFSA EQSEESQQAFYRALASSAQQKRSVPLLWPVPGNLTQTLEISASLLRRWPKKLWLVNVIGFECPSHDIQQERHSLAIARTA LDSLSELIFIKSTEGHLIATNRAFDQFWQGRIEEGSATFKGIMKGRTSQRCWTVTPDGRSCLLETYQRVLMSPQGENIGL LGISHDVTDWYNMQRQLREEMEKRRDTEVALAQRDTILQNILESSPDSIGIFNENMVYQACNQPFVEALGIAEVSDLVGK RLQDVIPEHIYARLSDTDSQVLHQGKSLRYIDRIERSDGEFIWFDVVKSPFRDPASGTNGVLIMARDVSERYLAAEQLEA ANQELERLSFLDSLTHVANRRRFDEQLHTLWHLHVREGKPLSIILCDVDYFKDYNDAYGHLMGDETLKQIAIAFTQVANR HSDCVARYGGEEFGILLPNTPQSGAILVAERIHEKVRGLAIPHDHSKVADRITVSLGIVTLIPRPEDVPEQMVELADRAL YQAKANGRNQTSIYQPNH
Specific function: Photoreceptor which exists in two forms that are reversibly interconvertible by light:the R form that absorbs maximally in the red region of the spectrum and the FR form that absorbs maximally in the far-red region [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Integral Membrane Protein [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 GGDEF domains [H]
Homologues:
Organism=Escherichia coli, GI1786584, Length=185, Percent_Identity=34.0540540540541, Blast_Score=103, Evalue=3e-23, Organism=Escherichia coli, GI1788381, Length=354, Percent_Identity=27.1186440677966, Blast_Score=97, Evalue=2e-21, Organism=Escherichia coli, GI145693134, Length=171, Percent_Identity=35.0877192982456, Blast_Score=97, Evalue=2e-21, Organism=Escherichia coli, GI1787262, Length=162, Percent_Identity=33.9506172839506, Blast_Score=96, Evalue=4e-21, Organism=Escherichia coli, GI87081881, Length=194, Percent_Identity=34.5360824742268, Blast_Score=94, Evalue=3e-20, Organism=Escherichia coli, GI1787541, Length=214, Percent_Identity=29.9065420560748, Blast_Score=91, Evalue=2e-19, Organism=Escherichia coli, GI1787802, Length=212, Percent_Identity=30.6603773584906, Blast_Score=86, Evalue=7e-18, Organism=Escherichia coli, GI87082007, Length=159, Percent_Identity=37.1069182389937, Blast_Score=86, Evalue=8e-18, Organism=Escherichia coli, GI1787816, Length=167, Percent_Identity=37.125748502994, Blast_Score=85, Evalue=2e-17, Organism=Escherichia coli, GI1788085, Length=190, Percent_Identity=30.5263157894737, Blast_Score=80, Evalue=5e-16, Organism=Escherichia coli, GI87081977, Length=183, Percent_Identity=32.2404371584699, Blast_Score=77, Evalue=4e-15, Organism=Escherichia coli, GI48994928, Length=161, Percent_Identity=27.9503105590062, Blast_Score=75, Evalue=1e-14, Organism=Escherichia coli, GI1787056, Length=167, Percent_Identity=31.1377245508982, Blast_Score=74, Evalue=4e-14, Organism=Escherichia coli, GI1788956, Length=170, Percent_Identity=31.1764705882353, Blast_Score=66, Evalue=6e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR003018 - InterPro: IPR016132 - InterPro: IPR001294 - InterPro: IPR013515 [H]
Pfam domain/function: PF00563 EAL; PF01590 GAF; PF00990 GGDEF; PF00360 Phytochrome [H]
EC number: NA
Molecular weight: Translated: 65764; Mature: 65632
Theoretical pI: Translated: 6.30; Mature: 6.30
Prosite motif: PS50113 PAC ; PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPEFLSEFVRFLFAAGLVLGGGLWLFSGWQRYVQPQQWIQLLHHAPSGMLLVGEDRVLRA CCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCCCEEEECCCCHHHH NLAAYLLLGIRLVGRHYLFSAEQSEESQQAFYRALASSAQQKRSVPLLWPVPGNLTQTLE HHHHHHHHHHHHHHHHHHCCCHHCHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHH ISASLLRRWPKKLWLVNVIGFECPSHDIQQERHSLAIARTALDSLSELIFIKSTEGHLIA HHHHHHHHHHHHHEEHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCEEE TNRAFDQFWQGRIEEGSATFKGIMKGRTSQRCWTVTPDGRSCLLETYQRVLMSPQGENIG ECHHHHHHHCCCCCCCCHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHCCCCCCCEE LLGISHDVTDWYNMQRQLREEMEKRRDTEVALAQRDTILQNILESSPDSIGIFNENMVYQ EEECCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCEEECCCCHHHH ACNQPFVEALGIAEVSDLVGKRLQDVIPEHIYARLSDTDSQVLHQGKSLRYIDRIERSDG HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHCCCC EFIWFDVVKSPFRDPASGTNGVLIMARDVSERYLAAEQLEAANQELERLSFLDSLTHVAN CEEEEEHHHCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RRRFDEQLHTLWHLHVREGKPLSIILCDVDYFKDYNDAYGHLMGDETLKQIAIAFTQVAN HHHHHHHHHHHHHEEECCCCCEEEEEECCHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHC RHSDCVARYGGEEFGILLPNTPQSGAILVAERIHEKVRGLAIPHDHSKVADRITVSLGIV CHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHEEEEEEE TLIPRPEDVPEQMVELADRALYQAKANGRNQTSIYQPNH EECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC >Mature Secondary Structure PEFLSEFVRFLFAAGLVLGGGLWLFSGWQRYVQPQQWIQLLHHAPSGMLLVGEDRVLRA CHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHCCCCCEEEECCCCHHHH NLAAYLLLGIRLVGRHYLFSAEQSEESQQAFYRALASSAQQKRSVPLLWPVPGNLTQTLE HHHHHHHHHHHHHHHHHHCCCHHCHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHH ISASLLRRWPKKLWLVNVIGFECPSHDIQQERHSLAIARTALDSLSELIFIKSTEGHLIA HHHHHHHHHHHHHEEHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCEEE TNRAFDQFWQGRIEEGSATFKGIMKGRTSQRCWTVTPDGRSCLLETYQRVLMSPQGENIG ECHHHHHHHCCCCCCCCHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHCCCCCCCEE LLGISHDVTDWYNMQRQLREEMEKRRDTEVALAQRDTILQNILESSPDSIGIFNENMVYQ EEECCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCEEECCCCHHHH ACNQPFVEALGIAEVSDLVGKRLQDVIPEHIYARLSDTDSQVLHQGKSLRYIDRIERSDG HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHCCCC EFIWFDVVKSPFRDPASGTNGVLIMARDVSERYLAAEQLEAANQELERLSFLDSLTHVAN CEEEEEHHHCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH RRRFDEQLHTLWHLHVREGKPLSIILCDVDYFKDYNDAYGHLMGDETLKQIAIAFTQVAN HHHHHHHHHHHHHEEECCCCCEEEEEECCHHHHCCCHHHHCCCCHHHHHHHHHHHHHHHC RHSDCVARYGGEEFGILLPNTPQSGAILVAERIHEKVRGLAIPHDHSKVADRITVSLGIV CHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHEEEEEEE TLIPRPEDVPEQMVELADRALYQAKANGRNQTSIYQPNH EECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 8590279; 8905231; 10978170; 11063585 [H]