Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is ykoW [H]

Identifier: 159897159

GI number: 159897159

Start: 727128

End: 729176

Strand: Direct

Name: ykoW [H]

Synonym: Haur_0630

Alternate gene names: 159897159

Gene position: 727128-729176 (Clockwise)

Preceding gene: 159897158

Following gene: 159897161

Centisome position: 11.46

GC content: 47.73

Gene sequence:

>2049_bases
ATGAGTAATCCACGAATCTTGATTCTTGATCATACGCCTACAAGTACTCAAGACCTCAGCACTCAATTGATGGCGGCTGG
TTGCAGCGTTATGGCCCATCTGAGCACATGGGAAGCCGCCGCCTGCCTGCTTGGCGAGCAGAGTGTTGATCTGGTCTTGG
CAACGCTGCATTTTGTGCCTCAAATCAATCAGCAACCCTGCCCAGTTCCGGTGGTCTATCTGAGCCAAACCAATGAGCAA
GCCACCCAAATACCCAATCAACCCACCGCGATCGATATTTTAACCTTGCCAATTAGCACCGAAAGCCTTGTGCTCACGCT
TAAAACGATCATCGAGCGCAGCAATTTGACCCAGCGGCTTAGCCGCGTTGAGGTGTGGATGCAAACGATGTTGGCCAATG
TCAGCGATGGCGTGGTGGCAATCGATCAGCATGGCAAAATTCAGTGGATTAATTCTGCCGCCGAGCACATGACTGGCTGG
GATTATCGTAGTGCGCTCCAGCAGGATTTTAATCAAGTGGTGGTAATTCGCAGCAGTCTCAACGATCAACGCATCGATGT
CATTGCGGCGGCCTTACGCAACGAGCCAGTGTTTGCCTTGCCCTTTGAGCGTTATTTGCATGCTCGCGATGGCCATGCAA
CCTCAATTACTGAACATGTTACGCCATTGCTCAATAACGACGGCCAAAATAATGGAGCAATCGTGATTTTGCGCGATCAT
ACAGCCCAATTACAAATGGAAGAGGCGCTGTATTATCAATCGCTGCACGATTCATTAACTGGCTTGCCCAATCGCCGTTC
ATTTCAATTACATTTATCACGGGCTTTAGAATACCAACGCCATCATCATGATTATAGCTTTGCGATCATTTTGCTCGATA
TCGATGAATTTAAAATGGTTAATGATGGCTTGGGTTATCACATCGGCGATACGATGCTGACCGAAATTGCCCAACGTTTA
CGTCGTGCCCTCTATTTACCAGGCGATGTGGTAGCACGCTTCGATGGCGATGAATTTGCAATCTTTTTTGATCGGCTGCC
CGATTTGCCTGCGGCCTTCAACGCCGCCCAACGCATCCGCCAACTCTTTGAAGATCCATTTATGATTGAAAATGGTCAGG
AAATTTTCTGTAATGTCAGCATTGGCTTGGAATTAATCACCAGCGAAGTGCCGATTGAAACCGTGATGCGAAACGCTGAC
CTGGCACTGTATCGGGCCAAACATACAGGTCGCGGCGGCATCGAAATTTTCGATCAAACCTTGTATGCCAATTTTAGCAC
CCGTTTGCACAACGAAACAGCGTTACGCCTAGCCTTGCAACGCCAAGAATTTCGGTTGTTTGCCCAGCCAATTATCGATT
TTGAGCATAGCCATTGCACAGGCTTTGAAATTTTGATTCGTTGGGCACACCCCGATGGGCGTTTGCGCTCGCCTGGCCAA
TTTTTGGATATTGCCGAGGAAACTGGCTTAATTATTCCATTGGGCTGGTGGATGCTGGAAGTTGCAGCCGAGCAGCTTGA
GCGGTGGCAAGCAGACTCAATGATGCAGCATATGACCTTGGCGATTAATCTCTCGCCGCGTCAATTATTGCATAGCCAGC
TTTTGCCAACGCTTAAATCAATATTCGAGCGCTATCAATTTCCACGTCAACAATTGCATTTAGAAATTACCGAAGGTGCA
TTATTAAATACCGAACGAGCTGAACCAATTTTGAATGCCTTGCGTCATTTTGGCTTACATTTACATATTGATGATTTTGG
CACTGGCTATTCGTCATTAACCTATTTACATCGTTTTCCGTTGAATACAATCAAAATTGACCGTTCGTTTATTCAAGCAG
CGCTCGAAGATCAACGGAGTTTAGCGATTGTGCGCACAATTATCAATTTGGCCCAAACCATGCAATTGGCCACGATTGCC
GAGGGCATCGAAACCGCTGAGCATATTCAAGTGCTGCGTGAACTTGGCTGTCAAGCAGGCCAAGGCTACTTCTTCTCGCC
GCCCGTTCCACTCGAACAGGCGGCGGACTTCGCATGTTCGTTGAATTAG

Upstream 100 bases:

>100_bases
AGTGTTGGTGAGGTGGCATGGTATTGCGACACATTTGATCTAAGCGCTCCATGGGCTGAGTACCTGTGCCAGCGTAACGA
TTTCCATTGGAGTGCATCCC

Downstream 100 bases:

>100_bases
GCTTGGCTGGCCTCGACCATTACATTGGCAACGAGCGTATAGGCAGCAACCCATGCTGCTTGCACTTCGTCATTCCACTG
CTCGCCCAGCTGTTGCGATA

Product: PAS/PAC sensor-containing diguanylate cyclase/phosphodiesterase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 682; Mature: 681

Protein sequence:

>682_residues
MSNPRILILDHTPTSTQDLSTQLMAAGCSVMAHLSTWEAAACLLGEQSVDLVLATLHFVPQINQQPCPVPVVYLSQTNEQ
ATQIPNQPTAIDILTLPISTESLVLTLKTIIERSNLTQRLSRVEVWMQTMLANVSDGVVAIDQHGKIQWINSAAEHMTGW
DYRSALQQDFNQVVVIRSSLNDQRIDVIAAALRNEPVFALPFERYLHARDGHATSITEHVTPLLNNDGQNNGAIVILRDH
TAQLQMEEALYYQSLHDSLTGLPNRRSFQLHLSRALEYQRHHHDYSFAIILLDIDEFKMVNDGLGYHIGDTMLTEIAQRL
RRALYLPGDVVARFDGDEFAIFFDRLPDLPAAFNAAQRIRQLFEDPFMIENGQEIFCNVSIGLELITSEVPIETVMRNAD
LALYRAKHTGRGGIEIFDQTLYANFSTRLHNETALRLALQRQEFRLFAQPIIDFEHSHCTGFEILIRWAHPDGRLRSPGQ
FLDIAEETGLIIPLGWWMLEVAAEQLERWQADSMMQHMTLAINLSPRQLLHSQLLPTLKSIFERYQFPRQQLHLEITEGA
LLNTERAEPILNALRHFGLHLHIDDFGTGYSSLTYLHRFPLNTIKIDRSFIQAALEDQRSLAIVRTIINLAQTMQLATIA
EGIETAEHIQVLRELGCQAGQGYFFSPPVPLEQAADFACSLN

Sequences:

>Translated_682_residues
MSNPRILILDHTPTSTQDLSTQLMAAGCSVMAHLSTWEAAACLLGEQSVDLVLATLHFVPQINQQPCPVPVVYLSQTNEQ
ATQIPNQPTAIDILTLPISTESLVLTLKTIIERSNLTQRLSRVEVWMQTMLANVSDGVVAIDQHGKIQWINSAAEHMTGW
DYRSALQQDFNQVVVIRSSLNDQRIDVIAAALRNEPVFALPFERYLHARDGHATSITEHVTPLLNNDGQNNGAIVILRDH
TAQLQMEEALYYQSLHDSLTGLPNRRSFQLHLSRALEYQRHHHDYSFAIILLDIDEFKMVNDGLGYHIGDTMLTEIAQRL
RRALYLPGDVVARFDGDEFAIFFDRLPDLPAAFNAAQRIRQLFEDPFMIENGQEIFCNVSIGLELITSEVPIETVMRNAD
LALYRAKHTGRGGIEIFDQTLYANFSTRLHNETALRLALQRQEFRLFAQPIIDFEHSHCTGFEILIRWAHPDGRLRSPGQ
FLDIAEETGLIIPLGWWMLEVAAEQLERWQADSMMQHMTLAINLSPRQLLHSQLLPTLKSIFERYQFPRQQLHLEITEGA
LLNTERAEPILNALRHFGLHLHIDDFGTGYSSLTYLHRFPLNTIKIDRSFIQAALEDQRSLAIVRTIINLAQTMQLATIA
EGIETAEHIQVLRELGCQAGQGYFFSPPVPLEQAADFACSLN
>Mature_681_residues
SNPRILILDHTPTSTQDLSTQLMAAGCSVMAHLSTWEAAACLLGEQSVDLVLATLHFVPQINQQPCPVPVVYLSQTNEQA
TQIPNQPTAIDILTLPISTESLVLTLKTIIERSNLTQRLSRVEVWMQTMLANVSDGVVAIDQHGKIQWINSAAEHMTGWD
YRSALQQDFNQVVVIRSSLNDQRIDVIAAALRNEPVFALPFERYLHARDGHATSITEHVTPLLNNDGQNNGAIVILRDHT
AQLQMEEALYYQSLHDSLTGLPNRRSFQLHLSRALEYQRHHHDYSFAIILLDIDEFKMVNDGLGYHIGDTMLTEIAQRLR
RALYLPGDVVARFDGDEFAIFFDRLPDLPAAFNAAQRIRQLFEDPFMIENGQEIFCNVSIGLELITSEVPIETVMRNADL
ALYRAKHTGRGGIEIFDQTLYANFSTRLHNETALRLALQRQEFRLFAQPIIDFEHSHCTGFEILIRWAHPDGRLRSPGQF
LDIAEETGLIIPLGWWMLEVAAEQLERWQADSMMQHMTLAINLSPRQLLHSQLLPTLKSIFERYQFPRQQLHLEITEGAL
LNTERAEPILNALRHFGLHLHIDDFGTGYSSLTYLHRFPLNTIKIDRSFIQAALEDQRSLAIVRTIINLAQTMQLATIAE
GIETAEHIQVLRELGCQAGQGYFFSPPVPLEQAADFACSLN

Specific function: Probable signaling protein whose physiological role is not yet known [H]

COG id: COG5001

COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 PAS (PER-ARNT-SIM) domain [H]

Homologues:

Organism=Escherichia coli, GI1787541, Length=627, Percent_Identity=28.2296650717703, Blast_Score=234, Evalue=2e-62,
Organism=Escherichia coli, GI87081921, Length=450, Percent_Identity=30.2222222222222, Blast_Score=196, Evalue=4e-51,
Organism=Escherichia coli, GI226510982, Length=367, Percent_Identity=28.6103542234332, Blast_Score=165, Evalue=9e-42,
Organism=Escherichia coli, GI1790496, Length=242, Percent_Identity=34.297520661157, Blast_Score=151, Evalue=2e-37,
Organism=Escherichia coli, GI1788502, Length=254, Percent_Identity=32.2834645669291, Blast_Score=143, Evalue=3e-35,
Organism=Escherichia coli, GI87081743, Length=244, Percent_Identity=33.6065573770492, Blast_Score=142, Evalue=1e-34,
Organism=Escherichia coli, GI1786507, Length=245, Percent_Identity=34.6938775510204, Blast_Score=140, Evalue=2e-34,
Organism=Escherichia coli, GI87081980, Length=249, Percent_Identity=36.144578313253, Blast_Score=135, Evalue=1e-32,
Organism=Escherichia coli, GI1788381, Length=306, Percent_Identity=33.0065359477124, Blast_Score=135, Evalue=1e-32,
Organism=Escherichia coli, GI87081845, Length=248, Percent_Identity=33.4677419354839, Blast_Score=134, Evalue=2e-32,
Organism=Escherichia coli, GI1787055, Length=246, Percent_Identity=31.7073170731707, Blast_Score=116, Evalue=4e-27,
Organism=Escherichia coli, GI1788849, Length=238, Percent_Identity=33.1932773109244, Blast_Score=113, Evalue=4e-26,
Organism=Escherichia coli, GI87082096, Length=239, Percent_Identity=33.4728033472803, Blast_Score=108, Evalue=1e-24,
Organism=Escherichia coli, GI87081881, Length=270, Percent_Identity=27.7777777777778, Blast_Score=86, Evalue=6e-18,
Organism=Escherichia coli, GI1786584, Length=182, Percent_Identity=30.7692307692308, Blast_Score=84, Evalue=2e-17,
Organism=Escherichia coli, GI1787816, Length=164, Percent_Identity=34.1463414634146, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI1788956, Length=167, Percent_Identity=32.3353293413174, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI87081977, Length=161, Percent_Identity=31.6770186335404, Blast_Score=77, Evalue=4e-15,
Organism=Escherichia coli, GI145693134, Length=180, Percent_Identity=30, Blast_Score=76, Evalue=9e-15,
Organism=Escherichia coli, GI87082007, Length=185, Percent_Identity=31.8918918918919, Blast_Score=75, Evalue=1e-14,
Organism=Escherichia coli, GI1788085, Length=181, Percent_Identity=30.3867403314917, Blast_Score=64, Evalue=3e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR005330
- InterPro:   IPR000014 [H]

Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF03707 MHYT [H]

EC number: NA

Molecular weight: Translated: 77235; Mature: 77104

Theoretical pI: Translated: 5.32; Mature: 5.32

Prosite motif: PS50112 PAS ; PS50113 PAC ; PS50883 EAL ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSNPRILILDHTPTSTQDLSTQLMAAGCSVMAHLSTWEAAACLLGEQSVDLVLATLHFVP
CCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC
QINQQPCPVPVVYLSQTNEQATQIPNQPTAIDILTLPISTESLVLTLKTIIERSNLTQRL
CCCCCCCCEEEEEEECCCCHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH
SRVEVWMQTMLANVSDGVVAIDQHGKIQWINSAAEHMTGWDYRSALQQDFNQVVVIRSSL
HHHHHHHHHHHHCCCCCEEEEECCCCEEEECHHHHHCCCCHHHHHHHHCCCEEEEEECCC
NDQRIDVIAAALRNEPVFALPFERYLHARDGHATSITEHVTPLLNNDGQNNGAIVILRDH
CCHHHHHHHHHHCCCCEEEECHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEEEECC
TAQLQMEEALYYQSLHDSLTGLPNRRSFQLHLSRALEYQRHHHDYSFAIILLDIDEFKMV
CCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECHHHHHH
NDGLGYHIGDTMLTEIAQRLRRALYLPGDVVARFDGDEFAIFFDRLPDLPAAFNAAQRIR
CCCCCCCCCHHHHHHHHHHHHHHHCCCCHHEEEECCCCEEEEECCCCCCCHHHHHHHHHH
QLFEDPFMIENGQEIFCNVSIGLELITSEVPIETVMRNADLALYRAKHTGRGGIEIFDQT
HHHCCCCEEECCCEEEEEEEECEEEEECCCCHHHHHCCCCEEEEEECCCCCCCHHHHHHH
LYANFSTRLHNETALRLALQRQEFRLFAQPIIDFEHSHCTGFEILIRWAHPDGRLRSPGQ
HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCH
FLDIAEETGLIIPLGWWMLEVAAEQLERWQADSMMQHMTLAINLSPRQLLHSQLLPTLKS
HHHHHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCHHHHHHHHHHHHHHH
IFERYQFPRQQLHLEITEGALLNTERAEPILNALRHFGLHLHIDDFGTGYSSLTYLHRFP
HHHHHHCCHHHEEEEEECCCEECCHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCC
LNTIKIDRSFIQAALEDQRSLAIVRTIINLAQTMQLATIAEGIETAEHIQVLRELGCQAG
CCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
QGYFFSPPVPLEQAADFACSLN
CCEEECCCCCHHHHCCCEECCC
>Mature Secondary Structure 
SNPRILILDHTPTSTQDLSTQLMAAGCSVMAHLSTWEAAACLLGEQSVDLVLATLHFVP
CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC
QINQQPCPVPVVYLSQTNEQATQIPNQPTAIDILTLPISTESLVLTLKTIIERSNLTQRL
CCCCCCCCEEEEEEECCCCHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH
SRVEVWMQTMLANVSDGVVAIDQHGKIQWINSAAEHMTGWDYRSALQQDFNQVVVIRSSL
HHHHHHHHHHHHCCCCCEEEEECCCCEEEECHHHHHCCCCHHHHHHHHCCCEEEEEECCC
NDQRIDVIAAALRNEPVFALPFERYLHARDGHATSITEHVTPLLNNDGQNNGAIVILRDH
CCHHHHHHHHHHCCCCEEEECHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEEEECC
TAQLQMEEALYYQSLHDSLTGLPNRRSFQLHLSRALEYQRHHHDYSFAIILLDIDEFKMV
CCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECHHHHHH
NDGLGYHIGDTMLTEIAQRLRRALYLPGDVVARFDGDEFAIFFDRLPDLPAAFNAAQRIR
CCCCCCCCCHHHHHHHHHHHHHHHCCCCHHEEEECCCCEEEEECCCCCCCHHHHHHHHHH
QLFEDPFMIENGQEIFCNVSIGLELITSEVPIETVMRNADLALYRAKHTGRGGIEIFDQT
HHHCCCCEEECCCEEEEEEEECEEEEECCCCHHHHHCCCCEEEEEECCCCCCCHHHHHHH
LYANFSTRLHNETALRLALQRQEFRLFAQPIIDFEHSHCTGFEILIRWAHPDGRLRSPGQ
HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCH
FLDIAEETGLIIPLGWWMLEVAAEQLERWQADSMMQHMTLAINLSPRQLLHSQLLPTLKS
HHHHHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCHHHHHHHHHHHHHHH
IFERYQFPRQQLHLEITEGALLNTERAEPILNALRHFGLHLHIDDFGTGYSSLTYLHRFP
HHHHHHCCHHHEEEEEECCCEECCHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHCC
LNTIKIDRSFIQAALEDQRSLAIVRTIINLAQTMQLATIAEGIETAEHIQVLRELGCQAG
CCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
QGYFFSPPVPLEQAADFACSLN
CCEEECCCCCHHHHCCCEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9384377; 11728710 [H]