The gene/protein map for NC_012581 is currently unavailable.
Definition Novosphingobium aromaticivorans DSM 12444 chromosome, complete genome.
Accession NC_007794
Length 3,561,584

Click here to switch to the map view.

The map label for this gene is yciR [C]

Identifier: 87199053

GI number: 87199053

Start: 1069183

End: 1071492

Strand: Direct

Name: yciR [C]

Synonym: Saro_1031

Alternate gene names: 87199053

Gene position: 1069183-1071492 (Clockwise)

Preceding gene: 87199052

Following gene: 87199056

Centisome position: 30.02

GC content: 65.67

Gene sequence:

>2310_bases
ATGGCTGTCATTGGCCTTCGAGATCCCGTCGAGGGTGATTGGGGCCGGTTGCGCGGCCTGCAGTATTCCAGCCTCGCCCG
CCTGACCTTCGCGCGGCTCATGGCGCATGCGGTCGCCGCGCTGCTCGTGTTGCAGACCTTTGGCGGACGGGTTCATCCCG
CGCTTCTCGTCGGCTGGATGGCGCTGCTGGTCAGCGTGCTTATCTCGGGCGCGCGGTTCGACCGCAGTCTAGTCGATGCC
GACCGGCGGCGCGTCAGCAGGCAGGAGATGCATCGCCAGACGATCAGCTCGCTCTTCGTCGCACTCGCCTGGGCGTTGCC
CATGCTGGTATTCGGTCCGTTTGGCGATGCTGCCGGCCGCATGACCTTGTGGACGGTGCTTGCCATGCTGATGACTGGCA
TGGCCGTCACGTTCGCGGCGATGCCGATGGCGACCGTGCTCTTTTCCGGCGTGGTCGGCGTCTCGGCGGTCGCGGCATTC
CTGTTCGAGGGCGAATACACCGCAGCGGCCGTGTCGACCGTCTTCGTGGTGATCGTCAGCGTCGGCGCGGTCGAGGTCGC
GCGGACCTTCCTCGGTGCGCGCGTGGCCGAAGCGGGGATGGCGGAAAAGAGCGAGGTTGTGTCGCTCCTGCTGCGCGAGT
TCGAGGAAGGCGATGCCGACTGGCTGTGGCAGGTCGATGCCAACCGCCGCATCCGTTCGGTCAGTCCGCGCTTCGCCTTC
GCGCTTGGCGCCGATCCCGAGGATCTTGAAGGCAAGCCGCTGATCCAGGTCATCGCCGGCCCGGCCTGGGAAGATGGACA
CTTCCATTCCAGTCTCCACGATCTTGCCGAGCGGCTGAAGCGGCGCGAGAGCTTTTCCAATCTGCTCGTCCGGGCGACAC
TTCACGGCAGCCACCGCTGGTGGGAACTCTCGGCGTCGCCCAAGGTCGACGACAACGGGACGTTCGTGGGCTTCCGCGGC
GTAGGTTCCGACGTGACCGAGCAGCGCGAAAGCGCCGAAAAGATCGCCTATCTCGCCCGCTATGACACGCTCACCGGGCT
GCCCAACCGCCTGATGCTTACCGAGGCGCTTGGCGATGCGATGGGCTATTCCGAGAAGTGGCGGTCGAACTGCGCCTTCC
TGATGATCGACCTCGATCGCTTCAAGGCGGTCAACGATACGCTGGGTCACCTGGTGGGCGACCAGCTCCTCGCCATGGTC
TCGGACCGCATGACGCGCATCATCAAGGATGGCGAAGTCTGCGGTCGTCTGGGGGGTGATGAGTTCGCCGTGGTCGTGCG
CGACGTTGCGGACAGCCAGCGCATCGCCGAGCTTGCAGATGCGATCATCGCCACCCTGTCGCAGCCCTACGAAGTCGATC
ACCACATGCTCTATGTCGGCGCCAGCGTCGGCTCGGCGATCGGCCCGCGCGATGGCGCTACGGTCGAGACCCTGCTACGC
AACGCCGACCTCGCGCTCTATCGGGCCAAGGACGAGGGCGGGGGGAGCCACTGCACGTACGAACCCGCACTCCACGCCCA
CGCCGAGGAGCGCCGCAAGCTCGAGTTCTCGCTGCGCCACGCTCTCGAACGCAGCGAATTCGGTCTGGTGTTCCAGCCTG
TCGTCGATGCGACAAGCGAAGCCGTGGTAAGTTTCGAGGCGCTGCTGCGCTGGAACAGCGAGGAGCACGGCTCCGTCAGC
CCGGCGAAGTTCATTCCGCTGGCCGAGGACACCCGCCTGATCGTGCCGATCGGCGAATGGGTGCTGCGCATGGCTTGCCA
GGAGGCGATGAACTGGCCGCCGCACGTCAAGATCGCGGTCAACGTATCGGGCGAGCAGTTGCTCGATCCCTACTTTGCCG
AAACCGTCGTAGGTGCGCTGGCGGCAAGTGGGTTGCCCCCGCACCGGCTCGAGATCGAGGTGACCGAGAGCATCTTCGTG
CGCGATGCCACGGTCGCGCAGATGACCCTCGAGAACCTCATGGGCATCGGGTGCGGCGTCGCGCTTGACGATTTCGGCAC
CGGATACTCGTCGCTCGGCTACCTGCGAAAGATGCGCTTCTCGACCATCAAGGTCGACCGCAGCTTTGTTCAGGGGGCCG
CCAAGGACAACCCCGAGAGCCTTGCGATCGTGCGCGCTGTTGTGGCGATGGCCGACAGCCTCGACATGTCGACCACAGCC
GAGGGCGTCGAGACCGAGGCCGAGCTGGAAACGATCCGCCGCCTCGGCTGCAAGAAGATCCAGGGCTACTACTTCGGTCG
CCCGATGAGCGCGGCGGATGCGCGCGGCCTGTTCAGCCAGACCCGCATCCTCGAGCGGAAGGCGGGTTAG

Upstream 100 bases:

>100_bases
TCGCGGCTTACGATTTGGGTAATGAATTGCCTGTATTTCTGCTGACGTGAAGAATCCGTCTGCTCCCGTCGCCGAAGCTT
TGTCGGCCAAGTTGCCGATA

Downstream 100 bases:

>100_bases
GCCTCTACCAGGCCCCGCACGACGCTTTCGAACAGCGCGCGCCCGTCGCTGCCGCCGTGCGCGGCTTCGATCATGCGTTC
CGGGTGCGGCATCATGCCCA

Product: PAS/PAC sensor-containing diguanylate cyclase/phosphodiesterase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 769; Mature: 768

Protein sequence:

>769_residues
MAVIGLRDPVEGDWGRLRGLQYSSLARLTFARLMAHAVAALLVLQTFGGRVHPALLVGWMALLVSVLISGARFDRSLVDA
DRRRVSRQEMHRQTISSLFVALAWALPMLVFGPFGDAAGRMTLWTVLAMLMTGMAVTFAAMPMATVLFSGVVGVSAVAAF
LFEGEYTAAAVSTVFVVIVSVGAVEVARTFLGARVAEAGMAEKSEVVSLLLREFEEGDADWLWQVDANRRIRSVSPRFAF
ALGADPEDLEGKPLIQVIAGPAWEDGHFHSSLHDLAERLKRRESFSNLLVRATLHGSHRWWELSASPKVDDNGTFVGFRG
VGSDVTEQRESAEKIAYLARYDTLTGLPNRLMLTEALGDAMGYSEKWRSNCAFLMIDLDRFKAVNDTLGHLVGDQLLAMV
SDRMTRIIKDGEVCGRLGGDEFAVVVRDVADSQRIAELADAIIATLSQPYEVDHHMLYVGASVGSAIGPRDGATVETLLR
NADLALYRAKDEGGGSHCTYEPALHAHAEERRKLEFSLRHALERSEFGLVFQPVVDATSEAVVSFEALLRWNSEEHGSVS
PAKFIPLAEDTRLIVPIGEWVLRMACQEAMNWPPHVKIAVNVSGEQLLDPYFAETVVGALAASGLPPHRLEIEVTESIFV
RDATVAQMTLENLMGIGCGVALDDFGTGYSSLGYLRKMRFSTIKVDRSFVQGAAKDNPESLAIVRAVVAMADSLDMSTTA
EGVETEAELETIRRLGCKKIQGYYFGRPMSAADARGLFSQTRILERKAG

Sequences:

>Translated_769_residues
MAVIGLRDPVEGDWGRLRGLQYSSLARLTFARLMAHAVAALLVLQTFGGRVHPALLVGWMALLVSVLISGARFDRSLVDA
DRRRVSRQEMHRQTISSLFVALAWALPMLVFGPFGDAAGRMTLWTVLAMLMTGMAVTFAAMPMATVLFSGVVGVSAVAAF
LFEGEYTAAAVSTVFVVIVSVGAVEVARTFLGARVAEAGMAEKSEVVSLLLREFEEGDADWLWQVDANRRIRSVSPRFAF
ALGADPEDLEGKPLIQVIAGPAWEDGHFHSSLHDLAERLKRRESFSNLLVRATLHGSHRWWELSASPKVDDNGTFVGFRG
VGSDVTEQRESAEKIAYLARYDTLTGLPNRLMLTEALGDAMGYSEKWRSNCAFLMIDLDRFKAVNDTLGHLVGDQLLAMV
SDRMTRIIKDGEVCGRLGGDEFAVVVRDVADSQRIAELADAIIATLSQPYEVDHHMLYVGASVGSAIGPRDGATVETLLR
NADLALYRAKDEGGGSHCTYEPALHAHAEERRKLEFSLRHALERSEFGLVFQPVVDATSEAVVSFEALLRWNSEEHGSVS
PAKFIPLAEDTRLIVPIGEWVLRMACQEAMNWPPHVKIAVNVSGEQLLDPYFAETVVGALAASGLPPHRLEIEVTESIFV
RDATVAQMTLENLMGIGCGVALDDFGTGYSSLGYLRKMRFSTIKVDRSFVQGAAKDNPESLAIVRAVVAMADSLDMSTTA
EGVETEAELETIRRLGCKKIQGYYFGRPMSAADARGLFSQTRILERKAG
>Mature_768_residues
AVIGLRDPVEGDWGRLRGLQYSSLARLTFARLMAHAVAALLVLQTFGGRVHPALLVGWMALLVSVLISGARFDRSLVDAD
RRRVSRQEMHRQTISSLFVALAWALPMLVFGPFGDAAGRMTLWTVLAMLMTGMAVTFAAMPMATVLFSGVVGVSAVAAFL
FEGEYTAAAVSTVFVVIVSVGAVEVARTFLGARVAEAGMAEKSEVVSLLLREFEEGDADWLWQVDANRRIRSVSPRFAFA
LGADPEDLEGKPLIQVIAGPAWEDGHFHSSLHDLAERLKRRESFSNLLVRATLHGSHRWWELSASPKVDDNGTFVGFRGV
GSDVTEQRESAEKIAYLARYDTLTGLPNRLMLTEALGDAMGYSEKWRSNCAFLMIDLDRFKAVNDTLGHLVGDQLLAMVS
DRMTRIIKDGEVCGRLGGDEFAVVVRDVADSQRIAELADAIIATLSQPYEVDHHMLYVGASVGSAIGPRDGATVETLLRN
ADLALYRAKDEGGGSHCTYEPALHAHAEERRKLEFSLRHALERSEFGLVFQPVVDATSEAVVSFEALLRWNSEEHGSVSP
AKFIPLAEDTRLIVPIGEWVLRMACQEAMNWPPHVKIAVNVSGEQLLDPYFAETVVGALAASGLPPHRLEIEVTESIFVR
DATVAQMTLENLMGIGCGVALDDFGTGYSSLGYLRKMRFSTIKVDRSFVQGAAKDNPESLAIVRAVVAMADSLDMSTTAE
GVETEAELETIRRLGCKKIQGYYFGRPMSAADARGLFSQTRILERKAG

Specific function: Unknown

COG id: COG5001

COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 PAS (PER-ARNT-SIM) domains [H]

Homologues:

Organism=Escherichia coli, GI1787541, Length=433, Percent_Identity=35.5658198614319, Blast_Score=272, Evalue=7e-74,
Organism=Escherichia coli, GI87081921, Length=437, Percent_Identity=32.7231121281465, Blast_Score=230, Evalue=3e-61,
Organism=Escherichia coli, GI226510982, Length=446, Percent_Identity=27.3542600896861, Blast_Score=159, Evalue=5e-40,
Organism=Escherichia coli, GI1787055, Length=299, Percent_Identity=31.7725752508361, Blast_Score=142, Evalue=1e-34,
Organism=Escherichia coli, GI87081845, Length=283, Percent_Identity=32.5088339222615, Blast_Score=135, Evalue=1e-32,
Organism=Escherichia coli, GI1788502, Length=239, Percent_Identity=31.7991631799163, Blast_Score=130, Evalue=2e-31,
Organism=Escherichia coli, GI1788849, Length=442, Percent_Identity=27.3755656108597, Blast_Score=130, Evalue=4e-31,
Organism=Escherichia coli, GI1790496, Length=236, Percent_Identity=30.9322033898305, Blast_Score=127, Evalue=3e-30,
Organism=Escherichia coli, GI87081980, Length=236, Percent_Identity=31.7796610169492, Blast_Score=122, Evalue=1e-28,
Organism=Escherichia coli, GI87081743, Length=239, Percent_Identity=28.8702928870293, Blast_Score=116, Evalue=6e-27,
Organism=Escherichia coli, GI1786507, Length=233, Percent_Identity=32.618025751073, Blast_Score=110, Evalue=4e-25,
Organism=Escherichia coli, GI1788381, Length=215, Percent_Identity=33.4883720930233, Blast_Score=106, Evalue=5e-24,
Organism=Escherichia coli, GI87082096, Length=426, Percent_Identity=22.5352112676056, Blast_Score=106, Evalue=6e-24,
Organism=Escherichia coli, GI87081977, Length=183, Percent_Identity=33.3333333333333, Blast_Score=96, Evalue=7e-21,
Organism=Escherichia coli, GI1788956, Length=155, Percent_Identity=32.9032258064516, Blast_Score=83, Evalue=5e-17,
Organism=Escherichia coli, GI1787802, Length=167, Percent_Identity=31.7365269461078, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI1786584, Length=170, Percent_Identity=29.4117647058824, Blast_Score=78, Evalue=2e-15,
Organism=Escherichia coli, GI1787816, Length=165, Percent_Identity=32.7272727272727, Blast_Score=77, Evalue=5e-15,
Organism=Escherichia coli, GI1789650, Length=436, Percent_Identity=22.2477064220184, Blast_Score=75, Evalue=1e-14,
Organism=Escherichia coli, GI1787262, Length=162, Percent_Identity=31.4814814814815, Blast_Score=74, Evalue=2e-14,
Organism=Escherichia coli, GI87081881, Length=171, Percent_Identity=28.6549707602339, Blast_Score=72, Evalue=2e-13,
Organism=Escherichia coli, GI145693134, Length=156, Percent_Identity=30.7692307692308, Blast_Score=67, Evalue=3e-12,
Organism=Escherichia coli, GI87081974, Length=101, Percent_Identity=35.6435643564356, Blast_Score=67, Evalue=5e-12,
Organism=Escherichia coli, GI1787410, Length=221, Percent_Identity=23.0769230769231, Blast_Score=67, Evalue=5e-12,
Organism=Escherichia coli, GI87082007, Length=162, Percent_Identity=30.2469135802469, Blast_Score=65, Evalue=1e-11,
Organism=Escherichia coli, GI1788085, Length=204, Percent_Identity=24.5098039215686, Blast_Score=64, Evalue=5e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR001610
- InterPro:   IPR000014
- InterPro:   IPR000700
- InterPro:   IPR013656
- InterPro:   IPR013655 [H]

Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF08447 PAS_3; PF08448 PAS_4 [H]

EC number: NA

Molecular weight: Translated: 83999; Mature: 83868

Theoretical pI: Translated: 5.45; Mature: 5.45

Prosite motif: PS50113 PAC ; PS50883 EAL ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.8 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
0.8 %Cys     (Mature Protein)
3.3 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAVIGLRDPVEGDWGRLRGLQYSSLARLTFARLMAHAVAALLVLQTFGGRVHPALLVGWM
CEEECCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
ALLVSVLISGARFDRSLVDADRRRVSRQEMHRQTISSLFVALAWALPMLVFGPFGDAAGR
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH
MTLWTVLAMLMTGMAVTFAAMPMATVLFSGVVGVSAVAAFLFEGEYTAAAVSTVFVVIVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
VGAVEVARTFLGARVAEAGMAEKSEVVSLLLREFEEGDADWLWQVDANRRIRSVSPRFAF
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHCCCCCEEE
ALGADPEDLEGKPLIQVIAGPAWEDGHFHSSLHDLAERLKRRESFSNLLVRATLHGSHRW
EECCCCCCCCCCCEEHEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCEE
WELSASPKVDDNGTFVGFRGVGSDVTEQRESAEKIAYLARYDTLTGLPNRLMLTEALGDA
EEECCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
MGYSEKWRSNCAFLMIDLDRFKAVNDTLGHLVGDQLLAMVSDRMTRIIKDGEVCGRLGGD
HCCCHHHHCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCC
EFAVVVRDVADSQRIAELADAIIATLSQPYEVDHHMLYVGASVGSAIGPRDGATVETLLR
HHEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHH
NADLALYRAKDEGGGSHCTYEPALHAHAEERRKLEFSLRHALERSEFGLVFQPVVDATSE
CCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEHHHHCCHHH
AVVSFEALLRWNSEEHGSVSPAKFIPLAEDTRLIVPIGEWVLRMACQEAMNWPPHVKIAV
HHHHHHHHHHCCCCCCCCCCCHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCCCEEEEE
NVSGEQLLDPYFAETVVGALAASGLPPHRLEIEVTESIFVRDATVAQMTLENLMGIGCGV
ECCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECHHHHCCHHHHHHHHHHHHHCCCCC
ALDDFGTGYSSLGYLRKMRFSTIKVDRSFVQGAAKDNPESLAIVRAVVAMADSLDMSTTA
EECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCC
EGVETEAELETIRRLGCKKIQGYYFGRPMSAADARGLFSQTRILERKAG
CCCCHHHHHHHHHHHCHHHHCCEECCCCCCCHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
AVIGLRDPVEGDWGRLRGLQYSSLARLTFARLMAHAVAALLVLQTFGGRVHPALLVGWM
EEECCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH
ALLVSVLISGARFDRSLVDADRRRVSRQEMHRQTISSLFVALAWALPMLVFGPFGDAAGR
HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH
MTLWTVLAMLMTGMAVTFAAMPMATVLFSGVVGVSAVAAFLFEGEYTAAAVSTVFVVIVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
VGAVEVARTFLGARVAEAGMAEKSEVVSLLLREFEEGDADWLWQVDANRRIRSVSPRFAF
HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHCCCCCEEE
ALGADPEDLEGKPLIQVIAGPAWEDGHFHSSLHDLAERLKRRESFSNLLVRATLHGSHRW
EECCCCCCCCCCCEEHEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCEE
WELSASPKVDDNGTFVGFRGVGSDVTEQRESAEKIAYLARYDTLTGLPNRLMLTEALGDA
EEECCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
MGYSEKWRSNCAFLMIDLDRFKAVNDTLGHLVGDQLLAMVSDRMTRIIKDGEVCGRLGGD
HCCCHHHHCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCCCC
EFAVVVRDVADSQRIAELADAIIATLSQPYEVDHHMLYVGASVGSAIGPRDGATVETLLR
HHEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHH
NADLALYRAKDEGGGSHCTYEPALHAHAEERRKLEFSLRHALERSEFGLVFQPVVDATSE
CCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEHHHHCCHHH
AVVSFEALLRWNSEEHGSVSPAKFIPLAEDTRLIVPIGEWVLRMACQEAMNWPPHVKIAV
HHHHHHHHHHCCCCCCCCCCCHHCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCCCEEEEE
NVSGEQLLDPYFAETVVGALAASGLPPHRLEIEVTESIFVRDATVAQMTLENLMGIGCGV
ECCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECHHHHCCHHHHHHHHHHHHHCCCCC
ALDDFGTGYSSLGYLRKMRFSTIKVDRSFVQGAAKDNPESLAIVRAVVAMADSLDMSTTA
EECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCC
EGVETEAELETIRRLGCKKIQGYYFGRPMSAADARGLFSQTRILERKAG
CCCCHHHHHHHHHHHCHHHHCCEECCCCCCCHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9163424 [H]