The gene/protein map for NC_012785 is currently unavailable.
Definition Kosmotoga olearia TBF 19.5.1, complete genome.
Accession NC_012785
Length 2,302,126

Click here to switch to the map view.

The map label for this gene is dosC [H]

Identifier: 239616993

GI number: 239616993

Start: 640686

End: 642095

Strand: Reverse

Name: dosC [H]

Synonym: Kole_0593

Alternate gene names: 239616993

Gene position: 642095-640686 (Counterclockwise)

Preceding gene: 239616994

Following gene: 239616991

Centisome position: 27.89

GC content: 38.58

Gene sequence:

>1410_bases
ATGGATAAAGTGTTAGTGGTTGATGACAGTGATACTTGGAGAACTTTGCTGGAAAAAGTACTGATAGAATACGGGTTTAC
TGTTGAAACTGCAGCCGATGGAATAGACGGTCTTAATAAGTTTTTTGATTTTCTTCCAGATGTGGTAATAACTGACTACG
TTATGCCCAGGATGAATGGAGTACATCTTTGCCGGCTAATAAGAAGCTATACCAGCTTTAAGAATGTTGGTATCCTAATT
TTAACCGGAGCGGATGAGGCGATAAATGATTTTTGGGCCAAAAAGAGTGGTGCAAATAAATTCCTGAAGAAATCAGGAGA
TGTGGAATCGATCACAAAAGAGATTCTTAACTTTTTGAAAGGGAATTATATTTCGGGTTGGTCAAAAGAGGTATATAAAA
CCCGCACTGAACCATTTGGCGAACTTGTGGATGTAATAGAAGAAACCCTGAAAACCGAGGTCATAAATAAAGAACTTCTT
TCGCTGGTTCAGTACGTGGATGATGAAGAATACATGATGAGAAAACTAAAAAGCTTGATCCTTGACTTTGTACGCGCTGA
TGGATTGTGTATTTTATTAATCTCTTCAAGTATAGGAAGAGTATATTCCTTTGGATTTTATTCTGATACTACGCCTCATG
AATATGTTAAAAGCAATTTATTGAGGGCTATGGAAAAACCTGTTACTCCATCTGAGTGGCTTTTGAAGTTTGATTCGGAT
GCTTCAGAAGATGGATTTGATACGAATAATGTCAAAAATTTTGTCATATCATTTGGAGATCAGGAATTGGGTGTTATGTC
TTTCAAAAGGCCGCGGAACGCAGAAAGCCTTTATTATTTCATGGATAGCATAGGAGAGACGCTTGGAATAATTGCAAAAA
CCGTAAACAATTTTTGTGATCAAAGAATCGCTACGGAAATCGACTTTCTAACTTCCCTTTTCAATCGAAAAGCTACACTT
AGTCGTTTGAAGGAGTACATAGAGCTGGCAAAGAGAAGAATGCTTTCCTTAACTGTAGCTATGCTGGACGTAGATGACTT
CAAGGAAATCAACGATACTTATGGGCATCTCTTTGGAGATACGGTTCTGAAGGAGATAGGAAAGATTTTCAGAAATTCTA
TGAGAAAAAGTGATATTGTGGGTAGATATGGCGGGGAAGAATTTTTGATAATTTTTCCGGGCATAAAAGCAGCAGAGGCA
GTTGAAGCTCTTGAAAGAGTTCTTGATGAATTGAGAAACTACGATTGGAGAGCGGTAACGGGCAAGCCGATAAAGGTAAC
TGTCAGCGCGGGAGTCGCCGAATTTTCCGATGATACGACTTTGAGGGAACTGGTAAATAAGGCTGATAACGAGCTCTACA
AAGCTAAACGTTCTGGAAAAGATAGAATAAGTGTCTACAGGGAGAATTAG

Upstream 100 bases:

>100_bases
ATACTGGTTAATTGAGGAATCAGAGCTTAGTTGATATACGATGTTTGCTGGGAAGGGGGATTCTTTTAGAGTAATGAGAA
TACGAAAGTGAGGAAAGTTT

Downstream 100 bases:

>100_bases
GAGATAGACGTGCGGTGATGATCAGGGTTCTTGTGGGTGACGATCGAGCGAATGCAATTGTAAAAATCAAAAACCCCGGA
GGTTATACCATTACTGAGTC

Product: response regulator receiver modulated diguanylate cyclase

Products: NA

Alternate protein names: DGC [H]

Number of amino acids: Translated: 469; Mature: 469

Protein sequence:

>469_residues
MDKVLVVDDSDTWRTLLEKVLIEYGFTVETAADGIDGLNKFFDFLPDVVITDYVMPRMNGVHLCRLIRSYTSFKNVGILI
LTGADEAINDFWAKKSGANKFLKKSGDVESITKEILNFLKGNYISGWSKEVYKTRTEPFGELVDVIEETLKTEVINKELL
SLVQYVDDEEYMMRKLKSLILDFVRADGLCILLISSSIGRVYSFGFYSDTTPHEYVKSNLLRAMEKPVTPSEWLLKFDSD
ASEDGFDTNNVKNFVISFGDQELGVMSFKRPRNAESLYYFMDSIGETLGIIAKTVNNFCDQRIATEIDFLTSLFNRKATL
SRLKEYIELAKRRMLSLTVAMLDVDDFKEINDTYGHLFGDTVLKEIGKIFRNSMRKSDIVGRYGGEEFLIIFPGIKAAEA
VEALERVLDELRNYDWRAVTGKPIKVTVSAGVAEFSDDTTLRELVNKADNELYKAKRSGKDRISVYREN

Sequences:

>Translated_469_residues
MDKVLVVDDSDTWRTLLEKVLIEYGFTVETAADGIDGLNKFFDFLPDVVITDYVMPRMNGVHLCRLIRSYTSFKNVGILI
LTGADEAINDFWAKKSGANKFLKKSGDVESITKEILNFLKGNYISGWSKEVYKTRTEPFGELVDVIEETLKTEVINKELL
SLVQYVDDEEYMMRKLKSLILDFVRADGLCILLISSSIGRVYSFGFYSDTTPHEYVKSNLLRAMEKPVTPSEWLLKFDSD
ASEDGFDTNNVKNFVISFGDQELGVMSFKRPRNAESLYYFMDSIGETLGIIAKTVNNFCDQRIATEIDFLTSLFNRKATL
SRLKEYIELAKRRMLSLTVAMLDVDDFKEINDTYGHLFGDTVLKEIGKIFRNSMRKSDIVGRYGGEEFLIIFPGIKAAEA
VEALERVLDELRNYDWRAVTGKPIKVTVSAGVAEFSDDTTLRELVNKADNELYKAKRSGKDRISVYREN
>Mature_469_residues
MDKVLVVDDSDTWRTLLEKVLIEYGFTVETAADGIDGLNKFFDFLPDVVITDYVMPRMNGVHLCRLIRSYTSFKNVGILI
LTGADEAINDFWAKKSGANKFLKKSGDVESITKEILNFLKGNYISGWSKEVYKTRTEPFGELVDVIEETLKTEVINKELL
SLVQYVDDEEYMMRKLKSLILDFVRADGLCILLISSSIGRVYSFGFYSDTTPHEYVKSNLLRAMEKPVTPSEWLLKFDSD
ASEDGFDTNNVKNFVISFGDQELGVMSFKRPRNAESLYYFMDSIGETLGIIAKTVNNFCDQRIATEIDFLTSLFNRKATL
SRLKEYIELAKRRMLSLTVAMLDVDDFKEINDTYGHLFGDTVLKEIGKIFRNSMRKSDIVGRYGGEEFLIIFPGIKAAEA
VEALERVLDELRNYDWRAVTGKPIKVTVSAGVAEFSDDTTLRELVNKADNELYKAKRSGKDRISVYREN

Specific function: Overexpression leads to an increased level of c-di-GMP, which leads to changes in the cell surface, to abnormal cell division, increased biofilm formation and decreased swimming (the latter 2 in strain W3110). In a strain able to produce cellulose (strain

COG id: COG3706

COG function: function code T; Response regulator containing a CheY-like receiver domain and a GGDEF domain

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 GGDEF domain [H]

Homologues:

Organism=Escherichia coli, GI145693134, Length=161, Percent_Identity=36.6459627329193, Blast_Score=114, Evalue=2e-26,
Organism=Escherichia coli, GI87082007, Length=158, Percent_Identity=35.4430379746835, Blast_Score=107, Evalue=2e-24,
Organism=Escherichia coli, GI1786584, Length=159, Percent_Identity=33.3333333333333, Blast_Score=100, Evalue=2e-22,
Organism=Escherichia coli, GI1787816, Length=160, Percent_Identity=34.375, Blast_Score=93, Evalue=4e-20,
Organism=Escherichia coli, GI87081881, Length=164, Percent_Identity=31.7073170731707, Blast_Score=89, Evalue=7e-19,
Organism=Escherichia coli, GI1787262, Length=166, Percent_Identity=33.7349397590361, Blast_Score=87, Evalue=2e-18,
Organism=Escherichia coli, GI1787802, Length=184, Percent_Identity=30.9782608695652, Blast_Score=87, Evalue=2e-18,
Organism=Escherichia coli, GI1788381, Length=162, Percent_Identity=29.0123456790123, Blast_Score=85, Evalue=1e-17,
Organism=Escherichia coli, GI1787541, Length=169, Percent_Identity=31.3609467455621, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI1787056, Length=127, Percent_Identity=36.2204724409449, Blast_Score=77, Evalue=2e-15,
Organism=Escherichia coli, GI87081974, Length=160, Percent_Identity=30.625, Blast_Score=72, Evalue=8e-14,
Organism=Escherichia coli, GI87081977, Length=154, Percent_Identity=28.5714285714286, Blast_Score=70, Evalue=3e-13,
Organism=Escherichia coli, GI1788085, Length=164, Percent_Identity=28.0487804878049, Blast_Score=69, Evalue=6e-13,
Organism=Escherichia coli, GI1788956, Length=156, Percent_Identity=30.1282051282051, Blast_Score=67, Evalue=3e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR012292
- InterPro:   IPR009050 [H]

Pfam domain/function: PF00990 GGDEF [H]

EC number: =2.7.7.65 [H]

Molecular weight: Translated: 53446; Mature: 53446

Theoretical pI: Translated: 4.87; Mature: 4.87

Prosite motif: PS50110 RESPONSE_REGULATORY ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
3.0 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDKVLVVDDSDTWRTLLEKVLIEYGFTVETAADGIDGLNKFFDFLPDVVITDYVMPRMNG
CCCEEEEECCHHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHHHHCCHHHHHHHHHHCCCH
VHLCRLIRSYTSFKNVGILILTGADEAINDFWAKKSGANKFLKKSGDVESITKEILNFLK
HHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHC
GNYISGWSKEVYKTRTEPFGELVDVIEETLKTEVINKELLSLVQYVDDEEYMMRKLKSLI
CCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
LDFVRADGLCILLISSSIGRVYSFGFYSDTTPHEYVKSNLLRAMEKPVTPSEWLLKFDSD
HHHHHCCCEEEEEEHHCCHHHEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHEEECCC
ASEDGFDTNNVKNFVISFGDQELGVMSFKRPRNAESLYYFMDSIGETLGIIAKTVNNFCD
CCCCCCCCCCHHHHHHCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
QRIATEIDFLTSLFNRKATLSRLKEYIELAKRRMLSLTVAMLDVDDFKEINDTYGHLFGD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH
TVLKEIGKIFRNSMRKSDIVGRYGGEEFLIIFPGIKAAEAVEALERVLDELRNYDWRAVT
HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEC
GKPIKVTVSAGVAEFSDDTTLRELVNKADNELYKAKRSGKDRISVYREN
CCCEEEEEECCCHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECC
>Mature Secondary Structure
MDKVLVVDDSDTWRTLLEKVLIEYGFTVETAADGIDGLNKFFDFLPDVVITDYVMPRMNG
CCCEEEEECCHHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHHHHCCHHHHHHHHHHCCCH
VHLCRLIRSYTSFKNVGILILTGADEAINDFWAKKSGANKFLKKSGDVESITKEILNFLK
HHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHC
GNYISGWSKEVYKTRTEPFGELVDVIEETLKTEVINKELLSLVQYVDDEEYMMRKLKSLI
CCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH
LDFVRADGLCILLISSSIGRVYSFGFYSDTTPHEYVKSNLLRAMEKPVTPSEWLLKFDSD
HHHHHCCCEEEEEEHHCCHHHEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHEEECCC
ASEDGFDTNNVKNFVISFGDQELGVMSFKRPRNAESLYYFMDSIGETLGIIAKTVNNFCD
CCCCCCCCCCHHHHHHCCCCCCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
QRIATEIDFLTSLFNRKATLSRLKEYIELAKRRMLSLTVAMLDVDDFKEINDTYGHLFGD
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH
TVLKEIGKIFRNSMRKSDIVGRYGGEEFLIIFPGIKAAEAVEALERVLDELRNYDWRAVT
HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEC
GKPIKVTVSAGVAEFSDDTTLRELVNKADNELYKAKRSGKDRISVYREN
CCCEEEEEECCCHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9097039; 9278503 [H]