Definition Exiguobacterium sp. AT1b, complete genome.
Accession NC_012673
Length 2,999,895

Click here to switch to the map view.

The map label for this gene is yciR [C]

Identifier: 229917997

GI number: 229917997

Start: 2231507

End: 2233561

Strand: Reverse

Name: yciR [C]

Synonym: EAT1b_2276

Alternate gene names: 229917997

Gene position: 2233561-2231507 (Counterclockwise)

Preceding gene: 229918001

Following gene: 229917995

Centisome position: 74.45

GC content: 45.79

Gene sequence:

>2055_bases
GTGGCTTCATTGCGTGATTTACCAGTAGATGAATTCATCGATGCTTTATCTGTTCCGTGTTGTTATATTTCTCAAGATGG
TACGGTCCTATTATGGAATACGCATGCCGAAGTTTTATTCGGCTGGGAACGTGACGAAGTCCTCCATCAACCGCTTCCGG
TCGTCCGGCAATTAGATCATGCTCTCGAAACATGGTCTCCGCTCCTCCTTCAATATCCATTCTCGACAACGGCGATGACG
CCGTTTCAACATAAGGATGGCTCCGTCATCTATGCGACTGCCTACCTGCAATCTTTTTCGCTCGAAGGGATTTATGGTTA
TTTTTTAGCGTTTCTCCCAAAAGGATTTGGACCGCTCATCAGTACAGAACAGTTCAATCTACTTCATCATTTTCAAGAGA
TGATCAAACAGTCCACGCATTACTTGGCGACAAATGCGAAAGGCGAGATCGTGGACATCAATCCTTCGTTATGCCGTCTA
CTCGATGCGACGGAGCAACAGCTAATTGGACGCGAATGGTTCGAACTCATTCATGATTCAAGTGAACAGGACGGGATTTC
GCGTAACGTCTTAAAATCACTTGCGACTAATCGGATTTGGAACGGTGAGATGCCGATCTCCTGTAAGCAACATGAGTCAG
AGCCGTGTTGGCTTAACTTGACTGTCGTACCGATTGTTAGTGAGAAGAATGAAGTGATTCAATATACCGCATTCGGATTT
GATGTGTCGGAAAAGAAGCGTCTTGAAAAAGAAGTCCAGTTCCTCGCTTATCAAAATGAGTTGACCGGACTTTATAATAA
AAAAGGGTTTTTACGTCGCTACGACTCAATCCTTCGACAAATCGATGAGCGAGAAGGTCTCCTTCATATTGCCTTGTTTG
ATATCGACCGCTTCAAAATTATCAATGAGTCATTCGGGTCTCGCGTTGGGAACGAACTTCTCATCCAAATCAAAGAGCGC
GCACTGCATCTCATTCCGGAATCCGCTCTTTTATTCCATCCTACGGGCGGACTGTTCGGTGTCATGTTCTTCGAGGAATC
GAAAGAAGAAGTGTTCCAAGTACTCCGACATCTGCAACACGAACTGCAACGTCCGTTCCGGATTTACCATCACTCCATCA
TGGTATCGATTTCAATCGGATGCGTATTTTATCCGTCTTCGACAAGTTCACTCGAGGAACTGTATACACGTGCTGAAAGT
GCCCTCTTCAAAGGGAAACAGATCGGCGTCGGTACGATTCAATTCGTGACAAAAGATATGGATGCCGCTTTCTCGAGACA
AATTCATATTGAGAAGGCGATGTACCGAGCCCTTGAAGAAAAGCATTTTTATCTTGAGTATCAACCAAAGTATGAACTCG
CCACAGACCGGCTCATTGGTTTTGAGGCACTCCTTCGTTGGCATCACGACGAACTGGGACAAATCCCACCTTCAGAGTTC
ATTCCGCTCGCGGAAGAGATGGCACTCATCGTTCCTATCAACAACTGGGTCATCTTAGAGGCAACGAAACAACTGAAAGC
CTGGAAAGCCGAGTTCGATCAACCGCTCTCGATGGCAATCAACATTTCGCCTAATCAATTCCGAAGCGACAGTTTTTTGA
ATACGTTACGAAACATCAAGCAACAACTCCAGCTCGACCCGGCAGACATCATCTTGGAGATCACGGAAAGTCTCGTCATG
CAACAGACCGATGAAATCATCGAACGCATGGAACAAATCAAGCGCTTGAACTATCGACTGTCGATTGACGATTTCGGGAC
AGGTTTCTCGTCCTTACAGTATTTGAAGTCCTTCCCGGTCGATGAACTCAAAATCGATAAAGTCTTTCTCGATGATTGGA
TGGCGTCCAAATCTCATCTCCTCGATGTCATCGTCCATCTTGGGAAGAGCTTAAATCTCCACGTCGTCGCGGAGGGGGTC
GAAGATGAAGAGATGTTGGCACACTTAAAGTCGACAGACTGCGATTCTTTCCAAGGCTATTTATACGCAAGACCTGCTTC
ACCAAGTGCAATCGAACAATTACTCCGATCATTAAAAAACTCACCGGACGCTTGA

Upstream 100 bases:

>100_bases
TTCTTTTGACAATGACGAACTTAAGCATTCCTTGAATTGGCAAAATCAGGTACACTGACATTAGAATATGCATTTAGCAT
GACAGATTGGAGGAAATGAT

Downstream 100 bases:

>100_bases
AGCGCCGGTGAGCTTTCTTATAGAGCGATTGTGGCTGTCCAAAGCCCATAAATGCCATAGGTCGCACATAATAGAATAAT
CAAGGCAATCATCGACTGGG

Product: diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s)

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 684; Mature: 683

Protein sequence:

>684_residues
MASLRDLPVDEFIDALSVPCCYISQDGTVLLWNTHAEVLFGWERDEVLHQPLPVVRQLDHALETWSPLLLQYPFSTTAMT
PFQHKDGSVIYATAYLQSFSLEGIYGYFLAFLPKGFGPLISTEQFNLLHHFQEMIKQSTHYLATNAKGEIVDINPSLCRL
LDATEQQLIGREWFELIHDSSEQDGISRNVLKSLATNRIWNGEMPISCKQHESEPCWLNLTVVPIVSEKNEVIQYTAFGF
DVSEKKRLEKEVQFLAYQNELTGLYNKKGFLRRYDSILRQIDEREGLLHIALFDIDRFKIINESFGSRVGNELLIQIKER
ALHLIPESALLFHPTGGLFGVMFFEESKEEVFQVLRHLQHELQRPFRIYHHSIMVSISIGCVFYPSSTSSLEELYTRAES
ALFKGKQIGVGTIQFVTKDMDAAFSRQIHIEKAMYRALEEKHFYLEYQPKYELATDRLIGFEALLRWHHDELGQIPPSEF
IPLAEEMALIVPINNWVILEATKQLKAWKAEFDQPLSMAINISPNQFRSDSFLNTLRNIKQQLQLDPADIILEITESLVM
QQTDEIIERMEQIKRLNYRLSIDDFGTGFSSLQYLKSFPVDELKIDKVFLDDWMASKSHLLDVIVHLGKSLNLHVVAEGV
EDEEMLAHLKSTDCDSFQGYLYARPASPSAIEQLLRSLKNSPDA

Sequences:

>Translated_684_residues
MASLRDLPVDEFIDALSVPCCYISQDGTVLLWNTHAEVLFGWERDEVLHQPLPVVRQLDHALETWSPLLLQYPFSTTAMT
PFQHKDGSVIYATAYLQSFSLEGIYGYFLAFLPKGFGPLISTEQFNLLHHFQEMIKQSTHYLATNAKGEIVDINPSLCRL
LDATEQQLIGREWFELIHDSSEQDGISRNVLKSLATNRIWNGEMPISCKQHESEPCWLNLTVVPIVSEKNEVIQYTAFGF
DVSEKKRLEKEVQFLAYQNELTGLYNKKGFLRRYDSILRQIDEREGLLHIALFDIDRFKIINESFGSRVGNELLIQIKER
ALHLIPESALLFHPTGGLFGVMFFEESKEEVFQVLRHLQHELQRPFRIYHHSIMVSISIGCVFYPSSTSSLEELYTRAES
ALFKGKQIGVGTIQFVTKDMDAAFSRQIHIEKAMYRALEEKHFYLEYQPKYELATDRLIGFEALLRWHHDELGQIPPSEF
IPLAEEMALIVPINNWVILEATKQLKAWKAEFDQPLSMAINISPNQFRSDSFLNTLRNIKQQLQLDPADIILEITESLVM
QQTDEIIERMEQIKRLNYRLSIDDFGTGFSSLQYLKSFPVDELKIDKVFLDDWMASKSHLLDVIVHLGKSLNLHVVAEGV
EDEEMLAHLKSTDCDSFQGYLYARPASPSAIEQLLRSLKNSPDA
>Mature_683_residues
ASLRDLPVDEFIDALSVPCCYISQDGTVLLWNTHAEVLFGWERDEVLHQPLPVVRQLDHALETWSPLLLQYPFSTTAMTP
FQHKDGSVIYATAYLQSFSLEGIYGYFLAFLPKGFGPLISTEQFNLLHHFQEMIKQSTHYLATNAKGEIVDINPSLCRLL
DATEQQLIGREWFELIHDSSEQDGISRNVLKSLATNRIWNGEMPISCKQHESEPCWLNLTVVPIVSEKNEVIQYTAFGFD
VSEKKRLEKEVQFLAYQNELTGLYNKKGFLRRYDSILRQIDEREGLLHIALFDIDRFKIINESFGSRVGNELLIQIKERA
LHLIPESALLFHPTGGLFGVMFFEESKEEVFQVLRHLQHELQRPFRIYHHSIMVSISIGCVFYPSSTSSLEELYTRAESA
LFKGKQIGVGTIQFVTKDMDAAFSRQIHIEKAMYRALEEKHFYLEYQPKYELATDRLIGFEALLRWHHDELGQIPPSEFI
PLAEEMALIVPINNWVILEATKQLKAWKAEFDQPLSMAINISPNQFRSDSFLNTLRNIKQQLQLDPADIILEITESLVMQ
QTDEIIERMEQIKRLNYRLSIDDFGTGFSSLQYLKSFPVDELKIDKVFLDDWMASKSHLLDVIVHLGKSLNLHVVAEGVE
DEEMLAHLKSTDCDSFQGYLYARPASPSAIEQLLRSLKNSPDA

Specific function: Unknown

COG id: COG5001

COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 MHYT domain [H]

Homologues:

Organism=Escherichia coli, GI1787541, Length=547, Percent_Identity=28.1535648994516, Blast_Score=231, Evalue=1e-61,
Organism=Escherichia coli, GI87081921, Length=436, Percent_Identity=30.5045871559633, Blast_Score=211, Evalue=1e-55,
Organism=Escherichia coli, GI226510982, Length=292, Percent_Identity=31.8493150684932, Blast_Score=148, Evalue=1e-36,
Organism=Escherichia coli, GI87081980, Length=259, Percent_Identity=36.2934362934363, Blast_Score=141, Evalue=1e-34,
Organism=Escherichia coli, GI1786507, Length=239, Percent_Identity=35.1464435146443, Blast_Score=137, Evalue=2e-33,
Organism=Escherichia coli, GI1790496, Length=250, Percent_Identity=34, Blast_Score=135, Evalue=6e-33,
Organism=Escherichia coli, GI87081845, Length=250, Percent_Identity=29.2, Blast_Score=134, Evalue=2e-32,
Organism=Escherichia coli, GI1788502, Length=244, Percent_Identity=32.3770491803279, Blast_Score=130, Evalue=2e-31,
Organism=Escherichia coli, GI1787055, Length=246, Percent_Identity=30.8943089430894, Blast_Score=129, Evalue=6e-31,
Organism=Escherichia coli, GI87081743, Length=230, Percent_Identity=31.7391304347826, Blast_Score=115, Evalue=1e-26,
Organism=Escherichia coli, GI1788849, Length=282, Percent_Identity=30.8510638297872, Blast_Score=109, Evalue=6e-25,
Organism=Escherichia coli, GI87082096, Length=243, Percent_Identity=26.3374485596708, Blast_Score=93, Evalue=6e-20,
Organism=Escherichia coli, GI1787410, Length=153, Percent_Identity=33.3333333333333, Blast_Score=67, Evalue=5e-12,
Organism=Escherichia coli, GI1788381, Length=300, Percent_Identity=18.3333333333333, Blast_Score=62, Evalue=9e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR005330 [H]

Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF03707 MHYT [H]

EC number: NA

Molecular weight: Translated: 78718; Mature: 78587

Theoretical pI: Translated: 5.03; Mature: 5.03

Prosite motif: PS50112 PAS ; PS50113 PAC ; PS50883 EAL ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MASLRDLPVDEFIDALSVPCCYISQDGTVLLWNTHAEVLFGWERDEVLHQPLPVVRQLDH
CCCCCCCCHHHHHHHHCCCEEEECCCCEEEEEECCEEEEECCCHHHHHHCCCHHHHHHHH
ALETWSPLLLQYPFSTTAMTPFQHKDGSVIYATAYLQSFSLEGIYGYFLAFLPKGFGPLI
HHHHCCCCEEECCCCCCCCCCCCCCCCCEEEEEEHHHHCCCCHHHHHHHHHHCCCCCCCC
STEQFNLLHHFQEMIKQSTHYLATNAKGEIVDINPSLCRLLDATEQQLIGREWFELIHDS
CCHHHHHHHHHHHHHHCCCCEEEECCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHCCC
SEQDGISRNVLKSLATNRIWNGEMPISCKQHESEPCWLNLTVVPIVSEKNEVIQYTAFGF
CCCCCHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCEEEEEEEEEECCCCCEEEEEEECC
DVSEKKRLEKEVQFLAYQNELTGLYNKKGFLRRYDSILRQIDEREGLLHIALFDIDRFKI
CCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCEEEEEEEECHHHHH
INESFGSRVGNELLIQIKERALHLIPESALLFHPTGGLFGVMFFEESKEEVFQVLRHLQH
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCEEEEEEECCCHHHHHHHHHHHHH
ELQRPFRIYHHSIMVSISIGCVFYPSSTSSLEELYTRAESALFKGKQIGVGTIQFVTKDM
HHHCHHHHHHHEEEEEEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH
DAAFSRQIHIEKAMYRALEEKHFYLEYQPKYELATDRLIGFEALLRWHHDELGQIPPSEF
HHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHC
IPLAEEMALIVPINNWVILEATKQLKAWKAEFDQPLSMAINISPNQFRSDSFLNTLRNIK
CCCHHHCEEEEECCCEEEEEEHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH
QQLQLDPADIILEITESLVMQQTDEIIERMEQIKRLNYRLSIDDFGTGFSSLQYLKSFPV
HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEHHCCCCHHHHHHHHHCCC
DELKIDKVFLDDWMASKSHLLDVIVHLGKSLNLHVVAEGVEDEEMLAHLKSTDCDSFQGY
CCEEEHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCCCCCCE
LYARPASPSAIEQLLRSLKNSPDA
EEECCCCCHHHHHHHHHHCCCCCC
>Mature Secondary Structure 
ASLRDLPVDEFIDALSVPCCYISQDGTVLLWNTHAEVLFGWERDEVLHQPLPVVRQLDH
CCCCCCCHHHHHHHHCCCEEEECCCCEEEEEECCEEEEECCCHHHHHHCCCHHHHHHHH
ALETWSPLLLQYPFSTTAMTPFQHKDGSVIYATAYLQSFSLEGIYGYFLAFLPKGFGPLI
HHHHCCCCEEECCCCCCCCCCCCCCCCCEEEEEEHHHHCCCCHHHHHHHHHHCCCCCCCC
STEQFNLLHHFQEMIKQSTHYLATNAKGEIVDINPSLCRLLDATEQQLIGREWFELIHDS
CCHHHHHHHHHHHHHHCCCCEEEECCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHHCCC
SEQDGISRNVLKSLATNRIWNGEMPISCKQHESEPCWLNLTVVPIVSEKNEVIQYTAFGF
CCCCCHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCEEEEEEEEEECCCCCEEEEEEECC
DVSEKKRLEKEVQFLAYQNELTGLYNKKGFLRRYDSILRQIDEREGLLHIALFDIDRFKI
CCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCEEEEEEEECHHHHH
INESFGSRVGNELLIQIKERALHLIPESALLFHPTGGLFGVMFFEESKEEVFQVLRHLQH
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCEEEEEEECCCHHHHHHHHHHHHH
ELQRPFRIYHHSIMVSISIGCVFYPSSTSSLEELYTRAESALFKGKQIGVGTIQFVTKDM
HHHCHHHHHHHEEEEEEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH
DAAFSRQIHIEKAMYRALEEKHFYLEYQPKYELATDRLIGFEALLRWHHDELGQIPPSEF
HHHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHC
IPLAEEMALIVPINNWVILEATKQLKAWKAEFDQPLSMAINISPNQFRSDSFLNTLRNIK
CCCHHHCEEEEECCCEEEEEEHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHH
QQLQLDPADIILEITESLVMQQTDEIIERMEQIKRLNYRLSIDDFGTGFSSLQYLKSFPV
HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEHHCCCCHHHHHHHHHCCC
DELKIDKVFLDDWMASKSHLLDVIVHLGKSLNLHVVAEGVEDEEMLAHLKSTDCDSFQGY
CCEEEHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCCCCCCE
LYARPASPSAIEQLLRSLKNSPDA
EEECCCCCHHHHHHHHHHCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 10984043 [H]