Definition Azoarcus sp. BH72 chromosome, complete genome.
Accession NC_008702
Length 4,376,040

Click here to switch to the map view.

The map label for this gene is yciR [C]

Identifier: 119899816

GI number: 119899816

Start: 3870767

End: 3872938

Strand: Direct

Name: yciR [C]

Synonym: azo3527

Alternate gene names: 119899816

Gene position: 3870767-3872938 (Clockwise)

Preceding gene: 119899815

Following gene: 119899817

Centisome position: 88.45

GC content: 65.84

Gene sequence:

>2172_bases
ATGCCACCGCTCATTCCCCCGACCTCGCAGGACTCGACGCTCACGCCGCACCGCCTGCGGGTCATCGGCGAACTCGCGCG
CCTGCTCGAATCGCAGACCCGCACGCGCTACTTCAGCGAGGCGCTGGTCGGCACCACGCGCGCGATGCACGCCGAAGCGA
TGGAGCGCCTGACCAGTGCGATCGGCGAGACCAGCAGCCCGCACTTCGTCGAAACCCTGGAACAGGCGCGCGACATCTGC
GGGCGGCTGTCCGACAGCTTCGACCTGCTGCTGGCCGAGCGCAACCGCCAGTCGGCCGCCAGCCATGAGGGGCTGCGCGA
CCTGATCAGCGAGTTCGACGACATGTTCACGCTGTTGTCGAACACCCTGGTCGAGCGCGAACTGCTCGAACGCCAGAGCC
AGGTGCTCGAGCAGATCATCCTGTCGCACGAGCGCATCGGGCAGTGGAAGGCCTTCGTCCAGGCCATCCTCAGCGACTTC
CACGCGATCTTCCCGTTCGACCTCTTCTTCGTCGCCTTCGCCGAAGAGAACGGGCTGATGCTCAACGTGTATTACTTCGG
CGAATGTGACGAGGCTTACCGCCTCGCCACGCGCCAGCAGCTCGCCCGGAGCATGATCGAAAGCCTCGGTCTGCCGGCGG
ATTCGCCGCTGGACTACGAGGAATTCCAGGTCCAGCGCAACCAGAGCATCCGCCCGGTGCGGCCGGAGCAGATGATCACG
GTGCGGGTGCCGGAGCACTCCTCGCAGCTCGCCGGCCTGCTCGGCGTCACCTTCCTGTCGTCGCAGGAGCTGAACGCGCG
CGAGGAAAGCGTCATCCGCTCCATCCTCGCGGTGATGGTGATGGTGGTCGGTTCCAGCAAGGTGCTGTCGCGCACGCTGG
CCGAGCTGGAGTACTACTCCATGCACGACCCGCTCACCGGCCTGTACAACCGGCGCCACTTCAACAACATGCTGGAGTAC
GAAATCGGCCGCTCGGAGCGCCACGAGCACGAATTCGCGCTGCTGCTGCTCGACCTCGACGACTTCAAGGACGTCAACGA
CTCCTACGGCCACCCCACCGGCGACAGCGTGCTGGTGCGGGTCGCCGAGATCCTGCGCGGCCACATCCGCAAGGGCGATC
TGGCCACCCGCATCGGCGGCGACGAGTTCGCCATCATGCTGATGGAAACCGGCACCGAGGGCGCGGTATCGGTGGCGGAA
AAGCTGGGCGCCGCGCTGCGCGCCACCAGCTTCGAAAGCCCCAACGGCAAGCGCTTCCACATCACCACCTCGATCGGCGT
GGTGGTCTATCCGCGCGACGCGCGCACCGAGCACGACCTGCTCGCCGGCGTGGACATCGCGATGTACCGCGCCAAGGAGC
TGGGCAAGGACAGCGCATGCACGCTGGCCTCGATGCCCGGCCAGCTGAAGGCCACCCGCGTCACCCGCGACTACGCCGAG
AAGCTGCGCGAGGCGCTGCGCGAGAACCGCATGGTTCCGTACTTCCAGCCCATCGTCGATTGCCACACCGGCGTGCCCTT
CGCCTGCGAGACGGTCGCCCGCCTGAAGGAGAAGACCGGCGAAACCATCGCCGCCGGCGCCTTCATCGACACCATCGAGA
AATACGGCCTCGGCCGCGAACTCGACCGCGTCATCATCCGCCAGACGCTCGAAGCCGCGGCCGCGCGCGCCCGCACCGGC
GCACCGCCGCTGCGCGTGTTCATCAACCTGTCGGCGCAGGAAATCCAGGGCCGCGGCATCCTCGGCTATGCCGAAGAGCT
GTGCAACGAACTCGGCATCCCGCCCAGCCAGGTGGTGTTCGAAATCCTCGAACGCGACGCCATCGGCGACATGACGAACA
TGCGCAAATTCCTCGCGAACCTGCGCAAAAAGGGCTTCGCCTTCGCGCTGGACGACTTCGGCAGCGGCTACAACTCCTTC
CACTACCTGCGCGAACTGCACTTCGAATTCGTCAAGATCGACGGCGCCTTCGTGCGCAGCATCGTCGAATCGCCGATCGA
CCGTGCGCTGGTGCGCAACCTCACCAACCTGTGCAAGGAAATCGGCATCCTCACCGTCGCCGAGTTCGTCGAATCCGAGG
AAATCCTCGAGATGCTGCGCGAGATGGGCATCGACTACGTCCAGGGCTACCACATCGGCATGCCGCTGCCGCAGATGCCC
GATGTCGCTTGA

Upstream 100 bases:

>100_bases
GCGGCAAGGCGGTGGGCGACTTCCTCGCCTGGCTGGGCAAGCTCGAATGCGGCATCGACCTGCTCGAAGCCGGCGGGTGT
TTCCGGAAGGCCTAAGCCGC

Downstream 100 bases:

>100_bases
ATGATGCCCGATGGCTGCCGGTTGACGTAACGGCAGCTTGCACGCCGCCGCGGCGCTCGACTATCCTCGCGCTGCATTCG
ATCCGGGTGCCCTGGAATGC

Product: GGDEF/EAL-domain-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 723; Mature: 722

Protein sequence:

>723_residues
MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDIC
GRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDF
HAIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT
VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEY
EIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAE
KLGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE
KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTG
APPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSF
HYLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP
DVA

Sequences:

>Translated_723_residues
MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDIC
GRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDF
HAIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT
VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEY
EIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAE
KLGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE
KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTG
APPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSF
HYLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP
DVA
>Mature_722_residues
PPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDICG
RLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFH
AIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMITV
RVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEYE
IGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEK
LGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAEK
LREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTGA
PPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFH
YLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMPD
VA

Specific function: Unknown

COG id: COG5001

COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain

Gene ontology:

Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 MHYT domain [H]

Homologues:

Organism=Escherichia coli, GI1787541, Length=415, Percent_Identity=26.9879518072289, Blast_Score=154, Evalue=2e-38,
Organism=Escherichia coli, GI1788849, Length=238, Percent_Identity=33.1932773109244, Blast_Score=134, Evalue=2e-32,
Organism=Escherichia coli, GI87081921, Length=418, Percent_Identity=26.555023923445, Blast_Score=130, Evalue=4e-31,
Organism=Escherichia coli, GI1790496, Length=248, Percent_Identity=28.2258064516129, Blast_Score=120, Evalue=2e-28,
Organism=Escherichia coli, GI87082096, Length=236, Percent_Identity=32.6271186440678, Blast_Score=119, Evalue=6e-28,
Organism=Escherichia coli, GI87081845, Length=237, Percent_Identity=30.8016877637131, Blast_Score=117, Evalue=3e-27,
Organism=Escherichia coli, GI1786507, Length=238, Percent_Identity=29.8319327731092, Blast_Score=116, Evalue=4e-27,
Organism=Escherichia coli, GI1788381, Length=432, Percent_Identity=23.3796296296296, Blast_Score=115, Evalue=1e-26,
Organism=Escherichia coli, GI1786584, Length=164, Percent_Identity=34.1463414634146, Blast_Score=111, Evalue=2e-25,
Organism=Escherichia coli, GI226510982, Length=438, Percent_Identity=23.972602739726, Blast_Score=107, Evalue=2e-24,
Organism=Escherichia coli, GI87081743, Length=250, Percent_Identity=26.8, Blast_Score=107, Evalue=3e-24,
Organism=Escherichia coli, GI1787262, Length=158, Percent_Identity=34.1772151898734, Blast_Score=106, Evalue=6e-24,
Organism=Escherichia coli, GI87081881, Length=185, Percent_Identity=36.7567567567568, Blast_Score=102, Evalue=7e-23,
Organism=Escherichia coli, GI145693134, Length=210, Percent_Identity=30, Blast_Score=98, Evalue=1e-21,
Organism=Escherichia coli, GI1788502, Length=237, Percent_Identity=27.0042194092827, Blast_Score=97, Evalue=5e-21,
Organism=Escherichia coli, GI1788956, Length=165, Percent_Identity=37.5757575757576, Blast_Score=86, Evalue=7e-18,
Organism=Escherichia coli, GI1788085, Length=161, Percent_Identity=31.055900621118, Blast_Score=85, Evalue=1e-17,
Organism=Escherichia coli, GI87082007, Length=165, Percent_Identity=30.3030303030303, Blast_Score=85, Evalue=1e-17,
Organism=Escherichia coli, GI87081980, Length=250, Percent_Identity=26.4, Blast_Score=84, Evalue=3e-17,
Organism=Escherichia coli, GI87081977, Length=154, Percent_Identity=29.8701298701299, Blast_Score=76, Evalue=8e-15,
Organism=Escherichia coli, GI1787802, Length=219, Percent_Identity=28.310502283105, Blast_Score=75, Evalue=2e-14,
Organism=Escherichia coli, GI87081974, Length=144, Percent_Identity=27.7777777777778, Blast_Score=74, Evalue=4e-14,
Organism=Escherichia coli, GI1787055, Length=244, Percent_Identity=26.2295081967213, Blast_Score=67, Evalue=4e-12,
Organism=Escherichia coli, GI1787816, Length=157, Percent_Identity=31.8471337579618, Blast_Score=65, Evalue=1e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001054
- InterPro:   IPR000160
- InterPro:   IPR001633
- InterPro:   IPR005330 [H]

Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF03707 MHYT [H]

EC number: NA

Molecular weight: Translated: 81239; Mature: 81108

Theoretical pI: Translated: 5.04; Mature: 5.04

Prosite motif: PS50883 EAL ; PS50887 GGDEF

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSA
CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IGETSSPHFVETLEQARDICGRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLS
HCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
NTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFHAIFPFDLFFVAFAEENGLM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEE
LNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT
EEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEE
VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYS
EECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH
MHDPLTGLYNRRHFNNMLEYEIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVR
HCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECHHHCCCCCCCCCCCCHHHHHH
VAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEKLGAALRATSFESPNGKRFH
HHHHHHHHCCCCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCEEE
ITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE
EEEECEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHCCCCCHHHHHHHHHHHH
KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRE
HHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHHCCCHH
LDRVIIRQTLEAAAARARTGAPPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVF
HHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCCCHHHHHHHHHHHCCCHHHHHH
EILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFHYLRELHFEFVKIDGAFVRS
HHHHHHHCCCHHHHHHHHHHHHHCCCEEEEHHHCCCHHHHHHHHHHHHHHHEECHHHHHH
IVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCHHHCCEECCCCCCCCC
DVA
CCC
>Mature Secondary Structure 
PPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSA
CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
IGETSSPHFVETLEQARDICGRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLS
HCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
NTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFHAIFPFDLFFVAFAEENGLM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEE
LNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT
EEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEE
VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYS
EECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH
MHDPLTGLYNRRHFNNMLEYEIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVR
HCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECHHHCCCCCCCCCCCCHHHHHH
VAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEKLGAALRATSFESPNGKRFH
HHHHHHHHCCCCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCEEE
ITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE
EEEECEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHCCCCCHHHHHHHHHHHH
KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRE
HHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHHCCCHH
LDRVIIRQTLEAAAARARTGAPPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVF
HHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCCCHHHHHHHHHHHCCCHHHHHH
EILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFHYLRELHFEFVKIDGAFVRS
HHHHHHHCCCHHHHHHHHHHHHHCCCEEEEHHHCCCHHHHHHHHHHHHHHHEECHHHHHH
IVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCHHHCCEECCCCCCCCC
DVA
CCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 10984043 [H]