Definition | Azoarcus sp. BH72 chromosome, complete genome. |
---|---|
Accession | NC_008702 |
Length | 4,376,040 |
Click here to switch to the map view.
The map label for this gene is yciR [C]
Identifier: 119899816
GI number: 119899816
Start: 3870767
End: 3872938
Strand: Direct
Name: yciR [C]
Synonym: azo3527
Alternate gene names: 119899816
Gene position: 3870767-3872938 (Clockwise)
Preceding gene: 119899815
Following gene: 119899817
Centisome position: 88.45
GC content: 65.84
Gene sequence:
>2172_bases ATGCCACCGCTCATTCCCCCGACCTCGCAGGACTCGACGCTCACGCCGCACCGCCTGCGGGTCATCGGCGAACTCGCGCG CCTGCTCGAATCGCAGACCCGCACGCGCTACTTCAGCGAGGCGCTGGTCGGCACCACGCGCGCGATGCACGCCGAAGCGA TGGAGCGCCTGACCAGTGCGATCGGCGAGACCAGCAGCCCGCACTTCGTCGAAACCCTGGAACAGGCGCGCGACATCTGC GGGCGGCTGTCCGACAGCTTCGACCTGCTGCTGGCCGAGCGCAACCGCCAGTCGGCCGCCAGCCATGAGGGGCTGCGCGA CCTGATCAGCGAGTTCGACGACATGTTCACGCTGTTGTCGAACACCCTGGTCGAGCGCGAACTGCTCGAACGCCAGAGCC AGGTGCTCGAGCAGATCATCCTGTCGCACGAGCGCATCGGGCAGTGGAAGGCCTTCGTCCAGGCCATCCTCAGCGACTTC CACGCGATCTTCCCGTTCGACCTCTTCTTCGTCGCCTTCGCCGAAGAGAACGGGCTGATGCTCAACGTGTATTACTTCGG CGAATGTGACGAGGCTTACCGCCTCGCCACGCGCCAGCAGCTCGCCCGGAGCATGATCGAAAGCCTCGGTCTGCCGGCGG ATTCGCCGCTGGACTACGAGGAATTCCAGGTCCAGCGCAACCAGAGCATCCGCCCGGTGCGGCCGGAGCAGATGATCACG GTGCGGGTGCCGGAGCACTCCTCGCAGCTCGCCGGCCTGCTCGGCGTCACCTTCCTGTCGTCGCAGGAGCTGAACGCGCG CGAGGAAAGCGTCATCCGCTCCATCCTCGCGGTGATGGTGATGGTGGTCGGTTCCAGCAAGGTGCTGTCGCGCACGCTGG CCGAGCTGGAGTACTACTCCATGCACGACCCGCTCACCGGCCTGTACAACCGGCGCCACTTCAACAACATGCTGGAGTAC GAAATCGGCCGCTCGGAGCGCCACGAGCACGAATTCGCGCTGCTGCTGCTCGACCTCGACGACTTCAAGGACGTCAACGA CTCCTACGGCCACCCCACCGGCGACAGCGTGCTGGTGCGGGTCGCCGAGATCCTGCGCGGCCACATCCGCAAGGGCGATC TGGCCACCCGCATCGGCGGCGACGAGTTCGCCATCATGCTGATGGAAACCGGCACCGAGGGCGCGGTATCGGTGGCGGAA AAGCTGGGCGCCGCGCTGCGCGCCACCAGCTTCGAAAGCCCCAACGGCAAGCGCTTCCACATCACCACCTCGATCGGCGT GGTGGTCTATCCGCGCGACGCGCGCACCGAGCACGACCTGCTCGCCGGCGTGGACATCGCGATGTACCGCGCCAAGGAGC TGGGCAAGGACAGCGCATGCACGCTGGCCTCGATGCCCGGCCAGCTGAAGGCCACCCGCGTCACCCGCGACTACGCCGAG AAGCTGCGCGAGGCGCTGCGCGAGAACCGCATGGTTCCGTACTTCCAGCCCATCGTCGATTGCCACACCGGCGTGCCCTT CGCCTGCGAGACGGTCGCCCGCCTGAAGGAGAAGACCGGCGAAACCATCGCCGCCGGCGCCTTCATCGACACCATCGAGA AATACGGCCTCGGCCGCGAACTCGACCGCGTCATCATCCGCCAGACGCTCGAAGCCGCGGCCGCGCGCGCCCGCACCGGC GCACCGCCGCTGCGCGTGTTCATCAACCTGTCGGCGCAGGAAATCCAGGGCCGCGGCATCCTCGGCTATGCCGAAGAGCT GTGCAACGAACTCGGCATCCCGCCCAGCCAGGTGGTGTTCGAAATCCTCGAACGCGACGCCATCGGCGACATGACGAACA TGCGCAAATTCCTCGCGAACCTGCGCAAAAAGGGCTTCGCCTTCGCGCTGGACGACTTCGGCAGCGGCTACAACTCCTTC CACTACCTGCGCGAACTGCACTTCGAATTCGTCAAGATCGACGGCGCCTTCGTGCGCAGCATCGTCGAATCGCCGATCGA CCGTGCGCTGGTGCGCAACCTCACCAACCTGTGCAAGGAAATCGGCATCCTCACCGTCGCCGAGTTCGTCGAATCCGAGG AAATCCTCGAGATGCTGCGCGAGATGGGCATCGACTACGTCCAGGGCTACCACATCGGCATGCCGCTGCCGCAGATGCCC GATGTCGCTTGA
Upstream 100 bases:
>100_bases GCGGCAAGGCGGTGGGCGACTTCCTCGCCTGGCTGGGCAAGCTCGAATGCGGCATCGACCTGCTCGAAGCCGGCGGGTGT TTCCGGAAGGCCTAAGCCGC
Downstream 100 bases:
>100_bases ATGATGCCCGATGGCTGCCGGTTGACGTAACGGCAGCTTGCACGCCGCCGCGGCGCTCGACTATCCTCGCGCTGCATTCG ATCCGGGTGCCCTGGAATGC
Product: GGDEF/EAL-domain-containing protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 723; Mature: 722
Protein sequence:
>723_residues MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDIC GRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDF HAIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEY EIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAE KLGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTG APPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSF HYLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP DVA
Sequences:
>Translated_723_residues MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDIC GRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDF HAIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEY EIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAE KLGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTG APPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSF HYLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP DVA >Mature_722_residues PPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSAIGETSSPHFVETLEQARDICG RLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLSNTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFH AIFPFDLFFVAFAEENGLMLNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMITV RVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYSMHDPLTGLYNRRHFNNMLEYE IGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVRVAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEK LGAALRATSFESPNGKRFHITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAEK LREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRELDRVIIRQTLEAAAARARTGA PPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVFEILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFH YLRELHFEFVKIDGAFVRSIVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMPD VA
Specific function: Unknown
COG id: COG5001
COG function: function code T; Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 MHYT domain [H]
Homologues:
Organism=Escherichia coli, GI1787541, Length=415, Percent_Identity=26.9879518072289, Blast_Score=154, Evalue=2e-38, Organism=Escherichia coli, GI1788849, Length=238, Percent_Identity=33.1932773109244, Blast_Score=134, Evalue=2e-32, Organism=Escherichia coli, GI87081921, Length=418, Percent_Identity=26.555023923445, Blast_Score=130, Evalue=4e-31, Organism=Escherichia coli, GI1790496, Length=248, Percent_Identity=28.2258064516129, Blast_Score=120, Evalue=2e-28, Organism=Escherichia coli, GI87082096, Length=236, Percent_Identity=32.6271186440678, Blast_Score=119, Evalue=6e-28, Organism=Escherichia coli, GI87081845, Length=237, Percent_Identity=30.8016877637131, Blast_Score=117, Evalue=3e-27, Organism=Escherichia coli, GI1786507, Length=238, Percent_Identity=29.8319327731092, Blast_Score=116, Evalue=4e-27, Organism=Escherichia coli, GI1788381, Length=432, Percent_Identity=23.3796296296296, Blast_Score=115, Evalue=1e-26, Organism=Escherichia coli, GI1786584, Length=164, Percent_Identity=34.1463414634146, Blast_Score=111, Evalue=2e-25, Organism=Escherichia coli, GI226510982, Length=438, Percent_Identity=23.972602739726, Blast_Score=107, Evalue=2e-24, Organism=Escherichia coli, GI87081743, Length=250, Percent_Identity=26.8, Blast_Score=107, Evalue=3e-24, Organism=Escherichia coli, GI1787262, Length=158, Percent_Identity=34.1772151898734, Blast_Score=106, Evalue=6e-24, Organism=Escherichia coli, GI87081881, Length=185, Percent_Identity=36.7567567567568, Blast_Score=102, Evalue=7e-23, Organism=Escherichia coli, GI145693134, Length=210, Percent_Identity=30, Blast_Score=98, Evalue=1e-21, Organism=Escherichia coli, GI1788502, Length=237, Percent_Identity=27.0042194092827, Blast_Score=97, Evalue=5e-21, Organism=Escherichia coli, GI1788956, Length=165, Percent_Identity=37.5757575757576, Blast_Score=86, Evalue=7e-18, Organism=Escherichia coli, GI1788085, Length=161, Percent_Identity=31.055900621118, Blast_Score=85, Evalue=1e-17, Organism=Escherichia coli, GI87082007, Length=165, Percent_Identity=30.3030303030303, Blast_Score=85, Evalue=1e-17, Organism=Escherichia coli, GI87081980, Length=250, Percent_Identity=26.4, Blast_Score=84, Evalue=3e-17, Organism=Escherichia coli, GI87081977, Length=154, Percent_Identity=29.8701298701299, Blast_Score=76, Evalue=8e-15, Organism=Escherichia coli, GI1787802, Length=219, Percent_Identity=28.310502283105, Blast_Score=75, Evalue=2e-14, Organism=Escherichia coli, GI87081974, Length=144, Percent_Identity=27.7777777777778, Blast_Score=74, Evalue=4e-14, Organism=Escherichia coli, GI1787055, Length=244, Percent_Identity=26.2295081967213, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI1787816, Length=157, Percent_Identity=31.8471337579618, Blast_Score=65, Evalue=1e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR005330 [H]
Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF03707 MHYT [H]
EC number: NA
Molecular weight: Translated: 81239; Mature: 81108
Theoretical pI: Translated: 5.04; Mature: 5.04
Prosite motif: PS50883 EAL ; PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 2.9 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSA CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IGETSSPHFVETLEQARDICGRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLS HCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH NTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFHAIFPFDLFFVAFAEENGLM HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEE LNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT EEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEE VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYS EECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH MHDPLTGLYNRRHFNNMLEYEIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVR HCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECHHHCCCCCCCCCCCCHHHHHH VAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEKLGAALRATSFESPNGKRFH HHHHHHHHCCCCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCEEE ITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE EEEECEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHCCCCCHHHHHHHHHHHH KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRE HHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHHCCCHH LDRVIIRQTLEAAAARARTGAPPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVF HHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCCCHHHHHHHHHHHCCCHHHHHH EILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFHYLRELHFEFVKIDGAFVRS HHHHHHHCCCHHHHHHHHHHHHHCCCEEEEHHHCCCHHHHHHHHHHHHHHHEECHHHHHH IVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCHHHCCEECCCCCCCCC DVA CCC >Mature Secondary Structure PPLIPPTSQDSTLTPHRLRVIGELARLLESQTRTRYFSEALVGTTRAMHAEAMERLTSA CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IGETSSPHFVETLEQARDICGRLSDSFDLLLAERNRQSAASHEGLRDLISEFDDMFTLLS HCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH NTLVERELLERQSQVLEQIILSHERIGQWKAFVQAILSDFHAIFPFDLFFVAFAEENGLM HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEE LNVYYFGECDEAYRLATRQQLARSMIESLGLPADSPLDYEEFQVQRNQSIRPVRPEQMIT EEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEE VRVPEHSSQLAGLLGVTFLSSQELNAREESVIRSILAVMVMVVGSSKVLSRTLAELEYYS EECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH MHDPLTGLYNRRHFNNMLEYEIGRSERHEHEFALLLLDLDDFKDVNDSYGHPTGDSVLVR HCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECHHHCCCCCCCCCCCCHHHHHH VAEILRGHIRKGDLATRIGGDEFAIMLMETGTEGAVSVAEKLGAALRATSFESPNGKRFH HHHHHHHHCCCCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCEEE ITTSIGVVVYPRDARTEHDLLAGVDIAMYRAKELGKDSACTLASMPGQLKATRVTRDYAE EEEECEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHCCCCCHHHHHHHHHHHH KLREALRENRMVPYFQPIVDCHTGVPFACETVARLKEKTGETIAAGAFIDTIEKYGLGRE HHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCEEEHHHHHHHHHHHCCCHH LDRVIIRQTLEAAAARARTGAPPLRVFINLSAQEIQGRGILGYAEELCNELGIPPSQVVF HHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHHCCCCCHHHHHHHHHHHCCCHHHHHH EILERDAIGDMTNMRKFLANLRKKGFAFALDDFGSGYNSFHYLRELHFEFVKIDGAFVRS HHHHHHHCCCHHHHHHHHHHHHHCCCEEEEHHHCCCHHHHHHHHHHHHHHHEECHHHHHH IVESPIDRALVRNLTNLCKEIGILTVAEFVESEEILEMLREMGIDYVQGYHIGMPLPQMP HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCHHHCCEECCCCCCCCC DVA CCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 10984043 [H]