Definition | Xanthomonas oryzae pv. oryzae MAFF 311018, complete genome. |
---|---|
Accession | NC_007705 |
Length | 4,940,217 |
Click here to switch to the map view.
The map label for this gene is yegE [H]
Identifier: 84625449
GI number: 84625449
Start: 4287131
End: 4289755
Strand: Reverse
Name: yegE [H]
Synonym: XOO_3792
Alternate gene names: 84625449
Gene position: 4289755-4287131 (Counterclockwise)
Preceding gene: 84625452
Following gene: 84625445
Centisome position: 86.83
GC content: 66.02
Gene sequence:
>2625_bases ATGACCGTCCTGTTGCCGCCCGTGCCGATCCCCGTCGACGATGCCTTGCGCGTGGACGCGGTGCGCCGGCTCGGGGTACT GGACACCGAGGCCGAGGCCGAATTCGACGACATCGCCTGGCTGGCCGCGCACGTCACCGGCGCGCCGGTGGCACTGGTGT CGCTGCTGGATGCCGACCGCCAGTGGTTCAAGGCGCGTTGCGGCAGCGACCTGGAAGGCACTCCGCGCAGCGCCTCGTTT TGCTCCCACGCGATCATGGGCACCGAGTTGATGGAAGTGCCCGATGCCGAAGCCGACCCGCGCTTCGTCAACAACCCGTT GGTGGTCGGCGCCCCCGGCGTGCGCTCGTACGTGGGGGTACCGTTGATCGGGCGCGAAGGCTATGCCTACGGGACCTTGT GCACGCTCAGCACCAGGCCGCGCGTGCTCGACGAACAGCAGAAGCAGGCGTTGATCCGCCTTGCGCGCCAGGCTGCCAAA CAGTTGGAAGCGCGCCGCGACCGCCTGGAAGCGCAGGCGCAGCGGCAGACCTTGAGCATGTTGCTGGAAGCCATGCCCGA CGGCGTGGTGGCCTGCGGCACCGACGGATTGCTGCGCGAGTTCAATCACGCCGCCCGGCAATGGCATGGCACCGATCCGC GCCTGTTGCCACCGGAGCAATGGGCGTTGCATTTCGATCTGTACACGGCCGACGGCACTCACCTGCTGCCGACCGAAGCG ATTCCGCTGCTGCGCGCCTGGCGCGGCGAACACGTGCGCAATGCCGAAATGGTGATCCGTGCCACCGGCCAGCCGCCGCG CAGCGTGCTGTGCAACGCAGACCCGGTGGTCGGCGACAACGGCACGGCCTTGGGCGCGGTCTGCGTGATGCACGACATCA CCCAGCTCAAGGATGCGTCGGCGGCACTGGCCGGCGAACGCGCGCGGCTGCAGGCACTGGTGGACGCGTCGCAAGATGTG GCCATCATGGCGTTCGACCCGCAGGGACGTCTGGAACTGTTCAATCCCGGTGCCGAGCGCCTGCTCGGTTACAGTGCCGA CGAGGTGCTGGGCACCTGTCCGGCGCAGTTCCATCTACCGGCCGAGCTTGAGCGCCACATGGCGACGCTGGCAGTGGCGC CATTCTCATACGCGACGCTGGCTGCCGCCGCCGCTGGCGATGTGCTGGCCGAGGAACTCTGGACACTGGTACGCAAGGAC GGCGATCTGCGGCGCGTGCGCCTGTGTTTCAACGTCATCCACGATGCGCAGCAGGGCCTGGCCGGATATCTGGCAATGGC CATCGACGTCACTGCCGAATTGCAGGCGCAAGCGGCCGCGCAGCTGGCCGCCGAGCGCTTCACCGGTGCATTCGAGACAG CGCCGCAAGGCATGGCGATCGTGTCGCTGGAAGGTGAGTGGTGCGACGTCAATCCGGCGTTGTGCGGCATCCTGGGCTAT GCGCGCGAGCAATTGTTGCGCACGACTTTCCAGCAGATTACCCATCCCGACGATCTGGAGATGGACCTGCAACTCGTGCA GGATCTCGTCGACGGAAAACGCGCCAGTTACAGCCTGGCCAAGCGCTACATCAGCCACCAGGGCGCGGTGATCTGGGCGC AGTTGTCGGTATCGCTGGTGCGCGACAGCAGCGGTGCGCCGGTGCATTTCGTTTCGCAGATTCAAGACGTGACCGAGCGC CACGTGGCGGCCGAACGGCTGGCCGAAAGCGAGGCGCGTCTGCGCGCGATCAGCGATGCCACGCCCACGTTGGTTGCCCA GTTCGATGCCGGGCAGCGCTACCTGTTCGCCAATAAAGCGCATCGCGCCTGGCTCGGTATGGAACCTGCCAGCCTGGTCG GTCGCCATATCACCCACGTGTTGGGCGAAGACCTCAGCGCCGCCGCGCGCGCTGCATTGGTACAGGCGGTGGCCGGGCAG CGCGCCAGCTTCGAACATGTCTTGCATGGCGGCACCAACCCGCGCGATGTGGAAGTGAGCCTGGTGCCGGAAATGCACGG CACTGCGACGCAGGCGCAGGGTTTCTTCCTGATGGCGCACGATGTCACTGCGCACAAGACACTGCACCGTTTGATGCACC AACGCGCTACCCGTGATGCGTTGACCGGACTGCCGAACCGGCATGCCTGGTCCGAAGCGCTGCAAGTGGCAATGACACAG GCGCATCAACAGCAGCTCGCGGTGGCGGTGATGTTCCTGGATTTGGATGGCTTCAAACGCATCAACGACATCTACAGCCA CCGTGCCGGCGATGCAGTGTTGGTGGCGTTCGGCCATTGCCTGCAGCGCGCGGCTGGTGAGCGCTATTTGGTGGCGCGCT TGGCCGGCGACGAATTCGTGGTGTTGCTGGATGCGCTGCAGGATCCGCAAGCCGAATGCGCCACGATGGCAGATCGCATC CGCGCGCTTGCGGCGGAAGGCGCCATGTTCGGCGATCAGCATCTACCGATCCAGCCCAGCATCGGTGTGGCCTGGCAACA CGGCGCACAGGCCGAAGCGGCCAGCTTGATGCATGCCGCTGACGAGGCGATGTATGCGGCCAAACGCGCACGGCGGCCAG CTGAGCAGGCCGTTGCGGCAGTAAGCGTTGTGGGTTGCTGGTGCAAGCCAGTGCGTTGCTGCTGA
Upstream 100 bases:
>100_bases TTTTCCGAGCGAGCTCGGCAGCACGCTGCTCAAGTTCCTCGTCGCCTCGCCGTTACTTGGGGCGACCTATCCTGACCAGA CGCTACCCGCCGATCACGCC
Downstream 100 bases:
>100_bases CGCGTGCGCACGGCTGGCGCGAATCGCGAACTTCCAACTCTCAACCCCTGCAGCGTTCGCGTTGTTACAACGCAACGATG CGTGCGTTCTTGCCAGGGAA
Product: histidine kinase-response regulator hybrid protein
Products: NA
Alternate protein names: DGC [H]
Number of amino acids: Translated: 874; Mature: 873
Protein sequence:
>874_residues MTVLLPPVPIPVDDALRVDAVRRLGVLDTEAEAEFDDIAWLAAHVTGAPVALVSLLDADRQWFKARCGSDLEGTPRSASF CSHAIMGTELMEVPDAEADPRFVNNPLVVGAPGVRSYVGVPLIGREGYAYGTLCTLSTRPRVLDEQQKQALIRLARQAAK QLEARRDRLEAQAQRQTLSMLLEAMPDGVVACGTDGLLREFNHAARQWHGTDPRLLPPEQWALHFDLYTADGTHLLPTEA IPLLRAWRGEHVRNAEMVIRATGQPPRSVLCNADPVVGDNGTALGAVCVMHDITQLKDASAALAGERARLQALVDASQDV AIMAFDPQGRLELFNPGAERLLGYSADEVLGTCPAQFHLPAELERHMATLAVAPFSYATLAAAAAGDVLAEELWTLVRKD GDLRRVRLCFNVIHDAQQGLAGYLAMAIDVTAELQAQAAAQLAAERFTGAFETAPQGMAIVSLEGEWCDVNPALCGILGY AREQLLRTTFQQITHPDDLEMDLQLVQDLVDGKRASYSLAKRYISHQGAVIWAQLSVSLVRDSSGAPVHFVSQIQDVTER HVAAERLAESEARLRAISDATPTLVAQFDAGQRYLFANKAHRAWLGMEPASLVGRHITHVLGEDLSAAARAALVQAVAGQ RASFEHVLHGGTNPRDVEVSLVPEMHGTATQAQGFFLMAHDVTAHKTLHRLMHQRATRDALTGLPNRHAWSEALQVAMTQ AHQQQLAVAVMFLDLDGFKRINDIYSHRAGDAVLVAFGHCLQRAAGERYLVARLAGDEFVVLLDALQDPQAECATMADRI RALAAEGAMFGDQHLPIQPSIGVAWQHGAQAEAASLMHAADEAMYAAKRARRPAEQAVAAVSVVGCWCKPVRCC
Sequences:
>Translated_874_residues MTVLLPPVPIPVDDALRVDAVRRLGVLDTEAEAEFDDIAWLAAHVTGAPVALVSLLDADRQWFKARCGSDLEGTPRSASF CSHAIMGTELMEVPDAEADPRFVNNPLVVGAPGVRSYVGVPLIGREGYAYGTLCTLSTRPRVLDEQQKQALIRLARQAAK QLEARRDRLEAQAQRQTLSMLLEAMPDGVVACGTDGLLREFNHAARQWHGTDPRLLPPEQWALHFDLYTADGTHLLPTEA IPLLRAWRGEHVRNAEMVIRATGQPPRSVLCNADPVVGDNGTALGAVCVMHDITQLKDASAALAGERARLQALVDASQDV AIMAFDPQGRLELFNPGAERLLGYSADEVLGTCPAQFHLPAELERHMATLAVAPFSYATLAAAAAGDVLAEELWTLVRKD GDLRRVRLCFNVIHDAQQGLAGYLAMAIDVTAELQAQAAAQLAAERFTGAFETAPQGMAIVSLEGEWCDVNPALCGILGY AREQLLRTTFQQITHPDDLEMDLQLVQDLVDGKRASYSLAKRYISHQGAVIWAQLSVSLVRDSSGAPVHFVSQIQDVTER HVAAERLAESEARLRAISDATPTLVAQFDAGQRYLFANKAHRAWLGMEPASLVGRHITHVLGEDLSAAARAALVQAVAGQ RASFEHVLHGGTNPRDVEVSLVPEMHGTATQAQGFFLMAHDVTAHKTLHRLMHQRATRDALTGLPNRHAWSEALQVAMTQ AHQQQLAVAVMFLDLDGFKRINDIYSHRAGDAVLVAFGHCLQRAAGERYLVARLAGDEFVVLLDALQDPQAECATMADRI RALAAEGAMFGDQHLPIQPSIGVAWQHGAQAEAASLMHAADEAMYAAKRARRPAEQAVAAVSVVGCWCKPVRCC >Mature_873_residues TVLLPPVPIPVDDALRVDAVRRLGVLDTEAEAEFDDIAWLAAHVTGAPVALVSLLDADRQWFKARCGSDLEGTPRSASFC SHAIMGTELMEVPDAEADPRFVNNPLVVGAPGVRSYVGVPLIGREGYAYGTLCTLSTRPRVLDEQQKQALIRLARQAAKQ LEARRDRLEAQAQRQTLSMLLEAMPDGVVACGTDGLLREFNHAARQWHGTDPRLLPPEQWALHFDLYTADGTHLLPTEAI PLLRAWRGEHVRNAEMVIRATGQPPRSVLCNADPVVGDNGTALGAVCVMHDITQLKDASAALAGERARLQALVDASQDVA IMAFDPQGRLELFNPGAERLLGYSADEVLGTCPAQFHLPAELERHMATLAVAPFSYATLAAAAAGDVLAEELWTLVRKDG DLRRVRLCFNVIHDAQQGLAGYLAMAIDVTAELQAQAAAQLAAERFTGAFETAPQGMAIVSLEGEWCDVNPALCGILGYA REQLLRTTFQQITHPDDLEMDLQLVQDLVDGKRASYSLAKRYISHQGAVIWAQLSVSLVRDSSGAPVHFVSQIQDVTERH VAAERLAESEARLRAISDATPTLVAQFDAGQRYLFANKAHRAWLGMEPASLVGRHITHVLGEDLSAAARAALVQAVAGQR ASFEHVLHGGTNPRDVEVSLVPEMHGTATQAQGFFLMAHDVTAHKTLHRLMHQRATRDALTGLPNRHAWSEALQVAMTQA HQQQLAVAVMFLDLDGFKRINDIYSHRAGDAVLVAFGHCLQRAAGERYLVARLAGDEFVVLLDALQDPQAECATMADRIR ALAAEGAMFGDQHLPIQPSIGVAWQHGAQAEAASLMHAADEAMYAAKRARRPAEQAVAAVSVVGCWCKPVRCC
Specific function: Cyclic-di-GMP is a second messenger which controls cell surface-associated traits in bacteria [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 PAS (PER-ARNT-SIM) domains [H]
Homologues:
Organism=Escherichia coli, GI1788381, Length=113, Percent_Identity=48.6725663716814, Blast_Score=124, Evalue=3e-29, Organism=Escherichia coli, GI87081977, Length=157, Percent_Identity=33.7579617834395, Blast_Score=89, Evalue=1e-18, Organism=Escherichia coli, GI87081881, Length=216, Percent_Identity=33.3333333333333, Blast_Score=85, Evalue=2e-17, Organism=Escherichia coli, GI1787802, Length=200, Percent_Identity=31, Blast_Score=84, Evalue=5e-17, Organism=Escherichia coli, GI1787541, Length=193, Percent_Identity=30.5699481865285, Blast_Score=82, Evalue=2e-16, Organism=Escherichia coli, GI1786584, Length=170, Percent_Identity=27.0588235294118, Blast_Score=73, Evalue=6e-14, Organism=Escherichia coli, GI145693134, Length=207, Percent_Identity=26.5700483091787, Blast_Score=72, Evalue=2e-13, Organism=Escherichia coli, GI1788956, Length=166, Percent_Identity=34.3373493975904, Blast_Score=71, Evalue=3e-13, Organism=Escherichia coli, GI1787816, Length=163, Percent_Identity=31.2883435582822, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI1787262, Length=181, Percent_Identity=29.2817679558011, Blast_Score=67, Evalue=5e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR007895 - InterPro: IPR001610 - InterPro: IPR000014 - InterPro: IPR000700 - InterPro: IPR013656 - InterPro: IPR013655 [H]
Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF05231 MASE1; PF08447 PAS_3; PF08448 PAS_4 [H]
EC number: =2.7.7.65 [H]
Molecular weight: Translated: 94769; Mature: 94638
Theoretical pI: Translated: 5.89; Mature: 5.89
Prosite motif: PS50112 PAS ; PS50113 PAC ; PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 4.3 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTVLLPPVPIPVDDALRVDAVRRLGVLDTEAEAEFDDIAWLAAHVTGAPVALVSLLDADR CEEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHH QWFKARCGSDLEGTPRSASFCSHAIMGTELMEVPDAEADPRFVNNPLVVGAPGVRSYVGV HHHHHHCCCCCCCCCCCHHHHHHHHHCCHHHHCCCCCCCCCCCCCCEEEECCCHHHHCCC PLIGREGYAYGTLCTLSTRPRVLDEQQKQALIRLARQAAKQLEARRDRLEAQAQRQTLSM CEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLEAMPDGVVACGTDGLLREFNHAARQWHGTDPRLLPPEQWALHFDLYTADGTHLLPTEA HHHHCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEECCCCEECCHHH IPLLRAWRGEHVRNAEMVIRATGQPPRSVLCNADPVVGDNGTALGAVCVMHDITQLKDAS HHHHHHHCCCCCCCCEEEEEECCCCCHHEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHH AALAGERARLQALVDASQDVAIMAFDPQGRLELFNPGAERLLGYSADEVLGTCPAQFHLP HHHHCHHHHHHHHHCCCCCEEEEEECCCCCEEEECCCHHHHHCCCHHHHHHCCCCCCCCC AELERHMATLAVAPFSYATLAAAAAGDVLAEELWTLVRKDGDLRRVRLCFNVIHDAQQGL HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH AGYLAMAIDVTAELQAQAAAQLAAERFTGAFETAPQGMAIVSLEGEWCDVNPALCGILGY HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCEEEEEECCCCCCCCHHHHHHHHH AREQLLRTTFQQITHPDDLEMDLQLVQDLVDGKRASYSLAKRYISHQGAVIWAQLSVSLV HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCEEEEEEEEEEE RDSSGAPVHFVSQIQDVTERHVAAERLAESEARLRAISDATPTLVAQFDAGQRYLFANKA ECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCEEEEECCC HRAWLGMEPASLVGRHITHVLGEDLSAAARAALVQAVAGQRASFEHVLHGGTNPRDVEVS HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCEEEEE LVPEMHGTATQAQGFFLMAHDVTAHKTLHRLMHQRATRDALTGLPNRHAWSEALQVAMTQ ECCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH AHQQQLAVAVMFLDLDGFKRINDIYSHRAGDAVLVAFGHCLQRAAGERYLVARLAGDEFV HHHHHHHHHHEEECCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCEE VLLDALQDPQAECATMADRIRALAAEGAMFGDQHLPIQPSIGVAWQHGAQAEAASLMHAA EEEEHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHH DEAMYAAKRARRPAEQAVAAVSVVGCWCKPVRCC HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCC >Mature Secondary Structure TVLLPPVPIPVDDALRVDAVRRLGVLDTEAEAEFDDIAWLAAHVTGAPVALVSLLDADR EEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHH QWFKARCGSDLEGTPRSASFCSHAIMGTELMEVPDAEADPRFVNNPLVVGAPGVRSYVGV HHHHHHCCCCCCCCCCCHHHHHHHHHCCHHHHCCCCCCCCCCCCCCEEEECCCHHHHCCC PLIGREGYAYGTLCTLSTRPRVLDEQQKQALIRLARQAAKQLEARRDRLEAQAQRQTLSM CEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LLEAMPDGVVACGTDGLLREFNHAARQWHGTDPRLLPPEQWALHFDLYTADGTHLLPTEA HHHHCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEECCCCEECCHHH IPLLRAWRGEHVRNAEMVIRATGQPPRSVLCNADPVVGDNGTALGAVCVMHDITQLKDAS HHHHHHHCCCCCCCCEEEEEECCCCCHHEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHH AALAGERARLQALVDASQDVAIMAFDPQGRLELFNPGAERLLGYSADEVLGTCPAQFHLP HHHHCHHHHHHHHHCCCCCEEEEEECCCCCEEEECCCHHHHHCCCHHHHHHCCCCCCCCC AELERHMATLAVAPFSYATLAAAAAGDVLAEELWTLVRKDGDLRRVRLCFNVIHDAQQGL HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH AGYLAMAIDVTAELQAQAAAQLAAERFTGAFETAPQGMAIVSLEGEWCDVNPALCGILGY HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCEEEEEECCCCCCCCHHHHHHHHH AREQLLRTTFQQITHPDDLEMDLQLVQDLVDGKRASYSLAKRYISHQGAVIWAQLSVSLV HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCEEEEEEEEEEE RDSSGAPVHFVSQIQDVTERHVAAERLAESEARLRAISDATPTLVAQFDAGQRYLFANKA ECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCEEEEECCC HRAWLGMEPASLVGRHITHVLGEDLSAAARAALVQAVAGQRASFEHVLHGGTNPRDVEVS HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCEEEEE LVPEMHGTATQAQGFFLMAHDVTAHKTLHRLMHQRATRDALTGLPNRHAWSEALQVAMTQ ECCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH AHQQQLAVAVMFLDLDGFKRINDIYSHRAGDAVLVAFGHCLQRAAGERYLVARLAGDEFV HHHHHHHHHHEEECCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCEE VLLDALQDPQAECATMADRIRALAAEGAMFGDQHLPIQPSIGVAWQHGAQAEAASLMHAA EEEEHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHH DEAMYAAKRARRPAEQAVAAVSVVGCWCKPVRCC HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503; 6094528; 7984428 [H]