Definition | Acidovorax citrulli AAC00-1 chromosome, complete genome. |
---|---|
Accession | NC_008752 |
Length | 5,352,772 |
Click here to switch to the map view.
The map label for this gene is yegE [H]
Identifier: 120609817
GI number: 120609817
Start: 1235598
End: 1237826
Strand: Direct
Name: yegE [H]
Synonym: Aave_1127
Alternate gene names: 120609817
Gene position: 1235598-1237826 (Clockwise)
Preceding gene: 120609815
Following gene: 120609818
Centisome position: 23.08
GC content: 74.16
Gene sequence:
>2229_bases ATGAGCGCGCCGCCCGCTCCGCCCGCGGTGGCCGGGCGCGGCCGCATGCGTGCCTTCGCCACGGTGCACGACGGGCGCGA GGCGCACTGGCACATGCCCGGTCCGCTGGTGCTGCGGATGATCGCCGCTGCCGTGGCCGCCGTGCTGCTGGCGGGAGGAC TGGCCGCGTGGATGGTGGCGCGCTCCGCCCAGCACGAGGCGCTGCAGCGCCTCATGGAGCGGCAGACCGACGAAGTCGAA GCGGTCGCGCTGCTGCTGTCGGGCAAGATCGAGCAGAACCAGCGCGTGCTGGCCACGCTGGCCGAAAGCATCCCCGCCCC GCTGCCCGAGGCGCCCGGAGCGCTCGATGCCCTGCTGGGCCAGGGCCTGCCCGCGGCGCGCTTCTTCGACGCCCTGCAGA TCGCGCGGCAGGACGGCAGCCTGGGTCTGTATTGGCAGAACGGCCGGCTGCATCCCGGCGCGCAACTCGATCCTGCCGAG CGCGACCAGTTGCGGCGCACGCTGCTGGAAGGCAAGCCGCTCCTGTCCGGCGTGATCGGCAACGGCCCCGCCGACGGGCG CATCATGTTCAGCCTGCCCTTGCGCACGGCCGGAGACGGCGGCGGCGTGCAGGGTGCGCTGGGCGCCACGCTGCGGCTGC AGTCGCAGGGGCTGTTGCCGCCATCGATGGCGCTGCCGGCACGGGCCGGATCGAGGCTGGTGGTCTTCGCACGCGATGGA ACCATCCTCGCGCATCCCGACCCGGCCCGCGTGCTCGGGCAGGTGCGGGACGAGGCCGGGCTGGCGCAGGTGTACGAGCG GGCGCTGCAGGGCGGACAGGCGGCAGTGGCCGGGCGCGGATTCTCGGACGTGGTCGCCGGGCAGGTGGTGAGCGTGGCGG GCGTGCCGCTGCCCCAGTGGCTGGTGGCGCGCGTGAGCGATGCGCGCGCCTCGGATCTGGACCTGCTGGCCGGCGGTGGC CAGCGTGGACTGGGGGGCGCCGTGGCCGCCACGGCCTGCGCGGCGCTGCTGGCGGCCATCGCCATCGCCGTGCTGGCCCA GCCCCTGGCACGGCTGTGCCGGCGCGTGGCGCGCGCCCATCCCACGGGCATCGCCGACGATGCGCCCTGGCCGGCGGGAG CCGGCGAGGTGGACGCTCTCGCGCAGGTCTGCCGGGCGCTGGTGGACGATTGCCGCATCGCCCGGCACCGGGCGGGCGTG TTGCAGGACCAGTTCCAGGCCGTGCTGGACCATGCCCCCGCCGGCATCGTCATCACCCGCGCCGGGCAGATGGTGGTGGT CGGGCTGGAGGCCTGCCGCCTGCTCGGCTATGAACCGCATGAACTGCAGGGGCAGCCCGCGCGGCTGCTCTACGCGAGCG ATGCGGCCTATGCCGAGCTGGGGCGGCGTGTGCGTGCCGAGTTCGCCGCGCACGGTGCCTTCGACGGAGATGTCTGCTTC GTCCGCAAGGACGGCGGAGCGGTCTGGGCGCGGGTGCTCGGCCGTGCCGTGCGCGACGGCGCCGTGGCGGAAGGGGGCGG CACGGTATGGATGCTGGAGGACCTGACGGCAGCGCGTGAGGCGCGGCGCCAGCAGGACTGGCCCGCGGGGCATGATTCCC TCACGCAGCTCGCCAACCGCGACGCGTTCCAGCAGCGCCTGCACGCCCTGCTCGCGGACCATGCGAAACGCTCCATGGTG GTGCGCGCCTCGGTCCGAAATGGTGCATTGCCTTCGGAGTCCCTGCAGGAGTCCCCCGGGCGTGCGGAGGCCGTGGCACC ACCTCCTGCACCGGAATCGGGCGTTTTGCTGTTCCTCGATCTCGACCATTTCACGGTGGTGAACGCGATCGCCGGGCACG ACGCGGGCGACGACGTGCTGCGCCACGTGGGCCGGCTCATCGAGACGCAGGTGCGGCATGTGGGGTGGGCCGCCCGGCTG GGCGGGGACGAGTTCGCGGTGGTGCTGCCCGGCTGCGCCCTGCCGCGTGGCCTGGCGATCGCCACCCAGCTCTGCGCGGC CGTGCACGCCTGGGAGCCGGCCTATGCCGGCCGCACCTATACGCTGGGCGTGAGCATCGGGCTCGTGCCGCTCGATGGCG GCGCCTACGACGTGCCCACCGTGCTGCATGCGGCCGACATGGCCTGCTACGAAGCCAAGCGCGCGGGCCGCAACCGCGTC GAGGTGCGGTGGCTGCGGCCGGGCCGTCCGCACGAGAGCACGACGGCGGACGGCGACGTGCAGGCCTGA
Upstream 100 bases:
>100_bases CTGTCCCGGGTATGTATTCCCCCGCAACGAACCGCCCGCTCCGCATGCTCCTTCGCTCCCGCTGTTCCCGGCCGATTCGC GCGCTGAGCGTGGGGGGCGC
Downstream 100 bases:
>100_bases CCACTGGCGCGGCAGCAGGGCCTCCCGAGCCTTGCCGCGGAGCGGCCGTCTTGTCCATGACGGCGGTGGAAACATTCTGT TCTTCGGAACACGACAACAT
Product: PAS/PAC sensor-containing diguanylate cyclase
Products: NA
Alternate protein names: DGC [H]
Number of amino acids: Translated: 742; Mature: 741
Protein sequence:
>742_residues MSAPPAPPAVAGRGRMRAFATVHDGREAHWHMPGPLVLRMIAAAVAAVLLAGGLAAWMVARSAQHEALQRLMERQTDEVE AVALLLSGKIEQNQRVLATLAESIPAPLPEAPGALDALLGQGLPAARFFDALQIARQDGSLGLYWQNGRLHPGAQLDPAE RDQLRRTLLEGKPLLSGVIGNGPADGRIMFSLPLRTAGDGGGVQGALGATLRLQSQGLLPPSMALPARAGSRLVVFARDG TILAHPDPARVLGQVRDEAGLAQVYERALQGGQAAVAGRGFSDVVAGQVVSVAGVPLPQWLVARVSDARASDLDLLAGGG QRGLGGAVAATACAALLAAIAIAVLAQPLARLCRRVARAHPTGIADDAPWPAGAGEVDALAQVCRALVDDCRIARHRAGV LQDQFQAVLDHAPAGIVITRAGQMVVVGLEACRLLGYEPHELQGQPARLLYASDAAYAELGRRVRAEFAAHGAFDGDVCF VRKDGGAVWARVLGRAVRDGAVAEGGGTVWMLEDLTAAREARRQQDWPAGHDSLTQLANRDAFQQRLHALLADHAKRSMV VRASVRNGALPSESLQESPGRAEAVAPPPAPESGVLLFLDLDHFTVVNAIAGHDAGDDVLRHVGRLIETQVRHVGWAARL GGDEFAVVLPGCALPRGLAIATQLCAAVHAWEPAYAGRTYTLGVSIGLVPLDGGAYDVPTVLHAADMACYEAKRAGRNRV EVRWLRPGRPHESTTADGDVQA
Sequences:
>Translated_742_residues MSAPPAPPAVAGRGRMRAFATVHDGREAHWHMPGPLVLRMIAAAVAAVLLAGGLAAWMVARSAQHEALQRLMERQTDEVE AVALLLSGKIEQNQRVLATLAESIPAPLPEAPGALDALLGQGLPAARFFDALQIARQDGSLGLYWQNGRLHPGAQLDPAE RDQLRRTLLEGKPLLSGVIGNGPADGRIMFSLPLRTAGDGGGVQGALGATLRLQSQGLLPPSMALPARAGSRLVVFARDG TILAHPDPARVLGQVRDEAGLAQVYERALQGGQAAVAGRGFSDVVAGQVVSVAGVPLPQWLVARVSDARASDLDLLAGGG QRGLGGAVAATACAALLAAIAIAVLAQPLARLCRRVARAHPTGIADDAPWPAGAGEVDALAQVCRALVDDCRIARHRAGV LQDQFQAVLDHAPAGIVITRAGQMVVVGLEACRLLGYEPHELQGQPARLLYASDAAYAELGRRVRAEFAAHGAFDGDVCF VRKDGGAVWARVLGRAVRDGAVAEGGGTVWMLEDLTAAREARRQQDWPAGHDSLTQLANRDAFQQRLHALLADHAKRSMV VRASVRNGALPSESLQESPGRAEAVAPPPAPESGVLLFLDLDHFTVVNAIAGHDAGDDVLRHVGRLIETQVRHVGWAARL GGDEFAVVLPGCALPRGLAIATQLCAAVHAWEPAYAGRTYTLGVSIGLVPLDGGAYDVPTVLHAADMACYEAKRAGRNRV EVRWLRPGRPHESTTADGDVQA >Mature_741_residues SAPPAPPAVAGRGRMRAFATVHDGREAHWHMPGPLVLRMIAAAVAAVLLAGGLAAWMVARSAQHEALQRLMERQTDEVEA VALLLSGKIEQNQRVLATLAESIPAPLPEAPGALDALLGQGLPAARFFDALQIARQDGSLGLYWQNGRLHPGAQLDPAER DQLRRTLLEGKPLLSGVIGNGPADGRIMFSLPLRTAGDGGGVQGALGATLRLQSQGLLPPSMALPARAGSRLVVFARDGT ILAHPDPARVLGQVRDEAGLAQVYERALQGGQAAVAGRGFSDVVAGQVVSVAGVPLPQWLVARVSDARASDLDLLAGGGQ RGLGGAVAATACAALLAAIAIAVLAQPLARLCRRVARAHPTGIADDAPWPAGAGEVDALAQVCRALVDDCRIARHRAGVL QDQFQAVLDHAPAGIVITRAGQMVVVGLEACRLLGYEPHELQGQPARLLYASDAAYAELGRRVRAEFAAHGAFDGDVCFV RKDGGAVWARVLGRAVRDGAVAEGGGTVWMLEDLTAAREARRQQDWPAGHDSLTQLANRDAFQQRLHALLADHAKRSMVV RASVRNGALPSESLQESPGRAEAVAPPPAPESGVLLFLDLDHFTVVNAIAGHDAGDDVLRHVGRLIETQVRHVGWAARLG GDEFAVVLPGCALPRGLAIATQLCAAVHAWEPAYAGRTYTLGVSIGLVPLDGGAYDVPTVLHAADMACYEAKRAGRNRVE VRWLRPGRPHESTTADGDVQA
Specific function: Cyclic-di-GMP is a second messenger which controls cell surface-associated traits in bacteria [H]
COG id: COG2199
COG function: function code T; FOG: GGDEF domain
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 PAS (PER-ARNT-SIM) domains [H]
Homologues:
Organism=Escherichia coli, GI1788381, Length=354, Percent_Identity=27.9661016949153, Blast_Score=128, Evalue=1e-30, Organism=Escherichia coli, GI87082007, Length=197, Percent_Identity=31.9796954314721, Blast_Score=80, Evalue=5e-16, Organism=Escherichia coli, GI1786584, Length=174, Percent_Identity=31.0344827586207, Blast_Score=79, Evalue=8e-16, Organism=Escherichia coli, GI145693134, Length=128, Percent_Identity=35.9375, Blast_Score=78, Evalue=2e-15, Organism=Escherichia coli, GI1787802, Length=147, Percent_Identity=33.3333333333333, Blast_Score=71, Evalue=2e-13, Organism=Escherichia coli, GI1787262, Length=218, Percent_Identity=28.4403669724771, Blast_Score=69, Evalue=2e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001054 - InterPro: IPR000160 - InterPro: IPR001633 - InterPro: IPR007895 - InterPro: IPR001610 - InterPro: IPR000014 - InterPro: IPR000700 - InterPro: IPR013656 - InterPro: IPR013655 [H]
Pfam domain/function: PF00563 EAL; PF00990 GGDEF; PF05231 MASE1; PF08447 PAS_3; PF08448 PAS_4 [H]
EC number: =2.7.7.65 [H]
Molecular weight: Translated: 77804; Mature: 77673
Theoretical pI: Translated: 7.21; Mature: 7.21
Prosite motif: PS50887 GGDEF
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.2 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 1.2 %Cys (Mature Protein) 1.5 %Met (Mature Protein) 2.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSAPPAPPAVAGRGRMRAFATVHDGREAHWHMPGPLVLRMIAAAVAAVLLAGGLAAWMVA CCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH RSAQHEALQRLMERQTDEVEAVALLLSGKIEQNQRVLATLAESIPAPLPEAPGALDALLG HHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHH QGLPAARFFDALQIARQDGSLGLYWQNGRLHPGAQLDPAERDQLRRTLLEGKPLLSGVIG CCCCHHHHHHHHHHHHCCCCEEEEEECCEECCCCCCCCHHHHHHHHHHHCCCHHHHHHCC NGPADGRIMFSLPLRTAGDGGGVQGALGATLRLQSQGLLPPSMALPARAGSRLVVFARDG CCCCCCEEEEEECCCCCCCCCCCCHHCCCEEEECCCCCCCCCCCCCCCCCCEEEEEEECC TILAHPDPARVLGQVRDEAGLAQVYERALQGGQAAVAGRGFSDVVAGQVVSVAGVPLPQW EEEECCCHHHHHHHHHHHCCHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHCCCCCHHH LVARVSDARASDLDLLAGGGQRGLGGAVAATACAALLAAIAIAVLAQPLARLCRRVARAH HHHHHHCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC PTGIADDAPWPAGAGEVDALAQVCRALVDDCRIARHRAGVLQDQFQAVLDHAPAGIVITR CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEE AGQMVVVGLEACRLLGYEPHELQGQPARLLYASDAAYAELGRRVRAEFAAHGAFDGDVCF CCCEEEEEHHHHHHCCCCCHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCCCEEE VRKDGGAVWARVLGRAVRDGAVAEGGGTVWMLEDLTAAREARRQQDWPAGHDSLTQLANR EEECCCHHHHHHHHHHHHCCCEECCCCEEEEEHHHHHHHHHHHHCCCCCCHHHHHHHHHH DAFQQRLHALLADHAKRSMVVRASVRNGALPSESLQESPGRAEAVAPPPAPESGVLLFLD HHHHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCEEEEEE LDHFTVVNAIAGHDAGDDVLRHVGRLIETQVRHVGWAARLGGDEFAVVLPGCALPRGLAI CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHCCCCCEEEEECCCCCCCHHHH ATQLCAAVHAWEPAYAGRTYTLGVSIGLVPLDGGAYDVPTVLHAADMACYEAKRAGRNRV HHHHHHHHHCCCCCCCCCEEEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCE EVRWLRPGRPHESTTADGDVQA EEEEECCCCCCCCCCCCCCCCC >Mature Secondary Structure SAPPAPPAVAGRGRMRAFATVHDGREAHWHMPGPLVLRMIAAAVAAVLLAGGLAAWMVA CCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH RSAQHEALQRLMERQTDEVEAVALLLSGKIEQNQRVLATLAESIPAPLPEAPGALDALLG HHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHH QGLPAARFFDALQIARQDGSLGLYWQNGRLHPGAQLDPAERDQLRRTLLEGKPLLSGVIG CCCCHHHHHHHHHHHHCCCCEEEEEECCEECCCCCCCCHHHHHHHHHHHCCCHHHHHHCC NGPADGRIMFSLPLRTAGDGGGVQGALGATLRLQSQGLLPPSMALPARAGSRLVVFARDG CCCCCCEEEEEECCCCCCCCCCCCHHCCCEEEECCCCCCCCCCCCCCCCCCEEEEEEECC TILAHPDPARVLGQVRDEAGLAQVYERALQGGQAAVAGRGFSDVVAGQVVSVAGVPLPQW EEEECCCHHHHHHHHHHHCCHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHCCCCCHHH LVARVSDARASDLDLLAGGGQRGLGGAVAATACAALLAAIAIAVLAQPLARLCRRVARAH HHHHHHCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC PTGIADDAPWPAGAGEVDALAQVCRALVDDCRIARHRAGVLQDQFQAVLDHAPAGIVITR CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEE AGQMVVVGLEACRLLGYEPHELQGQPARLLYASDAAYAELGRRVRAEFAAHGAFDGDVCF CCCEEEEEHHHHHHCCCCCHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCCCEEE VRKDGGAVWARVLGRAVRDGAVAEGGGTVWMLEDLTAAREARRQQDWPAGHDSLTQLANR EEECCCHHHHHHHHHHHHCCCEECCCCEEEEEHHHHHHHHHHHHCCCCCCHHHHHHHHHH DAFQQRLHALLADHAKRSMVVRASVRNGALPSESLQESPGRAEAVAPPPAPESGVLLFLD HHHHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCEEEEEE LDHFTVVNAIAGHDAGDDVLRHVGRLIETQVRHVGWAARLGGDEFAVVLPGCALPRGLAI CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHCCCCCEEEEECCCCCCCHHHH ATQLCAAVHAWEPAYAGRTYTLGVSIGLVPLDGGAYDVPTVLHAADMACYEAKRAGRNRV HHHHHHHHHCCCCCCCCCEEEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCE EVRWLRPGRPHESTTADGDVQA EEEEECCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503; 6094528; 7984428 [H]