| Definition | Azorhizobium caulinodans ORS 571, complete genome. |
|---|---|
| Accession | NC_009937 |
| Length | 5,369,772 |
Click here to switch to the map view.
The map label for this gene is yagR [H]
Identifier: 158423024
GI number: 158423024
Start: 1590925
End: 1593135
Strand: Reverse
Name: yagR [H]
Synonym: AZC_1400
Alternate gene names: 158423024
Gene position: 1593135-1590925 (Counterclockwise)
Preceding gene: 158423025
Following gene: 158423023
Centisome position: 29.67
GC content: 67.89
Gene sequence:
>2211_bases ATGAAATTCGAGACGCCCGCAACCACCAATCCCATCGACCGCCTGCGCGTGGTGGGCCAGCCGCTCGACCGCATCGACGG TCCGCTGAAGACCACCGGAACCGCGCCCTATGCCTATGAGCGGCACGATGTGGCGGCCGATCCCGCCTATGGCTATGTGA TCGGCGCTGGCATCGCCAAGGGGCGCATCACCCGCATGGATATCGAGGCGGCCCGACGCGCGCCGGGTGTCATCGCCATC GTGACGGCCGAGACCGCCGGGAAGCTCGGCAAGGGCAAGCACAATACCGCGCGCCTGCTGGCCGGGCCCCACATCGAACA TTACCACCAGGCCGTCGCGCTGGTGGTCGCGGAGACGTTCGAGCAGGCCCGTGCAGCGGCCGGGCGAGTCCGGATCGACT ATGCCGCCGAGCCGGGGCGCTTCGATCTCGAAGCCGCGCGCTGGTCCGCCGCCAAGCCCTCGGACGAGGCGACGGGCGGT GCCGCCGATACCAAGGTCGGCGACTTCGCCCGCGCCTTTGACGCGGCCACGGTGCGCCTGGACGAGACCTACAGCACGCC GGACGAGACCCACGCCATGATGGAGCCGCATGCTTCGCTGGCGCGGTGGGAAGGCGACCGGCTGACCGTCTGGACCTCTA ACCAGATGATCGGATGGGCCCATACGGATCTGGCCGAGACCCTCGGCGTACCGCCCGAGAAGGTACGCGTAATCTCACCC TTCATCGGCGGCGGCTTCGGCGGCAAGCTGTTCCTGCGCGCCGACATTGTGCTCGCAGCACTCGGCGCGCGGGCGGCGGG ACGGCCGGTGAAGGTGGCGCTCCATCGCCCGCTGATGATCAACAACACAACCCACCGTCCGGCCACCATCCAGCGTATCC GCATCGGCGCAGAGCGGGATGGCATGATCACCGCCATCGCCCACGAAAGCTGGTCCGGCGACCTGGAGGGCGGCGGACCG GAAGTAGCGGTGAACCAGACACGCCTCCTGTATGCGGGCGCCAATCGCCTCACCTCCATGCGTCTCGCTAGTCTCGACCT GCCGGAAGGCAACTCCATGCGCGCGCCGGGCGAAGCTCCGGGCATGATGGCGCTGGAGATCGCCATGGACGAAATGGCCG AAAAGCTCGGCATCGACCCGGTGGACTTCCGCATCCGCAACGACACGCAGAAAGATCCCGAAAAGCCCGGCCGGCCGTTC TCCGCCCGCCCCTTCGTGGAGTGCCTGCGGCTCGGCGCGGAACGCTTCGGCTGGAGCAGCCGCAATCCCGTCCCCGGGCA GCGGCGGGAGGGTCGTTGGCTGATCGGAGCCGGCATGGCGGGCGCTTTTCGCAACAACCTGCTCGTGAAGTCGGCCGCCC GCGTGCGCCTGTCCGCCGATGGCCGGGTCACGGTCGAGACCGACATGACCGACATCGGCACCGGCAGCTACACCATCATC GCCCAGACCGCCGCGGAGATGATGGGCGTCTCCCTCGAGCAGGTGGAGGTGCGTCTCGGAGATTCCGCCTATCCCGTCTC GGCGGGGTCCGGCGGCCAGTTCGGCGCCAACAATGCCACCGCTGGCGTCTATGCCGCCTGCGTGAAACTTCGCGAGGCGG TGGCCCGGAAGCTCGGCCTAAATGCCGCGGATGCCGAGTTCGTCCATGGCACGGTGCGCAGCGGCGAGCGGTCCATTCCG CTGGCGCTCGCGGCCTCAAGCGGGGAGCTCACCGGCGAGGACGGCATCGAATATGGCGACCTTCACAAGACCTACCAGCA GTCCACCTTCGGCAGCCACTTCGTTGAAGTGGCGGTGGACGCGGCGACGGGCGAAGCCCGCATCCGCCGCATGCTGGCGG TCTGCGCGGCCGGGCGCATCCTCAACCCGAAATCCGCCCGCAGCCAGGTGATCGGAGCCATGACCATGGGCGCAGGCGCG GCCTTGATGGAGGAACTGGCCGTCGACAAGAAGCGCGGTTTCTTCGTCAATCACGACCTCGCCGGATATGAGGTGCCCGT GCATGCGGATATCCCGCACCAGGAGGTGATCTTCCTTGATGAGACCGATCCGATCTCCTCGCCCATGAAGGCGAAGGGCG TCGGGGAACTGGGCATCTGCGGCGTCGGCGCCGCGGTTGCCAACGCGCTCTACAATGCGACGGGTGTGCGCGTGCGCAAC TACCCGATCACGCTCGACAAATATCTCGACCGGCTGCCGGACGTCGCCTGA
Upstream 100 bases:
>100_bases CTGCTCGCCGACGCGCGGCCGACGCGGGACAACGCCTTCAAGGTCGTTCTGGTGGAGCGCACGCTCGGTGCGGTGCTCAA CGAGGCAAGGGGCTGAACCC
Downstream 100 bases:
>100_bases TCTGCCGTCGTCTTCCGCCTCCTGTGCATGCGTGCAGGAGGCGTGAGCCAAGCAGTTTCAGATAAAAATTGGGGCCACGC CGAGCAAAGCGTGATACAAG
Product: oxidoreductase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 736; Mature: 736
Protein sequence:
>736_residues MKFETPATTNPIDRLRVVGQPLDRIDGPLKTTGTAPYAYERHDVAADPAYGYVIGAGIAKGRITRMDIEAARRAPGVIAI VTAETAGKLGKGKHNTARLLAGPHIEHYHQAVALVVAETFEQARAAAGRVRIDYAAEPGRFDLEAARWSAAKPSDEATGG AADTKVGDFARAFDAATVRLDETYSTPDETHAMMEPHASLARWEGDRLTVWTSNQMIGWAHTDLAETLGVPPEKVRVISP FIGGGFGGKLFLRADIVLAALGARAAGRPVKVALHRPLMINNTTHRPATIQRIRIGAERDGMITAIAHESWSGDLEGGGP EVAVNQTRLLYAGANRLTSMRLASLDLPEGNSMRAPGEAPGMMALEIAMDEMAEKLGIDPVDFRIRNDTQKDPEKPGRPF SARPFVECLRLGAERFGWSSRNPVPGQRREGRWLIGAGMAGAFRNNLLVKSAARVRLSADGRVTVETDMTDIGTGSYTII AQTAAEMMGVSLEQVEVRLGDSAYPVSAGSGGQFGANNATAGVYAACVKLREAVARKLGLNAADAEFVHGTVRSGERSIP LALAASSGELTGEDGIEYGDLHKTYQQSTFGSHFVEVAVDAATGEARIRRMLAVCAAGRILNPKSARSQVIGAMTMGAGA ALMEELAVDKKRGFFVNHDLAGYEVPVHADIPHQEVIFLDETDPISSPMKAKGVGELGICGVGAAVANALYNATGVRVRN YPITLDKYLDRLPDVA
Sequences:
>Translated_736_residues MKFETPATTNPIDRLRVVGQPLDRIDGPLKTTGTAPYAYERHDVAADPAYGYVIGAGIAKGRITRMDIEAARRAPGVIAI VTAETAGKLGKGKHNTARLLAGPHIEHYHQAVALVVAETFEQARAAAGRVRIDYAAEPGRFDLEAARWSAAKPSDEATGG AADTKVGDFARAFDAATVRLDETYSTPDETHAMMEPHASLARWEGDRLTVWTSNQMIGWAHTDLAETLGVPPEKVRVISP FIGGGFGGKLFLRADIVLAALGARAAGRPVKVALHRPLMINNTTHRPATIQRIRIGAERDGMITAIAHESWSGDLEGGGP EVAVNQTRLLYAGANRLTSMRLASLDLPEGNSMRAPGEAPGMMALEIAMDEMAEKLGIDPVDFRIRNDTQKDPEKPGRPF SARPFVECLRLGAERFGWSSRNPVPGQRREGRWLIGAGMAGAFRNNLLVKSAARVRLSADGRVTVETDMTDIGTGSYTII AQTAAEMMGVSLEQVEVRLGDSAYPVSAGSGGQFGANNATAGVYAACVKLREAVARKLGLNAADAEFVHGTVRSGERSIP LALAASSGELTGEDGIEYGDLHKTYQQSTFGSHFVEVAVDAATGEARIRRMLAVCAAGRILNPKSARSQVIGAMTMGAGA ALMEELAVDKKRGFFVNHDLAGYEVPVHADIPHQEVIFLDETDPISSPMKAKGVGELGICGVGAAVANALYNATGVRVRN YPITLDKYLDRLPDVA >Mature_736_residues MKFETPATTNPIDRLRVVGQPLDRIDGPLKTTGTAPYAYERHDVAADPAYGYVIGAGIAKGRITRMDIEAARRAPGVIAI VTAETAGKLGKGKHNTARLLAGPHIEHYHQAVALVVAETFEQARAAAGRVRIDYAAEPGRFDLEAARWSAAKPSDEATGG AADTKVGDFARAFDAATVRLDETYSTPDETHAMMEPHASLARWEGDRLTVWTSNQMIGWAHTDLAETLGVPPEKVRVISP FIGGGFGGKLFLRADIVLAALGARAAGRPVKVALHRPLMINNTTHRPATIQRIRIGAERDGMITAIAHESWSGDLEGGGP EVAVNQTRLLYAGANRLTSMRLASLDLPEGNSMRAPGEAPGMMALEIAMDEMAEKLGIDPVDFRIRNDTQKDPEKPGRPF SARPFVECLRLGAERFGWSSRNPVPGQRREGRWLIGAGMAGAFRNNLLVKSAARVRLSADGRVTVETDMTDIGTGSYTII AQTAAEMMGVSLEQVEVRLGDSAYPVSAGSGGQFGANNATAGVYAACVKLREAVARKLGLNAADAEFVHGTVRSGERSIP LALAASSGELTGEDGIEYGDLHKTYQQSTFGSHFVEVAVDAATGEARIRRMLAVCAAGRILNPKSARSQVIGAMTMGAGA ALMEELAVDKKRGFFVNHDLAGYEVPVHADIPHQEVIFLDETDPISSPMKAKGVGELGICGVGAAVANALYNATGVRVRN YPITLDKYLDRLPDVA
Specific function: Unknown
COG id: COG1529
COG function: function code C; Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the xanthine dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI71773480, Length=739, Percent_Identity=24.2219215155616, Blast_Score=137, Evalue=3e-32, Organism=Homo sapiens, GI91823271, Length=709, Percent_Identity=24.1184767277856, Blast_Score=133, Evalue=7e-31, Organism=Escherichia coli, GI1786478, Length=735, Percent_Identity=68.0272108843537, Blast_Score=1009, Evalue=0.0, Organism=Escherichia coli, GI1789230, Length=749, Percent_Identity=31.6421895861148, Blast_Score=261, Evalue=8e-71, Organism=Escherichia coli, GI1789246, Length=811, Percent_Identity=26.7570900123305, Blast_Score=200, Evalue=3e-52, Organism=Caenorhabditis elegans, GI17540638, Length=727, Percent_Identity=23.6588720770289, Blast_Score=152, Evalue=5e-37, Organism=Drosophila melanogaster, GI24647199, Length=724, Percent_Identity=24.0331491712707, Blast_Score=132, Evalue=7e-31, Organism=Drosophila melanogaster, GI24647195, Length=751, Percent_Identity=24.5006657789614, Blast_Score=123, Evalue=4e-28, Organism=Drosophila melanogaster, GI24647201, Length=739, Percent_Identity=24.2219215155616, Blast_Score=122, Evalue=8e-28, Organism=Drosophila melanogaster, GI24647197, Length=691, Percent_Identity=24.8914616497829, Blast_Score=114, Evalue=2e-25, Organism=Drosophila melanogaster, GI17737937, Length=699, Percent_Identity=24.0343347639485, Blast_Score=110, Evalue=3e-24, Organism=Drosophila melanogaster, GI24647193, Length=729, Percent_Identity=23.8683127572016, Blast_Score=110, Evalue=4e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000674 - InterPro: IPR008274 [H]
Pfam domain/function: PF01315 Ald_Xan_dh_C; PF02738 Ald_Xan_dh_C2 [H]
EC number: =1.17.1.4 [H]
Molecular weight: Translated: 78483; Mature: 78483
Theoretical pI: Translated: 6.88; Mature: 6.88
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKFETPATTNPIDRLRVVGQPLDRIDGPLKTTGTAPYAYERHDVAADPAYGYVIGAGIAK CCCCCCCCCCHHHHHHHHCCCHHHCCCCCCCCCCCCCEECCCCCCCCCCCCEEEECCCCC GRITRMDIEAARRAPGVIAIVTAETAGKLGKGKHNTARLLAGPHIEHYHQAVALVVAETF CCEEEEEHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHH EQARAAAGRVRIDYAAEPGRFDLEAARWSAAKPSDEATGGAADTKVGDFARAFDAATVRL HHHHHHCCCEEEEECCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCEEEEE DETYSTPDETHAMMEPHASLARWEGDRLTVWTSNQMIGWAHTDLAETLGVPPEKVRVISP CCCCCCCCHHHHHHCCCHHHHHCCCCEEEEEECCCEEEEEHHHHHHHHCCCHHHEEEECH FIGGGFGGKLFLRADIVLAALGARAAGRPVKVALHRPLMINNTTHRPATIQRIRIGAERD HHCCCCCCEEEEEHHHHHHHHCCCCCCCCEEEEEECCEEECCCCCCCCEEEEEEECCCCC GMITAIAHESWSGDLEGGGPEVAVNQTRLLYAGANRLTSMRLASLDLPEGNSMRAPGEAP CEEEEEEECCCCCCCCCCCCEEEECCEEEEEECCCHHHHEEEEEECCCCCCCCCCCCCCC GMMALEIAMDEMAEKLGIDPVDFRIRNDTQKDPEKPGRPFSARPFVECLRLGAERFGWSS CCEEEHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCC RNPVPGQRREGRWLIGAGMAGAFRNNLLVKSAARVRLSADGRVTVETDMTDIGTGSYTII CCCCCCCCCCCCEEEECCCCHHHHCCEEEECCEEEEEECCCEEEEEECCCCCCCCCEEEE AQTAAEMMGVSLEQVEVRLGDSAYPVSAGSGGQFGANNATAGVYAACVKLREAVARKLGL EHHHHHHHCCCHHHEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC NAADAEFVHGTVRSGERSIPLALAASSGELTGEDGIEYGDLHKTYQQSTFGSHFVEVAVD CCCCHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEEEE AATGEARIRRMLAVCAAGRILNPKSARSQVIGAMTMGAGAALMEELAVDKKRGFFVNHDL CCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCEEEECCC AGYEVPVHADIPHQEVIFLDETDPISSPMKAKGVGELGICGVGAAVANALYNATGVRVRN CCEEEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEC YPITLDKYLDRLPDVA CCEEHHHHHHHCCCCC >Mature Secondary Structure MKFETPATTNPIDRLRVVGQPLDRIDGPLKTTGTAPYAYERHDVAADPAYGYVIGAGIAK CCCCCCCCCCHHHHHHHHCCCHHHCCCCCCCCCCCCCEECCCCCCCCCCCCEEEECCCCC GRITRMDIEAARRAPGVIAIVTAETAGKLGKGKHNTARLLAGPHIEHYHQAVALVVAETF CCEEEEEHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHH EQARAAAGRVRIDYAAEPGRFDLEAARWSAAKPSDEATGGAADTKVGDFARAFDAATVRL HHHHHHCCCEEEEECCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCEEEEE DETYSTPDETHAMMEPHASLARWEGDRLTVWTSNQMIGWAHTDLAETLGVPPEKVRVISP CCCCCCCCHHHHHHCCCHHHHHCCCCEEEEEECCCEEEEEHHHHHHHHCCCHHHEEEECH FIGGGFGGKLFLRADIVLAALGARAAGRPVKVALHRPLMINNTTHRPATIQRIRIGAERD HHCCCCCCEEEEEHHHHHHHHCCCCCCCCEEEEEECCEEECCCCCCCCEEEEEEECCCCC GMITAIAHESWSGDLEGGGPEVAVNQTRLLYAGANRLTSMRLASLDLPEGNSMRAPGEAP CEEEEEEECCCCCCCCCCCCEEEECCEEEEEECCCHHHHEEEEEECCCCCCCCCCCCCCC GMMALEIAMDEMAEKLGIDPVDFRIRNDTQKDPEKPGRPFSARPFVECLRLGAERFGWSS CCEEEHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCC RNPVPGQRREGRWLIGAGMAGAFRNNLLVKSAARVRLSADGRVTVETDMTDIGTGSYTII CCCCCCCCCCCCEEEECCCCHHHHCCEEEECCEEEEEECCCEEEEEECCCCCCCCCEEEE AQTAAEMMGVSLEQVEVRLGDSAYPVSAGSGGQFGANNATAGVYAACVKLREAVARKLGL EHHHHHHHCCCHHHEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC NAADAEFVHGTVRSGERSIPLALAASSGELTGEDGIEYGDLHKTYQQSTFGSHFVEVAVD CCCCHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEEEE AATGEARIRRMLAVCAAGRILNPKSARSQVIGAMTMGAGAALMEELAVDKKRGFFVNHDL CCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCEEEECCC AGYEVPVHADIPHQEVIFLDETDPISSPMKAKGVGELGICGVGAAVANALYNATGVRVRN CCEEEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEC YPITLDKYLDRLPDVA CCEEHHHHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]