Definition | Mycobacterium sp. MCS chromosome, complete genome. |
---|---|
Accession | NC_008146 |
Length | 5,705,448 |
Click here to switch to the map view.
The map label for this gene is yagR [H]
Identifier: 108797417
GI number: 108797417
Start: 484004
End: 486094
Strand: Reverse
Name: yagR [H]
Synonym: Mmcs_0437
Alternate gene names: 108797417
Gene position: 486094-484004 (Counterclockwise)
Preceding gene: 108797418
Following gene: 108797406
Centisome position: 8.52
GC content: 69.63
Gene sequence:
>2091_bases ATGACACTCCTCGACCCGCAGGCGATCGGCGCCCCGATGGCACGGCTGGACGGCCGCGCCAAGGTCACCGGCGCCGCGCC GTACGCCTTCGAACAGCGCGTCGACGATCCGGCGTATCTGCATCCGATCCAGTCGACGATCGCCCGTGGTCGGGTCGCGG CCGTCGACACCGACGCCGCCAGGGCGATCGACGGTGTGCTCGATGTGCTGACGGTGTTCGACGCCCCGGAACTCGCCGAC ACCTCCGATGGTGAGCTGTCGATCCTGCAGGACGACCGCGTGCACTTCCGCGGTCAGATCATCGGCGGGGTCGTCGCGGA GACGGCCGAGATCGCCCGGCACGCAGCAGGTCTCGTGCAGGTGAGGTACACCGAGGAATCGCACGACGTGGAGCTGACCG CCGACCACCCCGGGCTCTACACCCCTGAGCAGGTCAACGCGGGGTATCCGTCGGACACCGACGAGGGTGACGTGGAGGCG GCGCTGGCCTCGGCCGAGGTCACCGTCGACCAGACCTATTCGACACCGATCGAGCACAACAACCCGATGGAGCCACACGC CGCGATCGCGATATGGACCGCTGACGGGGTGACGATGTTCGACTCCACACAGGGTGTGCACGCCGCGCGGAAGGCACTGG CACCGCTCTTCGGACTCGAACCCGACCAGTTGCGGGTCATCGCACCGCATGTCGGCGGCGGCTTCGGCTCGAAGGGGGCC CCGCACGCGCACGACGTCCTGGCCCTGCTGGCCGCCCAGCGCAGCGGTGGGCGGCCGGTGAAGCTGGCGCTGACCCGTCA GCAGATGTTCGCCCTGGTGGGCTATCGGACACCGACCATCCAGCGCATCCGGCTCGGCGCGAACCAGGACGGCACCCTGA CCGCACTGGCGCACGAGGTCGTCGAACAGACCTCGGCCGTCAAGGAATACGCCGAACAGACGGCGGTGACGTCACGCAAG ATGTACGCCGCACCGAACCGGCGCACCGCCCACCGACTCGCCGCCCTCGACGTGCCGGTACCGTTCTGGTTCCGCGCCCC GGGTGAGTGTCCCGGCGCCTACGCCGCCGAGGTGGCGATGGACGAACTGGCGGCGGCCTGCGGGGTCGACCCGATCGAAC TGCGGGTGCGCAACGATCCCGAGGTCGACCCCGAGACCGGCAACCCGTGGTCGGGCCGCCACCTCGTGGAATGCCTGCGG CTGGGGGCCGAGCGGTTCGGCTGGTCGTCGCGTGATCCTGTTCCCGCACAACGTTTTTCCGGTGACTGGTACGTCGGGCT CGGTGTCGCGTCGGCGACCTATCCGGCCATGCAGATGCCGGGTAACTCCGCGCGGATCACCTACGCCGACGAGGGCCGCT ACCTGGTGCAGATCGGCGCCGCCGACATCGGCACCGGCACCTGGACGACGCTGACGCAGATCGCCGCCGACGCGTTGGGT TGCGATGTGGCCGCGGTCGACCTGCAGATCGGCGACAGCGCGCTGCCGGAGGCATCGGTGGCCGGCGGATCGTCGGGCAT CAACTCGTGGGGACGGGCGATCGTCACGGCCGCACGGCAGTTCCGTCGCGACCACGGCGATCCCCCGGCGATCGGCGCGA CCACGGTGGCCGAGGCACCCGAGAACCCCGAGTCCGAGAAGTTCACCATGCAGTCCTTCGGCGCCCACTTCGTCGAGGCG AGGGTCAACCGGGACACCGGGGAGATCCGCGTGCCGCGGATGTTGGGCGTCTTCTCCATCGGCCGGGCGATCAACGCCCG TACGCTGCGCTCGCAGCTCATCGGCGGAATGACGATGGGCCTGTCGATGGCGTTGCACGAGGAGAGCGTGCGCGATCCGC GGTTCGGCCACGTCGTCACACAGGATTTCGCGACCTACCACATCAGCGCGCACGCCGACGTGGCCGACATCGACGCGATC TGGCTCGACGAGGCCGACGAACACGCCAACCCGATGGGATCGCGCGGCGCCGGGGAGATCGGCATCGTCGGATCGGCCGC CGCGGTGGTCAACGCCGTCTACAACGCGACCGGTGTGCGCGTGCGCGATCTGCCGGTCACCCTCGACAAGGTGCTGTCCG GGCTGCCCTGA
Upstream 100 bases:
>100_bases TGTCGGCCGCGGACCCGCAACCGGGCAACGAGTTCAAGGTTGCGCTGGCCCGCCGCACGCTGATCGCCGAACTGCGCGCG CTGACCGGACGGGGACGGCC
Downstream 100 bases:
>100_bases TCAGCGGTCGAGCAACTCCAGCAGATAGGCGCCGTACCCGGATTTGAGCAGGCTGTGCCCGCGGGCGGCCAACTGCTCGT CGTCGATGAACCCGACGCGC
Product: xanthine dehydrogenase, molybdenum binding subunit apoprotein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 696; Mature: 695
Protein sequence:
>696_residues MTLLDPQAIGAPMARLDGRAKVTGAAPYAFEQRVDDPAYLHPIQSTIARGRVAAVDTDAARAIDGVLDVLTVFDAPELAD TSDGELSILQDDRVHFRGQIIGGVVAETAEIARHAAGLVQVRYTEESHDVELTADHPGLYTPEQVNAGYPSDTDEGDVEA ALASAEVTVDQTYSTPIEHNNPMEPHAAIAIWTADGVTMFDSTQGVHAARKALAPLFGLEPDQLRVIAPHVGGGFGSKGA PHAHDVLALLAAQRSGGRPVKLALTRQQMFALVGYRTPTIQRIRLGANQDGTLTALAHEVVEQTSAVKEYAEQTAVTSRK MYAAPNRRTAHRLAALDVPVPFWFRAPGECPGAYAAEVAMDELAAACGVDPIELRVRNDPEVDPETGNPWSGRHLVECLR LGAERFGWSSRDPVPAQRFSGDWYVGLGVASATYPAMQMPGNSARITYADEGRYLVQIGAADIGTGTWTTLTQIAADALG CDVAAVDLQIGDSALPEASVAGGSSGINSWGRAIVTAARQFRRDHGDPPAIGATTVAEAPENPESEKFTMQSFGAHFVEA RVNRDTGEIRVPRMLGVFSIGRAINARTLRSQLIGGMTMGLSMALHEESVRDPRFGHVVTQDFATYHISAHADVADIDAI WLDEADEHANPMGSRGAGEIGIVGSAAAVVNAVYNATGVRVRDLPVTLDKVLSGLP
Sequences:
>Translated_696_residues MTLLDPQAIGAPMARLDGRAKVTGAAPYAFEQRVDDPAYLHPIQSTIARGRVAAVDTDAARAIDGVLDVLTVFDAPELAD TSDGELSILQDDRVHFRGQIIGGVVAETAEIARHAAGLVQVRYTEESHDVELTADHPGLYTPEQVNAGYPSDTDEGDVEA ALASAEVTVDQTYSTPIEHNNPMEPHAAIAIWTADGVTMFDSTQGVHAARKALAPLFGLEPDQLRVIAPHVGGGFGSKGA PHAHDVLALLAAQRSGGRPVKLALTRQQMFALVGYRTPTIQRIRLGANQDGTLTALAHEVVEQTSAVKEYAEQTAVTSRK MYAAPNRRTAHRLAALDVPVPFWFRAPGECPGAYAAEVAMDELAAACGVDPIELRVRNDPEVDPETGNPWSGRHLVECLR LGAERFGWSSRDPVPAQRFSGDWYVGLGVASATYPAMQMPGNSARITYADEGRYLVQIGAADIGTGTWTTLTQIAADALG CDVAAVDLQIGDSALPEASVAGGSSGINSWGRAIVTAARQFRRDHGDPPAIGATTVAEAPENPESEKFTMQSFGAHFVEA RVNRDTGEIRVPRMLGVFSIGRAINARTLRSQLIGGMTMGLSMALHEESVRDPRFGHVVTQDFATYHISAHADVADIDAI WLDEADEHANPMGSRGAGEIGIVGSAAAVVNAVYNATGVRVRDLPVTLDKVLSGLP >Mature_695_residues TLLDPQAIGAPMARLDGRAKVTGAAPYAFEQRVDDPAYLHPIQSTIARGRVAAVDTDAARAIDGVLDVLTVFDAPELADT SDGELSILQDDRVHFRGQIIGGVVAETAEIARHAAGLVQVRYTEESHDVELTADHPGLYTPEQVNAGYPSDTDEGDVEAA LASAEVTVDQTYSTPIEHNNPMEPHAAIAIWTADGVTMFDSTQGVHAARKALAPLFGLEPDQLRVIAPHVGGGFGSKGAP HAHDVLALLAAQRSGGRPVKLALTRQQMFALVGYRTPTIQRIRLGANQDGTLTALAHEVVEQTSAVKEYAEQTAVTSRKM YAAPNRRTAHRLAALDVPVPFWFRAPGECPGAYAAEVAMDELAAACGVDPIELRVRNDPEVDPETGNPWSGRHLVECLRL GAERFGWSSRDPVPAQRFSGDWYVGLGVASATYPAMQMPGNSARITYADEGRYLVQIGAADIGTGTWTTLTQIAADALGC DVAAVDLQIGDSALPEASVAGGSSGINSWGRAIVTAARQFRRDHGDPPAIGATTVAEAPENPESEKFTMQSFGAHFVEAR VNRDTGEIRVPRMLGVFSIGRAINARTLRSQLIGGMTMGLSMALHEESVRDPRFGHVVTQDFATYHISAHADVADIDAIW LDEADEHANPMGSRGAGEIGIVGSAAAVVNAVYNATGVRVRDLPVTLDKVLSGLP
Specific function: Unknown
COG id: COG1529
COG function: function code C; Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the xanthine dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI91823271, Length=747, Percent_Identity=23.6947791164659, Blast_Score=134, Evalue=3e-31, Organism=Homo sapiens, GI71773480, Length=746, Percent_Identity=23.3243967828418, Blast_Score=125, Evalue=1e-28, Organism=Escherichia coli, GI1786478, Length=722, Percent_Identity=38.781163434903, Blast_Score=439, Evalue=1e-124, Organism=Escherichia coli, GI1789230, Length=755, Percent_Identity=27.5496688741722, Blast_Score=210, Evalue=2e-55, Organism=Escherichia coli, GI1789246, Length=802, Percent_Identity=25.6857855361596, Blast_Score=183, Evalue=3e-47, Organism=Caenorhabditis elegans, GI17540638, Length=659, Percent_Identity=23.5204855842185, Blast_Score=112, Evalue=5e-25, Organism=Drosophila melanogaster, GI17737937, Length=686, Percent_Identity=24.4897959183673, Blast_Score=141, Evalue=2e-33, Organism=Drosophila melanogaster, GI24647199, Length=685, Percent_Identity=21.8978102189781, Blast_Score=100, Evalue=3e-21, Organism=Drosophila melanogaster, GI24647195, Length=709, Percent_Identity=21.8617771509168, Blast_Score=99, Evalue=9e-21, Organism=Drosophila melanogaster, GI24647201, Length=649, Percent_Identity=21.1093990755008, Blast_Score=96, Evalue=9e-20, Organism=Drosophila melanogaster, GI24647197, Length=664, Percent_Identity=21.8373493975904, Blast_Score=94, Evalue=3e-19, Organism=Drosophila melanogaster, GI24647193, Length=669, Percent_Identity=21.3751868460389, Blast_Score=89, Evalue=1e-17,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000674 - InterPro: IPR008274 [H]
Pfam domain/function: PF01315 Ald_Xan_dh_C; PF02738 Ald_Xan_dh_C2 [H]
EC number: =1.17.1.4 [H]
Molecular weight: Translated: 73961; Mature: 73829
Theoretical pI: Translated: 4.79; Mature: 4.79
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 2.7 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTLLDPQAIGAPMARLDGRAKVTGAAPYAFEQRVDDPAYLHPIQSTIARGRVAAVDTDAA CCCCCCHHHCCCHHHCCCCEEEECCCCHHHHHCCCCCHHHHHHHHHHHCCCEEEECCHHH RAIDGVLDVLTVFDAPELADTSDGELSILQDDRVHFRGQIIGGVVAETAEIARHAAGLVQ HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCEEEEEEEHHHHHHHHHHHHHHHCCEEE VRYTEESHDVELTADHPGLYTPEQVNAGYPSDTDEGDVEAALASAEVTVDQTYSTPIEHN EEEECCCCCEEEEECCCCCCCCHHCCCCCCCCCCCCHHHHHHHHCEEEEECCCCCCCCCC NPMEPHAAIAIWTADGVTMFDSTQGVHAARKALAPLFGLEPDQLRVIAPHVGGGFGSKGA CCCCCCEEEEEEECCCEEEECCCCCHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCC PHAHDVLALLAAQRSGGRPVKLALTRQQMFALVGYRTPTIQRIRLGANQDGTLTALAHEV CHHHHHHHHHHHHCCCCCEEEEEEEHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHH VEQTSAVKEYAEQTAVTSRKMYAAPNRRTAHRLAALDVPVPFWFRAPGECPGAYAAEVAM HHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHEEECCCCCCEECCCCCCCCHHHHHHHH DELAAACGVDPIELRVRNDPEVDPETGNPWSGRHLVECLRLGAERFGWSSRDPVPAQRFS HHHHHHHCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHCC GDWYVGLGVASATYPAMQMPGNSARITYADEGRYLVQIGAADIGTGTWTTLTQIAADALG CCEEEEEEECCCCCCHHCCCCCCCEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHHC CDVAAVDLQIGDSALPEASVAGGSSGINSWGRAIVTAARQFRRDHGDPPAIGATTVAEAP CCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCC ENPESEKFTMQSFGAHFVEARVNRDTGEIRVPRMLGVFSIGRAINARTLRSQLIGGMTMG CCCCCCCEEHHHHCCEEEEHEECCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHCCHHHH LSMALHEESVRDPRFGHVVTQDFATYHISAHADVADIDAIWLDEADEHANPMGSRGAGEI HHHHHHHHHCCCCCCCCEEECCCEEEEEECCCCCHHCCEEECCCCHHHCCCCCCCCCCCE GIVGSAAAVVNAVYNATGVRVRDLPVTLDKVLSGLP EEECHHHHHHHHHHCCCCCEEEECCCCHHHHHCCCC >Mature Secondary Structure TLLDPQAIGAPMARLDGRAKVTGAAPYAFEQRVDDPAYLHPIQSTIARGRVAAVDTDAA CCCCCHHHCCCHHHCCCCEEEECCCCHHHHHCCCCCHHHHHHHHHHHCCCEEEECCHHH RAIDGVLDVLTVFDAPELADTSDGELSILQDDRVHFRGQIIGGVVAETAEIARHAAGLVQ HHHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCEEEEEEEHHHHHHHHHHHHHHHCCEEE VRYTEESHDVELTADHPGLYTPEQVNAGYPSDTDEGDVEAALASAEVTVDQTYSTPIEHN EEEECCCCCEEEEECCCCCCCCHHCCCCCCCCCCCCHHHHHHHHCEEEEECCCCCCCCCC NPMEPHAAIAIWTADGVTMFDSTQGVHAARKALAPLFGLEPDQLRVIAPHVGGGFGSKGA CCCCCCEEEEEEECCCEEEECCCCCHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCC PHAHDVLALLAAQRSGGRPVKLALTRQQMFALVGYRTPTIQRIRLGANQDGTLTALAHEV CHHHHHHHHHHHHCCCCCEEEEEEEHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHH VEQTSAVKEYAEQTAVTSRKMYAAPNRRTAHRLAALDVPVPFWFRAPGECPGAYAAEVAM HHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHEEECCCCCCEECCCCCCCCHHHHHHHH DELAAACGVDPIELRVRNDPEVDPETGNPWSGRHLVECLRLGAERFGWSSRDPVPAQRFS HHHHHHHCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHCC GDWYVGLGVASATYPAMQMPGNSARITYADEGRYLVQIGAADIGTGTWTTLTQIAADALG CCEEEEEEECCCCCCHHCCCCCCCEEEEECCCCEEEEECCCCCCCCCHHHHHHHHHHHHC CDVAAVDLQIGDSALPEASVAGGSSGINSWGRAIVTAARQFRRDHGDPPAIGATTVAEAP CCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCC ENPESEKFTMQSFGAHFVEARVNRDTGEIRVPRMLGVFSIGRAINARTLRSQLIGGMTMG CCCCCCCEEHHHHCCEEEEHEECCCCCCEECCHHHHHHHHHHHHHHHHHHHHHHCCHHHH LSMALHEESVRDPRFGHVVTQDFATYHISAHADVADIDAIWLDEADEHANPMGSRGAGEI HHHHHHHHHCCCCCCCCEEECCCEEEEEECCCCCHHCCEEECCCCHHHCCCCCCCCCCCE GIVGSAAAVVNAVYNATGVRVRDLPVTLDKVLSGLP EEECHHHHHHHHHHCCCCCEEEECCCCHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]