| Definition | Mycobacterium sp. KMS chromosome, complete genome. |
|---|---|
| Accession | NC_008705 |
| Length | 5,737,227 |
Click here to switch to the map view.
The map label for this gene is 119867149
Identifier: 119867149
GI number: 119867149
Start: 1184185
End: 1186224
Strand: Reverse
Name: 119867149
Synonym: Mkms_1097
Alternate gene names: NA
Gene position: 1186224-1184185 (Counterclockwise)
Preceding gene: 119867150
Following gene: 119867148
Centisome position: 20.68
GC content: 70.2
Gene sequence:
>2040_bases ATGACCCGGACCCGCCCACCGCAGAGCCCTCGGCGCGTGTTCCGCGACCGCCGCGAGGCCGGTCGGGTGCTGGCGGACCT GCTCACCGCATACCGCGGGCGAGATGACGTCATCGTGCTGGGGCTGGCACGCGGCGGTGTCCCGGTCGCGTGGGAGGTGG CCGCGGCCCTCGGCGCTCCCCTGGACGCGTTCATCGTGCGCAAGCTGGGGGCGCCGGGGCATGAGGAGTTCGCGATGGGG GCGCTGGCCACCGGCGGGCGCGTCGTGGTCAACGACGACATCGTGCGCGCGCTGCGCGTCAGCCCGCAGCAACTCCGAGA CATCGCCGAGCGGGAAGGGCACGAGCTCTTCCGCAGGGAAGCCGCCTACCGGGCCGGGCGTCCCCCGCTCGACGTGTCCG GGAAGACGGTCGTCCTCGTCGACGACGGGCTGGCGACCGGGGCCAGCATGATGGCGGCGATACAGGCCCTGCGTGACGCC GGCCCGGCCGAGATCGTGGTCGCGGTGCCCGCCGCGCCGGAGTCGACCTGCCACGAGATACTCGGCGTCGCCGACGATCT GGTCTGCGCGAGCATGCCGACGCCGTTCGTCGCTGTCGGTGAATCGTATTGGGATTTCCGACAGGTAAGCGACGAAGAAG TCCGCGAGCACCTCGCCACCCCAACGACCGGCAGCGCCGCCACACCCGCGCCGGCCGCCCTGACACCGACCGCCATCGTG GGTGGGTGTGCGGTGGACGCGCCGGGCGGCGTGCCGCCACTCGACGCGCTGGAAGCGATCGTCGGTGACGCCAGGGTGGT CCTGATCGGCGAGGGCTCACACGGGACCCACGAGTTCTACGCTGCCCGCGCCGCGATCACCAGGTGGCTCATCGAGCAGA AGGGCTTCTGCGCGGTCGCCGCGGAAGCCGACTGGCCCGACGCCTACCGGGTCAACCGGTACGTCCGCGGCGAGGGCGAG GACACCACCGCCGACGCGGCGTTGCGCGGATTCCAGCGCTTCCCGGCGTGGATGTGGCGCAACGTGGTGGTGCGGGACTT CGCCGAGTGGCTGCAGGCCCACAATCGGCAGCGCCGATCCCTCGGTCAGCGCCAGACCGGCTTCTACGGCCTGGACCTCT ACAGCCTGCATCGCTCCATGGAAGAGGTGATCTCCTATCTCGACGGGGTGGACCCGCGCGCGGCAGATCGTGCGCGCCGC CGTTACGCCTGCTTCGACCACGCGACCGCCGACGACGGTCAGGCCTACGGTTACGCCGCCGCCTTCGGCGCGGGTCTGTC CTGTGAGCGTGAGGCGGTCGATCAGCTGATCGACATGCACCACAGCGCAATCGATTACCTCCACCACGACGGCCTGGTCG CCGAGGACGAGTTGTTCTACGCCCAGCAGAACGCCCAGACCGTCCGCGACGCCGAGGTCTACTACCGGGCGATGTTCAGC GGCCGTGTGACGTCGTGGAATCTGCGAGACGAACACATGGCCAGAACACTGGAGTCGCTGCTGACCCACCTCGATCGCCA CCCCGGCGCGGGCCCGGCACGAATCGTGTTGTGGGCGCACAACTCCCACGTCGGTGACGCCAGGGCCACCGAGGTGTCCT CCGACGGCCAGCTCACCCTGGGGCAGCTGGCGCGTCAACGGTTCGGCGACGACTGCCGCCTGATCGGCCTGACCACCTAC ACGGGCACCGTCACCGCGGCCAGTGAGTGGGGCGGTGTCGCCGAGCGGAAGGTCGTCCGGCCTGCACTGAACGGCAGCGT GGAAGAGCTGTTCCACGAAACCGATCGGCCGGAGTTCGTCATCTCGGCGTTGATCGACCGTGCCGCCGAGGAACCGCTGT CGACGGTGCGGTTGGGCCGGGCGATCGGCGTCATCTACCTCCCGGCCACCGAGCGGCAGAGCCACTACTACCACGTGCGG CCCGCCGATCAGTACGACGCCATCATCCACATCGACCGGACGCGGGCGCTCGAGCCGCTGGAGGTCACCAGCGAATGGGT GGCGGGCGAGACCCCCGAGACGTATCCCAGCGGTTTGTGA
Upstream 100 bases:
>100_bases TCACGGTGACCGTGCCGCTCAAGGAGCCCGCCAGCCCGGAGAAGCACGTCACGATCAAGTCGACGGACTGACCGTCCCGG TCACGGATGAGCAGACGAGG
Downstream 100 bases:
>100_bases ACCACCCGAAAACCCCTGAAAACCCTGCAATGCGGGACCTTCGACCCTGGCGCAGCGGACCAGGCCGTTCCTAGGGTCGA GGTATGGGAAGGCAACGACC
Product: erythromycin esterase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 679; Mature: 678
Protein sequence:
>679_residues MTRTRPPQSPRRVFRDRREAGRVLADLLTAYRGRDDVIVLGLARGGVPVAWEVAAALGAPLDAFIVRKLGAPGHEEFAMG ALATGGRVVVNDDIVRALRVSPQQLRDIAEREGHELFRREAAYRAGRPPLDVSGKTVVLVDDGLATGASMMAAIQALRDA GPAEIVVAVPAAPESTCHEILGVADDLVCASMPTPFVAVGESYWDFRQVSDEEVREHLATPTTGSAATPAPAALTPTAIV GGCAVDAPGGVPPLDALEAIVGDARVVLIGEGSHGTHEFYAARAAITRWLIEQKGFCAVAAEADWPDAYRVNRYVRGEGE DTTADAALRGFQRFPAWMWRNVVVRDFAEWLQAHNRQRRSLGQRQTGFYGLDLYSLHRSMEEVISYLDGVDPRAADRARR RYACFDHATADDGQAYGYAAAFGAGLSCEREAVDQLIDMHHSAIDYLHHDGLVAEDELFYAQQNAQTVRDAEVYYRAMFS GRVTSWNLRDEHMARTLESLLTHLDRHPGAGPARIVLWAHNSHVGDARATEVSSDGQLTLGQLARQRFGDDCRLIGLTTY TGTVTAASEWGGVAERKVVRPALNGSVEELFHETDRPEFVISALIDRAAEEPLSTVRLGRAIGVIYLPATERQSHYYHVR PADQYDAIIHIDRTRALEPLEVTSEWVAGETPETYPSGL
Sequences:
>Translated_679_residues MTRTRPPQSPRRVFRDRREAGRVLADLLTAYRGRDDVIVLGLARGGVPVAWEVAAALGAPLDAFIVRKLGAPGHEEFAMG ALATGGRVVVNDDIVRALRVSPQQLRDIAEREGHELFRREAAYRAGRPPLDVSGKTVVLVDDGLATGASMMAAIQALRDA GPAEIVVAVPAAPESTCHEILGVADDLVCASMPTPFVAVGESYWDFRQVSDEEVREHLATPTTGSAATPAPAALTPTAIV GGCAVDAPGGVPPLDALEAIVGDARVVLIGEGSHGTHEFYAARAAITRWLIEQKGFCAVAAEADWPDAYRVNRYVRGEGE DTTADAALRGFQRFPAWMWRNVVVRDFAEWLQAHNRQRRSLGQRQTGFYGLDLYSLHRSMEEVISYLDGVDPRAADRARR RYACFDHATADDGQAYGYAAAFGAGLSCEREAVDQLIDMHHSAIDYLHHDGLVAEDELFYAQQNAQTVRDAEVYYRAMFS GRVTSWNLRDEHMARTLESLLTHLDRHPGAGPARIVLWAHNSHVGDARATEVSSDGQLTLGQLARQRFGDDCRLIGLTTY TGTVTAASEWGGVAERKVVRPALNGSVEELFHETDRPEFVISALIDRAAEEPLSTVRLGRAIGVIYLPATERQSHYYHVR PADQYDAIIHIDRTRALEPLEVTSEWVAGETPETYPSGL >Mature_678_residues TRTRPPQSPRRVFRDRREAGRVLADLLTAYRGRDDVIVLGLARGGVPVAWEVAAALGAPLDAFIVRKLGAPGHEEFAMGA LATGGRVVVNDDIVRALRVSPQQLRDIAEREGHELFRREAAYRAGRPPLDVSGKTVVLVDDGLATGASMMAAIQALRDAG PAEIVVAVPAAPESTCHEILGVADDLVCASMPTPFVAVGESYWDFRQVSDEEVREHLATPTTGSAATPAPAALTPTAIVG GCAVDAPGGVPPLDALEAIVGDARVVLIGEGSHGTHEFYAARAAITRWLIEQKGFCAVAAEADWPDAYRVNRYVRGEGED TTADAALRGFQRFPAWMWRNVVVRDFAEWLQAHNRQRRSLGQRQTGFYGLDLYSLHRSMEEVISYLDGVDPRAADRARRR YACFDHATADDGQAYGYAAAFGAGLSCEREAVDQLIDMHHSAIDYLHHDGLVAEDELFYAQQNAQTVRDAEVYYRAMFSG RVTSWNLRDEHMARTLESLLTHLDRHPGAGPARIVLWAHNSHVGDARATEVSSDGQLTLGQLARQRFGDDCRLIGLTTYT GTVTAASEWGGVAERKVVRPALNGSVEELFHETDRPEFVISALIDRAAEEPLSTVRLGRAIGVIYLPATERQSHYYHVRP ADQYDAIIHIDRTRALEPLEVTSEWVAGETPETYPSGL
Specific function: Unknown
COG id: COG2312
COG function: function code R; Erythromycin esterase homolog
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: In the N-terminal section; belongs to the purine/pyrimidine phosphoribosyltransferase family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR007815 - InterPro: IPR000836 [H]
Pfam domain/function: PF05139 Erythro_esteras; PF00156 Pribosyltran [H]
EC number: NA
Molecular weight: Translated: 74239; Mature: 74108
Theoretical pI: Translated: 5.33; Mature: 5.33
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.5 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTRTRPPQSPRRVFRDRREAGRVLADLLTAYRGRDDVIVLGLARGGVPVAWEVAAALGAP CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHCCC LDAFIVRKLGAPGHEEFAMGALATGGRVVVNDDIVRALRVSPQQLRDIAEREGHELFRRE HHHHHHHHHCCCCHHHHHHHHCCCCCEEEECHHHHHHHHCCHHHHHHHHHHHHHHHHHHH AAYRAGRPPLDVSGKTVVLVDDGLATGASMMAAIQALRDAGPAEIVVAVPAAPESTCHEI HHHHCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHH LGVADDLVCASMPTPFVAVGESYWDFRQVSDEEVREHLATPTTGSAATPAPAALTPTAIV HCCHHHHHHHCCCCCEEEECCCHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHC GGCAVDAPGGVPPLDALEAIVGDARVVLIGEGSHGTHEFYAARAAITRWLIEQKGFCAVA CCCEECCCCCCCCHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEE AEADWPDAYRVNRYVRGEGEDTTADAALRGFQRFPAWMWRNVVVRDFAEWLQAHNRQRRS ECCCCCCHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LGQRQTGFYGLDLYSLHRSMEEVISYLDGVDPRAADRARRRYACFDHATADDGQAYGYAA CCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHH AFGAGLSCEREAVDQLIDMHHSAIDYLHHDGLVAEDELFYAQQNAQTVRDAEVYYRAMFS HHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHC GRVTSWNLRDEHMARTLESLLTHLDRHPGAGPARIVLWAHNSHVGDARATEVSSDGQLTL CCCEECCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEH GQLARQRFGDDCRLIGLTTYTGTVTAASEWGGVAERKVVRPALNGSVEELFHETDRPEFV HHHHHHHCCCCEEEEEEEEECCEEEECHHCCCHHHHHHHHHHHCCCHHHHHHHCCCHHHH ISALIDRAAEEPLSTVRLGRAIGVIYLPATERQSHYYHVRPADQYDAIIHIDRTRALEPL HHHHHHHHHHCHHHHHHHHHHEEEEEECCCCCCCCEEEECCCCCCCEEEEEECCCCCCCH EVTSEWVAGETPETYPSGL HHHHHHHCCCCCCCCCCCC >Mature Secondary Structure TRTRPPQSPRRVFRDRREAGRVLADLLTAYRGRDDVIVLGLARGGVPVAWEVAAALGAP CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHCCC LDAFIVRKLGAPGHEEFAMGALATGGRVVVNDDIVRALRVSPQQLRDIAEREGHELFRRE HHHHHHHHHCCCCHHHHHHHHCCCCCEEEECHHHHHHHHCCHHHHHHHHHHHHHHHHHHH AAYRAGRPPLDVSGKTVVLVDDGLATGASMMAAIQALRDAGPAEIVVAVPAAPESTCHEI HHHHCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHH LGVADDLVCASMPTPFVAVGESYWDFRQVSDEEVREHLATPTTGSAATPAPAALTPTAIV HCCHHHHHHHCCCCCEEEECCCHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHC GGCAVDAPGGVPPLDALEAIVGDARVVLIGEGSHGTHEFYAARAAITRWLIEQKGFCAVA CCCEECCCCCCCCHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEE AEADWPDAYRVNRYVRGEGEDTTADAALRGFQRFPAWMWRNVVVRDFAEWLQAHNRQRRS ECCCCCCHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH LGQRQTGFYGLDLYSLHRSMEEVISYLDGVDPRAADRARRRYACFDHATADDGQAYGYAA CCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHH AFGAGLSCEREAVDQLIDMHHSAIDYLHHDGLVAEDELFYAQQNAQTVRDAEVYYRAMFS HHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHC GRVTSWNLRDEHMARTLESLLTHLDRHPGAGPARIVLWAHNSHVGDARATEVSSDGQLTL CCCEECCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEH GQLARQRFGDDCRLIGLTTYTGTVTAASEWGGVAERKVVRPALNGSVEELFHETDRPEFV HHHHHHHCCCCEEEEEEEEECCEEEECHHCCCHHHHHHHHHHHCCCHHHHHHHCCCHHHH ISALIDRAAEEPLSTVRLGRAIGVIYLPATERQSHYYHVRPADQYDAIIHIDRTRALEPL HHHHHHHHHHCHHHHHHHHHHEEEEEECCCCCCCCEEEECCCCCCCEEEEEECCCCCCCH EVTSEWVAGETPETYPSGL HHHHHHHCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036 [H]