Definition | Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome. |
---|---|
Accession | NC_008769 |
Length | 4,374,522 |
Click here to switch to the map view.
The map label for this gene is ybjS [C]
Identifier: 121637037
GI number: 121637037
Start: 1264259
End: 1265371
Strand: Reverse
Name: ybjS [C]
Synonym: BCG_1166c
Alternate gene names: 121637037
Gene position: 1265371-1264259 (Counterclockwise)
Preceding gene: 121637038
Following gene: 121637034
Centisome position: 28.93
GC content: 64.87
Gene sequence:
>1113_bases ATGCTTCGCCGCATGGGTGATGCATCGCTGACAACCGAGCTCGGCCGCGTTCTGGTCACCGGCGGCGCGGGCTTCGTGGG CGCCAACCTGGTGACCACCTTGCTGGACCGCGGGCACTGGGTGCGTTCCTTCGACCGCGCGCCGTCGCTGTTGCCTGCGC ATCCGCAACTGGAGGTGCTGCAAGGGGACATCACCGACGCGGACGTCTGCGCCGCGGCCGTGGACGGCATCGACACGATC TTCCACACCGCAGCGATCATCGAGCTGATGGGCGGCGCGTCGGTCACCGACGAGTACCGCCAACGTAGCTTTGCGGTCAA CGTCGGCGGCACCGAGAACCTGCTGCACGCCGGCCAGCGGGCCGGGGTGCAGCGGTTCGTCTACACGTCATCCAACAGTG TGGTGATGGGCGGCCAGAACATCGCCGGCGGTGACGAGACGCTGCCCTATACCGACCGGTTCAACGACCTCTACACCGAG ACCAAGGTGGTTGCCGAGCGATTCGTGTTGGCCCAGAACGGTGTCGACGGCATGCTGACGTGCGCGATCCGGCCCAGCGG CATCTGGGGAAACGGCGATCAGACGATGTTCCGCAAGCTGTTCGAAAGTGTGCTCAAGGGCCACGTCAAGGTGCTGGTCG GGCGCAAGTCGGCCCGGCTGGATAACTCTTACGTGCACAACCTGATTCACGGTTTCATCTTGGCCGCTGCCCATCTGGTG CCGGACGGCACAGCGCCCGGGCAGGCTTACTTCATCAACGACGCAGAGCCGATCAATATGTTCGAGTTCGCTCGGCCGGT GCTCGAGGCGTGCGGGCAGCGCTGGCCGAAGATGCGGATTTCCGGCCCCGCGGTCCGCTGGGTAATGACGGGGTGGCAGC GGCTGCACTTCCGGTTCGGATTCCCCGCGCCGCTGCTCGAGCCGCTGGCCGTCGAACGACTGTACCTGGACAACTACTTT TCGATCGCTAAGGCACGCCGCGACCTGGGCTATGAGCCGCTGTTCACCACCCAGCAGGCGCTGACCGAATGCCTGCCGTA CTACGTGAGTCTGTTTGAGCAGATGAAGAACGAGGCCCGGGCGGAAAAAACGGCCGCCACAGTCAAGCCGTAG
Upstream 100 bases:
>100_bases GCAACTCGCCAAAAGGTGCGAGGAGCACTTAGCTGGTGCACGCCAGCGGGTGTCCGATGTGCTGGCCGGCGACGAGGCCC AAAACGGCTAAGGCAAGGTT
Downstream 100 bases:
>100_bases CCAGAATTATCTGAAACTCACCACTTGCTGCCCCAGGTCGCTCGGATGTGTGCGTCGACGTCGTGCACGACGGCGTCTCG CCTGCCGATAATCAGGCAGG
Product: putative cholesterol dehydrogenase
Products: NA
Alternate protein names: Cholesterol dehydrogenase; 3-beta-hydroxy-Delta(5)-steroid dehydrogenase; 3-beta-HSD; 3BHSD; 3-beta hydroxysterol dehydrogenase; 3-beta-hydroxy-5-ene steroid dehydrogenase; Progesterone reductase; Steroid Delta-isomerase; Delta-5-3-ketosteroid isomerase
Number of amino acids: Translated: 370; Mature: 370
Protein sequence:
>370_residues MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTI FHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYF SIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP
Sequences:
>Translated_370_residues MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTI FHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYF SIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP >Mature_370_residues MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTI FHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYF SIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP
Specific function: 3-beta-HSD is a bifunctional enzyme, that catalyzes the oxidation and isomerization of cholesterol, pregnenolone, and dehydroepiandrosterone (DHEA) into cholest-4-en-3-one, progesterone, and androsterone, respectively
COG id: COG0451
COG function: function code MG; Nucleoside-diphosphate-sugar epimerases
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 3-beta-HSD family
Homologues:
Organism=Homo sapiens, GI193211614, Length=335, Percent_Identity=32.5373134328358, Blast_Score=170, Evalue=2e-42, Organism=Homo sapiens, GI8393516, Length=335, Percent_Identity=32.5373134328358, Blast_Score=170, Evalue=2e-42, Organism=Homo sapiens, GI116268111, Length=341, Percent_Identity=35.7771260997067, Blast_Score=168, Evalue=6e-42, Organism=Homo sapiens, GI310132178, Length=332, Percent_Identity=32.8313253012048, Blast_Score=166, Evalue=4e-41, Organism=Homo sapiens, GI310113012, Length=332, Percent_Identity=32.8313253012048, Blast_Score=166, Evalue=4e-41, Organism=Homo sapiens, GI239745448, Length=332, Percent_Identity=32.8313253012048, Blast_Score=166, Evalue=4e-41, Organism=Homo sapiens, GI260763931, Length=376, Percent_Identity=29.5212765957447, Blast_Score=120, Evalue=1e-27, Organism=Homo sapiens, GI4504509, Length=376, Percent_Identity=29.5212765957447, Blast_Score=120, Evalue=1e-27, Organism=Homo sapiens, GI4504507, Length=378, Percent_Identity=29.8941798941799, Blast_Score=119, Evalue=5e-27, Organism=Homo sapiens, GI19923621, Length=373, Percent_Identity=32.9758713136729, Blast_Score=115, Evalue=9e-26, Organism=Homo sapiens, GI218563686, Length=209, Percent_Identity=33.9712918660287, Blast_Score=77, Evalue=3e-14, Organism=Homo sapiens, GI218563684, Length=209, Percent_Identity=33.9712918660287, Blast_Score=77, Evalue=3e-14, Organism=Escherichia coli, GI87081792, Length=335, Percent_Identity=23.5820895522388, Blast_Score=71, Evalue=1e-13, Organism=Caenorhabditis elegans, GI17570557, Length=369, Percent_Identity=29.5392953929539, Blast_Score=125, Evalue=3e-29, Organism=Caenorhabditis elegans, GI17509709, Length=372, Percent_Identity=29.5698924731183, Blast_Score=122, Evalue=2e-28, Organism=Caenorhabditis elegans, GI193211272, Length=334, Percent_Identity=27.5449101796407, Blast_Score=109, Evalue=2e-24, Organism=Saccharomyces cerevisiae, GI6321437, Length=361, Percent_Identity=28.808864265928, Blast_Score=119, Evalue=5e-28,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): 3BHS_MYCTU (O53454)
Other databases:
- EMBL: AE000516 - EMBL: BX842575 - PIR: H70897 - RefSeq: NP_215622.1 - RefSeq: NP_335580.1 - ProteinModelPortal: O53454 - EnsemblBacteria: EBMYCT00000002215 - EnsemblBacteria: EBMYCT00000071008 - GeneID: 886004 - GeneID: 924962 - GenomeReviews: AE000516_GR - GenomeReviews: AL123456_GR - KEGG: mtc:MT1137 - KEGG: mtu:Rv1106c - TIGR: MT1137 - TubercuList: Rv1106c - GeneTree: EBGT00050000015184 - HOGENOM: HBG755066 - OMA: TKFIQAD - ProtClustDB: CLSK790953 - GO: GO:0005829 - Gene3D: G3DSA:3.40.50.720
Pfam domain/function: NA
EC number: =1.1.1.145; =5.3.3.1
Molecular weight: Translated: 40742; Mature: 40742
Theoretical pI: Translated: 7.01; Mature: 7.01
Prosite motif: NA
Important sites: ACT_SITE 158-158 BINDING 162-162
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVL CCCCCCCHHHHHHHCCEEEECCCCHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCCHHHH QGDITDADVCAAAVDGIDTIFHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQR CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCEEEEECCCHHHHHHHHHH AGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTETKVVAERFVLAQNGVDGMLT HCCEEEEEECCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEE CAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV EEEECCCCCCCCCHHHHHHHHHHHHHHHHEEEECCCHHHCCHHHHHHHHHHHHHHHHHHC PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFG CCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHEECC FPAPLLEPLAVERLYLDNYFSIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEAR CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH AEKTAATVKP HHHHHCCCCC >Mature Secondary Structure MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVL CCCCCCCHHHHHHHCCEEEECCCCHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCCHHHH QGDITDADVCAAAVDGIDTIFHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQR CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCEEEEECCCHHHHHHHHHH AGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTETKVVAERFVLAQNGVDGMLT HCCEEEEEECCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEE CAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV EEEECCCCCCCCCHHHHHHHHHHHHHHHHEEEECCCHHHCCHHHHHHHHHHHHHHHHHHC PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFG CCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHEECC FPAPLLEPLAVERLYLDNYFSIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEAR CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH AEKTAATVKP HHHHHCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036