Definition | Mycobacterium bovis BCG str. Pasteur 1173P2, complete genome. |
---|---|
Accession | NC_008769 |
Length | 4,374,522 |
Click here to switch to the map view.
The map label for this gene is pks17 [H]
Identifier: 121637571
GI number: 121637571
Start: 1891776
End: 1893284
Strand: Direct
Name: pks17 [H]
Synonym: BCG_1702
Alternate gene names: 121637571
Gene position: 1891776-1893284 (Clockwise)
Preceding gene: 121637570
Following gene: 121637572
Centisome position: 43.25
GC content: 62.96
Gene sequence:
>1509_bases ATGGAGGCCGGACCGCAGCGGATTGCGCAGATGCTGGCCGAGTTAGTCGAGTTGTTCAAAACTGAAGCGCTGCATCGGCT TCCAGTCAAGTCATGGGATGTGCGGCACGCTCGGGAGGCGTATCGGTTCTTGAGCCAGGCGCGCCATGTCGGCAAAGTGG TGCTGACCATGCCGGACGCGTGGGCCGCGGGCACGGTGCTGATCACCGGTGGCACTGGGATGGCAGGTTCTGCGGTGGCG CGTCATCTGGTGAGTCGATACGGGGTGCGGCAGGTGGTGTTGGCCAGTCGTGCTGGTGAGCACACGGAGAGCGTCGCAGC ATTGGTGGACGAGCTCGGCTCGGCCGGCGCCCGAGTGCAGGTGGTGTCTTGCGATGTGGCCGATCGTGATGCGGTGGCGG GTTTGGTGGCAAGCCAACCAGATCTGACTGCAGTGTTTCATGCGGCTGGGGTTCTTGACGATGCGGTAATCACCGGATTG ACGCCGGAGCGGGTGGATAAGGTATTGCGGGCCAAGGTCGATGGGGCCTGGAATTTGCATGAGCTCACCCGGCACCTGGA TGTGTCAGCGTTTGTGTTGTTTTCGTCGATGGCCGGGATTGTGGGTGCGCCGGGCCAGGCCAATTATGCTGCAGCGAACG CGTTTTTGGACGGGTTGGCGGCCTATCGGCGATCACGTGGACTGGCCGCGTTGTCGGTGGCGTGGGGATTGTGGGAGCAG GCTTCGGCGATGACCGAGCATTTAGGCGAGCGGGATCGGGTCCGGATGAGTCGGGTTGGACTGGCGCCGTTGCCTACCAA CCAGGCGATGGGATTCCTGGATGCCGCGTTGCTGGCGGATCGGCCCGTGGTGGTGGCTGCTCGGCTGGATCGTGCCGCGC TGGCCGGTGCCGAGCTGCCGGCACTATTTAGCCAGTTGGTTGCCGGTCCGATCCGACGGATCATCGACGGCGCCGATGAG GTGTCGGGGTCGGGATTGGCGTCGCGGCTGCACGGGCTGACTCCCGAGCAGCGGCACCGCGAACTCACCGAGTTAGTATG TAGCAACGCCGCGATCGTGTTGGGGCATTCCGGCACTGAGATCGACGCGCACAAGGCATTCCAGGATCTCGGGTTTGATT CGCTGACAGCGGTGGAGCTGCGCAACCGGCTCAAGACTGCGACCGGGTTGACCTTGCCACCGACCTTGATCTTTGACTAC CCCACGGCCGCCGAGTTGGCCGAACACCTCGACATCCAGCTGGCGAACGCCCCTGCCGTCACGGTCGACCAACCCAACCC GTCGACTCGTTTCAACGAGGTCACCCGCGAACTACAAGCATTGCTCGACCAACCCAACTGGAACCCCGACGACAAAACGC GCCTGATCAAGCGATTGCAAGCGATTTTGACCGATTGCACCGCTCCACCGGCCAGCTCCGGCCCGTCTACCACCCATGAC GACGAGGACATCACCACCGCCACTGAAAGCCAGCTTTTTGCCATCCTCGACGACGAACTTGGACCTTAG
Upstream 100 bases:
>100_bases CGCGAGGCGGTCGCTTCATCGAGATGGGCAAAACCGAGTTCGGGACGCCCAGGTCGTTGCCCAGGACCATCCTGGGGTGG CCTACCGGGCTTTCGACTTG
Downstream 100 bases:
>100_bases CGCACGTGCAACCGACAGGCATCGCAATCATCGGGCTGGCATGCAGGTTTCCCACCGTCGTCAGCCCCGGCGACCTCTGG GACCTGTTGCGCGACGGGCG
Product: putative polyketide synthase pks17
Products: NA
Alternate protein names: 6-deoxyerythronolide B synthase II; DEBS 2; ORF 2 [H]
Number of amino acids: Translated: 502; Mature: 502
Protein sequence:
>502_residues MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHVGKVVLTMPDAWAAGTVLITGGTGMAGSAVA RHLVSRYGVRQVVLASRAGEHTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGL TPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQ ASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELPALFSQLVAGPIRRIIDGADE VSGSGLASRLHGLTPEQRHRELTELVCSNAAIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPDDKTRLIKRLQAILTDCTAPPASSGPSTTHD DEDITTATESQLFAILDDELGP
Sequences:
>Translated_502_residues MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHVGKVVLTMPDAWAAGTVLITGGTGMAGSAVA RHLVSRYGVRQVVLASRAGEHTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGL TPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQ ASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELPALFSQLVAGPIRRIIDGADE VSGSGLASRLHGLTPEQRHRELTELVCSNAAIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPDDKTRLIKRLQAILTDCTAPPASSGPSTTHD DEDITTATESQLFAILDDELGP >Mature_502_residues MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHVGKVVLTMPDAWAAGTVLITGGTGMAGSAVA RHLVSRYGVRQVVLASRAGEHTESVAALVDELGSAGARVQVVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGL TPERVDKVLRAKVDGAWNLHELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQ ASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELPALFSQLVAGPIRRIIDGADE VSGSGLASRLHGLTPEQRHRELTELVCSNAAIVLGHSGTEIDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDY PTAAELAEHLDIQLANAPAVTVDQPNPSTRFNEVTRELQALLDQPNWNPDDKTRLIKRLQAILTDCTAPPASSGPSTTHD DEDITTATESQLFAILDDELGP
Specific function: Unknown
COG id: COG3321
COG function: function code Q; Polyketide synthase modules and related proteins
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Contains 2 acyl carrier domains [H]
Homologues:
Organism=Homo sapiens, GI41872631, Length=385, Percent_Identity=27.012987012987, Blast_Score=100, Evalue=6e-21, Organism=Caenorhabditis elegans, GI212642053, Length=408, Percent_Identity=25.2450980392157, Blast_Score=90, Evalue=2e-18, Organism=Caenorhabditis elegans, GI17550940, Length=324, Percent_Identity=23.4567901234568, Blast_Score=90, Evalue=3e-18, Organism=Drosophila melanogaster, GI24581345, Length=278, Percent_Identity=26.978417266187, Blast_Score=89, Evalue=6e-18, Organism=Drosophila melanogaster, GI221330659, Length=350, Percent_Identity=26, Blast_Score=89, Evalue=6e-18, Organism=Drosophila melanogaster, GI19920632, Length=350, Percent_Identity=26, Blast_Score=89, Evalue=7e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001227 - InterPro: IPR009081 - InterPro: IPR014043 - InterPro: IPR016035 - InterPro: IPR013149 - InterPro: IPR013154 - InterPro: IPR000794 - InterPro: IPR002198 - InterPro: IPR015357 - InterPro: IPR011032 - InterPro: IPR018201 - InterPro: IPR014031 - InterPro: IPR014030 - InterPro: IPR016036 - InterPro: IPR016040 - InterPro: IPR006163 - InterPro: IPR020842 - InterPro: IPR020801 - InterPro: IPR020841 - InterPro: IPR020807 - InterPro: IPR020843 - InterPro: IPR020806 - InterPro: IPR015083 - InterPro: IPR006162 - InterPro: IPR016039 - InterPro: IPR016038 [H]
Pfam domain/function: PF00698 Acyl_transf_1; PF08240 ADH_N; PF00106 adh_short; PF00107 ADH_zinc_N; PF08990 Docking; PF09277 Erythro-docking; PF00109 ketoacyl-synt; PF02801 Ketoacyl-synt_C; PF00550 PP-binding [H]
EC number: =2.3.1.94 [H]
Molecular weight: Translated: 53525; Mature: 53525
Theoretical pI: Translated: 5.43; Mature: 5.43
Prosite motif: PS00012 PHOSPHOPANTETHEINE ; PS50075 ACP_DOMAIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 2.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHVGKVVLTMPDA CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCEEEEECCCC WAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGEHTESVAALVDELGSAGARVQ CCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCEEE VVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGLTPERVDKVLRAKVDGAWNLH EEEECCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHH ELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQ HHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH ASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP HHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHCCCHHH ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNAAIVLGHSGTE HHHHHHHHHHHHHHHCCCHHCCCCHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCC IDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDYPTAAELAEHLDIQLANAPAV CHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCEEECCCCHHHHHHHCCEEEECCCEE TVDQPNPSTRFNEVTRELQALLDQPNWNPDDKTRLIKRLQAILTDCTAPPASSGPSTTHD EECCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC DEDITTATESQLFAILDDELGP CCCCCHHHHHHHHEECCCCCCC >Mature Secondary Structure MEAGPQRIAQMLAELVELFKTEALHRLPVKSWDVRHAREAYRFLSQARHVGKVVLTMPDA CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCEEEEECCCC WAAGTVLITGGTGMAGSAVARHLVSRYGVRQVVLASRAGEHTESVAALVDELGSAGARVQ CCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCEEE VVSCDVADRDAVAGLVASQPDLTAVFHAAGVLDDAVITGLTPERVDKVLRAKVDGAWNLH EEEECCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHH ELTRHLDVSAFVLFSSMAGIVGAPGQANYAAANAFLDGLAAYRRSRGLAALSVAWGLWEQ HHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH ASAMTEHLGERDRVRMSRVGLAPLPTNQAMGFLDAALLADRPVVVAARLDRAALAGAELP HHHHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHCCCHHH ALFSQLVAGPIRRIIDGADEVSGSGLASRLHGLTPEQRHRELTELVCSNAAIVLGHSGTE HHHHHHHHHHHHHHHCCCHHCCCCHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCC IDAHKAFQDLGFDSLTAVELRNRLKTATGLTLPPTLIFDYPTAAELAEHLDIQLANAPAV CHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCEEECCCCHHHHHHHCCEEEECCCEE TVDQPNPSTRFNEVTRELQALLDQPNWNPDDKTRLIKRLQAILTDCTAPPASSGPSTTHD EECCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC DEDITTATESQLFAILDDELGP CCCCCHHHHHHHHEECCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 2024119; 1740151 [H]