Definition | Yersinia pestis CO92 chromosome, complete genome. |
---|---|
Accession | NC_003143 |
Length | 4,653,728 |
Click here to switch to the map view.
The map label for this gene is galK
Identifier: 218928302
GI number: 218928302
Start: 1283086
End: 1284237
Strand: Reverse
Name: galK
Synonym: YPO1137
Alternate gene names: 218928302
Gene position: 1284237-1283086 (Counterclockwise)
Preceding gene: 218928303
Following gene: 218928301
Centisome position: 27.6
GC content: 51.22
Gene sequence:
>1152_bases ATGAGTTTAAAACAACATACCCAGACTATTTTCCGCCAACAGTTTGACCGCGAGTCTGACATCACCATTAAAGCGCCGGG CCGCGTCAATCTGATTGGCGAACATACCGACTATAACGATGGCTTTGTTCTGCCCTGCGCCATTAATTATGAAACGGTGA TCAGTTGTGGCAAACGCGACGATCGCCAGATTCGTGTTATTGCCGCCGACTATGAAAACCAGCAGGATATATTCTCTCTT GATGCACCGATTGTCCCGCATCCTGAATATCGCTGGGCTGACTACGTGCGTGGTGTGGTGAAACATCTACAAATGCGCAA CGCTGATTTTGGTGGGGCCGATCTGGTTATCTGTGGCAATGTCCCGCAGGGTGCTGGCCTCAGTTCCTCTGCATCGTTGG AAGTGGCCGTGGGCCAAGCCCTGCAATCACTCTATCAACTCCCTCTTAGCGGTGTAGAACTGGCGCTGAATGGGCAAGAG GCAGAAAACCAATTTGTCGGCTGTAACTGCGGCATTATGGATCAGTTAATCTCAGCATTGGGTAAAAAAGACCATGCGTT GCTGATTGATTGTCGGACCTTGGAAACCCGTGCCGTGCCAATGCCGGAAAACATGGCCGTCGTTATTATCAACTCAAACA TTCAACGTGGCCTGGTTGACAGCGAATACAATACTCGCCGCCAACAGTGTGAAGCTGCCGCCCGTTTCTTTGGCGTCAAA GCATTGCGTGATGTCGAACCGAGCCTCTTCTTCTCAATACAAGACGAGCTAGATCCGGTCGTCGCTAAACGCGCCCGCCA TGTGATCAGCGAGAATGCACGCACGCTGGCAGCCGCAGATGCCTTGGCCGCCGGGAACTTGAAATTGATGGGGCAATTGA TGCAAGAGTCTCATATTTCTATGCGTGATGACTTTGAGATCACGGTTCCACCAATAGATAGACTCGTCGAGATTGTGAAA TCAGTGATTGGTGATCAAGGTGGGGTGCGCATGACGGGTGGCGGTTTTGGCGGTTGTATTATCGCGTTAATGCCGCTTGA ATTAGTCGAGCAGGTTCGCACCACCGTTGCGCAAGAATACCCGGCACACAGCGGCGGCAAGAAAGAGACTTTTTATGTCT GTCAGGCTTCACAAGGAGCGGGTTTATGCTGA
Upstream 100 bases:
>100_bases AACTGCTGGCCGAAACCCAGCGAGACCTTACAGCAGAACAGGCGGCAGCACTCCTGCGGGCAGTAAGTGATGTTCACTAT AAAGAGGCCGGAGCCAAATC
Downstream 100 bases:
>100_bases AAAACGGCGCGGCATCACCAAACAGTGTTGACCCGTTAGCACCGGATGGTCACCCTTTTGAATTCACCAAATTGCAGAAT AAAAGCGGTATGACCGTCAC
Product: galactokinase
Products: NA
Alternate protein names: Galactose kinase
Number of amino acids: Translated: 383; Mature: 382
Protein sequence:
>383_residues MSLKQHTQTIFRQQFDRESDITIKAPGRVNLIGEHTDYNDGFVLPCAINYETVISCGKRDDRQIRVIAADYENQQDIFSL DAPIVPHPEYRWADYVRGVVKHLQMRNADFGGADLVICGNVPQGAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQE AENQFVGCNCGIMDQLISALGKKDHALLIDCRTLETRAVPMPENMAVVIINSNIQRGLVDSEYNTRRQQCEAAARFFGVK ALRDVEPSLFFSIQDELDPVVAKRARHVISENARTLAAADALAAGNLKLMGQLMQESHISMRDDFEITVPPIDRLVEIVK SVIGDQGGVRMTGGGFGGCIIALMPLELVEQVRTTVAQEYPAHSGGKKETFYVCQASQGAGLC
Sequences:
>Translated_383_residues MSLKQHTQTIFRQQFDRESDITIKAPGRVNLIGEHTDYNDGFVLPCAINYETVISCGKRDDRQIRVIAADYENQQDIFSL DAPIVPHPEYRWADYVRGVVKHLQMRNADFGGADLVICGNVPQGAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQE AENQFVGCNCGIMDQLISALGKKDHALLIDCRTLETRAVPMPENMAVVIINSNIQRGLVDSEYNTRRQQCEAAARFFGVK ALRDVEPSLFFSIQDELDPVVAKRARHVISENARTLAAADALAAGNLKLMGQLMQESHISMRDDFEITVPPIDRLVEIVK SVIGDQGGVRMTGGGFGGCIIALMPLELVEQVRTTVAQEYPAHSGGKKETFYVCQASQGAGLC >Mature_382_residues SLKQHTQTIFRQQFDRESDITIKAPGRVNLIGEHTDYNDGFVLPCAINYETVISCGKRDDRQIRVIAADYENQQDIFSLD APIVPHPEYRWADYVRGVVKHLQMRNADFGGADLVICGNVPQGAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQEA ENQFVGCNCGIMDQLISALGKKDHALLIDCRTLETRAVPMPENMAVVIINSNIQRGLVDSEYNTRRQQCEAAARFFGVKA LRDVEPSLFFSIQDELDPVVAKRARHVISENARTLAAADALAAGNLKLMGQLMQESHISMRDDFEITVPPIDRLVEIVKS VIGDQGGVRMTGGGFGGCIIALMPLELVEQVRTTVAQEYPAHSGGKKETFYVCQASQGAGLC
Specific function: Galactose metabolism; first step. [C]
COG id: COG0153
COG function: function code G; Galactokinase
Gene ontology:
Cell location: Cytoplasm (Potential)
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the GHMP kinase family. GalK subfamily
Homologues:
Organism=Homo sapiens, GI4503895, Length=382, Percent_Identity=43.717277486911, Blast_Score=281, Evalue=8e-76, Organism=Homo sapiens, GI48527957, Length=440, Percent_Identity=27.0454545454545, Blast_Score=132, Evalue=4e-31, Organism=Homo sapiens, GI4503897, Length=440, Percent_Identity=27.0454545454545, Blast_Score=132, Evalue=4e-31, Organism=Escherichia coli, GI1786972, Length=383, Percent_Identity=72.5848563968668, Blast_Score=575, Evalue=1e-165, Organism=Caenorhabditis elegans, GI71989053, Length=404, Percent_Identity=28.960396039604, Blast_Score=108, Evalue=3e-24, Organism=Saccharomyces cerevisiae, GI6319494, Length=252, Percent_Identity=30.1587301587302, Blast_Score=86, Evalue=1e-17, Organism=Saccharomyces cerevisiae, GI6320212, Length=328, Percent_Identity=26.219512195122, Blast_Score=79, Evalue=9e-16, Organism=Drosophila melanogaster, GI24661292, Length=412, Percent_Identity=27.6699029126214, Blast_Score=108, Evalue=6e-24, Organism=Drosophila melanogaster, GI21355577, Length=412, Percent_Identity=27.6699029126214, Blast_Score=108, Evalue=6e-24, Organism=Drosophila melanogaster, GI24661285, Length=412, Percent_Identity=27.6699029126214, Blast_Score=108, Evalue=6e-24,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): GAL1_YERP3 (A7FKP2)
Other databases:
- EMBL: CP000720 - RefSeq: YP_001401820.1 - ProteinModelPortal: A7FKP2 - STRING: A7FKP2 - GeneID: 5385326 - GenomeReviews: CP000720_GR - KEGG: ypi:YpsIP31758_2857 - eggNOG: COG0153 - HOGENOM: HBG725121 - OMA: CREMIAR - ProtClustDB: PRK05101 - BioCyc: YPSE349747:YPSIP31758_2857-MONOMER - GO: GO:0005737 - HAMAP: MF_00246 - InterPro: IPR000705 - InterPro: IPR022963 - InterPro: IPR019741 - InterPro: IPR019539 - InterPro: IPR006204 - InterPro: IPR013750 - InterPro: IPR006203 - InterPro: IPR006206 - InterPro: IPR020568 - InterPro: IPR014721 - Gene3D: G3DSA:3.30.230.10 - PIRSF: PIRSF000530 - PRINTS: PR00473 - PRINTS: PR00959 - TIGRFAMs: TIGR00131
Pfam domain/function: PF10509 GalKase_gal_bdg; PF08544 GHMP_kinases_C; PF00288 GHMP_kinases_N; SSF54211 Ribosomal_S5_D2-typ_fold
EC number: =2.7.1.6
Molecular weight: Translated: 41866; Mature: 41735
Theoretical pI: Translated: 5.01; Mature: 5.01
Prosite motif: PS00106 GALACTOKINASE; PS00627 GHMP_KINASES_ATP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.6 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 5.2 %Cys+Met (Translated Protein) 2.6 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 5.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSLKQHTQTIFRQQFDRESDITIKAPGRVNLIGEHTDYNDGFVLPCAINYETVISCGKRD CCHHHHHHHHHHHHCCCCCCEEEECCCEEEEEECCCCCCCCEEEEEEECHHHHHHCCCCC DRQIRVIAADYENQQDIFSLDAPIVPHPEYRWADYVRGVVKHLQMRNADFGGADLVICGN CCEEEEEEECCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEECC VPQGAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQEAENQFVGCNCGIMDQLISAL CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEECCCCCHHHHHHHHH GKKDHALLIDCRTLETRAVPMPENMAVVIINSNIQRGLVDSEYNTRRQQCEAAARFFGVK CCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHH ALRDVEPSLFFSIQDELDPVVAKRARHVISENARTLAAADALAAGNLKLMGQLMQESHIS HHHHCCCHHEEEEHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCC MRDDFEITVPPIDRLVEIVKSVIGDQGGVRMTGGGFGGCIIALMPLELVEQVRTTVAQEY CCCCCEEECCCHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHC PAHSGGKKETFYVCQASQGAGLC CCCCCCCCCEEEEEECCCCCCCC >Mature Secondary Structure SLKQHTQTIFRQQFDRESDITIKAPGRVNLIGEHTDYNDGFVLPCAINYETVISCGKRD CHHHHHHHHHHHHCCCCCCEEEECCCEEEEEECCCCCCCCEEEEEEECHHHHHHCCCCC DRQIRVIAADYENQQDIFSLDAPIVPHPEYRWADYVRGVVKHLQMRNADFGGADLVICGN CCEEEEEEECCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEECC VPQGAGLSSSASLEVAVGQALQSLYQLPLSGVELALNGQEAENQFVGCNCGIMDQLISAL CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEECCCCCHHHHHHHHH GKKDHALLIDCRTLETRAVPMPENMAVVIINSNIQRGLVDSEYNTRRQQCEAAARFFGVK CCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHH ALRDVEPSLFFSIQDELDPVVAKRARHVISENARTLAAADALAAGNLKLMGQLMQESHIS HHHHCCCHHEEEEHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCC MRDDFEITVPPIDRLVEIVKSVIGDQGGVRMTGGGFGGCIIALMPLELVEQVRTTVAQEY CCCCCEEECCCHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHC PAHSGGKKETFYVCQASQGAGLC CCCCCCCCCEEEEEECCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA