Definition | Clostridium botulinum A2 str. Kyoto chromosome, complete genome. |
---|---|
Accession | NC_012563 |
Length | 4,155,278 |
Click here to switch to the map view.
The map label for this gene is gap [H]
Identifier: 226948370
GI number: 226948370
Start: 1306492
End: 1307508
Strand: Direct
Name: gap [H]
Synonym: CLM_1253
Alternate gene names: 226948370
Gene position: 1306492-1307508 (Clockwise)
Preceding gene: 226948368
Following gene: 226948372
Centisome position: 31.44
GC content: 31.96
Gene sequence:
>1017_bases ATGGGTATAAAAGTTGGACTAAATGGTTTTGGAAGAATAGGAAGAGCAGTATTAAGAATAGCACAGGACAAGTTCCCAGA AGATGTAGAAATAGTAGCATTAAATGCGAGAGCCACTACTGAAACATTAGTACATCTATTTAAATACGACTCCTGCTACG GAAGATTTAATGGACAAGTGGAAATAATTTCTAATGATAAAATGAGAGTAAATGGAAAAGAAATAAAAATATTTAGAGAA AATGATCCTCAAAATTTACCGTGGAAAGAACTTGGAGTAGATATAGTTATAGAATCTACAGGAATATTTAAAGATAGAGA AAAAGCTATGAAGCATATAGAAGCTGGAGCTAAAAAAGTTATAATAACAGCACCAGGAAAAAATGAAGATATAACTATAG TTTTAGGAGTAAATGAAAATGAATATAATCCAGAAGAGCATAATATAATTTCAAATGCATCTTGTACTACAAATTGTTTA GCTCCTTTTGCTAAAGTACTTGACGATAATTTTGGTATAGTTAGAGGTTTAATGACAACAGTACATTCTTATACAAATGA CCAAAGAATATTAGATAAAACTCATAAAGATTTAAGAAGGGCAAGAGCTGCAGGAGAATCAATAATACCAACTACTACAG GAGCTGCAAAGGCAGTAGCTAAAGTATTACCTAGTCTTAAAGGTAAATTAAATGGATTTGCTTTAAGAGTTCCGACCCCA ACAGTATCTTTAACAGATTTAGTTTGTGAATTAAAAACTAAAGTAACTGCTGAAGAAATTAATGCAGCGTTTAAAAAAGC AGCAGAGGGCGAAATGAATGGTGTTCTAGGATATTCAGAAGAACCACTAGTTTCAGTAGATTACAAGGGAGATCCAAGAT CCTCTATAGTGGATGGATTATCTACTATGGTTTTAGAAGACGACATGGTTAAAGTTGTGTCTTGGTATGATAATGAATGG GGATACTCATGTAGAACTGTAGATTTAGTTAATTATGTAGCTAAATTTATGAAATAA
Upstream 100 bases:
>100_bases ATATAATAAGGAATCATACAACACATATATTAATTTAATGAATTATAAATAAAGTATTTTAAAGATTATTTAAATAATAT ATTAATAGGGGGAAAAAACA
Downstream 100 bases:
>100_bases AAATAATTAATTAGCTTATATAAATTAATTATATATGAATTAATTTTATATATAAGTACCTCTTTTATAAAATAATTATT TATATTTATAATAACTTTAG
Product: glyceraldehyde-3-phosphate dehydrogenase
Products: NA
Alternate protein names: GAPDH [H]
Number of amino acids: Translated: 338; Mature: 337
Protein sequence:
>338_residues MGIKVGLNGFGRIGRAVLRIAQDKFPEDVEIVALNARATTETLVHLFKYDSCYGRFNGQVEIISNDKMRVNGKEIKIFRE NDPQNLPWKELGVDIVIESTGIFKDREKAMKHIEAGAKKVIITAPGKNEDITIVLGVNENEYNPEEHNIISNASCTTNCL APFAKVLDDNFGIVRGLMTTVHSYTNDQRILDKTHKDLRRARAAGESIIPTTTGAAKAVAKVLPSLKGKLNGFALRVPTP TVSLTDLVCELKTKVTAEEINAAFKKAAEGEMNGVLGYSEEPLVSVDYKGDPRSSIVDGLSTMVLEDDMVKVVSWYDNEW GYSCRTVDLVNYVAKFMK
Sequences:
>Translated_338_residues MGIKVGLNGFGRIGRAVLRIAQDKFPEDVEIVALNARATTETLVHLFKYDSCYGRFNGQVEIISNDKMRVNGKEIKIFRE NDPQNLPWKELGVDIVIESTGIFKDREKAMKHIEAGAKKVIITAPGKNEDITIVLGVNENEYNPEEHNIISNASCTTNCL APFAKVLDDNFGIVRGLMTTVHSYTNDQRILDKTHKDLRRARAAGESIIPTTTGAAKAVAKVLPSLKGKLNGFALRVPTP TVSLTDLVCELKTKVTAEEINAAFKKAAEGEMNGVLGYSEEPLVSVDYKGDPRSSIVDGLSTMVLEDDMVKVVSWYDNEW GYSCRTVDLVNYVAKFMK >Mature_337_residues GIKVGLNGFGRIGRAVLRIAQDKFPEDVEIVALNARATTETLVHLFKYDSCYGRFNGQVEIISNDKMRVNGKEIKIFREN DPQNLPWKELGVDIVIESTGIFKDREKAMKHIEAGAKKVIITAPGKNEDITIVLGVNENEYNPEEHNIISNASCTTNCLA PFAKVLDDNFGIVRGLMTTVHSYTNDQRILDKTHKDLRRARAAGESIIPTTTGAAKAVAKVLPSLKGKLNGFALRVPTPT VSLTDLVCELKTKVTAEEINAAFKKAAEGEMNGVLGYSEEPLVSVDYKGDPRSSIVDGLSTMVLEDDMVKVVSWYDNEWG YSCRTVDLVNYVAKFMK
Specific function: Second phase of glycolysis; first step. [C]
COG id: COG0057
COG function: function code G; Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glyceraldehyde-3-phosphate dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI7657116, Length=333, Percent_Identity=50.4504504504504, Blast_Score=338, Evalue=5e-93, Organism=Homo sapiens, GI7669492, Length=334, Percent_Identity=49.1017964071856, Blast_Score=315, Evalue=3e-86, Organism=Escherichia coli, GI1788079, Length=336, Percent_Identity=53.2738095238095, Blast_Score=358, Evalue=1e-100, Organism=Escherichia coli, GI1789295, Length=330, Percent_Identity=44.2424242424242, Blast_Score=303, Evalue=8e-84, Organism=Caenorhabditis elegans, GI17534677, Length=336, Percent_Identity=49.1071428571429, Blast_Score=326, Evalue=1e-89, Organism=Caenorhabditis elegans, GI17534679, Length=336, Percent_Identity=49.4047619047619, Blast_Score=325, Evalue=1e-89, Organism=Caenorhabditis elegans, GI32566163, Length=336, Percent_Identity=49.1071428571429, Blast_Score=317, Evalue=5e-87, Organism=Caenorhabditis elegans, GI17568413, Length=336, Percent_Identity=49.1071428571429, Blast_Score=317, Evalue=5e-87, Organism=Saccharomyces cerevisiae, GI6322468, Length=336, Percent_Identity=53.5714285714286, Blast_Score=345, Evalue=7e-96, Organism=Saccharomyces cerevisiae, GI6321631, Length=336, Percent_Identity=53.8690476190476, Blast_Score=343, Evalue=2e-95, Organism=Saccharomyces cerevisiae, GI6322409, Length=336, Percent_Identity=51.4880952380952, Blast_Score=341, Evalue=7e-95, Organism=Drosophila melanogaster, GI85725000, Length=332, Percent_Identity=49.0963855421687, Blast_Score=315, Evalue=2e-86, Organism=Drosophila melanogaster, GI22023983, Length=332, Percent_Identity=49.0963855421687, Blast_Score=315, Evalue=2e-86, Organism=Drosophila melanogaster, GI17933600, Length=332, Percent_Identity=48.7951807228916, Blast_Score=315, Evalue=2e-86, Organism=Drosophila melanogaster, GI18110149, Length=332, Percent_Identity=48.7951807228916, Blast_Score=315, Evalue=2e-86, Organism=Drosophila melanogaster, GI19922412, Length=330, Percent_Identity=46.969696969697, Blast_Score=299, Evalue=2e-81,
Paralogues:
None
Copy number: 220 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 1840 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 740 Molecules/Cell In: Growth Phase, Minimal Media (Based on E. coli). 1660 Molecules/Cell In: Growth Phase, Min
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR020831 - InterPro: IPR020830 - InterPro: IPR020829 - InterPro: IPR020828 - InterPro: IPR006424 - InterPro: IPR016040 [H]
Pfam domain/function: PF02800 Gp_dh_C; PF00044 Gp_dh_N [H]
EC number: =1.2.1.12 [H]
Molecular weight: Translated: 37331; Mature: 37200
Theoretical pI: Translated: 6.68; Mature: 6.68
Prosite motif: PS00071 GAPDH
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 2.4 %Met (Translated Protein) 3.8 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 2.1 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGIKVGLNGFGRIGRAVLRIAQDKFPEDVEIVALNARATTETLVHLFKYDSCYGRFNGQV CCEEECCCCHHHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHHCHHCCCCCEE EIISNDKMRVNGKEIKIFRENDPQNLPWKELGVDIVIESTGIFKDREKAMKHIEAGAKKV EEEECCEEEECCCEEEEEECCCCCCCCHHHCCEEEEEECCCCCHHHHHHHHHHHCCCCEE IITAPGKNEDITIVLGVNENEYNPEEHNIISNASCTTNCLAPFAKVLDDNFGIVRGLMTT EEECCCCCCCEEEEEECCCCCCCCHHHCEECCCCCCHHHHHHHHHHHCCCHHHHHHHHHH VHSYTNDQRILDKTHKDLRRARAAGESIIPTTTGAAKAVAKVLPSLKGKLNGFALRVPTP HHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCC TVSLTDLVCELKTKVTAEEINAAFKKAAEGEMNGVLGYSEEPLVSVDYKGDPRSSIVDGL CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH STMVLEDDMVKVVSWYDNEWGYSCRTVDLVNYVAKFMK HHHHHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHCC >Mature Secondary Structure GIKVGLNGFGRIGRAVLRIAQDKFPEDVEIVALNARATTETLVHLFKYDSCYGRFNGQV CEEECCCCHHHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHHCHHCCCCCEE EIISNDKMRVNGKEIKIFRENDPQNLPWKELGVDIVIESTGIFKDREKAMKHIEAGAKKV EEEECCEEEECCCEEEEEECCCCCCCCHHHCCEEEEEECCCCCHHHHHHHHHHHCCCCEE IITAPGKNEDITIVLGVNENEYNPEEHNIISNASCTTNCLAPFAKVLDDNFGIVRGLMTT EEECCCCCCCEEEEEECCCCCCCCHHHCEECCCCCCHHHHHHHHHHHCCCHHHHHHHHHH VHSYTNDQRILDKTHKDLRRARAAGESIIPTTTGAAKAVAKVLPSLKGKLNGFALRVPTP HHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCC TVSLTDLVCELKTKVTAEEINAAFKKAAEGEMNGVLGYSEEPLVSVDYKGDPRSSIVDGL CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH STMVLEDDMVKVVSWYDNEWGYSCRTVDLVNYVAKFMK HHHHHHHHHHHHHHHHCCCCCCCEEHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 2684782; 2227448; 2656407; 7408868; 193030; 3586018; 9175858 [H]