| Definition | Kosmotoga olearia TBF 19.5.1, complete genome. |
|---|---|
| Accession | NC_012785 |
| Length | 2,302,126 |
Click here to switch to the map view.
The map label for this gene is folC [H]
Identifier: 239618104
GI number: 239618104
Start: 1863658
End: 1864989
Strand: Reverse
Name: folC [H]
Synonym: Kole_1736
Alternate gene names: 239618104
Gene position: 1864989-1863658 (Counterclockwise)
Preceding gene: 239618105
Following gene: 239618103
Centisome position: 81.01
GC content: 41.82
Gene sequence:
>1332_bases GTGACTGTTATGGAATACAGAGAAGCTCTGGAGTACCTTTATTCTTCGAGACCGTACGGAAAAATCAAGTACGGTCTCTT TCGTATTGAAGAATTGATGGAACGCCTGGGGAACCCTCAAGAAAGCTATCCTACCATACACATAACAGGTACAAACGGGA AAGGAAGTGTAGCAACGATACTCAAAGGGGTTCTCGAAGCCCATGGATTACACGTTGGTATGAATATATCACCGCATATC GTTTCCTTCAGGGAGCGAATTCAATTAGACAACAGGTACATAACAGAAGAAGAGGTATGTGAAACCCTGAAAGAAATTCT GCCAGCAATTGAAACGATGGATAAAAAAGGGCCAGAATATGCCCCGAGTTTCTTTGAGGTTGTAACCGCCATGGCCTTCC ACTTTTTCAGAAAGAAAAAGGTGGATGTTGCCGTCATCGAAGTTGGCTTAGGCGGTAGATACGATGCAACCAATATCATT AAGAAACCTCTTGTTTCAGTCATAACAACTGTTTCTCTGGATCACAAAAACATACTGGGGAAGAACGAAGAAAAAATTGC TATTGAAAAAGCCGGGATCATCAAAGAAGGTGTTCCGGTTGTTTCAGGTGTAACACGTCCTTCCATTCGATATACAATCG AAGAAAAAGCCGGTGAAAAGAACGCTCCTTCATACTTTCTATGGAAAGATTTTCAGGTTGAGACGAAGGAATTAAAAATC AATGAAAACGTGTACAATTATTCTGGAGAGGAAACGTACCCCAACCTGGTCCTCTCTTTGAACGGTACCCATCAGGCAAG AAACCTGGCGGTAGCTATGAAAACCTTAGAGATCGCTTTTAATGGATTGAAAAAGAAGATCGATAATGCCAAACTGCGAG ACAGCCTCAAAAAAACAAGCTGGCCCGGAAGGTTTGAGGTTTTAACCCATAAAAACAGGAAAATAATCCTTGATGGTGCT CATAATATCGACGGTGCTTATGCTCTACGGAACAGTCTTGAGATTTACTTCCCCGGTCAAAAGCTCGATATAATTTTCGG AAGTCTCGATGACAAAGACTACGAAAGCGTGATTTCAATTCTTGCACCTATCTCCGGGAAAGTTGTCGTAACAAAGGTTC CGAGTCACAGAAGCATAAATCCCGAGCGGGTGCGGGAAATATGGAAAGTGTACCACGGTAATGTTGAGTTTATAACGGAA CCGGATAGAGCCTTTGAAAAATTTTTCAACAGCACACAAAATACTTTACTCATCACGGGTTCTCTGTATCTGGTGAGTTA TCTTAGAAATCTAATCGTTGAGGGAGTTGGAGATATTGATAAAAGGCGTTGA
Upstream 100 bases:
>100_bases TCGAAAGAGCTCCGGAAGAGGTTGTTGAGGAAACAAAAGAAAAACTGAAAGCTGCCAAAAGCAATTGTGATAGGCTCAAA AAAATTATAGATGATTTGAA
Downstream 100 bases:
>100_bases AGGAGAAATAACAGAGATATCCGGAAACGAGGTCACACTGAAGGCAGGACAGTTCTATCTGAACATTCTGTGTTCGACCA ATACAATAAAGGAACTTTCA
Product: FolC bifunctional protein
Products: NA
Alternate protein names: Folylpoly-gamma-glutamate synthetase; FPGS; Tetrahydrofolate synthase; Tetrahydrofolylpolyglutamate synthase [H]
Number of amino acids: Translated: 443; Mature: 442
Protein sequence:
>443_residues MTVMEYREALEYLYSSRPYGKIKYGLFRIEELMERLGNPQESYPTIHITGTNGKGSVATILKGVLEAHGLHVGMNISPHI VSFRERIQLDNRYITEEEVCETLKEILPAIETMDKKGPEYAPSFFEVVTAMAFHFFRKKKVDVAVIEVGLGGRYDATNII KKPLVSVITTVSLDHKNILGKNEEKIAIEKAGIIKEGVPVVSGVTRPSIRYTIEEKAGEKNAPSYFLWKDFQVETKELKI NENVYNYSGEETYPNLVLSLNGTHQARNLAVAMKTLEIAFNGLKKKIDNAKLRDSLKKTSWPGRFEVLTHKNRKIILDGA HNIDGAYALRNSLEIYFPGQKLDIIFGSLDDKDYESVISILAPISGKVVVTKVPSHRSINPERVREIWKVYHGNVEFITE PDRAFEKFFNSTQNTLLITGSLYLVSYLRNLIVEGVGDIDKRR
Sequences:
>Translated_443_residues MTVMEYREALEYLYSSRPYGKIKYGLFRIEELMERLGNPQESYPTIHITGTNGKGSVATILKGVLEAHGLHVGMNISPHI VSFRERIQLDNRYITEEEVCETLKEILPAIETMDKKGPEYAPSFFEVVTAMAFHFFRKKKVDVAVIEVGLGGRYDATNII KKPLVSVITTVSLDHKNILGKNEEKIAIEKAGIIKEGVPVVSGVTRPSIRYTIEEKAGEKNAPSYFLWKDFQVETKELKI NENVYNYSGEETYPNLVLSLNGTHQARNLAVAMKTLEIAFNGLKKKIDNAKLRDSLKKTSWPGRFEVLTHKNRKIILDGA HNIDGAYALRNSLEIYFPGQKLDIIFGSLDDKDYESVISILAPISGKVVVTKVPSHRSINPERVREIWKVYHGNVEFITE PDRAFEKFFNSTQNTLLITGSLYLVSYLRNLIVEGVGDIDKRR >Mature_442_residues TVMEYREALEYLYSSRPYGKIKYGLFRIEELMERLGNPQESYPTIHITGTNGKGSVATILKGVLEAHGLHVGMNISPHIV SFRERIQLDNRYITEEEVCETLKEILPAIETMDKKGPEYAPSFFEVVTAMAFHFFRKKKVDVAVIEVGLGGRYDATNIIK KPLVSVITTVSLDHKNILGKNEEKIAIEKAGIIKEGVPVVSGVTRPSIRYTIEEKAGEKNAPSYFLWKDFQVETKELKIN ENVYNYSGEETYPNLVLSLNGTHQARNLAVAMKTLEIAFNGLKKKIDNAKLRDSLKKTSWPGRFEVLTHKNRKIILDGAH NIDGAYALRNSLEIYFPGQKLDIIFGSLDDKDYESVISILAPISGKVVVTKVPSHRSINPERVREIWKVYHGNVEFITEP DRAFEKFFNSTQNTLLITGSLYLVSYLRNLIVEGVGDIDKRR
Specific function: Conversion of folates to polyglutamate derivatives. It preferes 5,10-methylenetetrahydrofolate, rather than 10- formyltetrahydrofolate as folate substrate [H]
COG id: COG0285
COG function: function code H; Folylpolyglutamate synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the folylpolyglutamate synthase family [H]
Homologues:
Organism=Homo sapiens, GI66932990, Length=349, Percent_Identity=30.6590257879656, Blast_Score=174, Evalue=2e-43, Organism=Homo sapiens, GI66932984, Length=349, Percent_Identity=30.6590257879656, Blast_Score=174, Evalue=2e-43, Organism=Escherichia coli, GI1788654, Length=428, Percent_Identity=28.7383177570093, Blast_Score=148, Evalue=8e-37, Organism=Caenorhabditis elegans, GI17553150, Length=439, Percent_Identity=26.879271070615, Blast_Score=155, Evalue=3e-38, Organism=Caenorhabditis elegans, GI17553148, Length=439, Percent_Identity=26.879271070615, Blast_Score=155, Evalue=3e-38, Organism=Caenorhabditis elegans, GI71984923, Length=439, Percent_Identity=26.879271070615, Blast_Score=155, Evalue=3e-38, Organism=Saccharomyces cerevisiae, GI6323760, Length=442, Percent_Identity=30.9954751131222, Blast_Score=182, Evalue=8e-47, Organism=Saccharomyces cerevisiae, GI6324815, Length=327, Percent_Identity=31.8042813455657, Blast_Score=129, Evalue=7e-31, Organism=Saccharomyces cerevisiae, GI6322718, Length=288, Percent_Identity=26.0416666666667, Blast_Score=84, Evalue=6e-17, Organism=Drosophila melanogaster, GI24641571, Length=301, Percent_Identity=30.8970099667774, Blast_Score=147, Evalue=2e-35, Organism=Drosophila melanogaster, GI24581568, Length=306, Percent_Identity=27.1241830065359, Blast_Score=103, Evalue=2e-22,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR018109 - InterPro: IPR001645 - InterPro: IPR004101 - InterPro: IPR013221 [H]
Pfam domain/function: PF02875 Mur_ligase_C; PF08245 Mur_ligase_M [H]
EC number: =6.3.2.17 [H]
Molecular weight: Translated: 50203; Mature: 50072
Theoretical pI: Translated: 9.08; Mature: 9.08
Prosite motif: PS01011 FOLYLPOLYGLU_SYNT_1 ; PS01012 FOLYLPOLYGLU_SYNT_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTVMEYREALEYLYSSRPYGKIKYGLFRIEELMERLGNPQESYPTIHITGTNGKGSVATI CCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHCCEEEEECCCCCCHHHHH LKGVLEAHGLHVGMNISPHIVSFRERIQLDNRYITEEEVCETLKEILPAIETMDKKGPEY HHHHHHHCCEEEECCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC APSFFEVVTAMAFHFFRKKKVDVAVIEVGLGGRYDATNIIKKPLVSVITTVSLDHKNILG CHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHCC KNEEKIAIEKAGIIKEGVPVVSGVTRPSIRYTIEEKAGEKNAPSYFLWKDFQVETKELKI CCCCEEEEECCCCHHCCCCHHCCCCCCCEEEEEHHHCCCCCCCCEEEEECCEEEEEEEEE NENVYNYSGEETYPNLVLSLNGTHQARNLAVAMKTLEIAFNGLKKKIDNAKLRDSLKKTS ECCEECCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCC WPGRFEVLTHKNRKIILDGAHNIDGAYALRNSLEIYFPGQKLDIIFGSLDDKDYESVISI CCCEEEEEEECCCEEEEECCCCCCHHHHHHCCEEEEECCCEEEEEEECCCCHHHHHHHHH LAPISGKVVVTKVPSHRSINPERVREIWKVYHGNVEFITEPDRAFEKFFNSTQNTLLITG HCCCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCEEEEEH SLYLVSYLRNLIVEGVGDIDKRR HHHHHHHHHHHHHHCCCCCCCCC >Mature Secondary Structure TVMEYREALEYLYSSRPYGKIKYGLFRIEELMERLGNPQESYPTIHITGTNGKGSVATI CHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHCCEEEEECCCCCCHHHHH LKGVLEAHGLHVGMNISPHIVSFRERIQLDNRYITEEEVCETLKEILPAIETMDKKGPEY HHHHHHHCCEEEECCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC APSFFEVVTAMAFHFFRKKKVDVAVIEVGLGGRYDATNIIKKPLVSVITTVSLDHKNILG CHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHCC KNEEKIAIEKAGIIKEGVPVVSGVTRPSIRYTIEEKAGEKNAPSYFLWKDFQVETKELKI CCCCEEEEECCCCHHCCCCHHCCCCCCCEEEEEHHHCCCCCCCCEEEEECCEEEEEEEEE NENVYNYSGEETYPNLVLSLNGTHQARNLAVAMKTLEIAFNGLKKKIDNAKLRDSLKKTS ECCEECCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCC WPGRFEVLTHKNRKIILDGAHNIDGAYALRNSLEIYFPGQKLDIIFGSLDDKDYESVISI CCCEEEEEEECCCEEEEECCCCCCHHHHHHCCEEEEECCCEEEEEEECCCCHHHHHHHHH LAPISGKVVVTKVPSHRSINPERVREIWKVYHGNVEFITEPDRAFEKFFNSTQNTLLITG HCCCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCEEEEEH SLYLVSYLRNLIVEGVGDIDKRR HHHHHHHHHHHHHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8419299; 9384377; 2553669 [H]