Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is folC [H]

Identifier: 187735419

GI number: 187735419

Start: 1096830

End: 1098044

Strand: Reverse

Name: folC [H]

Synonym: Amuc_0918

Alternate gene names: 187735419

Gene position: 1098044-1096830 (Counterclockwise)

Preceding gene: 187735420

Following gene: 187735418

Centisome position: 41.22

GC content: 61.56

Gene sequence:

>1215_bases
ATGAATGTATCCGCCGCCCTTGACTGGCTCTTTTCCACCCAATTTTTCGGGATCAAGCTGGGGCTTGATAATACCAGGAA
GCTGCTGGCGGCAGCCGGGGCGGACCGGATAAACGCCACCGTGGTCCACGTGGCCGGCACCAATGGGAAAGGCTCCACCT
GCGCCATGATTGAGGCCCTGGCCAGGGCGCAGGGTTATGTCACGGGCCTGTTCACTTCCCCCCATCTGGTAGATTTTTCT
GAACGCATCCGCGTCAATGGGGACATGATCACTCCGGATGCCCTGGCAGAAGAAATTTCCTTCCTGAAACAGCTGGCGGA
AGGCTGGGAACAGCCGCCCACCTTTTTCGAGCTGGCTCTGGCCGTCGCCCTGCGCCACTTCCGGAAAAACAGCGTGAACT
TCATCATTCTGGAAACAGGGCTGGGCGGAAGGCTGGACGCCACTAACGCCGTGCCCAAGGATATCGCCGTACTGGCCCCC
ATCGGCCTGGACCACCAGCAGTATCTGGGAGATACCCTGGAAGAGATCGCTGCGGAAAAAGCGGCCATCATCGCCCCCGG
AAAACCGTCCGTAACTGCCGTGCAGCACCCGGGTGTCATGGCCGTCATCGAACGGACGGCGCAAAACCGCCGTTCTCCCC
TCACGATTGCCCGGGCGGACCGGAAAACGCCCATCCCATCCCTGCCAGGAGCCCATCAGCGGGAAAACGCAGCCCTGGCG
CTGGAAACCATGAGGCGGCTCCATTCCCTCCCCTCCCCATCCGAAGCCGCCGCAGTCCTGGCCAAGGTACAGTGGCCCGG
ACGGTTCGAACGCCTTGAGACGCCCCCCCTGGTGTTGGACGGCGCTCATAACGAACATGCGGCGCGCGTGCTCGCGTCCA
CGTGGAAAGAGGAATTCCCGGGACGGAAGGCGGCACTGGTTTTTGCCGCATCCGCAGACAAGCACATCCGGGAAATGATT
CCGGTTTTGCGGGAAATTACCGGAGAATGGCATCTGGTTCCCTGCACTTCCCCCCGCATCATGCCGGCGGAAGAAATGGC
TGCCCTGCTGGGCGAACAGGAAACCGGCCCCGTTTTCATCCATTCCTCGCTTCCCGATGGTCTGCAGGCGGCACTGGCCT
CCCCGCTCCCGGTCCTGGCGGCAGGTTCCCTGTTCCTTCTGGGAGATTTGAAAGCTCTGCTCCGCCATGCGGAAAAACGC
AGCACCGCCCAATAA

Upstream 100 bases:

>100_bases
TAGCTTGTTCGTGTTTGCCGCGTCCGTTCCCGGCGGCATTTTACAGTCCCCACGCCATACCGGAGTTGAACGGGAGTTTT
CCCGTGGTAAAATACCGCCC

Downstream 100 bases:

>100_bases
CCCCTACCGTCCTTTTTGCTTTTCCCCGTGATTTCCGACGACATAACCCTTTCCGCCCTCCGGCCCGGAGACCTGCGCCC
CGAACTGCTGGCCCCCGCCG

Product: FolC bifunctional protein

Products: NA

Alternate protein names: Folylpoly-gamma-glutamate synthetase; FPGS; Tetrahydrofolate synthase; Tetrahydrofolylpolyglutamate synthase [H]

Number of amino acids: Translated: 404; Mature: 404

Protein sequence:

>404_residues
MNVSAALDWLFSTQFFGIKLGLDNTRKLLAAAGADRINATVVHVAGTNGKGSTCAMIEALARAQGYVTGLFTSPHLVDFS
ERIRVNGDMITPDALAEEISFLKQLAEGWEQPPTFFELALAVALRHFRKNSVNFIILETGLGGRLDATNAVPKDIAVLAP
IGLDHQQYLGDTLEEIAAEKAAIIAPGKPSVTAVQHPGVMAVIERTAQNRRSPLTIARADRKTPIPSLPGAHQRENAALA
LETMRRLHSLPSPSEAAAVLAKVQWPGRFERLETPPLVLDGAHNEHAARVLASTWKEEFPGRKAALVFAASADKHIREMI
PVLREITGEWHLVPCTSPRIMPAEEMAALLGEQETGPVFIHSSLPDGLQAALASPLPVLAAGSLFLLGDLKALLRHAEKR
STAQ

Sequences:

>Translated_404_residues
MNVSAALDWLFSTQFFGIKLGLDNTRKLLAAAGADRINATVVHVAGTNGKGSTCAMIEALARAQGYVTGLFTSPHLVDFS
ERIRVNGDMITPDALAEEISFLKQLAEGWEQPPTFFELALAVALRHFRKNSVNFIILETGLGGRLDATNAVPKDIAVLAP
IGLDHQQYLGDTLEEIAAEKAAIIAPGKPSVTAVQHPGVMAVIERTAQNRRSPLTIARADRKTPIPSLPGAHQRENAALA
LETMRRLHSLPSPSEAAAVLAKVQWPGRFERLETPPLVLDGAHNEHAARVLASTWKEEFPGRKAALVFAASADKHIREMI
PVLREITGEWHLVPCTSPRIMPAEEMAALLGEQETGPVFIHSSLPDGLQAALASPLPVLAAGSLFLLGDLKALLRHAEKR
STAQ
>Mature_404_residues
MNVSAALDWLFSTQFFGIKLGLDNTRKLLAAAGADRINATVVHVAGTNGKGSTCAMIEALARAQGYVTGLFTSPHLVDFS
ERIRVNGDMITPDALAEEISFLKQLAEGWEQPPTFFELALAVALRHFRKNSVNFIILETGLGGRLDATNAVPKDIAVLAP
IGLDHQQYLGDTLEEIAAEKAAIIAPGKPSVTAVQHPGVMAVIERTAQNRRSPLTIARADRKTPIPSLPGAHQRENAALA
LETMRRLHSLPSPSEAAAVLAKVQWPGRFERLETPPLVLDGAHNEHAARVLASTWKEEFPGRKAALVFAASADKHIREMI
PVLREITGEWHLVPCTSPRIMPAEEMAALLGEQETGPVFIHSSLPDGLQAALASPLPVLAAGSLFLLGDLKALLRHAEKR
STAQ

Specific function: Conversion of folates to polyglutamate derivatives. It preferes 5,10-methylenetetrahydrofolate, rather than 10- formyltetrahydrofolate as folate substrate [H]

COG id: COG0285

COG function: function code H; Folylpolyglutamate synthase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the folylpolyglutamate synthase family [H]

Homologues:

Organism=Homo sapiens, GI66932990, Length=360, Percent_Identity=32.2222222222222, Blast_Score=157, Evalue=2e-38,
Organism=Homo sapiens, GI66932984, Length=360, Percent_Identity=32.2222222222222, Blast_Score=157, Evalue=2e-38,
Organism=Escherichia coli, GI1788654, Length=357, Percent_Identity=32.7731092436975, Blast_Score=154, Evalue=8e-39,
Organism=Caenorhabditis elegans, GI17553150, Length=355, Percent_Identity=30.4225352112676, Blast_Score=143, Evalue=1e-34,
Organism=Caenorhabditis elegans, GI71984923, Length=355, Percent_Identity=30.4225352112676, Blast_Score=143, Evalue=2e-34,
Organism=Caenorhabditis elegans, GI17553148, Length=355, Percent_Identity=30.4225352112676, Blast_Score=143, Evalue=2e-34,
Organism=Saccharomyces cerevisiae, GI6323760, Length=417, Percent_Identity=30.2158273381295, Blast_Score=166, Evalue=6e-42,
Organism=Saccharomyces cerevisiae, GI6324815, Length=324, Percent_Identity=31.1728395061728, Blast_Score=123, Evalue=7e-29,
Organism=Drosophila melanogaster, GI24641571, Length=286, Percent_Identity=32.5174825174825, Blast_Score=135, Evalue=4e-32,
Organism=Drosophila melanogaster, GI24581568, Length=169, Percent_Identity=31.3609467455621, Blast_Score=92, Evalue=5e-19,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR018109
- InterPro:   IPR001645
- InterPro:   IPR004101
- InterPro:   IPR013221 [H]

Pfam domain/function: PF02875 Mur_ligase_C; PF08245 Mur_ligase_M [H]

EC number: =6.3.2.17 [H]

Molecular weight: Translated: 43494; Mature: 43494

Theoretical pI: Translated: 7.01; Mature: 7.01

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.5 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
2.5 %Cys+Met (Translated Protein)
0.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNVSAALDWLFSTQFFGIKLGLDNTRKLLAAAGADRINATVVHVAGTNGKGSTCAMIEAL
CCHHHHHHHHHHCEEEEEEECCCHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHH
ARAQGYVTGLFTSPHLVDFSERIRVNGDMITPDALAEEISFLKQLAEGWEQPPTFFELAL
HHHCCCEEEEECCCCEECCHHHEEECCCEECHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
AVALRHFRKNSVNFIILETGLGGRLDATNAVPKDIAVLAPIGLDHQQYLGDTLEEIAAEK
HHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCC
AAIIAPGKPSVTAVQHPGVMAVIERTAQNRRSPLTIARADRKTPIPSLPGAHQRENAALA
EEEECCCCCCEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHH
LETMRRLHSLPSPSEAAAVLAKVQWPGRFERLETPPLVLDGAHNEHAARVLASTWKEEFP
HHHHHHHHCCCCCHHHHHHHHHCCCCCCHHCCCCCCEEEECCCCHHHHHHHHHHHHHHCC
GRKAALVFAASADKHIREMIPVLREITGEWHLVPCTSPRIMPAEEMAALLGEQETGPVFI
CCEEEEEEECCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHCCCCCCCEEE
HSSLPDGLQAALASPLPVLAAGSLFLLGDLKALLRHAEKRSTAQ
ECCCCHHHHHHHCCCCCHHHCCCCHHHHHHHHHHHHHHHHCCCC
>Mature Secondary Structure
MNVSAALDWLFSTQFFGIKLGLDNTRKLLAAAGADRINATVVHVAGTNGKGSTCAMIEAL
CCHHHHHHHHHHCEEEEEEECCCHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHH
ARAQGYVTGLFTSPHLVDFSERIRVNGDMITPDALAEEISFLKQLAEGWEQPPTFFELAL
HHHCCCEEEEECCCCEECCHHHEEECCCEECHHHHHHHHHHHHHHHHHHCCCCHHHHHHH
AVALRHFRKNSVNFIILETGLGGRLDATNAVPKDIAVLAPIGLDHQQYLGDTLEEIAAEK
HHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCC
AAIIAPGKPSVTAVQHPGVMAVIERTAQNRRSPLTIARADRKTPIPSLPGAHQRENAALA
EEEECCCCCCEEEECCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHH
LETMRRLHSLPSPSEAAAVLAKVQWPGRFERLETPPLVLDGAHNEHAARVLASTWKEEFP
HHHHHHHHCCCCCHHHHHHHHHCCCCCCHHCCCCCCEEEECCCCHHHHHHHHHHHHHHCC
GRKAALVFAASADKHIREMIPVLREITGEWHLVPCTSPRIMPAEEMAALLGEQETGPVFI
CCEEEEEEECCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHCCCCCCCEEE
HSSLPDGLQAALASPLPVLAAGSLFLLGDLKALLRHAEKRSTAQ
ECCCCHHHHHHHCCCCCHHHCCCCHHHHHHHHHHHHHHHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8419299; 9384377; 2553669 [H]