Definition Prochlorococcus marinus str. MIT 9313 chromosome, complete genome.
Accession NC_005071
Length 2,410,873

Click here to switch to the map view.

The map label for this gene is folE [H]

Identifier: 33863495

GI number: 33863495

Start: 1314144

End: 1314911

Strand: Reverse

Name: folE [H]

Synonym: PMT1227

Alternate gene names: 33863495

Gene position: 1314911-1314144 (Counterclockwise)

Preceding gene: 33863496

Following gene: 33863493

Centisome position: 54.54

GC content: 46.35

Gene sequence:

>768_bases
ATGACTTCCATCATCCCAGCATCATCGAAAGGCCTTGGCAATGACAAAAATGCTTCGCCTGACAATGGTCAAATTGCCAA
TACAAGCAAGATTTCCGAGCGTATACGCGCACGGTTTGTTGCGGGAGGTGTTTCCTTTCTCGCTAACGATAATATTTCGG
AATACATTCAACCAGGCGAGTTGCAAGAGCTTGAAGTAGAAGTTGCCGATCGAGTACGAGAACTACTTCAAAGCCTGTTG
ATTGATATTCATAATGACCACAACACAGAAGAGACAGCCGAGCGAGTTGCTCGCATGTATCTCAACGAAGTTTTTAAGGG
CCGTTATCAGAAACAGCCCAAGATTACAAGTTTTCCTAACGTTAAGCAGCTTGATGAGATTTATACTGTTGGACCAATCA
CGATCCGTTCGGCCTGCTCCCACCACTTTGTGCCGATCATGGGTAACTGCTGGATTGGGATCAAGCCAGGGACCCGTGTG
ATTGGACTCTCTAAGTTTGCAAGAGTTGCCGATTGGGTATTTTCAAGACCTCATATTCAGGAAGAGGCCGTAATGATCTT
GGCTGATGAAATTGAGCGGCTATGTGAGCCTCAGGGGCTTGGAATTATTGTCAAGGCAGAGCATTATTGCATGAAATTAC
GTGGCGTAATGGAACCCCAGTGCACCATGGTGAACTCAGTTGTGCGAGGTGTTTTCAGGCATGACTCTAGCCTTAAACAG
GAATTCTTTGAGCTCGTTCGTCAGCAGGAGGCCATGCTTGCAACATAA

Upstream 100 bases:

>100_bases
CTGCCGCTGCCCTTCTTCACTTGGCCCAACAGCCACCATCACAGGTGATTGAAGATCTAACTCTGATGCCAGCCACTGGC
GCCTTTTGAACCAGCCTTTA

Downstream 100 bases:

>100_bases
AAGATCTCTCTAGATCTTGCCAAGAAAGAACATTTTGGCGATGGCCATGTCTGATTAACATGTGTGTATCCACTTCGCTT
GTTACTAGCCAAGTGGATGA

Product: GTP cyclohydrolase I

Products: NA

Alternate protein names: GTP cyclohydrolase I; GTP-CH-I [H]

Number of amino acids: Translated: 255; Mature: 254

Protein sequence:

>255_residues
MTSIIPASSKGLGNDKNASPDNGQIANTSKISERIRARFVAGGVSFLANDNISEYIQPGELQELEVEVADRVRELLQSLL
IDIHNDHNTEETAERVARMYLNEVFKGRYQKQPKITSFPNVKQLDEIYTVGPITIRSACSHHFVPIMGNCWIGIKPGTRV
IGLSKFARVADWVFSRPHIQEEAVMILADEIERLCEPQGLGIIVKAEHYCMKLRGVMEPQCTMVNSVVRGVFRHDSSLKQ
EFFELVRQQEAMLAT

Sequences:

>Translated_255_residues
MTSIIPASSKGLGNDKNASPDNGQIANTSKISERIRARFVAGGVSFLANDNISEYIQPGELQELEVEVADRVRELLQSLL
IDIHNDHNTEETAERVARMYLNEVFKGRYQKQPKITSFPNVKQLDEIYTVGPITIRSACSHHFVPIMGNCWIGIKPGTRV
IGLSKFARVADWVFSRPHIQEEAVMILADEIERLCEPQGLGIIVKAEHYCMKLRGVMEPQCTMVNSVVRGVFRHDSSLKQ
EFFELVRQQEAMLAT
>Mature_254_residues
TSIIPASSKGLGNDKNASPDNGQIANTSKISERIRARFVAGGVSFLANDNISEYIQPGELQELEVEVADRVRELLQSLLI
DIHNDHNTEETAERVARMYLNEVFKGRYQKQPKITSFPNVKQLDEIYTVGPITIRSACSHHFVPIMGNCWIGIKPGTRVI
GLSKFARVADWVFSRPHIQEEAVMILADEIERLCEPQGLGIIVKAEHYCMKLRGVMEPQCTMVNSVVRGVFRHDSSLKQE
FFELVRQQEAMLAT

Specific function: Tetrahydrofolate biosynthesis; first step. [C]

COG id: COG0302

COG function: function code H; GTP cyclohydrolase I

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GTP cyclohydrolase I family [H]

Homologues:

Organism=Homo sapiens, GI66932968, Length=123, Percent_Identity=39.0243902439024, Blast_Score=97, Evalue=2e-20,
Organism=Homo sapiens, GI4503949, Length=123, Percent_Identity=39.0243902439024, Blast_Score=97, Evalue=2e-20,
Organism=Escherichia coli, GI1788476, Length=183, Percent_Identity=33.3333333333333, Blast_Score=116, Evalue=1e-27,
Organism=Caenorhabditis elegans, GI17560486, Length=125, Percent_Identity=35.2, Blast_Score=98, Evalue=5e-21,
Organism=Saccharomyces cerevisiae, GI6321706, Length=182, Percent_Identity=31.3186813186813, Blast_Score=88, Evalue=1e-18,
Organism=Drosophila melanogaster, GI24656782, Length=125, Percent_Identity=35.2, Blast_Score=83, Evalue=1e-16,
Organism=Drosophila melanogaster, GI24656796, Length=125, Percent_Identity=35.2, Blast_Score=83, Evalue=2e-16,
Organism=Drosophila melanogaster, GI24656791, Length=125, Percent_Identity=35.2, Blast_Score=83, Evalue=2e-16,

Paralogues:

None

Copy number: 420 Molecules/Cell In: Growth-Phase, Minimal-Media (Based on E. coli). [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001474
- InterPro:   IPR020602
- InterPro:   IPR018234 [H]

Pfam domain/function: PF01227 GTP_cyclohydroI [H]

EC number: =3.5.4.16 [H]

Molecular weight: Translated: 28712; Mature: 28581

Theoretical pI: Translated: 6.63; Mature: 6.63

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.0 %Cys     (Translated Protein)
3.1 %Met     (Translated Protein)
5.1 %Cys+Met (Translated Protein)
2.0 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTSIIPASSKGLGNDKNASPDNGQIANTSKISERIRARFVAGGVSFLANDNISEYIQPGE
CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCC
LQELEVEVADRVRELLQSLLIDIHNDHNTEETAERVARMYLNEVFKGRYQKQPKITSFPN
HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC
VKQLDEIYTVGPITIRSACSHHFVPIMGNCWIGIKPGTRVIGLSKFARVADWVFSRPHIQ
HHHHHHHHHCCCHHHHHHHCCCCEEEECCEEEEECCCCEEEHHHHHHHHHHHHHCCCCCH
EEAVMILADEIERLCEPQGLGIIVKAEHYCMKLRGVMEPQCTMVNSVVRGVFRHDSSLKQ
HHHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHH
EFFELVRQQEAMLAT
HHHHHHHHHHHHCCC
>Mature Secondary Structure 
TSIIPASSKGLGNDKNASPDNGQIANTSKISERIRARFVAGGVSFLANDNISEYIQPGE
CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCC
LQELEVEVADRVRELLQSLLIDIHNDHNTEETAERVARMYLNEVFKGRYQKQPKITSFPN
HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC
VKQLDEIYTVGPITIRSACSHHFVPIMGNCWIGIKPGTRVIGLSKFARVADWVFSRPHIQ
HHHHHHHHHCCCHHHHHHHCCCCEEEECCEEEEECCCCEEEHHHHHHHHHHHHHCCCCCH
EEAVMILADEIERLCEPQGLGIIVKAEHYCMKLRGVMEPQCTMVNSVVRGVFRHDSSLKQ
HHHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHH
EFFELVRQQEAMLAT
HHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA