Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is hisF

Identifier: 218695651

GI number: 218695651

Start: 2326871

End: 2327647

Strand: Direct

Name: hisF

Synonym: EC55989_2284

Alternate gene names: 218695651

Gene position: 2326871-2327647 (Clockwise)

Preceding gene: 218695650

Following gene: 218695652

Centisome position: 45.14

GC content: 51.74

Gene sequence:

>777_bases
ATGCTGGCAAAACGTATAATTCCGTGTCTCGACGTTCGTGATGGTCAGGTGGTGAAAGGCGTACAGTTTCGCAACCATGA
AATCATTGGCGATATCGTGCCGCTGGCAAAACGCTACGCTGAAGAAGGTGCAGACGAACTGGTGTTCTACGATATCACCG
CTTCCAGCGATGGCCGTGTGGTAGATAAAAGCTGGGTATCTCGCGTGGCGGAAGTGATCGACATTCCGTTTTGTGTGGCG
GGTGGGATTAAATCTCTGGAAGATGCCGCGAAAATTCTTTCCTTTGGCGCGGATAAAATTTCCATAAACTCCCCTGCGCT
GGCGGACCCGACGTTAATTACTCGCCTGGCCGATCGCTTTGGCGTGCAGTGTATTGTAGTCGGTATTGATACCTGGTACG
ACGGCGAAACCGGTAAATATCATGTGAATCAATATACCGGCGATGAAAGCCGCACCCGCGTCACTCAGTGGGAAACACTC
GACTGGGTACAGGAAGTGCAAAAACGCGGTGCCGGAGAAATCGTCCTCAACATGATGAATCAGGACGGCGTGCGTAACGG
TTACGACCTCGAACAACTGAAAAAAGTGCGTGAAGTTTGCCACGTCCCGCTAATTGCCTCCGGTGGCGCGGGCACCATGG
AACACTTCCTCGAAGCCTTCCGCGATGCCGACGTTGACGGCGCGCTGGCTGCTTCCGTATTCCACAAACAAATAATCAAT
ATTGGTGAATTAAAAGCGTACCTGGCAACACAGGGCGTGGAGATCAGGATATGTTAA

Upstream 100 bases:

>100_bases
ACATTGATGATGTGGCGGCCCTGCGTGGTACTGGCGTGCGCGGCGTAATAGTTGGTCGGGCATTACTGGAAGGTAAATTC
ACCGTGAAGGAGGCCATCGC

Downstream 100 bases:

>100_bases
CAGAACAACAACGTCGCGAACTGGACTGGGAAAAAACCGACGGACTGATGCCGGTGATTGTGCAACACGCGGTATCCGGC
GAAGTGTTAATGCTGGGCTA

Product: imidazole glycerol phosphate synthase subunit HisF

Products: NA

Alternate protein names: IGP synthase cyclase subunit; IGP synthase subunit hisF; ImGP synthase subunit hisF; IGPS subunit hisF [H]

Number of amino acids: Translated: 258; Mature: 258

Protein sequence:

>258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDGETGKYHVNQYTGDESRTRVTQWETL
DWVQEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
IGELKAYLATQGVEIRIC

Sequences:

>Translated_258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDGETGKYHVNQYTGDESRTRVTQWETL
DWVQEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
IGELKAYLATQGVEIRIC
>Mature_258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDGETGKYHVNQYTGDESRTRVTQWETL
DWVQEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
IGELKAYLATQGVEIRIC

Specific function: IGPS catalyzes the conversion of PRFAR and glutamine to IGP, AICAR and glutamate. The hisF subunit catalyzes the cyclization activity that produces IGP and AICAR from PRFAR using the ammonia provided by the hisH subunit [H]

COG id: COG0107

COG function: function code E; Imidazoleglycerol-phosphate synthase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the hisA/hisF family [H]

Homologues:

Organism=Escherichia coli, GI1788336, Length=258, Percent_Identity=99.6124031007752, Blast_Score=527, Evalue=1e-151,
Organism=Escherichia coli, GI87082028, Length=231, Percent_Identity=25.974025974026, Blast_Score=66, Evalue=2e-12,
Organism=Saccharomyces cerevisiae, GI6319725, Length=318, Percent_Identity=31.7610062893082, Blast_Score=134, Evalue=1e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013785
- InterPro:   IPR006062
- InterPro:   IPR004651
- InterPro:   IPR011060 [H]

Pfam domain/function: PF00977 His_biosynth [H]

EC number: 4.1.3.-

Molecular weight: Translated: 28441; Mature: 28441

Theoretical pI: Translated: 4.81; Mature: 4.81

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV
CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE
VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF
ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH
GVQCIVVGIDTWYDGETGKYHVNQYTGDESRTRVTQWETLDWVQEVQKRGAGEIVLNMMN
CCEEEEEEEECCCCCCCCCEEEEEECCCCHHHEHHHHHHHHHHHHHHHCCCCCEEHHHHC
QDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH
IGELKAYLATQGVEIRIC
HHHHHHHHHCCCEEEEEC
>Mature Secondary Structure
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV
CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE
VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF
ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH
GVQCIVVGIDTWYDGETGKYHVNQYTGDESRTRVTQWETLDWVQEVQKRGAGEIVLNMMN
CCEEEEEEEECCCCCCCCCEEEEEECCCCHHHEHHHHHHHHHHHHHHHCCCCCEEHHHHC
QDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH
IGELKAYLATQGVEIRIC
HHHHHHHHHCCCEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Lyases; Carbon-Nitrogen Lyases; Amidine-Lyases [C]

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA