Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is hisF [H]

Identifier: 157161518

GI number: 157161518

Start: 2141699

End: 2142475

Strand: Direct

Name: hisF [H]

Synonym: EcHS_A2164

Alternate gene names: 157161518

Gene position: 2141699-2142475 (Clockwise)

Preceding gene: 157161517

Following gene: 157161519

Centisome position: 46.12

GC content: 51.48

Gene sequence:

>777_bases
ATGCTGGCAAAACGCATAATCCCATGTCTCGACGTTCGTGATGGTCAGGTGGTGAAAGGCGTACAGTTTCGCAACCATGA
AATCATTGGCGATATTGTACCGCTGGCAAAACGCTACGCTGAAGAAGGTGCAGACGAACTGGTGTTCTACGATATCACCG
CTTCCAGCGATGGCCGTGTGGTAGATAAAAGCTGGGTATCTCGCGTGGCGGAAGTGATCGACATTCCGTTCTGTGTGGCG
GGTGGGATTAAGTCTCTGGAAGATGCCGCGAAAATTCTTTCCTTTGGCGCGGATAAAATTTCCATCAACTCTCCTGCGCT
GGCGGACCCGACGTTAATTACTCGCCTGGCGGATCGCTTTGGCGTGCAGTGTATTGTGGTCGGTATTGATACCTGGTACG
ACGCTGAAACCGGTAAATATCATGTGAATCAATATACCGGCGATGAAAGCCGCACCCGCGTCACTCAGTGGGAAACGCTC
GACTGGGTAGAGGAAGTGCAAAAACGCGGTGCCGGAGAAATCGTCCTCAACATGATGAATCAGGACGGCGTGCGTAACGG
TTACGACCTAGAACAACTGAAAAAAGTGCATGAAGTTTGCCACGTCCCGCTGATTGCCTCCGGTGGCGCGGGCACCATGG
AACACTTCCTCGAAGCCTTCCGCGATGCCGACGTTGACGGTGCGCTGGCAGCTTCTGTATTCCACAAACAAATAATCAAT
ATTAGTGAATTAAAAGCGTACCTGGCAACACAGGGCGTGGAGATCAGGATATGTTAA

Upstream 100 bases:

>100_bases
ACATTGATGATGTCGCGGCCCTGCGTGGTACTGGCGTGCGCGGCGTAATAGTTGGTCGTGCATTACTGGAAGGTAAATTC
ACCGTGAAGGAGGCCATCGC

Downstream 100 bases:

>100_bases
CAGAACAACAACGTCACGAACTGGACTGGGAAAAAACCGACGGACTGATGCCGGTGATTGTACAACACGCGGTATCTGGC
GAAGTGTTAATGCTGGGCTA

Product: imidazole glycerol phosphate synthase subunit HisF

Products: NA

Alternate protein names: IGP synthase cyclase subunit; IGP synthase subunit hisF; ImGP synthase subunit hisF; IGPS subunit hisF [H]

Number of amino acids: Translated: 258; Mature: 258

Protein sequence:

>258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL
DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
ISELKAYLATQGVEIRIC

Sequences:

>Translated_258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL
DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
ISELKAYLATQGVEIRIC
>Mature_258_residues
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL
DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
ISELKAYLATQGVEIRIC

Specific function: IGPS catalyzes the conversion of PRFAR and glutamine to IGP, AICAR and glutamate. The hisF subunit catalyzes the cyclization activity that produces IGP and AICAR from PRFAR using the ammonia provided by the hisH subunit [H]

COG id: COG0107

COG function: function code E; Imidazoleglycerol-phosphate synthase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the hisA/hisF family [H]

Homologues:

Organism=Escherichia coli, GI1788336, Length=258, Percent_Identity=98.8372093023256, Blast_Score=523, Evalue=1e-150,
Organism=Escherichia coli, GI87082028, Length=245, Percent_Identity=26.1224489795918, Blast_Score=68, Evalue=7e-13,
Organism=Saccharomyces cerevisiae, GI6319725, Length=318, Percent_Identity=31.7610062893082, Blast_Score=133, Evalue=3e-32,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013785
- InterPro:   IPR006062
- InterPro:   IPR004651
- InterPro:   IPR011060 [H]

Pfam domain/function: PF00977 His_biosynth [H]

EC number: 4.1.3.-

Molecular weight: Translated: 28467; Mature: 28467

Theoretical pI: Translated: 4.74; Mature: 4.74

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
1.6 %Met     (Translated Protein)
3.5 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
1.6 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV
CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE
VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF
ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH
GVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETLDWVEEVQKRGAGEIVLNMMN
CCEEEEEEECCCCCCCCCCEEEEEECCCCHHHEEHHHHHHHHHHHHHHCCCCCEEHHHHC
QDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC
ISELKAYLATQGVEIRIC
HHHHHHHHHHCCCEEEEC
>Mature Secondary Structure
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV
CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE
VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF
ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH
GVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETLDWVEEVQKRGAGEIVLNMMN
CCEEEEEEECCCCCCCCCCEEEEEECCCCHHHEEHHHHHHHHHHHHHHCCCCCEEHHHHC
QDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC
ISELKAYLATQGVEIRIC
HHHHHHHHHHCCCEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Lyases; Carbon-Nitrogen Lyases; Amidine-Lyases [C]

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA