Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is hisF [H]
Identifier: 157161518
GI number: 157161518
Start: 2141699
End: 2142475
Strand: Direct
Name: hisF [H]
Synonym: EcHS_A2164
Alternate gene names: 157161518
Gene position: 2141699-2142475 (Clockwise)
Preceding gene: 157161517
Following gene: 157161519
Centisome position: 46.12
GC content: 51.48
Gene sequence:
>777_bases ATGCTGGCAAAACGCATAATCCCATGTCTCGACGTTCGTGATGGTCAGGTGGTGAAAGGCGTACAGTTTCGCAACCATGA AATCATTGGCGATATTGTACCGCTGGCAAAACGCTACGCTGAAGAAGGTGCAGACGAACTGGTGTTCTACGATATCACCG CTTCCAGCGATGGCCGTGTGGTAGATAAAAGCTGGGTATCTCGCGTGGCGGAAGTGATCGACATTCCGTTCTGTGTGGCG GGTGGGATTAAGTCTCTGGAAGATGCCGCGAAAATTCTTTCCTTTGGCGCGGATAAAATTTCCATCAACTCTCCTGCGCT GGCGGACCCGACGTTAATTACTCGCCTGGCGGATCGCTTTGGCGTGCAGTGTATTGTGGTCGGTATTGATACCTGGTACG ACGCTGAAACCGGTAAATATCATGTGAATCAATATACCGGCGATGAAAGCCGCACCCGCGTCACTCAGTGGGAAACGCTC GACTGGGTAGAGGAAGTGCAAAAACGCGGTGCCGGAGAAATCGTCCTCAACATGATGAATCAGGACGGCGTGCGTAACGG TTACGACCTAGAACAACTGAAAAAAGTGCATGAAGTTTGCCACGTCCCGCTGATTGCCTCCGGTGGCGCGGGCACCATGG AACACTTCCTCGAAGCCTTCCGCGATGCCGACGTTGACGGTGCGCTGGCAGCTTCTGTATTCCACAAACAAATAATCAAT ATTAGTGAATTAAAAGCGTACCTGGCAACACAGGGCGTGGAGATCAGGATATGTTAA
Upstream 100 bases:
>100_bases ACATTGATGATGTCGCGGCCCTGCGTGGTACTGGCGTGCGCGGCGTAATAGTTGGTCGTGCATTACTGGAAGGTAAATTC ACCGTGAAGGAGGCCATCGC
Downstream 100 bases:
>100_bases CAGAACAACAACGTCACGAACTGGACTGGGAAAAAACCGACGGACTGATGCCGGTGATTGTACAACACGCGGTATCTGGC GAAGTGTTAATGCTGGGCTA
Product: imidazole glycerol phosphate synthase subunit HisF
Products: NA
Alternate protein names: IGP synthase cyclase subunit; IGP synthase subunit hisF; ImGP synthase subunit hisF; IGPS subunit hisF [H]
Number of amino acids: Translated: 258; Mature: 258
Protein sequence:
>258_residues MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN ISELKAYLATQGVEIRIC
Sequences:
>Translated_258_residues MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN ISELKAYLATQGVEIRIC >Mature_258_residues MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL DWVEEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN ISELKAYLATQGVEIRIC
Specific function: IGPS catalyzes the conversion of PRFAR and glutamine to IGP, AICAR and glutamate. The hisF subunit catalyzes the cyclization activity that produces IGP and AICAR from PRFAR using the ammonia provided by the hisH subunit [H]
COG id: COG0107
COG function: function code E; Imidazoleglycerol-phosphate synthase
Gene ontology:
Cell location: Cytoplasm [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the hisA/hisF family [H]
Homologues:
Organism=Escherichia coli, GI1788336, Length=258, Percent_Identity=98.8372093023256, Blast_Score=523, Evalue=1e-150, Organism=Escherichia coli, GI87082028, Length=245, Percent_Identity=26.1224489795918, Blast_Score=68, Evalue=7e-13, Organism=Saccharomyces cerevisiae, GI6319725, Length=318, Percent_Identity=31.7610062893082, Blast_Score=133, Evalue=3e-32,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR013785 - InterPro: IPR006062 - InterPro: IPR004651 - InterPro: IPR011060 [H]
Pfam domain/function: PF00977 His_biosynth [H]
EC number: 4.1.3.-
Molecular weight: Translated: 28467; Mature: 28467
Theoretical pI: Translated: 4.74; Mature: 4.74
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.9 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 3.5 %Cys+Met (Translated Protein) 1.9 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 3.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH GVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETLDWVEEVQKRGAGEIVLNMMN CCEEEEEEECCCCCCCCCCEEEEEECCCCHHHEEHHHHHHHHHHHHHHCCCCCEEHHHHC QDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC ISELKAYLATQGVEIRIC HHHHHHHHHHCCCEEEEC >Mature Secondary Structure MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRV CCHHHCCCCCCCCCCCEEECEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCE VDKSWVSRVAEVIDIPFCVAGGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRF ECHHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHH GVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETLDWVEEVQKRGAGEIVLNMMN CCEEEEEEECCCCCCCCCCEEEEEECCCCHHHEEHHHHHHHHHHHHHHCCCCCEEHHHHC QDGVRNGYDLEQLKKVHEVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN CCCCCCCCCHHHHHHHHHHHCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC ISELKAYLATQGVEIRIC HHHHHHHHHHCCCEEEEC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: Lyases; Carbon-Nitrogen Lyases; Amidine-Lyases [C]
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA