Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is entC
Identifier: 157160089
GI number: 157160089
Start: 660966
End: 662141
Strand: Direct
Name: entC
Synonym: EcHS_A0644
Alternate gene names: 157160089
Gene position: 660966-662141 (Clockwise)
Preceding gene: 157160087
Following gene: 157160090
Centisome position: 14.23
GC content: 56.46
Gene sequence:
>1176_bases ATGGATACGTCACTGGCTGAGGAAGTACAGCAGACCATGGCAACACTTGCGCCCAATCGCTTTTTCTTTATGTCGCCGTA CCGCAGTTTTACGACGTCAGGATGTTTCGCCCGCTTCGATGAACCGGCTGTGAACGGGGATTCGCCCGACAGTCCCTTCC AGCAAAAACTCGCCGCGCTGTTTGCCGATGCCAAAGCGCAGGGCATCAAAAATCCGGTGATGGTCGGGGCGATTCCCTTC GATCCACGTCAGCCTTCGTCGCTGTATATTCCTGAATCCTGGCAGTCGTTCTCCCGTCAGGAAAAACAAGCTTCCGCACG CCGTTTCACCCGCAGCCAGTCGCTGAATGTGGTGGAACGCCAGGCAATTCCGGAGCAAACCACGTTTGAACAGATGGTTG CCCGCGCCGCCGCACTTACCGCCACGCCGCAGGTCGACAAAGTGGTGTTGTCACGGTTGATTGATATCACCACTGACGCC GCCATTGATAGTGGCGTATTGCTGGAACGGTTGATTGCGCAAAACCCGGTTAGTTACAACTTCCATGTTCCGCTGGCTGA TGGTGGCGTCCTGCTGGGGGCCAGCCCGGAACTGCTGCTACGTAAAGACGGCGAGCGTTTTAGCTCCATTCCGTTAGCCG GTTCCGCGCGTCGTCAGCCGGATGAAGTGCTCGATCGCGAAGCAGGTAATCGTCTGCTGGCGTCAGAAAAAGATCGCCAT GAACATGAACTGGTGACTCAGGCGATGAAAGAGGTACTGCGCGAACGCAGTAGTGAGTTACACGTTCCTTCTTCTCCACA GCTGATCACCACGCCGACGCTGTGGCATCTCGCAACTCCCTTTGAAGGTAAAGCGAATTCGCAAGAAAACGCACTGACTC TGGCCTGTCTGCTGCATCCGACCCCCGCGCTGAGCGGTTTCCCGCATCAGGCCGCGACCCAGGTTATTGCTGAACTGGAA CCGTTCGACCGCGAACTGTTTGGCGGCATTGTGGGTTGGTGTGACAGCGAAGGTAACGGCGAATGGGTGGTGACCATCCG CTGCGCGAAGCTGCGGGAAAATCAGGTGCGTCTGTTTGCCGGAGCGGGGATTGTGCCTGCGTCGTCACCGTTGGGTGAGT GGCGCGAAACAGGCGTCAAACTTTCTACCATGTTGAACGTTTTTGGATTGCATTAA
Upstream 100 bases:
>100_bases ACCTCAAGAGTTGACATAGTGCGCGTTTGCTTTTAGGTTAGCGACCGAAAATATAAATGATAATCATTATTAAAGCCTTT ATCATTTTGTGGAGGATGAT
Downstream 100 bases:
>100_bases GGAGCGAGGATGAGCATTCCATTCACCCGCTGGCCGGAAGAGTTTGCCCGTCGCTATCGGGAAAAAGGCTACTGGCAGGA TTTGCCGCTGACCGACATTC
Product: isochorismate synthase
Products: NA
Alternate protein names: Isochorismate mutase
Number of amino acids: Translated: 391; Mature: 391
Protein sequence:
>391_residues MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAALFADAKAQGIKNPVMVGAIPF DPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVERQAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDA AIDSGVLLERLIAQNPVSYNFHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHPTPALSGFPHQAATQVIAELE PFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFAGAGIVPASSPLGEWRETGVKLSTMLNVFGLH
Sequences:
>Translated_391_residues MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAALFADAKAQGIKNPVMVGAIPF DPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVERQAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDA AIDSGVLLERLIAQNPVSYNFHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHPTPALSGFPHQAATQVIAELE PFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFAGAGIVPASSPLGEWRETGVKLSTMLNVFGLH >Mature_391_residues MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAALFADAKAQGIKNPVMVGAIPF DPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVERQAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDA AIDSGVLLERLIAQNPVSYNFHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHPTPALSGFPHQAATQVIAELE PFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFAGAGIVPASSPLGEWRETGVKLSTMLNVFGLH
Specific function: Siderophore biosynthesis; enterobactin biosynthesis. [C]
COG id: COG1169
COG function: function code HQ; Isochorismate synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the isochorismate synthase family
Homologues:
Organism=Escherichia coli, GI1786809, Length=391, Percent_Identity=100, Blast_Score=800, Evalue=0.0, Organism=Escherichia coli, GI87082077, Length=276, Percent_Identity=31.5217391304348, Blast_Score=127, Evalue=1e-30, Organism=Escherichia coli, GI1788114, Length=200, Percent_Identity=28, Blast_Score=76, Evalue=4e-15, Organism=Escherichia coli, GI1787518, Length=227, Percent_Identity=29.0748898678414, Blast_Score=64, Evalue=1e-11, Organism=Saccharomyces cerevisiae, GI6320935, Length=212, Percent_Identity=28.3018867924528, Blast_Score=72, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): ENTC_ECO57 (P0AEJ3)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: D85558 - PIR: H90707 - RefSeq: NP_286320.1 - RefSeq: NP_308659.1 - ProteinModelPortal: P0AEJ3 - SMR: P0AEJ3 - EnsemblBacteria: EBESCT00000027757 - EnsemblBacteria: EBESCT00000057215 - GeneID: 916991 - GeneID: 957607 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z0735 - KEGG: ecs:ECs0632 - GeneTree: EBGT00050000009519 - HOGENOM: HBG405459 - OMA: FASGNLK - ProtClustDB: PRK15016 - BioCyc: ECOL83334:ECS0632-MONOMER - InterPro: IPR005801 - InterPro: IPR015890 - InterPro: IPR004561 - Gene3D: G3DSA:3.60.120.10 - PANTHER: PTHR11236:SF3 - PANTHER: PTHR11236 - TIGRFAMs: TIGR00543
Pfam domain/function: PF00425 Chorismate_bind; SSF56322 TRPE_1_chor_bd
EC number: =5.4.4.2
Molecular weight: Translated: 42932; Mature: 42932
Theoretical pI: Translated: 5.46; Mature: 5.46
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAAL CCCHHHHHHHHHHHHHCCCCEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCHHHHHHHHH FADAKAQGIKNPVMVGAIPFDPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVER HHHHHHHCCCCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHH QAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDAAIDSGVLLERLIAQNPVSYN HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCCEE FHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH EEEEECCCCEEECCCCCEEEECCCCHHHCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHH EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHP HHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCHHEECCCCCCCCCCCCCEEEEEEEECC TPALSGFPHQAATQVIAELEPFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFA CCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHEEECCCCCCCEEEEEEEECCCCCCEEEEE GAGIVPASSPLGEWRETGVKLSTMLNVFGLH CCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC >Mature Secondary Structure MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAAL CCCHHHHHHHHHHHHHCCCCEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCHHHHHHHHH FADAKAQGIKNPVMVGAIPFDPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVER HHHHHHHCCCCCEEEEEECCCCCCCCCEECCHHHHHHHHHHHHHHHHHHHHHCCCCHHHH QAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDAAIDSGVLLERLIAQNPVSYN HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCCEE FHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH EEEEECCCCEEECCCCCEEEECCCCHHHCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHHH EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHP HHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCHHEECCCCCCCCCCCCCEEEEEEEECC TPALSGFPHQAATQVIAELEPFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFA CCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHEEECCCCCCCEEEEEEEECCCCCCEEEEE GAGIVPASSPLGEWRETGVKLSTMLNVFGLH CCCCCCCCCCHHHHHHCCCHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796