| Definition | Moorella thermoacetica ATCC 39073, complete genome. |
|---|---|
| Accession | NC_007644 |
| Length | 2,628,784 |
Click here to switch to the map view.
The map label for this gene is aroC
Identifier: 83590400
GI number: 83590400
Start: 1600074
End: 1601225
Strand: Reverse
Name: aroC
Synonym: Moth_1557
Alternate gene names: 83590400
Gene position: 1601225-1600074 (Counterclockwise)
Preceding gene: 83590401
Following gene: 83590399
Centisome position: 60.91
GC content: 67.45
Gene sequence:
>1152_bases ATGCTGCGCTATCTGACTGCCGGGGAATCCCACGGCCGGGGCCTGAGCGTCATTGTCGAAGGGCTGCCGGCCGGGGTGCC CCTGACAGATGGAGACATAAATACCTGGCTGACCCGCCGCCAGGGGGGATATGGCCGCGGCGGCCGCATGGCCATTGAAC GGGATCAGGCCGAGATCCTGGCCGGGGTGCGCGGTGGCCTGACCCTGGGCAGTCCTATAGCCCTCTTCATCGCCAACCGG GACTGGGAGAACTGGCAGGAGATCATGGCCCCGGGACCGGAAGCCAGGGCCGCCCGGGTAGTAACCCGACCGCGGCCGGG ACATGCCGACCTGGCGGGAGGATTGAAATACCACCAGGCGGACCTGCGCAATATCCTGGAGCGGGCCAGCGCCCGGGAAA CGGCGGCCAGGGTGGCCGCCGGGGCGGTGGCGGCAGTGCTGCTCAAAGAATTAGCCATTGAACTGGCCTTCCACGTCGTG CGGATCGGGCCCGTAGAGGTCCGGGAGCAGGTGGATTGGGAGGCCGCCTGCCGGGCCGTCGAGTCACCGGTCTACTGCGC TGACCCGGAAGCGGGCCGGGCCATGGTGGCGGCCATTGAAGAAGCACGACAGCAGGGAGATACCCTGGGAGGGGTAGTCG AGGTCCTGGCCCGGGGCGTTCCTGCCGGCCTGGGCAGTCATGTCCACTGGGACCGGCGCCTGGACGGCCGCCTGGCCCAG GCGCTCATGAGCATCCCGGCCATCAAAGGAGTTGAGATCGGCGCCGGCTTCAGGGTGGCTGCCCTGCCGGGGAGCCGGGC CCACGACGCCATTGCCTACCGCAAGGGGCAGGGCTTCTATCATCCTACCAACCGGGCCGGCGGCCTGGAAGGCGGCCTGA CCAATGGCGAAACCCTCGTCCTGCGAGCAGCCATGAAACCCATTCCCACCCTAATGCACCCACTGCCCAGCGTCGATTTG GTCACCAAACAACCGGCGACAGCCAGCATCGAGCGTTCCGATGTCTGCGCCGTTCCGGCGGCGGCGGTGGTAGCCGCGGC AGCGGTGGCCTGGGTCCTGGCCGGGGCTATACTGGAACAATTCGGGGGCGACTATTTACCGGTCATCCAGGAACGCCTGG CTGCCTACCGGCAGTACCTGCAGGAAATTTGA
Upstream 100 bases:
>100_bases CCCGGTAGCAGTCATGGACAGGGTCCTCCGGGAGGCCATGGGGGCGAGTTCAGGCGGGCCTGCTGCCGGCCGGTGAAAGC TTGAGAAAGGATGGATACAT
Downstream 100 bases:
>100_bases GACTGAATTCAATTGAATTGACATTTCTTGCTAACAGAAGGGGTGAGGAAGGATGCCGGGGAACATCGTCCTGATCGGTT TTATGGGCAGCGGTAAGACG
Product: chorismate synthase
Products: NA
Alternate protein names: 5-enolpyruvylshikimate-3-phosphate phospholyase
Number of amino acids: Translated: 383; Mature: 383
Protein sequence:
>383_residues MLRYLTAGESHGRGLSVIVEGLPAGVPLTDGDINTWLTRRQGGYGRGGRMAIERDQAEILAGVRGGLTLGSPIALFIANR DWENWQEIMAPGPEARAARVVTRPRPGHADLAGGLKYHQADLRNILERASARETAARVAAGAVAAVLLKELAIELAFHVV RIGPVEVREQVDWEAACRAVESPVYCADPEAGRAMVAAIEEARQQGDTLGGVVEVLARGVPAGLGSHVHWDRRLDGRLAQ ALMSIPAIKGVEIGAGFRVAALPGSRAHDAIAYRKGQGFYHPTNRAGGLEGGLTNGETLVLRAAMKPIPTLMHPLPSVDL VTKQPATASIERSDVCAVPAAAVVAAAAVAWVLAGAILEQFGGDYLPVIQERLAAYRQYLQEI
Sequences:
>Translated_383_residues MLRYLTAGESHGRGLSVIVEGLPAGVPLTDGDINTWLTRRQGGYGRGGRMAIERDQAEILAGVRGGLTLGSPIALFIANR DWENWQEIMAPGPEARAARVVTRPRPGHADLAGGLKYHQADLRNILERASARETAARVAAGAVAAVLLKELAIELAFHVV RIGPVEVREQVDWEAACRAVESPVYCADPEAGRAMVAAIEEARQQGDTLGGVVEVLARGVPAGLGSHVHWDRRLDGRLAQ ALMSIPAIKGVEIGAGFRVAALPGSRAHDAIAYRKGQGFYHPTNRAGGLEGGLTNGETLVLRAAMKPIPTLMHPLPSVDL VTKQPATASIERSDVCAVPAAAVVAAAAVAWVLAGAILEQFGGDYLPVIQERLAAYRQYLQEI >Mature_383_residues MLRYLTAGESHGRGLSVIVEGLPAGVPLTDGDINTWLTRRQGGYGRGGRMAIERDQAEILAGVRGGLTLGSPIALFIANR DWENWQEIMAPGPEARAARVVTRPRPGHADLAGGLKYHQADLRNILERASARETAARVAAGAVAAVLLKELAIELAFHVV RIGPVEVREQVDWEAACRAVESPVYCADPEAGRAMVAAIEEARQQGDTLGGVVEVLARGVPAGLGSHVHWDRRLDGRLAQ ALMSIPAIKGVEIGAGFRVAALPGSRAHDAIAYRKGQGFYHPTNRAGGLEGGLTNGETLVLRAAMKPIPTLMHPLPSVDL VTKQPATASIERSDVCAVPAAAVVAAAAVAWVLAGAILEQFGGDYLPVIQERLAAYRQYLQEI
Specific function: Aromatic amino acids biosynthesis; shikimate pathway; seventh step. [C]
COG id: COG0082
COG function: function code E; Chorismate synthase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the chorismate synthase family
Homologues:
Organism=Escherichia coli, GI1788669, Length=363, Percent_Identity=38.2920110192837, Blast_Score=184, Evalue=7e-48, Organism=Saccharomyces cerevisiae, GI6321290, Length=377, Percent_Identity=34.7480106100796, Blast_Score=191, Evalue=1e-49,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): AROC_MOOTA (Q2RI73)
Other databases:
- EMBL: CP000232 - RefSeq: YP_430409.1 - HSSP: P63611 - ProteinModelPortal: Q2RI73 - SMR: Q2RI73 - STRING: Q2RI73 - GeneID: 3832190 - GenomeReviews: CP000232_GR - KEGG: mta:Moth_1557 - NMPDR: fig|264732.9.peg.1561 - eggNOG: COG0082 - HOGENOM: HBG292336 - OMA: GSEAHDE - ProtClustDB: PRK05382 - BioCyc: MTHE264732:MOTH_1557-MONOMER - HAMAP: MF_00300_B - InterPro: IPR000453 - InterPro: IPR020541 - PANTHER: PTHR21085 - PIRSF: PIRSF001456 - TIGRFAMs: TIGR00033
Pfam domain/function: PF01264 Chorismate_synt; SSF103263 Chorismate_synth
EC number: =4.2.3.5
Molecular weight: Translated: 40608; Mature: 40608
Theoretical pI: Translated: 7.30; Mature: 7.30
Prosite motif: PS00787 CHORISMATE_SYNTHASE_1; PS00788 CHORISMATE_SYNTHASE_2; PS00789 CHORISMATE_SYNTHASE_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLRYLTAGESHGRGLSVIVEGLPAGVPLTDGDINTWLTRRQGGYGRGGRMAIERDQAEIL CCEEEECCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHCCCCCCCCCCEEEEHHHHHHH AGVRGGLTLGSPIALFIANRDWENWQEIMAPGPEARAARVVTRPRPGHADLAGGLKYHQA HHHCCCCCCCCCEEEEEECCCHHHHHHHHCCCCCHHHHHEEECCCCCCCHHHCCHHHHHH DLRNILERASARETAARVAAGAVAAVLLKELAIELAFHVVRIGPVEVREQVDWEAACRAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCHHHHHHHH ESPVYCADPEAGRAMVAAIEEARQQGDTLGGVVEVLARGVPAGLGSHVHWDRRLDGRLAQ CCCCEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH ALMSIPAIKGVEIGAGFRVAALPGSRAHDAIAYRKGQGFYHPTNRAGGLEGGLTNGETLV HHHHCCCCCCEEECCCEEEEECCCCCCHHHHHEECCCCCCCCCCCCCCCCCCCCCCCEEE LRAAMKPIPTLMHPLPSVDLVTKQPATASIERSDVCAVPAAAVVAAAAVAWVLAGAILEQ EHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH FGGDYLPVIQERLAAYRQYLQEI HCCCCHHHHHHHHHHHHHHHHHC >Mature Secondary Structure MLRYLTAGESHGRGLSVIVEGLPAGVPLTDGDINTWLTRRQGGYGRGGRMAIERDQAEIL CCEEEECCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHCCCCCCCCCCEEEEHHHHHHH AGVRGGLTLGSPIALFIANRDWENWQEIMAPGPEARAARVVTRPRPGHADLAGGLKYHQA HHHCCCCCCCCCEEEEEECCCHHHHHHHHCCCCCHHHHHEEECCCCCCCHHHCCHHHHHH DLRNILERASARETAARVAAGAVAAVLLKELAIELAFHVVRIGPVEVREQVDWEAACRAV HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCHHHHHHHH ESPVYCADPEAGRAMVAAIEEARQQGDTLGGVVEVLARGVPAGLGSHVHWDRRLDGRLAQ CCCCEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH ALMSIPAIKGVEIGAGFRVAALPGSRAHDAIAYRKGQGFYHPTNRAGGLEGGLTNGETLV HHHHCCCCCCEEECCCEEEEECCCCCCHHHHHEECCCCCCCCCCCCCCCCCCCCCCCEEE LRAAMKPIPTLMHPLPSVDLVTKQPATASIERSDVCAVPAAAVVAAAAVAWVLAGAILEQ EHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH FGGDYLPVIQERLAAYRQYLQEI HCCCCHHHHHHHHHHHHHHHHHC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA