Definition Akkermansia muciniphila ATCC BAA-835, complete genome.
Accession NC_010655
Length 2,664,102

Click here to switch to the map view.

The map label for this gene is aroC

Identifier: 187736380

GI number: 187736380

Start: 2303821

End: 2304906

Strand: Direct

Name: aroC

Synonym: Amuc_1897

Alternate gene names: 187736380

Gene position: 2303821-2304906 (Clockwise)

Preceding gene: 187736379

Following gene: 187736381

Centisome position: 86.48

GC content: 62.71

Gene sequence:

>1086_bases
ATGTCCAGCAGTTTTGGTCAGGTGTTCAGAATTTCTACCTGGGGTGAATCCCATGGGACTGGGGTAGGCGTGGTGATTGA
TGGTTGCCCGTCCCTCGTCCCGGTGACGGAAGAAGACATTCAGCGGGAGCTGGACCGGCGCAGGCCGGGGCAGAGCGACA
TCGTAACCCCCCGCAGGGAGGAAGACCGCGCGGAAATCCTTTCCGGAGTGCTGGACGGCAAAACCCTGGGAACGCCTATC
GCCATCAGTGTCCGGAACAAGGACCACCGCTCTTCCGCCTATGACGAGATGGCCAGAACGTACCGGCCCTCCCACGCGGA
CTATACATACGACGCTAAATACGGCATTCGCGCCTGGGCGGGCGGGGGCCGGGCCTCCGCACGGGAAACCATCGGCCGCG
TCGCAGCCGGAGCGGTGGCCAGGGCCGTGCTGAAGCAGGCTTTCCCCGATATGGAGGTCGTGGCCTGGGTGGATCAGGTT
CACCATGTGAAAGCTTCCGTGGACTGGGGAGCCGTGACGGCCTCTGCCATTGAGAGCAACATCGTCCGTACGGCGGACCC
CTCCGCTGCGGAAGCCATGATCGCTGCCATCAAGGAAGCTCGTGACTCCGGAAACTCCTTGGGCGGCGTGGTCAAATGCG
TGGTGCGCGGCTGCCCTCCCGGACTGGGTGATCCGGTTTTTGACAAGCTGGACGCTACGCTTGCCCACGCCATGATGAGC
ATTCCCGCCACCAAGGCTTTCGCCGTGGGTTCCGGTTTTGAAGCGGCGGACATGACCGGCTTGGAACATAATGACCCTTT
TTACATGCAGGGCTGCCGGGTGCGTACTACCACCAACCACTCCGGCGGTATTCAGGGCGGCATCTCCAACGGAGAGGACA
TTCTGATGCGCATCGGCTTCAAGCCTACGGCCACCTTGATGATTGACCAGCAGACGGTCAACAGGGACGGGGAGGATGCC
CGGCTCAAGGGCAGGGGACGGCATGATGCCTGCGTACTGCCGCGCGCCGTGCCCATTGTGGAGGCCATGGCCTGGCTCTG
CCTGTGCGACCACTACCTGCGCCAACGCTGCCAGAGGGCTCTGTAA

Upstream 100 bases:

>100_bases
CTTTCAGCACCCCCGTCCTTTGAGGACAGGGGTGTTTTTTATGTCCCGGTGCGGATGCACACGGCTTGACCGGGTGAACG
CCTCCGTCTAGAATACCGCC

Downstream 100 bases:

>100_bases
CAACCGGTTTCACGGCTTCTTTTCCCCTCATTCATGGATTCCTCCCTCACATTTTACCTGGTTCTGGGCGCATTGCTGCT
GGGTTTCATTGTACTCGGCA

Product: chorismate synthase

Products: NA

Alternate protein names: 5-enolpyruvylshikimate-3-phosphate phospholyase

Number of amino acids: Translated: 361; Mature: 360

Protein sequence:

>361_residues
MSSSFGQVFRISTWGESHGTGVGVVIDGCPSLVPVTEEDIQRELDRRRPGQSDIVTPRREEDRAEILSGVLDGKTLGTPI
AISVRNKDHRSSAYDEMARTYRPSHADYTYDAKYGIRAWAGGGRASARETIGRVAAGAVARAVLKQAFPDMEVVAWVDQV
HHVKASVDWGAVTASAIESNIVRTADPSAAEAMIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMS
IPATKAFAVGSGFEAADMTGLEHNDPFYMQGCRVRTTTNHSGGIQGGISNGEDILMRIGFKPTATLMIDQQTVNRDGEDA
RLKGRGRHDACVLPRAVPIVEAMAWLCLCDHYLRQRCQRAL

Sequences:

>Translated_361_residues
MSSSFGQVFRISTWGESHGTGVGVVIDGCPSLVPVTEEDIQRELDRRRPGQSDIVTPRREEDRAEILSGVLDGKTLGTPI
AISVRNKDHRSSAYDEMARTYRPSHADYTYDAKYGIRAWAGGGRASARETIGRVAAGAVARAVLKQAFPDMEVVAWVDQV
HHVKASVDWGAVTASAIESNIVRTADPSAAEAMIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMS
IPATKAFAVGSGFEAADMTGLEHNDPFYMQGCRVRTTTNHSGGIQGGISNGEDILMRIGFKPTATLMIDQQTVNRDGEDA
RLKGRGRHDACVLPRAVPIVEAMAWLCLCDHYLRQRCQRAL
>Mature_360_residues
SSSFGQVFRISTWGESHGTGVGVVIDGCPSLVPVTEEDIQRELDRRRPGQSDIVTPRREEDRAEILSGVLDGKTLGTPIA
ISVRNKDHRSSAYDEMARTYRPSHADYTYDAKYGIRAWAGGGRASARETIGRVAAGAVARAVLKQAFPDMEVVAWVDQVH
HVKASVDWGAVTASAIESNIVRTADPSAAEAMIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMSI
PATKAFAVGSGFEAADMTGLEHNDPFYMQGCRVRTTTNHSGGIQGGISNGEDILMRIGFKPTATLMIDQQTVNRDGEDAR
LKGRGRHDACVLPRAVPIVEAMAWLCLCDHYLRQRCQRAL

Specific function: Aromatic amino acids biosynthesis; shikimate pathway; seventh step. [C]

COG id: COG0082

COG function: function code E; Chorismate synthase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the chorismate synthase family

Homologues:

Organism=Escherichia coli, GI1788669, Length=359, Percent_Identity=44.5682451253482, Blast_Score=298, Evalue=3e-82,
Organism=Saccharomyces cerevisiae, GI6321290, Length=368, Percent_Identity=51.0869565217391, Blast_Score=375, Evalue=1e-105,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): AROC_AKKM8 (B2UNJ5)

Other databases:

- EMBL:   CP001071
- RefSeq:   YP_001878492.1
- ProteinModelPortal:   B2UNJ5
- SMR:   B2UNJ5
- GeneID:   6273687
- GenomeReviews:   CP001071_GR
- KEGG:   amu:Amuc_1897
- HOGENOM:   HBG292336
- OMA:   TATIGKE
- ProtClustDB:   PRK05382
- HAMAP:   MF_00300_B
- InterPro:   IPR000453
- InterPro:   IPR020541
- PANTHER:   PTHR21085
- PIRSF:   PIRSF001456
- TIGRFAMs:   TIGR00033

Pfam domain/function: PF01264 Chorismate_synt; SSF103263 Chorismate_synth

EC number: =4.2.3.5

Molecular weight: Translated: 38860; Mature: 38729

Theoretical pI: Translated: 7.04; Mature: 7.04

Prosite motif: PS00787 CHORISMATE_SYNTHASE_1; PS00788 CHORISMATE_SYNTHASE_2; PS00789 CHORISMATE_SYNTHASE_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
5.3 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
2.8 %Met     (Mature Protein)
5.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSSSFGQVFRISTWGESHGTGVGVVIDGCPSLVPVTEEDIQRELDRRRPGQSDIVTPRRE
CCCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC
EDRAEILSGVLDGKTLGTPIAISVRNKDHRSSAYDEMARTYRPSHADYTYDAKYGIRAWA
HHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEECCCCCEEEC
GGGRASARETIGRVAAGAVARAVLKQAFPDMEVVAWVDQVHHVKASVDWGAVTASAIESN
CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHC
IVRTADPSAAEAMIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMS
CEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH
IPATKAFAVGSGFEAADMTGLEHNDPFYMQGCRVRTTTNHSGGIQGGISNGEDILMRIGF
CCCHHHEEECCCCCHHHCCCCCCCCCCEECCCEEEEECCCCCCCCCCCCCCCEEEEEECC
KPTATLMIDQQTVNRDGEDARLKGRGRHDACVLPRAVPIVEAMAWLCLCDHYLRQRCQRA
CCCEEEEEEHHHHCCCCCCCEECCCCCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHC
L
C
>Mature Secondary Structure 
SSSFGQVFRISTWGESHGTGVGVVIDGCPSLVPVTEEDIQRELDRRRPGQSDIVTPRRE
CCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC
EDRAEILSGVLDGKTLGTPIAISVRNKDHRSSAYDEMARTYRPSHADYTYDAKYGIRAWA
HHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEECCCCCEEEC
GGGRASARETIGRVAAGAVARAVLKQAFPDMEVVAWVDQVHHVKASVDWGAVTASAIESN
CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHC
IVRTADPSAAEAMIAAIKEARDSGNSLGGVVKCVVRGCPPGLGDPVFDKLDATLAHAMMS
CEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH
IPATKAFAVGSGFEAADMTGLEHNDPFYMQGCRVRTTTNHSGGIQGGISNGEDILMRIGF
CCCHHHEEECCCCCHHHCCCCCCCCCCEECCCEEEEECCCCCCCCCCCCCCCEEEEEECC
KPTATLMIDQQTVNRDGEDARLKGRGRHDACVLPRAVPIVEAMAWLCLCDHYLRQRCQRA
CCCEEEEEEHHHHCCCCCCCEECCCCCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHC
L
C

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA