The gene/protein map for NC_007705 is currently unavailable.
Definition Xanthomonas oryzae pv. oryzae MAFF 311018, complete genome.
Accession NC_007705
Length 4,940,217

Click here to switch to the map view.

The map label for this gene is trpE [H]

Identifier: 84625481

GI number: 84625481

Start: 4328487

End: 4329962

Strand: Reverse

Name: trpE [H]

Synonym: XOO_3824

Alternate gene names: 84625481

Gene position: 4329962-4328487 (Counterclockwise)

Preceding gene: 84625482

Following gene: 84625480

Centisome position: 87.65

GC content: 64.43

Gene sequence:

>1476_bases
TTGATCACTGCAGAGCAGTTCCAGCGCCAGGCCGCTGAAGGTCATACCCGTATTCCCGTTGTCCGCGAAGTGCTGTCCGA
CCTGGACACGCCGCTGTCGGTCTATCTGAAGCTCGCCGACGGCGCTTACACCTATCTGTTCGAATCGGTCGAAGGCGGTG
AGCGCTTCGGGCGCTATTCCATCATTGGCCTGCCGGCGCGCCGCGTGTACAGCTTTCGCGGTCACACGCTGGAGGTCAGC
GAGCACGGCGAAGTGGTCGACACCCGCGACGTCGCCGACCCGCTAGCCGAAGTGGACGCACTGCGCGCCGAGCATTCGGT
GCCGCAACTCGATAGCCTGCCCGGCTTCACCGGCGGGCTTGTCGGCTGGTTCGGTTTCGAGTGCATCCAGTACATCGAAC
CTCGCCTGGGCAGCGGCGACAAAGCTGACGAACTTGGCACGCCCGACATTCTGCTGATGCTCTCCGAAGAGCTGGCGGTG
TTCGACAACCTCAAGGGCCGCTTGTACCTGATCGTGCATGCCGACCCGCGTCAGCCGCAGGCCTATGTGCGTGCCAATCG
GCGTCTGGACGAACTGGCGCACCGCCTGCGTCAGGGCGGCGCCGGCTACCCGCAGGCGCAGATTTCCGATGCGATCGACG
AATCGGATTTCCACTCCTCGTTTACCCGCGAGCAGTACCACGCGGTGGTGCGCAAGGCGCAGGAATACGTACGCGCCGGC
GACATCTTCCAGGTGGTGCCATCGCAACGGTTACGCGTGCCGTTCCGTGCGCGCCCGGTGGATGTGTATCGCGCCTTGCG
TGCGTTGAATCCCTCGCCGTACATGTATTTTCTCGACGTCGGCGGCACCCAGGTGGTCGGCTCGTCACCGGAAATCCTGG
CGCGTCTGCGCGATGGCGTGGTCACCGTGCGGCCCATCGCCGGCACCCGCCCGCGTGGCGCAACGCCAGAGCTGGACAAG
GCGCTGGAAGAAGAATTGCTGGCCGACCCGAAAGAGCGCGCCGAGCACGTGATGCTGATCGACCTGGGCCGCAACGATGT
CGGCCGCGTGGCCGAACCGGGCACGGTCAAGGTGGGCGAGCAGTTCGTGATCGAACGCTATAGCCATGTCATGCATATCG
TCAGCGAAGTGACCGGCACGCTGAAAGCGGGTTTGAACTACAGCGATGTATTGCGCGCCACGTTCCCGGCCGGCACCGTC
AGCGGCGCGCCGAAAATCCGCGCGCTGGAAATCATTCGTGAGCTGGAACCGGTCAAGCGCAACGTCTACTCCGGTGCGGT
CGGCTACATCGGCTGGCACGGCGATGCCGACACCGCCATCGCCATCCGCACCGCTGTGATCCAGGACGGGTATCTGTATG
TGCAGGCCGGTGGCGGCGTGGTCTACGACTCCGACCCCGACCTGGAATGGCAGGAAACCATGAACAAGGGGCGCGCGCTG
TTTCGCGCAGTTGCCCAGGCGGCAAAGGGCTTGTGA

Upstream 100 bases:

>100_bases
CTGACGACTCCACGCTCCTTCTCTTCCGCTCGCCGCTGGTGGCGATGGTGGCCCGTTGCCGTCGTCCGCCCCGCCACAGA
GTCCCGGAAGGAAAGTCGTC

Downstream 100 bases:

>100_bases
TGGACGGGCAGGGCAGCGCCGCGCGCATCCGCGTTTGCTGCAGGCATTGGCGCAGGCCAGTGCCGAGCAGTGCGCAGGCT
ACGGCACTGATCGGCAAACA

Product: anthranilate synthase component I

Products: NA

Alternate protein names: Anthranilate synthase component I [H]

Number of amino acids: Translated: 491; Mature: 491

Protein sequence:

>491_residues
MITAEQFQRQAAEGHTRIPVVREVLSDLDTPLSVYLKLADGAYTYLFESVEGGERFGRYSIIGLPARRVYSFRGHTLEVS
EHGEVVDTRDVADPLAEVDALRAEHSVPQLDSLPGFTGGLVGWFGFECIQYIEPRLGSGDKADELGTPDILLMLSEELAV
FDNLKGRLYLIVHADPRQPQAYVRANRRLDELAHRLRQGGAGYPQAQISDAIDESDFHSSFTREQYHAVVRKAQEYVRAG
DIFQVVPSQRLRVPFRARPVDVYRALRALNPSPYMYFLDVGGTQVVGSSPEILARLRDGVVTVRPIAGTRPRGATPELDK
ALEEELLADPKERAEHVMLIDLGRNDVGRVAEPGTVKVGEQFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTV
SGAPKIRALEIIRELEPVKRNVYSGAVGYIGWHGDADTAIAIRTAVIQDGYLYVQAGGGVVYDSDPDLEWQETMNKGRAL
FRAVAQAAKGL

Sequences:

>Translated_491_residues
MITAEQFQRQAAEGHTRIPVVREVLSDLDTPLSVYLKLADGAYTYLFESVEGGERFGRYSIIGLPARRVYSFRGHTLEVS
EHGEVVDTRDVADPLAEVDALRAEHSVPQLDSLPGFTGGLVGWFGFECIQYIEPRLGSGDKADELGTPDILLMLSEELAV
FDNLKGRLYLIVHADPRQPQAYVRANRRLDELAHRLRQGGAGYPQAQISDAIDESDFHSSFTREQYHAVVRKAQEYVRAG
DIFQVVPSQRLRVPFRARPVDVYRALRALNPSPYMYFLDVGGTQVVGSSPEILARLRDGVVTVRPIAGTRPRGATPELDK
ALEEELLADPKERAEHVMLIDLGRNDVGRVAEPGTVKVGEQFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTV
SGAPKIRALEIIRELEPVKRNVYSGAVGYIGWHGDADTAIAIRTAVIQDGYLYVQAGGGVVYDSDPDLEWQETMNKGRAL
FRAVAQAAKGL
>Mature_491_residues
MITAEQFQRQAAEGHTRIPVVREVLSDLDTPLSVYLKLADGAYTYLFESVEGGERFGRYSIIGLPARRVYSFRGHTLEVS
EHGEVVDTRDVADPLAEVDALRAEHSVPQLDSLPGFTGGLVGWFGFECIQYIEPRLGSGDKADELGTPDILLMLSEELAV
FDNLKGRLYLIVHADPRQPQAYVRANRRLDELAHRLRQGGAGYPQAQISDAIDESDFHSSFTREQYHAVVRKAQEYVRAG
DIFQVVPSQRLRVPFRARPVDVYRALRALNPSPYMYFLDVGGTQVVGSSPEILARLRDGVVTVRPIAGTRPRGATPELDK
ALEEELLADPKERAEHVMLIDLGRNDVGRVAEPGTVKVGEQFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTV
SGAPKIRALEIIRELEPVKRNVYSGAVGYIGWHGDADTAIAIRTAVIQDGYLYVQAGGGVVYDSDPDLEWQETMNKGRAL
FRAVAQAAKGL

Specific function: Tryptophan biosynthesis; first step. [C]

COG id: COG0147

COG function: function code EH; Anthranilate/para-aminobenzoate synthases component I

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the anthranilate synthase component I family [H]

Homologues:

Organism=Escherichia coli, GI1787518, Length=394, Percent_Identity=38.8324873096447, Blast_Score=248, Evalue=5e-67,
Organism=Escherichia coli, GI1788114, Length=474, Percent_Identity=32.7004219409283, Blast_Score=239, Evalue=3e-64,
Organism=Escherichia coli, GI87082077, Length=280, Percent_Identity=28.2142857142857, Blast_Score=92, Evalue=9e-20,
Organism=Escherichia coli, GI1786809, Length=209, Percent_Identity=27.7511961722488, Blast_Score=82, Evalue=9e-17,
Organism=Saccharomyces cerevisiae, GI6320935, Length=499, Percent_Identity=38.0761523046092, Blast_Score=318, Evalue=1e-87,
Organism=Saccharomyces cerevisiae, GI6324361, Length=392, Percent_Identity=26.7857142857143, Blast_Score=117, Evalue=4e-27,

Paralogues:

None

Copy number: 700 Molecules/Cell In: Glucose minimal media [C]

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005801
- InterPro:   IPR019999
- InterPro:   IPR006805
- InterPro:   IPR005256
- InterPro:   IPR015890 [H]

Pfam domain/function: PF04715 Anth_synt_I_N; PF00425 Chorismate_bind [H]

EC number: =4.1.3.27 [H]

Molecular weight: Translated: 54224; Mature: 54224

Theoretical pI: Translated: 5.51; Mature: 5.51

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.2 %Met     (Translated Protein)
1.4 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MITAEQFQRQAAEGHTRIPVVREVLSDLDTPLSVYLKLADGAYTYLFESVEGGERFGRYS
CCCHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHEEEECCCCCEEHHHHHHCCHHHCCCEE
IIGLPARRVYSFRGHTLEVSEHGEVVDTRDVADPLAEVDALRAEHSVPQLDSLPGFTGGL
EEECCHHHHHHCCCCEEEECCCCCEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH
VGWFGFECIQYIEPRLGSGDKADELGTPDILLMLSEELAVFDNLKGRLYLIVHADPRQPQ
HHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHCCCCEEEEEEECCCCCCH
AYVRANRRLDELAHRLRQGGAGYPQAQISDAIDESDFHSSFTREQYHAVVRKAQEYVRAG
HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHC
DIFQVVPSQRLRVPFRARPVDVYRALRALNPSPYMYFLDVGGTQVVGSSPEILARLRDGV
CHHEECCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCEEECCCHHHHHHHHCCC
VTVRPIAGTRPRGATPELDKALEEELLADPKERAEHVMLIDLGRNDVGRVAEPGTVKVGE
EEEEECCCCCCCCCCCHHHHHHHHHHHCCCHHHCCEEEEEECCCCCCCCCCCCCCEEHHH
QFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTVSGAPKIRALEIIRELEPVKR
HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH
NVYSGAVGYIGWHGDADTAIAIRTAVIQDGYLYVQAGGGVVYDSDPDLEWQETMNKGRAL
HHHCCCCEEEEECCCCCCCCEEEHEEEECCEEEEEECCEEEECCCCCCCHHHHHHHHHHH
FRAVAQAAKGL
HHHHHHHHCCC
>Mature Secondary Structure
MITAEQFQRQAAEGHTRIPVVREVLSDLDTPLSVYLKLADGAYTYLFESVEGGERFGRYS
CCCHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHEEEECCCCCEEHHHHHHCCHHHCCCEE
IIGLPARRVYSFRGHTLEVSEHGEVVDTRDVADPLAEVDALRAEHSVPQLDSLPGFTGGL
EEECCHHHHHHCCCCEEEECCCCCEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHH
VGWFGFECIQYIEPRLGSGDKADELGTPDILLMLSEELAVFDNLKGRLYLIVHADPRQPQ
HHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHCCCCEEEEEEECCCCCCH
AYVRANRRLDELAHRLRQGGAGYPQAQISDAIDESDFHSSFTREQYHAVVRKAQEYVRAG
HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHC
DIFQVVPSQRLRVPFRARPVDVYRALRALNPSPYMYFLDVGGTQVVGSSPEILARLRDGV
CHHEECCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCEEECCCHHHHHHHHCCC
VTVRPIAGTRPRGATPELDKALEEELLADPKERAEHVMLIDLGRNDVGRVAEPGTVKVGE
EEEEECCCCCCCCCCCHHHHHHHHHHHCCCHHHCCEEEEEECCCCCCCCCCCCCCEEHHH
QFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTVSGAPKIRALEIIRELEPVKR
HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH
NVYSGAVGYIGWHGDADTAIAIRTAVIQDGYLYVQAGGGVVYDSDPDLEWQETMNKGRAL
HHHCCCCEEEEECCCCCCCCEEEHEEEECCEEEEEECCEEEECCCCCCCHHHHHHHHHHH
FRAVAQAAKGL
HHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1987141 [H]