The gene/protein map for NC_009800 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ygeA [H]

Identifier: 157162293

GI number: 157162293

Start: 2998264

End: 2998956

Strand: Reverse

Name: ygeA [H]

Synonym: EcHS_A2987

Alternate gene names: 157162293

Gene position: 2998956-2998264 (Counterclockwise)

Preceding gene: 157162294

Following gene: 157162291

Centisome position: 64.58

GC content: 50.65

Gene sequence:

>693_bases
ATGAAAACAATTGGTTTGCTGGGAGGAATGAGCTGGGAATCCACCATTCCTTACTATCGTTTGATAAATGAAGGCATTAA
ACAGCGGCTTGGTGGGCTTCACTCTGCGCAAGTGCTGCTACATAGCGTCGATTTTCATGAAATAGAAGAGTGCCAGCGTC
GCGGGGAATGGGATAAAACCGGGGACATTCTGGCTGAGGCGGCGCTTGGCTTACAGCGGGCGGGCGCAGAAGGTATTGTG
TTATGTACCAATACGATGCATAAAGTGGCGGATGCCATTGAGTCACGTTGCACTCTGCCTTTCTTACACATTGCGGATGC
CACCGGACGTGCAATTACCGGGGCCGGAATGACTCGTGTGGCGCTGCTGGGTACGCGTTACACCATGGAACAGGATTTTT
ATCGCGGGCGGCTGACGGAACAATTTTCCATCAATTGTCTTATTCCTGAAGCGGATGAACGGGCGAAAATTAATCAGATT
ATTTTTGAAGAACTGTGTCTGGGGCAATTTACCGAAGCGTCACGCGCTTATTATGCGCAAGTGATTGCTCGCCTTGCAGA
ACAGGGCGCACAGGGCGTCATTTTTGGCTGCACAGAAATTGGTTTACTGGTGCCAGAAGAGCGCAGTGTTTTGCCTGTGT
TTGATACCGCGGCGATCCATGCCGAGGATGCTGTCGCTTTTATGCTGTCGTAG

Upstream 100 bases:

>100_bases
GGCCCTTTTTTCGTTAATAGAGATTGGGCACTTGGCCGTTGAGGCGTTTGTCTCGTTCCTTATTCAGCCTTGTTGCGGTA
ACACACATCAGGAGAGAGGA

Downstream 100 bases:

>100_bases
CTGACGACAAAATAGCGTCAAGAGAAGTGACCAGTTTCGGTAACCCCGCTTGTAAATGCCCACTAAACGCCTGAACCAGC
GCTGATGACGGGCGGTGCAG

Product: putative racemase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 230; Mature: 230

Protein sequence:

>230_residues
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS

Sequences:

>Translated_230_residues
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
>Mature_230_residues
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS

Specific function: Unknown

COG id: COG1794

COG function: function code M; Aspartate racemase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the aspartate/glutamate racemases family [H]

Homologues:

Organism=Escherichia coli, GI1789205, Length=230, Percent_Identity=99.5652173913043, Blast_Score=468, Evalue=1e-133,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR015942
- InterPro:   IPR001920
- InterPro:   IPR018187
- InterPro:   IPR004380 [H]

Pfam domain/function: PF01177 Asp_Glu_race [H]

EC number: NA

Molecular weight: Translated: 25308; Mature: 25308

Theoretical pI: Translated: 4.85; Mature: 4.85

Prosite motif: PS00923 ASP_GLU_RACEMASE_1 ; PS00924 ASP_GLU_RACEMASE_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.6 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
5.2 %Cys+Met (Translated Protein)
2.6 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
5.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKT
CCCEECCCCCCCHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHHHCCCCCCH
GDILAEAALGLQRAGAEGIVLCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRV
HHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHCCCCCEEEECCCCCCEEECCCHHHH
ALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQIIFEELCLGQFTEASRAYYAQ
HHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
VIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
HHHHHHHCCCCEEEEEHHHCCEECCCCCCEEEEECCHHHHHCCCEEEECC
>Mature Secondary Structure
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKT
CCCEECCCCCCCHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHHHCCCCCCH
GDILAEAALGLQRAGAEGIVLCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRV
HHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHCCCCCEEEECCCCCCEEECCCHHHH
ALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQIIFEELCLGQFTEASRAYYAQ
HHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
VIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
HHHHHHHCCCCEEEEEHHHCCEECCCCCCEEEEECCHHHHHCCCEEEECC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 6350602; 2836407; 9278503 [H]