The gene/protein map for NC_009800 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ynjA

Identifier: 157161215

GI number: 157161215

Start: 1853597

End: 1854145

Strand: Direct

Name: ynjA

Synonym: EcHS_A1837

Alternate gene names: 157161215

Gene position: 1853597-1854145 (Clockwise)

Preceding gene: 157161214

Following gene: 157161216

Centisome position: 39.92

GC content: 56.28

Gene sequence:

>549_bases
ATGGGTTTACCGCCGCTTAGCAAAATTCCTTTAATTTTACGTCCACAGGCGTGGCTGCATCGTCGCCATTACGGCGAGGT
GCTAAGCCCCATTCGCTGGTGGGGGCGGATCCCGTTTATCTTTTATCTGGTGTCGATGTTTGTTGGCTGGCTGGAGCGCA
AACGCTCACCGCTCGATCCGGTAGTACGATCGCTTGTCAGCGCGCGCATTGCGCAAATGTGCCTGTGTGAGTTTTGTGTG
GATATCACCAGTATGAAAGTCGCCGAGCGCACCGGCAGCAGCGATAAACTGCTGGCAGTGGCTGACTGGCGGCAAAGCCC
GCTCTTTAGCGATGAAGAACGGCTGGCGCTGGAGTACGCCGAAGCCGCAAGCGTAACGCCGCCAACGGTCGATGATGCCC
TGCGTACCCGACTGGCTGCGCATTTTGACGCTCAGGCGCTCACCGAACTGACGGCATTGATCGGCCTGCAAAATCTGTCA
GCCCGTTTTAATTCTGCCATGGACATTCCCGCTCAGGGGCTGTGCCGTATTCCTGAAAAACGTTCTTAA

Upstream 100 bases:

>100_bases
CTTTCTGGTTTGTCACCGGACTGTTTATTCTGTTTGCCCTGACCGTGGTGATTTTTATGGCGAAGAAAATATGGCTTGAA
CGCCAGAAGAGGAATGCCTG

Downstream 100 bases:

>100_bases
GGAGAGATGATGCGCCATTGTGGGTGGTTGCTGGGATTGTTATCGCTGTTTTCTCTGGCAACACATGCCAGTGACTGGCA
AGAAATTAAAAATGAGGCCA

Product: carboxymuconolactone decarboxylase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 182; Mature: 181

Protein sequence:

>182_residues
MGLPPLSKIPLILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGWLERKRSPLDPVVRSLVSARIAQMCLCEFCV
DITSMKVAERTGSSDKLLAVADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQALTELTALIGLQNLS
ARFNSAMDIPAQGLCRIPEKRS

Sequences:

>Translated_182_residues
MGLPPLSKIPLILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGWLERKRSPLDPVVRSLVSARIAQMCLCEFCV
DITSMKVAERTGSSDKLLAVADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQALTELTALIGLQNLS
ARFNSAMDIPAQGLCRIPEKRS
>Mature_181_residues
GLPPLSKIPLILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGWLERKRSPLDPVVRSLVSARIAQMCLCEFCVD
ITSMKVAERTGSSDKLLAVADWRQSPLFSDEERLALEYAEAASVTPPTVDDALRTRLAAHFDAQALTELTALIGLQNLSA
RFNSAMDIPAQGLCRIPEKRS

Specific function: Unknown

COG id: COG2128

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: To M.tuberculosis Rv2313c

Homologues:

Organism=Escherichia coli, GI1788050, Length=182, Percent_Identity=100, Blast_Score=370, Evalue=1e-104,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YNJA_ECOLI (P76222)

Other databases:

- EMBL:   U00096
- EMBL:   AP009048
- PIR:   A64935
- RefSeq:   AP_002372.1
- RefSeq:   NP_416267.1
- ProteinModelPortal:   P76222
- SMR:   P76222
- STRING:   P76222
- EnsemblBacteria:   EBESCT00000000680
- EnsemblBacteria:   EBESCT00000018290
- GeneID:   946270
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW1742
- KEGG:   eco:b1753
- EchoBASE:   EB3759
- EcoGene:   EG14003
- eggNOG:   COG2128
- GeneTree:   EBGT00050000011629
- HOGENOM:   HBG659726
- OMA:   HIPRLLN
- ProtClustDB:   CLSK880182
- BioCyc:   EcoCyc:G6948-MONOMER
- Genevestigator:   P76222
- InterPro:   IPR003779

Pfam domain/function: PF02627 CMD

EC number: NA

Molecular weight: Translated: 20533; Mature: 20402

Theoretical pI: Translated: 8.65; Mature: 8.65

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.2 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
4.9 %Cys+Met (Translated Protein)
2.2 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGLPPLSKIPLILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGWLERKRSPLDP
CCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHH
VVRSLVSARIAQMCLCEFCVDITSMKVAERTGSSDKLLAVADWRQSPLFSDEERLALEYA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCHHHHHHHHH
EAASVTPPTVDDALRTRLAAHFDAQALTELTALIGLQNLSARFNSAMDIPAQGLCRIPEK
HHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCCC
RS
CC
>Mature Secondary Structure 
GLPPLSKIPLILRPQAWLHRRHYGEVLSPIRWWGRIPFIFYLVSMFVGWLERKRSPLDP
CCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHH
VVRSLVSARIAQMCLCEFCVDITSMKVAERTGSSDKLLAVADWRQSPLFSDEERLALEYA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCHHHHHHHHH
EAASVTPPTVDDALRTRLAAHFDAQALTELTALIGLQNLSARFNSAMDIPAQGLCRIPEK
HHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCCC
RS
CC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9278503