Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yehK

Identifier: 157161600

GI number: 157161600

Start: 2246320

End: 2246637

Strand: Direct

Name: yehK

Synonym: EcHS_A2252

Alternate gene names: 157161600

Gene position: 2246320-2246637 (Clockwise)

Preceding gene: 157161599

Following gene: 157161601

Centisome position: 48.38

GC content: 38.68

Gene sequence:

>318_bases
ATGATCGTGCAAAAAGAGCTGGTTGCTATTTACGATTATGAGGTCCCTGTACCTGAAGATCCGTTTTCCTTCAGACTTGA
GATCCATAAATGCTCTGAATTATTTACAGGTTCCGTCTATCGACTGGAGCGATTCCGGCTACGTCCAACATTTCATCAAC
GTGATCGAGAAGATGCTGACCCGCTAATAAATGATGCGTTGATTTATATAAGAGATGAGTGTATTGATGAGCGGAAATTA
CGAGGTGAATCACCTGAAACTGTAATAGCAATTTTTAATCGTGAACTACAGAATATATTCAACCAAGAAATAGAATAA

Upstream 100 bases:

>100_bases
GAGTGAGGCGTTAAGCGCACCTGACGTCATTTTCCATTAAAACACAGCGGGCAGTGATGCGACTGCCCGTTATCTACACG
ACTTACCAGCGGGGAAAGCG

Downstream 100 bases:

>100_bases
TATACTCTAAATAATTCAAATTGGTCCGATCCGGCGCAACGTCCCAATGGCCTGGATTATAAATCTCATTATCTTAATTG
CAACGGGGTCCAGCCGTGGT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 105; Mature: 105

Protein sequence:

>105_residues
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRPTFHQRDREDADPLINDALIYIRDECIDERKL
RGESPETVIAIFNRELQNIFNQEIE

Sequences:

>Translated_105_residues
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRPTFHQRDREDADPLINDALIYIRDECIDERKL
RGESPETVIAIFNRELQNIFNQEIE
>Mature_105_residues
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRPTFHQRDREDADPLINDALIYIRDECIDERKL
RGESPETVIAIFNRELQNIFNQEIE

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI87082048, Length=105, Percent_Identity=100, Blast_Score=212, Evalue=3e-57,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YEHK_ECOLI (P33347)

Other databases:

- EMBL:   U00007
- EMBL:   U00096
- EMBL:   AP009048
- RefSeq:   AP_002714.1
- RefSeq:   YP_588459.1
- ProteinModelPortal:   P33347
- STRING:   P33347
- EnsemblBacteria:   EBESCT00000004253
- EnsemblBacteria:   EBESCT00000015442
- GeneID:   4056035
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW2106
- KEGG:   eco:b4541
- EchoBASE:   EB1937
- EcoGene:   EG11997
- GeneTree:   EBGT00050000012020
- HOGENOM:   HBG469293
- OMA:   RDEFIDE
- ProtClustDB:   CLSK880296
- BioCyc:   EcoCyc:MONOMER0-2680
- Genevestigator:   P33347

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 12603; Mature: 12603

Theoretical pI: Translated: 4.47; Mature: 4.47

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
1.0 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRPTFHQRDREDAD
CCCCCCEEEEECCCCCCCCCCCEEEEEHHHHHHHHCCCHHHHHHHHCCCCHHCCCCCCCC
PLINDALIYIRDECIDERKLRGESPETVIAIFNRELQNIFNQEIE
HHHHHHHEEEHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MIVQKELVAIYDYEVPVPEDPFSFRLEIHKCSELFTGSVYRLERFRLRPTFHQRDREDAD
CCCCCCEEEEECCCCCCCCCCCEEEEEHHHHHHHHCCCHHHHHHHHCCCCHHCCCCCCCC
PLINDALIYIRDECIDERKLRGESPETVIAIFNRELQNIFNQEIE
HHHHHHHEEEHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9278503