Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ycfJ

Identifier: 157160637

GI number: 157160637

Start: 1231304

End: 1231843

Strand: Direct

Name: ycfJ

Synonym: EcHS_A1233

Alternate gene names: 157160637

Gene position: 1231304-1231843 (Clockwise)

Preceding gene: 157160636

Following gene: 157160639

Centisome position: 26.52

GC content: 53.33

Gene sequence:

>540_bases
GTGAATAAATCAATGTTGGCGGGTATCGGGATTGGTGTCGCAGCTGCGCTGGGCGTAGCGGCAGTGGCCAGTCTGAACGT
GTTTGAACGTGGCCCGCAATACGCTCAGGTTGTTTCTGCAACCCCAATCAAGGAAACGGTTAAAACACCGCGTCAGGAGT
GTCGCAACGTCACAGTGACCCATCGTCGACCGGTGCAGGATGAAAATCGCATTACCGGGTCGGTGCTCGGCGCTGTTGCT
GGCGGCGTGATAGGGCATCAGTTTGGCGGTGGTCGCGGTAAAGATGTCGCCACTGTTGTGGGGGCGCTGGGTGGTGGATA
TGCCGGTAACCAGATCCAGGGCTCTCTCCAGGAAAGCGATACTTACACGACTACGCAACAGCGTTGTAAAACGGTGTATG
ACAAGTCAGAAAAAATGCTCGGTTATGATGTGACCTATAAGATTGGCGATCAGCAGGGCAAAATCCGCATGGACCGCGAT
CCGGGTACGCAGATCCCGCTAGATAGTAATGGGCAACTGATTTTGAATAACAAAGTATAA

Upstream 100 bases:

>100_bases
AGTATTTCCAGCCATTCCCGCGCTTTTCATCTTCTGTCTGATAGCTGCTTTTCTCCTTCGCTTGCATGATTGGCATAACT
GCAAAGAAGGAGGTGTTCCC

Downstream 100 bases:

>100_bases
CAAGGCTGTACTCTGCAATTTGGCCCCTCATTCGCTCAGGCTGAGGGGCTTTTTTTGCGACTTATTTCACCAGTTCGGGC
CATAAACGCAAAGTCGTTCC

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 179; Mature: 179

Protein sequence:

>179_residues
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTPRQECRNVTVTHRRPVQDENRITGSVLGAVA
GGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDRD
PGTQIPLDSNGQLILNNKV

Sequences:

>Translated_179_residues
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTPRQECRNVTVTHRRPVQDENRITGSVLGAVA
GGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDRD
PGTQIPLDSNGQLILNNKV
>Mature_179_residues
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTPRQECRNVTVTHRRPVQDENRITGSVLGAVA
GGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSLQESDTYTTTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDRD
PGTQIPLDSNGQLILNNKV

Specific function: Unknown

COG id: COG3134

COG function: function code S; Predicted outer membrane lipoprotein

Gene ontology:

Cell location: Membrane; Single-pass membrane protein (Potential)

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To Rickettsia 17 kDa surface antigen

Homologues:

Organism=Escherichia coli, GI1787353, Length=179, Percent_Identity=100, Blast_Score=362, Evalue=1e-102,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YCFJ_ECOL6 (P0AB36)

Other databases:

- EMBL:   AE014075
- RefSeq:   NP_753293.1
- ProteinModelPortal:   P0AB36
- EnsemblBacteria:   EBESCT00000040292
- GeneID:   1035083
- GenomeReviews:   AE014075_GR
- KEGG:   ecc:c1383
- GeneTree:   EBGT00050000008864
- HOGENOM:   HBG751820
- OMA:   PREVCKD
- ProtClustDB:   PRK11280
- InterPro:   IPR008816

Pfam domain/function: PF05433 Rick_17kDa_Anti

EC number: NA

Molecular weight: Translated: 18921; Mature: 18921

Theoretical pI: Translated: 9.64; Mature: 9.64

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x11c3870c)-;

Cys/Met content:

1.1 %Cys     (Translated Protein)
2.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTPRQECRNVTVT
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHCCCEEE
HRRPVQDENRITGSVLGAVAGGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSLQESD
CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHCCC
TYTTTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDRDPGTQIPLDSNGQLILNNKV
CCHHHHHHHHHHHHCCCCEECEEEEEEECCCCCCEEECCCCCCCCCCCCCCCEEECCCH
>Mature Secondary Structure
MNKSMLAGIGIGVAAALGVAAVASLNVFERGPQYAQVVSATPIKETVKTPRQECRNVTVT
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHCCCEEE
HRRPVQDENRITGSVLGAVAGGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSLQESD
CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHCCC
TYTTTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDRDPGTQIPLDSNGQLILNNKV
CCHHHHHHHHHHHHCCCCEECEEEEEEECCCCCCEEECCCCCCCCCCCCCCCEEECCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 12471157