Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is ycbK
Identifier: 157160447
GI number: 157160447
Start: 1043413
End: 1043961
Strand: Direct
Name: ycbK
Synonym: EcHS_A1033
Alternate gene names: 157160447
Gene position: 1043413-1043961 (Clockwise)
Preceding gene: 157160446
Following gene: 157160448
Centisome position: 22.47
GC content: 49.91
Gene sequence:
>549_bases ATGGACAAATTCGACGCTAATCGCCGCAAATTGCTGGCGCTTGGTGGCGTTGCACTCGGTGCCGCCATCCTGCCGACCCC TGCGTTTGCAACACTCTCTACCCCACGCCCGCGCATTTTGACACTCAATAATCTTCATACCGGAGAGTCAATCAAAGCGG AGTTTTTCGATGGCAGAGGCTATATTCAGGAAGAATTGGCAAAACTTAACCATTTTTTCCGCGATTACCGCGCGAACAAA ATAAAGTCCATCGACCCAGGATTATTCGACCAGTTGTATCGCCTGCAAGGGTTGTTAGGCACGCGCAAACCGGTGCAACT CATTTCCGGTTATCGTTCTATTGATACCAACAATGAACTACGCGCCCGCAGCCGTGGAGTAGCGAAGAAAAGCTATCACA CTAAAGGCCAGGCGATGGATTTCCATATTGAAGGTATCGCGTTAAGCAATATTCGCAAAGCCGCGTTATCTATGCGCGCA GGTGGTGTAGGATATTACCCACGTAGTAACTTTGTGCATATTGATACCGGGCCAGCACGGCACTGGTAG
Upstream 100 bases:
>100_bases GGTTGAGTCATCTTGACGTCTGCTTTACGGGCGGTTAAGGTGCCTCTTGTGCGCCAGAAGTGCATATAAACGATAACATT GACCTGTAGACTTGATTATC
Downstream 100 bases:
>100_bases CAATCGCTTAACGAAACAGGGGCAGTATGAACTATCGTATTATTCCGGTCACCGCATTCTCCCAGAACTGTTCATTAATC TGGTGTGAACAAACCCGTCT
Product: Tat pathway signal sequence domain-containing protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 182; Mature: 182
Protein sequence:
>182_residues MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGESIKAEFFDGRGYIQEELAKLNHFFRDYRANK IKSIDPGLFDQLYRLQGLLGTRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSNIRKAALSMRA GGVGYYPRSNFVHIDTGPARHW
Sequences:
>Translated_182_residues MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGESIKAEFFDGRGYIQEELAKLNHFFRDYRANK IKSIDPGLFDQLYRLQGLLGTRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSNIRKAALSMRA GGVGYYPRSNFVHIDTGPARHW >Mature_182_residues MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGESIKAEFFDGRGYIQEELAKLNHFFRDYRANK IKSIDPGLFDQLYRLQGLLGTRKPVQLISGYRSIDTNNELRARSRGVAKKSYHTKGQAMDFHIEGIALSNIRKAALSMRA GGVGYYPRSNFVHIDTGPARHW
Specific function: Unknown
COG id: COG3108
COG function: function code S; Uncharacterized protein conserved in bacteria
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: To H.influenzae HI_1666
Homologues:
Organism=Escherichia coli, GI1787157, Length=182, Percent_Identity=100, Blast_Score=373, Evalue=1e-105,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): YCBK_ECO57 (P0AB08)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: A90755 - RefSeq: NP_286801.1 - RefSeq: NP_309036.1 - ProteinModelPortal: P0AB08 - SMR: P0AB08 - EnsemblBacteria: EBESCT00000023949 - EnsemblBacteria: EBESCT00000060151 - GeneID: 917754 - GeneID: 958899 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z1273 - KEGG: ecs:ECs1009 - GeneTree: EBGT00050000011500 - HOGENOM: HBG613277 - OMA: VAKHSYH - ProtClustDB: CLSK879849 - BioCyc: ECOL83334:ECS1009-MONOMER - InterPro: IPR010275 - InterPro: IPR009045 - InterPro: IPR006311 - Gene3D: G3DSA:3.30.1380.10 - TIGRFAMs: TIGR01409
Pfam domain/function: PF05951 Peptidase_M15_2; SSF55166 Hedgehog_sig_N
EC number: NA
Molecular weight: Translated: 20355; Mature: 20355
Theoretical pI: Translated: 10.84; Mature: 10.84
Prosite motif: PS51318 TAT
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 1.6 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGESIKAEFFDGRG CCCCCCCCCEEEEECCHHHHHHHCCCCCEEECCCCCCEEEEECCCCCCCCCEEEEECCCC YIQEELAKLNHFFRDYRANKIKSIDPGLFDQLYRLQGLLGTRKPVQLISGYRSIDTNNEL HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHCHHCCCCCCHH RARSRGVAKKSYHTKGQAMDFHIEGIALSNIRKAALSMRAGGVGYYPRSNFVHIDTGPAR HHHHCCCHHHHCCCCCCEEEEEECCEEHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCC HW CC >Mature Secondary Structure MDKFDANRRKLLALGGVALGAAILPTPAFATLSTPRPRILTLNNLHTGESIKAEFFDGRG CCCCCCCCCEEEEECCHHHHHHHCCCCCEEECCCCCCEEEEECCCCCCCCCEEEEECCCC YIQEELAKLNHFFRDYRANKIKSIDPGLFDQLYRLQGLLGTRKPVQLISGYRSIDTNNEL HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHCHHCCCCCCHH RARSRGVAKKSYHTKGQAMDFHIEGIALSNIRKAALSMRAGGVGYYPRSNFVHIDTGPAR HHHHCCCHHHHCCCCCCEEEEEECCEEHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCC HW CC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796