Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yddW
Identifier: 30063218
GI number: 30063218
Start: 1811207
End: 1812526
Strand: Direct
Name: yddW
Synonym: S1869
Alternate gene names: 30063218
Gene position: 1811207-1812526 (Clockwise)
Preceding gene: 30063217
Following gene: 30063219
Centisome position: 39.38
GC content: 51.82
Gene sequence:
>1320_bases ATGGATATCTGCTCCCGAAACAAGAAATTAACGATTAGAAGACCAGCGATACTGGTTGCATTGGCACTTTTACTGTGTAG TTGTAAAAGCACGCCTCCAGAGTCCATGGTGACACCACCAGCAGGTTCAAAGCCACCAGCCACGACGCAACAATCGTCAC AACCGATGCGTGGCATCTGGCTGGCCACGGTTTCTCGCCTCGACTGGCCACCGGTTTCCTCGGTTAACATTAGTAATCCC ACCAGCCGAGCCCGTGTACAACAACAGGCGATGATCGACAAACTGGATCATCTCCAACGTCTCGGCATAAACACGGTCTT TTTCCAGGTCAAGCCGGACGGTACCGCCCTGTGGCCATCGAAAATTTTGCCGTGGTCCGATCTTATGACCGGTAAGATTG GTGAAAATCCGGGTTACGATCCGCTGCAATTCATGCTCGACGAAGCCCACAAGCGTGGGATGAAAGTACACGCCTGGTTT AACCCCTATCGCGTATCGGTTAATACGAAGCCCGGTACTATCAGGGAACTGAATAGCACTCTGTCTCAACAACCGGCGAG CGTCTATGTGCAACACCGTGACTGGATCAGAACGTCTGGCGATCGCTTTGTCCTCGACCCGGGCATCCCTGAGGTTCAGG ACTGGATCACATCAATAGTCGCTGAAGTGGTTTCCCGCTATCCGGTAGATGGCGTGCAGTTTGACGACTATTTCTATACT GAATCACCGGGTTCACGGCTAAATGATAACGAAACGTACCGTAAATACGGAGGCGCATTTGCGTCAAAAGCAGACTGGCG GCGCAACAATACTCAGCAGTTAATTGCAAAGGTATCACACACCATTAAAAGCATTAAGCCGGGAGTCGAATTTGGCGTTA GCCCGGCAGGCGTGTGGCGTAACCGATCACACGATCCGCTCGGTTCCGATACCCGAGGCGCGGCAGCCTATGACGAATCC TACGCAGACACCCGTCGATGGGTGGAACAAGGATTGCTGGATTACATTGCTCCCCAAATTTACTGGCCGTTCTCACGGAG TGCCGCGCGTTATGACGTGTTGGCAAAATGGTGGGCGGATGTCGTTAAACCAACCAGGACCCGCCTGTATATCGGTATCG CCTTCTATAAAGTGGGTGAACCTTCAAAGATAGAGCCAGACTGGATGATTAACGGCGGCGTACCGGAACTGAAAAAGCAG CTCGATCTTAACGATGCGGTACCAGAAATTAGCGGCACCATCTTGTTCCGTGAGGACTATCTGAATAAACCGCAGACTCA ACAAGCGGTCAGCTATCTGCAAAGTCGTTGGGGCAGTTAA
Upstream 100 bases:
>100_bases CGATAATTCATCGCTCCCTTTTTCGTGCTTGCTGTCGTATTGACTTAACCGGGATAAGTACGAGAATGAGCGCACATCTG TTTACCGGAAACCAGCACAT
Downstream 100 bases:
>100_bases ACTACCATGCTTACTTTGTAACAAGCCGGTGCTTTACCCACCGGCTTGTTACACCTTGTGAAATATCCCACTAGTAATCA TGCTTACATAAGTCAAATTA
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 439; Mature: 439
Protein sequence:
>439_residues MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS
Sequences:
>Translated_439_residues MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS >Mature_439_residues MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS
Specific function: Unknown
COG id: COG1649
COG function: function code S; Uncharacterized protein conserved in bacteria
Gene ontology:
Cell location: Cell membrane; Lipid-anchor (Potential)
Metaboloic importance: Non Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the UPF0748 family
Homologues:
Organism=Escherichia coli, GI1787767, Length=439, Percent_Identity=100, Blast_Score=905, Evalue=0.0,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): YDDW_ECO57 (P64427)
Other databases:
- EMBL: AE005174 - EMBL: BA000007 - PIR: H90890 - RefSeq: NP_287664.1 - RefSeq: NP_310123.1 - ProteinModelPortal: P64427 - EnsemblBacteria: EBESCT00000024320 - EnsemblBacteria: EBESCT00000055997 - GeneID: 917296 - GeneID: 960677 - GenomeReviews: AE005174_GR - GenomeReviews: BA000007_GR - KEGG: ece:Z2217 - KEGG: ecs:ECs2096 - GeneTree: EBGT00050000008957 - HOGENOM: HBG364398 - OMA: WRNIADD - ProtClustDB: CLSK880041 - BioCyc: ECOL83334:ECS2096-MONOMER - InterPro: IPR003790 - InterPro: IPR017853 - InterPro: IPR013781 - Gene3D: G3DSA:3.20.20.80
Pfam domain/function: PF02638 DUF187; SSF51445 Glyco_hydro_cat
EC number: NA
Molecular weight: Translated: 49575; Mature: 49575
Theoretical pI: Translated: 9.58; Mature: 9.58
Prosite motif: PS51257 PROKAR_LIPOPROTEIN; PS00013 PROKAR_LIPOPROTEIN
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 2.5 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW CCCCCCCCEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCEECCC KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST CCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEEECEEEEEEECCCCHHHHHHHH LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT HHCCCHHEEEEHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEECCEEEE ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR CCCCCCCCCCHHHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHHCCCHHHCCCCHHHCC NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHH VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY HHCCCCEEEEEEEEEEECCCCCCCCCCEEECCCCHHHHHHCCCCCCCCCCCCEEEEEHHH LNKPQTQQAVSYLQSRWGS CCCCHHHHHHHHHHHHCCC >Mature Secondary Structure MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIW CCCCCCCCEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH LATVSRLDWPPVSSVNISNPTSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPS HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCEECCC KILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWFNPYRVSVNTKPGTIRELNST CCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEEECEEEEEEECCCCHHHHHHHH LSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT HHCCCHHEEEEHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEECCEEEE ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWR CCCCCCCCCCHHHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHHCCCHHHCCCCHHHCC NRSHDPLGSDTRGAAAYDESYADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWAD CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHH VVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQLDLNDAVPEISGTILFREDY HHCCCCEEEEEEEEEEECCCCCCCCCCEEECCCCHHHHHHCCCCCCCCCCCCEEEEEHHH LNKPQTQQAVSYLQSRWGS CCCCHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796