Definition Escherichia fergusonii ATCC 35469 chromosome, complete genome.
Accession NC_011740
Length 4,588,711

Click here to switch to the map view.

The map label for this gene is ydiU

Identifier: 218548721

GI number: 218548721

Start: 1392186

End: 1393628

Strand: Direct

Name: ydiU

Synonym: EFER_1358

Alternate gene names: 218548721

Gene position: 1392186-1393628 (Clockwise)

Preceding gene: 218548720

Following gene: 218548725

Centisome position: 30.34

GC content: 49.62

Gene sequence:

>1443_bases
ATGACCCTGTCTTTTACCACGCGCTGGCGTGACGAATTACCAGCAACCTGGACCGCTCTTAACCCGACGCCATTGCATAA
TGCGCGGCTTATCTGGCATAACGCGGAACTGGCGCACGAACTGGCGATCCCACAATCCCTTTTTGCTGATAACAAAGGCG
CTGGTGTCTGGGGCGGTGAGGCATTACTTCCCGGAATGTCACCATTGGCGCAAGTCTACAGTGGGCATCAGTTTGGTGTC
TGGGCCGGGCAGTTAGGCGATGGCCGAGGGATTTTGTTAGGCGAACAATTATTAGCAGATGGCACAACCATGGACTGGCA
TCTGAAAGGTGCGGGGCTAACGCCTTATTCACGTATGGGCGATGGGCGTGCAGTGCTGCGTTCGACCATCCGCGAAAGCC
TGGCGAGTGAAGCGATGCATTATTTGGGTATTCCGGGCACCCGCTCCCTGGCTATTGTCACCAGCGATACCCCGGTTTAC
CGTGAGACCACTGAGACGGGAGCGATGCTGATGCGTTTGGCGCAAAGCCATATGCGCTTCGGTCATTTTGAGCATTTCTA
CTATCGGCGTGATATTGAGAAAGTACAACTTTTGGCGGATTTCGCTATTCGTCACTACTGGCCTCACTTGCAAGAAGAAC
AGGATAAATATGCGATCTGGTTTCGTGATGTCGTCGCGCGGACGGCTTCACTTATTGCTGGCTGGCAAACGGTGGGATTT
GCGCATGGTGTTATGAATACTGACAACATGTCGATAATGGGTCTGACGCTTGATTACGGGCCATTTGGTTTTCTTGATGA
TTATAACCCGCAATTTATTTGTAACCATTCTGACCATCAGGGGCGCTATAGTTTTGATAATCAACCAGCGGTAGCGTTGT
GGAATTTACAAAGGCTGGCCCAAACCTTGTCACCTTTTATTGCTGTTAATGCGTTAAATGATGCCCTGGACAGCTACAAG
CAGGTGTTATTAGCCGTGTATGGTAAACGGATGCGCCAGAAACTGGGGTTCTATACAGAACAAAACAATGACAACGATTT
ATTAAATGAACTGTTTGCCTTAATGGCACGTGAAGGTAGCGATTATACCCGCACATTCCGGATGCTAAGCCAGACTGAGC
AGAACAGTGCGTCATCGCCGTTACGTGATGAATTTATTGATCGTGCAGCATTTGATAGCTGGTTTAGCCGTTATCGTGCA
CGGATACAAACAGAGCAGGTTACGGATGATGAGCGTCAGCTACAGATGAAAAGCGTCAATCCAGCGGTTGTGTTACGTAA
CTGGCTGGCCCAAAGAGCAATCAACGATGCACAGAAAGGAGATATGGAGGAACTGCATCGATTGCATGACGTATTGCGTA
ATCCCTTCAACGATCGTGATGATGATTACTCCCGCCGTCCTCCTGAATGGGGTAAACGGCTGGAAGTCAGTTGTTCGAGC
TAA

Upstream 100 bases:

>100_bases
TCGCGACGAAGCACGTTACCACGTTGGTACAGTAGTAATCCTCCTGTTTGTGGGTATTTAAGAGAGCATTTCCCCTCTAC
ACTATCAGACAGGAGGAGCT

Downstream 100 bases:

>100_bases
ACTACTTTGTTAGCAATAACTTACCTGCCTGCGTTTTACGCAGCAGGTATTCCTGACCGTCATGATCGATAATGACTTTA
CCTTCCGGGCCCAACAACGT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 480; Mature: 479

Protein sequence:

>480_residues
MTLSFTTRWRDELPATWTALNPTPLHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGEALLPGMSPLAQVYSGHQFGV
WAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPGTRSLAIVTSDTPVY
RETTETGAMLMRLAQSHMRFGHFEHFYYRRDIEKVQLLADFAIRHYWPHLQEEQDKYAIWFRDVVARTASLIAGWQTVGF
AHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSDHQGRYSFDNQPAVALWNLQRLAQTLSPFIAVNALNDALDSYK
QVLLAVYGKRMRQKLGFYTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASSPLRDEFIDRAAFDSWFSRYRA
RIQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGDMEELHRLHDVLRNPFNDRDDDYSRRPPEWGKRLEVSCSS

Sequences:

>Translated_480_residues
MTLSFTTRWRDELPATWTALNPTPLHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGEALLPGMSPLAQVYSGHQFGV
WAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPGTRSLAIVTSDTPVY
RETTETGAMLMRLAQSHMRFGHFEHFYYRRDIEKVQLLADFAIRHYWPHLQEEQDKYAIWFRDVVARTASLIAGWQTVGF
AHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSDHQGRYSFDNQPAVALWNLQRLAQTLSPFIAVNALNDALDSYK
QVLLAVYGKRMRQKLGFYTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASSPLRDEFIDRAAFDSWFSRYRA
RIQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGDMEELHRLHDVLRNPFNDRDDDYSRRPPEWGKRLEVSCSS
>Mature_479_residues
TLSFTTRWRDELPATWTALNPTPLHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGEALLPGMSPLAQVYSGHQFGVW
AGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPGTRSLAIVTSDTPVYR
ETTETGAMLMRLAQSHMRFGHFEHFYYRRDIEKVQLLADFAIRHYWPHLQEEQDKYAIWFRDVVARTASLIAGWQTVGFA
HGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSDHQGRYSFDNQPAVALWNLQRLAQTLSPFIAVNALNDALDSYKQ
VLLAVYGKRMRQKLGFYTEQNNDNDLLNELFALMAREGSDYTRTFRMLSQTEQNSASSPLRDEFIDRAAFDSWFSRYRAR
IQTEQVTDDERQLQMKSVNPAVVLRNWLAQRAINDAQKGDMEELHRLHDVLRNPFNDRDDDYSRRPPEWGKRLEVSCSS

Specific function: Unknown

COG id: COG0397

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UPF0061 (SELO) family

Homologues:

Organism=Homo sapiens, GI32880229, Length=404, Percent_Identity=41.8316831683168, Blast_Score=290, Evalue=2e-78,
Organism=Escherichia coli, GI1787999, Length=480, Percent_Identity=83.5416666666667, Blast_Score=842, Evalue=0.0,
Organism=Saccharomyces cerevisiae, GI6325034, Length=388, Percent_Identity=35.5670103092784, Blast_Score=220, Evalue=4e-58,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YDIU_ESCF3 (B7LQ82)

Other databases:

- EMBL:   CU928158
- RefSeq:   YP_002382512.1
- EnsemblBacteria:   EBESCT00000121720
- GeneID:   7120344
- GenomeReviews:   CU928158_GR
- KEGG:   efe:EFER_1358
- GeneTree:   EBGT00050000010410
- HOGENOM:   HBG683993
- ProtClustDB:   PRK00029
- BioCyc:   EFER585054:EFER_1358-MONOMER
- HAMAP:   MF_00692
- InterPro:   IPR003846

Pfam domain/function: PF02696 UPF0061

EC number: NA

Molecular weight: Translated: 54803; Mature: 54672

Theoretical pI: Translated: 6.31; Mature: 6.31

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
3.3 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
3.1 %Met     (Mature Protein)
3.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTLSFTTRWRDELPATWTALNPTPLHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGE
CCCCCCCCHHHCCCCEEEECCCCCCCCCEEEEECHHHHHHHHCCHHHHCCCCCCCCCCCC
ALLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMG
HHCCCCHHHHHHHCCCCCCEEECCCCCCCEEEECHHHHHCCCCEEEEEECCCCCCHHHCC
DGRAVLRSTIRESLASEAMHYLGIPGTRSLAIVTSDTPVYRETTETGAMLMRLAQSHMRF
CCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH
GHFEHFYYRRDIEKVQLLADFAIRHYWPHLQEEQDKYAIWFRDVVARTASLIAGWQTVGF
CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
AHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSDHQGRYSFDNQPAVALWNLQRLA
HHCCCCCCCCEEEEEEECCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHH
QTLSPFIAVNALNDALDSYKQVLLAVYGKRMRQKLGFYTEQNNDNDLLNELFALMAREGS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCHHHHHHHHHHHHHCCC
DYTRTFRMLSQTEQNSASSPLRDEFIDRAAFDSWFSRYRARIQTEQVTDDERQLQMKSVN
HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCC
PAVVLRNWLAQRAINDAQKGDMEELHRLHDVLRNPFNDRDDDYSRRPPEWGKRLEVSCSS
HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEECC
>Mature Secondary Structure 
TLSFTTRWRDELPATWTALNPTPLHNARLIWHNAELAHELAIPQSLFADNKGAGVWGGE
CCCCCCCHHHCCCCEEEECCCCCCCCCEEEEECHHHHHHHHCCHHHHCCCCCCCCCCCC
ALLPGMSPLAQVYSGHQFGVWAGQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMG
HHCCCCHHHHHHHCCCCCCEEECCCCCCCEEEECHHHHHCCCCEEEEEECCCCCCHHHCC
DGRAVLRSTIRESLASEAMHYLGIPGTRSLAIVTSDTPVYRETTETGAMLMRLAQSHMRF
CCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH
GHFEHFYYRRDIEKVQLLADFAIRHYWPHLQEEQDKYAIWFRDVVARTASLIAGWQTVGF
CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
AHGVMNTDNMSIMGLTLDYGPFGFLDDYNPQFICNHSDHQGRYSFDNQPAVALWNLQRLA
HHCCCCCCCCEEEEEEECCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHH
QTLSPFIAVNALNDALDSYKQVLLAVYGKRMRQKLGFYTEQNNDNDLLNELFALMAREGS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCHHHHHHHHHHHHHCCC
DYTRTFRMLSQTEQNSASSPLRDEFIDRAAFDSWFSRYRARIQTEQVTDDERQLQMKSVN
HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCC
PAVVLRNWLAQRAINDAQKGDMEELHRLHDVLRNPFNDRDDDYSRRPPEWGKRLEVSCSS
HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA