Definition Escherichia coli 55989, complete genome.
Accession NC_011748
Length 5,154,862

Click here to switch to the map view.

The map label for this gene is yagU

Identifier: 218693750

GI number: 218693750

Start: 317383

End: 317997

Strand: Direct

Name: yagU

Synonym: EC55989_0292

Alternate gene names: 218693750

Gene position: 317383-317997 (Clockwise)

Preceding gene: 218693744

Following gene: 218693757

Centisome position: 6.16

GC content: 45.2

Gene sequence:

>615_bases
ATGAATATATTTGAACAAACTCCACCGAACCGCAGACGTTATGGTCTTGCTGCATTCATTGGGCTGATTGCTGGCGTTGT
TTCCGCATTCGTGAAGTGGGGGGCTGAAGTTCCATTGCCGCCACGTAGCCCGGTGGATATGTTTAATGCAGCGTGTGGCC
CGGAATCATTAATCAGGGCTGCAGGCCAAATTGATTGCTCGCGTAATTTTCTCAATCCACCGTATATTTTTCTTCGAGAC
TGGTTGGGGCTGACAGATCCCAATGCGGCTGTTTATACCTTTGCCGGGCATGTCTTTAACTGGGTTGGTGTTACGCACAT
TATCTTTTCGATAGTGTTTGCTGTCGGTTATTGTGTGGTCGCTGAAGTATTTCCAAAAATTAAACTCTGGCAGGGCTTAC
TGGCAGGTGCTTTAGCCCAACTTTTTGTTCATATGATTTCATTCCCTCTCATGGGACTGACGCCACCTCTGTTTGATCTC
CCGTGGTATGAGAATGTTTCTGAAATTTTTGGACATTTAGTCTGGTTCTGGTCTATTGAAATTATTCGCAGAGATTTACG
AAACAGAATTACTCATGAGCCAGACCCTGAGATCCCTTTAGGCTCAAACAGATAA

Upstream 100 bases:

>100_bases
AACCATCTTGTTATACAAAACAATACAGTTCTTTACATTTGCCTTGTTTTATGAATACTCCTGAAGAGGTGTATAACATA
ATGGTACAAGCAGGGTAGAT

Downstream 100 bases:

>100_bases
TGCATTGAATGATAAAAATGGCGCAAATACAGCGCCATTTTTATAGGTTAAAAACATTGCTTTTTATATTCTGATTCAGA
TAGTCAGTGAGTATATCGCG

Product: conserved hypothetical protein; putative inner membrane protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 204; Mature: 204

Protein sequence:

>204_residues
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRAAGQIDCSRNFLNPPYIFLRD
WLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDL
PWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPLGSNR

Sequences:

>Translated_204_residues
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRAAGQIDCSRNFLNPPYIFLRD
WLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDL
PWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPLGSNR
>Mature_204_residues
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRAAGQIDCSRNFLNPPYIFLRD
WLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDL
PWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPLGSNR

Specific function: Unknown

COG id: COG3477

COG function: function code S; Predicted periplasmic/secreted protein

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Escherichia coli, GI1786481, Length=204, Percent_Identity=100, Blast_Score=416, Evalue=1e-118,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): YAGU_ECO57 (P0AAA2)

Other databases:

- EMBL:   AE005174
- EMBL:   BA000007
- PIR:   E90668
- PIR:   H85518
- RefSeq:   NP_286004.1
- RefSeq:   NP_308344.1
- ProteinModelPortal:   P0AAA2
- EnsemblBacteria:   EBESCT00000025735
- EnsemblBacteria:   EBESCT00000055696
- GeneID:   914416
- GeneID:   957141
- GenomeReviews:   AE005174_GR
- GenomeReviews:   BA000007_GR
- KEGG:   ece:Z0353
- KEGG:   ecs:ECs0317
- GeneTree:   EBGT00050000011002
- HOGENOM:   HBG515007
- OMA:   SSFVKWG
- ProtClustDB:   CLSK879613
- BioCyc:   ECOL83334:ECS0317-MONOMER
- InterPro:   IPR009898

Pfam domain/function: PF07274 DUF1440

EC number: NA

Molecular weight: Translated: 22967; Mature: 22967

Theoretical pI: Translated: 7.04; Mature: 7.04

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x16fea61c)-; HASH(0x16a2c264)-; HASH(0x16b6fe84)-;

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.0 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.0 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRA
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCHHHHHHH
AGQIDCSRNFLNPPYIFLRDWLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVV
HHCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDLPWYENVSEIFGHLVWFWSIE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHH
IIRRDLRNRITHEPDPEIPLGSNR
HHHHHHHHHCCCCCCCCCCCCCCH
>Mature Secondary Structure
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRA
CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCHHHHHHH
AGQIDCSRNFLNPPYIFLRDWLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVV
HHCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
AEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDLPWYENVSEIFGHLVWFWSIE
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHH
IIRRDLRNRITHEPDPEIPLGSNR
HHHHHHHHHCCCCCCCCCCCCCCH

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796