Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yqiI [H]

Identifier: 157162522

GI number: 157162522

Start: 3234130

End: 3235194

Strand: Direct

Name: yqiI [H]

Synonym: EcHS_A3224

Alternate gene names: 157162522

Gene position: 3234130-3235194 (Clockwise)

Preceding gene: 157162521

Following gene: 157162524

Centisome position: 69.65

GC content: 40.66

Gene sequence:

>1065_bases
ATGCGCTACTTATTAACTGTAATTGCATTTATTATGGGTTTTAGTGCGTTACCTGTATGGGCGATGAACTGCTATGCTGA
ACATGAAGGGGGGAATACCGTAGTCATAGGCTACGTACCAAGAATTGCTATCCCCAGCGATGGTAAAAAAGGTGATAAAA
TCTGGCAAAGCAGTGAATATTTTATGAATGTCTTCTGTAATAATGCACTACCCGCCCCATCACCAGGAGAAGAATACCCA
TCTGCATGGGTAAATATAATGATGTTGTTAGCATCAGGTCAGGACTTTTATAATCAAAACTCATATACTCTTGGTGTAAC
CTATAATGGAGTCGATTATGATTCAACCTCCACACGAGCTATCGCAGCACCAGAGTGCATTGATGTTAAAGGCGCAGGGA
TATACAGTAATACCTATAAAAACCCCGCTGTCTGTAGCGGTGGTCCTGAACCTCAATTGTCAGTAACTTTTCCAGTACGT
GTACAGCTGTATATTAAGCTGGCTAAAAATGCTAATAAAGTAAATAAAAAACTTGTATTACCTGACGAATATATCGCCCT
AGAGTTTAAAGGTATGAGTGGCACAGGAACTATAGACACAGACAAAAATTTGACCTTCAGAATTCGTGGGTTAAATAATA
TTCATGTCCTTGACTGCTTTGTTAATGTTGATCTGGAACCAGCTGATGGCGTTGTCGACTTTGGTAAAATAAATTCCCGA
ACAATTAAAAATACCAGCGTGAGTGAGACGTTTAGCGTAGTCATGACCAAAGATCCGGGTGCGGCCTGTACTGAGCAGTT
TAATATTTTAGGGAGTTTTTTCACTACGGATATTTTGAGTGATTATAGCCATCTGGATATAGGTAATGGTCTGCTATTGA
AGATATTTCATAACGATGGAACAGCAACGGAATTTAACCGCTTCTCACAATTTGCTTCTTTTTCATCGTCTAGTGCGCCT
TCGGTCACCGCACCATTCAGGGCAGAACTGAGTGCGAACCCGGCAGAAACGGTTGTTGAGGGACCGTTTAGTAAAGACGT
AATCCTGAAAATCACCTATAACTAG

Upstream 100 bases:

>100_bases
GGTTATATCAATGATTTTGGTGGCTTAAGTTTTTATGAAATAAACTGCCCAACAGTGAATAATTCTTGTAATGTTTCCGT
AGCCAAACGAGATAAATAAA

Downstream 100 bases:

>100_bases
TATCTAATACAAACACTAAAACGGGCCATCAGGCCCGTTATATCAGTGCTCTAACTCCAGCTCTCTTGCCTGCATCATCT
GTTGACAATACCAGGCATAA

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 354; Mature: 354

Protein sequence:

>354_residues
MRYLLTVIAFIMGFSALPVWAMNCYAEHEGGNTVVIGYVPRIAIPSDGKKGDKIWQSSEYFMNVFCNNALPAPSPGEEYP
SAWVNIMMLLASGQDFYNQNSYTLGVTYNGVDYDSTSTRAIAAPECIDVKGAGIYSNTYKNPAVCSGGPEPQLSVTFPVR
VQLYIKLAKNANKVNKKLVLPDEYIALEFKGMSGTGTIDTDKNLTFRIRGLNNIHVLDCFVNVDLEPADGVVDFGKINSR
TIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLDIGNGLLLKIFHNDGTATEFNRFSQFASFSSSSAP
SVTAPFRAELSANPAETVVEGPFSKDVILKITYN

Sequences:

>Translated_354_residues
MRYLLTVIAFIMGFSALPVWAMNCYAEHEGGNTVVIGYVPRIAIPSDGKKGDKIWQSSEYFMNVFCNNALPAPSPGEEYP
SAWVNIMMLLASGQDFYNQNSYTLGVTYNGVDYDSTSTRAIAAPECIDVKGAGIYSNTYKNPAVCSGGPEPQLSVTFPVR
VQLYIKLAKNANKVNKKLVLPDEYIALEFKGMSGTGTIDTDKNLTFRIRGLNNIHVLDCFVNVDLEPADGVVDFGKINSR
TIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLDIGNGLLLKIFHNDGTATEFNRFSQFASFSSSSAP
SVTAPFRAELSANPAETVVEGPFSKDVILKITYN
>Mature_354_residues
MRYLLTVIAFIMGFSALPVWAMNCYAEHEGGNTVVIGYVPRIAIPSDGKKGDKIWQSSEYFMNVFCNNALPAPSPGEEYP
SAWVNIMMLLASGQDFYNQNSYTLGVTYNGVDYDSTSTRAIAAPECIDVKGAGIYSNTYKNPAVCSGGPEPQLSVTFPVR
VQLYIKLAKNANKVNKKLVLPDEYIALEFKGMSGTGTIDTDKNLTFRIRGLNNIHVLDCFVNVDLEPADGVVDFGKINSR
TIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLDIGNGLLLKIFHNDGTATEFNRFSQFASFSSSSAP
SVTAPFRAELSANPAETVVEGPFSKDVILKITYN

Specific function: May be involved in a fimbrial system chaperoned by yqiH and exported by yqiG [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: To E.coli ybgO [H]

Homologues:

Organism=Escherichia coli, GI1789427, Length=354, Percent_Identity=93.2203389830508, Blast_Score=655, Evalue=0.0,
Organism=Escherichia coli, GI87081777, Length=343, Percent_Identity=35.8600583090379, Blast_Score=185, Evalue=3e-48,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008966
- InterPro:   IPR000259 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 38655; Mature: 38655

Theoretical pI: Translated: 4.88; Mature: 4.88

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.7 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
4.0 %Cys+Met (Translated Protein)
1.7 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
4.0 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MRYLLTVIAFIMGFSALPVWAMNCYAEHEGGNTVVIGYVPRIAIPSDGKKGDKIWQSSEY
CHHHHHHHHHHHCCCCCCCEEEEEEEEECCCCEEEEEECCEEEECCCCCCCCHHHCCCHH
FMNVFCNNALPAPSPGEEYPSAWVNIMMLLASGQDFYNQNSYTLGVTYNGVDYDSTSTRA
EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCEEEEEEECCCCCCCCCCEE
IAAPECIDVKGAGIYSNTYKNPAVCSGGPEPQLSVTFPVRVQLYIKLAKNANKVNKKLVL
EECCCEEECCCCCEECCCCCCCCEECCCCCCCEEEEEEEEEEEEEEECCCCCCCCCEEEC
PDEYIALEFKGMSGTGTIDTDKNLTFRIRGLNNIHVLDCFVNVDLEPADGVVDFGKINSR
CCCEEEEEEECCCCCCCEECCCCEEEEEECCCCEEEEEEEEEECCCCCCCEEECCCCCCE
TIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLDIGNGLLLKIFHNDG
EECCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCEECCCCEEEEEEECCC
TATEFNRFSQFASFSSSSAPSVTAPFRAELSANPAETVVEGPFSKDVILKITYN
CCHHHHHHHHHHHCCCCCCCCEECCEEEECCCCCHHHHCCCCCCCCEEEEEEEC
>Mature Secondary Structure
MRYLLTVIAFIMGFSALPVWAMNCYAEHEGGNTVVIGYVPRIAIPSDGKKGDKIWQSSEY
CHHHHHHHHHHHCCCCCCCEEEEEEEEECCCCEEEEEECCEEEECCCCCCCCHHHCCCHH
FMNVFCNNALPAPSPGEEYPSAWVNIMMLLASGQDFYNQNSYTLGVTYNGVDYDSTSTRA
EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHCCCCEEEEEEECCCCCCCCCCEE
IAAPECIDVKGAGIYSNTYKNPAVCSGGPEPQLSVTFPVRVQLYIKLAKNANKVNKKLVL
EECCCEEECCCCCEECCCCCCCCEECCCCCCCEEEEEEEEEEEEEEECCCCCCCCCEEEC
PDEYIALEFKGMSGTGTIDTDKNLTFRIRGLNNIHVLDCFVNVDLEPADGVVDFGKINSR
CCCEEEEEEECCCCCCCEECCCCEEEEEECCCCEEEEEEEEEECCCCCCCEEECCCCCCE
TIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLDIGNGLLLKIFHNDG
EECCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCCEECCCCEEEEEEECCC
TATEFNRFSQFASFSSSSAPSVTAPFRAELSANPAETVVEGPFSKDVILKITYN
CCHHHHHHHHHHHCCCCCCCCEECCEEEECCCCCHHHHCCCCCCCCEEEEEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]