Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yraI [H]
Identifier: 157162628
GI number: 157162628
Start: 3339015
End: 3339710
Strand: Direct
Name: yraI [H]
Synonym: EcHS_A3335
Alternate gene names: 157162628
Gene position: 3339015-3339710 (Clockwise)
Preceding gene: 157162627
Following gene: 157162629
Centisome position: 71.91
GC content: 43.82
Gene sequence:
>696_bases ATGTCAAAACGAACATTCGCGGTGATATTAACCTTGTTGTGTAGCTTCTGTATTGGCCAGGCGCTTGCAGGAGGAATCGT TTTACAGCGAACGCGAGTGATCTATGATGCCAGCCGCAAAGAGGCTGCGTTACCTGTCGCAAACAAAGGCGCAGAAACGC CTTATTTACTGCAATCATGGGTAGATAATATAGATGGTAAAAGCCGTGCCCCATTTATTATAACCCCACCGCTATTTCGT CTTGAGGCTGGCGATGACTCATCACTGCGAATTATTAAAACAGCTGATAACCTGCCTGAAAATAAAGAGTCGCTGTTCTA CATTAATGTTCGTGCCATTCCAGCAAAGAAAAAATCAGATGATGTTAATGCTAACGAGTTGACGCTGGTATTTAAAACAC GGATCAAAATGTTTTATCGCCCCGCACACCTGAAGGGACGGGTAAACGATGCGTGGAAATCACTGGAATTTAAACGTAGT GATCATTCACTCAATATATATAACCCAACTGAATATTACGTCGTATTTGCCGGATTGGCAGTCGATAAAATCGATCTCAC AAGCAAAATTGAATATATCGCGCCCGGAGAACATAAACAGTTACCACTTCCTGCATCTGGCGGAAAGAACGTGAAATGGG CTGCGATCAATGATTATGGCGGCAGTTCCGGGACAGAAACTCGTCCACTGCAATAA
Upstream 100 bases:
>100_bases GTAACGCTGGATTACCGTTAATACGTTACGGCGTTATCTGACCTGTCAGATAACGCCCTTTTCCTTCCTCTTTCTCGTTG TATCAGGTTGAAAAATGACT
Downstream 100 bases:
>100_bases CAAATATAAAAAACACAGGTCATCAGGGAATGCCACAACGACACCACCAGGGACATAAACGCACACCGAAACAGTTGGCG CTCATTATCAAACGCTGTTT
Product: pili assembly chaperone protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 231; Mature: 230
Protein sequence:
>231_residues MSKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGKSRAPFIITPPLFR LEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDDVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRS DHSLNIYNPTEYYVVFAGLAVDKIDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ
Sequences:
>Translated_231_residues MSKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGKSRAPFIITPPLFR LEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDDVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRS DHSLNIYNPTEYYVVFAGLAVDKIDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ >Mature_230_residues SKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGKSRAPFIITPPLFRL EAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDDVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSD HSLNIYNPTEYYVVFAGLAVDKIDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ
Specific function: Could be required for the biogenesis of the putative yraH fimbria [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Periplasm [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 Ig-like (immunoglobulin-like) domain [H]
Homologues:
Organism=Escherichia coli, GI1789532, Length=231, Percent_Identity=99.5670995670996, Blast_Score=474, Evalue=1e-135, Organism=Escherichia coli, GI1790771, Length=217, Percent_Identity=39.63133640553, Blast_Score=156, Evalue=1e-39, Organism=Escherichia coli, GI1786743, Length=220, Percent_Identity=35, Blast_Score=140, Evalue=9e-35, Organism=Escherichia coli, GI1787171, Length=219, Percent_Identity=34.2465753424658, Blast_Score=124, Evalue=6e-30, Organism=Escherichia coli, GI1786333, Length=238, Percent_Identity=34.0336134453782, Blast_Score=117, Evalue=9e-28, Organism=Escherichia coli, GI1788428, Length=217, Percent_Identity=31.3364055299539, Blast_Score=108, Evalue=3e-25, Organism=Escherichia coli, GI1786936, Length=199, Percent_Identity=32.1608040201005, Blast_Score=100, Evalue=9e-23, Organism=Escherichia coli, GI1788677, Length=159, Percent_Identity=33.3333333333333, Blast_Score=97, Evalue=9e-22, Organism=Escherichia coli, GI87082203, Length=224, Percent_Identity=29.0178571428571, Blast_Score=96, Evalue=2e-21, Organism=Escherichia coli, GI87081806, Length=234, Percent_Identity=27.7777777777778, Blast_Score=84, Evalue=7e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008962 - InterPro: IPR001829 - InterPro: IPR016148 - InterPro: IPR018046 - InterPro: IPR016147 [H]
Pfam domain/function: PF02753 Pili_assembly_C; PF00345 Pili_assembly_N [H]
EC number: NA
Molecular weight: Translated: 25689; Mature: 25558
Theoretical pI: Translated: 9.84; Mature: 9.84
Prosite motif: PS00635 PILI_CHAPERONE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 1.7 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 0.4 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSW CCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHH VDNIDGKSRAPFIITPPLFRLEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSD HHCCCCCCCCCEEECCCCEEEECCCCCCEEEEEECCCCCCCCCCEEEEEEEEECCCCCCC DVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSDHSLNIYNPTEYYVVFAGLA CCCCCEEEEEEEEEHHHEECCCCCCCCCCHHHHCCEEECCCCEEEEECCCEEEEEEECCE VDKIDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ EEEEECCCCEEEECCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCC >Mature Secondary Structure SKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSW CCHHHHHHHHHHHHHHHHHHHHCCEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHH VDNIDGKSRAPFIITPPLFRLEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSD HHCCCCCCCCCEEECCCCEEEECCCCCCEEEEEECCCCCCCCCCEEEEEEEEECCCCCCC DVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSDHSLNIYNPTEYYVVFAGLA CCCCCEEEEEEEEEHHHEECCCCCCCCCCHHHHCCEEECCCCEEEEECCCEEEEEEECCE VDKIDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ EEEEECCCCEEEECCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]