Definition | Escherichia coli 55989, complete genome. |
---|---|
Accession | NC_011748 |
Length | 5,154,862 |
Click here to switch to the map view.
The map label for this gene is yhhI
Identifier: 218693702
GI number: 218693702
Start: 271022
End: 272083
Strand: Direct
Name: yhhI
Synonym: EC55989_0242
Alternate gene names: 218693702
Gene position: 271022-272083 (Clockwise)
Preceding gene: 218693701
Following gene: 218693704
Centisome position: 5.26
GC content: 42.28
Gene sequence:
>1062_bases TTGTCAGGCATCCTACTATTGACTATTTTTGCCGTTATTTCTGGTGCAGAAAGTTGGGAAGATATAGAGGATTTCGGGGA AACACATCTCGATTTCTTGAAGCAATATGGTGATTTTGAAAATGGTATTCCTGTTCACGATACTATTGCCAGAGTTGTAT CCTGTATCAGTCCTGCAAAATTTCATGAGTGCTTTATTAACTGGATGCGTGACTGCCATTCTTCAGATGATAAAGACGTC ATCGCAATTGATGGAAAAACGCTCCGGCACTCTTATGACAAGAGTCGCCGCAGGGGAGCGATTCATGTCATTAGTGCGTT CTCAACAATGCACAGTCTGGTCATCGGACAGATCAGGACGGATGAGAAATCTAATGAGATTACAGCTATCCCAGAACTTC TTAACATGCTGGATATTAAAGGAAAAATCATCACAACTGATGCGATGGGTTGCCAGAAAGATATTGCAGAGAAGATACAA AAACAGGGAGGTGATTATTTATTCGCGGTAAAAGGAAACCAGGGGCGGCTAAATAAAGCCTTTGAGGAAAAATTTCCGCT GAAAGAATTAAATAATCCAGAGCATGACAGTTACGCAATGAGTGAAAAGAGTCACGGCAGAGAAGAAATCCGTCTTCATA TTGTTTGCGATGTCCCTGATGAACTTATTGATTTCACGTTTGAATGGAAAGGACTGAAGAAATTATGCGTGGCAGTCTCC TTTCGGTCAATAATAGCAGAACAAAAGAAAGAGCCAGAAATGACGGTCAGATATTATATCAGTTCTGCTGATTTAACCGC AGAAAAGTTCGCCACAGCAATCCGAAACCACTGGCACGTGGAGAATAAGCTGCACTGGCGTCTGGACGTGGTAATGAATG AAGACGACTGCAAAATAAGAAGAGGAAACGCCGCAGAATTATTTTCAGGGATACGGCACATCGCTATTAATATTTTGACG AATGATAAGGTATTCAAGGCAGGGTTAAGACGTAAGATGCGAAAAGCAGCTATGGACAGAAACTATCTGGCGTCAGTCCT TGCGGAGAGCGGGCTTTCGTAG
Upstream 100 bases:
>100_bases CTCCTTACATAAATAAGGTGAACAAATGGAACTTAAAAAATTGATGGAACATATTTCTATTATTCCCGATTACAGATAAG CCTGGAAAGTAGAGCATAAA
Downstream 100 bases:
>100_bases TCTTACCCCGACTCTCCCCCAGCCTTAAACACAACCCCCACTCACCGCAACCTAAACTCATCCGCATCCTGCCATGCCGG AAACTTTTCTCTATATTCCC
Product: putative transposase
Products: NA
Alternate protein names: H repeat-associated protein in rhsC-phrB intergenic region; ORF-H2 [H]
Number of amino acids: Translated: 353; Mature: 352
Protein sequence:
>353_residues MSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT NDKVFKAGLRRKMRKAAMDRNYLASVLAESGLS
Sequences:
>Translated_353_residues MSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ KQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT NDKVFKAGLRRKMRKAAMDRNYLASVLAESGLS >Mature_352_residues SGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK QGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSF RSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN DKVFKAGLRRKMRKAAMDRNYLASVLAESGLS
Specific function: Unknown
COG id: COG5433
COG function: function code L; Transposase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the transposase 11 family [H]
Homologues:
Organism=Escherichia coli, GI1789896, Length=353, Percent_Identity=97.7337110481586, Blast_Score=713, Evalue=0.0, Organism=Escherichia coli, GI1787733, Length=353, Percent_Identity=97.1671388101983, Blast_Score=706, Evalue=0.0, Organism=Escherichia coli, GI1786924, Length=228, Percent_Identity=92.1052631578947, Blast_Score=434, Evalue=1e-123,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002559 [H]
Pfam domain/function: PF01609 Transposase_11 [H]
EC number: NA
Molecular weight: Translated: 40175; Mature: 40044
Theoretical pI: Translated: 6.88; Mature: 6.88
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.0 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 4.8 %Cys+Met (Translated Protein) 2.0 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 4.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK CCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHH FHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRT HHHHHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC DEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKA CCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCHHH FEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS HHHCCCHHHCCCCCCCCHHHHHHHCCCEEEEEEEEECCCHHHHCCCCCHHHHHHHHHHHH FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIR HHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCEEE RGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAESGLS CCCHHHHHHHHHHHEEEEECCCHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCC >Mature Secondary Structure SGILLLTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK CCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHH FHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRT HHHHHHHHHHHHCCCCCCCEEEECCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC DEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKA CCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCHHH FEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS HHHCCCHHHCCCCCCCCHHHHHHHCCCEEEEEEEEECCCHHHHCCCCCHHHHHHHHHHHH FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIR HHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCEEE RGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAESGLS CCCHHHHHHHHHHHEEEEECCCHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8387990; 8905232; 9278503 [H]