Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is fimE
Identifier: 157163757
GI number: 157163757
Start: 4547040
End: 4547636
Strand: Direct
Name: fimE
Synonym: EcHS_A4539
Alternate gene names: 157163757
Gene position: 4547040-4547636 (Clockwise)
Preceding gene: 157163756
Following gene: 157163758
Centisome position: 97.92
GC content: 46.9
Gene sequence:
>597_bases GTGAGTAAACGTCGTTATCTTACCGGTAAAGAAGTTCAGGCCATGATGCAGGCGGTTTGTTACGGGGCAACGGGAGCCAG AGATTATTGTCTTATTCTGTTGGCATATCGGCATGGGATGCGTATTAGTGAACTGCTTGATCTGCATTATCAGGACCTTG ACCTTAATGAAGGTAGAATAAATATTCGCCGACTGAAGAACGGATTTTCTACCGTTCACCCGTTACGTTTTGATGAGCGT GAAGCCGTGGAACGATGGACCCAGGAACGTGCTAACTGGAAAGGCGCTGACCGGACTGACGCTATATTTATTTCTCGCCG CGGGAGTCGGCTTTCTCGCCAGCAGGCCTATCGCATTATTCGCGATGCCGGTATTGAAGCTGGAACCGTAACGCAGACTC ATCCTCATATGTTAAGGCATGCTTGCGGTTATGAACTGGCGGAGCGTGGTGCCGATACCCGTTTAATTCAGGATTATCTC GGGCATCGAAATATTCGCCATACTGTGCGTTATACCGCCAGTAATGCTGCTCGTTTTGCCGGATTATGGGAAAGAAATAA TCTCATAAACGAAAAATTAAAAAGAGAAGAAGTTTAA
Upstream 100 bases:
>100_bases CCGTGTGTGGTTATCTTTTTATCTATTGGGCTAATTTTGACCGATTGAGGTTTCCTATAGGTATTCATTCAAATATATCT CAGTTAGGAGTACTACTATT
Downstream 100 bases:
>100_bases TTTAACTTATTGATAATAAAGTTAAAAAGCAAATAAATACAAGACAATTGGGGCCAAACTGTCCATATCATAAATAAGTT ACGTATTTTTTCTCAAGCAT
Product: tyrosine recombinase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 198; Mature: 197
Protein sequence:
>198_residues MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRINIRRLKNGFSTVHPLRFDER EAVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYL GHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
Sequences:
>Translated_198_residues MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRINIRRLKNGFSTVHPLRFDER EAVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYL GHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV >Mature_197_residues SKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRINIRRLKNGFSTVHPLRFDERE AVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYLG HRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV
Specific function: FimE is one of the 2 regulatory proteins which control the phase variation of type 1 fimbriae in E.coli. These proteins mediate the periodic inversion of a 300bp DNA segment that harbors the promoter for the fimbrial structural gene, fimA. FimE switches f
COG id: COG0582
COG function: function code L; Integrase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the 'phage' integrase family
Homologues:
Organism=Escherichia coli, GI1790768, Length=198, Percent_Identity=100, Blast_Score=408, Evalue=1e-115, Organism=Escherichia coli, GI1790767, Length=183, Percent_Identity=52.4590163934426, Blast_Score=198, Evalue=2e-52, Organism=Escherichia coli, GI1790244, Length=167, Percent_Identity=29.940119760479, Blast_Score=75, Evalue=3e-15, Organism=Escherichia coli, GI1789261, Length=184, Percent_Identity=27.7173913043478, Blast_Score=67, Evalue=7e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): FIME_ECOL6 (P0ADH8)
Other databases:
- EMBL: AE014075 - RefSeq: NP_757240.1 - ProteinModelPortal: P0ADH8 - SMR: P0ADH8 - EnsemblBacteria: EBESCT00000043640 - GeneID: 1037249 - GenomeReviews: AE014075_GR - KEGG: ecc:c5392 - GeneTree: EBGT00050000008629 - HOGENOM: HBG287305 - OMA: TDALFIS - ProtClustDB: PRK09871 - GO: GO:0006350 - InterPro: IPR011010 - InterPro: IPR013762 - InterPro: IPR002104 - Gene3D: G3DSA:1.10.443.10
Pfam domain/function: PF00589 Phage_integrase; SSF56349 DNA_brk_join_enz
EC number: NA
Molecular weight: Translated: 23117; Mature: 22985
Theoretical pI: Translated: 10.16; Mature: 10.16
Prosite motif: NA
Important sites: ACT_SITE 171-171
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 4.0 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRI CCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCC NIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRII HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHH RDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYLGHRNIRHTVRYTASNAARFA HHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHCCHHHHH GLWERNNLINEKLKREEV HHHHHHHHHHHHHHHCCC >Mature Secondary Structure SKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRI CCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCC NIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRII HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHHHHHH RDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYLGHRNIRHTVRYTASNAARFA HHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHCCHHHHH GLWERNNLINEKLKREEV HHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: DNA [C]
Specific reaction: Protein + DNA = Protein-DNA [C]
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 12471157