| Definition | Escherichia coli HS, complete genome. |
|---|---|
| Accession | NC_009800 |
| Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is fliF
Identifier: 157161409
GI number: 157161409
Start: 2032969
End: 2034627
Strand: Direct
Name: fliF
Synonym: EcHS_A2038
Alternate gene names: 157161409
Gene position: 2032969-2034627 (Clockwise)
Preceding gene: 157161404
Following gene: 157161410
Centisome position: 43.78
GC content: 54.91
Gene sequence:
>1659_bases ATGAATGCGACTGCAGCCCAGACAAAATCTCTTGAGTGGCTTAATCGCCTGCGTGCGAATCCGAAAATTCCATTGATTGT TGCCGGTTCCGCGGCAGTGGCGGTCATGGTCGCACTGATCCTGTGGGCGAAAGCCCCCGACTACCGCACATTATTCAGCA ATCTTTCCGATCAGGATGGTGGCGCAATTGTCAGCCAACTGACGCAAATGAATATTCCTTACCGCTTCAGCGAAGCCAGC GGCGCTATTGAAGTTCCGGCAGATAAAGTTCACGAGCTGCGTCTGCGCCTGGCGCAACAAGGTTTGCCCAAAGGCGGCGC GGTCGGTTTCGAACTACTCGATCAGGAAAAGTTTGGTATCAGCCAGTTCAGCGAACAGGTGAATTATCAGCGGGCGCTGG AAGGCGAGCTTTCTCGTACCATCGAAACTATCGGCCCGGTAAAAGGGGCGCGCGTACATCTGGCAATGCCGAAACCGTCT TTATTCGTCCGTGAACAAAAATCCCCTTCTGCATCGGTGACGGTAAATCTGTTACCCGGCCGCGCACTCGATGAAGGGCA AATTAGCGCCATTGTGCATCTGGTTTCCAGCGCCGTTGCTGGTCTGCCGCCGGGAAACGTCACGCTGGTGGATCAGGGCG GACATCTGTTAACCCAGTCCAATACCAGCGGGCGCGATCTTAATGATGCTCAGTTGAAATATGCCAGCGATGTCGAAGGC CGTATTCAGCGGCGTATTGAAGCAATCCTGTCGCCTATTGTTGGTAACGGTAATATTCACGCCCAGGTCACGGCGCAGCT GGACTTCGCCAGTAAAGAACAAACGGAAGAACAGTATCGCCCTAACGGTGATGAATCTCATGCGGCGCTTCGTTCACGCC AGCTTAATGAGAGCGAGCAAAGCGGTTCCGGTTATCCGGGCGGCGTACCGGGGGCGTTGTCGAATCAACCGGCACCTGCG AATAACGCGCCAATCAGCACGCCTCCGGCAAATCAAAATAACCGCCAGCAGCAGGCGAGCACCACCAGCAATAGCGGGCC GCGTAGCACACAGCGGAATGAAACCAGTAACTACGAAGTCGATCGCACCATTCGTCATACCAAAATGAACGTGGGCGATG TGCAACGTCTGTCAGTCGCGGTAGTGGTGAATTACAAAACCTTGCCAGACGGTAAACCATTGCCTCTCAGCAACGAACAG ATGAAGCAAATTGAAGATCTGACCCGCGAGGCGATGGGCTTTTCTGAAAAACGCGGCGACTCGCTCAATGTCGTTAACTC GCCGTTCAATAGCAGTGACGAAAGCGGCGGAGAACTGCCGTTCTGGCAACAGCAAGCGTTTATCGATCAGTTGCTGGCTG CCGGTCGCTGGTTGCTGGTACTGCTGGTGGCGTGGCTGCTGTGGCGGAAAGCGGTACGCCCGCAGTTAACACGTCGCGCC GAGGCGATGAAAGCTGTACAGCAACAGGCGCAGGCCCGTGAGGAAGTGGAAGATGCGGTGGAAGTCCGCCTGAGCAAAGA CGAACAACTCCAACAACGACGCGCTAACCAACGTCTGGGGGCAGAAGTCATGAGCCAGCGTATCCGTGAAATGTCTGATA ACGATCCGCGCGTGGTGGCGCTGGTCATTCGCCAGTGGATAAATAACGATCATGAGTAA
Upstream 100 bases:
>100_bases CAAATAATGGCAGCGTCAATTTTTCGAGTTTGCTGACCCGGGAGTGTGTCTTGTTCCACTTTGCCAATAACGCCGTCCAT AATCAGCCACGAGGTGCGCG
Downstream 100 bases:
>100_bases CCTGACAGGCACCGATAAAAGCGTCATCCTGCTGATGACCATTGGCGAAGACCGGGCGGCAGAGGTGTTCAAGCACCTCT CCCAGCGCGAAGTGCAAACC
Product: flagellar MS-ring protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 552; Mature: 552
Protein sequence:
>552_residues MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE
Sequences:
>Translated_552_residues MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE >Mature_552_residues MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE
Specific function: The M ring may be actively involved in energy transduction
COG id: COG1766
COG function: function code NU; Flagellar biosynthesis/type III secretory pathway lipoprotein
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein. Bacterial flagellum basal body
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the fliF family
Homologues:
Organism=Escherichia coli, GI1788248, Length=552, Percent_Identity=100, Blast_Score=1120, Evalue=0.0,
Paralogues:
None
Copy number: 10-20 (rich media) [C]
Swissprot (AC and ID): FLIF_ECOLI (P25798)
Other databases:
- EMBL: D89826 - EMBL: U00096 - EMBL: AP009048 - EMBL: M84992 - EMBL: L13243 - PIR: G64957 - RefSeq: AP_002550.1 - RefSeq: NP_416448.1 - ProteinModelPortal: P25798 - DIP: DIP-401N - IntAct: P25798 - STRING: P25798 - EnsemblBacteria: EBESCT00000000800 - EnsemblBacteria: EBESCT00000000801 - EnsemblBacteria: EBESCT00000016882 - GeneID: 946448 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW1922 - KEGG: eco:b1938 - EchoBASE: EB1323 - EcoGene: EG11347 - eggNOG: COG1766 - GeneTree: EBGT00050000011524 - HOGENOM: HBG661951 - OMA: FMEATRY - ProtClustDB: PRK06007 - BioCyc: EcoCyc:FLIF-FLAGELLAR-MS-RING - Genevestigator: P25798 - InterPro: IPR013556 - InterPro: IPR000067 - InterPro: IPR006182 - PRINTS: PR01009 - TIGRFAMs: TIGR00206
Pfam domain/function: PF01514 YscJ_FliF; PF08345 YscJ_FliF_C
EC number: NA
Molecular weight: Translated: 60590; Mature: 60590
Theoretical pI: Translated: 6.99; Mature: 6.99
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0x153a5744)-; HASH(0x154a2b3c)-;
Cys/Met content:
0.0 %Cys (Translated Protein) 1.8 %Met (Translated Protein) 1.8 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDG CCCCHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCC GAIVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGI HHHHHHHHHCCCCEEECCCCCCEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHCCH SQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPG HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCCCCCEEEEEECCC RALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG CCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEECCCCCCCCCHHHHHHHHHHHH RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQ HHHHHHHHHHHHHCCCCCEEEEEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCHHH SGSGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEV CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCH DRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEKRGD HHHHHHHHCCCCCHHEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCHHHCCC SLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH LVIRQWINNDHE HHHHHHHCCCCH >Mature Secondary Structure MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDG CCCCHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCC GAIVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGI HHHHHHHHHCCCCEEECCCCCCEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHCCH SQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPG HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCCCCCEEEEEECCC RALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG CCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEECCCCCCCCCHHHHHHHHHHHH RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQ HHHHHHHHHHHHHCCCCCEEEEEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCHHH SGSGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEV CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCH DRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEKRGD HHHHHHHHCCCCCHHEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCHHHCCC SLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH LVIRQWINNDHE HHHHHHHCCCCH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9097040; 9278503; 1551848; 8224881