Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is fliF

Identifier: 157161409

GI number: 157161409

Start: 2032969

End: 2034627

Strand: Direct

Name: fliF

Synonym: EcHS_A2038

Alternate gene names: 157161409

Gene position: 2032969-2034627 (Clockwise)

Preceding gene: 157161404

Following gene: 157161410

Centisome position: 43.78

GC content: 54.91

Gene sequence:

>1659_bases
ATGAATGCGACTGCAGCCCAGACAAAATCTCTTGAGTGGCTTAATCGCCTGCGTGCGAATCCGAAAATTCCATTGATTGT
TGCCGGTTCCGCGGCAGTGGCGGTCATGGTCGCACTGATCCTGTGGGCGAAAGCCCCCGACTACCGCACATTATTCAGCA
ATCTTTCCGATCAGGATGGTGGCGCAATTGTCAGCCAACTGACGCAAATGAATATTCCTTACCGCTTCAGCGAAGCCAGC
GGCGCTATTGAAGTTCCGGCAGATAAAGTTCACGAGCTGCGTCTGCGCCTGGCGCAACAAGGTTTGCCCAAAGGCGGCGC
GGTCGGTTTCGAACTACTCGATCAGGAAAAGTTTGGTATCAGCCAGTTCAGCGAACAGGTGAATTATCAGCGGGCGCTGG
AAGGCGAGCTTTCTCGTACCATCGAAACTATCGGCCCGGTAAAAGGGGCGCGCGTACATCTGGCAATGCCGAAACCGTCT
TTATTCGTCCGTGAACAAAAATCCCCTTCTGCATCGGTGACGGTAAATCTGTTACCCGGCCGCGCACTCGATGAAGGGCA
AATTAGCGCCATTGTGCATCTGGTTTCCAGCGCCGTTGCTGGTCTGCCGCCGGGAAACGTCACGCTGGTGGATCAGGGCG
GACATCTGTTAACCCAGTCCAATACCAGCGGGCGCGATCTTAATGATGCTCAGTTGAAATATGCCAGCGATGTCGAAGGC
CGTATTCAGCGGCGTATTGAAGCAATCCTGTCGCCTATTGTTGGTAACGGTAATATTCACGCCCAGGTCACGGCGCAGCT
GGACTTCGCCAGTAAAGAACAAACGGAAGAACAGTATCGCCCTAACGGTGATGAATCTCATGCGGCGCTTCGTTCACGCC
AGCTTAATGAGAGCGAGCAAAGCGGTTCCGGTTATCCGGGCGGCGTACCGGGGGCGTTGTCGAATCAACCGGCACCTGCG
AATAACGCGCCAATCAGCACGCCTCCGGCAAATCAAAATAACCGCCAGCAGCAGGCGAGCACCACCAGCAATAGCGGGCC
GCGTAGCACACAGCGGAATGAAACCAGTAACTACGAAGTCGATCGCACCATTCGTCATACCAAAATGAACGTGGGCGATG
TGCAACGTCTGTCAGTCGCGGTAGTGGTGAATTACAAAACCTTGCCAGACGGTAAACCATTGCCTCTCAGCAACGAACAG
ATGAAGCAAATTGAAGATCTGACCCGCGAGGCGATGGGCTTTTCTGAAAAACGCGGCGACTCGCTCAATGTCGTTAACTC
GCCGTTCAATAGCAGTGACGAAAGCGGCGGAGAACTGCCGTTCTGGCAACAGCAAGCGTTTATCGATCAGTTGCTGGCTG
CCGGTCGCTGGTTGCTGGTACTGCTGGTGGCGTGGCTGCTGTGGCGGAAAGCGGTACGCCCGCAGTTAACACGTCGCGCC
GAGGCGATGAAAGCTGTACAGCAACAGGCGCAGGCCCGTGAGGAAGTGGAAGATGCGGTGGAAGTCCGCCTGAGCAAAGA
CGAACAACTCCAACAACGACGCGCTAACCAACGTCTGGGGGCAGAAGTCATGAGCCAGCGTATCCGTGAAATGTCTGATA
ACGATCCGCGCGTGGTGGCGCTGGTCATTCGCCAGTGGATAAATAACGATCATGAGTAA

Upstream 100 bases:

>100_bases
CAAATAATGGCAGCGTCAATTTTTCGAGTTTGCTGACCCGGGAGTGTGTCTTGTTCCACTTTGCCAATAACGCCGTCCAT
AATCAGCCACGAGGTGCGCG

Downstream 100 bases:

>100_bases
CCTGACAGGCACCGATAAAAGCGTCATCCTGCTGATGACCATTGGCGAAGACCGGGCGGCAGAGGTGTTCAAGCACCTCT
CCCAGCGCGAAGTGCAAACC

Product: flagellar MS-ring protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 552; Mature: 552

Protein sequence:

>552_residues
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS
GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS
LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA
NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ
MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE

Sequences:

>Translated_552_residues
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS
GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS
LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA
NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ
MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE
>Mature_552_residues
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS
GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS
LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA
NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ
MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE

Specific function: The M ring may be actively involved in energy transduction

COG id: COG1766

COG function: function code NU; Flagellar biosynthesis/type III secretory pathway lipoprotein

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein. Bacterial flagellum basal body

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the fliF family

Homologues:

Organism=Escherichia coli, GI1788248, Length=552, Percent_Identity=100, Blast_Score=1120, Evalue=0.0,

Paralogues:

None

Copy number: 10-20 (rich media) [C]

Swissprot (AC and ID): FLIF_ECOLI (P25798)

Other databases:

- EMBL:   D89826
- EMBL:   U00096
- EMBL:   AP009048
- EMBL:   M84992
- EMBL:   L13243
- PIR:   G64957
- RefSeq:   AP_002550.1
- RefSeq:   NP_416448.1
- ProteinModelPortal:   P25798
- DIP:   DIP-401N
- IntAct:   P25798
- STRING:   P25798
- EnsemblBacteria:   EBESCT00000000800
- EnsemblBacteria:   EBESCT00000000801
- EnsemblBacteria:   EBESCT00000016882
- GeneID:   946448
- GenomeReviews:   AP009048_GR
- GenomeReviews:   U00096_GR
- KEGG:   ecj:JW1922
- KEGG:   eco:b1938
- EchoBASE:   EB1323
- EcoGene:   EG11347
- eggNOG:   COG1766
- GeneTree:   EBGT00050000011524
- HOGENOM:   HBG661951
- OMA:   FMEATRY
- ProtClustDB:   PRK06007
- BioCyc:   EcoCyc:FLIF-FLAGELLAR-MS-RING
- Genevestigator:   P25798
- InterPro:   IPR013556
- InterPro:   IPR000067
- InterPro:   IPR006182
- PRINTS:   PR01009
- TIGRFAMs:   TIGR00206

Pfam domain/function: PF01514 YscJ_FliF; PF08345 YscJ_FliF_C

EC number: NA

Molecular weight: Translated: 60590; Mature: 60590

Theoretical pI: Translated: 6.99; Mature: 6.99

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

HASH(0x153a5744)-; HASH(0x154a2b3c)-;

Cys/Met content:

0.0 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDG
CCCCHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCC
GAIVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGI
HHHHHHHHHCCCCEEECCCCCCEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHCCH
SQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPG
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCCCCCEEEEEECCC
RALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
CCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEECCCCCCCCCHHHHHHHHHHHH
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQ
HHHHHHHHHHHHHCCCCCEEEEEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCHHH
SGSGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCH
DRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEKRGD
HHHHHHHHCCCCCHHEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCHHHCCC
SLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA
HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH
LVIRQWINNDHE
HHHHHHHCCCCH
>Mature Secondary Structure
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDG
CCCCHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCC
GAIVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGI
HHHHHHHHHCCCCEEECCCCCCEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHCCH
SQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPG
HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEECCCCCCCEEEEEECCC
RALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
CCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCEEEEECCCCCCCCCHHHHHHHHHHHH
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQ
HHHHHHHHHHHHHCCCCCEEEEEEEEECCCCHHHHHHHHCCCCCHHHHHHHHHHCCCHHH
SGSGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEV
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCH
DRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEKRGD
HHHHHHHHCCCCCHHEEEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCHHHCCC
SLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVA
HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH
LVIRQWINNDHE
HHHHHHHCCCCH

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 9097040; 9278503; 1551848; 8224881