Definition | Helicobacter pylori 26695, complete genome. |
---|---|
Accession | NC_000915 |
Length | 1,667,867 |
Click here to switch to the map view.
The map label for this gene is flgK
Identifier: 15645733
GI number: 15645733
Start: 1184620
End: 1186440
Strand: Reverse
Name: flgK
Synonym: HP1119
Alternate gene names: 15645733
Gene position: 1186440-1184620 (Counterclockwise)
Preceding gene: 15645734
Following gene: 15645732
Centisome position: 71.14
GC content: 39.54
Gene sequence:
>1821_bases ATGGGCGGAATCTTATCTTCACTCAACACTTCTTACACCGGCCTTCAAGCCCATCAGAGCATGGTGGATGTTACCGGGAA TAATATTTCTAACGCTAGCGATGAATTTTATAGCCGCCAGCGCGTGATTGCAAAGCCCCAAGCGGCCTATATGTATGGCA CTAAAAACGTGAATATGGGCGTGGATGTGGAAGCCATTGAAAGGGTGCATGATGAGTTTGTTTTTGCTCGTTACACGAAA GCTAATTACGAAAACACTTATTACGATACAGAATTTTCGCATTTAAAAGAAGCGAGCGCGTATTTTCCGGACATTGATGA AGCGAGCCTTTTTACGGATTTGCAAGATTATTTTAATTCATGGAAAGAATTGTCTAAAAACGCCAAAGACTCCGCTCAAA AACAGGCTCTCGCTCAAAAAACAGAAGCTTTAACGCACAACATTAAAGACACCAGAGAGAGGTTAACGACCTTACAGCAC AAGGCGAGTGAAGAATTAAAAAGCGTCATTAAAGAAGTCAATAGCTTGGGTTCTCAAATCGCTGAGATTAACAAACGCAT TAAAGAAGTGGAAAACAACAAGAGTTTAAAGCATGCGAACGAATTAAGGGATAAGCGAGATGAATTGGAATTCCATTTGC GAGAGCTTTTAGGGGGGAATGTTTTTAAAAGCAGCATTAAGACTCATTCGCTCACCGATAAAGACTCAGCGGATTTTGAT GAGAGCTATAACCTTAATATCGGGCATGGGTTCAATATCATTGATGGCTCTATTTTCCATCCTTTAGTGGTTAAAGAATC CGAAAATAAAGGGGGTTTGAACCAGGTTTATTTTCAAAGCGATGATTTTAAGGTTACTAATATTACTGACAAGCTCAATC AGGGAAGAGTGGGGGCGTTATTGAATGTGTATAATGACGGCTCTAACGGGACTTTAAAGGGCAAATTACAAGATTATATT GATTTGTTGGATTCTTTTGCTAAGGGTTTGATAGAATCCACTAATGCGATTTACGCTCAAAGCGCGAGTCATTATATTGA GGGCGAGCCGGTGGAGTTTAATAGCGATGAAGCCTTTAAAGACACTAACTACAATATCAAAAACGGCTCGTTTGACTTAA TCGCTTACAACACCGATGGTAAAGAAATCGCTAGAAAAACCATTGCTATCACGCCCATTACAACCATGAACGATATTATC CAAGCCATTAACGCTAACACTGATGACAATCAGGACAATAACACCGAAAACGATTTTGATGATTATTTCACAGCGGGCTT TAACAATGAGACTAAAAAGTTTGTTATCCAGCCTAAAAACGCTTCGCAAGGGTTGTTTGTCTCTATGAAAGATAACGGCA CGAATTTTATGGGAGCGTTAAAACTCAACCCTTTTTTTCAAGGCGATGACGCTTCTAATATCAGCTTGAATAAGGAATAC AAAAAAGAGCCTACCACTATCCGCCCATGGCTTGCTCCCATTAATGGGAATTTTGATGTGGCGAACATGATGCAGCAATT GCAATACGATAGCGTGGATTTTTATAACGATAAGTTTGACATTAAACCAATGAAAATCAGCGAGTTTTATCAATTTTTAA CCGGTAAAATCAACACGGACGCTGAAAAATCCGGGCGTATTTTGGACACTAAAAAGAGCATGTTAGAAACCATTAAAAAA GAGCAACTCTCTATTTCGCAAGTGAGCGTGGATGAAGAAATGGTGAATTTGATCAAGTTTCAAAGCGGCTATGCGGCTAA CGCTAAAGTCATTACCGCTATTGATCGGATGATAGACACTTTATTGGGGATTAAACAATAA
Upstream 100 bases:
>100_bases TTTTATTCTTCGCTCATCCAACAAATCATTCCCCATGACACTTGCGATTATAAAGGCTCTAGGCATGTGGGGAGTCATTT TTTAAGAGTGCAGGCGTAAA
Downstream 100 bases:
>100_bases GTTTTCTACCCATAGCGTTTTGATCAAATAAGCCTTATTTAACTTATTTTTTAAACTCTATCTATTTTAAAACTCATTTT TGAGCCTTTTTTATAGCTAG
Product: flagellar hook-associated protein FlgK
Products: NA
Alternate protein names: HAP1 [H]
Number of amino acids: Translated: 606; Mature: 605
Protein sequence:
>606_residues MGGILSSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMGVDVEAIERVHDEFVFARYTK ANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNSWKELSKNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQH KASEELKSVIKEVNSLGSQIAEINKRIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFD ESYNLNIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVTNITDKLNQGRVGALLNVYNDGSNGTLKGKLQDYI DLLDSFAKGLIESTNAIYAQSASHYIEGEPVEFNSDEAFKDTNYNIKNGSFDLIAYNTDGKEIARKTIAITPITTMNDII QAINANTDDNQDNNTENDFDDYFTAGFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNISLNKEY KKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTGKINTDAEKSGRILDTKKSMLETIKK EQLSISQVSVDEEMVNLIKFQSGYAANAKVITAIDRMIDTLLGIKQ
Sequences:
>Translated_606_residues MGGILSSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMGVDVEAIERVHDEFVFARYTK ANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNSWKELSKNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQH KASEELKSVIKEVNSLGSQIAEINKRIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFD ESYNLNIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVTNITDKLNQGRVGALLNVYNDGSNGTLKGKLQDYI DLLDSFAKGLIESTNAIYAQSASHYIEGEPVEFNSDEAFKDTNYNIKNGSFDLIAYNTDGKEIARKTIAITPITTMNDII QAINANTDDNQDNNTENDFDDYFTAGFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNISLNKEY KKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTGKINTDAEKSGRILDTKKSMLETIKK EQLSISQVSVDEEMVNLIKFQSGYAANAKVITAIDRMIDTLLGIKQ >Mature_605_residues GGILSSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMGVDVEAIERVHDEFVFARYTKA NYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNSWKELSKNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQHK ASEELKSVIKEVNSLGSQIAEINKRIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFDE SYNLNIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVTNITDKLNQGRVGALLNVYNDGSNGTLKGKLQDYID LLDSFAKGLIESTNAIYAQSASHYIEGEPVEFNSDEAFKDTNYNIKNGSFDLIAYNTDGKEIARKTIAITPITTMNDIIQ AINANTDDNQDNNTENDFDDYFTAGFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNISLNKEYK KEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTGKINTDAEKSGRILDTKKSMLETIKKE QLSISQVSVDEEMVNLIKFQSGYAANAKVITAIDRMIDTLLGIKQ
Specific function: Unknown
COG id: COG1256
COG function: function code N; Flagellar hook-associated protein
Gene ontology:
Cell location: Secreted. Bacterial flagellum (By similarity) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the flagella basal body rod proteins family [H]
Homologues:
Organism=Escherichia coli, GI1787323, Length=299, Percent_Identity=25.752508361204, Blast_Score=99, Evalue=8e-22,
Paralogues:
None
Copy number: 10-20 (rich media) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR010930 - InterPro: IPR001444 - InterPro: IPR019776 - InterPro: IPR002371 [H]
Pfam domain/function: PF06429 DUF1078; PF00460 Flg_bb_rod [H]
EC number: NA
Molecular weight: Translated: 68351; Mature: 68220
Theoretical pI: Translated: 4.84; Mature: 4.84
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 2.1 %Met (Translated Protein) 2.1 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGGILSSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMG CCCCHHHCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC VDVEAIERVHDEFVFARYTKANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNS CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH WKELSKNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQHKASEELKSVIKEVNSLGSQI HHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AEINKRIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFD HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCC ESYNLNIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVTNITDKLNQGRVGAL CCCCEECCCCCCCCCCCCCCCEEEEECCCCCCCCEEEEECCCEEEEEHHHHHCCCCCEEE LNVYNDGSNGTLKGKLQDYIDLLDSFAKGLIESTNAIYAQSASHYIEGEPVEFNSDEAFK EEEEECCCCCEEECHHHHHHHHHHHHHHHHHHCCCHHEECCCCCCCCCCCCCCCCCCCCC DTNYNIKNGSFDLIAYNTDGKEIARKTIAITPITTMNDIIQAINANTDDNQDNNTENDFD CCCCEECCCCEEEEEECCCHHHHHHHEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHH DYFTAGFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNISLNKEY HHHHCCCCCCCCEEEEECCCCCCCEEEEEECCCCCEEEEEEECCEECCCCCCCCCCCHHH KKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTGKINTD CCCCCEECCEEECCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCC AEKSGRILDTKKSMLETIKKEQLSISQVSVDEEMVNLIKFQSGYAANAKVITAIDRMIDT HHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH LLGIKQ HHCCCC >Mature Secondary Structure GGILSSLNTSYTGLQAHQSMVDVTGNNISNASDEFYSRQRVIAKPQAAYMYGTKNVNMG CCCHHHCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC VDVEAIERVHDEFVFARYTKANYENTYYDTEFSHLKEASAYFPDIDEASLFTDLQDYFNS CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH WKELSKNAKDSAQKQALAQKTEALTHNIKDTRERLTTLQHKASEELKSVIKEVNSLGSQI HHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AEINKRIKEVENNKSLKHANELRDKRDELEFHLRELLGGNVFKSSIKTHSLTDKDSADFD HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCC ESYNLNIGHGFNIIDGSIFHPLVVKESENKGGLNQVYFQSDDFKVTNITDKLNQGRVGAL CCCCEECCCCCCCCCCCCCCCEEEEECCCCCCCCEEEEECCCEEEEEHHHHHCCCCCEEE LNVYNDGSNGTLKGKLQDYIDLLDSFAKGLIESTNAIYAQSASHYIEGEPVEFNSDEAFK EEEEECCCCCEEECHHHHHHHHHHHHHHHHHHCCCHHEECCCCCCCCCCCCCCCCCCCCC DTNYNIKNGSFDLIAYNTDGKEIARKTIAITPITTMNDIIQAINANTDDNQDNNTENDFD CCCCEECCCCEEEEEECCCHHHHHHHEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHH DYFTAGFNNETKKFVIQPKNASQGLFVSMKDNGTNFMGALKLNPFFQGDDASNISLNKEY HHHHCCCCCCCCEEEEECCCCCCCEEEEEECCCCCEEEEEEECCEECCCCCCCCCCCHHH KKEPTTIRPWLAPINGNFDVANMMQQLQYDSVDFYNDKFDIKPMKISEFYQFLTGKINTD CCCCCEECCEEECCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCC AEKSGRILDTKKSMLETIKKEQLSISQVSVDEEMVNLIKFQSGYAANAKVITAIDRMIDT HHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH LLGIKQ HHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9384377; 8045879 [H]