Definition Francisella tularensis subsp. holarctica LVS chromosome, complete genome.
Accession NC_007880
Length 1,895,994

Click here to switch to the map view.

The map label for this gene is tapB [H]

Identifier: 89256188

GI number: 89256188

Start: 808481

End: 810262

Strand: Reverse

Name: tapB [H]

Synonym: FTL_0828

Alternate gene names: 89256188

Gene position: 810262-808481 (Counterclockwise)

Preceding gene: 89256190

Following gene: 89256187

Centisome position: 42.74

GC content: 33.89

Gene sequence:

>1782_bases
ATGATTGAAAATCCACATATCTGCAAAAAACTTGCTTCGCTATTACTTCATCGTAAGCTTATCACAAAGCAACAGCTTGA
AGAAATAAGTCACGATAAGTCTTTAGCAAAACAAGACTTTTTAGAATACTTGATTGAAAATGAGATCGTTGATAATAAAT
CTTTTATGATAGAGTCTGCAACATTACTACGTTTGCAATATATTGACTTAACTACCATAAATGTAAAATTTTTACCTCAA
GAATACTTTGACATTGATTTTTGTCGCGAAAATCACTGTCTAGCACTATTTACACGCAAAAGGACTATTCATGTAGCAAT
AGCTAACCCTATTGAAAGTCAACAAATTCTTAAAAAACTTCGCAGTAGATATGATTGTGCATTCATCCCTGTAGTTGCTG
AACTAAATCATCTTAAAGAAAAAATAGAAGAATATGAACAATATCTCAAAGATGAGGGGCAAAGAGAGGATGACGCAAAG
CTTAATGAAAGAATTGATGCAGGATCAGAGCTTGATATTGACTTTGTTGAAGAAGGTGAGCGTGAAGGCGATGCAATAAT
AGGTAGTGGCGAAGATGAAGAAGCTCCTATTATTCGCTTTATCAATAATACTATAGTAGATGCCATCCAAAAAGGTGCTT
CGGATATACATTTTGAACCATATGAAAAAAACTTTCGCATTCGCTATAGGATAGACGGTCTTCTAATTGAGACTTTTAAT
ACTACTAAGAAAAATTTAGCTCCTAAGGTTATTTCTAGACTTAAAATTATGTCAAGCTTAGATATTGCTGAAAAGAGGAT
TCCTCAGGATGGTAAATTTAAAATTTCTCTTTCACGTGAAAAAGCCATCGACTTTCGTGTAAGTACATGTCCTATTAGTT
TCGGTGAAAAAGTTGTACTGCGTATTATTGATTCAAGCTCGACACAAATACCAATAGAACAATTAGGTTTTTCAGAATCA
CAAAAAGAAACCTATCTTAAATATATTCAACAACCTCAAGGCATGGTACTAGTAACAGGACCAACAGGTTCTGGTAAAAC
TGTTACTTTATATACAGGTATTAATATTCTAAATAAGCCCGAAAAAAATATTTCAACCGCCGAAGACCCTGTTGAGCTAA
TTGTAAAAGGTATTAACCAAGTAAGTGTTAATAATAAACAAGGACTTACTTTCGCAGCAGCACTTAAGTCTTTCTTACGT
CAAGATCCAGATATAATCATGGTCGGGGAGATTAGAGATATCGAAACAGGTTCAATTGCAATCAAAGCTTCTCAAACAGG
TCACTTAGTTATGTCAACATTACATACAAACAGTGCTCCAGAGACATTAAACAGACTTGTGGATATGGGCTTGCCTCGAT
ACAATATTGCTACATCTGTAACATTAATTATCGCTCAACGTCTAATTCGAAAGCTATGTCCTAAGTGTAAATTACCAGAT
ACTGATACAGAATTTTCACTTCTTGTCGAAAATAGTGGTTTAAATGATGAGATATTAGCAAAAACATTTGGAACGACTCT
AGATAAAGTAAAAAATGCTAAAATTTACAAAGCTAATCCTAAAGGTTGCCCTAGATGTTTTAAGGGTTACAAAGGCAGAA
TTGGTTTATATGAAGTTATGCCAGTATCAAGGCAAATATCAAGAATGATATTAGAAGATAAAAACACTATGGAAATTGCG
ATACAAGCTCAAAAAGAAGGAATCGCTACTGTCAGACAATCAGCTCTAGTCAGGGTTGCTGAAGATCTAACATCAATGGA
AGAAGTATACCGTGTAAGTTAA

Upstream 100 bases:

>100_bases
CTAAGCTATTTTTGATTAATTTATCTTAAATATTTTGTTTTTTAATAATTATCTTAATCATGCTATAATTTTGCTACTAT
TTGAAGTTTATTTTATATAG

Downstream 100 bases:

>100_bases
AGGGAATAAAAAATATGCTATTTGGCAAAAAAGATAAAAATAAAAGAACTATTACATCATGGAATTATAAAGCTAAGCTA
AAAAGTGGTAAAAAGACTAA

Product: Type IV pili nucleotide binding protein, ABC transporter, ATP-binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 593; Mature: 593

Protein sequence:

>593_residues
MIENPHICKKLASLLLHRKLITKQQLEEISHDKSLAKQDFLEYLIENEIVDNKSFMIESATLLRLQYIDLTTINVKFLPQ
EYFDIDFCRENHCLALFTRKRTIHVAIANPIESQQILKKLRSRYDCAFIPVVAELNHLKEKIEEYEQYLKDEGQREDDAK
LNERIDAGSELDIDFVEEGEREGDAIIGSGEDEEAPIIRFINNTIVDAIQKGASDIHFEPYEKNFRIRYRIDGLLIETFN
TTKKNLAPKVISRLKIMSSLDIAEKRIPQDGKFKISLSREKAIDFRVSTCPISFGEKVVLRIIDSSSTQIPIEQLGFSES
QKETYLKYIQQPQGMVLVTGPTGSGKTVTLYTGINILNKPEKNISTAEDPVELIVKGINQVSVNNKQGLTFAAALKSFLR
QDPDIIMVGEIRDIETGSIAIKASQTGHLVMSTLHTNSAPETLNRLVDMGLPRYNIATSVTLIIAQRLIRKLCPKCKLPD
TDTEFSLLVENSGLNDEILAKTFGTTLDKVKNAKIYKANPKGCPRCFKGYKGRIGLYEVMPVSRQISRMILEDKNTMEIA
IQAQKEGIATVRQSALVRVAEDLTSMEEVYRVS

Sequences:

>Translated_593_residues
MIENPHICKKLASLLLHRKLITKQQLEEISHDKSLAKQDFLEYLIENEIVDNKSFMIESATLLRLQYIDLTTINVKFLPQ
EYFDIDFCRENHCLALFTRKRTIHVAIANPIESQQILKKLRSRYDCAFIPVVAELNHLKEKIEEYEQYLKDEGQREDDAK
LNERIDAGSELDIDFVEEGEREGDAIIGSGEDEEAPIIRFINNTIVDAIQKGASDIHFEPYEKNFRIRYRIDGLLIETFN
TTKKNLAPKVISRLKIMSSLDIAEKRIPQDGKFKISLSREKAIDFRVSTCPISFGEKVVLRIIDSSSTQIPIEQLGFSES
QKETYLKYIQQPQGMVLVTGPTGSGKTVTLYTGINILNKPEKNISTAEDPVELIVKGINQVSVNNKQGLTFAAALKSFLR
QDPDIIMVGEIRDIETGSIAIKASQTGHLVMSTLHTNSAPETLNRLVDMGLPRYNIATSVTLIIAQRLIRKLCPKCKLPD
TDTEFSLLVENSGLNDEILAKTFGTTLDKVKNAKIYKANPKGCPRCFKGYKGRIGLYEVMPVSRQISRMILEDKNTMEIA
IQAQKEGIATVRQSALVRVAEDLTSMEEVYRVS
>Mature_593_residues
MIENPHICKKLASLLLHRKLITKQQLEEISHDKSLAKQDFLEYLIENEIVDNKSFMIESATLLRLQYIDLTTINVKFLPQ
EYFDIDFCRENHCLALFTRKRTIHVAIANPIESQQILKKLRSRYDCAFIPVVAELNHLKEKIEEYEQYLKDEGQREDDAK
LNERIDAGSELDIDFVEEGEREGDAIIGSGEDEEAPIIRFINNTIVDAIQKGASDIHFEPYEKNFRIRYRIDGLLIETFN
TTKKNLAPKVISRLKIMSSLDIAEKRIPQDGKFKISLSREKAIDFRVSTCPISFGEKVVLRIIDSSSTQIPIEQLGFSES
QKETYLKYIQQPQGMVLVTGPTGSGKTVTLYTGINILNKPEKNISTAEDPVELIVKGINQVSVNNKQGLTFAAALKSFLR
QDPDIIMVGEIRDIETGSIAIKASQTGHLVMSTLHTNSAPETLNRLVDMGLPRYNIATSVTLIIAQRLIRKLCPKCKLPD
TDTEFSLLVENSGLNDEILAKTFGTTLDKVKNAKIYKANPKGCPRCFKGYKGRIGLYEVMPVSRQISRMILEDKNTMEIA
IQAQKEGIATVRQSALVRVAEDLTSMEEVYRVS

Specific function: Involved in the translocation of the type IV pilin [H]

COG id: COG2804

COG function: function code NU; Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB

Gene ontology:

Cell location: Cytoplasm (Potential) [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the GSP E family [H]

Homologues:

Organism=Escherichia coli, GI1789723, Length=408, Percent_Identity=44.1176470588235, Blast_Score=339, Evalue=4e-94,
Organism=Escherichia coli, GI1786296, Length=402, Percent_Identity=41.2935323383085, Blast_Score=331, Evalue=7e-92,
Organism=Escherichia coli, GI87082188, Length=274, Percent_Identity=32.4817518248175, Blast_Score=120, Evalue=2e-28,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR013374
- InterPro:   IPR007831
- InterPro:   IPR001482 [H]

Pfam domain/function: PF00437 GSPII_E; PF05157 GSPII_E_N [H]

EC number: NA

Molecular weight: Translated: 67092; Mature: 67092

Theoretical pI: Translated: 6.61; Mature: 6.61

Prosite motif: PS00124 FBPASE ; PS00211 ABC_TRANSPORTER_1 ; PS00662 T2SP_E

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MIENPHICKKLASLLLHRKLITKQQLEEISHDKSLAKQDFLEYLIENEIVDNKSFMIESA
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEEEEE
TLLRLQYIDLTTINVKFLPQEYFDIDFCRENHCLALFTRKRTIHVAIANPIESQQILKKL
EEEEEEEEEEEEEEEEECCHHHCCCEEECCCCEEEEEEECEEEEEEECCCCCHHHHHHHH
RSRYDCAFIPVVAELNHLKEKIEEYEQYLKDEGQREDDAKLNERIDAGSELDIDFVEEGE
HHHCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCHHHHCCC
REGDAIIGSGEDEEAPIIRFINNTIVDAIQKGASDIHFEPYEKNFRIRYRIDGLLIETFN
CCCCEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEEEEECC
TTKKNLAPKVISRLKIMSSLDIAEKRIPQDGKFKISLSREKAIDFRVSTCPISFGEKVVL
CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEEEEECCCCCCCEEEE
RIIDSSSTQIPIEQLGFSESQKETYLKYIQQPQGMVLVTGPTGSGKTVTLYTGINILNKP
EEECCCCCCCCHHHCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEECCCCCCCC
EKNISTAEDPVELIVKGINQVSVNNKQGLTFAAALKSFLRQDPDIIMVGEIRDIETGSIA
CCCCCCCCHHHHHHHHCCHHEECCCCCCCCHHHHHHHHHHCCCCEEEEECCEECCCCCEE
IKASQTGHLVMSTLHTNSAPETLNRLVDMGLPRYNIATSVTLIIAQRLIRKLCPKCKLPD
EEECCCCCEEEEECCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC
TDTEFSLLVENSGLNDEILAKTFGTTLDKVKNAKIYKANPKGCPRCFKGYKGRIGLYEVM
CCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCEEEEC
PVSRQISRMILEDKNTMEIAIQAQKEGIATVRQSALVRVAEDLTSMEEVYRVS
CHHHHHHHHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MIENPHICKKLASLLLHRKLITKQQLEEISHDKSLAKQDFLEYLIENEIVDNKSFMIESA
CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHEEEEE
TLLRLQYIDLTTINVKFLPQEYFDIDFCRENHCLALFTRKRTIHVAIANPIESQQILKKL
EEEEEEEEEEEEEEEEECCHHHCCCEEECCCCEEEEEEECEEEEEEECCCCCHHHHHHHH
RSRYDCAFIPVVAELNHLKEKIEEYEQYLKDEGQREDDAKLNERIDAGSELDIDFVEEGE
HHHCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCHHHHCCC
REGDAIIGSGEDEEAPIIRFINNTIVDAIQKGASDIHFEPYEKNFRIRYRIDGLLIETFN
CCCCEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEEEEECC
TTKKNLAPKVISRLKIMSSLDIAEKRIPQDGKFKISLSREKAIDFRVSTCPISFGEKVVL
CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCEEEEEEECCCCCCCEEEE
RIIDSSSTQIPIEQLGFSESQKETYLKYIQQPQGMVLVTGPTGSGKTVTLYTGINILNKP
EEECCCCCCCCHHHCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEECCCCCCCC
EKNISTAEDPVELIVKGINQVSVNNKQGLTFAAALKSFLRQDPDIIMVGEIRDIETGSIA
CCCCCCCCHHHHHHHHCCHHEECCCCCCCCHHHHHHHHHHCCCCEEEEECCEECCCCCEE
IKASQTGHLVMSTLHTNSAPETLNRLVDMGLPRYNIATSVTLIIAQRLIRKLCPKCKLPD
EEECCCCCEEEEECCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC
TDTEFSLLVENSGLNDEILAKTFGTTLDKVKNAKIYKANPKGCPRCFKGYKGRIGLYEVM
CCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCEEEEC
PVSRQISRMILEDKNTMEIAIQAQKEGIATVRQSALVRVAEDLTSMEEVYRVS
CHHHHHHHHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8820654 [H]