Definition Bacillus licheniformis ATCC 14580, complete genome.
Accession NC_006322
Length 4,222,645

Click here to switch to the map view.

The map label for this gene is tagF [H]

Identifier: 52787497

GI number: 52787497

Start: 3640629

End: 3642764

Strand: Reverse

Name: tagF [H]

Synonym: BLi03817

Alternate gene names: 52787497

Gene position: 3642764-3640629 (Counterclockwise)

Preceding gene: 52787498

Following gene: 52787496

Centisome position: 86.27

GC content: 39.98

Gene sequence:

>2136_bases
ATGGATAGTGCAATTCCGATAGAATTTCAGCTGCGAGAAATAAAAATTAACGACGCTCGCCTCTATTTTGTGTTCGAAAG
TTCTATCTCTCTTAATCATGTCGGTATTTTGGCAGTGAACCGCAATTCGAAAAAAGAGATGCGTGTAACTTGTAACAAAA
TAAGCGATTCAGGGCAGTTAAAAGCAGCGGAGATATCTTTAAATAACCTTTCTAACCTGATAATAGACGAAACGGTCATT
GATTTTTATGTGCTATATGAAAATAATGAAAAAAATCAATGTAAAAAGAGGATTTACACAGCCTCAAAACCGATTGAGCT
GTATTGGCATCAGGATCGAGTCAAACAGTTCATTTATTTGCCGTATACGACCCGAAAAGGCCACTTCGCTCTTGATGTTT
CACGGAGAAAAGCAGTAGCGGAACCGGACAGCATTAAGTTAAGCTCCGAAGGAAAACTAATGATCAGAGGCTATACTTTT
CTGCTCGATGCGGAAGAATGTTCGATAATAAAAAGCAGAAAGCTGGTTTTAAAGGAAATTAGCCAAAACGATAAAGAAAG
GGTTTTCAATTTCCCTTTAAAAGGCGTTCAGCGCGTCGATTTGTCAGCTGATTCTTCTGATCATGCCGTTGGATTTGAGG
CGGAAATCGATCTGAAAAAGCTGTATGAGGAGAACAGGATGCCTGCTTTCTTTAAGTTTTATATAGAATACGTAGGTGAA
GATGCTGCAACAGGGGAGGAAATCGCTATAAAAAGCAGGGCATTCAAGCTTATAGATGATATAGAAAGTTTTTCAATTGT
TAACACAAAAAAAGGACCGGCTCGTTTTCACATTTATCCGCAGAAGCGAAAACGGGGTATGCGTCTAAGAATGAATGACT
ATACGATCAAAACCCGTGCGGTTTATTTTGCCAAAGGAAAAGGAAAAAGGCTGATGTCAGTCATACGCAAAGGCAAGAAT
AAAGCAAAGAAAAAGATTTCGGGCATGGTGAAGAGAGCATACTATTTTACTTTTGGACTGGCCGGCAAACTCCCTGTTAA
AAAGAAAACCGTCATTTTTGAAAGCTTTGCCGGCAAGCAGTACAGCTGCAATCCGAGAGCAATTTACGAATATATGAAGG
AACATCATCCGGAATACAACTTAATATGGAGTGTAAATCCGAGTTATACAGAAATTTTTGAAGAAAAAAATGTCCCTTAT
ATTCACCGATTCACGCTGAAGTGGCTGTTCGCCATGGCAAGGGCCGAATATTGGGTTGTAAACAGCCGGCTTCCTTTGTG
GATCCCAAAACCAAAGCACACGACATATGTGCAGACATGGCACGGGACTCCGCTGAAAAGGCTTGCCGTCGATATGGAAG
AAGTGCACATGCCGGGCACCAACACTGAGAAGTACAAGCAAAATTTCACTAAAGAAGCATCGAAGTGGGACTATTTAATC
TCCCCGAATCGCTATTCAACCGAAATCTTTGCCCGCGCTTTTCAATTTAACAAAACAATGATTGAGTCCGGCTATCCGAG
GAATGACTTTCTGTATACGGACAACCGCCCGGAAACGATGAAGGCCATAAAGAGAAAAATGAATATCCCCGAAGATAAAA
AGGTCATTTTATATGCACCGACATGGCGGGACGATCAGTTCTATAAAAAAGGAAAATACAAATTCGACCTTGATTTGAAT
TTGGAAAAGCTGCGAGAGGAGATCGGCGATAATTATGTCATCGTTTTAAGAATGCACTATTTAGTCGCGGAAAACTTTGA
CCTATCGCCGTACAAAGGATTCGCATATGATTTCTCATCATACGAAGATATTCGCGAACTTTATATGGTATCTGATTTGT
TAATTACAGATTACTCTTCAGTATTCTTTGATTTCGCAAATTTGAAGCGTCCGATGATTTTCTTCGTTCCAGACATTGAA
ACCTATCGTGACAAGCTGCGCGGCTTTTATTTTGACTTTGAACAAGAAGCACCAGGACCGTTGGTCAAAACGACTGAGGA
AGTAATTGAGAAAATCAAGGAAACAGAGACATCTGACTATCGTCTGCCTGACATTTTTGAGCCTTTTTATGAAAAGTTTT
GTTACCTGGAGACGGGCAATTCAACTGAAAAAGTTGTAAAAACGGTATTTAAATAA

Upstream 100 bases:

>100_bases
AAAAATAAGAAGTGCTTTTTGTTTCGTATACGAATCATTTTAATATAATAAGTATGAACGGCTTTGCTTTTGAATAAAAA
AAGAGGAGTAGACTGATTTT

Downstream 100 bases:

>100_bases
ATACAAAAAAGCCATTGCTTTATGTAATGGCTTTTTTGTAAAATATCTTCTGATAACAATAAAGAAAGAAGGCGAAGTTT
TCAGAAGGAAAAATTTAAGC

Product: TagF

Products: NA

Alternate protein names: CGPTase; Major teichoic acid biosynthesis protein F; Polyglycerol phosphate polymerase [H]

Number of amino acids: Translated: 711; Mature: 711

Protein sequence:

>711_residues
MDSAIPIEFQLREIKINDARLYFVFESSISLNHVGILAVNRNSKKEMRVTCNKISDSGQLKAAEISLNNLSNLIIDETVI
DFYVLYENNEKNQCKKRIYTASKPIELYWHQDRVKQFIYLPYTTRKGHFALDVSRRKAVAEPDSIKLSSEGKLMIRGYTF
LLDAEECSIIKSRKLVLKEISQNDKERVFNFPLKGVQRVDLSADSSDHAVGFEAEIDLKKLYEENRMPAFFKFYIEYVGE
DAATGEEIAIKSRAFKLIDDIESFSIVNTKKGPARFHIYPQKRKRGMRLRMNDYTIKTRAVYFAKGKGKRLMSVIRKGKN
KAKKKISGMVKRAYYFTFGLAGKLPVKKKTVIFESFAGKQYSCNPRAIYEYMKEHHPEYNLIWSVNPSYTEIFEEKNVPY
IHRFTLKWLFAMARAEYWVVNSRLPLWIPKPKHTTYVQTWHGTPLKRLAVDMEEVHMPGTNTEKYKQNFTKEASKWDYLI
SPNRYSTEIFARAFQFNKTMIESGYPRNDFLYTDNRPETMKAIKRKMNIPEDKKVILYAPTWRDDQFYKKGKYKFDLDLN
LEKLREEIGDNYVIVLRMHYLVAENFDLSPYKGFAYDFSSYEDIRELYMVSDLLITDYSSVFFDFANLKRPMIFFVPDIE
TYRDKLRGFYFDFEQEAPGPLVKTTEEVIEKIKETETSDYRLPDIFEPFYEKFCYLETGNSTEKVVKTVFK

Sequences:

>Translated_711_residues
MDSAIPIEFQLREIKINDARLYFVFESSISLNHVGILAVNRNSKKEMRVTCNKISDSGQLKAAEISLNNLSNLIIDETVI
DFYVLYENNEKNQCKKRIYTASKPIELYWHQDRVKQFIYLPYTTRKGHFALDVSRRKAVAEPDSIKLSSEGKLMIRGYTF
LLDAEECSIIKSRKLVLKEISQNDKERVFNFPLKGVQRVDLSADSSDHAVGFEAEIDLKKLYEENRMPAFFKFYIEYVGE
DAATGEEIAIKSRAFKLIDDIESFSIVNTKKGPARFHIYPQKRKRGMRLRMNDYTIKTRAVYFAKGKGKRLMSVIRKGKN
KAKKKISGMVKRAYYFTFGLAGKLPVKKKTVIFESFAGKQYSCNPRAIYEYMKEHHPEYNLIWSVNPSYTEIFEEKNVPY
IHRFTLKWLFAMARAEYWVVNSRLPLWIPKPKHTTYVQTWHGTPLKRLAVDMEEVHMPGTNTEKYKQNFTKEASKWDYLI
SPNRYSTEIFARAFQFNKTMIESGYPRNDFLYTDNRPETMKAIKRKMNIPEDKKVILYAPTWRDDQFYKKGKYKFDLDLN
LEKLREEIGDNYVIVLRMHYLVAENFDLSPYKGFAYDFSSYEDIRELYMVSDLLITDYSSVFFDFANLKRPMIFFVPDIE
TYRDKLRGFYFDFEQEAPGPLVKTTEEVIEKIKETETSDYRLPDIFEPFYEKFCYLETGNSTEKVVKTVFK
>Mature_711_residues
MDSAIPIEFQLREIKINDARLYFVFESSISLNHVGILAVNRNSKKEMRVTCNKISDSGQLKAAEISLNNLSNLIIDETVI
DFYVLYENNEKNQCKKRIYTASKPIELYWHQDRVKQFIYLPYTTRKGHFALDVSRRKAVAEPDSIKLSSEGKLMIRGYTF
LLDAEECSIIKSRKLVLKEISQNDKERVFNFPLKGVQRVDLSADSSDHAVGFEAEIDLKKLYEENRMPAFFKFYIEYVGE
DAATGEEIAIKSRAFKLIDDIESFSIVNTKKGPARFHIYPQKRKRGMRLRMNDYTIKTRAVYFAKGKGKRLMSVIRKGKN
KAKKKISGMVKRAYYFTFGLAGKLPVKKKTVIFESFAGKQYSCNPRAIYEYMKEHHPEYNLIWSVNPSYTEIFEEKNVPY
IHRFTLKWLFAMARAEYWVVNSRLPLWIPKPKHTTYVQTWHGTPLKRLAVDMEEVHMPGTNTEKYKQNFTKEASKWDYLI
SPNRYSTEIFARAFQFNKTMIESGYPRNDFLYTDNRPETMKAIKRKMNIPEDKKVILYAPTWRDDQFYKKGKYKFDLDLN
LEKLREEIGDNYVIVLRMHYLVAENFDLSPYKGFAYDFSSYEDIRELYMVSDLLITDYSSVFFDFANLKRPMIFFVPDIE
TYRDKLRGFYFDFEQEAPGPLVKTTEEVIEKIKETETSDYRLPDIFEPFYEKFCYLETGNSTEKVVKTVFK

Specific function: Is responsible for the polymerization of the main chain of the major teichoic acid by sequential transfer of glycerol phosphate units from CDP-glycerol to the dissacharide linkage unit. Synthesizes polymers of approximately 35 glycerol phosphate units in

COG id: COG1887

COG function: function code M; Putative glycosyl/glycerophosphate transferases involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC

Gene ontology:

Cell location: Cell membrane; Peripheral membrane protein [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the CDP-glycerol glycerophosphotransferase family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR007554 [H]

Pfam domain/function: PF04464 Glyphos_transf [H]

EC number: =2.7.8.12 [H]

Molecular weight: Translated: 83847; Mature: 83847

Theoretical pI: Translated: 9.58; Mature: 9.58

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.2 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
2.5 %Met     (Mature Protein)
3.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MDSAIPIEFQLREIKINDARLYFVFESSISLNHVGILAVNRNSKKEMRVTCNKISDSGQL
CCCCCCEEEEEEEEEECCEEEEEEEECCCCCCEEEEEEECCCCCCCEEEEEEECCCCCCE
KAAEISLNNLSNLIIDETVIDFYVLYENNEKNQCKKRIYTASKPIELYWHQDRVKQFIYL
EEEEEEHHHHHHHHHHHEEEEEEEEEECCCHHHHHHHHHCCCCCEEEEEECCCCCEEEEE
PYTTRKGHFALDVSRRKAVAEPDSIKLSSEGKLMIRGYTFLLDAEECSIIKSRKLVLKEI
EEECCCCCEEEEEHHCCCCCCCCCEEECCCCEEEEEEEEEEEECHHHHHHHHHHHHHHHH
SQNDKERVFNFPLKGVQRVDLSADSSDHAVGFEAEIDLKKLYEENRMPAFFKFYIEYVGE
CCCCHHHEEECCCCCEEEEECCCCCCCCEEEEEECCHHHHHHHHCCCCHHHHHHHHHHCC
DAATGEEIAIKSRAFKLIDDIESFSIVNTKKGPARFHIYPQKRKRGMRLRMNDYTIKTRA
CCCCCCCEEEHHHHHHHHHHHHHCEEEECCCCCEEEEEEEHHHCCCCEEEECCEEEEEEE
VYFAKGKGKRLMSVIRKGKNKAKKKISGMVKRAYYFTFGLAGKLPVKKKTVIFESFAGKQ
EEEECCCCHHHHHHHHHCHHHHHHHHHHHHHHHHEEEEECCCCCCCCCEEEEEECCCCCC
YSCNPRAIYEYMKEHHPEYNLIWSVNPSYTEIFEEKNVPYIHRFTLKWLFAMARAEYWVV
CCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHCCCCEEEHHHHHHHHHHHHCEEEEE
NSRLPLWIPKPKHTTYVQTWHGTPLKRLAVDMEEVHMPGTNTEKYKQNFTKEASKWDYLI
ECCCCEEECCCCCCEEEEEECCCCHHHHHCCHHHHCCCCCCHHHHHHHHHHHHHCCCEEE
SPNRYSTEIFARAFQFNKTMIESGYPRNDFLYTDNRPETMKAIKRKMNIPEDKKVILYAP
CCCCCCHHHHHHHHHHCHHHHHCCCCCCCEEEECCCCHHHHHHHHHCCCCCCCEEEEECC
TWRDDQFYKKGKYKFDLDLNLEKLREEIGDNYVIVLRMHYLVAENFDLSPYKGFAYDFSS
CCCCCHHHHCCCEEEEECCCHHHHHHHHCCCEEEEEEEEEHHHCCCCCCCCCCCEECCCC
YEDIRELYMVSDLLITDYSSVFFDFANLKRPMIFFVPDIETYRDKLRGFYFDFEQEAPGP
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHCCEEEEECCCCCCC
LVKTTEEVIEKIKETETSDYRLPDIFEPFYEKFCYLETGNSTEKVVKTVFK
HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHEEEEECCCCHHHHHHHHCC
>Mature Secondary Structure
MDSAIPIEFQLREIKINDARLYFVFESSISLNHVGILAVNRNSKKEMRVTCNKISDSGQL
CCCCCCEEEEEEEEEECCEEEEEEEECCCCCCEEEEEEECCCCCCCEEEEEEECCCCCCE
KAAEISLNNLSNLIIDETVIDFYVLYENNEKNQCKKRIYTASKPIELYWHQDRVKQFIYL
EEEEEEHHHHHHHHHHHEEEEEEEEEECCCHHHHHHHHHCCCCCEEEEEECCCCCEEEEE
PYTTRKGHFALDVSRRKAVAEPDSIKLSSEGKLMIRGYTFLLDAEECSIIKSRKLVLKEI
EEECCCCCEEEEEHHCCCCCCCCCEEECCCCEEEEEEEEEEEECHHHHHHHHHHHHHHHH
SQNDKERVFNFPLKGVQRVDLSADSSDHAVGFEAEIDLKKLYEENRMPAFFKFYIEYVGE
CCCCHHHEEECCCCCEEEEECCCCCCCCEEEEEECCHHHHHHHHCCCCHHHHHHHHHHCC
DAATGEEIAIKSRAFKLIDDIESFSIVNTKKGPARFHIYPQKRKRGMRLRMNDYTIKTRA
CCCCCCCEEEHHHHHHHHHHHHHCEEEECCCCCEEEEEEEHHHCCCCEEEECCEEEEEEE
VYFAKGKGKRLMSVIRKGKNKAKKKISGMVKRAYYFTFGLAGKLPVKKKTVIFESFAGKQ
EEEECCCCHHHHHHHHHCHHHHHHHHHHHHHHHHEEEEECCCCCCCCCEEEEEECCCCCC
YSCNPRAIYEYMKEHHPEYNLIWSVNPSYTEIFEEKNVPYIHRFTLKWLFAMARAEYWVV
CCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHCCCCEEEHHHHHHHHHHHHCEEEEE
NSRLPLWIPKPKHTTYVQTWHGTPLKRLAVDMEEVHMPGTNTEKYKQNFTKEASKWDYLI
ECCCCEEECCCCCCEEEEEECCCCHHHHHCCHHHHCCCCCCHHHHHHHHHHHHHCCCEEE
SPNRYSTEIFARAFQFNKTMIESGYPRNDFLYTDNRPETMKAIKRKMNIPEDKKVILYAP
CCCCCCHHHHHHHHHHCHHHHHCCCCCCCEEEECCCCHHHHHHHHHCCCCCCCEEEEECC
TWRDDQFYKKGKYKFDLDLNLEKLREEIGDNYVIVLRMHYLVAENFDLSPYKGFAYDFSS
CCCCCHHHHCCCEEEEECCCHHHHHHHHCCCEEEEEEEEEHHHCCCCCCCCCCCEECCCC
YEDIRELYMVSDLLITDYSSVFFDFANLKRPMIFFVPDIETYRDKLRGFYFDFEQEAPGP
HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHCCEEEEECCCCCCC
LVKTTEEVIEKIKETETSDYRLPDIFEPFYEKFCYLETGNSTEKVVKTVFK
HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHEEEEECCCCHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 2507871; 9384377; 1309530; 12637499 [H]