Definition Burkholderia thailandensis E264 chromosome chromosome I, complete sequence.
Accession NC_007651
Length 3,809,201

Click here to switch to the map view.

The map label for this gene is 83719786

Identifier: 83719786

GI number: 83719786

Start: 1979933

End: 1981615

Strand: Direct

Name: 83719786

Synonym: BTH_I1766

Alternate gene names: NA

Gene position: 1979933-1981615 (Clockwise)

Preceding gene: 83721221

Following gene: 83718703

Centisome position: 51.98

GC content: 68.39

Gene sequence:

>1683_bases
GTGACGCGGCACGGCGCGAAAGATTGTTCTGTAATAGACTGTAACGGTAAGAAATCAGGCGAAATACGCGACGATTCGAC
TCGAATATTTCACTGGCTACAAAAATCCACAGAAACCCTTACGTTTTGGACAGCATGCGTCCTGCAAAGCGCTTACGCTA
ACAATGGGGCAACTGCGCGGAAATTGGCCGTATCTGCGTGGATCTGCCGTTCAGAAACAGTGCTTTTCAACACGCGCGGC
AAGCGTTGCCGGCTTGTCGCTGCTAAAATTCGCGGCCTACACACCAATCGCGCTCACGGATTCCGTTCTAGTTCTCCCTG
CGCTTGCCGCGAAAAGCCGCACGAGGAGCCTCGGGATAGCGGCCGCCAGGCCGGTTCCGGTGCGTTGCGCGTCGTTGAAA
GTGCAAGCAAGTCAGAAAGGAGTTGGGATCGTGTTCGAAATCGTCCCCGCCGAAGCATCGCGCTCCCGTTTCGCCCGTCG
CTCGATCGTCCGGGGGCGCGCGGCATGACGGAATCGCTTCCCATCGTCGGTTGGGCGCTGCTCGCGCTCGTCTGCGCGTC
GTGCGGCTATGCGGTGCTCGCCGCGTTCGCGCCCGCGCCGCGCGTGCCCCGCACGGCCGCGCGCGACGGCTTCGAGCCCG
TCAGCGTGCTCAAGCCGCTGTGCGGCTCGGAGCCGCATCTGTATGAAAATCTCGCGACCTTCTGCGAGCAGCGGCATCCG
CGCTACCAGCTGCTGTTCGGCGTCGCGTCGGCCGCCGATCCGGCCGCAGCCGTCGTGCGCCGGCTGCAGGCCGACTACCC
CGATTGCGACATCGAGCTCGTGATCGACGCGCGCGTGTACGGCTCGAACCTGAAGGTCAGCAATCTCGTCAATCTCGCCG
AGCGCGCGCGCCACGGCCGCATCGTGATCGCCGACAGCGACATCGCGGTCGAGCCCGACTATCTGACGCGCGTGACGGCG
CCGCTCGCCGATCCGTCGGTCGGCGTCGTCACTTGCCTGTATCATGCGCGCAGCGTCGGCGGCTTCTGGACGCGGATCGG
CGCGCAGTTCGTCGATGCGTGGTTCGCGCCGTCGGTCCGGATCACCCATCTCGGCGGGTCGAGCCGTTTCGGGTTCGGCG
CGACGCTCGCGTTGACGCGCGCGACGCTCGACGCGATCGGCGGCTTCAAGGCGCTGAAGGACGAGCTCGCGGACGACTAC
TGGCTCGCCGAGCTGCCGCGCCGCCTCGGGCGGCGCACGGTGCTCTCCGAGGTGAACGTGGCGACGGACGTCGCGGAGCC
GTCGTTCGCGCCGCTGTGGCTGCGCGAGACGCGCTGGCTGCGCACGATCCGCTCGCTGAATCCGGCGGGGTTCGCCTTCC
TGTTCATCACGTTCACCGCGCCGTGGCTCGCGATCGGCGCGGCGCTCGCGGCGTGGCTCGGCCTCGCGTCGGCCGCGGGC
GCGACGGCCGCGTGGGCGGCCGCGATCGGCGCGTTCGCGCGGCTCGCGCTGCACGCGCGCGGCGCGGCCGGATGGCGCGC
GTTCTGGCGCGACTTGCCGCTCGTGCCGGTGCGCGACGCGCTGCTCGCGCTTGAATGGCTCGCCGCCGCGTTCGGCACGC
AGGTCGTGTGGCGCGGCGCGCGGATGACGGTGGTCGGCGGCGATGCGCGCGCGACGGTCGTCGAAGCGGGCGACGGGCGC
TGA

Upstream 100 bases:

>100_bases
GGATGCAACAGACGGCGTCGATTGTGCCAAAGCGTTCCGGATCGCGTCGGAGGCCGCAAAGATATCCCTGAAGGCGACAG
GGACGAAAAAGGGCGTCAAC

Downstream 100 bases:

>100_bases
CAGGCGCCGGACACACATACGGGATGCGCGGGCGGCGGCGGGCCGCCCGCGGTTTGTCGAAACATTTGTACTGGAACGAA
TTGATATCTATGCAGCAGGC

Product: syl transferase, group 2 family protein

Products: NA

Alternate protein names: Glycosyltransferase; Glycosyl Transferase Family Protein; Glucosyltransferase; Glycosyl Transferase Group 2 Family; Glycosyl Transferase; Cell Wall Biosynthesis Glycosyltransferase; Glycosyl Transferase Group 2 Family Protein; Acyl-CoA Dehydrogenase-Like; Acyl-CoA Dehydrogenase-Like Protein; Glycosyl Transferase Protein; Glycosyl Tranferase Homolog; Cell Wall Biosynthesis Glycosyltransferase-Like Protein; Syl Transferase Group 2 Family Protein; Putatiave Glycosyltransferase; Family 2 Glycosyl Transferase

Number of amino acids: Translated: 560; Mature: 559

Protein sequence:

>560_residues
MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRG
KRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPS
LDRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP
RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTA
PLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDY
WLAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG
ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR

Sequences:

>Translated_560_residues
MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRG
KRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPS
LDRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP
RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTA
PLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDY
WLAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG
ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR
>Mature_559_residues
TRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRGK
RCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSL
DRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHPR
YQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTAP
LADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYW
LAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAGA
TAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR

Specific function: Unknown

COG id: COG1215

COG function: function code M; Glycosyltransferases, probably involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI4507811, Length=240, Percent_Identity=31.25, Blast_Score=103, Evalue=3e-22,
Organism=Caenorhabditis elegans, GI32566973, Length=283, Percent_Identity=27.5618374558304, Blast_Score=97, Evalue=3e-20,
Organism=Caenorhabditis elegans, GI25147526, Length=239, Percent_Identity=26.7782426778243, Blast_Score=92, Evalue=6e-19,
Organism=Caenorhabditis elegans, GI25149580, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI25149577, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI25149574, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI32564728, Length=271, Percent_Identity=27.3062730627306, Blast_Score=89, Evalue=7e-18,
Organism=Drosophila melanogaster, GI24657569, Length=208, Percent_Identity=30.2884615384615, Blast_Score=91, Evalue=2e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 60808; Mature: 60677

Theoretical pI: Translated: 10.14; Mature: 10.14

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
0.5 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
0.4 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATAR
CCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH
KLAVSAWICRSETVLFNTRGKRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDS
HHHHHHHHCCCCEEEEECCCCEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHH
GRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSLDRPGARGMTESLPIVGWAL
HCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHH
LALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP
HHHHHHHCCHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCC
RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGR
CEEEEEEHHCCCCHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCCE
IVIADSDIAVEPDYLTRVTAPLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVR
EEEECCCCEECCHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEE
ITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYWLAELPRRLGRRTVLSEVNV
EEECCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH
ATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG
HHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC
ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGA
HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHCCCC
RMTVVGGDARATVVEAGDGR
EEEEECCCCEEEEEECCCCC
>Mature Secondary Structure 
TRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATAR
CCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH
KLAVSAWICRSETVLFNTRGKRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDS
HHHHHHHHCCCCEEEEECCCCEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHH
GRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSLDRPGARGMTESLPIVGWAL
HCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHH
LALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP
HHHHHHHCCHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCC
RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGR
CEEEEEEHHCCCCHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCCE
IVIADSDIAVEPDYLTRVTAPLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVR
EEEECCCCEECCHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEE
ITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYWLAELPRRLGRRTVLSEVNV
EEECCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH
ATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG
HHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC
ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGA
HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHCCCC
RMTVVGGDARATVVEAGDGR
EEEEECCCCEEEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA