Definition | Burkholderia thailandensis E264 chromosome chromosome I, complete sequence. |
---|---|
Accession | NC_007651 |
Length | 3,809,201 |
Click here to switch to the map view.
The map label for this gene is 83719786
Identifier: 83719786
GI number: 83719786
Start: 1979933
End: 1981615
Strand: Direct
Name: 83719786
Synonym: BTH_I1766
Alternate gene names: NA
Gene position: 1979933-1981615 (Clockwise)
Preceding gene: 83721221
Following gene: 83718703
Centisome position: 51.98
GC content: 68.39
Gene sequence:
>1683_bases GTGACGCGGCACGGCGCGAAAGATTGTTCTGTAATAGACTGTAACGGTAAGAAATCAGGCGAAATACGCGACGATTCGAC TCGAATATTTCACTGGCTACAAAAATCCACAGAAACCCTTACGTTTTGGACAGCATGCGTCCTGCAAAGCGCTTACGCTA ACAATGGGGCAACTGCGCGGAAATTGGCCGTATCTGCGTGGATCTGCCGTTCAGAAACAGTGCTTTTCAACACGCGCGGC AAGCGTTGCCGGCTTGTCGCTGCTAAAATTCGCGGCCTACACACCAATCGCGCTCACGGATTCCGTTCTAGTTCTCCCTG CGCTTGCCGCGAAAAGCCGCACGAGGAGCCTCGGGATAGCGGCCGCCAGGCCGGTTCCGGTGCGTTGCGCGTCGTTGAAA GTGCAAGCAAGTCAGAAAGGAGTTGGGATCGTGTTCGAAATCGTCCCCGCCGAAGCATCGCGCTCCCGTTTCGCCCGTCG CTCGATCGTCCGGGGGCGCGCGGCATGACGGAATCGCTTCCCATCGTCGGTTGGGCGCTGCTCGCGCTCGTCTGCGCGTC GTGCGGCTATGCGGTGCTCGCCGCGTTCGCGCCCGCGCCGCGCGTGCCCCGCACGGCCGCGCGCGACGGCTTCGAGCCCG TCAGCGTGCTCAAGCCGCTGTGCGGCTCGGAGCCGCATCTGTATGAAAATCTCGCGACCTTCTGCGAGCAGCGGCATCCG CGCTACCAGCTGCTGTTCGGCGTCGCGTCGGCCGCCGATCCGGCCGCAGCCGTCGTGCGCCGGCTGCAGGCCGACTACCC CGATTGCGACATCGAGCTCGTGATCGACGCGCGCGTGTACGGCTCGAACCTGAAGGTCAGCAATCTCGTCAATCTCGCCG AGCGCGCGCGCCACGGCCGCATCGTGATCGCCGACAGCGACATCGCGGTCGAGCCCGACTATCTGACGCGCGTGACGGCG CCGCTCGCCGATCCGTCGGTCGGCGTCGTCACTTGCCTGTATCATGCGCGCAGCGTCGGCGGCTTCTGGACGCGGATCGG CGCGCAGTTCGTCGATGCGTGGTTCGCGCCGTCGGTCCGGATCACCCATCTCGGCGGGTCGAGCCGTTTCGGGTTCGGCG CGACGCTCGCGTTGACGCGCGCGACGCTCGACGCGATCGGCGGCTTCAAGGCGCTGAAGGACGAGCTCGCGGACGACTAC TGGCTCGCCGAGCTGCCGCGCCGCCTCGGGCGGCGCACGGTGCTCTCCGAGGTGAACGTGGCGACGGACGTCGCGGAGCC GTCGTTCGCGCCGCTGTGGCTGCGCGAGACGCGCTGGCTGCGCACGATCCGCTCGCTGAATCCGGCGGGGTTCGCCTTCC TGTTCATCACGTTCACCGCGCCGTGGCTCGCGATCGGCGCGGCGCTCGCGGCGTGGCTCGGCCTCGCGTCGGCCGCGGGC GCGACGGCCGCGTGGGCGGCCGCGATCGGCGCGTTCGCGCGGCTCGCGCTGCACGCGCGCGGCGCGGCCGGATGGCGCGC GTTCTGGCGCGACTTGCCGCTCGTGCCGGTGCGCGACGCGCTGCTCGCGCTTGAATGGCTCGCCGCCGCGTTCGGCACGC AGGTCGTGTGGCGCGGCGCGCGGATGACGGTGGTCGGCGGCGATGCGCGCGCGACGGTCGTCGAAGCGGGCGACGGGCGC TGA
Upstream 100 bases:
>100_bases GGATGCAACAGACGGCGTCGATTGTGCCAAAGCGTTCCGGATCGCGTCGGAGGCCGCAAAGATATCCCTGAAGGCGACAG GGACGAAAAAGGGCGTCAAC
Downstream 100 bases:
>100_bases CAGGCGCCGGACACACATACGGGATGCGCGGGCGGCGGCGGGCCGCCCGCGGTTTGTCGAAACATTTGTACTGGAACGAA TTGATATCTATGCAGCAGGC
Product: syl transferase, group 2 family protein
Products: NA
Alternate protein names: Glycosyltransferase; Glycosyl Transferase Family Protein; Glucosyltransferase; Glycosyl Transferase Group 2 Family; Glycosyl Transferase; Cell Wall Biosynthesis Glycosyltransferase; Glycosyl Transferase Group 2 Family Protein; Acyl-CoA Dehydrogenase-Like; Acyl-CoA Dehydrogenase-Like Protein; Glycosyl Transferase Protein; Glycosyl Tranferase Homolog; Cell Wall Biosynthesis Glycosyltransferase-Like Protein; Syl Transferase Group 2 Family Protein; Putatiave Glycosyltransferase; Family 2 Glycosyl Transferase
Number of amino acids: Translated: 560; Mature: 559
Protein sequence:
>560_residues MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRG KRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPS LDRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTA PLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDY WLAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR
Sequences:
>Translated_560_residues MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRG KRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPS LDRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTA PLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDY WLAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR >Mature_559_residues TRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATARKLAVSAWICRSETVLFNTRGK RCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDSGRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSL DRPGARGMTESLPIVGWALLALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHPR YQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGRIVIADSDIAVEPDYLTRVTAP LADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVRITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYW LAELPRRLGRRTVLSEVNVATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAGA TAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGARMTVVGGDARATVVEAGDGR
Specific function: Unknown
COG id: COG1215
COG function: function code M; Glycosyltransferases, probably involved in cell wall biogenesis
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI4507811, Length=240, Percent_Identity=31.25, Blast_Score=103, Evalue=3e-22, Organism=Caenorhabditis elegans, GI32566973, Length=283, Percent_Identity=27.5618374558304, Blast_Score=97, Evalue=3e-20, Organism=Caenorhabditis elegans, GI25147526, Length=239, Percent_Identity=26.7782426778243, Blast_Score=92, Evalue=6e-19, Organism=Caenorhabditis elegans, GI25149580, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18, Organism=Caenorhabditis elegans, GI25149577, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18, Organism=Caenorhabditis elegans, GI25149574, Length=305, Percent_Identity=26.8852459016393, Blast_Score=90, Evalue=3e-18, Organism=Caenorhabditis elegans, GI32564728, Length=271, Percent_Identity=27.3062730627306, Blast_Score=89, Evalue=7e-18, Organism=Drosophila melanogaster, GI24657569, Length=208, Percent_Identity=30.2884615384615, Blast_Score=91, Evalue=2e-18,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 60808; Mature: 60677
Theoretical pI: Translated: 10.14; Mature: 10.14
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.3 %Cys (Translated Protein) 0.5 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 2.3 %Cys (Mature Protein) 0.4 %Met (Mature Protein) 2.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATAR CCCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH KLAVSAWICRSETVLFNTRGKRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDS HHHHHHHHCCCCEEEEECCCCEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHH GRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSLDRPGARGMTESLPIVGWAL HCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHH LALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP HHHHHHHCCHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCC RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGR CEEEEEEHHCCCCHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCCE IVIADSDIAVEPDYLTRVTAPLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVR EEEECCCCEECCHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEE ITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYWLAELPRRLGRRTVLSEVNV EEECCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH ATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG HHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGA HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHCCCC RMTVVGGDARATVVEAGDGR EEEEECCCCEEEEEECCCCC >Mature Secondary Structure TRHGAKDCSVIDCNGKKSGEIRDDSTRIFHWLQKSTETLTFWTACVLQSAYANNGATAR CCCCCCCCCEEECCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH KLAVSAWICRSETVLFNTRGKRCRLVAAKIRGLHTNRAHGFRSSSPCACREKPHEEPRDS HHHHHHHHCCCCEEEEECCCCEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHH GRQAGSGALRVVESASKSERSWDRVRNRPRRSIALPFRPSLDRPGARGMTESLPIVGWAL HCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCHHHHHHH LALVCASCGYAVLAAFAPAPRVPRTAARDGFEPVSVLKPLCGSEPHLYENLATFCEQRHP HHHHHHHCCHHHHHHHCCCCCCCCHHHHCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCC RYQLLFGVASAADPAAAVVRRLQADYPDCDIELVIDARVYGSNLKVSNLVNLAERARHGR CEEEEEEHHCCCCHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCCE IVIADSDIAVEPDYLTRVTAPLADPSVGVVTCLYHARSVGGFWTRIGAQFVDAWFAPSVR EEEECCCCEECCHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEE ITHLGGSSRFGFGATLALTRATLDAIGGFKALKDELADDYWLAELPRRLGRRTVLSEVNV EEECCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH ATDVAEPSFAPLWLRETRWLRTIRSLNPAGFAFLFITFTAPWLAIGAALAAWLGLASAAG HHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC ATAAWAAAIGAFARLALHARGAAGWRAFWRDLPLVPVRDALLALEWLAAAFGTQVVWRGA HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHCCCC RMTVVGGDARATVVEAGDGR EEEEECCCCEEEEEECCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA