Definition Burkholderia glumae BGR1 chromosome chromosome 1, complete sequence.
Accession NC_012724
Length 3,906,507

Click here to switch to the map view.

The map label for this gene is yfbI [C]

Identifier: 238028167

GI number: 238028167

Start: 2981425

End: 2983290

Strand: Direct

Name: yfbI [C]

Synonym: bglu_1g26190

Alternate gene names: 238028167

Gene position: 2981425-2983290 (Clockwise)

Preceding gene: 238028166

Following gene: 238028179

Centisome position: 76.32

GC content: 71.7

Gene sequence:

>1866_bases
ATGCAGGGACCCGCAACTCCCGGCGGTGCGCGCTCGTCCGCACCCGCCCCCTTTCGTCATACCAAAGGCTCGGAGCGCCC
ACCCGCCGGCAAGACAAGCGCCACCACCCAGGCCGCAATGCCGCTCGCCGACCACGCCTGGCCGCGCGTGCAGGCAGCCG
CATTCGCGCTGACCGGGCTGCGCGTCTGGCTGCTGGTCGCCACGGTGGCCTGCGCGTATCTGCTGCCCGGCATCCTCGGC
CACGACCCTTGGAAGCAGGACGAGACCTACACGTTCGGCATCATCCAGCACATGCTCGACACCGGCGACCTGATCGTGCC
GACCAACGCCGGCCAGCCGTTCCTCGAGAAGCCGCCGCTCTACGATTGGACCGCGGCGGCGCTGGCCAAGCTGTTCGGCC
GCTACCTGCCGCTGCACGACGCGGCGCGGCTGGCGAGCGCGCTGTTCGCTGCGCTCGCGTTCGGCTTCATGGCCCGCGCC
GCGCGCATTGCGAGCGGCGCGCGGCGCTGGTTTGATCTCGACGTGATCGGCCCGGTTGCGCTCTGCGCGGGCACGCTGGT
GGTGGTCAAGCACGTCCACGACATGATGACCGACGTGGCGCTGATGGCCGGTACCGCGATCGCGTTCTCGTCGCTGCTCG
AACTCGTGACGGCCCGCTGCGGCGGGCGCCCCGCGGGCCGCTTCGCGGCGGCCGGCTTCGGCGCGGGGGTCGGCATCGCG
CTGATGACCAAGGGCCTGTTCGTGCCACTGGTGTTCGGCGCGACGCTCGCGGCACTGCTCGTGCTGTATCCGGCCTGCCG
CAGCCGTGCGTTCTTCCGCTCGCTAGGCGTGGCCGCGCTGGTGTTCGCGCCGTTCGCGCTGATCTGGCCGATCGCGCTGC
TGCTGCGCTCCGAGACGCTGTTCATGACTTGGTTCTGGGACAACAACGTCGGCCGCTTCTTCGGCTTTTCGGTGCCGCGT
CTCGGCGCCGAGAACGATAAGCCGCTGTTCATCTGGCGCGCGCTGCTCACGGTCGGCTTCCCGGTCGGCCCGCTCGCGGC
CTGGGCGCTCGCCGCAGGCCGCTGGCGTGCCTGGCGCGTGCCGGCCGTCGGGCTGCCCGCGCTGTTCGCGGGCATCGGCA
TGATCGTGCTGCAGATGGCCGCCACCTCGCGCCAGCTCTACATCCTGCCGTTCATCGCGCCGCTCGCGCTGTTGGCGGCG
GACGCCGTGGCCCGCCTGCCGACGCGGGTCGCGCGCGGCTGGGACTACCTGAGCCGCGTGCTGTTCGGCGCGGCGGCCGG
CCTCGCCTGGTACATCTGGTCGATCGTCACCTCGTACCACGCGTCGCGCGAGCCGATTCGCTGGCTCGGCCGCTGGCTGC
CGCTCGACTGGACCGCGCCGCTCGAGTTGCCGTTCACGCTCGCCGCGGTGGCACTGACGATCGGCTGGCTGTGGCTGCTG
CCGTCGCTGCGCACGACCGGCAAATGGCGCGGCGCGCTATCGTGGGCGACCGGCGTGCTGCTGGTGTGGGGGCTGATCTT
CACGCTGCTGCTGCCGTGGCTCGACGCCGCCAAGAGCTATCGCTCGGTGTTCGCGAGCCTCGACCGGCAGCTCAAGCCGC
AGTGGAGCGAAGGCGATTGCATGGCGAGCCTCGGGCTCGGCGAGTCGGAAGCGCCGATGCTCTATTACTTCACCGGCATC
CAGCACCGGCCGGTCGCCAACCCCGCCGACACGGCCTGCACCTGGCTGATCGTGCAGGGGCTGCGCGCGGTCACGCCGGA
GCCCGGCGCCGAGTGGCAGCCGTTCTGGTCGGGCGCGCGGCCCGGCGACGACAAGGAGCTGCTGCGCGTCTACGTGCGCA
CCCCGCAAGGCGGCGCCAAGCCTTGA

Upstream 100 bases:

>100_bases
AGGGCCATCGTGGTATCCGCCGCGTTTCGCCTCCCCACGGCAATCCTCACCAACCGGCCGAAGTCGCTGGCTGGGCAACC
GATTTCTGAGACGTTCTTCG

Downstream 100 bases:

>100_bases
GCCAAGCGGCGCGAGCGCCGCGCCGCCGCACGCCTCAACCCGGTCGATTGCGCATCCAGTCGGCCGTATCGTAGAACGAA
TGCATGAGCCGCTCGCGCAG

Product: glycosyltransferase

Products: NA

Alternate protein names: Glycosyl Transferase Family Protein; Glycosyltransferase; 4-Amino-4-Deoxy-L-Arabinose Transferase; Inner Membrane Protein

Number of amino acids: Translated: 621; Mature: 621

Protein sequence:

>621_residues
MQGPATPGGARSSAPAPFRHTKGSERPPAGKTSATTQAAMPLADHAWPRVQAAAFALTGLRVWLLVATVACAYLLPGILG
HDPWKQDETYTFGIIQHMLDTGDLIVPTNAGQPFLEKPPLYDWTAAALAKLFGRYLPLHDAARLASALFAALAFGFMARA
ARIASGARRWFDLDVIGPVALCAGTLVVVKHVHDMMTDVALMAGTAIAFSSLLELVTARCGGRPAGRFAAAGFGAGVGIA
LMTKGLFVPLVFGATLAALLVLYPACRSRAFFRSLGVAALVFAPFALIWPIALLLRSETLFMTWFWDNNVGRFFGFSVPR
LGAENDKPLFIWRALLTVGFPVGPLAAWALAAGRWRAWRVPAVGLPALFAGIGMIVLQMAATSRQLYILPFIAPLALLAA
DAVARLPTRVARGWDYLSRVLFGAAAGLAWYIWSIVTSYHASREPIRWLGRWLPLDWTAPLELPFTLAAVALTIGWLWLL
PSLRTTGKWRGALSWATGVLLVWGLIFTLLLPWLDAAKSYRSVFASLDRQLKPQWSEGDCMASLGLGESEAPMLYYFTGI
QHRPVANPADTACTWLIVQGLRAVTPEPGAEWQPFWSGARPGDDKELLRVYVRTPQGGAKP

Sequences:

>Translated_621_residues
MQGPATPGGARSSAPAPFRHTKGSERPPAGKTSATTQAAMPLADHAWPRVQAAAFALTGLRVWLLVATVACAYLLPGILG
HDPWKQDETYTFGIIQHMLDTGDLIVPTNAGQPFLEKPPLYDWTAAALAKLFGRYLPLHDAARLASALFAALAFGFMARA
ARIASGARRWFDLDVIGPVALCAGTLVVVKHVHDMMTDVALMAGTAIAFSSLLELVTARCGGRPAGRFAAAGFGAGVGIA
LMTKGLFVPLVFGATLAALLVLYPACRSRAFFRSLGVAALVFAPFALIWPIALLLRSETLFMTWFWDNNVGRFFGFSVPR
LGAENDKPLFIWRALLTVGFPVGPLAAWALAAGRWRAWRVPAVGLPALFAGIGMIVLQMAATSRQLYILPFIAPLALLAA
DAVARLPTRVARGWDYLSRVLFGAAAGLAWYIWSIVTSYHASREPIRWLGRWLPLDWTAPLELPFTLAAVALTIGWLWLL
PSLRTTGKWRGALSWATGVLLVWGLIFTLLLPWLDAAKSYRSVFASLDRQLKPQWSEGDCMASLGLGESEAPMLYYFTGI
QHRPVANPADTACTWLIVQGLRAVTPEPGAEWQPFWSGARPGDDKELLRVYVRTPQGGAKP
>Mature_621_residues
MQGPATPGGARSSAPAPFRHTKGSERPPAGKTSATTQAAMPLADHAWPRVQAAAFALTGLRVWLLVATVACAYLLPGILG
HDPWKQDETYTFGIIQHMLDTGDLIVPTNAGQPFLEKPPLYDWTAAALAKLFGRYLPLHDAARLASALFAALAFGFMARA
ARIASGARRWFDLDVIGPVALCAGTLVVVKHVHDMMTDVALMAGTAIAFSSLLELVTARCGGRPAGRFAAAGFGAGVGIA
LMTKGLFVPLVFGATLAALLVLYPACRSRAFFRSLGVAALVFAPFALIWPIALLLRSETLFMTWFWDNNVGRFFGFSVPR
LGAENDKPLFIWRALLTVGFPVGPLAAWALAAGRWRAWRVPAVGLPALFAGIGMIVLQMAATSRQLYILPFIAPLALLAA
DAVARLPTRVARGWDYLSRVLFGAAAGLAWYIWSIVTSYHASREPIRWLGRWLPLDWTAPLELPFTLAAVALTIGWLWLL
PSLRTTGKWRGALSWATGVLLVWGLIFTLLLPWLDAAKSYRSVFASLDRQLKPQWSEGDCMASLGLGESEAPMLYYFTGI
QHRPVANPADTACTWLIVQGLRAVTPEPGAEWQPFWSGARPGDDKELLRVYVRTPQGGAKP

Specific function: Unknown

COG id: COG1807

COG function: function code M; 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family

Gene ontology:

Cell location: Integral membrane protein (Potential) [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 67247; Mature: 67247

Theoretical pI: Translated: 10.28; Mature: 10.28

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
3.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQGPATPGGARSSAPAPFRHTKGSERPPAGKTSATTQAAMPLADHAWPRVQAAAFALTGL
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCHHCCCHHHHHHHHHHHHH
RVWLLVATVACAYLLPGILGHDPWKQDETYTFGIIQHMLDTGDLIVPTNAGQPFLEKPPL
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCC
YDWTAAALAKLFGRYLPLHDAARLASALFAALAFGFMARAARIASGARRWFDLDVIGPVA
CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEHHHHHHH
LCAGTLVVVKHVHDMMTDVALMAGTAIAFSSLLELVTARCGGRPAGRFAAAGFGAGVGIA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCHHHHH
LMTKGLFVPLVFGATLAALLVLYPACRSRAFFRSLGVAALVFAPFALIWPIALLLRSETL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
FMTWFWDNNVGRFFGFSVPRLGAENDKPLFIWRALLTVGFPVGPLAAWALAAGRWRAWRV
EEEEEEECCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCCEEEC
PAVGLPALFAGIGMIVLQMAATSRQLYILPFIAPLALLAADAVARLPTRVARGWDYLSRV
CCCCHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LFGAAAGLAWYIWSIVTSYHASREPIRWLGRWLPLDWTAPLELPFTLAAVALTIGWLWLL
HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH
PSLRTTGKWRGALSWATGVLLVWGLIFTLLLPWLDAAKSYRSVFASLDRQLKPQWSEGDC
HHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCE
MASLGLGESEAPMLYYFTGIQHRPVANPADTACTWLIVQGLRAVTPEPGAEWQPFWSGAR
EEECCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
PGDDKELLRVYVRTPQGGAKP
CCCHHHHHHHHHCCCCCCCCC
>Mature Secondary Structure
MQGPATPGGARSSAPAPFRHTKGSERPPAGKTSATTQAAMPLADHAWPRVQAAAFALTGL
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCHHCCCHHHHHHHHHHHHH
RVWLLVATVACAYLLPGILGHDPWKQDETYTFGIIQHMLDTGDLIVPTNAGQPFLEKPPL
HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCC
YDWTAAALAKLFGRYLPLHDAARLASALFAALAFGFMARAARIASGARRWFDLDVIGPVA
CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEHHHHHHH
LCAGTLVVVKHVHDMMTDVALMAGTAIAFSSLLELVTARCGGRPAGRFAAAGFGAGVGIA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCHHHHH
LMTKGLFVPLVFGATLAALLVLYPACRSRAFFRSLGVAALVFAPFALIWPIALLLRSETL
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
FMTWFWDNNVGRFFGFSVPRLGAENDKPLFIWRALLTVGFPVGPLAAWALAAGRWRAWRV
EEEEEEECCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCCEEEC
PAVGLPALFAGIGMIVLQMAATSRQLYILPFIAPLALLAADAVARLPTRVARGWDYLSRV
CCCCHHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
LFGAAAGLAWYIWSIVTSYHASREPIRWLGRWLPLDWTAPLELPFTLAAVALTIGWLWLL
HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH
PSLRTTGKWRGALSWATGVLLVWGLIFTLLLPWLDAAKSYRSVFASLDRQLKPQWSEGDC
HHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCE
MASLGLGESEAPMLYYFTGIQHRPVANPADTACTWLIVQGLRAVTPEPGAEWQPFWSGAR
EEECCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC
PGDDKELLRVYVRTPQGGAKP
CCCHHHHHHHHHCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: NA