Definition Agrobacterium tumefaciens str. C58 chromosome circular, complete sequence.
Accession NC_003062
Length 2,841,580

Click here to switch to the map view.

The map label for this gene is wecG [H]

Identifier: 159185180

GI number: 159185180

Start: 2348039

End: 2348839

Strand: Reverse

Name: wecG [H]

Synonym: Atu2375

Alternate gene names: 159185180

Gene position: 2348839-2348039 (Counterclockwise)

Preceding gene: 159185181

Following gene: 159185179

Centisome position: 82.66

GC content: 60.67

Gene sequence:

>801_bases
ATGGCGGATGCGGAAGGGCCGGGCCAGGACTGGCCGGTTATTCCTCTTGCGGCACTGCCCATCACTGACGCGACCATCGA
TGAAACGGCGCGGGATTTCATCCTGCGTGCGACGACAGAGCGCGTGGCCGGTGCAAGACCGTTTTATTCGACCTCGGCGA
ATGGCCAGGTCATTGCGCTCTGCCATCACGACCGGGAATTCGACGCCATGCTGCGGCAGGCGGACCAGATTCACGCCGAT
GGCATGTCGCTGGTTATCTTCTCCCGCAAATTCTGCCGGCAGGCGCTGAGGGAACGGGTCGCGACGACGGACCTCGTTCA
TGCGGTGGCCAAGCGCGCCGAAGAAACGGGCAGTCGGTTTTATTTCCTCGGCGGTTCGGAAGAGGTGAACCGCGCGGCGG
TCGAAGAAATGCAGAGGCTTTATCCGCGTCTGGTATTTTCGGGACGGCGCAACGGTTATTTCAGCCGGGCCGAGGAAGAC
GCGGTCCTTGCCGATATCACTACTTCGAAAACGGATATTCTCTGGGTCGGCTTTGGTATTCCGCTGGAGCAGCGTTTCGT
TTCGCGCAATCTCGACAGGCTTTCCGGCATCGCCGTGATCAAGACCTGCGGCGGCCTTTTCGACTTCCTCGCCGGCCGGA
ACAGCCGCGCGCCGCAATGGATGCAGGATATGGGGCTGGAATGGCTCTACCGGGCGATGCTTGAGCCGAAGCGTCTGGGA
AAACGGTATCTGCTGACCAATCCGATCGCGATATATTCGCTGCTGAAGTATCGCTTCGGGGCGTCGGGTCAGGGCCGGTA
A

Upstream 100 bases:

>100_bases
CTTCGTAGTGTTTGAAGCCTTATACCCTAAGCATAAACGTGAATATAACAATGAAACGATGCCTATCTTGGACACCGGAC
AGATCATGACAGGGAGAGAG

Downstream 100 bases:

>100_bases
AGCTGCGGCAAGCCTCTTCCGCGGCAAGCATATCGCATTGCGTCAATTACATCAGGTTTCCCTAACGTATTGAGCCAAAG
GGCTTTGCCGCTGAAATTCG

Product: UDP-hexose transferase

Products: C55-PP-GlcNAc-ManNAcA; UDP [C]

Alternate protein names: UDP-ManNAcA transferase [H]

Number of amino acids: Translated: 266; Mature: 265

Protein sequence:

>266_residues
MADAEGPGQDWPVIPLAALPITDATIDETARDFILRATTERVAGARPFYSTSANGQVIALCHHDREFDAMLRQADQIHAD
GMSLVIFSRKFCRQALRERVATTDLVHAVAKRAEETGSRFYFLGGSEEVNRAAVEEMQRLYPRLVFSGRRNGYFSRAEED
AVLADITTSKTDILWVGFGIPLEQRFVSRNLDRLSGIAVIKTCGGLFDFLAGRNSRAPQWMQDMGLEWLYRAMLEPKRLG
KRYLLTNPIAIYSLLKYRFGASGQGR

Sequences:

>Translated_266_residues
MADAEGPGQDWPVIPLAALPITDATIDETARDFILRATTERVAGARPFYSTSANGQVIALCHHDREFDAMLRQADQIHAD
GMSLVIFSRKFCRQALRERVATTDLVHAVAKRAEETGSRFYFLGGSEEVNRAAVEEMQRLYPRLVFSGRRNGYFSRAEED
AVLADITTSKTDILWVGFGIPLEQRFVSRNLDRLSGIAVIKTCGGLFDFLAGRNSRAPQWMQDMGLEWLYRAMLEPKRLG
KRYLLTNPIAIYSLLKYRFGASGQGR
>Mature_265_residues
ADAEGPGQDWPVIPLAALPITDATIDETARDFILRATTERVAGARPFYSTSANGQVIALCHHDREFDAMLRQADQIHADG
MSLVIFSRKFCRQALRERVATTDLVHAVAKRAEETGSRFYFLGGSEEVNRAAVEEMQRLYPRLVFSGRRNGYFSRAEEDA
VLADITTSKTDILWVGFGIPLEQRFVSRNLDRLSGIAVIKTCGGLFDFLAGRNSRAPQWMQDMGLEWLYRAMLEPKRLGK
RYLLTNPIAIYSLLKYRFGASGQGR

Specific function: Catalyzes the synthesis of Und-PP-GlcNAc-ManNAcA (Lipid II), the second lipid-linked intermediate involved in ECA synthesis [H]

COG id: COG1922

COG function: function code M; Teichoic acid biosynthesis proteins

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 26 family [H]

Homologues:

Organism=Escherichia coli, GI2367289, Length=186, Percent_Identity=31.1827956989247, Blast_Score=87, Evalue=1e-18,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR023085
- InterPro:   IPR004629 [H]

Pfam domain/function: PF03808 Glyco_tran_WecB [H]

EC number: 2.4.1.- [C]

Molecular weight: Translated: 29900; Mature: 29768

Theoretical pI: Translated: 8.79; Mature: 8.79

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.1 %Cys     (Translated Protein)
2.6 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.1 %Cys     (Mature Protein)
2.3 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MADAEGPGQDWPVIPLAALPITDATIDETARDFILRATTERVAGARPFYSTSANGQVIAL
CCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEE
CHHDREFDAMLRQADQIHADGMSLVIFSRKFCRQALRERVATTDLVHAVAKRAEETGSRF
EECCCHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
YFLGGSEEVNRAAVEEMQRLYPRLVFSGRRNGYFSRAEEDAVLADITTSKTDILWVGFGI
EEECCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCEEEEEECCC
PLEQRFVSRNLDRLSGIAVIKTCGGLFDFLAGRNSRAPQWMQDMGLEWLYRAMLEPKRLG
CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCHHHHHHHHCCHHHCC
KRYLLTNPIAIYSLLKYRFGASGQGR
CEEECCCHHHHHHHHHHHCCCCCCCC
>Mature Secondary Structure 
ADAEGPGQDWPVIPLAALPITDATIDETARDFILRATTERVAGARPFYSTSANGQVIAL
CCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEE
CHHDREFDAMLRQADQIHADGMSLVIFSRKFCRQALRERVATTDLVHAVAKRAEETGSRF
EECCCHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE
YFLGGSEEVNRAAVEEMQRLYPRLVFSGRRNGYFSRAEEDAVLADITTSKTDILWVGFGI
EEECCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCEEEEEECCC
PLEQRFVSRNLDRLSGIAVIKTCGGLFDFLAGRNSRAPQWMQDMGLEWLYRAMLEPKRLG
CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCHHHHHHHHCCHHHCC
KRYLLTNPIAIYSLLKYRFGASGQGR
CEEECCCHHHHHHHHHHHCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: UDP-N-acetyl-D-mannosaminuronic-acid; undecaprenyl-N-acetyl-alpha-D-glucosaminyl-pyrophosphate [C]

Specific reaction: UDP-N-acetyl-D-mannosaminuronic-acid + undecaprenyl-N-acetyl-alpha-D-glucosaminyl-pyrophosphate = C55-PP-GlcNAc-ManNAcA + UDP [C]

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 14528314 [H]