Definition Gluconacetobacter diazotrophicus PAl 5 chromosome, complete genome.
Accession NC_010125
Length 3,944,163

Click here to switch to the map view.

The map label for this gene is epsH [H]

Identifier: 162146742

GI number: 162146742

Start: 933346

End: 936207

Strand: Reverse

Name: epsH [H]

Synonym: GDI_0920

Alternate gene names: 162146742

Gene position: 936207-933346 (Counterclockwise)

Preceding gene: 162146746

Following gene: 162146740

Centisome position: 23.74

GC content: 71.38

Gene sequence:

>2862_bases
ATGACGCGTGACCTCCTCCTGGCAGAATGGACGACCGACATCCCGATGGACGCTGACGAACGGCTTTCGCCCTGCCCGCT
GCCCGAAGGGCTTCGCCTGATGGGCCGCGACGCGCTCGATCCCTATTTCCTGGTGCCGCCCGAACTGGACCCCGATCCCG
CCATCAACGCGGTGCTGCCGTTCCTGGGCTGGATCGTCACGCTGGCCCGGCCCCGCCGCATCGGCCTGATGCCGGCCCGG
AAGGCGATCGCGGCCCTGATGGCGGACGTGGCGCACCGGATGCGCCTGCCGGCGGACATCCGCGCCCTGCCGTTCCAGCC
GCCGCCTGCCGGATTCGACCTGCTATGGCTGGATCTTCCGCCGGCCAGCACGCCGGATGCCGCCCTGGCACCGACGCCCG
ACCAGATGCTCCAGCTGCTGGGCCAGGGGGGCATTGTCGTGCTGCACGGCCTGGATAGCGGCGGCTGGGACGACCTTTCC
ATGGCGACCCTGAATCTGGGGCGCGGACTGGGCGTGCTGGTAGGGGGCGCGTGTCGCGGGGGCTCCGTGGCCAGCCTGTG
CGCGATGCTGAATCGTTCCGACGACGGCACGGCAGCCAACCTGGCCGCCCGCTTCGCCGCGATCGGCGCGCATTGGGCCG
CCCGCCGCGCCCTGGCCGACACGCAGGCGGAACTGGACCGGACCCGGCAGGCGCTCAGTCATCTGCGGCTGGATGCCATG
GAGATGCGGCTGGCCCTGAACCATCAGGATGCCGCCGGGCAGGATGCGAACCGGCAGGGGTCGTCCCCCCCGGCCGCGCC
GCCCGTCCCCGTCACGCCGGCCGCACCGCCGCCGAAGGGACCATCCCGGTGGCGTCGCCTTGCCCGCAGGCTGATCCGGG
GGCCTGCTACCCCTGCCCCGGCAAGCGCCGACCGCACGATCCGCACGGTCCTGTTCGTATCCGGCGAACCGGGCACCCCC
GGCACGACCTACCGCGTCACGCGCAACGCCGCCGCCTGCGCCGCCGCCGGATACGCGACCCGGTGCAGGGACTGCGCGGC
GGTCGGGCCGGACGACATCGCATGGGCCGACATGGTCGTGCTGTGGCGCGTGGAATATAGCGGCCATGTCGACACCTTGC
TGGGCCTGGCCCGGGCGCGCGGCGCCGTGCTGGCCTTCGATGCCGATGACATCGTGTTCGAACCCGCCCTGGCGCGCACC
GACCTGATCGACGGAATCCGCGTCTGTCCGGCCCCCGTGGCGCGGATCGAACGGATGTATGCCGACATGCAGCGCACCAT
GCGCCAGTGCGACCTCGGCCTGGCCACCACCGATACGCTGGCCGACTGGATGCGCCCCTTCCTGAAGCTGACGCTGGTGC
TGCCGAACACCTTCGATGACGCGACGCTGCAGCGTGCACGCCACGCCGTCCGCCGGCGGGCGCTGGCCGCGCCCGACGCG
GCGGATGACGTCGTGCGGATAGGCTATGCCACCGGATCGCGCACCCACCAGCGCGACTTCGCCCGTGCCCTGCCCGGCCT
GCTGCGGGTCATGGACCGACGGGCGCAGGTGCGCCTGGTCCTGTTCCGCGAACCCGGCGGAGGGCGCCCCCTGCTGCTGA
TCGAGGAATTTCCCGACCTGCACGCGCGGTCGGCGCAGATCGAATGGCGCGACATGGTGACGCTGGACGCGCTGCCGGAC
GAACTGGCGCGGCTGGACATCTCGATTGCCCCGTTGGAGGACGGCAATCCGTTCTGCGAGGCCAAGAGCGAACTGAAATT
CTTCGAGGCCGCGCTGGCCGGCGTCTGTACCGTCGCCTCGCCCACCGGGCCGTTTCGCGCCGCCATCCGGCCGGGCGTGA
CCGGCCTGCTGGCGGACGGTGCGGCGGAATGGGAAAGCGCGCTGCTGCGTCTGGTGGACGACCCCGCCCTGCGCCGCCGC
ATGGCGCGCGACGTGCTGCACACGGTGCTGTGGGAATACGGGCCCCAGCGACAGGCCGCCCTGCTGGGGCCGGCCATCGC
CGGGCTGGGCGATGCGCGGGCAGCGGCACGGGTTGGCGCCACCGTCCTGGCACGCGGCGCCTTCCGCGTCCGCGCCATTC
CCCGGATCCCCGACAGCACGGTCCTGTTCACCCAGGACCATCTGCAGGACGCCGCCGTCACGGTCGTCGTGACCGCGTAT
AATTATGCCGGCCACGTCATCGAGGCCCTGGACTCCGTCCGCCGCCAGACGCTCGACCCGCTGGACCTGATCGTGGTCGA
TGATGCCTCGACCGACGATACTCCGTCGCTGCTGACGGGCTGGGCGGCCCGGCATGGCGCACGGTTCAACCGGCTGCTGA
TCCTGCGCGCCCGGCGCAATGCCGGGCTGGGCGGCGCGCGCAATATCGGCATGGCGGCGGCCGAAACCCCCTATGTCCTG
CAACTGGACGCCGACAACCGCCTGCTGCCCGATGCCTGCGCCCGCCTGCTGGCCGCCATCGCGGCGGAAAGAGCGGGCTA
TGCCTATCCCCTGATCCGCCAGTTCGGGCGCGAGGCCAGCGTGATGGGCGATACCCCGTTCCATCCCGGGCGACTGGTCG
GCGGCAATACCATCGACGCCATGGCGCTGGTGGCCAAATGGGCTTGGGCCGCCGCCGGCGGCTATTACGTGCGGCGCGAC
GCCATGGGGTGGGAGGATTACGACCTGTGGTGCACCCTGGCAGAACTGGGCATCGCCGGTACCCAGGTGCCCGAAATCCT
GGCCGAATACCGCGTGCATGACACGGCCATGACCGACACGCTGACCGAACGGCCGCACCACAAGGACGCGGTAGTCACGC
TGCTGCGAGACCGCCATCCCTGGATTCGCCTGACGGCCCCCGAGGCACGTGCGCGTTCATGA

Upstream 100 bases:

>100_bases
CTCACCCGCTGCCCGGCTGCCCGTCCGGCCGGAACGACAGCGCCATGAAAGCGGGGAGATCTTCCCAAAATATTAATATC
CCCGCTGCCAGGATCACAAA

Downstream 100 bases:

>100_bases
GCAGGCCGACGGGCGGGATCAGGGGACGTTTCTTTCCAACTCTTCCAGCCATAGCCTGGCATTGCCGTCGGAGGGCGCGC
GCCAGTCGCCACGCGGCGAG

Product: glycosyl transferase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 953; Mature: 952

Protein sequence:

>953_residues
MTRDLLLAEWTTDIPMDADERLSPCPLPEGLRLMGRDALDPYFLVPPELDPDPAINAVLPFLGWIVTLARPRRIGLMPAR
KAIAALMADVAHRMRLPADIRALPFQPPPAGFDLLWLDLPPASTPDAALAPTPDQMLQLLGQGGIVVLHGLDSGGWDDLS
MATLNLGRGLGVLVGGACRGGSVASLCAMLNRSDDGTAANLAARFAAIGAHWAARRALADTQAELDRTRQALSHLRLDAM
EMRLALNHQDAAGQDANRQGSSPPAAPPVPVTPAAPPPKGPSRWRRLARRLIRGPATPAPASADRTIRTVLFVSGEPGTP
GTTYRVTRNAAACAAAGYATRCRDCAAVGPDDIAWADMVVLWRVEYSGHVDTLLGLARARGAVLAFDADDIVFEPALART
DLIDGIRVCPAPVARIERMYADMQRTMRQCDLGLATTDTLADWMRPFLKLTLVLPNTFDDATLQRARHAVRRRALAAPDA
ADDVVRIGYATGSRTHQRDFARALPGLLRVMDRRAQVRLVLFREPGGGRPLLLIEEFPDLHARSAQIEWRDMVTLDALPD
ELARLDISIAPLEDGNPFCEAKSELKFFEAALAGVCTVASPTGPFRAAIRPGVTGLLADGAAEWESALLRLVDDPALRRR
MARDVLHTVLWEYGPQRQAALLGPAIAGLGDARAAARVGATVLARGAFRVRAIPRIPDSTVLFTQDHLQDAAVTVVVTAY
NYAGHVIEALDSVRRQTLDPLDLIVVDDASTDDTPSLLTGWAARHGARFNRLLILRARRNAGLGGARNIGMAAAETPYVL
QLDADNRLLPDACARLLAAIAAERAGYAYPLIRQFGREASVMGDTPFHPGRLVGGNTIDAMALVAKWAWAAAGGYYVRRD
AMGWEDYDLWCTLAELGIAGTQVPEILAEYRVHDTAMTDTLTERPHHKDAVVTLLRDRHPWIRLTAPEARARS

Sequences:

>Translated_953_residues
MTRDLLLAEWTTDIPMDADERLSPCPLPEGLRLMGRDALDPYFLVPPELDPDPAINAVLPFLGWIVTLARPRRIGLMPAR
KAIAALMADVAHRMRLPADIRALPFQPPPAGFDLLWLDLPPASTPDAALAPTPDQMLQLLGQGGIVVLHGLDSGGWDDLS
MATLNLGRGLGVLVGGACRGGSVASLCAMLNRSDDGTAANLAARFAAIGAHWAARRALADTQAELDRTRQALSHLRLDAM
EMRLALNHQDAAGQDANRQGSSPPAAPPVPVTPAAPPPKGPSRWRRLARRLIRGPATPAPASADRTIRTVLFVSGEPGTP
GTTYRVTRNAAACAAAGYATRCRDCAAVGPDDIAWADMVVLWRVEYSGHVDTLLGLARARGAVLAFDADDIVFEPALART
DLIDGIRVCPAPVARIERMYADMQRTMRQCDLGLATTDTLADWMRPFLKLTLVLPNTFDDATLQRARHAVRRRALAAPDA
ADDVVRIGYATGSRTHQRDFARALPGLLRVMDRRAQVRLVLFREPGGGRPLLLIEEFPDLHARSAQIEWRDMVTLDALPD
ELARLDISIAPLEDGNPFCEAKSELKFFEAALAGVCTVASPTGPFRAAIRPGVTGLLADGAAEWESALLRLVDDPALRRR
MARDVLHTVLWEYGPQRQAALLGPAIAGLGDARAAARVGATVLARGAFRVRAIPRIPDSTVLFTQDHLQDAAVTVVVTAY
NYAGHVIEALDSVRRQTLDPLDLIVVDDASTDDTPSLLTGWAARHGARFNRLLILRARRNAGLGGARNIGMAAAETPYVL
QLDADNRLLPDACARLLAAIAAERAGYAYPLIRQFGREASVMGDTPFHPGRLVGGNTIDAMALVAKWAWAAAGGYYVRRD
AMGWEDYDLWCTLAELGIAGTQVPEILAEYRVHDTAMTDTLTERPHHKDAVVTLLRDRHPWIRLTAPEARARS
>Mature_952_residues
TRDLLLAEWTTDIPMDADERLSPCPLPEGLRLMGRDALDPYFLVPPELDPDPAINAVLPFLGWIVTLARPRRIGLMPARK
AIAALMADVAHRMRLPADIRALPFQPPPAGFDLLWLDLPPASTPDAALAPTPDQMLQLLGQGGIVVLHGLDSGGWDDLSM
ATLNLGRGLGVLVGGACRGGSVASLCAMLNRSDDGTAANLAARFAAIGAHWAARRALADTQAELDRTRQALSHLRLDAME
MRLALNHQDAAGQDANRQGSSPPAAPPVPVTPAAPPPKGPSRWRRLARRLIRGPATPAPASADRTIRTVLFVSGEPGTPG
TTYRVTRNAAACAAAGYATRCRDCAAVGPDDIAWADMVVLWRVEYSGHVDTLLGLARARGAVLAFDADDIVFEPALARTD
LIDGIRVCPAPVARIERMYADMQRTMRQCDLGLATTDTLADWMRPFLKLTLVLPNTFDDATLQRARHAVRRRALAAPDAA
DDVVRIGYATGSRTHQRDFARALPGLLRVMDRRAQVRLVLFREPGGGRPLLLIEEFPDLHARSAQIEWRDMVTLDALPDE
LARLDISIAPLEDGNPFCEAKSELKFFEAALAGVCTVASPTGPFRAAIRPGVTGLLADGAAEWESALLRLVDDPALRRRM
ARDVLHTVLWEYGPQRQAALLGPAIAGLGDARAAARVGATVLARGAFRVRAIPRIPDSTVLFTQDHLQDAAVTVVVTAYN
YAGHVIEALDSVRRQTLDPLDLIVVDDASTDDTPSLLTGWAARHGARFNRLLILRARRNAGLGGARNIGMAAAETPYVLQ
LDADNRLLPDACARLLAAIAAERAGYAYPLIRQFGREASVMGDTPFHPGRLVGGNTIDAMALVAKWAWAAAGGYYVRRDA
MGWEDYDLWCTLAELGIAGTQVPEILAEYRVHDTAMTDTLTERPHHKDAVVTLLRDRHPWIRLTAPEARARS

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Integral Membrane Protein [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: NA

Molecular weight: Translated: 103128; Mature: 102996

Theoretical pI: Translated: 7.00; Mature: 7.00

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.3 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
1.3 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTRDLLLAEWTTDIPMDADERLSPCPLPEGLRLMGRDALDPYFLVPPELDPDPAINAVLP
CCCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCEEEECCCCCCCHHHHHHHH
FLGWIVTLARPRRIGLMPARKAIAALMADVAHRMRLPADIRALPFQPPPAGFDLLWLDLP
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCCEEEEEECC
PASTPDAALAPTPDQMLQLLGQGGIVVLHGLDSGGWDDLSMATLNLGRGLGVLVGGACRG
CCCCCCCCCCCCHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHCCCCEEEEECCCCC
GSVASLCAMLNRSDDGTAANLAARFAAIGAHWAARRALADTQAELDRTRQALSHLRLDAM
CHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
EMRLALNHQDAAGQDANRQGSSPPAAPPVPVTPAAPPPKGPSRWRRLARRLIRGPATPAP
HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC
ASADRTIRTVLFVSGEPGTPGTTYRVTRNAAACAAAGYATRCRDCAAVGPDDIAWADMVV
CCCCCEEEEEEEEECCCCCCCCEEEEECCCHHHHHHCHHHHHHHHHCCCCCCHHHHCEEE
LWRVEYSGHVDTLLGLARARGAVLAFDADDIVFEPALARTDLIDGIRVCPAPVARIERMY
EEEEECCCCHHHHHHHHHHCCCEEEECCCCCEECCHHHHHHHHCCCCCCCHHHHHHHHHH
ADMQRTMRQCDLGLATTDTLADWMRPFLKLTLVLPNTFDDATLQRARHAVRRRALAAPDA
HHHHHHHHHHCCCCCCHHHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC
ADDVVRIGYATGSRTHQRDFARALPGLLRVMDRRAQVRLVLFREPGGGRPLLLIEEFPDL
CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCEEEEEECCCCC
HARSAQIEWRDMVTLDALPDELARLDISIAPLEDGNPFCEAKSELKFFEAALAGVCTVAS
CCCCCCEEHHHEEEECCCHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECC
PTGPFRAAIRPGVTGLLADGAAEWESALLRLVDDPALRRRMARDVLHTVLWEYGPQRQAA
CCCCCHHHHCCCCCEEHHCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHH
LLGPAIAGLGDARAAARVGATVLARGAFRVRAIPRIPDSTVLFTQDHLQDAAVTVVVTAY
HHHHHHHCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEEEHHHCCCCEEEEEEEEH
NYAGHVIEALDSVRRQTLDPLDLIVVDDASTDDTPSLLTGWAARHGARFNRLLILRARRN
HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEEEECCC
AGLGGARNIGMAAAETPYVLQLDADNRLLPDACARLLAAIAAERAGYAYPLIRQFGREAS
CCCCCCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCC
VMGDTPFHPGRLVGGNTIDAMALVAKWAWAAAGGYYVRRDAMGWEDYDLWCTLAELGIAG
CCCCCCCCCCEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHCCCC
TQVPEILAEYRVHDTAMTDTLTERPHHKDAVVTLLRDRHPWIRLTAPEARARS
CHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCEEEEECCHHCCCC
>Mature Secondary Structure 
TRDLLLAEWTTDIPMDADERLSPCPLPEGLRLMGRDALDPYFLVPPELDPDPAINAVLP
CCCEEEEECCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCEEEECCCCCCCHHHHHHHH
FLGWIVTLARPRRIGLMPARKAIAALMADVAHRMRLPADIRALPFQPPPAGFDLLWLDLP
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCCEEEEEECC
PASTPDAALAPTPDQMLQLLGQGGIVVLHGLDSGGWDDLSMATLNLGRGLGVLVGGACRG
CCCCCCCCCCCCHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHCCCCEEEEECCCCC
GSVASLCAMLNRSDDGTAANLAARFAAIGAHWAARRALADTQAELDRTRQALSHLRLDAM
CHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
EMRLALNHQDAAGQDANRQGSSPPAAPPVPVTPAAPPPKGPSRWRRLARRLIRGPATPAP
HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC
ASADRTIRTVLFVSGEPGTPGTTYRVTRNAAACAAAGYATRCRDCAAVGPDDIAWADMVV
CCCCCEEEEEEEEECCCCCCCCEEEEECCCHHHHHHCHHHHHHHHHCCCCCCHHHHCEEE
LWRVEYSGHVDTLLGLARARGAVLAFDADDIVFEPALARTDLIDGIRVCPAPVARIERMY
EEEEECCCCHHHHHHHHHHCCCEEEECCCCCEECCHHHHHHHHCCCCCCCHHHHHHHHHH
ADMQRTMRQCDLGLATTDTLADWMRPFLKLTLVLPNTFDDATLQRARHAVRRRALAAPDA
HHHHHHHHHHCCCCCCHHHHHHHHHHHHEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCC
ADDVVRIGYATGSRTHQRDFARALPGLLRVMDRRAQVRLVLFREPGGGRPLLLIEEFPDL
CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCEEEEEECCCCC
HARSAQIEWRDMVTLDALPDELARLDISIAPLEDGNPFCEAKSELKFFEAALAGVCTVAS
CCCCCCEEHHHEEEECCCHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHEECC
PTGPFRAAIRPGVTGLLADGAAEWESALLRLVDDPALRRRMARDVLHTVLWEYGPQRQAA
CCCCCHHHHCCCCCEEHHCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHH
LLGPAIAGLGDARAAARVGATVLARGAFRVRAIPRIPDSTVLFTQDHLQDAAVTVVVTAY
HHHHHHHCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEEEHHHCCCCEEEEEEEEH
NYAGHVIEALDSVRRQTLDPLDLIVVDDASTDDTPSLLTGWAARHGARFNRLLILRARRN
HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEEEECCC
AGLGGARNIGMAAAETPYVLQLDADNRLLPDACARLLAAIAAERAGYAYPLIRQFGREAS
CCCCCCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCC
VMGDTPFHPGRLVGGNTIDAMALVAKWAWAAAGGYYVRRDAMGWEDYDLWCTLAELGIAG
CCCCCCCCCCEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHCCCC
TQVPEILAEYRVHDTAMTDTLTERPHHKDAVVTLLRDRHPWIRLTAPEARARS
CHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCEEEEECCHHCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]