Definition Francisella tularensis subsp. holarctica LVS chromosome, complete genome.
Accession NC_007880
Length 1,895,994

Click here to switch to the map view.

The map label for this gene is epsH [H]

Identifier: 89256716

GI number: 89256716

Start: 1353205

End: 1354212

Strand: Reverse

Name: epsH [H]

Synonym: FTL_1423

Alternate gene names: 89256716

Gene position: 1354212-1353205 (Counterclockwise)

Preceding gene: 89256717

Following gene: 89256715

Centisome position: 71.42

GC content: 27.78

Gene sequence:

>1008_bases
ATGTACAATCTTAATTATAAGCAGCTAATATCTATAATCATACCAATATACAATACTCAACAATATCTTAGTAGATGTTT
AGAATCTGTTATTAATCAAACATATAAAAATTTAGAAATTATACTTATAAATGATGGGTCAACTGATAATAGTCTATCAA
TTTGTCAAAAATATAAATCTAAAGATAGTCGGATCGTTTTGCTAAATCAGCAAAACTCTGGGCAGGCATTAGCTAGAAAT
AATGCTTTAGATATAGCAAAAGGTGATTATATAGCATTTATAGATAGTGATGATTGGGTTAGTTTAGACTATATTCAAGC
ATTATATAATCATGTTTTTAGCTATTCAGCAGATATCGCTATATCAGCAATGGTTGGCTGTAATAAGCAAATAAAAGCTG
CTGAAATTATTCCTAGTAATATAAACATCTTTGATAACAATGACGCTATAATTAAGGCTTTTCTATCAAAACAGTTATCA
TCAATGGCATGCGGAAGTTTAATTAAGCGTAAACTTCTAGACAAACAACGATTTAGAAATTTTATAGCATATGAAGATTT
AGATTTTTTTTACAAAATTTACTCTCAGGCTCAAATCATAGTCAAAGACAATAATGTGCGTTATTTTTATTATCAGCGTG
ATGATGGTATTATGGGGGCTAATAGACTGAATTTCTCTTTACAGCATTTGCAAGCATTGAAGAGTGTTACAACATATTAT
GAAAGATTTTTTATTGAAAAATATCCTCAGCTTGGTAGTTTAATTTATATGAATATTTTAAGACATCTTGTAGATAATTT
TTCAGAAGCAGCGGTGCGAAAAAATGTTTTAGCTAAAGATGTTTTATTACTTTATAAAACTTTATACCTAAATTGTATAA
GTAGGGGATTTAAGCTTAGCTTACCATATAAGTTATTTTTCTACTTTCCTAATCTGGTTGCAAAATGTTATTTGAAAGCG
AAAAAGATTAAACTCTCACTCAAAGAAAAAAGAATTACTAAAAGTTAA

Upstream 100 bases:

>100_bases
ACTAAAAGCATCGAAACTTTTCGCTATAAAATAAAAAGGAATTTTATAAAATATTTTATGTTTTTCTTCTTATTTAAAGA
TTAATACTCTTAAGGTAAAT

Downstream 100 bases:

>100_bases
TTATGTCAAAATTATTAATAGATACGCGCTGGCAAGGCAAACATGGAATAGGTAGATATGCTTGTGAAATAATTAAATAT
TTACCACAGAATTTTATTAC

Product: glycosyl transferase family protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 335; Mature: 335

Protein sequence:

>335_residues
MYNLNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGSTDNSLSICQKYKSKDSRIVLLNQQNSGQALARN
NALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIAISAMVGCNKQIKAAEIIPSNINIFDNNDAIIKAFLSKQLS
SMACGSLIKRKLLDKQRFRNFIAYEDLDFFYKIYSQAQIIVKDNNVRYFYYQRDDGIMGANRLNFSLQHLQALKSVTTYY
ERFFIEKYPQLGSLIYMNILRHLVDNFSEAAVRKNVLAKDVLLLYKTLYLNCISRGFKLSLPYKLFFYFPNLVAKCYLKA
KKIKLSLKEKRITKS

Sequences:

>Translated_335_residues
MYNLNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGSTDNSLSICQKYKSKDSRIVLLNQQNSGQALARN
NALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIAISAMVGCNKQIKAAEIIPSNINIFDNNDAIIKAFLSKQLS
SMACGSLIKRKLLDKQRFRNFIAYEDLDFFYKIYSQAQIIVKDNNVRYFYYQRDDGIMGANRLNFSLQHLQALKSVTTYY
ERFFIEKYPQLGSLIYMNILRHLVDNFSEAAVRKNVLAKDVLLLYKTLYLNCISRGFKLSLPYKLFFYFPNLVAKCYLKA
KKIKLSLKEKRITKS
>Mature_335_residues
MYNLNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGSTDNSLSICQKYKSKDSRIVLLNQQNSGQALARN
NALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIAISAMVGCNKQIKAAEIIPSNINIFDNNDAIIKAFLSKQLS
SMACGSLIKRKLLDKQRFRNFIAYEDLDFFYKIYSQAQIIVKDNNVRYFYYQRDDGIMGANRLNFSLQHLQALKSVTTYY
ERFFIEKYPQLGSLIYMNILRHLVDNFSEAAVRKNVLAKDVLLLYKTLYLNCISRGFKLSLPYKLFFYFPNLVAKCYLKA
KKIKLSLKEKRITKS

Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]

COG id: COG0463

COG function: function code M; Glycosyltransferases involved in cell wall biogenesis

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the glycosyltransferase 2 family [H]

Homologues:

Organism=Escherichia coli, GI1790044, Length=185, Percent_Identity=31.3513513513514, Blast_Score=104, Evalue=7e-24,
Organism=Escherichia coli, GI1788372, Length=127, Percent_Identity=39.3700787401575, Blast_Score=74, Evalue=1e-14,
Organism=Escherichia coli, GI1787259, Length=89, Percent_Identity=33.7078651685393, Blast_Score=65, Evalue=8e-12,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001173 [H]

Pfam domain/function: PF00535 Glycos_transf_2 [H]

EC number: 2.-.-.- [C]

Molecular weight: Translated: 38895; Mature: 38895

Theoretical pI: Translated: 9.70; Mature: 9.70

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.8 %Cys     (Translated Protein)
1.5 %Met     (Translated Protein)
3.3 %Cys+Met (Translated Protein)
1.8 %Cys     (Mature Protein)
1.5 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MYNLNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGSTDNSLSICQKYKS
CCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHHC
KDSRIVLLNQQNSGQALARNNALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIA
CCCEEEEEECCCCCCHHHCCCCCEEECCCEEEEECCCCCEEHHHHHHHHHHHHHCCCCHH
ISAMVGCNKQIKAAEIIPSNINIFDNNDAIIKAFLSKQLSSMACGSLIKRKLLDKQRFRN
EEHHHCCCCCCCHHEECCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FIAYEDLDFFYKIYSQAQIIVKDNNVRYFYYQRDDGIMGANRLNFSLQHLQALKSVTTYY
HHHHHHHHHHHHHHCCCEEEEEECCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH
ERFFIEKYPQLGSLIYMNILRHLVDNFSEAAVRKNVLAKDVLLLYKTLYLNCISRGFKLS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE
LPYKLFFYFPNLVAKCYLKAKKIKLSLKEKRITKS
CCHHHHHHHHHHHHHHHHHHHHHEEEHHHHCCCCC
>Mature Secondary Structure
MYNLNYKQLISIIIPIYNTQQYLSRCLESVINQTYKNLEIILINDGSTDNSLSICQKYKS
CCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCHHHHHHHHHC
KDSRIVLLNQQNSGQALARNNALDIAKGDYIAFIDSDDWVSLDYIQALYNHVFSYSADIA
CCCEEEEEECCCCCCHHHCCCCCEEECCCEEEEECCCCCEEHHHHHHHHHHHHHCCCCHH
ISAMVGCNKQIKAAEIIPSNINIFDNNDAIIKAFLSKQLSSMACGSLIKRKLLDKQRFRN
EEHHHCCCCCCCHHEECCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
FIAYEDLDFFYKIYSQAQIIVKDNNVRYFYYQRDDGIMGANRLNFSLQHLQALKSVTTYY
HHHHHHHHHHHHHHCCCEEEEEECCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH
ERFFIEKYPQLGSLIYMNILRHLVDNFSEAAVRKNVLAKDVLLLYKTLYLNCISRGFKLS
HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE
LPYKLFFYFPNLVAKCYLKAKKIKLSLKEKRITKS
CCHHHHHHHHHHHHHHHHHHHHHEEEHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8969506; 9384377 [H]