| Definition | Chlorobaculum parvum NCIB 8327 chromosome, complete genome. |
|---|---|
| Accession | NC_011027 |
| Length | 2,289,249 |
Click here to switch to the map view.
The map label for this gene is epsE [H]
Identifier: 193213453
GI number: 193213453
Start: 1987613
End: 1989130
Strand: Direct
Name: epsE [H]
Synonym: Cpar_1814
Alternate gene names: 193213453
Gene position: 1987613-1989130 (Clockwise)
Preceding gene: 193213451
Following gene: 193213455
Centisome position: 86.82
GC content: 47.3
Gene sequence:
>1518_bases ATGAACCGTCTTTCCGGAATCATCAGACAGCAGGCAAAACATGCCGAGCGAAAAGTTTCGCGCGTTAAATGGCAGCTTCG TTATGCCTTGGTAAAGAAGGCTCTGAAGAAAAGACCTGAGCATCCCGCGCTTCTGAACAAGCTTGCAAAGATACTTGTAA AGCTGAAAGATGAACAAGAGTTGCTTCGAGCGACAGAACGTTATGCCATGAATGGATTCAGGCATGACGACCCGCAAACT GCCCAAGCGCTCTTTCGCCTTGCGCATTCAAGAGCGGCTTCATTCTGGGATAATGGCAGTTATGAGGAGGCTGTAGCGCT GATTAACAGCGTCATGGAGTGGAATACACCTAATAGCAGTATGCTGCATCAATCGCTTTACTGGCAGTTATGTATAGCAA AAACAGAAAACAAGAAGAATGTCATCGAGAAACGAGTACTTGACCGGTATGTTGAAATCGAAAAGCAGCATTTACCGCAC GACTTTGTAAAAGCTCACATGGCAACCGAGGCGATGATGCGTCTCGGCCTTGCTGATCAGGCTAAAAAGACCCTCCTGAA GCACGAAGACAAGCCAATGGCGCTTTTGGCCCTGTCCAATACCAGCATTGATAATGACGAGCTGTGGCTCGAGTATGTCA ACCGTTTCTTCAGTTCTCGAAAGCTGGAACCGCTCTCCCTATTTGAGGGTGTCAAGCCACGCTTTCTCCGCCTTGAAGGA GCCCCGGCTACCTCATACAGCGGTGGCCCGAAAATCAGCGTCATCATGACCGCATTCAACGTCGAAAACTATATTGAAAC AGCAATACGCTCTGTACTGGCTCAATCATGGAGTAATTTCGAACTGATCGTGGTTGATGACTGCAGCACAGACGCTACTC GCCGTATCGTAAAACAATTGATGGAACATGACTCAAGAATCAAGCTTATAGAAAACCAAACAAATTGCGGCACCTATATC AACCGGAACAAGGCTTATGACCTGGCGACAGGCGAATATGTAACCTGCCATGATTCTGATGACTGGGCACACCCGAAAAA GCTCGAATATCAAGTTACGGCGCTACTTAAAAATCCTGACGCTGTTTCAAGCTCGAGCCACTGGGTGCGAATGTATGAAA ACGGCCAGTTCATTTTTTATACCCTTGGCACTTTTGCCCAATACAATGCGAACTCGCTGATGTTCAAAATCGACAGGGTA AAACCGGTGCTTGGTTACTGGGACTCAGTACGAATAAGCGCTGACAGCGAATTCATCAGGAGGCTGCACGCCGTATTTGG CAAAGAGCGTGAAATCATCCTCGATGACGTTCTTATGTTTGGTCTTAAACGTCCGGAAAGCCTTACAACAGCATCAGGAA GCGGTCACAGGCCTAACGGCATCTCTCCCTTGCGCAAAAGATATTATGACAGTTATACCAACTGGCATCAAACCATTGAC AAGAATTCGGCATACATGGCATTTCCTCTGAAAAGTCGGCCATTCGAAGCACCTGAAGAGATACTGGCTAAAGGCTGA
Upstream 100 bases:
>100_bases AAAAGCACTGTACTGGCGAACAAATATTGAACTCTCTTACAAAAAAGACCATATTTATAAAGTAAAGCCCCCCTACCCCG CGAACATATTGAAAAGTTCC
Downstream 100 bases:
>100_bases AGGTTCCTACCCGAAAAAACATCTCATTCCAGAAACCCGTCAGCGGAAATTGTGAACACGCCCGCCTTGTTAGCCGAGAT TGCTCTGACCTCCCAAGCGT
Product: family 2 glycosyl transferase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 505; Mature: 505
Protein sequence:
>505_residues MNRLSGIIRQQAKHAERKVSRVKWQLRYALVKKALKKRPEHPALLNKLAKILVKLKDEQELLRATERYAMNGFRHDDPQT AQALFRLAHSRAASFWDNGSYEEAVALINSVMEWNTPNSSMLHQSLYWQLCIAKTENKKNVIEKRVLDRYVEIEKQHLPH DFVKAHMATEAMMRLGLADQAKKTLLKHEDKPMALLALSNTSIDNDELWLEYVNRFFSSRKLEPLSLFEGVKPRFLRLEG APATSYSGGPKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCSTDATRRIVKQLMEHDSRIKLIENQTNCGTYI NRNKAYDLATGEYVTCHDSDDWAHPKKLEYQVTALLKNPDAVSSSSHWVRMYENGQFIFYTLGTFAQYNANSLMFKIDRV KPVLGYWDSVRISADSEFIRRLHAVFGKEREIILDDVLMFGLKRPESLTTASGSGHRPNGISPLRKRYYDSYTNWHQTID KNSAYMAFPLKSRPFEAPEEILAKG
Sequences:
>Translated_505_residues MNRLSGIIRQQAKHAERKVSRVKWQLRYALVKKALKKRPEHPALLNKLAKILVKLKDEQELLRATERYAMNGFRHDDPQT AQALFRLAHSRAASFWDNGSYEEAVALINSVMEWNTPNSSMLHQSLYWQLCIAKTENKKNVIEKRVLDRYVEIEKQHLPH DFVKAHMATEAMMRLGLADQAKKTLLKHEDKPMALLALSNTSIDNDELWLEYVNRFFSSRKLEPLSLFEGVKPRFLRLEG APATSYSGGPKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCSTDATRRIVKQLMEHDSRIKLIENQTNCGTYI NRNKAYDLATGEYVTCHDSDDWAHPKKLEYQVTALLKNPDAVSSSSHWVRMYENGQFIFYTLGTFAQYNANSLMFKIDRV KPVLGYWDSVRISADSEFIRRLHAVFGKEREIILDDVLMFGLKRPESLTTASGSGHRPNGISPLRKRYYDSYTNWHQTID KNSAYMAFPLKSRPFEAPEEILAKG >Mature_505_residues MNRLSGIIRQQAKHAERKVSRVKWQLRYALVKKALKKRPEHPALLNKLAKILVKLKDEQELLRATERYAMNGFRHDDPQT AQALFRLAHSRAASFWDNGSYEEAVALINSVMEWNTPNSSMLHQSLYWQLCIAKTENKKNVIEKRVLDRYVEIEKQHLPH DFVKAHMATEAMMRLGLADQAKKTLLKHEDKPMALLALSNTSIDNDELWLEYVNRFFSSRKLEPLSLFEGVKPRFLRLEG APATSYSGGPKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCSTDATRRIVKQLMEHDSRIKLIENQTNCGTYI NRNKAYDLATGEYVTCHDSDDWAHPKKLEYQVTALLKNPDAVSSSSHWVRMYENGQFIFYTLGTFAQYNANSLMFKIDRV KPVLGYWDSVRISADSEFIRRLHAVFGKEREIILDDVLMFGLKRPESLTTASGSGHRPNGISPLRKRYYDSYTNWHQTID KNSAYMAFPLKSRPFEAPEEILAKG
Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]
COG id: COG0463
COG function: function code M; Glycosyltransferases involved in cell wall biogenesis
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyltransferase 2 family [H]
Homologues:
Organism=Escherichia coli, GI1788372, Length=101, Percent_Identity=43.5643564356436, Blast_Score=74, Evalue=2e-14, Organism=Escherichia coli, GI1790044, Length=120, Percent_Identity=32.5, Blast_Score=73, Evalue=3e-14, Organism=Escherichia coli, GI1787259, Length=121, Percent_Identity=34.7107438016529, Blast_Score=71, Evalue=1e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001173 [H]
Pfam domain/function: PF00535 Glycos_transf_2 [H]
EC number: NA
Molecular weight: Translated: 58215; Mature: 58215
Theoretical pI: Translated: 9.62; Mature: 9.62
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.8 %Cys (Translated Protein) 2.8 %Met (Translated Protein) 3.6 %Cys+Met (Translated Protein) 0.8 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 3.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNRLSGIIRQQAKHAERKVSRVKWQLRYALVKKALKKRPEHPALLNKLAKILVKLKDEQE CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHH LLRATERYAMNGFRHDDPQTAQALFRLAHSRAASFWDNGSYEEAVALINSVMEWNTPNSS HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCHH MLHQSLYWQLCIAKTENKKNVIEKRVLDRYVEIEKQHLPHDFVKAHMATEAMMRLGLADQ HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHH AKKTLLKHEDKPMALLALSNTSIDNDELWLEYVNRFFSSRKLEPLSLFEGVKPRFLRLEG HHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCHHHHHCCCCCEEEEEEC APATSYSGGPKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCSTDATRRIVKQL CCCCCCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHH MEHDSRIKLIENQTNCGTYINRNKAYDLATGEYVTCHDSDDWAHPKKLEYQVTALLKNPD HCCCCCEEEEECCCCCCCEECCCCCEEECCCCEEEEECCCCCCCCCHHHEEEEEEECCCC AVSSSSHWVRMYENGQFIFYTLGTFAQYNANSLMFKIDRVKPVLGYWDSVRISADSEFIR CCCCCCCEEEEEECCCEEEEEECCHHHCCCCEEEEEEECCCHHHCCCCCEEECCCHHHHH RLHAVFGKEREIILDDVLMFGLKRPESLTTASGSGHRPNGISPLRKRYYDSYTNWHQTID HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC KNSAYMAFPLKSRPFEAPEEILAKG CCCCEEEEECCCCCCCCCHHHHCCC >Mature Secondary Structure MNRLSGIIRQQAKHAERKVSRVKWQLRYALVKKALKKRPEHPALLNKLAKILVKLKDEQE CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHH LLRATERYAMNGFRHDDPQTAQALFRLAHSRAASFWDNGSYEEAVALINSVMEWNTPNSS HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCHH MLHQSLYWQLCIAKTENKKNVIEKRVLDRYVEIEKQHLPHDFVKAHMATEAMMRLGLADQ HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHH AKKTLLKHEDKPMALLALSNTSIDNDELWLEYVNRFFSSRKLEPLSLFEGVKPRFLRLEG HHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCHHHHHCCCCCEEEEEEC APATSYSGGPKISVIMTAFNVENYIETAIRSVLAQSWSNFELIVVDDCSTDATRRIVKQL CCCCCCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHH MEHDSRIKLIENQTNCGTYINRNKAYDLATGEYVTCHDSDDWAHPKKLEYQVTALLKNPD HCCCCCEEEEECCCCCCCEECCCCCEEECCCCEEEEECCCCCCCCCHHHEEEEEEECCCC AVSSSSHWVRMYENGQFIFYTLGTFAQYNANSLMFKIDRVKPVLGYWDSVRISADSEFIR CCCCCCCEEEEEECCCEEEEEECCHHHCCCCEEEEEEECCCHHHCCCCCEEECCCHHHHH RLHAVFGKEREIILDDVLMFGLKRPESLTTASGSGHRPNGISPLRKRYYDSYTNWHQTID HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC KNSAYMAFPLKSRPFEAPEEILAKG CCCCEEEEECCCCCCCCCHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8969506; 9384377 [H]