Definition | Acidovorax citrulli AAC00-1 chromosome, complete genome. |
---|---|
Accession | NC_008752 |
Length | 5,352,772 |
Click here to switch to the map view.
The map label for this gene is epsD [H]
Identifier: 120609897
GI number: 120609897
Start: 1332893
End: 1334014
Strand: Reverse
Name: epsD [H]
Synonym: Aave_1210
Alternate gene names: 120609897
Gene position: 1334014-1332893 (Counterclockwise)
Preceding gene: 120609898
Following gene: 120609896
Centisome position: 24.92
GC content: 70.41
Gene sequence:
>1122_bases GTGAAAGTCCTGCAGGTCATCACCAAAGGCGAGACAGGGGGAGCGCAGAGCCATGTGCGCACACTGTGCGCCGCGCTCTC CACGCAGGGGGTGGCAGTGCAGGCCGCCATCGGCGGAGCCGGCCCGCTTTCTCCCTTGGGCAGGTATCTGGAAACCCTGG GCCTGCATGTGCATGGCATTCCCTTGCTGGACAACGGCTTGCGACCGGTCCGCCTCCTGCGGGCCACGCGTGATCTGCTC CGCATCGTGCGAGCGGAGTCTCCGGACCTGCTGCACGCCCACAGCGCCATGGCGGGCGTCGCAGCGCGTATCGCCGGGGC ATTGTCGGGTGTTCCCGTGATCTACACCGTGCACGGCTTCGCATTCAAGGCCGGCAATGCCTGGCCGCGCCGGGCGGTGG CCTGGACCACCGAGGCGCTGCTGGCCCCGCTGACCAGCCGCATGGTCTGCGTCTCGGAACACGAGCGTGCCCTTGCGCGC AGCCTGCCGCTGCCGAGCTGGCGCCTGGAGACCGTCGTGAACGGCGTTGACGACATCGCCGTTCCCGCCGATGCCACGGC ATGGAAGCGCGAAGGCACTTCGGTCGCCATGGTCGCCCGCATGGCACCGCCGAAGCGGCACGACCTGCTGCTGCATGCCC TGGCCCGGCTCAGGGACACGACCGGACAGCATGGCTCCGCCACGCTCCTGGGGGACGGCCCCGACAGGCCGGCCCACGAA GCGCTGGCAGCCCGCCTCGGCCTGCAGTCGTCGGTCGATTTCAAGGGGGACGTGGACGACGTACCCGCGCAGCTCGCACG CCACGGGATCTTCGTGCTCATGTCGGACCACGAAGGACTGCCCGTTTCGCTCATCGAGGCGATGCGTGCGGGCATGGCGA TCGTGGCCAGCGATCTGCCTGGCGTGCGCGAGCTGCTGCCCGAACCCGGGCACGCCCTGCTCGTGCCCGCGGACCCGGAA GCCCTCGCCGCTGCACTGCGACAATTGATGGAGTCCCCCGCGCTGCGCGCCCGGCTCGGCGCTGCCGCACGGCAGCGCTA CGAACGGCACTACACCGCCGAGCGCATGGGGTCTGCGGTGCGCGACGTCTATGCCGCCGCCCTGCGGCGTTCCCGCTCCT GA
Upstream 100 bases:
>100_bases CTGCTGCGGCTCGAACTCGTGGGCGACGTGCTGGACCGCACGGGTATCGCGCCATGAGCAGGCTCCGCGCCATTCGGCCA GCGCCCGGCATGCGTACGCC
Downstream 100 bases:
>100_bases ATGGCATGAACAGCATTCCCCAACGCGAACTTGGCTCTTCCACGGCGCACCGGCAAGGCGCGCTGCTGCGCTGGTCGCTG GCGGGAACGGTGACGCTGCT
Product: group 1 glycosyl transferase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 373; Mature: 373
Protein sequence:
>373_residues MKVLQVITKGETGGAQSHVRTLCAALSTQGVAVQAAIGGAGPLSPLGRYLETLGLHVHGIPLLDNGLRPVRLLRATRDLL RIVRAESPDLLHAHSAMAGVAARIAGALSGVPVIYTVHGFAFKAGNAWPRRAVAWTTEALLAPLTSRMVCVSEHERALAR SLPLPSWRLETVVNGVDDIAVPADATAWKREGTSVAMVARMAPPKRHDLLLHALARLRDTTGQHGSATLLGDGPDRPAHE ALAARLGLQSSVDFKGDVDDVPAQLARHGIFVLMSDHEGLPVSLIEAMRAGMAIVASDLPGVRELLPEPGHALLVPADPE ALAAALRQLMESPALRARLGAAARQRYERHYTAERMGSAVRDVYAAALRRSRS
Sequences:
>Translated_373_residues MKVLQVITKGETGGAQSHVRTLCAALSTQGVAVQAAIGGAGPLSPLGRYLETLGLHVHGIPLLDNGLRPVRLLRATRDLL RIVRAESPDLLHAHSAMAGVAARIAGALSGVPVIYTVHGFAFKAGNAWPRRAVAWTTEALLAPLTSRMVCVSEHERALAR SLPLPSWRLETVVNGVDDIAVPADATAWKREGTSVAMVARMAPPKRHDLLLHALARLRDTTGQHGSATLLGDGPDRPAHE ALAARLGLQSSVDFKGDVDDVPAQLARHGIFVLMSDHEGLPVSLIEAMRAGMAIVASDLPGVRELLPEPGHALLVPADPE ALAAALRQLMESPALRARLGAAARQRYERHYTAERMGSAVRDVYAAALRRSRS >Mature_373_residues MKVLQVITKGETGGAQSHVRTLCAALSTQGVAVQAAIGGAGPLSPLGRYLETLGLHVHGIPLLDNGLRPVRLLRATRDLL RIVRAESPDLLHAHSAMAGVAARIAGALSGVPVIYTVHGFAFKAGNAWPRRAVAWTTEALLAPLTSRMVCVSEHERALAR SLPLPSWRLETVVNGVDDIAVPADATAWKREGTSVAMVARMAPPKRHDLLLHALARLRDTTGQHGSATLLGDGPDRPAHE ALAARLGLQSSVDFKGDVDDVPAQLARHGIFVLMSDHEGLPVSLIEAMRAGMAIVASDLPGVRELLPEPGHALLVPADPE ALAAALRQLMESPALRARLGAAARQRYERHYTAERMGSAVRDVYAAALRRSRS
Specific function: May be involved in the production of the exopolysaccharide (EPS) component of the extracellular matrix during biofilm formation. EPS is responsible for the adhesion of chains of cells into bundles. Required for biofilm maintenance [H]
COG id: COG0438
COG function: function code M; Glycosyltransferase
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyltransferase 1 family [H]
Homologues:
Organism=Caenorhabditis elegans, GI17532709, Length=325, Percent_Identity=24.9230769230769, Blast_Score=74, Evalue=1e-13, Organism=Drosophila melanogaster, GI24654571, Length=324, Percent_Identity=26.5432098765432, Blast_Score=73, Evalue=3e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001296 [H]
Pfam domain/function: PF00534 Glycos_transf_1 [H]
EC number: NA
Molecular weight: Translated: 39645; Mature: 39645
Theoretical pI: Translated: 10.18; Mature: 10.18
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.7 %Met (Mature Protein) 3.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKVLQVITKGETGGAQSHVRTLCAALSTQGVAVQAAIGGAGPLSPLGRYLETLGLHVHGI CCHHHEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCCEEECC PLLDNGLRPVRLLRATRDLLRIVRAESPDLLHAHSAMAGVAARIAGALSGVPVIYTVHGF CHHCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECE AFKAGNAWPRRAVAWTTEALLAPLTSRMVCVSEHERALARSLPLPSWRLETVVNGVDDIA EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHCC VPADATAWKREGTSVAMVARMAPPKRHDLLLHALARLRDTTGQHGSATLLGDGPDRPAHE CCCCCHHHHCCCCCEEEHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHH ALAARLGLQSSVDFKGDVDDVPAQLARHGIFVLMSDHEGLPVSLIEAMRAGMAIVASDLP HHHHHHCCCCCCCCCCCHHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHCHHEEHHCCC GVRELLPEPGHALLVPADPEALAAALRQLMESPALRARLGAAARQRYERHYTAERMGSAV CHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH RDVYAAALRRSRS HHHHHHHHHHCCC >Mature Secondary Structure MKVLQVITKGETGGAQSHVRTLCAALSTQGVAVQAAIGGAGPLSPLGRYLETLGLHVHGI CCHHHEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCCEEECC PLLDNGLRPVRLLRATRDLLRIVRAESPDLLHAHSAMAGVAARIAGALSGVPVIYTVHGF CHHCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECE AFKAGNAWPRRAVAWTTEALLAPLTSRMVCVSEHERALARSLPLPSWRLETVVNGVDDIA EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHCC VPADATAWKREGTSVAMVARMAPPKRHDLLLHALARLRDTTGQHGSATLLGDGPDRPAHE CCCCCHHHHCCCCCEEEHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHH ALAARLGLQSSVDFKGDVDDVPAQLARHGIFVLMSDHEGLPVSLIEAMRAGMAIVASDLP HHHHHHCCCCCCCCCCCHHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHCHHEEHHCCC GVRELLPEPGHALLVPADPEALAAALRQLMESPALRARLGAAARQRYERHYTAERMGSAV CHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH RDVYAAALRRSRS HHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 8969506; 9384377 [H]