| Definition | Xanthomonas campestris pv. vesicatoria str. 85-10 chromosome, complete genome. |
|---|---|
| Accession | NC_007508 |
| Length | 5,178,466 |
Click here to switch to the map view.
The map label for this gene is egl4 [H]
Identifier: 78048260
GI number: 78048260
Start: 3063572
End: 3065332
Strand: Reverse
Name: egl4 [H]
Synonym: XCV2704
Alternate gene names: 78048260
Gene position: 3065332-3063572 (Counterclockwise)
Preceding gene: 78048261
Following gene: 78048258
Centisome position: 59.19
GC content: 65.25
Gene sequence:
>1761_bases ATGACTATCTTCAAAACTCTGCTCACCAGCCTGATGATCGTTTCGCCCGTGCTCGCCTGCGCTGCAGAGACAACCGGCAG GCAGGAGGCCGTGGCGGCCAGCGACGCCATCCACGTCAACCAGCTGGGCTATCTGCCGGGCTCGGCAAAAATGGCGATCG TCGGGCTGCCGCCGCAGGGCACCGCGCAGCGTTCGGACCGCTTCACCGTCGAAGATGCACAGGGACGCAGCGTGTTGCAG GGAACCCTGCAGCCGGCCGCGATGTGGTCGCCGGCCGGACAGCAGGCGCGGGTGGCGGATTTTTCCGGGCTGCGTGCGGT CGGAACCTATCGGTTGAAGGTCGACGGGCTGCCGGCATCGGAGACCTTCGCCATCAACGCGCATGCCTATCAGCCCCTGG TCGATGCAGCGTTGAAGGCGTTCTACTTCAACCGTGCCAGCACCGCGCTGCCGGCACGCTACGCCGGGCGCTATGCGCGC GCGGCCGGGCACCCGGATACGCAGGTGCGCATCCATCCGTCCGCAGCGTCTGCAACGCGTCCGACCGATAGCGTGATCAG CGCGCCCAAGGGCTGGTACGACGCGGGCGACTACAACAAGTACATCGTCAATTCCGGCATCAGCACCTACACCTTGCTGG CGGCCTACGAGCAGTATCCGGCGTTCTTCAGGTCCAAGCCGCTGACCATCCCCGACGATGCGCCCGGCGTGCCCGGCATC CTGCAGGAGGCCTGGTGGAATCTGGACTGGATGCTCGCCATGCAGGACCCGGCCGATGGCGGCGTGTACCACAAGCTCAC CGACAAGCAGTTCGATGGCTTGGTGATGCCGGATCAGGCCACGCAGCAACGCTACGTGGTGATGAAGACCAATGCCGCGA CCTTGGACTTCGCTGCGGTGATGGCCGTCGCCAGTCGCGTATATGCGCCATTTGAAAAGCAGTATCCGGGGCTGTCGGCG CGCATGTTGAAAGCCTCGCGGGCTGCGTGGGCGTGGGCGAAGCAGCATCCGGACGTGATCTACCAACAGCCCAGCGACGT ACGCACCGGCGGTTACGACGATGCACACCTGGATGACGAATTCGCCTGGGCGGCGGCCGAGTTGTACATCGCCACCGGCG AGGATGGGTTCTACGACGCGATGATCGCGCGCAACGTGCCGGCCACCGTGCCGTCGTGGGGCAATGTCGGTGGGCTGGCC TGGATGTCGTTGGCCGCGCATCGCGACCGACTCACGCCGCATGCCGATCGCGCGCGGATCGCACGGGAGATCACCGGACT GGCACAACAGCTGGCCGACACCTGGCAATCATCGGCATGGCGGCTGGCGATGAACGACAGCGATTTCGTGTGGGGCAGCA ATGCAGGGGTGCTCAACCGCGCGATGATGCTGCTGCAGGGCTACCGGTTGACGCAGCAGCGGCAATTTCTGGATGCGGCG CAATCGCAGCTGGACTACATCCTGGGCCGCAATCCGCTGGGACTGTCGTTCGTGACCGGGATAGGCAAACGCACCCCGAT GCATATCCATCACCGCCCCTCCGAGGCCGATGGGATCGAAGCGCCGGTGCCGGGTTGGCTGGTCGGCGGCCCGCAGCCGG GGCAGCAGGATGCCAAGGAGTGCAAGGTGCCGTACCCCTCCAAACGGCCGGCGTTGTCGTACCTGGACAACTTCTGCAGC TACTCCACCAACGAGGTGGCGATCAACTGGAATGCGCCGCTGGTGTATGTGAGCGCGGCGATCGAGGCGTCGACGCGCTA A
Upstream 100 bases:
>100_bases ATCCGCGCAAGGGGCGCATTGCTGCAGCGATGCGGATTGACGCATCGGCTATGCCGGCATTGGACTTGTCGCTGCAAGAT AATGCGGCGACCATCGTGGC
Downstream 100 bases:
>100_bases AGCGTCGCTATCAAGCCGATGCCGCGGCTGTTCCTTTTCCCGTCGGGAGAAGGTGGCGCGCAGCGCCGGATGAGGGTACG GCGTCGGCAGGAATACTGAT
Product: cellulase precursor
Products: NA
Alternate protein names: EGD; Cellulase D; Endo-1,4-beta-glucanase [H]
Number of amino acids: Translated: 586; Mature: 585
Protein sequence:
>586_residues MTIFKTLLTSLMIVSPVLACAAETTGRQEAVAASDAIHVNQLGYLPGSAKMAIVGLPPQGTAQRSDRFTVEDAQGRSVLQ GTLQPAAMWSPAGQQARVADFSGLRAVGTYRLKVDGLPASETFAINAHAYQPLVDAALKAFYFNRASTALPARYAGRYAR AAGHPDTQVRIHPSAASATRPTDSVISAPKGWYDAGDYNKYIVNSGISTYTLLAAYEQYPAFFRSKPLTIPDDAPGVPGI LQEAWWNLDWMLAMQDPADGGVYHKLTDKQFDGLVMPDQATQQRYVVMKTNAATLDFAAVMAVASRVYAPFEKQYPGLSA RMLKASRAAWAWAKQHPDVIYQQPSDVRTGGYDDAHLDDEFAWAAAELYIATGEDGFYDAMIARNVPATVPSWGNVGGLA WMSLAAHRDRLTPHADRARIAREITGLAQQLADTWQSSAWRLAMNDSDFVWGSNAGVLNRAMMLLQGYRLTQQRQFLDAA QSQLDYILGRNPLGLSFVTGIGKRTPMHIHHRPSEADGIEAPVPGWLVGGPQPGQQDAKECKVPYPSKRPALSYLDNFCS YSTNEVAINWNAPLVYVSAAIEASTR
Sequences:
>Translated_586_residues MTIFKTLLTSLMIVSPVLACAAETTGRQEAVAASDAIHVNQLGYLPGSAKMAIVGLPPQGTAQRSDRFTVEDAQGRSVLQ GTLQPAAMWSPAGQQARVADFSGLRAVGTYRLKVDGLPASETFAINAHAYQPLVDAALKAFYFNRASTALPARYAGRYAR AAGHPDTQVRIHPSAASATRPTDSVISAPKGWYDAGDYNKYIVNSGISTYTLLAAYEQYPAFFRSKPLTIPDDAPGVPGI LQEAWWNLDWMLAMQDPADGGVYHKLTDKQFDGLVMPDQATQQRYVVMKTNAATLDFAAVMAVASRVYAPFEKQYPGLSA RMLKASRAAWAWAKQHPDVIYQQPSDVRTGGYDDAHLDDEFAWAAAELYIATGEDGFYDAMIARNVPATVPSWGNVGGLA WMSLAAHRDRLTPHADRARIAREITGLAQQLADTWQSSAWRLAMNDSDFVWGSNAGVLNRAMMLLQGYRLTQQRQFLDAA QSQLDYILGRNPLGLSFVTGIGKRTPMHIHHRPSEADGIEAPVPGWLVGGPQPGQQDAKECKVPYPSKRPALSYLDNFCS YSTNEVAINWNAPLVYVSAAIEASTR >Mature_585_residues TIFKTLLTSLMIVSPVLACAAETTGRQEAVAASDAIHVNQLGYLPGSAKMAIVGLPPQGTAQRSDRFTVEDAQGRSVLQG TLQPAAMWSPAGQQARVADFSGLRAVGTYRLKVDGLPASETFAINAHAYQPLVDAALKAFYFNRASTALPARYAGRYARA AGHPDTQVRIHPSAASATRPTDSVISAPKGWYDAGDYNKYIVNSGISTYTLLAAYEQYPAFFRSKPLTIPDDAPGVPGIL QEAWWNLDWMLAMQDPADGGVYHKLTDKQFDGLVMPDQATQQRYVVMKTNAATLDFAAVMAVASRVYAPFEKQYPGLSAR MLKASRAAWAWAKQHPDVIYQQPSDVRTGGYDDAHLDDEFAWAAAELYIATGEDGFYDAMIARNVPATVPSWGNVGGLAW MSLAAHRDRLTPHADRARIAREITGLAQQLADTWQSSAWRLAMNDSDFVWGSNAGVLNRAMMLLQGYRLTQQRQFLDAAQ SQLDYILGRNPLGLSFVTGIGKRTPMHIHHRPSEADGIEAPVPGWLVGGPQPGQQDAKECKVPYPSKRPALSYLDNFCSY STNEVAINWNAPLVYVSAAIEASTR
Specific function: This enzyme catalyzes the endohydrolysis of 1,4-beta- glucosidic linkages in cellulose, lichenin and cereal beta-D- glucans [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: Belongs to the glycosyl hydrolase 9 (cellulase E) family [H]
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR008928 - InterPro: IPR012341 - InterPro: IPR016134 - InterPro: IPR002105 - InterPro: IPR018242 - InterPro: IPR001701 - InterPro: IPR018221 - InterPro: IPR004197 - InterPro: IPR013783 - InterPro: IPR014756 [H]
Pfam domain/function: PF02927 CelD_N; PF00404 Dockerin_1; PF00759 Glyco_hydro_9 [H]
EC number: =3.2.1.4 [H]
Molecular weight: Translated: 63880; Mature: 63748
Theoretical pI: Translated: 7.16; Mature: 7.16
Prosite motif: PS00592 GLYCOSYL_HYDROL_F9_1 ; PS00698 GLYCOSYL_HYDROL_F9_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.5 %Cys (Translated Protein) 2.7 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.5 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTIFKTLLTSLMIVSPVLACAAETTGRQEAVAASDAIHVNQLGYLPGSAKMAIVGLPPQG CCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCEEEEHCCCCCCCCCEEEEECCCCC TAQRSDRFTVEDAQGRSVLQGTLQPAAMWSPAGQQARVADFSGLRAVGTYRLKVDGLPAS CCCCCCCEEEECCCCCHHHHHCCCCHHHCCCCCCCCEECCCCCCEEEEEEEEEECCCCCC ETFAINAHAYQPLVDAALKAFYFNRASTALPARYAGRYARAAGHPDTQVRIHPSAASATR CEEEEECHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCEEEECCCCCCCCC PTDSVISAPKGWYDAGDYNKYIVNSGISTYTLLAAYEQYPAFFRSKPLTIPDDAPGVPGI CCHHHHCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHH LQEAWWNLDWMLAMQDPADGGVYHKLTDKQFDGLVMPDQATQQRYVVMKTNAATLDFAAV HHHHHCCCEEEEEECCCCCCCEEEEECCCCCCCEECCCCCCCCEEEEEECCCCHHHHHHH MAVASRVYAPFEKQYPGLSARMLKASRAAWAWAKQHPDVIYQQPSDVRTGGYDDAHLDDE HHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCH FAWAAAELYIATGEDGFYDAMIARNVPATVPSWGNVGGLAWMSLAAHRDRLTPHADRARI HHEEEEEEEEECCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHH AREITGLAQQLADTWQSSAWRLAMNDSDFVWGSNAGVLNRAMMLLQGYRLTQQRQFLDAA HHHHHHHHHHHHHHHHCCCEEEEECCCCEEECCCCHHHHHHHHHHHCCHHHHHHHHHHHH QSQLDYILGRNPLGLSFVTGIGKRTPMHIHHRPSEADGIEAPVPGWLVGGPQPGQQDAKE HHHHHHHHCCCCCCHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCEECCCCCCCHHHHHH CKVPYPSKRPALSYLDNFCSYSTNEVAINWNAPLVYVSAAIEASTR CCCCCCCCCCHHHHHHHHHCCCCCEEEEECCCCEEEEEEEECCCCC >Mature Secondary Structure TIFKTLLTSLMIVSPVLACAAETTGRQEAVAASDAIHVNQLGYLPGSAKMAIVGLPPQG CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCEEEEHCCCCCCCCCEEEEECCCCC TAQRSDRFTVEDAQGRSVLQGTLQPAAMWSPAGQQARVADFSGLRAVGTYRLKVDGLPAS CCCCCCCEEEECCCCCHHHHHCCCCHHHCCCCCCCCEECCCCCCEEEEEEEEEECCCCCC ETFAINAHAYQPLVDAALKAFYFNRASTALPARYAGRYARAAGHPDTQVRIHPSAASATR CEEEEECHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCEEEECCCCCCCCC PTDSVISAPKGWYDAGDYNKYIVNSGISTYTLLAAYEQYPAFFRSKPLTIPDDAPGVPGI CCHHHHCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHH LQEAWWNLDWMLAMQDPADGGVYHKLTDKQFDGLVMPDQATQQRYVVMKTNAATLDFAAV HHHHHCCCEEEEEECCCCCCCEEEEECCCCCCCEECCCCCCCCEEEEEECCCCHHHHHHH MAVASRVYAPFEKQYPGLSARMLKASRAAWAWAKQHPDVIYQQPSDVRTGGYDDAHLDDE HHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCH FAWAAAELYIATGEDGFYDAMIARNVPATVPSWGNVGGLAWMSLAAHRDRLTPHADRARI HHEEEEEEEEECCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHH AREITGLAQQLADTWQSSAWRLAMNDSDFVWGSNAGVLNRAMMLLQGYRLTQQRQFLDAA HHHHHHHHHHHHHHHHCCCEEEEECCCCEEECCCCHHHHHHHHHHHCCHHHHHHHHHHHH QSQLDYILGRNPLGLSFVTGIGKRTPMHIHHRPSEADGIEAPVPGWLVGGPQPGQQDAKE HHHHHHHHCCCCCCHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCEECCCCCCCHHHHHH CKVPYPSKRPALSYLDNFCSYSTNEVAINWNAPLVYVSAAIEASTR CCCCCCCCCCHHHHHHHHHCCCCCEEEEECCCCEEEEEEEECCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 3024110 [H]