3.1. Identification of genes encoding different cellulases in Scenedesmus LWG002611
In the present work, we performed an analysis of Scendesmus quadricauda LWG002611 draft genome sequence, and eleven endoglucanases, β-glucosidases and exoglucanases gene sequences have been detected by protein folding homology analysis by Phyre2 and similarity study withMonoraphidium neglectum, a closely related species which has been taken a reference sequence for draft genome analysis.
These sequences have shown 82.26-98.38% similarity with the reference sequences of M. neglectum (Table 1).
According to the amino acid sequence analysis by Pfam two GH9 (Scequ2611|3068 and Scequ2611|4665), one GH5 (Scequ2611|2009), three GH1 (Scequ2611|3544, Scequ2611|9833 and Scequ2611|10006), one GH10 (Scequ2611|547), and four undefined GH family short proteins (Scequ2611|8404, Scequ2611|9353, Scequ2611|13370 and Scequ2611|13657) were detected (Table 1, Supplementary file 1)