Mixed samples
We tested NGSpeciesID’s performance on mixed samples in silico by
combining 300 reads of each of the seven barcodes from Maestri et al.
(2019). To do so, we set the cluster abundance ratio to 5%
(–abundance_ratio 0.05). We recovered seven consensus sequences
corresponding to the seven DNA barcodes, ranging from 99.3% to 100%
similarity to the corresponding Sanger sequence (Table 2). In four out
of the seven cases, we recovered the same percentage similarity to the
Sanger sequence in the mixed analysis as in the respective single
barcode processing. In three cases the accuracy was slightly lower with
two and four basepair differences, respectively.