Acknowledgements
The authors thank the Director, CSIR-NIO, for encouraging this work. BS,
PC, UVN and MG received the financial assistance from the Council of
Scientific & Industrial Research, Government of India, under projects
OLP2005 & MLP1802. MS is also funded by the Engineering and Physical
Sciences Research Council, UK, and Imperial College London
(EP/N509486/1: 1,979,819). We thank our funders. We also thank the
High-performance computing facility ”Pravah” to carry out the
bioinformatics and Machine Learning work. This is NIO’s contribution
No___. The authors declare that they have no conflict of interest.
All the copepod associated microbiome sequence datasets were downloaded
from the NCBI SRA database. The information on their NCBI BioProject
numbers (accession numbers), species names, 16S rDNA regions, sequencing
platforms used along with the corresponding references can be found in
the supplementary file Table 1. All of these data are free to download
from the NCBI database (https://www.ncbi.nlm.nih.gov/sra). In the next
step, we vetted these crude files for quality reads. In this study, only
this quality reads were used for our meta-analysis. These datasets in
both sequence format, as well as biom format, is submitted in the
Figshare database and is free to download from the following link
(https://doi.org/10.6084/m9.figshare.c.5086811.v1).
This dataset was used for taxonomy, machine-learning, Picrust2 and ANCOM
analysis. The dataset enclosing the results for the analyses mentioned
above was also deposited in the Figshare repository and is free to
download from the following link
(https://doi.org/10.6084/m9.figshare.c.5087183.v3).
Anyone can download and reuse this data for their analysis.