Acknowledgements
The authors thank the Director, CSIR-NIO, for encouraging this work. BS, PC, UVN and MG received the financial assistance from the Council of Scientific & Industrial Research, Government of India, under projects OLP2005 & MLP1802. MS is also funded by the Engineering and Physical Sciences Research Council, UK, and Imperial College London (EP/N509486/1: 1,979,819). We thank our funders. We also thank the High-performance computing facility ”Pravah” to carry out the bioinformatics and Machine Learning work. This is NIO’s contribution No___. The authors declare that they have no conflict of interest. All the copepod associated microbiome sequence datasets were downloaded from the NCBI SRA database. The information on their NCBI BioProject numbers (accession numbers), species names, 16S rDNA regions, sequencing platforms used along with the corresponding references can be found in the supplementary file Table 1. All of these data are free to download from the NCBI database (https://www.ncbi.nlm.nih.gov/sra). In the next step, we vetted these crude files for quality reads. In this study, only this quality reads were used for our meta-analysis. These datasets in both sequence format, as well as biom format, is submitted in the Figshare database and is free to download from the following link (https://doi.org/10.6084/m9.figshare.c.5086811.v1). This dataset was used for taxonomy, machine-learning, Picrust2 and ANCOM analysis. The dataset enclosing the results for the analyses mentioned above was also deposited in the Figshare repository and is free to download from the following link (https://doi.org/10.6084/m9.figshare.c.5087183.v3). Anyone can download and reuse this data for their analysis.