loading page

Evaluating kinship estimation methods for reduced-representation SNP data in non-model species
  • +2
  • Eilish McMaster,
  • Patricia Lu-Irving,
  • Marlien van der Merwe,
  • Simon Ho,
  • Maurizio Rossetto
Eilish McMaster
The University of Sydney School of Life and Environmental Sciences

Corresponding Author:[email protected]

Author Profile
Patricia Lu-Irving
Botanic Gardens of Sydney
Author Profile
Marlien van der Merwe
Royal Botanic Gardens and Domain Trust
Author Profile
Simon Ho
University of Sydney
Author Profile
Maurizio Rossetto
The Royal Botanic Gardens Sydney
Author Profile

Abstract

\internallinenumbers

Kinship estimation is widely used in ecological and evolutionary research, particularly in studies of human genealogy and genome-wide associations. In conservation, restoration, agriculture, and forestry, identifying relationships between individuals can be crucial for successful population management and can provide insight into inheritance patterns. Kinship estimation methods are typically designed for large datasets with hundreds of thousands of single-nucleotide polymorphisms. However, studies of non-model species often use much smaller datasets obtained using reduced-representation sequencing. To evaluate the performance of kinship estimation methods under these circumstances, we applied six algorithms to datasets from six non-model Australian flowering plant species (Acacia terminalis, Acacia suaveolens, Banksia serrata, Banksia aemula, Hakea sericea, and Hakea teretifolia), encompassing 3,390 individuals and 369 families. Our results show different performances of kinship methods on reduced-representation sequence data compared with prior evaluations. PC-Relate, RelateAdmix, and Goudet’s beta dosage exhibited limited precision, KING Homo and KING Robust demonstrated high precision with limited sensitivity, while PLINK displayed variable sensitivity and precision. The sensitivity and precision of the methods were affected in various ways by filtering parameters; each method showed its best performance under different thresholds for minor allele frequency and locus missingness. We also present a case study that illustrates a practical application of the methods, demonstrating how estimates of kinship can inform management of seed production areas of the broadleaf hopbush (Dodonaea viscosa). Based on our findings, we offer specific recommendations for utilizing kinship estimation methods in studies of reduced-representation sequence data from non-model species.

05 Mar 2024Submitted to Molecular Ecology Resources
06 Mar 2024Assigned to Editor
06 Mar 2024Submission Checks Completed
06 Mar 2024Review(s) Completed, Editorial Evaluation Pending
08 Mar 2024Reviewer(s) Assigned