FIGURE LEGENDS
Fig.1 The phylogenetic relationships among six nematodes and
genomic characteristics and synteny of B. schroederi.(a) Genomic characterization of B. schroederi genome.
The figure shows the gene number, repeat content, GC content, sequencing
coverage and scaffolds from the center to the edge. (b) Synteny
of B. schroederi with P. univalens and T. canis at
the gene level. Different colors represent different synteny blocks.(c) Upset plot showing the intersection of gene family
expansions in nematodes. Each row represents a nematode. Black circles
and vertical lines between the rows represent the intersection of
expanded families between species. The barplot indicates the total gene
family count in each intersection. (d) Time-calibrated maximum
likelihood phylogenetic tree of six nematodes. The estimated divergent
times are shown at the bifurcations. The numbers below the nodes
represent the number of gene families significantly expanded,
maintained, and contracted, respectively.
Fig.2 The expansion and contraction of roundworm gene families.(a) Significant increases and decreases in roundworm gene
families. The solid circle and the solid triangle represent the top KEGG
pathways that are enriched in the expanded gene families of Ascariasis
compared with C. elegans or M. hapla , respectively. The
open circle and the open triangle represent the top KEGG pathways that
are enriched in the contracted gene families of Ascariasis compared withC. elegans or M. hapla , respectively. (b) GO
function enrichment and gene copy number of the significantly expanded
gene families in roundworms; (c) The proportion of GO
functional genes in the gene family with significant expansion (or
contraction) in roundworms compared to the total number of expansion (or
contraction) genes. The red asterisk represents the p value of
statistical Sidak’s multiple comparisons tests of expansion and
contraction of genes comparing with C. elegans or M. hapla(One asterisk represent 10-1).
Fig.3 Expansion and contraction ofB. schroederi gene
families compared with three roundworms (A. suum P.
univalens and T. canis). (a) Enrichment of the KEGG pathway in
some significantly expanded gene families of B. schroederi . The
proportion represents the ratio of the number of expanded genes located
in the pathway (target genes) to all genes in the pathway (background
genes). (b) REVIGO clusters of significantly over-represented
GO items for significantly expanded gene families in B.
schroederi . The position of the bubbles is based on semantic similarity
of GO terms. (c) Enrichment of KEGG pathways in B.
schroederi ’s unique gene families. (d) Heatmap showing the
gene families of B. schroederi that are significantly expanded or
contracted (P < 0.01). The x axis represents the four
roundworms of Ascariasis, whereas the y axis represents the families.
Fig.4 Life history of B. schroederi and the effect of
actin gene on muscle contraction. (a) Life history of B.
schroederi . L1 and L2 represent in vitro developmental stages, and L2
larvae enter the host body after developing into the infective stage. L3
and L4 represent the stage of internal organ migration of the larva.
Stage L5 larvae return to the small intestine and develop into adult
worms through sexual maturation. (b) Schematic diagram of
anatomical cross-section of B. schroederi ; (c) Multiple
signaling pathways are involved in actin polymerization, and genes in
red are positively selected genes (PSGs). It shows significant expansion
of three key gene families involved in actin polymerization.
Fig.5 Demographic history of theB.
schroederi reconstructed from the reference and population resequencing
genomes. (a) The red and purple line represent the estimated effective
population size of B. schroederi and host, respectively. The 100
gray curves of B. schroederi and host represent the PSMC
estimates for 100 sequences randomly resampled from the original
sequence. Generation time (g) of e and giant panda were
0.17 and 12 years, respectively. The neutral mutation rate per
generation (µ ) of B. schroederi and giant panda were 0.9 ×
10−8 and 1.3 × 10−8, respectively.
The black line shows the MAR of Chinese loess. (b) Longitudinal
change of the effective population size of the B. schroederipopulations. The effective population sizes (Ne ) were estimated
using the MSMC2 method. QLI:
Qinling population; SC: Sichuan population.
Fig. 6 Population structure and relationships of Sichuan (SC) in
comparison to Qinling (QLI) population. (a) The geographic distribution
of the sampling locations for QLI and SC populations. (b) PCA
analysis of two populations; (c) A maximum likelihood (ML)
phylogenetic tree with 100 bootstrap tests constructed using
whole-genome SNPs information. We used P. univale as the
outgroup; (d) Population structure of SC and QLI populations (K
from 2 to 5). The y axis quantifies the proportion of the individual’s
genome from inferred ancestral population, and x axis shows the
different individuals.
Fig.7 Analysis of natural selection in captive populations. (a)Genomic regions with selection sweep signals in captive (SC) and wild
(QLI) B. schroederi population. Distribution of ln ratio
(θπ, wild(QLI)/θπ, captive(SC)) andFst of 50 kb windows with 10 kb steps. Red dots represent windows
fulfilling the selected regions requirement (corresponding to Z test P
< 0.005, where Fst >0.21 and ln ratio
>0.34). (b) Plot of iHS showing loci under
positive selection of captive (SC) population. SNPs with
|iHS|≥iHSm (3.89, top 1%) were shown
above the dashed horizontal line. Nucleotide diversity aroundglc-1 , nrf-6 , ABC transporter ced7 and bre-4loci using 10-kb sliding windows were displayed above the genes. The
decay of haplotype homozygosity around a focal marker were displayed on
the right side of the figure. The furcation structures represent the
complete information contained in the concept of extended shared
haplotypes EHH(Pardis C. Sabeti et al., 2002). The root (focal marker)
is indicated by a vertical dashed line. The thickness of the lines
corresponds to the number of scaffolds sharing a haplotype. (c)XP-EHH from each SNP core showing the same nucleotide between the
subject and the comparison target, also transformed to P values
and plotted in logarithmic scale.
Fig.8 The position of known and potential drug target genes onsuperscaffolds. Different
colors indicate different known drugs, and black indicates potential
drug targets. The chemical structural formulas of 23 known drugs are
drawn. The circles following the potential drug targets represent the
six criteria, with a red solid circle indicating match condition and a
hollow circle indicating mismatch condition. the six criteria were: (1)
Similarity with ChEMBL known drug targets having a highly conserved
alignment (>80%); (2) Lack of human homologues; (3)
Related to lethal, L3 arrest, flaccid, molt defect or sterile phenotype;
(4) A predicted metabolic
chokepoint; (5) A predicted excretory/secretory protein (EP); (6) The
protein has a structure in the PDBe. Potential drug target proteins
encoding genes on each superscaffold and corresponding scores are marked
(black).