LCSB R³
Responsible and Reproducible Research

Comprehensive reanalysis for CNVs in ES data from unsolved rare disease cases results in new diagnoses#

Authors#

Solve-RD Consortium, Patrick May

Abstract#

We report the results of a comprehensive copy number variant (CNV) reanalysis of 9171 exome sequencing datasets from 5757 families affected by a rare disease (RD). The data reanalysed was extremely heterogeneous, having been generated using 28 different enrichment kits by 42 different research groups across Europe partnering in the Solve-RD project. Each research group had previously undertaken their own analysis of the data but failed to identify disease-causing variants. We applied three CNV calling algorithms to maximise sensitivity, and rare CNVs overlapping genes of interest, provided by four partner European Reference Networks, were taken forward for interpretation by clinical experts. This reanalysis has resulted in a molecular diagnosis being provided to 51 families in this sample, with ClinCNV performing the best of the three algorithms. We also identified partially explanatory pathogenic CNVs in a further 34 individuals. This work illustrates the value of reanalysing ES cold cases for CNVs.

Code availability#

All the software tools used in this paper are open-source and freely available online at https://github.com/imgag/ClinCNV (ClinCNV 1.16.6), https://github.com/vplagnol/ExomeDepth (ExomeDepth 1.1.15), https://conifer.sourceforge.net/ (CoNIFER 0.2.2), https://github.com/lgmgeo/AnnotSV (AnnotSV v.3.0.7). Genome-Phenome Analysis Platform used for the metadata collection is available on https://platform.rd-connect.eu/.

Data availability#

All raw and processed data files are deposited at the EGA (Datasets EGAD00001009767, EGAD00001009768, EGAD00001009769, and EGAD00001009770, under Solve-RD study EGAS00001003851) and can be made available upon approval by the Data Access Committee (EGAC00001001319). The family (FAM) and participant (P) identifiers used in this manuscript are pseudonymized and known only to the researchers involved In Solve-RD.