Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project

View ORCID ProfileSarah L. Stenton, Melanie O’Leary, View ORCID ProfileGabrielle Lemire, View ORCID ProfileGrace E. VanNoy, View ORCID ProfileStephanie DiTroia, Vijay S. Ganesh, View ORCID ProfileEmily Groopman, View ORCID ProfileEmily O’Heir, Brian Mangilog, Ikeoluwa Osei-Owusu, View ORCID ProfileLynn S. Pais, View ORCID ProfileJillian Serrano, View ORCID ProfileMoriel Singer-Berk, View ORCID ProfileBen Weisburd, Michael Wilson, Christina Austin-Tse, View ORCID ProfileMarwa Abdelhakim, View ORCID ProfileAzza Althagafi, View ORCID ProfileGiulia Babbi, View ORCID ProfileRiccardo Bellazzi, View ORCID ProfileSamuele Bovo, Maria Giulia Carta, View ORCID ProfileRita Casadio, Pieter-Jan Coenen, View ORCID ProfileFederica De Paoli, View ORCID ProfileMatteo Floris, View ORCID ProfileManavalan Gajapathy, View ORCID ProfileRobert Hoehndorf, View ORCID ProfileJulius O.B. Jacobsen, Thomas Joseph, Akash Kamandula, View ORCID ProfilePanagiotis Katsonis, Cyrielle Kint, View ORCID ProfileOlivier Lichtarge, View ORCID ProfileIvan Limongelli, View ORCID ProfileYulan Lu, View ORCID ProfilePaolo Magni, View ORCID ProfileTarun Karthik Kumar Mamidi, View ORCID ProfilePier Luigi Martelli, Marta Mulargia, View ORCID ProfileGiovanna Nicora, Keith Nykamp, Vikas Pejaver, Yisu Peng, Thi Hong Cam Pham, Maurizio S. Podda, Aditya Rao, View ORCID ProfileEttore Rizzo, Vangala G Saipradeep, View ORCID ProfileCastrense Savojardo, Peter Schols, View ORCID ProfileYang Shen, Naveen Sivadasan, View ORCID ProfileDamian Smedley, Dorian Soru, Rajgopal Srinivasan, View ORCID ProfileYuanfei Sun, Uma Sunderam, Wuwei Tan, Naina Tiwari, View ORCID ProfileXiao Wang, View ORCID ProfileYaqiong Wang, Amanda Williams, View ORCID ProfileElizabeth A. Worthey, Rujie Yin, Yuning You, View ORCID ProfileDaniel Zeiberg, View ORCID ProfileSusanna Zucca, View ORCID ProfileConstantina Bakolitsa, View ORCID ProfileSteven E. Brenner, View ORCID ProfileStephanie M Fullerton, View ORCID ProfilePredrag Radivojac, View ORCID ProfileHeidi L. Rehm, View ORCID ProfileAnne O’Donnell-Luria
doi: https://doi.org/10.1101/2023.08.02.23293212
Sarah L. Stenton
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sarah L. Stenton
Melanie O’Leary
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gabrielle Lemire
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gabrielle Lemire
Grace E. VanNoy
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Grace E. VanNoy
Stephanie DiTroia
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stephanie DiTroia
Vijay S. Ganesh
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
4Department of Neurology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emily Groopman
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Emily Groopman
Emily O’Heir
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Emily O’Heir
Brian Mangilog
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ikeoluwa Osei-Owusu
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lynn S. Pais
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lynn S. Pais
Jillian Serrano
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jillian Serrano
Moriel Singer-Berk
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Moriel Singer-Berk
Ben Weisburd
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ben Weisburd
Michael Wilson
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christina Austin-Tse
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marwa Abdelhakim
5Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marwa Abdelhakim
Azza Althagafi
5Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
6Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
7Computer Science Department, College of Computers and Information Technology, Taif University, Taif, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Azza Althagafi
Giulia Babbi
8Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Giulia Babbi
Riccardo Bellazzi
9enGenome Srl, Pavia, Italy
10Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Riccardo Bellazzi
Samuele Bovo
11Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Samuele Bovo
Maria Giulia Carta
10Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rita Casadio
8Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rita Casadio
Pieter-Jan Coenen
12Invitae, San Francisco, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Federica De Paoli
9enGenome Srl, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Federica De Paoli
Matteo Floris
13Department of Biomedical Sciences, University of Sassari, Sassari, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Matteo Floris
Manavalan Gajapathy
14Center for Computational Genomics and Data Science, The University of Alabama at Birmingham, Birmingham, AL, USA
15Department of Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
16Hugh Kaul Precision Medicine Institute, The University of Alabama at Birmingham, Birmingham, AL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Manavalan Gajapathy
Robert Hoehndorf
5Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
6Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robert Hoehndorf
Julius O.B. Jacobsen
17William Harvey Research Institute, Barts & The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Julius O.B. Jacobsen
Thomas Joseph
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Akash Kamandula
19Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Panagiotis Katsonis
20Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Panagiotis Katsonis
Cyrielle Kint
12Invitae, San Francisco, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Olivier Lichtarge
20Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX, USA
21Structural and Computational Biology & Molecular Biophysics Program, Baylor College of Medicine, Houston, TX, USA
22Computational and Integrative Biomedical Research Center, Baylor College of Medicine, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Olivier Lichtarge
Ivan Limongelli
9enGenome Srl, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ivan Limongelli
Yulan Lu
23Center for molecular medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yulan Lu
Paolo Magni
9enGenome Srl, Pavia, Italy
10Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paolo Magni
Tarun Karthik Kumar Mamidi
14Center for Computational Genomics and Data Science, The University of Alabama at Birmingham, Birmingham, AL, USA
15Department of Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
16Hugh Kaul Precision Medicine Institute, The University of Alabama at Birmingham, Birmingham, AL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tarun Karthik Kumar Mamidi
Pier Luigi Martelli
8Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Pier Luigi Martelli
Marta Mulargia
13Department of Biomedical Sciences, University of Sassari, Sassari, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Giovanna Nicora
9enGenome Srl, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Giovanna Nicora
Keith Nykamp
12Invitae, San Francisco, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vikas Pejaver
24Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
25Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yisu Peng
19Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thi Hong Cam Pham
26Anatomy and Surgical Training Department, University of Medicine and Pharmacy, Hue University, Vietnam
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maurizio S. Podda
13Department of Biomedical Sciences, University of Sassari, Sassari, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aditya Rao
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ettore Rizzo
9enGenome Srl, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ettore Rizzo
Vangala G Saipradeep
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Castrense Savojardo
8Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Castrense Savojardo
Peter Schols
12Invitae, San Francisco, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yang Shen
27Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
28Department of Computer Science and Engineering, Texas A&M University, College Station, TX, USA
29Institute of Biosciences and Technology and Department of Translational Medical Sciences, College of Medicine, Texas A&M University, Houston, Texas, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yang Shen
Naveen Sivadasan
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Damian Smedley
17William Harvey Research Institute, Barts & The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Damian Smedley
Dorian Soru
30Independent consultant
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rajgopal Srinivasan
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuanfei Sun
27Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yuanfei Sun
Uma Sunderam
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wuwei Tan
27Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Naina Tiwari
18TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiao Wang
23Center for molecular medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xiao Wang
Yaqiong Wang
23Center for molecular medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yaqiong Wang
Amanda Williams
20Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elizabeth A. Worthey
14Center for Computational Genomics and Data Science, The University of Alabama at Birmingham, Birmingham, AL, USA
15Department of Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
16Hugh Kaul Precision Medicine Institute, The University of Alabama at Birmingham, Birmingham, AL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elizabeth A. Worthey
Rujie Yin
27Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuning You
27Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Zeiberg
19Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Daniel Zeiberg
Susanna Zucca
9enGenome Srl, Pavia, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Susanna Zucca
Constantina Bakolitsa
31Department of Plant and Microbial Biology and Center for Computational Biology, University of California, Berkeley, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Constantina Bakolitsa
Steven E. Brenner
31Department of Plant and Microbial Biology and Center for Computational Biology, University of California, Berkeley, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Steven E. Brenner
Stephanie M Fullerton
32Department of Bioethics & Humanities, University of Washington School of Medicine, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stephanie M Fullerton
Predrag Radivojac
19Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Predrag Radivojac
Heidi L. Rehm
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Heidi L. Rehm
Anne O’Donnell-Luria
1Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anne O’Donnell-Luria
  • For correspondence: odonnell{at}broadinstitute.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

Background A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average “diagnostic odyssey” lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery. Families are consented for sharing of sequence and phenotype data with researchers, allowing development of a Critical Assessment of Genome Interpretation (CAGI) community challenge, placing variant prioritization models head-to-head in a real-life clinical diagnostic setting.

Methods Predictors were provided a dataset of phenotype terms and variant calls from GS of 175 RGP individuals (65 families), including 35 solved training set families, with causal variants specified, and 30 test set families (14 solved, 16 unsolved). The challenge tasked teams with identifying the causal variants in as many test set families as possible. Ranked variant predictions were submitted with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on rank position of true positive causal variants and maximum F-measure, based on precision and recall of causal variants across EPCR thresholds.

Results Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performing teams recalled the causal variants in up to 13 of 14 solved families by prioritizing high quality variant calls that were rare, predicted deleterious, segregating correctly, and consistent with reported phenotype. In unsolved families, newly discovered diagnostic variants were returned to two families following confirmatory RNA sequencing, and two prioritized novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS, identified in trans with a frameshift variant, in an unsolved proband with phenotype overlap with asparagine synthetase deficiency.

Conclusions By objective assessment of variant predictions, we provide insights into current state-of-the-art algorithms and platforms for genome sequencing analysis for rare disease diagnosis and explore areas for future optimization. Identification of diagnostic variants in unsolved families promotes synergy between researchers with clinical and computational expertise as a means of advancing the field of clinical genome interpretation.

Competing Interest Statement

Authors S.Z., I.L., E.R., P.M., and R.B., own shares of enGenome srl. Authors F.D.P. and G.N. are employees of enGenome srl. Authors T.J., R.S., S.G.V., N.S., A.R., U.S., N.T., are employees of TCS Ltd. Authors P.J.C., C.K., K.N., and P.S. are employees of Invitae Ltd. H.L.R. receives support from Illumina and Microsoft for rare disease gene discovery and diagnosis. A.O’D-L. is a member of the scientific advisory board for Congenica Inc and the Simons Foundation SPARK for Autism study and co-chairs the clinical advisory board for CAGI. S.E.B receives support at UC Berkeley from a research agreement from TCS. All other authors report no competing interests.

Funding Statement

S.L.S. is supported by a fellowship from the Manton Center for Orphan Disease Research at Boston Children’s Hospital. G.L. was supported by Fonds de recherche en sante du Quebec. V.S.G. was supported by the Mass General Brigham Training Program in Precision and Genomic Medicine (NHGRI T32 HG10464). Data and diagnoses were provided by Broad Institute of MIT and Harvard Center for Mendelian Genomics with funding to H.L.R. and A.O’D-L., by the National Human Genome Research Institute (NHGRI) grants UM1HG008900, U01HG011755, and R01HG009141 and by the Chan Zuckerberg Initiative through an advised fund of the Silicon Valley Community Foundation grant 2020-224274. This study was also supported by the NHGRI CAGI grant U24 HG007346 (to S.E.B. and P.R.), National Institute of Child Health and Human Development grant 1R01HD103805‐01, and National Institute of General Medical Sciences R35GM124952, along with funding from King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) grants URF/1/4355-01-01, URF/1/4675-01-01, FCC/1/1976-34-01.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The Rare Genomes Project study is approved by the Mass General Brigham Institutional Review Board (IRB) protocol 2016P001422. Written informed consent for the publication of clinical details was obtained from the participants or legal guardians.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

  • LIST OF ABBREVIATIONS

    ACMG/AMP
    American College of Medical Genetics and Genomics and the Association for Molecular Pathology
    AD
    autosomal dominant
    AFR
    African/African American
    AMR
    Admixed American
    AR
    autosomal recessive
    ASJ
    Ashkenazi Jewish
    CAGI
    Critical Assessment of Genome Interpretation
    CSF
    cerebrospinal fluid
    DM
    disease mutation
    EPCR
    estimated probability of causal relationship
    F-max
    maximum F-measure
    HPO
    Human Phenotype Ontology
    IGV
    Integrative Genome Viewer
    indel
    small insertion/deletion
    LP
    likely pathogenic
    NFE
    Non-Finnish European
    P
    pathogenic
    PHS
    Pitt-Hopkins syndrome
    RGP
    Rare Genomes Project
    SAS
    South Asian
    SE
    standard error
    SNV
    single nucleotide variant
    SV
    structural variant
    VCF
    variant call file
    VEP
    Variant Effect Predictor
    VUS
    variant of uncertain significance
    XLR
    X-linked recessive
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted August 04, 2023.
    Download PDF

    Supplementary Material

    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project
    Sarah L. Stenton, Melanie O’Leary, Gabrielle Lemire, Grace E. VanNoy, Stephanie DiTroia, Vijay S. Ganesh, Emily Groopman, Emily O’Heir, Brian Mangilog, Ikeoluwa Osei-Owusu, Lynn S. Pais, Jillian Serrano, Moriel Singer-Berk, Ben Weisburd, Michael Wilson, Christina Austin-Tse, Marwa Abdelhakim, Azza Althagafi, Giulia Babbi, Riccardo Bellazzi, Samuele Bovo, Maria Giulia Carta, Rita Casadio, Pieter-Jan Coenen, Federica De Paoli, Matteo Floris, Manavalan Gajapathy, Robert Hoehndorf, Julius O.B. Jacobsen, Thomas Joseph, Akash Kamandula, Panagiotis Katsonis, Cyrielle Kint, Olivier Lichtarge, Ivan Limongelli, Yulan Lu, Paolo Magni, Tarun Karthik Kumar Mamidi, Pier Luigi Martelli, Marta Mulargia, Giovanna Nicora, Keith Nykamp, Vikas Pejaver, Yisu Peng, Thi Hong Cam Pham, Maurizio S. Podda, Aditya Rao, Ettore Rizzo, Vangala G Saipradeep, Castrense Savojardo, Peter Schols, Yang Shen, Naveen Sivadasan, Damian Smedley, Dorian Soru, Rajgopal Srinivasan, Yuanfei Sun, Uma Sunderam, Wuwei Tan, Naina Tiwari, Xiao Wang, Yaqiong Wang, Amanda Williams, Elizabeth A. Worthey, Rujie Yin, Yuning You, Daniel Zeiberg, Susanna Zucca, Constantina Bakolitsa, Steven E. Brenner, Stephanie M Fullerton, Predrag Radivojac, Heidi L. Rehm, Anne O’Donnell-Luria
    medRxiv 2023.08.02.23293212; doi: https://doi.org/10.1101/2023.08.02.23293212
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project
    Sarah L. Stenton, Melanie O’Leary, Gabrielle Lemire, Grace E. VanNoy, Stephanie DiTroia, Vijay S. Ganesh, Emily Groopman, Emily O’Heir, Brian Mangilog, Ikeoluwa Osei-Owusu, Lynn S. Pais, Jillian Serrano, Moriel Singer-Berk, Ben Weisburd, Michael Wilson, Christina Austin-Tse, Marwa Abdelhakim, Azza Althagafi, Giulia Babbi, Riccardo Bellazzi, Samuele Bovo, Maria Giulia Carta, Rita Casadio, Pieter-Jan Coenen, Federica De Paoli, Matteo Floris, Manavalan Gajapathy, Robert Hoehndorf, Julius O.B. Jacobsen, Thomas Joseph, Akash Kamandula, Panagiotis Katsonis, Cyrielle Kint, Olivier Lichtarge, Ivan Limongelli, Yulan Lu, Paolo Magni, Tarun Karthik Kumar Mamidi, Pier Luigi Martelli, Marta Mulargia, Giovanna Nicora, Keith Nykamp, Vikas Pejaver, Yisu Peng, Thi Hong Cam Pham, Maurizio S. Podda, Aditya Rao, Ettore Rizzo, Vangala G Saipradeep, Castrense Savojardo, Peter Schols, Yang Shen, Naveen Sivadasan, Damian Smedley, Dorian Soru, Rajgopal Srinivasan, Yuanfei Sun, Uma Sunderam, Wuwei Tan, Naina Tiwari, Xiao Wang, Yaqiong Wang, Amanda Williams, Elizabeth A. Worthey, Rujie Yin, Yuning You, Daniel Zeiberg, Susanna Zucca, Constantina Bakolitsa, Steven E. Brenner, Stephanie M Fullerton, Predrag Radivojac, Heidi L. Rehm, Anne O’Donnell-Luria
    medRxiv 2023.08.02.23293212; doi: https://doi.org/10.1101/2023.08.02.23293212

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Genetic and Genomic Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (509)
    • Allergy and Immunology (826)
    • Anesthesia (274)
    • Cardiovascular Medicine (4040)
    • Dentistry and Oral Medicine (407)
    • Dermatology (349)
    • Emergency Medicine (566)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1396)
    • Epidemiology (14548)
    • Forensic Medicine (28)
    • Gastroenterology (1033)
    • Genetic and Genomic Medicine (6082)
    • Geriatric Medicine (605)
    • Health Economics (920)
    • Health Informatics (4043)
    • Health Policy (1293)
    • Health Systems and Quality Improvement (1456)
    • Hematology (504)
    • HIV/AIDS (1166)
    • Infectious Diseases (except HIV/AIDS) (15477)
    • Intensive Care and Critical Care Medicine (1032)
    • Medical Education (564)
    • Medical Ethics (140)
    • Nephrology (618)
    • Neurology (5989)
    • Nursing (318)
    • Nutrition (905)
    • Obstetrics and Gynecology (1052)
    • Occupational and Environmental Health (909)
    • Oncology (3057)
    • Ophthalmology (875)
    • Orthopedics (330)
    • Otolaryngology (398)
    • Pain Medicine (393)
    • Palliative Medicine (116)
    • Pathology (618)
    • Pediatrics (1585)
    • Pharmacology and Therapeutics (643)
    • Primary Care Research (663)
    • Psychiatry and Clinical Psychology (4997)
    • Public and Global Health (8637)
    • Radiology and Imaging (2016)
    • Rehabilitation Medicine and Physical Therapy (1269)
    • Respiratory Medicine (1134)
    • Rheumatology (548)
    • Sexual and Reproductive Health (644)
    • Sports Medicine (488)
    • Surgery (660)
    • Toxicology (87)
    • Transplantation (270)
    • Urology (239)