{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,7]],"date-time":"2026-06-07T17:58:50Z","timestamp":1780855130968,"version":"3.54.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"6","funder":[{"name":"Intramural Research Program of the National Institutes of Health"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>The accurate and sensitive identification of de novo variants, which are unique to an individual and not found in the parents\u2019 germlines, is critical for understanding the genetic basis of rare diseases, developmental disorders, and evolutionary processes. Existing de novo variant detection pipelines often lack the flexibility to handle multiple variant types, struggle with speed and reproducibility across computational environments, demand extensive manual configuration, or require bioinformatics expertise for downstream curation and analysis, limiting their scalability and usability for large genomic studies. Accordingly, there is a pressing need to better address these challenges.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We introduce TriosCompass, an open-source Snakemake workflow that addresses these challenges by providing a modular, accelerated, and environmentally-configurable end-to-end solution for comprehensive de novo variant discovery. It integrates state-of-the-art tools into a reproducible framework, empowering researchers to discover novel genetic insights with greater efficiency and reliability.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability<\/jats:title>\n                    <jats:p>TriosCompass is implemented as a Snakemake workflow and is freely available at https:\/\/github.com\/NCI-CGR\/TriosCompass_v2 or on Zenodo (10.5281\/zenodo.17981062).<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data is available on GitHub at https:\/\/github.com\/NCI-CGR\/TriosCompass_v2\/tree\/manuscript\/report_dashboards. Supplementary methods on DeepTrio benchmark runs can be viewed at: https:\/\/github.com\/NCI-CGR\/TriosCompass_v2\/blob\/manuscript\/TriosCompass_Supp_Methods_deeptrio_benchmark.md<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btag319","type":"journal-article","created":{"date-parts":[[2026,5,16]],"date-time":"2026-05-16T11:44:36Z","timestamp":1778931876000},"source":"Crossref","is-referenced-by-count":0,"title":["TriosCompass: a snakemake workflow for integrated detection of SNVs, indels, STRs, and structural de novo variants in parent\u2013child trios"],"prefix":"10.1093","volume":"42","author":[{"given":"Wei","family":"Zhu","sequence":"first","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Meredith","family":"Yeager","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Eric T","family":"Dawson","sequence":"additional","affiliation":[{"name":"NVIDIA Corporation , 2788 San Tomas Expy , Santa Clara, CA 95051,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Komal","family":"Jain","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Belynda","family":"Hicks","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2234-0631","authenticated-orcid":false,"given":"Michael","family":"Dean","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Stephen J","family":"Chanock","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9850-3797","authenticated-orcid":false,"given":"Wendy S W","family":"Wong","sequence":"additional","affiliation":[{"name":"Division of Cancer Epidemiology and Genetics, National Cancer Institute , 9609 Medical Center Dr , Rockville, MD 20850,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2026,5,20]]},"reference":[{"key":"2026060713162006600_btag319-B1","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"1000 Genomes Project Consortium","year":"2015","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B2","doi-asserted-by":"crossref","first-page":"2050","DOI":"10.1101\/gr.222109.117","article-title":"GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly","volume":"27","author":"Cameron","year":"2017","journal-title":"Genome Res"},{"key":"2026060713162006600_btag319-B3","doi-asserted-by":"crossref","first-page":"i884","DOI":"10.1093\/bioinformatics\/bty560","article-title":"fastp: an ultra-fast all-in-one FASTQ preprocessor","volume":"34","author":"Chen","year":"2018","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B4","doi-asserted-by":"crossref","first-page":"1220","DOI":"10.1093\/bioinformatics\/btv710","article-title":"Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications","volume":"32","author":"Chen","year":"2016","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B5","author":"Clara Parabricks (4.0.1-1)"},{"key":"2026060713162006600_btag319-B6","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","article-title":"A structural variation reference for medical and population genetics","volume":"581","author":"Collins","year":"2020","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B7","doi-asserted-by":"crossref","first-page":"712","DOI":"10.1038\/ng.862","article-title":"Variation in genome-wide mutation rates within and between human families","volume":"43","author":"Conrad","year":"2011","journal-title":"Nat Genet"},{"key":"2026060713162006600_btag319-B8","doi-asserted-by":"crossref","first-page":"5402","DOI":"10.1038\/s41467-019-13341-9","article-title":"GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs","volume":"10","author":"Eggertsson","year":"2019","journal-title":"Nat Commun"},{"key":"2026060713162006600_btag319-B9","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1186\/s13059-022-02840-6","article-title":"Truvari: refined structural variant comparison preserves allelic diversity","volume":"23","author":"English","year":"2022","journal-title":"Genome Biol"},{"key":"2026060713162006600_btag319-B10","doi-asserted-by":"crossref","first-page":"3047","DOI":"10.1093\/bioinformatics\/btw354","article-title":"MultiQC: summarize analysis results for multiple tools and samples in a single report","volume":"32","author":"Ewels","year":"2016","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B11","doi-asserted-by":"crossref","DOI":"10.1093\/genetics\/iyae013","article-title":"Effects of parental age and polymer composition on short tandem repeat de novo mutation rates","volume":"226","author":"Goldberg","year":"2024","journal-title":"Genetics"},{"key":"2026060713162006600_btag319-B12","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1038\/ng.3597","article-title":"Parent-of-origin-specific signatures of de novo mutations","volume":"48","author":"Goldmann","year":"2016","journal-title":"Nat Genet"},{"key":"2026060713162006600_btag319-B13","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.gde.2017.01.012","article-title":"A genomic view of short tandem repeats","volume":"44","author":"Gymrek","year":"2017","journal-title":"Curr Opin Genet Dev"},{"key":"2026060713162006600_btag319-B14","doi-asserted-by":"crossref","first-page":"732","DOI":"10.1038\/s41586-022-04965-x","article-title":"The sequences of 150,119 genomes in the UK biobank","volume":"607","author":"Halldorsson","year":"2022","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B15","doi-asserted-by":"crossref","first-page":"14061","DOI":"10.1038\/ncomms14061","article-title":"Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast","volume":"8","author":"Jeffares","year":"2017","journal-title":"Nat Commun"},{"key":"2026060713162006600_btag319-B16","doi-asserted-by":"crossref","first-page":"1674","DOI":"10.1038\/s41588-018-0259-9","article-title":"Multiple transmissions of de novo mutations in families","volume":"50","author":"J\u00f3nsson","year":"2018","journal-title":"Nat Genet"},{"key":"2026060713162006600_btag319-B17","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1038\/nature24018","article-title":"Parental influence on human germline de novo mutations in 1,548 trios from Iceland","volume":"549","author":"J\u00f3nsson","year":"2017","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B18","doi-asserted-by":"crossref","first-page":"e97","DOI":"10.1093\/nar\/gkac511","article-title":"DeNovoCNN: a deep learning approach to de novo variant calling in next generation sequencing data","volume":"50","author":"Khazeeva","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2026060713162006600_btag319-B19","author":"Kolesnikov","year":"2021"},{"key":"2026060713162006600_btag319-B20","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1038\/nature11396","article-title":"Rate of de novo mutations and the importance of father\/\u2019s age to disease risk","volume":"488","author":"Kong","year":"2012","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B21","doi-asserted-by":"crossref","first-page":"e1004572","DOI":"10.1371\/journal.pcbi.1004572","article-title":"Wham: identifying structural variants of biological consequence","volume":"11","author":"Kronenberg","year":"2015","journal-title":"PLoS Comput Biol"},{"key":"2026060713162006600_btag319-B22","doi-asserted-by":"crossref","first-page":"R84","DOI":"10.1186\/gb-2014-15-6-r84","article-title":"LUMPY: a probabilistic framework for structural variant discovery","volume":"15","author":"Layer","year":"2014","journal-title":"Genome Biol"},{"key":"2026060713162006600_btag319-B23","author":"Martin","year":"2016"},{"key":"2026060713162006600_btag319-B24","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.1101\/gr.107524.110","article-title":"The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data","volume":"20","author":"McKenna","year":"2010","journal-title":"Genome Res"},{"key":"2026060713162006600_btag319-B25","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1038\/s41586-020-03078-7","article-title":"Patterns of de novo tandem repeat mutations and their role in autism","volume":"589","author":"Mitra","year":"2021","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B26","doi-asserted-by":"crossref","first-page":"33","DOI":"10.12688\/f1000research.29032.2","article-title":"Sustainable data analysis with snakemake","volume":"10","author":"M\u00f6lder","year":"2021","journal-title":"F1000Res"},{"key":"2026060713162006600_btag319-B27","doi-asserted-by":"crossref","first-page":"btad775","DOI":"10.1093\/bioinformatics\/btad775","article-title":"HAT: de novo variant calling for highly accurate short-read and long-read sequencing data","volume":"40","author":"Ng","year":"2024","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B28","doi-asserted-by":"crossref","first-page":"3066","DOI":"10.1038\/s41588-025-02418-5","article-title":"Passenger mutations link cellular origin and transcriptional identity in human lung adenocarcinomas","volume":"57","author":"Panja","year":"2025","journal-title":"Nat Genet"},{"key":"2026060713162006600_btag319-B29","author":"Pedersen"},{"key":"2026060713162006600_btag319-B30","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1038\/s41525-021-00227-3","article-title":"Effective variant filtering and expected candidate variant yield in studies of rare human disease","volume":"6","author":"Pedersen","year":"2021","journal-title":"NPJ Genom Med"},{"key":"2026060713162006600_btag319-B31","doi-asserted-by":"crossref","first-page":"983","DOI":"10.1038\/nbt.4235","article-title":"A universal SNP and small-indel variant caller using deep neural networks","volume":"36","author":"Poplin","year":"2018","journal-title":"Nat Biotechnol"},{"key":"2026060713162006600_btag319-B32","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1038\/s41586-025-08922-2","article-title":"Human de novo mutation rates from a four-generation pedigree reference","volume":"643","author":"Porubsky","year":"2025","journal-title":"Nature"},{"key":"2026060713162006600_btag319-B33","doi-asserted-by":"crossref","first-page":"i333","DOI":"10.1093\/bioinformatics\/bts378","article-title":"DELLY: structural variant discovery by integrated paired-end and split-read analysis","volume":"28","author":"Rausch","year":"2012","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B34","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1038\/nbt.1754","article-title":"Integrative genomics viewer","volume":"29","author":"Robinson","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2026060713162006600_btag319-B35","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1186\/s13059-022-02818-4","article-title":"The mutational dynamics of short tandem repeats in large, multigenerational families","volume":"23","author":"Steely","year":"2022","journal-title":"Genome Biol"},{"key":"2026060713162006600_btag319-B36","doi-asserted-by":"crossref","first-page":"2032","DOI":"10.1093\/bioinformatics\/btv098","article-title":"Sambamba: fast processing of NGS alignment formats","volume":"31","author":"Tarasov","year":"2015","journal-title":"Bioinformatics"},{"key":"2026060713162006600_btag319-B37","doi-asserted-by":"crossref","first-page":"1894","DOI":"10.1101\/gr.177774.114","article-title":"The landscape of human STR variation","volume":"24","author":"Willems","year":"2014","journal-title":"Genome Res"},{"key":"2026060713162006600_btag319-B38","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1038\/nmeth.4267","article-title":"Genome-wide profiling of heritable and de novo STR variations","volume":"14","author":"Willems","year":"2017","journal-title":"Nat Methods"},{"key":"2026060713162006600_btag319-B39","doi-asserted-by":"crossref","first-page":"10486","DOI":"10.1038\/ncomms10486","article-title":"New observations on maternal age effect on germline de novo mutations","volume":"7","author":"Wong","year":"2016","journal-title":"Nat Commun"},{"key":"2026060713162006600_btag319-B40","author":"Yun","year":"2020"},{"key":"2026060713162006600_btag319-B41","volume-title":"GigaScience","author":"Zarate","year":"2020"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btag319\/68349698\/btag319.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/6\/btag319\/68349698\/btag319.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/6\/btag319\/68349698\/btag319.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,7]],"date-time":"2026-06-07T17:16:33Z","timestamp":1780852593000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btag319\/8688632"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5,20]]},"references-count":41,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2026,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btag319","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,6]]},"published":{"date-parts":[[2026,5,20]]},"article-number":"btag319"}}