{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T20:15:25Z","timestamp":1770754525082,"version":"3.50.0"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T00:00:00Z","timestamp":1761868800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Broad Institute Schmidt Fellowship and the National Human Genome Research Institute of the National Institutes of Health","award":["R01HG012467"],"award-info":[{"award-number":["R01HG012467"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Summary<\/jats:title>\n                    <jats:p>Structural variants (SVs) are key drivers of genetic variation and disease in the genome. Their discovery remains challenging, however, in large part due to the scarcity of validated SV callsets and comprehensive benchmarks, which are essential for method development and evaluation. The growing number of data-driven learning-based approaches for SV discovery, in particular, requires large, diverse, and well-balanced training datasets to achieve reliable performance. To address this need, SV simulation has served as a key tool for assessing method performance and training SV models. However, existing SV simulators only support a fixed and limited set of SV classes and do not provide fine-grained control over the placement of SVs within specific contexts of the genome. Here we present insilicoSV, a versatile framework for SV simulation, which models SVs using a simple and flexible grammar, allowing users to easily define standard and custom arbitrary genome rearrangements, as well as encode genome placement constraints. This design allows insilicoSV to naturally support new and bespoke SV types, such as the complex rearrangements of cancer genomes. In addition to grammar-based modeling, insilicoSV provides built-in support for 26 predefined SV types, placement of user-provided SVs, small variant simulation, streamlined workflows for the simulation of genome evolution and genome mixtures, read simulation, alignment, and visualization. These features enable the creation of comprehensive genomic datasets for a variety of downstream applications, such as in-depth benchmarking of alignment and variant calling methods, as well as training of data-driven learning-based approaches for SV detection.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>insilicoSV is available under the MIT license at https:\/\/github.com\/PopicLab\/insilicoSV and https:\/\/doi.org\/10.5281\/zenodo.17402009.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf594","type":"journal-article","created":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T12:51:11Z","timestamp":1761655871000},"source":"Crossref","is-referenced-by-count":1,"title":["insilicoSV: a flexible grammar-based framework for structural variant simulation and placement"],"prefix":"10.1093","volume":"41","author":[{"given":"Enzo","family":"Battistella","sequence":"first","affiliation":[{"name":"Broad Clinical Labs , Cambridge, MA 02142,","place":["United States"]},{"name":"Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}]},{"given":"Nick","family":"Jiang","sequence":"additional","affiliation":[{"name":"Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]},{"name":"University of California, Berkeley , Berkeley, CA 94720,","place":["United States"]}]},{"given":"Chris","family":"Rohlicek","sequence":"additional","affiliation":[{"name":"Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}]},{"given":"Ilya","family":"Shlyakhter","sequence":"additional","affiliation":[{"name":"Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3181-5432","authenticated-orcid":false,"given":"Victoria","family":"Popic","sequence":"additional","affiliation":[{"name":"Broad Clinical Labs , Cambridge, MA 02142,","place":["United States"]},{"name":"Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,10,31]]},"reference":[{"key":"2025111103172893800_btaf594-B1","doi-asserted-by":"crossref","first-page":"3240","DOI":"10.1038\/s41467-019-11146-4","article-title":"Comprehensive evaluation and characterisation of short read general-purpose structural variant calling software","volume":"10","author":"Cameron","year":"2019","journal-title":"Nat Commun"},{"key":"2025111103172893800_btaf594-B2","article-title":"Dwgsim: whole genome simulator for next-generation sequencing","author":"Homer","year":"2010"},{"key":"2025111103172893800_btaf594-B3","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btaf332","volume-title":"Bioinformatics","author":"Porubsky","year":"2025"},{"key":"2025111103172893800_btaf594-B4","doi-asserted-by":"crossref","first-page":"1143","DOI":"10.1038\/s41592-023-01932-w","article-title":"A survey of algorithms for the detection of genomic structural variants from long-read sequencing data","volume":"20","author":"Ahsan","year":"2023","journal-title":"Nat Methods"},{"key":"2025111103172893800_btaf594-B5","doi-asserted-by":"crossref","first-page":"1679","DOI":"10.1093\/bioinformatics\/btt198","article-title":"RSVSim: an R\/bioconductor package for the simulation of structural variations","volume":"29","author":"Bartenhagen","year":"2013","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B6","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1186\/s13059-021-02380-5","article-title":"Samplot: a platform for structural variant visual validation and automated filtering","volume":"22","author":"Belyeu","year":"2021","journal-title":"Genome Biol"},{"key":"2025111103172893800_btaf594-B7","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1038\/s41588-021-00865-4","article-title":"Long-read sequencing of 3,622 icelanders provides insight into the role of structural variants in human diseases and other traits","volume":"53","author":"Beyter","year":"2021","journal-title":"Nat Genet"},{"key":"2025111103172893800_btaf594-B8","doi-asserted-by":"crossref","first-page":"1267","DOI":"10.1093\/bioinformatics\/btz719","article-title":"VISOR: a versatile haplotype-aware structural variant simulator for short- and long-read sequencing","volume":"36","author":"Bolognini","year":"2020","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B9","doi-asserted-by":"crossref","first-page":"1220","DOI":"10.1093\/bioinformatics\/btv710","article-title":"Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications","volume":"32","author":"Chen","year":"2016","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B10","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1038\/s41586-020-2287-8","article-title":"A structural variation reference for medical and population genetics","volume":"581","author":"Collins","year":"2020","journal-title":"Nature"},{"key":"2025111103172893800_btaf594-B11","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1186\/s13059-021-02551-4","article-title":"A benchmark of structural variation detection by long reads through a realistic simulated model","volume":"22","author":"Dierckxsens","year":"2021","journal-title":"Genome Biol"},{"key":"2025111103172893800_btaf594-B12","doi-asserted-by":"crossref","first-page":"9029","DOI":"10.1038\/s41467-024-53260-y","article-title":"The giab genomic stratifications resource for human reference genomes","volume":"15","author":"Dwarshuis","year":"2024","journal-title":"Nat Commun"},{"key":"2025111103172893800_btaf594-B13","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1186\/s13059-022-02840-6","article-title":"Truvari: refined structural variant comparison preserves allelic diversity","volume":"23","author":"English","year":"2022","journal-title":"Genome Biol"},{"key":"2025111103172893800_btaf594-B14","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/S0092-8674(02)00828-0","article-title":"Genomic deletions created upon LINE-1 retrotransposition","volume":"110","author":"Gilbert","year":"2002","journal-title":"Cell"},{"key":"2025111103172893800_btaf594-B15","doi-asserted-by":"crossref","first-page":"4061","DOI":"10.1093\/hmg\/ddv146","article-title":"Alu-mediated diverse and complex pathogenic copy-number variants within human chromosome 17 at p13.3","volume":"24","author":"Gu","year":"2015","journal-title":"Hum Mol Genet"},{"key":"2025111103172893800_btaf594-B16","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1093\/bioinformatics\/btz041","article-title":"Svim: structural variant identification using mapped long reads","volume":"35","author":"Heller","year":"2019","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B17","doi-asserted-by":"crossref","first-page":"14061","DOI":"10.1038\/ncomms14061","article-title":"Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast","volume":"8","author":"Jeffares","year":"2017","journal-title":"Nat Commun"},{"key":"2025111103172893800_btaf594-B18","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B19","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1038\/s41586-019-1913-9","article-title":"Patterns of somatic structural variation in human cancer genomes","volume":"578","author":"Li","year":"2020","journal-title":"Nature"},{"key":"2025111103172893800_btaf594-B20","doi-asserted-by":"crossref","first-page":"1230","DOI":"10.1038\/s41592-022-01609-w","article-title":"Svision: a deep learning approach to resolve complex structural variants","volume":"19","author":"Lin","year":"2022","journal-title":"Nat Methods"},{"key":"2025111103172893800_btaf594-B21","doi-asserted-by":"crossref","first-page":"426","DOI":"10.3389\/fgene.2019.00426","article-title":"Long-read sequencing emerging in medical genetics","volume":"10","author":"Mantere","year":"2019","journal-title":"Front Genet"},{"key":"2025111103172893800_btaf594-B22","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1038\/gim.2017.86","article-title":"Long-read genome sequencing identifies causal structural variation in a mendelian disease","volume":"20","author":"Merker","year":"2018","journal-title":"Genet Med"},{"key":"2025111103172893800_btaf594-B23","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.tig.2007.02.006","article-title":"Which transposable elements are active in the human genome?","volume":"23","author":"Mills","year":"2007","journal-title":"Trends Genet"},{"key":"2025111103172893800_btaf594-B24","doi-asserted-by":"crossref","first-page":"1469","DOI":"10.1093\/bioinformatics\/btu828","article-title":"VarSim: a high-fidelity simulation and validation framework for high-throughput genome sequencing with cancer applications","volume":"31","author":"Mu","year":"2015","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B25","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1089\/152791600319259","article-title":"Repeatmasker","volume":"1","author":"Nishimura","year":"2000","journal-title":"Biotech Softw Internet Rep"},{"key":"2025111103172893800_btaf594-B26","doi-asserted-by":"crossref","first-page":"2493","DOI":"10.1093\/nar\/gkac128","article-title":"Characterizing mobile element insertions in 5675 genomes","volume":"50","author":"Niu","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025111103172893800_btaf594-B27","doi-asserted-by":"crossref","first-page":"lqac092","DOI":"10.1093\/nargab\/lqac092","article-title":"Pbsim3: a simulator for all types of pacbio and ont long reads","volume":"4","author":"Ono","year":"2022","journal-title":"NAR Genom Bioinform"},{"key":"2025111103172893800_btaf594-B28","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1038\/s41592-023-01799-x","article-title":"Cue: a deep-learning framework for structural variant discovery and genotyping","volume":"20","author":"Popic","year":"2023","journal-title":"Nat Methods"},{"key":"2025111103172893800_btaf594-B29","doi-asserted-by":"crossref","first-page":"i333","DOI":"10.1093\/bioinformatics\/bts378","article-title":"Delly: structural variant discovery by integrated paired-end and split-read analysis","volume":"28","author":"Rausch","year":"2012","journal-title":"Bioinformatics"},{"key":"2025111103172893800_btaf594-B30","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1186\/s13073-018-0606-6","article-title":"Complex structural variants in mendelian disorders: identification and breakpoint resolution using short-and long-read genome sequencing","volume":"10","author":"Sanchis-Juan","year":"2018","journal-title":"Genome Med"},{"key":"2025111103172893800_btaf594-B31","doi-asserted-by":"crossref","first-page":"1571","DOI":"10.1038\/s41587-023-02024-y","article-title":"Detection of mosaic and population-level structural variants with sniffles2","volume":"42","author":"Smolka","year":"2024","journal-title":"Nat Biotechnol"},{"key":"2025111103172893800_btaf594-B32","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1093\/bib\/bbs017","article-title":"Integrative genomics viewer (igv): high-performance genomics data visualization and exploration","volume":"14","author":"Thorvaldsd\u00f3ttir","year":"2013","journal-title":"Brief Bioinform"},{"key":"2025111103172893800_btaf594-B33","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1038\/nrg3373","article-title":"Phenotypic impact of genomic structural variation: insights from and for human disease","volume":"14","author":"Weischenfeldt","year":"2013","journal-title":"Nat Rev Genet"},{"key":"2025111103172893800_btaf594-B34","doi-asserted-by":"crossref","first-page":"giy081","DOI":"10.1093\/gigascience\/giy081","article-title":"SVEngine: an efficient and versatile simulator of genome structural variations with features of cancer clonal evolution","volume":"7","author":"Xia","year":"2018","journal-title":"Gigascience"},{"key":"2025111103172893800_btaf594-B35","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/s41587-020-0538-8","article-title":"A robust benchmark for detection of germline large deletions and insertions","volume":"38","author":"Zook","year":"2020","journal-title":"Nat Biotechnol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf594\/65050239\/btaf594.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/11\/btaf594\/65050239\/btaf594.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/11\/btaf594\/65050239\/btaf594.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T08:17:35Z","timestamp":1762849055000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf594\/8307527"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,31]]},"references-count":35,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2025,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf594","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,11]]},"published":{"date-parts":[[2025,10,31]]},"article-number":"btaf594"}}