{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T07:12:29Z","timestamp":1769843549100,"version":"3.49.0"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"4","license":[{"start":{"date-parts":[[2024,7,9]],"date-time":"2024-07-09T00:00:00Z","timestamp":1720483200000},"content-version":"vor","delay-in-days":47,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"publisher","award":["2022YFF1202101"],"award-info":[{"award-number":["2022YFF1202101"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62225109"],"award-info":[{"award-number":["62225109"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62172129"],"award-info":[{"award-number":["62172129"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,5,23]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Structural variation (SV) is an important form of genomic variation that influences gene function and expression by altering the structure of the genome. Although long-read data have been proven to better characterize SVs, SVs detected from noisy long-read data still include a considerable portion of false-positive calls. To accurately detect SVs in long-read data, we present SVDF, a method that employs a learning-based noise filtering strategy and an SV signature-adaptive clustering algorithm, for effectively reducing the likelihood of false-positive events. Benchmarking results from multiple orthogonal experiments demonstrate that, across different sequencing platforms and depths, SVDF achieves higher calling accuracy for each sample compared to several existing general SV calling tools. We believe that, with its meticulous and sensitive SV detection capability, SVDF can bring new opportunities and advancements to cutting-edge genomic research.<\/jats:p>","DOI":"10.1093\/bib\/bbae336","type":"journal-article","created":{"date-parts":[[2024,7,9]],"date-time":"2024-07-09T13:59:43Z","timestamp":1720533583000},"source":"Crossref","is-referenced-by-count":4,"title":["SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies"],"prefix":"10.1093","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9505-4049","authenticated-orcid":false,"given":"Heng","family":"Hu","sequence":"first","affiliation":[{"name":"College of Life Sciences, Northeast Forestry University , Harbin 150000 , China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-0870-6693","authenticated-orcid":false,"given":"Runtian","family":"Gao","sequence":"additional","affiliation":[{"name":"College of Life Sciences, Northeast Forestry University , Harbin 150000 , China"}]},{"given":"Wentao","family":"Gao","sequence":"additional","affiliation":[{"name":"College of Life Sciences, Northeast Forestry University , Harbin 150000 , China"}]},{"given":"Bo","family":"Gao","sequence":"additional","affiliation":[{"name":"Department of Radiology, The Second Affiliated Hospital of Harbin Medical University , Harbin 150000 , China"}]},{"given":"Zhongjun","family":"Jiang","sequence":"additional","affiliation":[{"name":"College of Life Sciences, Northeast Forestry University , Harbin 150000 , China"}]},{"given":"Murong","family":"Zhou","sequence":"additional","affiliation":[{"name":"College of Life Sciences, Northeast Forestry University , Harbin 150000 , China"}]},{"given":"Guohua","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Computer and Control Engineering, Northeast Forestry University , Harbin 150000 , China"},{"name":"State Key Laboratory of Tree Genetics and Breeding , Harbin 150000 , China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0673-8503","authenticated-orcid":false,"given":"Tao","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Harbin Institute of Technology , Harbin 150000 , China"}]}],"member":"286","published-online":{"date-parts":[[2024,7,9]]},"reference":[{"key":"2024070913582095900_ref1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-019-1828-7","article-title":"Structural variant calling: the long and the short of it","volume":"20","author":"Mahmoud","year":"2019","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref2","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1038\/ng1718","article-title":"APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy","volume":"38","author":"Rovelet-Lecrux","year":"2006","journal-title":"Nat Genet"},{"key":"2024070913582095900_ref3","doi-asserted-by":"crossref","first-page":"928","DOI":"10.1038\/35057149","article-title":"A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms","volume":"409","author":"The International SNP Map Working Group","year":"2001","journal-title":"Nature"},{"key":"2024070913582095900_ref4","doi-asserted-by":"crossref","first-page":"e58048","DOI":"10.1371\/journal.pone.0058048","article-title":"Rare genomic structural variants in complex disease: lessons from the replication of associations with obesity","volume":"8","author":"Walters","year":"2013","journal-title":"PloS One"},{"key":"2024070913582095900_ref5","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1038\/s41586-019-1913-9","article-title":"Patterns of somatic structural variation in human cancer genomes","volume":"578","author":"Li","year":"2020","journal-title":"Nature"},{"key":"2024070913582095900_ref6","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1016\/j.cell.2020.05.021","article-title":"Major impacts of widespread structural variation on gene expression and crop improvement in tomato","volume":"182","author":"Alonge","year":"2020","journal-title":"Cell"},{"key":"2024070913582095900_ref7","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/s41586-018-0063-9","article-title":"Genomic variation in 3,010 diverse accessions of Asian cultivated rice","volume":"557","author":"Wang","year":"2018","journal-title":"Nature"},{"key":"2024070913582095900_ref8","doi-asserted-by":"crossref","first-page":"3403","DOI":"10.1038\/s41467-020-17195-4","article-title":"Discovery and population genomics of structural variation in a songbird genus","volume":"11","author":"Weissensteiner","year":"2020","journal-title":"Nat Commun"},{"key":"2024070913582095900_ref9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gb-2013-14-6-405","article-title":"The advantages of SMRT sequencing","volume":"14","author":"Roberts","year":"2013","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref10","first-page":"1","article-title":"The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community","volume":"17","author":"Jain","year":"2016","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref11","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1038\/s41576-021-00367-3","article-title":"Towards population-scale long-read sequencing","volume":"22","author":"De Coster","year":"2021","journal-title":"Nat Rev Genet"},{"key":"2024070913582095900_ref12","doi-asserted-by":"crossref","first-page":"779","DOI":"10.1038\/s41588-021-00865-4","article-title":"Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits","volume":"53","author":"Beyter","year":"2021","journal-title":"Nat Genet"},{"key":"2024070913582095900_ref13","doi-asserted-by":"crossref","first-page":"eabf7117","DOI":"10.1126\/science.abf7117","article-title":"Haplotype-resolved diverse human genomes and integrated analysis of structural variation","volume":"372","author":"Ebert","year":"2021","journal-title":"Science"},{"key":"2024070913582095900_ref14","author":"Biosciences"},{"key":"2024070913582095900_ref15","doi-asserted-by":"crossref","first-page":"2907","DOI":"10.1093\/bioinformatics\/btz041","article-title":"SVIM: structural variant identification using mapped long reads","volume":"35","author":"Heller","year":"2019","journal-title":"Bioinformatics"},{"key":"2024070913582095900_ref16","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1038\/s41592-018-0001-7","article-title":"Accurate detection of complex structural variations using single-molecule sequencing","volume":"15","author":"Sedlazeck","year":"2018","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-020-02107-y","article-title":"Long-read-based human genomic structural variation detection with cuteSV","volume":"21","author":"Jiang","year":"2020","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref18","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1109\/TNB.2019.2908438","article-title":"Long-read based novel sequence insertion detection with rCANID","volume":"18","author":"Jiang","year":"2019","journal-title":"IEEE Trans Nanobioscience"},{"key":"2024070913582095900_ref19","doi-asserted-by":"crossref","first-page":"bbae049","DOI":"10.1093\/bib\/bbae049","article-title":"Kled: an ultra-fast and sensitive structural variant detection tool for long-read sequencing data","volume":"25","author":"Zhang","year":"2024","journal-title":"Brief Bioinform"},{"key":"2024070913582095900_ref20","doi-asserted-by":"crossref","first-page":"1230","DOI":"10.1038\/s41592-022-01609-w","article-title":"SVision: a deep learning approach to resolve complex structural variants","volume":"19","author":"Lin","year":"2022","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref21","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1038\/s41592-023-01799-x","article-title":"Cue: a deep-learning framework for structural variant discovery and genotyping","volume":"20","author":"Popic","year":"2023","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref22","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1038\/s41592-023-01767-5","article-title":"Facilitating genome structural variation analysis","volume":"20","author":"Sikic","year":"2023","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref23","doi-asserted-by":"crossref","first-page":"bbac195","DOI":"10.1093\/bib\/bbac195","article-title":"MAMnet: detecting and genotyping deletions and insertions based on long reads and a deep learning approach","volume":"23","author":"Ding","year":"2022","journal-title":"Brief Bioinform"},{"key":"2024070913582095900_ref24","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1186\/s12859-023-05216-0","article-title":"INSnet: a method for detecting insertions based on deep learning network","volume":"24","author":"Gao","year":"2023","journal-title":"BMC bioinformatics"},{"key":"2024070913582095900_ref25","doi-asserted-by":"crossref","first-page":"4568","DOI":"10.1093\/bioinformatics\/btaa527","article-title":"SVJedi: genotyping structural variations with long reads","volume":"36","author":"Lecompte","year":"2020","journal-title":"Bioinformatics"},{"key":"2024070913582095900_ref26","doi-asserted-by":"crossref","first-page":"14061","DOI":"10.1038\/ncomms14061","article-title":"Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast","volume":"8","author":"Jeffares","year":"2017","journal-title":"Nat Commun"},{"key":"2024070913582095900_ref27","doi-asserted-by":"crossref","first-page":"589","DOI":"10.1093\/bioinformatics\/btaa835","article-title":"PBSIM2: a simulator for long-read sequencers with a novel generative model of quality scores","volume":"37","author":"Ono","year":"2021","journal-title":"Bioinformatics"},{"key":"2024070913582095900_ref28","first-page":"1","article-title":"Detection of mosaic and population-level structural variants with Sniffles2","author":"Smolka","year":"2024","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref29","article-title":"Regenotyping structural variants through an accurate force-calling method","author":"Jiang","year":"2022","journal-title":"bioRxiv"},{"key":"2024070913582095900_ref30","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/s41587-020-0538-8","article-title":"A robust benchmark for detection of germline large deletions and insertions","volume":"38","author":"Zook","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref31","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1186\/s13059-022-02840-6","article-title":"Truvari: refined structural variant comparison preserves allelic diversity","volume":"23","author":"English","year":"2022","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref32","doi-asserted-by":"crossref","first-page":"672","DOI":"10.1038\/s41587-021-01158-1","article-title":"Curated variation benchmarks for challenging medically relevant autosomal genes","volume":"40","author":"Wagner","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref33","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1038\/s41587-020-0711-0","article-title":"Chromosome-scale, haplotype-resolved assembly of human genomes","volume":"39","author":"Garg","year":"2021","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref34","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1038\/s41587-020-0503-6","article-title":"Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes","volume":"38","author":"Shafin","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref35","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1038\/s41587-019-0217-9","article-title":"Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome","volume":"37","author":"Wenger","year":"2019","journal-title":"Nat Biotechnol"},{"key":"2024070913582095900_ref36","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1038\/s41592-018-0054-7","article-title":"A synthetic-diploid benchmark for accurate variant-calling evaluation","volume":"15","author":"Li","year":"2018","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref37","doi-asserted-by":"crossref","first-page":"1282","DOI":"10.1016\/j.cell.2019.02.012","article-title":"Characterizing mutational signatures in human cancer cell lines reveals episodic APOBEC mutagenesis","volume":"176","author":"Petljak","year":"2019","journal-title":"Cell"},{"key":"2024070913582095900_ref38","first-page":"67","article-title":"The genomic complexity of primary human prostate cancer","volume":"2011","author":"Mf","year":"2011","journal-title":"Nature"},{"key":"2024070913582095900_ref39","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1038\/s41588-019-0576-7","article-title":"Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing","volume":"52","author":"Cort\u00e9s-Ciriano","year":"2020","journal-title":"Nat Genet"},{"key":"2024070913582095900_ref40","doi-asserted-by":"crossref","first-page":"666","DOI":"10.1016\/j.cell.2013.03.021","article-title":"Punctuated evolution of prostate cancer genomes","volume":"153","author":"Baca","year":"2013","journal-title":"Cell"},{"key":"2024070913582095900_ref41","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1186\/s13059-022-02816-6","article-title":"Structural variant analysis of a cancer reference cell line sample using multiple sequencing technologies","volume":"23","author":"Talsania","year":"2022","journal-title":"Genome Biol"},{"key":"2024070913582095900_ref42","article-title":"Severus: accurate detection and characterization of somatic structural variation in tumor genomes using long reads","author":"Keskus","year":"2024","journal-title":"medRxiv"},{"key":"2024070913582095900_ref43","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1038\/s41592-022-01674-1","article-title":"SVDSS: structural variation discovery in hard-to-call genomic regions using sample-specific strings from accurate long reads","volume":"20","author":"Denti","year":"2023","journal-title":"Nat Methods"},{"key":"2024070913582095900_ref44","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1038\/s41467-023-35996-1","article-title":"Deciphering the exact breakpoints of structural variations using long sequencing reads with DeBreak","volume":"14","author":"Chen","year":"2023","journal-title":"Nat Commun"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/4\/bbae336\/58482682\/bbae336.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/25\/4\/bbae336\/58482682\/bbae336.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,9]],"date-time":"2024-07-09T14:00:42Z","timestamp":1720533642000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbae336\/7709769"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,23]]},"references-count":44,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,5,23]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbae336","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,5,23]]},"article-number":"bbae336"}}