{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,19]],"date-time":"2026-02-19T03:56:16Z","timestamp":1771473376639,"version":"3.50.1"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T00:00:00Z","timestamp":1684800000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T00:00:00Z","timestamp":1684800000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Structural variations (SVs) refer to variations in an organism\u2019s chromosome structure that exceed a length of 50 base pairs. They play a significant role in genetic diseases and evolutionary mechanisms. While long-read sequencing technology has led to the development of numerous SV caller methods, their performance results have been suboptimal. Researchers have observed that current SV callers often miss true SVs and generate many false SVs, especially in repetitive regions and areas with multi-allelic SVs. These errors are due to the messy alignments of long-read data, which are affected by their high error rate. Therefore, there is a need for a more accurate SV caller method.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Result<\/jats:title>\n                <jats:p>We propose a new method-SVcnn, a more accurate deep learning-based method for detecting SVs by using long-read sequencing data. We run SVcnn and other SV callers in three real datasets and find that SVcnn improves the F1-score by 2\u20138% compared with the second-best method when the read depth is greater than 5\u00d7. More importantly, SVcnn has better performance for detecting multi-allelic SVs.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>SVcnn is an accurate deep learning-based method to detect SVs. The program is available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/nwpuzhengyan\/SVcnn\">https:\/\/github.com\/nwpuzhengyan\/SVcnn<\/jats:ext-link>.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-023-05324-x","type":"journal-article","created":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T15:04:21Z","timestamp":1684854261000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data"],"prefix":"10.1186","volume":"24","author":[{"given":"Yan","family":"Zheng","sequence":"first","affiliation":[]},{"given":"Xuequn","family":"Shang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,5,23]]},"reference":[{"issue":"7571","key":"5324_CR1","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1038\/nature15394","volume":"526","author":"PH Sudmant","year":"2015","unstructured":"Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Hsi-Yang Fritz M. An integrated map of structural variation in 2504 human genomes. Nature. 2015;526(7571):75\u201381.","journal-title":"Nature"},{"issue":"9","key":"5324_CR2","doi-asserted-by":"publisher","first-page":"530","DOI":"10.1016\/j.tig.2016.07.002","volume":"32","author":"G Macintyre","year":"2016","unstructured":"Macintyre G, Ylstra B, Brenton JD. Sequencing structural variants in cancer for precision therapeutics. Trends Genet. 2016;32(9):530\u201342.","journal-title":"Trends Genet"},{"issue":"2","key":"5324_CR3","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1038\/nrg3373","volume":"14","author":"J Weischenfeldt","year":"2013","unstructured":"Weischenfeldt J, Symmons O, Spitz F, Korbel JO. Phenotypic impact of genomic structural variation: insights from and for human disease. Nature Rev Genet. 2013;14(2):125\u201338.","journal-title":"Nature Rev Genet"},{"issue":"1","key":"5324_CR4","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1038\/ng1718","volume":"38","author":"A Rovelet-Lecrux","year":"2006","unstructured":"Rovelet-Lecrux A, Hannequin D, Raux G, Meur NL, Laquerri\u00e8re A, Vital A, Dumanchin C, Feuillette S, Brice A, Vercelletto M. App locus duplication causes autosomal dominant early-onset alzheimer disease with cerebral amyloid angiopathy. Nature Genet. 2006;38(1):24\u20136.","journal-title":"Nature Genet"},{"issue":"18","key":"5324_CR5","doi-asserted-by":"publisher","first-page":"4712","DOI":"10.1111\/mec.14134","volume":"26","author":"S Dennenmoser","year":"2017","unstructured":"Dennenmoser S, Sedlazeck FJ, Iwaszkiewicz E, Li X-Y, Altm\u00fcller J, Nolte AW. Copy number increases of transposable elements and protein-coding genes in an invasive fish of hybrid origin. Mol Ecol. 2017;26(18):4712\u201324.","journal-title":"Mol Ecol"},{"issue":"5","key":"5324_CR6","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1002\/em.21943","volume":"56","author":"JR Lupski","year":"2015","unstructured":"Lupski JR. Structural variation mutagenesis of the human genome: impact on disease and evolution. Environ Mol Mutagen. 2015;56(5):419\u201336.","journal-title":"Environ Mol Mutagen"},{"issue":"5","key":"5324_CR7","doi-asserted-by":"publisher","first-page":"692","DOI":"10.1038\/ng.3834","volume":"49","author":"C Chiang","year":"2017","unstructured":"Chiang C, Scott AJ, Davis JR, Tsang EK, Li X, Kim Y, Hadzic T, Damani FN, Ganel L, Montgomery SB. The impact of structural variation on human gene expression. Nature Genet. 2017;49(5):692\u20139.","journal-title":"Nature Genet"},{"issue":"3","key":"5324_CR8","doi-asserted-by":"publisher","first-page":"568","DOI":"10.1101\/gr.142646.112","volume":"23","author":"T Zichner","year":"2013","unstructured":"Zichner T, Garfield DA, Rausch T, St\u00fctz AM, Cannav\u00f3 E, Braun M, Furlong EE, Korbel JO. Impact of genomic structural variation in drosophila melanogaster based on population-scale sequencing. Genome Res. 2013;23(3):568\u201379.","journal-title":"Genome Res"},{"issue":"19","key":"5324_CR9","doi-asserted-by":"publisher","first-page":"10879","DOI":"10.1093\/nar\/gkab904","volume":"49","author":"D-X Liu","year":"2021","unstructured":"Liu D-X, Rajaby R, Wei L-L, Zhang L, Yang Z-Q, Yang Q-Y, Sung W-K. Calling large indels in 1047 arabidopsis with indelensembler. Nucleic Acids Res. 2021;49(19):10879\u201394.","journal-title":"Nucleic Acids Res"},{"key":"5324_CR10","doi-asserted-by":"publisher","first-page":"733","DOI":"10.1007\/s00122-018-3233-0","volume":"132","author":"I Gabur","year":"2019","unstructured":"Gabur I, Chawla HS, Snowdon RJ, Parkin IA. Connecting genome structural variation with complex traits in crop plants. Theor Appl Genet. 2019;132:733\u201350.","journal-title":"Theor Appl Genet"},{"issue":"18","key":"5324_CR11","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1093\/bioinformatics\/bts378","volume":"28","author":"T Rausch","year":"2012","unstructured":"Rausch T, Zichner T, Schlattl A, St\u00fctz AM, Benes V, Korbel JO. Delly: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28(18):333\u20139.","journal-title":"Bioinformatics"},{"issue":"6","key":"5324_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2014-15-6-r84","volume":"15","author":"RM Layer","year":"2014","unstructured":"Layer RM, Chiang C, Quinlan AR, Hall IM. Lumpy: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15(6):1\u201319.","journal-title":"Genome Biol"},{"issue":"21","key":"5324_CR13","doi-asserted-by":"publisher","first-page":"2865","DOI":"10.1093\/bioinformatics\/btp394","volume":"25","author":"K Ye","year":"2009","unstructured":"Ye K, Schulz MH, Long Q, Apweiler R, Ning Z. Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics. 2009;25(21):2865\u201371.","journal-title":"Bioinformatics"},{"issue":"8","key":"5324_CR14","doi-asserted-by":"publisher","first-page":"1220","DOI":"10.1093\/bioinformatics\/btv710","volume":"32","author":"X Chen","year":"2016","unstructured":"Chen X, Schulz-Trieglaff O, Shaw R, Barnes B, Schlesinger F, K\u00e4llberg M, Cox AJ, Kruglyak S, Saunders CT. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. 2016;32(8):1220\u20132.","journal-title":"Bioinformatics"},{"issue":"24","key":"5324_CR15","doi-asserted-by":"publisher","first-page":"3484","DOI":"10.1093\/bioinformatics\/btu431","volume":"30","author":"K Trappe","year":"2014","unstructured":"Trappe K, Emde A-K, Ehrlich H-C, Reinert K. Gustaf: Detecting and correctly classifying svs in the ngs twilight zone. Bioinformatics. 2014;30(24):3484\u201390.","journal-title":"Bioinformatics"},{"issue":"11","key":"5324_CR16","doi-asserted-by":"publisher","first-page":"1497","DOI":"10.1093\/bioinformatics\/btz261","volume":"37","author":"R Rajaby","year":"2021","unstructured":"Rajaby R, Sung W-K. Survindel: improving cnv calling from high-throughput sequencing data through statistical testing. Bioinformatics. 2021;37(11):1497\u2013505.","journal-title":"Bioinformatics"},{"issue":"6","key":"5324_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2013-14-6-405","volume":"14","author":"RJ Roberts","year":"2013","unstructured":"Roberts RJ, Carneiro MO, Schatz MC. The advantages of smrt sequencing. Genome Biol. 2013;14(6):1\u20134.","journal-title":"Genome Biol"},{"issue":"1","key":"5324_CR18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-015-0866-z","volume":"17","author":"M Jain","year":"2016","unstructured":"Jain M, Olsen HE, Paten B, Akeson M. The oxford nanopore minion: delivery of nanopore sequencing to the genomics community. Genome Biol. 2016;17(1):1\u201311.","journal-title":"Genome Biol"},{"key":"5324_CR19","doi-asserted-by":"crossref","unstructured":"Chen Y, Wang A, Barkley C, Zhao X, Gao M, Edmonds M, Chong Z.: Debreak: deciphering the exact breakpoints of structural variations using long sequencing reads. 2022.","DOI":"10.21203\/rs.3.rs-1261915\/v1"},{"issue":"1","key":"5324_CR20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-020-02107-y","volume":"21","author":"T Jiang","year":"2020","unstructured":"Jiang T, Liu Y, Jiang Y, Li J, Gao Y, Cui Z, Liu Y, Liu B, Wang Y. Long-read-based human genomic structural variation detection with cutesv. Genome Biol. 2020;21(1):1\u201324.","journal-title":"Genome Biol"},{"issue":"6","key":"5324_CR21","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1038\/s41592-018-0001-7","volume":"15","author":"FJ Sedlazeck","year":"2018","unstructured":"Sedlazeck FJ, Rescheneder P, Smolka M, Fang H, Nattestad M, Von Haeseler A, Schatz MC. Accurate detection of complex structural variations using single-molecule sequencing. Nature Methods. 2018;15(6):461\u20138.","journal-title":"Nature Methods"},{"issue":"1","key":"5324_CR22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-017-01343-4","volume":"8","author":"M Cretu Stancu","year":"2017","unstructured":"Cretu Stancu M, Van Roosmalen MJ, Renkens I, Nieboer MM, Middelkamp S, De Ligt J, Pregno G, Giachino D, Mandrile G, Espejo Valle-Inclan J. Mapping and phasing of structural variation in patient genomes using nanopore sequencing. Nature Commun. 2017;8(1):1\u201313.","journal-title":"Nature Commun"},{"issue":"6","key":"5324_CR23","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1038\/s41592-018-0002-6","volume":"15","author":"L Gong","year":"2018","unstructured":"Gong L, Wong C-H, Cheng W-C, Tjong H, Menghi F, Ngan CY, Liu ET, Wei C-L. Picky comprehensively detects high-resolution structural variants in nanopore long reads. Nature Methods. 2018;15(6):455\u201360.","journal-title":"Nature Methods"},{"issue":"17","key":"5324_CR24","doi-asserted-by":"publisher","first-page":"2907","DOI":"10.1093\/bioinformatics\/btz041","volume":"35","author":"D Heller","year":"2019","unstructured":"Heller D, Vingron M. Svim: structural variant identification using mapped long reads. Bioinformatics. 2019;35(17):2907\u201315.","journal-title":"Bioinformatics"},{"issue":"1","key":"5324_CR25","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/1471-2105-15-180","volume":"15","author":"AC English","year":"2014","unstructured":"English AC, Salerno WJ, Reid JG. Pbhoney: identifying genomic variants via long-read discordance and interrupted mapping. BMC Bioinf. 2014;15(1):1\u20137.","journal-title":"BMC Bioinf"},{"issue":"10","key":"5324_CR26","doi-asserted-by":"publisher","first-page":"1230","DOI":"10.1038\/s41592-022-01609-w","volume":"19","author":"J Lin","year":"2022","unstructured":"Lin J, Wang S, Audano PA, Meng D, Flores JI, Kosters W, Yang X, Jia P, Marschall T, Beck CR. Svision: a deep learning approach to resolve complex structural variants. Nature Methods. 2022;19(10):1230\u20133.","journal-title":"Nature Methods"},{"issue":"9","key":"5324_CR27","doi-asserted-by":"publisher","first-page":"418","DOI":"10.1016\/j.tig.2014.07.001","volume":"30","author":"EL Van Dijk","year":"2014","unstructured":"Van Dijk EL, Auger H, Jaszczyszyn Y, Thermes C. Ten years of next-generation sequencing technology. Trends Genet. 2014;30(9):418\u201326.","journal-title":"Trends Genet"},{"key":"5324_CR28","doi-asserted-by":"publisher","first-page":"426","DOI":"10.3389\/fgene.2019.00426","volume":"10","author":"T Mantere","year":"2019","unstructured":"Mantere T, Kersten S, Hoischen A. Long-read sequencing emerging in medical genetics. Front Genet. 2019;10:426.","journal-title":"Front Genet"},{"issue":"4","key":"5324_CR29","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1093\/bib\/bbaa370","volume":"22","author":"Y Liu","year":"2021","unstructured":"Liu Y, Huang Y, Wang G, Wang Y. A deep learning approach for filtering structural variants in short read sequencing data. Brief Bioinf. 2021;22(4):370.","journal-title":"Brief Bioinf"},{"issue":"5","key":"5324_CR30","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1093\/bib\/bbaa381","volume":"22","author":"JT Glessner","year":"2021","unstructured":"Glessner JT, Hou X, Zhong C, Zhang J, Khan M, Brand F, Krawitz P, Sleiman PM, Hakonarson H, Wei Z. Deepcnv: a deep learning approach for authenticating copy number variations. Brief Bioinf. 2021;22(5):381.","journal-title":"Brief Bioinf"},{"issue":"1","key":"5324_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-019-3299-y","volume":"20","author":"L Cai","year":"2019","unstructured":"Cai L, Wu Y, Gao J. Deepsv: accurate calling of genomic deletions from high-throughput sequencing data using deep convolutional neural network. BMC Bioinf. 2019;20(1):1\u201317.","journal-title":"BMC Bioinf"},{"issue":"21","key":"5324_CR32","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1093\/nar\/gkac788","volume":"50","author":"R Tan","year":"2022","unstructured":"Tan R, Shen Y. Accurate in silico confirmation of rare copy number variant calls from exome sequencing data using transfer learning. Nucleic Acids Res. 2022;50(21):123\u2013123.","journal-title":"Nucleic Acids Res"},{"key":"5324_CR33","doi-asserted-by":"crossref","unstructured":"Popic V, Rohlicek C, Cunial F, Hajirasouliha I, Meleshko D, Garimella K, Maheshwari A.: Cue: a deep-learning framework for structural variant discovery and genotyping. Nature Methods. 2023;1\u201310","DOI":"10.1101\/2022.04.30.490167"},{"key":"5324_CR34","doi-asserted-by":"crossref","unstructured":"Smolka M, Paulin LF, Grochowski CM, Mahmoud M, Behera S, Gandhi M, Hong K, Pehlivan D, Scholz SW, Carvalho CM, et al.: Comprehensive structural variant detection: from mosaic to population-level. Biorxiv. 2022;2022\u201304","DOI":"10.1101\/2022.04.04.487055"},{"issue":"1","key":"5324_CR35","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1038\/s41597-020-00743-4","volume":"7","author":"T Hon","year":"2020","unstructured":"Hon T, Mars K, Young G, Tsai Y-C, Karalius JW, Landolin JM, Maurer N, Kudrna D, Hardigan MA, Steiner CC. Highly accurate long-read hifi sequencing data for five complex genomes. Sci Data. 2020;7(1):399.","journal-title":"Sci Data"},{"key":"5324_CR36","doi-asserted-by":"crossref","unstructured":"Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Subgroup G.P.D.P. The sequence alignment\/map format and samtools. Bioinformatics 2009;25(16), 2078\u20132079","DOI":"10.1093\/bioinformatics\/btp352"},{"issue":"18","key":"5324_CR37","doi-asserted-by":"publisher","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","volume":"34","author":"H Li","year":"2018","unstructured":"Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094\u2013100.","journal-title":"Bioinformatics"},{"issue":"4","key":"5324_CR38","doi-asserted-by":"publisher","first-page":"87","DOI":"10.5539\/jel.v2n4p87","volume":"2","author":"P-HN Wu","year":"2013","unstructured":"Wu P-HN, Marek MW. Helping second language literature learners overcome e-learning difficulties: Let-net team teaching with online peer interaction. J Educ Learn. 2013;2(4):87\u2013101.","journal-title":"J Educ Learn"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05324-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05324-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05324-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T15:04:29Z","timestamp":1684854269000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05324-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,23]]},"references-count":38,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5324"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05324-x","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,23]]},"assertion":[{"value":"22 February 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 May 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"213"}}