{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T02:47:28Z","timestamp":1773197248054,"version":"3.50.1"},"reference-count":46,"publisher":"Oxford University Press (OUP)","issue":"6","license":[{"start":{"date-parts":[[2021,12,24]],"date-time":"2021-12-24T00:00:00Z","timestamp":1640304000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research","award":["URF\/1\/3790-01-01"],"award-info":[{"award-number":["URF\/1\/3790-01-01"]}]},{"name":"King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research","award":["URF\/1\/4355-01-01"],"award-info":[{"award-number":["URF\/1\/4355-01-01"]}]},{"name":"King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research","award":["FCC\/1\/1976-08-01"],"award-info":[{"award-number":["FCC\/1\/1976-08-01"]}]},{"name":"King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research","award":["FCC\/1\/1976-08-08"],"award-info":[{"award-number":["FCC\/1\/1976-08-08"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Structural genomic variants account for much of human variability and are involved in several diseases. Structural variants are complex and may affect coding regions of multiple genes, or affect the functions of genomic regions in different ways from single nucleotide variants. Interpreting the phenotypic consequences of structural variants relies on information about gene functions, haploinsufficiency or triplosensitivity and other genomic features. Phenotype-based methods to identifying variants that are involved in genetic diseases combine molecular features with prior knowledge about the phenotypic consequences of altering gene functions. While phenotype-based methods have been applied successfully to single nucleotide variants as well as short insertions and deletions, the complexity of structural variants makes it more challenging to link them to phenotypes. Furthermore, structural variants can affect a large number of coding regions, and phenotype information may not be available for all of them.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We developed DeepSVP, a computational method to prioritize structural variants involved in genetic diseases by combining genomic and gene functions information. We incorporate phenotypes linked to genes, functions of gene products, gene expression in individual cell types and anatomical sites of expression, and systematically relate them to their phenotypic consequences through ontologies and machine learning. DeepSVP significantly improves the success rate of finding causative variants in several benchmarks and can identify novel pathogenic structural variants in consanguineous families.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>https:\/\/github.com\/bio-ontology-research-group\/DeepSVP.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab859","type":"journal-article","created":{"date-parts":[[2021,12,21]],"date-time":"2021-12-21T07:17:53Z","timestamp":1640071073000},"page":"1677-1684","source":"Crossref","is-referenced-by-count":11,"title":["DeepSVP: integration of genotype and phenotype for structural variant prioritization using deep learning"],"prefix":"10.1093","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6084-8706","authenticated-orcid":false,"given":"Azza","family":"Althagafi","sequence":"first","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST) , Thuwal 23955-6900, Saudi Arabia"},{"name":"Computer Science Department, College of Computers and Information Technology, Taif University , Taif, Saudi Arabia"}]},{"given":"Lamia","family":"Alsubaie","sequence":"additional","affiliation":[{"name":"Department of Pathology and Laboratory Medicine, King Abdulaziz Medical City (KAMC) , Riyadh, Saudi Arabia"},{"name":"Center for Genetics and Inherited Diseases, Taibah University, Almadinah Almunwarah, Saudi Arabia"}]},{"given":"Nagarajan","family":"Kathiresan","sequence":"additional","affiliation":[{"name":"Supercomputing Core Lab, KAUST , Thuwal, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4727-045X","authenticated-orcid":false,"given":"Katsuhiko","family":"Mineta","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST) , Thuwal 23955-6900, Saudi Arabia"}]},{"given":"Taghrid","family":"Aloraini","sequence":"additional","affiliation":[{"name":"Department of Pathology and Laboratory Medicine, King Abdulaziz Medical City (KAMC) , Riyadh, Saudi Arabia"},{"name":"King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Centre, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"}]},{"given":"Fuad","family":"Al Mutairi","sequence":"additional","affiliation":[{"name":"Genetics & Precision Medicine Department, King Abdulaziz Medical City, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"},{"name":"King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Centre, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"}]},{"given":"Majid","family":"Alfadhel","sequence":"additional","affiliation":[{"name":"Genetics & Precision Medicine Department, King Abdulaziz Medical City, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"},{"name":"King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Centre, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"}]},{"given":"Takashi","family":"Gojobori","sequence":"additional","affiliation":[{"name":"KCBRC, Biological and Environmental Science and Engineering Division (BESE), KAUST, Thuwal, Saudi Arabia"}]},{"given":"Ahmad","family":"Alfares","sequence":"additional","affiliation":[{"name":"Department of Pathology and Laboratory Medicine, King Abdulaziz Medical City (KAMC) , Riyadh, Saudi Arabia"},{"name":"King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Centre, Ministry of National Guard-Health Affairs (MNG-HA), Riyadh, Saudi Arabia"},{"name":"Department of Pediatrics, College of Medicine, Qassim University, Qassim, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8149-5890","authenticated-orcid":false,"given":"Robert","family":"Hoehndorf","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST) , Thuwal 23955-6900, Saudi Arabia"}]}],"member":"286","published-online":{"date-parts":[[2021,12,24]]},"reference":[{"key":"2023020108573250300_btab859-B1","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature11632","article-title":"An integrated map of genetic variation from 1,092 human genomes","volume":"491","year":"2012","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B2","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1186\/s12920-020-00743-8","article-title":"What is the right sequencing approach? Solo VS extended family analysis in consanguineous populations","volume":"13","author":"Alfares","year":"2020","journal-title":"BMC Med. Genomics"},{"key":"2023020108573250300_btab859-B3","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1002\/humu.21466","article-title":"A new face and new challenges for Online Mendelian Inheritance in Man (OMIM\u00ae)","volume":"32","author":"Amberger","year":"2011","journal-title":"Hum. Mutat"},{"key":"2023020108573250300_btab859-B4","doi-asserted-by":"crossref","first-page":"D801","DOI":"10.1093\/nar\/gky1056","article-title":"Mouse Genome Database (MGD) 2019","volume":"47","author":"Bult","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023020108573250300_btab859-B5","doi-asserted-by":"crossref","first-page":"853","DOI":"10.1093\/bioinformatics\/btaa879","article-title":"Predicting candidate genes from phenotypes, functions and anatomical site of expression","volume":"37","author":"Chen","year":"2020","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B6","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1038\/s41586-018-0590-4","article-title":"Single-cell transcriptomics of 20 mouse organs creates a tabula muris","volume":"562","author":"Consortium","year":"2018","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B7","doi-asserted-by":"crossref","first-page":"2087","DOI":"10.1093\/bioinformatics\/bty028","article-title":"PhenoRank: reducing study bias in gene prioritization through simulation","volume":"34","author":"Cornish","year":"2018","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13326-016-0088-7","article-title":"The cell ontology 2016: enhanced content, modularization, and ontology interoperability","volume":"7","author":"Diehl","year":"2016","journal-title":"J. Biomed. Seman"},{"key":"2023020108573250300_btab859-B9","first-page":"358","article-title":"Phenotypic overlap in the contribution of individual genes to CNV pathogenicity revealed by cross-species computational analysis of single-gene mutations in humans, mice and zebrafish","volume":"6","author":"Doelken","year":"2013","journal-title":"Dis. Models Mech"},{"key":"2023020108573250300_btab859-B10","doi-asserted-by":"crossref","first-page":"64","DOI":"10.1056\/NEJMra1809315","article-title":"Genetic variation, comparative genomics, and the diagnosis of disease","volume":"381","author":"Eichler","year":"2019","journal-title":"N. Engl. J. Med"},{"key":"2023020108573250300_btab859-B11","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1038\/nrg.2017.52","article-title":"Settling the score: variant prioritization and mendelian disease","volume":"18","author":"Eilbeck","year":"2017","journal-title":"Nat. Rev. Genet"},{"key":"2023020108573250300_btab859-B12","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1016\/j.ajhg.2009.03.010","article-title":"Decipher: database of chromosomal imbalance and phenotype in humans using Ensembl resources","volume":"84","author":"Firth","year":"2009","journal-title":"Am. J. Hum. Genet"},{"key":"2023020108573250300_btab859-B13","doi-asserted-by":"crossref","first-page":"1083","DOI":"10.1093\/bioinformatics\/btw789","article-title":"SVScore: an impact prediction tool for structural variation","volume":"33","author":"Ganel","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B14","doi-asserted-by":"crossref","first-page":"D330","DOI":"10.1093\/nar\/gky1055","article-title":"The gene ontology resource: 20 years and still going strong","volume":"47","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023020108573250300_btab859-B15","doi-asserted-by":"crossref","first-page":"3572","DOI":"10.1093\/bioinformatics\/bty304","article-title":"AnnotSV: an integrated tool for structural variations annotation","volume":"34","author":"Geoffroy","year":"2018","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B16","doi-asserted-by":"crossref","first-page":"1129","DOI":"10.1016\/S0895-4356(03)00177-X","article-title":"The diagnostic odds ratio: a single indicator of test performance","volume":"56","author":"Glas","year":"2003","journal-title":"J. Clin. Epidemiol"},{"key":"2023020108573250300_btab859-B17"},{"key":"2023020108573250300_btab859-B18","doi-asserted-by":"crossref","first-page":"648","DOI":"10.1126\/science.1262110","article-title":"The genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans","volume":"348","year":"2015","journal-title":"Science"},{"key":"2023020108573250300_btab859-B19","doi-asserted-by":"crossref","first-page":"e1000752","DOI":"10.1371\/journal.pcbi.1000752","article-title":"Accurate distinction of pathogenic from benign CNVs in mental retardation","volume":"6","author":"Hehir-Kwa","year":"2010","journal-title":"PLoS Comput. Biol"},{"key":"2023020108573250300_btab859-B20","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/nature06862","article-title":"Mapping and sequencing of structural variation from eight human genomes","volume":"453","author":"Kidd","year":"2008","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B21","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1172\/JCI94999","article-title":"Severe peri-ictal respiratory dysfunction is common in Dravet syndrome","volume":"128","author":"Kim","year":"2018","journal-title":"J. Clin. Invest"},{"key":"2023020108573250300_btab859-B22","author":"Kleinert","year":"2021"},{"key":"2023020108573250300_btab859-B23","doi-asserted-by":"crossref","first-page":"766","DOI":"10.1136\/jmedgenet-2014-102633","article-title":"Clinical interpretation of CNVs with cross-species phenotype data","volume":"51","author":"K\u00f6hler","year":"2014","journal-title":"J. Med. Genet"},{"key":"2023020108573250300_btab859-B24","doi-asserted-by":"crossref","first-page":"D1018","DOI":"10.1093\/nar\/gky1105","article-title":"Expansion of the human phenotype ontology (HPO) knowledge base and resources","volume":"47","author":"K\u00f6hler","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023020108573250300_btab859-B25","doi-asserted-by":"crossref","first-page":"e1008453","DOI":"10.1371\/journal.pcbi.1008453","article-title":"DeepPheno: predicting single gene loss-of-function phenotypes using an ontology-aware hierarchical classifier","volume":"16","author":"Kulmanov","year":"2020","journal-title":"PLoS Comput. Biol"},{"key":"2023020108573250300_btab859-B26","doi-asserted-by":"crossref","first-page":"bbaa199","DOI":"10.1093\/bib\/bbaa199","article-title":"Semantic similarity and machine learning with ontologies","volume":"22","author":"Kulmanov","year":"2020","journal-title":"Brief. Bioinform"},{"key":"2023020108573250300_btab859-B27","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1038\/nature13127","article-title":"Guidelines for investigating causality of sequence variants in human disease","volume":"508","author":"MacArthur","year":"2014","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B28","author":"Mikolov","year":"2013"},{"key":"2023020108573250300_btab859-B29","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1111\/gbb.12099","article-title":"Mapping genetic modifiers of survival in a mouse model of Dravet syndrome","volume":"13","author":"Miller","year":"2014","journal-title":"Genes Brain Behav"},{"key":"2023020108573250300_btab859-B30","doi-asserted-by":"crossref","first-page":"R5","DOI":"10.1186\/gb-2012-13-1-r5","article-title":"UBERON: an integrative multi-species anatomy ontology","volume":"13","author":"Mungall","year":"2012","journal-title":"Genome Biol"},{"key":"2023020108573250300_btab859-B31","doi-asserted-by":"crossref","first-page":"e66","DOI":"10.1111\/j.1528-1167.2011.03139.x","article-title":"Refractory neonatal epilepsy with a de novo duplication of chromosome 2q24.2q24.3","volume":"52","author":"Okumura","year":"2011","journal-title":"Epilepsia"},{"key":"2023020108573250300_btab859-B32","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1038\/nature09146","article-title":"Functional impact of global rare copy number variation in autism spectrum disorders","volume":"466","author":"Pinto","year":"2010","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B33","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1038\/s41436-019-0686-8","article-title":"Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen)","volume":"22","author":"Riggs","year":"2020","journal-title":"Genet. Med"},{"key":"2023020108573250300_btab859-B34","doi-asserted-by":"crossref","first-page":"e1001273","DOI":"10.1371\/journal.pgen.1001273","article-title":"Proteins encoded in genomic regions associated with immune-mediated disease physically interact and suggest underlying biology","volume":"7","author":"Rossin","year":"2011","journal-title":"PLoS Genet"},{"key":"2023020108573250300_btab859-B35","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1186\/s13073-018-0606-6","article-title":"Complex structural variants in mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing","volume":"10","author":"Sanchis-Juan","year":"2018","journal-title":"Genome Med"},{"key":"2023020108573250300_btab859-B36","author":"Sharo","year":"2020"},{"key":"2023020108573250300_btab859-B37","doi-asserted-by":"crossref","first-page":"D704","DOI":"10.1093\/nar\/gkz997","article-title":"The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species","volume":"48","author":"Shefchek","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2023020108573250300_btab859-B38","doi-asserted-by":"crossref","first-page":"2128","DOI":"10.1111\/j.1528-1167.2012.03676.x","article-title":"Duplication of the sodium channel gene cluster on 2q24 in children with early onset epilepsy","volume":"53","author":"Simonetti","year":"2012","journal-title":"Epilepsia"},{"key":"2023020108573250300_btab859-B39","doi-asserted-by":"crossref","first-page":"2133","DOI":"10.1093\/bioinformatics\/bty933","article-title":"OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction","volume":"35","author":"Smaili","year":"2019","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B40","doi-asserted-by":"crossref","first-page":"bat025","DOI":"10.1093\/database\/bat025","article-title":"PhenoDigm: analyzing curated annotations to associate animal models with human diseases","volume":"2013","author":"Smedley","year":"2013","journal-title":"Database"},{"key":"2023020108573250300_btab859-B41","doi-asserted-by":"crossref","first-page":"3215","DOI":"10.1093\/bioinformatics\/btu508","article-title":"Walking the interactome for candidate prioritization in exome sequencing studies of Mendelian diseases","volume":"30","author":"Smedley","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020108573250300_btab859-B42","doi-asserted-by":"crossref","first-page":"2004","DOI":"10.1038\/nprot.2015.124","article-title":"Next-generation diagnostics and disease-gene discovery with the exomiser","volume":"10","author":"Smedley","year":"2015","journal-title":"Nat. Protoc"},{"key":"2023020108573250300_btab859-B43","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1038\/nature15394","article-title":"An integrated map of structural variation in 2,504 human genomes","volume":"526","author":"Sudmant","year":"2015","journal-title":"Nature"},{"key":"2023020108573250300_btab859-B44","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023020108573250300_btab859-B45","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13073-021-00945-4","article-title":"X-CNV: genome-wide prediction of the pathogenicity of copy number variations","volume":"13","author":"Zhang","year":"2021","journal-title":"Genome Med"},{"key":"2023020108573250300_btab859-B46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-019-1835-8","article-title":"The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens","volume":"20","author":"Zhou","year":"2019","journal-title":"Genome Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab859\/42113886\/btab859.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/6\/1677\/49008910\/btab859.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/38\/6\/1677\/49008910\/btab859.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,14]],"date-time":"2023-11-14T13:20:04Z","timestamp":1699968004000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/38\/6\/1677\/6482742"}},"subtitle":[],"editor":[{"given":"Zhiyong","family":"Lu","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,12,24]]},"references-count":46,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,3,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab859","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.01.28.428557","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,3,15]]},"published":{"date-parts":[[2021,12,24]]}}}