{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T01:13:21Z","timestamp":1773105201479,"version":"3.50.1"},"reference-count":114,"publisher":"Oxford University Press (OUP)","license":[{"start":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T00:00:00Z","timestamp":1698192000000},"content-version":"vor","delay-in-days":301,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100008530","name":"FEDER","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,12,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>The adoption of whole-genome sequencing in genetic screens has facilitated the detection of genetic variation in the intronic regions of genes, far from annotated splice sites. However, selecting an appropriate computational tool to discriminate functionally relevant genetic variants from those with no effect is challenging, particularly for deep intronic regions where independent benchmarks are scarce.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>In this study, we have provided an overview of the computational methods available and the extent to which they can be used to analyze deep intronic variation. We leveraged diverse datasets to extensively evaluate tool performance across different intronic regions, distinguishing between variants that are expected to disrupt splicing through different molecular mechanisms. Notably, we compared the performance of SpliceAI, a widely used sequence-based deep learning model, with that of more recent methods that extend its original implementation. We observed considerable differences in tool performance depending on the region considered, with variants generating cryptic splice sites being better predicted than those that potentially affect splicing regulatory elements. Finally, we devised a novel quantitative assessment of tool interpretability and found that tools providing mechanistic explanations of their predictions are often correct with respect to the ground - information, but the use of these tools results in decreased predictive power when compared to black box methods.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>Our findings translate into practical recommendations for tool usage and provide a reference framework for applying prediction tools in deep intronic regions, enabling more informed decision-making by practitioners.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/gigascience\/giad085","type":"journal-article","created":{"date-parts":[[2023,9,20]],"date-time":"2023-09-20T17:26:11Z","timestamp":1695230771000},"source":"Crossref","is-referenced-by-count":22,"title":["Computational prediction of human deep intronic variation"],"prefix":"10.1093","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3892-7640","authenticated-orcid":false,"given":"Pedro","family":"Barbosa","sequence":"first","affiliation":[{"name":"LASIGE, Departamento de Inform\u00e1tica, Faculdade de Ci\u00eancias, Universidade de Lisboa, 1749-016, , Lisboa ,","place":["Portugal"]},{"name":"Instituto de Medicina Molecular Jo\u00e3o Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 , Lisboa ,","place":["Portugal"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8367-2317","authenticated-orcid":false,"given":"Rosina","family":"Savisaar","sequence":"additional","affiliation":[{"name":"Mondego Science, 3030-480 , Coimbra ,","place":["Portugal"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3402-7143","authenticated-orcid":false,"given":"Maria","family":"Carmo-Fonseca","sequence":"additional","affiliation":[{"name":"Instituto de Medicina Molecular Jo\u00e3o Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, 1649-028 , Lisboa ,","place":["Portugal"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0879-4015","authenticated-orcid":false,"given":"Alcides","family":"Fonseca","sequence":"additional","affiliation":[{"name":"LASIGE, Departamento de Inform\u00e1tica, Faculdade de Ci\u00eancias, Universidade de Lisboa, 1749-016, , Lisboa ,","place":["Portugal"]}]}],"member":"286","published-online":{"date-parts":[[2023,10,25]]},"reference":[{"issue":"5","key":"2024111706372193300_bib1","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1186\/1479-7364-4-5-284","article-title":"Functional intronic polymorphisms: Buried treasure awaiting discovery within our genes","volume":"4","author":"Cooper","year":"2010","journal-title":"Hum Genom"},{"issue":"7809","key":"2024111706372193300_bib2","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1038\/s41586-020-2308-7","article-title":"The mutational constraint spectrum quantified from variation in 141,456 humans","volume":"581","author":"Karczewski","year":"2020","journal-title":"Nature"},{"issue":"7845","key":"2024111706372193300_bib3","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1038\/s41586-021-03205-y","article-title":"Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program","volume":"590","author":"Taliun","year":"2021","journal-title":"Nature"},{"issue":"10","key":"2024111706372193300_bib4","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1038\/nrg.2017.52","article-title":"Settling the score: variant prioritization and Mendelian disease","volume":"18","author":"Eilbeck","year":"2017","journal-title":"Nat Rev Genet"},{"key":"2024111706372193300_bib5","doi-asserted-by":"crossref","first-page":"689892","DOI":"10.3389\/fgene.2021.689892","article-title":"Splicing in the diagnosis of rare disease: advances and challenges","volume":"12","author":"Lord","year":"2021","journal-title":"Front Genet"},{"issue":"4","key":"2024111706372193300_bib6","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1016\/j.cell.2009.02.009","article-title":"The spliceosome: design principles of a dynamic RNP machine","volume":"136","author":"Wahl","year":"2009","journal-title":"Cell"},{"issue":"2","key":"2024111706372193300_bib7","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1002\/path.2649","article-title":"The pathobiology of splicing","volume":"220","author":"Ward","year":"2010","journal-title":"J Pathol"},{"issue":"10","key":"2024111706372193300_bib8","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1038\/nrg2164","article-title":"Splicing in disease: disruption of the splicing code and the decoding machinery","volume":"8","author":"Wang","year":"2007","journal-title":"Nat Rev Genet"},{"issue":"27","key":"2024111706372193300_bib9","doi-asserted-by":"crossref","first-page":"11093","DOI":"10.1073\/pnas.1101135108","article-title":"Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes","volume":"108","author":"Lim","year":"2011","journal-title":"Proc Natl Acad Sci"},{"issue":"3","key":"2024111706372193300_bib10","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1016\/j.cell.2018.12.015","article-title":"Predicting splicing from primary sequence with deep learning","volume":"176","author":"Jaganathan","year":"2019","journal-title":"Cell"},{"issue":"2","key":"2024111706372193300_bib11","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1038\/s41573-019-0042-3","article-title":"Targeting mRNA processing as an anticancer strategy","volume":"19","author":"Desterro","year":"2020","journal-title":"Nat Rev Drug Discov"},{"issue":"3","key":"2024111706372193300_bib12","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1007\/s13353-018-0444-7","article-title":"Splicing mutations in human genetic disorders: examples, detection, and confirmation","volume":"59","author":"Anna","year":"2018","journal-title":"J Appl Genet"},{"issue":"2","key":"2024111706372193300_bib13","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1016\/j.molcel.2019.09.017","article-title":"Alternative splicing regulatory networks: functions, mechanisms, and evolution","volume":"76","author":"Ule","year":"2019","journal-title":"Mol Cell"},{"issue":"7","key":"2024111706372193300_bib14","doi-asserted-by":"crossref","first-page":"407","DOI":"10.1038\/nrg.2016.46","article-title":"Lessons from non-canonical splicing","volume":"17","author":"Sibley","year":"2016","journal-title":"Nat Rev Genet"},{"issue":"D1","key":"2024111706372193300_bib15","doi-asserted-by":"crossref","first-page":"D1062","DOI":"10.1093\/nar\/gkx1153","article-title":"ClinVar: improving access to variant interpretations and supporting evidence","volume":"46","author":"Landrum","year":"2018","journal-title":"Nucleic Acids Res"},{"issue":"10","key":"2024111706372193300_bib16","doi-asserted-by":"crossref","first-page":"1197","DOI":"10.1007\/s00439-020-02199-3","article-title":"The human gene mutation database (HGMD\u00ae): optimizing its use in a clinical diagnostic or research setting","volume":"139","author":"Stenson","year":"2020","journal-title":"Hum Genet"},{"issue":"2","key":"2024111706372193300_bib17","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1101\/gr.238444.118","article-title":"Pathogenicity and selective constraint on variation near splice sites","volume":"29","author":"Lord","year":"2019","journal-title":"Genome Res"},{"issue":"1","key":"2024111706372193300_bib18","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1186\/s13073-022-01087-x","article-title":"A systematic analysis of splicing variants identifies new diagnoses in the 100,000 Genomes Project","volume":"14","author":"Blakes","year":"2022","journal-title":"Genome Med"},{"issue":"1","key":"2024111706372193300_bib19","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1186\/s13073-022-01073-3","article-title":"Recommendations for clinical interpretation of variants found in non-coding regions of the genome","volume":"14","author":"Ellingford","year":"2022","journal-title":"Genome Med"},{"issue":"9","key":"2024111706372193300_bib20","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1007\/s00439-017-1809-4","article-title":"Deep intronic mutations and human disease","volume":"136","author":"Vaz-Drago","year":"2017","journal-title":"Hum Genet"},{"key":"2024111706372193300_bib21","doi-asserted-by":"crossref","first-page":"943044","DOI":"10.3389\/fgene.2022.943044","article-title":"Analysis of pathogenic pseudoexons reveals novel mechanisms driving cryptic splicing","volume":"12","author":"Keegan","year":"2022","journal-title":"Front Genet"},{"issue":"7616","key":"2024111706372193300_bib22","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1038\/nature19057","article-title":"Analysis of protein-coding genetic variation in 60,706 humans","volume":"536","author":"Lek","year":"2016","journal-title":"Nature"},{"issue":"7414","key":"2024111706372193300_bib23","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"Dunham","year":"2012","journal-title":"Nature"},{"issue":"7","key":"2024111706372193300_bib24","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1038\/s41576-019-0122-6","article-title":"Deep learning: new computational modelling techniques for genomics","volume":"20","author":"Eraslan","year":"2019","journal-title":"Nat Rev Genet"},{"issue":"1","key":"2024111706372193300_bib25","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1186\/s12859-022-05041-x","article-title":"Combining genetic constraint with predictions of alternative splicing to prioritize deleterious splicing in rare disease studies","volume":"23","author":"Cormier","year":"2022","journal-title":"BMC Bioinformatics"},{"key":"2024111706372193300_bib26","volume-title":"PDIVAS:\u00a0Pathogenicity\u00a0predictor\u00a0for\u00a0deep-intronic\u00a0variants\u00a0causing\u00a0aberrant\u00a0splicing","author":"Kurosawa","year":"2023"},{"issue":"5","key":"2024111706372193300_bib27","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1038\/s41588-023-01373-3","article-title":"Aberrant splicing prediction across human tissues","volume":"55","author":"Wagner","year":"2023","journal-title":"Nat Genet"},{"issue":"1","key":"2024111706372193300_bib28","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1186\/s13059-022-02664-4","article-title":"Predicting RNA splicing from DNA sequence using Pangolin","volume":"23","author":"Zeng","year":"2022","journal-title":"Genome Biol"},{"issue":"6","key":"2024111706372193300_bib29","doi-asserted-by":"crossref","first-page":"e0269159","DOI":"10.1371\/journal.pone.0269159","article-title":"CI-SpliceAI\u2014improving machine learning predictions of disease causing splicing variants using curated alternative splice sites","volume":"17","author":"Strauch","year":"2022","journal-title":"PLoS One"},{"issue":"D1","key":"2024111706372193300_bib30","doi-asserted-by":"crossref","first-page":"D766","DOI":"10.1093\/nar\/gky955","article-title":"GENCODE reference annotation for the human and mouse genomes","volume":"47","author":"Frankish","year":"2019","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"2024111706372193300_bib31","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1186\/s13059-019-1738-8","article-title":"Essential guidelines for computational method benchmarking","volume":"20","author":"Weber","year":"2019","journal-title":"Genome Biol"},{"issue":"1","key":"2024111706372193300_bib32","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1186\/s13059-021-02365-4","article-title":"On the optimistic performance evaluation of newly introduced bioinformatic methods","volume":"22","author":"Buchka","year":"2021","journal-title":"Genome Biol"},{"issue":"1","key":"2024111706372193300_bib33","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1186\/s12864-020-6484-5","article-title":"Assessment of branch point prediction tools to predict physiological branch points and their alteration by variants","volume":"21","author":"Leman","year":"2020","journal-title":"BMC Genomics"},{"issue":"10","key":"2024111706372193300_bib34","doi-asserted-by":"crossref","first-page":"1811","DOI":"10.1002\/humu.24091","article-title":"Large-scale comparative evaluation of user-friendly tools for predicting variant-induced alterations of splicing regulatory elements","volume":"41","author":"Tubeuf","year":"2020","journal-title":"Hum Mutat"},{"issue":"13","key":"2024111706372193300_bib35","doi-asserted-by":"crossref","first-page":"3341","DOI":"10.3390\/cancers13133341","article-title":"Role of splicing regulatory elements and in silico tools usage in the identification of deep intronic splicing variants in hereditary breast\/ovarian cancer genes","volume":"13","author":"Moles-Fern\u00e1ndez","year":"2021","journal-title":"Cancers"},{"issue":"7","key":"2024111706372193300_bib36","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1002\/humu.24212","article-title":"Benchmarking deep learning splice prediction tools using functional splice assays","volume":"42","author":"Riepe","year":"2021","journal-title":"Hum Mutat"},{"issue":"1","key":"2024111706372193300_bib37","doi-asserted-by":"crossref","first-page":"20607","DOI":"10.1038\/s41598-021-99747-2","article-title":"Comparison of in silico strategies to prioritize rare genomic variants impacting RNA splicing for the diagnosis of genomic disorders","volume":"11","author":"Rowlands","year":"2021","journal-title":"Sci Rep"},{"issue":"9","key":"2024111706372193300_bib38","doi-asserted-by":"crossref","first-page":"1308","DOI":"10.3390\/genes12091308","article-title":"Performance evaluation of spliceai for the prediction of splicing of NF1 variants","volume":"12","author":"Ha","year":"2021","journal-title":"Genes"},{"issue":"16","key":"2024111706372193300_bib39","doi-asserted-by":"crossref","first-page":"9115","DOI":"10.1093\/nar\/gkac686","article-title":"Performance evaluation of differential splicing analysis methods and splicing analytics platform construction","volume":"50","author":"Li","year":"2022","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"2024111706372193300_bib40","doi-asserted-by":"crossref","first-page":"2308","DOI":"10.1002\/humu.24491","article-title":"SPiP: splicing prediction pipeline, a machine learning tool for massive detection of exonic and intronic variant effects on mRNA splicing","volume":"43","author":"Leman","year":"2022","journal-title":"Hum Mutat"},{"issue":"1","key":"2024111706372193300_bib41","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1186\/s13073-020-00775-w","article-title":"CAPICE: a computational method for consequence-agnostic pathogenicity interpretation of clinical exome variations","volume":"12","author":"Li","year":"2020","journal-title":"Genome Med"},{"issue":"8","key":"2024111706372193300_bib42","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1101\/gr.3715005","article-title":"Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes","volume":"15","author":"Siepel","year":"2005","journal-title":"Genome Res"},{"issue":"15","key":"2024111706372193300_bib43","doi-asserted-by":"crossref","first-page":"7793","DOI":"10.1093\/nar\/gky678","article-title":"Performance evaluation of pathogenicity-computation methods for missense variants","volume":"46","author":"Li","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024111706372193300_bib44","first-page":"190","article-title":"New methods for detecting lineage-specific selection","volume-title":"Research in Computational Molecular Biology. Lecture Notes in Computer Science","author":"Siepel","year":"2006"},{"issue":"8","key":"2024111706372193300_bib45","doi-asserted-by":"crossref","first-page":"2125","DOI":"10.1093\/hmg\/ddu733","article-title":"Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies","volume":"24","author":"Dong","year":"2015","journal-title":"Hum Mol Genet"},{"issue":"12","key":"2024111706372193300_bib46","doi-asserted-by":"crossref","first-page":"i54","DOI":"10.1093\/bioinformatics\/btp190","article-title":"Identifying novel constrained elements by exploiting biased substitution patterns","volume":"25","author":"Garber","year":"2009","journal-title":"Bioinformatics"},{"issue":"12","key":"2024111706372193300_bib47","doi-asserted-by":"crossref","first-page":"e1001025","DOI":"10.1371\/journal.pcbi.1001025","article-title":"Identifying a high fraction of the human genome to be under selective constraint using GERP++","volume":"6","author":"Davydov","year":"2010","journal-title":"PLoS Comput Biol"},{"issue":"10","key":"2024111706372193300_bib48","doi-asserted-by":"crossref","DOI":"10.1093\/bioinformatics\/btv009","article-title":"An integrative approach to predicting the functional effects of non-coding and coding sequence variation","volume":"31","author":"Shihab","year":"2015","journal-title":"Bioinformatics"},{"issue":"3","key":"2024111706372193300_bib49","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1002\/humu.22932","article-title":"dbNSFP v3.0: a one-stop database of functional predictions and annotations for human non-synonymous and splice site SNVs","volume":"37","author":"Liu","year":"2016","journal-title":"Hum Mutat"},{"issue":"7571","key":"2024111706372193300_bib50","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nature15393","article-title":"A global reference for human genetic variation","volume":"526","author":"1000 Genomes Project Consortium","year":"2015","journal-title":"Nature"},{"issue":"2","key":"2024111706372193300_bib51","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1038\/ng.3477","article-title":"A spectral approach integrating functional genomic annotations for coding and noncoding variants","volume":"48","author":"Ionita-Laza","year":"2016","journal-title":"Nat Genet"},{"issue":"4","key":"2024111706372193300_bib52","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1038\/s41588-019-0348-4","article-title":"S-CAP extends pathogenicity prediction to genetic variants that affect RNA splicing","volume":"51","author":"Jagadeesh","year":"2019","journal-title":"Nat Genet"},{"issue":"3","key":"2024111706372193300_bib53","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/j.ajhg.2016.07.005","article-title":"A whole-genome analysis framework for effective identification of pathogenic regulatory variants in Mendelian disease","volume":"99","author":"Smedley","year":"2016","journal-title":"Am J Hum Genet"},{"issue":"4","key":"2024111706372193300_bib54","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1038\/ng.3810","article-title":"Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data","volume":"49","author":"Huang","year":"2017","journal-title":"Nat Genet"},{"issue":"12","key":"2024111706372193300_bib55","doi-asserted-by":"crossref","first-page":"2230","DOI":"10.1002\/humu.23896","article-title":"Dutch genome diagnostic laboratories accelerated and improved variant interpretation and increased accuracy by sharing data","volume":"40","author":"Fokkema","year":"2019","journal-title":"Hum Mutat"},{"issue":"1","key":"2024111706372193300_bib56","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/s13073-021-00835-9","article-title":"CADD-splice\u2014improving genome-wide variant effect prediction using deep learning-derived splice scores","volume":"13","author":"Rentzsch","year":"2021","journal-title":"Genome Med"},{"issue":"2-3","key":"2024111706372193300_bib57","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1089\/1066527041410418","article-title":"Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals","volume":"11","author":"Yeo","year":"2004","journal-title":"J Comput Biol"},{"issue":"13","key":"2024111706372193300_bib58","doi-asserted-by":"crossref","first-page":"2315","DOI":"10.1093\/bioinformatics\/bty960","article-title":"A plugin for the ensembl variant effect predictor that uses maxentscan to predict variant spliceogenicity","volume":"35","author":"Shamsani","year":"2019","journal-title":"Bioinformatics"},{"issue":"22","key":"2024111706372193300_bib59","doi-asserted-by":"crossref","first-page":"13534","DOI":"10.1093\/nar\/gku1206","article-title":"In silico prediction of splice-altering single nucleotide variants in the human genome","volume":"42","author":"Jian","year":"2014","journal-title":"Nucleic Acids Res"},{"issue":"Database issue","key":"2024111706372193300_bib60","doi-asserted-by":"crossref","first-page":"D1055","DOI":"10.1093\/nar\/gkr1171","article-title":"SpliceDisease database: linking RNA splicing and disease","volume":";40","author":"Wang","year":"2012","journal-title":"Nucleic Acids Res"},{"issue":"6218","key":"2024111706372193300_bib61","doi-asserted-by":"crossref","DOI":"10.1126\/science.1254806","article-title":"The human splicing code reveals new insights into the genetic determinants of disease","volume":"347","author":"Xiong","year":"2015","journal-title":"Science"},{"issue":"3","key":"2024111706372193300_bib62","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1016\/j.cell.2015.09.054","article-title":"Learning the sequence determinants of alternative splicing from millions of random sequences","volume":"163","author":"Rosenberg","year":"2015","journal-title":"Cell"},{"issue":"1","key":"2024111706372193300_bib63","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1038\/s41467-017-00141-2","article-title":"Annotating pathogenic non-coding variants in genic regions","volume":"8","author":"Gelfman","year":"2017","journal-title":"Nat Commun"},{"issue":"6","key":"2024111706372193300_bib64","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1038\/s41587-019-0140-0","article-title":"The Kipoi repository accelerates community exchange and reuse of predictive models for genomics","volume":"37","author":"Avsec","year":"2019","journal-title":"Nat Biotechnol"},{"issue":"6","key":"2024111706372193300_bib65","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1038\/ng.2653","article-title":"The genotype-tissue expression (GTEx) Project","volume":"45","author":"Lonsdale","year":"2013","journal-title":"Nat Genet"},{"issue":"1","key":"2024111706372193300_bib66","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1186\/s13059-019-1653-z","article-title":"MMSplice: modular modeling improves the predictions of genetic variant effects on splicing","volume":"20","author":"Cheng","year":"2019","journal-title":"Genome Biol"},{"issue":"9","key":"2024111706372193300_bib67","doi-asserted-by":"crossref","first-page":"1564","DOI":"10.1016\/j.ajhg.2021.06.014","article-title":"Interpretable prioritization of splice variants in diagnostic next-generation sequencing","volume":"108","author":"Danis","year":"2021","journal-title":"Am J Hum Genet"},{"issue":"5","key":"2024111706372193300_bib68","doi-asserted-by":"crossref","first-page":"bbac334","DOI":"10.1093\/bib\/bbac334","article-title":"Performance evaluation of computational methods for splice-disrupting variants and improving the performance using the machine learning-based framework","volume":"23","author":"Liu","year":"2022","journal-title":"Brief Bioinform"},{"issue":"1","key":"2024111706372193300_bib69","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1093\/nar\/29.1.308","article-title":"dbSNP: the NCBI database of genetic variation","volume":"29","author":"Sherry","year":"2001","journal-title":"Nucleic Acids Res"},{"issue":"13","key":"2024111706372193300_bib70","doi-asserted-by":"crossref","first-page":"3568","DOI":"10.1093\/nar\/gkg616","article-title":"ESEfinder: a web resource to identify exonic splicing enhancers","volume":"31","author":"Cartegni","year":"2003","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"2024111706372193300_bib71","doi-asserted-by":"crossref","first-page":"1360","DOI":"10.1101\/gr.119628.110","article-title":"Quantitative evaluation of all hexamers as exonic splicing elements","volume":"21","author":"Ke","year":"2011","journal-title":"Genome Res"},{"issue":"16","key":"2024111706372193300_bib72","doi-asserted-by":"crossref","first-page":"10681","DOI":"10.1093\/nar\/gku736","article-title":"Genomic HEXploring allows landscaping of novel potential splicing regulatory elements","volume":"42","author":"Erkelenz","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2024111706372193300_bib73","doi-asserted-by":"crossref","first-page":"701076","DOI":"10.3389\/fgene.2021.701076","article-title":"IntSplice2: prediction of the splicing effects of intronic single-nucleotide variants using LightGBM modeling","volume":"12","author":"Takeda","year":"2021","journal-title":"Front Genet"},{"issue":"11","key":"2024111706372193300_bib74","doi-asserted-by":"crossref","first-page":"e1001016","DOI":"10.1371\/journal.pcbi.1001016","article-title":"Genome-wide association between branch point properties and alternative splicing","volume":"6","author":"Corvelo","year":"2010","journal-title":"PLoS Comput Biol"},{"issue":"20","key":"2024111706372193300_bib75","doi-asserted-by":"crossref","first-page":"3166","DOI":"10.1093\/bioinformatics\/btx401","article-title":"BPP: a sequence-based algorithm for branch point prediction","volume":"33","author":"Zhang","year":"2017","journal-title":"Bioinformatics"},{"issue":"12","key":"2024111706372193300_bib76","doi-asserted-by":"crossref","first-page":"1647","DOI":"10.1261\/rna.066290.118","article-title":"A sequence-based, deep learning model accurately predicts RNA splicing branchpoints","volume":"24","author":"Paggi","year":"2018","journal-title":"RNA"},{"issue":"44","key":"2024111706372193300_bib77","doi-asserted-by":"crossref","first-page":"e2211194119","DOI":"10.1073\/pnas.2211194119","article-title":"Genome-wide detection of human variants that disrupt intronic branchpoints","volume":"119","author":"Zhang","year":"2022","journal-title":"Proc Natl Acad Sci"},{"issue":"24","key":"2024111706372193300_bib78","doi-asserted-by":"crossref","first-page":"4180","DOI":"10.1093\/bioinformatics\/bty497","article-title":"SpliceRover: interpretable convolutional neural networks for improved splice site prediction","volume":"34","author":"Zuallaert","year":"2018","journal-title":"Bioinformatics"},{"issue":"9","key":"2024111706372193300_bib79","doi-asserted-by":"crossref","first-page":"1261","DOI":"10.1002\/humu.23794","article-title":"Predicting the impact of single nucleotide variants on splicing via sequence-based deep neural networks and genomic features","volume":"40","author":"Naito","year":"2019","journal-title":"Hum Mutat"},{"issue":"6","key":"2024111706372193300_bib80","doi-asserted-by":"crossref","first-page":"848","DOI":"10.1038\/ng.3837","article-title":"Pathogenic variants that alter protein code often disrupt splicing","volume":"49","author":"Soemedi","year":"2017","journal-title":"Nat Genet"},{"issue":"1","key":"2024111706372193300_bib81","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1186\/s12859-021-04471-3","article-title":"Spliceator: multi-species splice site prediction using convolutional neural networks","volume":"22","author":"Scalzitti","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"2024111706372193300_bib82","article-title":"Preparing input for multiple splicing predictors","author":"Barbosa","year":"2023"},{"issue":"5","key":"2024111706372193300_bib83","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1002\/humu.22768","article-title":"The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity","volume":"36","author":"Grimm","year":"2015","journal-title":"Hum Mutat"},{"issue":"7","key":"2024111706372193300_bib84","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1038\/s41388-020-01614-3","article-title":"Comprehensive characterisation of intronic mis-splicing mutations in human cancers","volume":"40","author":"Jung","year":"2021","journal-title":"Oncogene"},{"issue":"2","key":"2024111706372193300_bib85","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1002\/humu.24306","article-title":"Pseudoexon activation in disease by non-splice site deep intronic sequence variation\u2014wild type pseudoexons constitute high-risk sites in the human genome","volume":"43","author":"Petersen","year":"2022","journal-title":"Hum Mutat"},{"issue":"1","key":"2024111706372193300_bib86","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1186\/s13059-018-1437-x","article-title":"Vex-Seq: high-throughput identification of the impact of genetic variation on pre-mRNA splicing efficiency","volume":"19","author":"Adamson","year":"2018","journal-title":"Genome Biol"},{"issue":"1","key":"2024111706372193300_bib87","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.molcel.2018.10.037","article-title":"A multiplexed assay for exon recognition reveals that an unappreciated fraction of rare genetic variants cause large-effect splicing disruptions","volume":"73","author":"Cheung","year":"2019","journal-title":"Mol cell"},{"key":"2024111706372193300_bib88","author":"SpliceAI Lookup API"},{"issue":"2","key":"2024111706372193300_bib89","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1038\/s41588-022-01293-8","article-title":"SpliceVault predicts the precise nature of variant-associated mis-splicing","volume":"55","author":"Dawes","year":"2023","journal-title":"Nat Genet"},{"issue":"D1","key":"2024111706372193300_bib90","doi-asserted-by":"crossref","first-page":"D1207","DOI":"10.1093\/nar\/gkaa1043","article-title":"The human phenotype ontology in 2021","volume":"49","author":"K\u00f6hler","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"2024111706372193300_bib91","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1038\/gim.2015.30","article-title":"Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology","volume":"17","author":"Richards","year":"2015","journal-title":"Genet Med"},{"issue":"7","key":"2024111706372193300_bib92","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1038\/s41436-020-0781-x","article-title":"Alternative transcripts in variant interpretation: the potential for missed diagnoses and misdiagnoses","volume":"22","author":"Schoch","year":"2020","journal-title":"Genet Med"},{"issue":"10","key":"2024111706372193300_bib93","doi-asserted-by":"crossref","first-page":"1705","DOI":"10.1002\/humu.24074","article-title":"Variant effect on splicing regulatory elements, branchpoint usage, and pseudoexonization: strategies to enhance bioinformatic prediction using hereditary cancer genes as exemplars","volume":"41","author":"Canson","year":"2020","journal-title":"Hum Mutat"},{"issue":"8","key":"2024111706372193300_bib94","doi-asserted-by":"crossref","first-page":"1668","DOI":"10.3390\/ijms18081668","article-title":"Mutations of pre-mRNA splicing regulatory elements: are predictions moving forward to clinical diagnostics?","volume":"18","author":"Grodeck\u00e1","year":"2017","journal-title":"Int J Mol Sci"},{"issue":"3","key":"2024111706372193300_bib95","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1038\/s41576-020-00302-y","article-title":"RNA-binding proteins in human genetic disease","volume":"22","author":"Gebauer","year":"2021","journal-title":"Nat Rev Genet"},{"issue":"141","key":"2024111706372193300_bib96","doi-asserted-by":"crossref","first-page":"20170387","DOI":"10.1098\/rsif.2017.0387","article-title":"Opportunities and obstacles for deep learning in biology and medicine","volume":"15","author":"Ching","year":"2018","journal-title":"J R Soc Int"},{"key":"2024111706372193300_bib97","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1038\/s41576-022-00532-2","article-title":"Obtaining genetics insights from deep learning via explainable artificial intelligence","volume":"24","author":"Novakovsky","year":"2023","journal-title":"Nat Rev Genet"},{"issue":"7","key":"2024111706372193300_bib98","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1038\/s41436-020-0780-y","article-title":"Mapping RNA splicing variations in clinically-accessible and non-accessible tissues to facilitate Mendelian disease diagnosis using RNA-seq","volume":"22","author":"Aicher","year":"2020","journal-title":"Genet Med"},{"key":"2024111706372193300_bib99","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-023-03144-z","article-title":"Benchmarking splice variant prediction algorithms using massively parallel splicing assays","author":"Smith","year":"2023"},{"key":"2024111706372193300_bib100","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1186\/s40246-023-00451-1","article-title":"SpliceAI-visual: a free online tool to improve SpliceAI splicing variant interpretation","volume":"17","author":"de\u00a0Sainte\u00a0Agathe","year":"2023","journal-title":"Hum Genom"},{"key":"2024111706372193300_bib101","author":"MobiDetails"},{"key":"2024111706372193300_bib102","author":"CI-SpliceAI Online Service"},{"key":"2024111706372193300_bib103","article-title":"HuggingFace\u2019s transformers: state-of-the-art natural language processing","author":"Wolf"},{"issue":"10","key":"2024111706372193300_bib104","doi-asserted-by":"crossref","first-page":"1196","DOI":"10.1038\/s41592-021-01252-x","article-title":"Effective gene expression prediction from sequence by integrating long-range interactions","volume":"18","author":"Avsec","year":"2021","journal-title":"Nat Methods"},{"key":"2024111706372193300_bib105","first-page":"29287","article-title":"Language models enable zero-shot prediction of the effects of mutations on protein function","author":"Meier","year":"2021"},{"issue":"1","key":"2024111706372193300_bib106","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1186\/s13059-016-0974-4","article-title":"The ensembl variant effect predictor","volume":"17","author":"McLaren","year":"2016","journal-title":"Genome Biol"},{"issue":"1","key":"2024111706372193300_bib107","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1093\/bioinformatics\/btx547","article-title":"Snaptron: querying splicing patterns across tens of thousands of RNA-seq samples","volume":"34","author":"Wilks","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"2024111706372193300_bib108","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1186\/s13073-020-00803-9","article-title":"dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs","volume":"12","author":"Liu","year":"2020","journal-title":"Genome Med"},{"issue":"6","key":"2024111706372193300_bib109","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1101\/gr.229102","article-title":"The human genome browser at UCSC","volume":"12","author":"Kent","year":"2002","journal-title":"Genome Res"},{"issue":"1","key":"2024111706372193300_bib110","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1186\/s13059-016-0973-5","article-title":"Vcfanno: fast, flexible annotation of genetic variants","volume":"17","author":"Pedersen","year":"2016","journal-title":"Genome Biol"},{"key":"2024111706372193300_bib111","doi-asserted-by":"crossref","DOI":"10.3389\/fcvm.2022.975478","article-title":"Clinical significance of genetic variation in hypertrophic cardiomyopathy: comparison of computational tools to prioritize missense variants","volume":"9","author":"Barbosa","year":"2022","journal-title":"Front Cardiovasc Med"},{"key":"2024111706372193300_bib112","author":"SpliceVault Portal"},{"key":"2024111706372193300_bib113","article-title":"GitHub Repository for \u201cComputational Prediction of Human Deep Intronic Variation.\u201d","author":"Barbosa","year":"2023"},{"key":"2024111706372193300_bib114","doi-asserted-by":"crossref","unstructured":"Barbosa P, Savisaar R, Carmo-Fonseca M, et al. \u00a0Supporting data for \u201cComputational Prediction of Human Deep Intronic Variation.\u201d. GigaScience Database. 2023. 10.5524\/102423.","DOI":"10.1101\/2023.02.17.528928"}],"container-title":["GigaScience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/gigascience\/article-pdf\/doi\/10.1093\/gigascience\/giad085\/60707190\/giad085.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/gigascience\/article-pdf\/doi\/10.1093\/gigascience\/giad085\/60707190\/giad085.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,17]],"date-time":"2024-11-17T01:40:09Z","timestamp":1731807609000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/gigascience\/article\/doi\/10.1093\/gigascience\/giad085\/7329463"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,28]]},"references-count":114,"URL":"https:\/\/doi.org\/10.1093\/gigascience\/giad085","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.02.17.528928","asserted-by":"object"}]},"ISSN":["2047-217X"],"issn-type":[{"value":"2047-217X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023]]},"published":{"date-parts":[[2022,12,28]]},"article-number":"giad085"}}