{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T08:36:13Z","timestamp":1770712573100,"version":"3.49.0"},"reference-count":44,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,5,1]],"date-time":"2023-05-01T00:00:00Z","timestamp":1682899200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,1]],"date-time":"2023-05-01T00:00:00Z","timestamp":1682899200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100009595","name":"Service Public de Wallonie","doi-asserted-by":"publisher","award":["2010235-ARIAC"],"award-info":[{"award-number":["2010235-ARIAC"]}],"id":[{"id":"10.13039\/501100009595","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009595","name":"Service Public de Wallonie","doi-asserted-by":"publisher","award":["2010235-ARIAC"],"award-info":[{"award-number":["2010235-ARIAC"]}],"id":[{"id":"10.13039\/501100009595","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009595","name":"Service Public de Wallonie","doi-asserted-by":"publisher","award":["2010235-ARIAC"],"award-info":[{"award-number":["2010235-ARIAC"]}],"id":[{"id":"10.13039\/501100009595","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004744","name":"Innoviris","doi-asserted-by":"publisher","award":["2020 RDIR 55b"],"award-info":[{"award-number":["2020 RDIR 55b"]}],"id":[{"id":"10.13039\/501100004744","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004744","name":"Innoviris","doi-asserted-by":"publisher","award":["2020 RDIR 55b"],"award-info":[{"award-number":["2020 RDIR 55b"]}],"id":[{"id":"10.13039\/501100004744","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004744","name":"Innoviris","doi-asserted-by":"publisher","award":["2020 RDIR 55b"],"award-info":[{"award-number":["2020 RDIR 55b"]}],"id":[{"id":"10.13039\/501100004744","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002661","name":"Fonds De La Recherche Scientifique - FNRS","doi-asserted-by":"publisher","award":["40008622"],"award-info":[{"award-number":["40008622"]}],"id":[{"id":"10.13039\/501100002661","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002661","name":"Fonds De La Recherche Scientifique - FNRS","doi-asserted-by":"publisher","award":["35276964"],"award-info":[{"award-number":["35276964"]}],"id":[{"id":"10.13039\/501100002661","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002661","name":"Fonds De La Recherche Scientifique - FNRS","doi-asserted-by":"publisher","award":["40005602"],"award-info":[{"award-number":["40005602"]}],"id":[{"id":"10.13039\/501100002661","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","award":["27.002.53.01.4524"],"award-info":[{"award-number":["27.002.53.01.4524"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","award":["27.002.53.01.4524"],"award-info":[{"award-number":["27.002.53.01.4524"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","award":["27.002.53.01.4524"],"award-info":[{"award-number":["27.002.53.01.4524"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003130","name":"Fonds Wetenschappelijk Onderzoek","doi-asserted-by":"publisher","award":["I002819N"],"award-info":[{"award-number":["I002819N"]}],"id":[{"id":"10.13039\/501100003130","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003130","name":"Fonds Wetenschappelijk Onderzoek","doi-asserted-by":"publisher","award":["I002819N"],"award-info":[{"award-number":["I002819N"]}],"id":[{"id":"10.13039\/501100003130","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>The prediction of potentially pathogenic variant combinations in patients remains a key task in the field of medical genetics for the understanding and detection of oligogenic\/multilocus diseases. Models tailored towards such cases can help shorten the gap of missing diagnoses and can aid researchers in dealing with the high complexity of the derived data. The predictor VarCoPP (Variant Combinations Pathogenicity Predictor) that was published in 2019 and identified potentially pathogenic variant combinations in gene pairs (bilocus variant combinations), was the first important step in this direction. Despite its usefulness and applicability, several issues still remained that hindered a better performance, such as its False Positive (FP) rate, the quality of its training set and its complex architecture.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>We present VarCoPP2.0: the successor of VarCoPP that is a simplified, faster and more accurate predictive model identifying potentially pathogenic bilocus variant combinations. Results from cross-validation and on independent data sets reveal that VarCoPP2.0 has improved in terms of both sensitivity (95% in cross-validation and 98% during testing) and specificity (5% FP rate). At the same time, its running time shows a significant 150-fold decrease due to the selection of a simpler Balanced Random Forest model. Its positive training set now consists of variant combinations that are more confidently linked with evidence of pathogenicity, based on the confidence scores present in OLIDA, the Oligogenic Diseases Database (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/olida.ibsquare.be\">https:\/\/olida.ibsquare.be<\/jats:ext-link>). The improvement of its performance is also attributed to a more careful selection of up-to-date features identified via an original wrapper method. We show that the combination of different variant and gene pair features together is important for predictions, highlighting the usefulness of integrating biological information at different levels.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusions<\/jats:title>\n                <jats:p>Through its improved performance and faster execution time, VarCoPP2.0 enables a more accurate analysis of larger data sets linked to oligogenic diseases. Users can access the ORVAL platform (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/orval.ibsquare.be\">https:\/\/orval.ibsquare.be<\/jats:ext-link>) to apply VarCoPP2.0 on their data.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-023-05291-3","type":"journal-article","created":{"date-parts":[[2023,5,1]],"date-time":"2023-05-01T12:02:24Z","timestamp":1682942544000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Faster and more accurate pathogenic combination predictions with VarCoPP2.0"],"prefix":"10.1186","volume":"24","author":[{"given":"Nassim","family":"Versbraegen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Barbara","family":"Gravel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Charlotte","family":"Nachtegael","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexandre","family":"Renaux","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Emma","family":"Verkinderen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ann","family":"Now\u00e9","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tom","family":"Lenaerts","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sofia","family":"Papadimitriou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,5,1]]},"reference":[{"key":"5291_CR1","doi-asserted-by":"publisher","DOI":"10.3390\/genes11030239","author":"KMTH Rahit","year":"2020","unstructured":"Rahit KMTH, Tarailo-Graovac M. Genetic modifiers and rare mendelian disease. Genes. 2020. https:\/\/doi.org\/10.3390\/genes11030239.","journal-title":"Genes"},{"issue":"6","key":"5291_CR2","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1038\/nrg910","volume":"3","author":"JL Badano","year":"2022","unstructured":"Badano JL, Katsanis N. Beyond Mendel: an evolving view of human genetic disease transmission. Nat Rev Genet. 2022;3(6):779\u201389. https:\/\/doi.org\/10.1038\/nrg910.","journal-title":"Nat Rev Genet"},{"key":"5291_CR3","doi-asserted-by":"publisher","unstructured":"Robinson JF, Katsanis N. Oligogenic disease. 2010;243\u201362. Chap. 7. https:\/\/doi.org\/10.1007\/978-3-540-37654-5.","DOI":"10.1007\/978-3-540-37654-5"},{"key":"5291_CR4","doi-asserted-by":"crossref","unstructured":"Okazaki A, Ott J. Machine learning approaches to explore digenic inheritance. Trends Genet. 2022.","DOI":"10.1016\/j.tig.2022.04.009"},{"key":"5291_CR5","doi-asserted-by":"crossref","unstructured":"Ott J, Park T. Overview of frequent pattern mining. Genom Inform. 2022;20(4).","DOI":"10.5808\/gi.22074"},{"key":"5291_CR6","doi-asserted-by":"publisher","DOI":"10.3389\/fgene.2015.00285","author":"C Niel","year":"2015","unstructured":"Niel C, Sinoquet C, Dina C, Rocheleau G. A survey about methods dedicated to epistasis detection. Front Genet. 2015. https:\/\/doi.org\/10.3389\/fgene.2015.00285.","journal-title":"Front Genet"},{"issue":"D1","key":"5291_CR7","doi-asserted-by":"publisher","first-page":"900","DOI":"10.1093\/nar\/gkv1068","volume":"44","author":"AM Gazzo","year":"2016","unstructured":"Gazzo AM, Daneels D, Cilia E, Bonduelle M, Abramowicz M, Van Dooren S, Smits G, Lenaerts T. DIDA: a curated and annotated digenic diseases database. Nucleic Acids Res. 2016;44(D1):900\u20137.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"5291_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-018-32876-3","volume":"8","author":"I Boudellioua","year":"2018","unstructured":"Boudellioua I, Kulmanov M, Schofield PN, Gkoutos GV, Hoehndorf R. OligoPVP: phenotype-driven analysis of individual genomic information to prioritize oligogenic disease variants. Sci Rep. 2018;8(1):1\u20138.","journal-title":"Sci Rep"},{"issue":"15","key":"5291_CR9","doi-asserted-by":"publisher","first-page":"140","DOI":"10.1093\/nar\/gkx557","volume":"45","author":"A Gazzo","year":"2017","unstructured":"Gazzo A, Raimondi D, Daneels D, Moreau Y, Smits G, Van Dooren S, Lenaerts T. Understanding mutational effects in digenic diseases. Nucleic Acids Res. 2017;45(15):140\u2013140.","journal-title":"Nucleic Acids Res"},{"key":"5291_CR10","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2019.06.006","volume":"99","author":"N Versbraegen","year":"2019","unstructured":"Versbraegen N, Fouch\u00e9 A, Nachtegael C, Papadimitriou S, Gazzo A, Smits G, Lenaerts T. Using game theory and decision decomposition to effectively discern and characterise bi-locus diseases. Artif Intell Med. 2019;99: 101690.","journal-title":"Artif Intell Med"},{"issue":"24","key":"5291_CR11","doi-asserted-by":"publisher","first-page":"11878","DOI":"10.1073\/pnas.1815601116","volume":"116","author":"S Papadimitriou","year":"2019","unstructured":"Papadimitriou S, Gazzo A, Versbraegen N, Nachtegael C, Aerts J, Moreau Y, Van Dooren S, Now\u00e9 A, Smits G, Lenaerts T. Predicting disease-causing variant combinations. Proc Natl Acad Sci. 2019;116(24):11878\u201387.","journal-title":"Proc Natl Acad Sci"},{"issue":"7571","key":"5291_CR12","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1038\/nature15393","volume":"526","author":"A Auton","year":"2015","unstructured":"Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR. A global reference for human genetic variation. Nature. 2015;526(7571):68\u201374. https:\/\/doi.org\/10.1038\/nature15393.","journal-title":"Nature"},{"issue":"W1","key":"5291_CR13","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1093\/nar\/gkz437","volume":"47","author":"A Renaux","year":"2019","unstructured":"Renaux A, Papadimitriou S, Versbraegen N, Nachtegael C, Boutry S, Now\u00e9 A, Smits G, Lenaerts T. ORVAL: a novel platform for the prediction and exploration of disease-causing oligogenic variant combinations. Nucleic Acids Res. 2019;47(W1):93\u20138.","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"5291_CR14","doi-asserted-by":"publisher","first-page":"656","DOI":"10.1111\/cen.14381","volume":"94","author":"M Laan","year":"2021","unstructured":"Laan M, Kasak L, Timinskas K, Grigorova M, Venclovas \u010c, Renaux A, Lenaerts T, Punab M. Nr5a1 c. 991\u20131g$$>$$c splice-site variant causes familial 46, xy partial gonadal dysgenesis with incomplete penetrance. Clin Endocrinol. 2021;94(4):656\u201366.","journal-title":"Clin Endocrinol"},{"key":"5291_CR15","doi-asserted-by":"publisher","DOI":"10.3389\/fgene.2021.664963","volume":"12","author":"H Dallali","year":"2021","unstructured":"Dallali H, Kheriji N, Kammoun W, Mrad M, Soltani M, Trabelsi H, Hamdi W, Bahlous A, Ben Ahmed M, Mahjoub F, et al. Multiallelic rare variants in BBS genes support an oligogenic ciliopathy in a non-obese juvenile-onset syndromic diabetic patient: a case report. Front Genet. 2021;12: 664963.","journal-title":"Front Genet"},{"key":"5291_CR16","doi-asserted-by":"crossref","unstructured":"Costantini A, Valta H, Suomi A-M, M\u00e4kitie O, Taylan F. Oligogenic inheritance of monoallelic TRIP11, FKBP10, NEK1, TBX5, and NBAS variants leading to a phenotype similar to odontochondrodysplasia. Front Genet. 2021;714.","DOI":"10.3389\/fgene.2021.680838"},{"key":"5291_CR17","doi-asserted-by":"crossref","unstructured":"Mkaouar R, Abdallah LCB, Naouali C, Lahbib S, Turki Z, Elouej S, Bouyacoub Y, Somai M, Mcelreavey K, Bashamboo A, et al. Oligogenic inheritance underlying incomplete penetrance of prokr2 mutations in hypogonadotropic hypogonadism. Front Genet. 2021;12.","DOI":"10.3389\/fgene.2021.665174"},{"issue":"10","key":"5291_CR18","doi-asserted-by":"publisher","first-page":"1946","DOI":"10.1016\/j.ajhg.2021.08.010","volume":"108","author":"S Mukherjee","year":"2021","unstructured":"Mukherjee S, Cogan JD, Newman JH, Phillips JA III, Hamid R, Network UD, Meiler J, Capra JA. Identifying digenic disease genes via machine learning in the undiagnosed diseases network. Am J Hum Genet. 2021;108(10):1946\u201363.","journal-title":"Am J Hum Genet"},{"key":"5291_CR19","doi-asserted-by":"publisher","first-page":"3639","DOI":"10.1016\/j.csbj.2022.07.011","volume":"20","author":"Y Yuan","year":"2022","unstructured":"Yuan Y, Zhang L, Long Q, Jiang H, Li M. An accurate prediction model of digenic interaction for estimating pathogenic gene pairs of human diseases. Comput Struct Biotechnol J. 2022;20:3639\u201352.","journal-title":"Comput Struct Biotechnol J"},{"issue":"5","key":"5291_CR20","doi-asserted-by":"publisher","first-page":"1623","DOI":"10.1016\/j.patcog.2014.11.014","volume":"48","author":"Z Sun","year":"2015","unstructured":"Sun Z, Song Q, Zhu X, Sun H, Xu B, Zhou Y. A novel ensemble method for classifying imbalanced data. Pattern Recognit. 2015;48(5):1623\u201337.","journal-title":"Pattern Recognit"},{"key":"5291_CR21","doi-asserted-by":"crossref","unstructured":"Nachtegael C, Gravel B, Dillen A, Smits, G, Now\u00e9 A, Papadimitriou S, Lenaerts T. Scaling up oligogenic diseases research with OLIDA: the oligogenic diseases database. Database 2022;2022.","DOI":"10.1093\/database\/baac023"},{"issue":"10","key":"5291_CR22","doi-asserted-by":"publisher","first-page":"1122","DOI":"10.1038\/s41592-021-01205-4","volume":"18","author":"I Walsh","year":"2021","unstructured":"Walsh I, Fishman D, Garcia-Gasulla D, Titma T, Pollastri G, Harrow J, Psomopoulos FE, Tosatto SC. DOME: recommendations for supervised machine learning validation in biology. Nat Methods. 2021;18(10):1122\u20137.","journal-title":"Nat Methods"},{"issue":"D1","key":"5291_CR23","doi-asserted-by":"publisher","first-page":"886","DOI":"10.1093\/nar\/gky1016","volume":"47","author":"P Rentzsch","year":"2018","unstructured":"Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2018;47(D1):886\u201394. https:\/\/doi.org\/10.1093\/nar\/gky1016.","journal-title":"Nucleic Acids Res"},{"issue":"12","key":"5291_CR24","doi-asserted-by":"publisher","first-page":"1751","DOI":"10.1093\/BIOINFORMATICS\/BTX028","volume":"33","author":"HA Shihab","year":"2017","unstructured":"Shihab HA, Rogers MF, Campbell C, Gaunt TR. HIPred: an integrative approach to predicting haploinsufficient genes. Bioinformatics. 2017;33(12):1751. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTX028.","journal-title":"Bioinformatics"},{"issue":"12","key":"5291_CR25","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1016\/s0169-5347(00)01994-7","volume":"15","author":"Z Yang","year":"2000","unstructured":"Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends Ecol Evol. 2000;15(12):496\u2013503. https:\/\/doi.org\/10.1016\/s0169-5347(00)01994-7.","journal-title":"Trends Ecol Evol"},{"issue":"20","key":"5291_CR26","doi-asserted-by":"publisher","first-page":"3065","DOI":"10.1093\/BIOINFORMATICS\/BTW381","volume":"32","author":"JS Hsu","year":"2016","unstructured":"Hsu JS, Kwan JSH, Pan Z, Garcia-Barcelo MM, Sham PC, Li M. Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes. Bioinformatics. 2016;32(20):3065\u201371. https:\/\/doi.org\/10.1093\/BIOINFORMATICS\/BTW381.","journal-title":"Bioinformatics"},{"issue":"1","key":"5291_CR27","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1186\/1471-2164-15-256","volume":"15","author":"Y Itan","year":"2014","unstructured":"Itan Y, Mazel M, Mazel B, Abhyankar A, Nitschke P, Quintana-Murci L, Boisson-Dupuis S, Boisson B, Abel L, Zhang S-Y, Casanova J-L. HGCS: an online tool for prioritizing disease-causing gene variants by biological distance. BMC Genom. 2014;15(1):256. https:\/\/doi.org\/10.1186\/1471-2164-15-256.","journal-title":"BMC Genom"},{"issue":"D1","key":"5291_CR28","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1093\/nar\/gky1155","volume":"47","author":"T Obayashi","year":"2019","unstructured":"Obayashi T, Kagaya Y, Aoki Y, Tadaka S, Kinoshita K. COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference. Nucleic Acids Res. 2019;47(D1):55\u201362. https:\/\/doi.org\/10.1093\/nar\/gky1155.","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"5291_CR29","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1186\/1471-2105-9-S5-S4","volume":"9","author":"C Pesquita","year":"2008","unstructured":"Pesquita C, Faria D, Bastos H, Ferreira AEN, Falc\u00e3o AO, Couto FM. Metrics for GO based protein semantic similarity: a systematic evaluation. BMC Bioinform. 2008;9(5):4. https:\/\/doi.org\/10.1186\/1471-2105-9-S5-S4.","journal-title":"BMC Bioinform"},{"issue":"8","key":"5291_CR30","doi-asserted-by":"publisher","first-page":"690","DOI":"10.1038\/nmeth.2561","volume":"10","author":"A Calderone","year":"2013","unstructured":"Calderone A, Castagnoli L, Cesareni G. mentha: a resource for browsing integrated protein-interaction networks. Nat Methods. 2013;10(8):690\u20131. https:\/\/doi.org\/10.1038\/nmeth.2561.","journal-title":"Nat Methods"},{"issue":"D1","key":"5291_CR31","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1093\/nar\/gkx994","volume":"46","author":"S Lee","year":"2018","unstructured":"Lee S, Zhang C, Arif M, Liu Z, Benfeitas R, Bidkhori G, Deshmukh S, Al Shobky M, Lovric A, Boren J, Nielsen J, Uhlen M, Mardinoglu A. TCSBN: a database of tissue and cancer specific biological networks. Nucleic Acids Res. 2018;46(D1):595\u2013600. https:\/\/doi.org\/10.1093\/nar\/gkx994.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"5291_CR32","doi-asserted-by":"publisher","first-page":"605","DOI":"10.1093\/nar\/gkaa1074","volume":"49","author":"D Szklarczyk","year":"2021","unstructured":"Szklarczyk D, Gable AL, Nastou KC, Lyon D, Kirsch R, Pyysalo S, Doncheva NT, Legeay M, Fang T, Bork P, Jensen LJ, von Mering C. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene\/measurement sets. Nucleic Acids Res. 2021;49(D1):605\u201312. https:\/\/doi.org\/10.1093\/nar\/gkaa1074.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"5291_CR33","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1093\/nar\/gkab1028","volume":"50","author":"M Gillespie","year":"2022","unstructured":"Gillespie M, Jassal B, Stephan R, Milacic M, Rothfels K, Senff-Ribeiro A, Griss J, Sevilla C, Matthews L, Gong C, Deng C, Varusai T, Ragueneau E, Haider Y, May B, Shamovsky V, Weiser J, Brunson T, Sanati N, Beckman L, Shao X, Fabregat A, Sidiropoulos K, Murillo J, Viteri G, Cook J, Shorser S, Bader G, Demir E, Sander C, Haw R, Wu G, Stein L, Hermjakob H, D\u2019Eustachio P. The reactome pathway knowledgebase 2022. Nucleic Acids Res. 2022;50(D1):687\u201392. https:\/\/doi.org\/10.1093\/nar\/gkab1028.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"5291_CR34","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25\u20139. https:\/\/doi.org\/10.1038\/75556.","journal-title":"Nat Genet"},{"key":"5291_CR35","doi-asserted-by":"publisher","unstructured":"Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 2021;49(D1):325\u201334. https:\/\/doi.org\/10.1093\/nar\/gkaa1113.","DOI":"10.1093\/nar\/gkaa1113"},{"issue":"D1","key":"5291_CR36","doi-asserted-by":"publisher","first-page":"344","DOI":"10.1093\/nar\/gkaa977","volume":"49","author":"M Blum","year":"2021","unstructured":"Blum M, Chang H-Y, Chuguransky S, Grego T, Kandasaamy S, Mitchell A, Nuka G, Paysan-Lafosse T, Qureshi M, Raj S, Richardson L, Salazar GA, Williams L, Bork P, Bridge A, Gough J, Haft DH, Letunic I, Marchler-Bauer A, Mi H, Natale DA, Necci M, Orengo CA, Pandurangan AP, Rivoire C, Sigrist CJA, Sillitoe I, Thanki N, Thomas PD, Tosatto SCE, Wu CH, Bateman A, Finn RD. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 2021;49(D1):344\u201354. https:\/\/doi.org\/10.1093\/nar\/gkaa977.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"5291_CR37","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1093\/nar\/gky973","volume":"47","author":"M Giurgiu","year":"2019","unstructured":"Giurgiu M, Reinhard J, Brauner B, Dunger-Kaltenbach I, Fobo G, Frishman G, Montrone C, Ruepp A. CORUM: the comprehensive resource of mammalian protein complexes-2019. Nucleic Acids Res. 2019;47(D1):559\u201363. https:\/\/doi.org\/10.1093\/nar\/gky973.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"5291_CR38","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1007\/BF01386390","volume":"1","author":"EW Dijkstra","year":"1959","unstructured":"Dijkstra EW. A note on two problems in connexion with graphs. Numer Math. 1959;1(1):269\u201371. https:\/\/doi.org\/10.1007\/BF01386390.","journal-title":"Numer. Math."},{"issue":"8","key":"5291_CR39","doi-asserted-by":"publisher","first-page":"1003709","DOI":"10.1371\/JOURNAL.PGEN.1003709","volume":"9","author":"S Petrovski","year":"2013","unstructured":"Petrovski S, Wang Q, Heinzen EL, Allen AS, Goldstein DB. Genic intolerance to functional variation and the interpretation of personal genomes. PLoS Genet. 2013;9(8):1003709. https:\/\/doi.org\/10.1371\/JOURNAL.PGEN.1003709.","journal-title":"PLoS Genet"},{"issue":"7","key":"5291_CR40","first-page":"13","volume":"1","author":"AG Karegowda","year":"2010","unstructured":"Karegowda AG, Jayaram M, Manjunath A. Feature subset selection problem using wrapper approach in supervised learning. Int J Comput Appl. 2010;1(7):13\u20137.","journal-title":"Int J Comput Appl"},{"key":"5291_CR41","doi-asserted-by":"publisher","unstructured":"Breiman L. Random forests. J Mach Learn. 2001;45(1):5\u201332. https:\/\/doi.org\/10.1017\/CBO9781107415324.004. arXiv:1011.1669v3","DOI":"10.1017\/CBO9781107415324.004"},{"key":"5291_CR42","unstructured":"Chen C, Liaw A, Breiman L, et al. Using random forest to learn imbalanced data. Technical report 1-12 2004."},{"issue":"17","key":"5291_CR43","first-page":"1","volume":"18","author":"G Lema\u00eetre","year":"2017","unstructured":"Lema\u00eetre G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res. 2017;18(17):1\u20135.","journal-title":"J Mach Learn Res"},{"issue":"1","key":"5291_CR44","doi-asserted-by":"publisher","DOI":"10.1016\/j.xhgg.2022.100165","volume":"4","author":"S Papadimitriou","year":"2023","unstructured":"Papadimitriou S, Gravel B, Nachtegael C, De Baere E, Loeys B, Vikkula M, Smits G, Lenaerts T. Toward reporting standards for the pathogenicity of variant combinations involved in multilocus\/oligogenic diseases. Hum Genet Genom Adv. 2023;4(1): 100165. https:\/\/doi.org\/10.1016\/j.xhgg.2022.100165.","journal-title":"Hum Genet Genom Adv"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05291-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-023-05291-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-023-05291-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,1]],"date-time":"2023-05-01T12:03:40Z","timestamp":1682942620000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-023-05291-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,1]]},"references-count":44,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["5291"],"URL":"https:\/\/doi.org\/10.1186\/s12859-023-05291-3","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,1]]},"assertion":[{"value":"6 December 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 April 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"179"}}