{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:17:45Z","timestamp":1772173065367,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010180","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,6,10]],"date-time":"2022-06-10T00:00:00Z","timestamp":1654819200000}}],"reference-count":41,"publisher":"Public Library of Science (PLoS)","issue":"5","license":[{"start":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T00:00:00Z","timestamp":1653955200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003329","name":"Ministerio de Econom\u00eda y Competitividad","doi-asserted-by":"publisher","award":["PID2019-110344RB-I00"],"award-info":[{"award-number":["PID2019-110344RB-I00"]}],"id":[{"id":"10.13039\/501100003329","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003086","name":"Eusko Jaurlaritza","doi-asserted-by":"crossref","award":["PIBA_2020_01_0055"],"award-info":[{"award-number":["PIBA_2020_01_0055"]}],"id":[{"id":"10.13039\/501100003086","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004587","name":"Instituto de Salud Carlos III","doi-asserted-by":"publisher","award":["PI16\/02024, PI17\/00701"],"award-info":[{"award-number":["PI16\/02024, PI17\/00701"]}],"id":[{"id":"10.13039\/501100004587","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014139","name":"CIBERONC","doi-asserted-by":"crossref","award":["CB16\/12\/00489"],"award-info":[{"award-number":["CB16\/12\/00489"]}],"id":[{"id":"10.13039\/501100014139","id-type":"DOI","asserted-by":"crossref"}]},{"name":"ERANET program ERAPerMed","award":["MEET-AML"],"award-info":[{"award-number":["MEET-AML"]}]},{"DOI":"10.13039\/501100003329","name":"Ministerio de Econom\u00eda y Competitividad","doi-asserted-by":"publisher","award":["Explora RTHALMY"],"award-info":[{"award-number":["Explora RTHALMY"]}],"id":[{"id":"10.13039\/501100003329","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005004","name":"Ekonomiaren Garapen eta Lehiakortasun Saila, Eusko Jaurlaritza","doi-asserted-by":"publisher","award":["KK-2020\/00008"],"award-info":[{"award-number":["KK-2020\/00008"]}],"id":[{"id":"10.13039\/501100005004","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Cancer Research UK and AECC under the Accelerator Award Programme","award":["C355\/A26819"],"award-info":[{"award-number":["C355\/A26819"]}]},{"DOI":"10.13039\/100008054","name":"Fundaci\u00f3n Ram\u00f3n Areces","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100008054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004587","name":"Instituto de Salud Carlos III","doi-asserted-by":"publisher","award":["FI17\/00297"],"award-info":[{"award-number":["FI17\/00297"]}],"id":[{"id":"10.13039\/501100004587","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003086","name":"Eusko Jaurlaritza","doi-asserted-by":"publisher","award":["PRE_2018.2.0297"],"award-info":[{"award-number":["PRE_2018.2.0297"]}],"id":[{"id":"10.13039\/501100003086","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>With the frenetic growth of high-dimensional datasets in different biomedical domains, there is an urgent need to develop predictive methods able to deal with this complexity. Feature selection is a relevant strategy in machine learning to address this challenge. We introduce a novel feature selection algorithm for linear regression called BOSO (Bilevel Optimization Selector Operator). We conducted a benchmark of BOSO with key algorithms in the literature, finding a superior accuracy for feature selection in high-dimensional datasets. Proof-of-concept of BOSO for predicting drug sensitivity in cancer is presented. A detailed analysis is carried out for methotrexate, a well-studied drug targeting cancer metabolism.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010180","type":"journal-article","created":{"date-parts":[[2022,5,31]],"date-time":"2022-05-31T13:39:47Z","timestamp":1654004387000},"page":"e1010180","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":5,"title":["BOSO: A novel feature selection algorithm for linear regression with high-dimensional data"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3769-5419","authenticated-orcid":true,"given":"Luis V.","family":"Valc\u00e1rcel","sequence":"first","affiliation":[]},{"given":"Edurne","family":"San Jos\u00e9-En\u00e9riz","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8401-4087","authenticated-orcid":true,"given":"Xabier","family":"Cendoya","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3274-2450","authenticated-orcid":true,"given":"\u00c1ngel","family":"Rubio","sequence":"additional","affiliation":[]},{"given":"Xabier","family":"Agirre","sequence":"additional","affiliation":[]},{"given":"Felipe","family":"Pr\u00f3sper","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1155-3105","authenticated-orcid":true,"given":"Francisco J.","family":"Planes","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,5,31]]},"reference":[{"key":"pcbi.1010180.ref001","doi-asserted-by":"crossref","first-page":"S16","DOI":"10.1038\/527S16a","article-title":"Perspective: Sustaining the big-data ecosystem","volume":"527","author":"PE Bourne","year":"2015","journal-title":"Nature"},{"key":"pcbi.1010180.ref002","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/j.copbio.2019.03.004","article-title":"Big data analytics for personalized medicine","volume":"58","author":"D Cirillo","year":"2019","journal-title":"Curr Opin Biotechnol"},{"key":"pcbi.1010180.ref003","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1038\/nbt.3790","article-title":"Discovering and linking public omics data sets using the Omics Discovery Index","volume":"35","author":"Y Perez-Riverol","year":"2017","journal-title":"Nat Biotechnol"},{"key":"pcbi.1010180.ref004","doi-asserted-by":"crossref","first-page":"1754","DOI":"10.1093\/bioinformatics\/btv037","article-title":"Bayesian feature selection for high-dimensional linear regression via the Ising approximation with applications to genomics","volume":"31","author":"CK Fisher","year":"2015","journal-title":"Bioinformatics"},{"key":"pcbi.1010180.ref005","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep36076","article-title":"HASE: Framework for efficient high-dimensional association analyses.","volume":"6","author":"G V. Roshchupkin","year":"2016","journal-title":"Sci Rep."},{"key":"pcbi.1010180.ref006","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1005752","article-title":"mixOmics: An R package for \u2018omics feature selection and multiple data integration","volume":"13","author":"F Rohart","year":"2017","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1010180.ref007","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-020-3400-6","article-title":"GARS: Genetic Algorithm for the identification of a Robust Subset of features in high-dimensional datasets","volume":"21","author":"M Chiesa","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"pcbi.1010180.ref008","article-title":"Structured sparsity regularization for analyzing high-dimensional omics data","author":"S. Vinga","year":"2020","journal-title":"Brief Bioinform"},{"key":"pcbi.1010180.ref009","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"ST Roweis","year":"2000","journal-title":"Science (80-)."},{"key":"pcbi.1010180.ref010","doi-asserted-by":"crossref","first-page":"2507","DOI":"10.1093\/bioinformatics\/btm344","article-title":"A review of feature selection techniques in bioinformatics","volume":"23","author":"Y Saeys","year":"2007","journal-title":"bioinformatics"},{"key":"pcbi.1010180.ref011","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.neucom.2017.11.077","article-title":"Feature selection in machine learning: A new perspective","volume":"300","author":"J Cai","year":"2018","journal-title":"Neurocomputing"},{"key":"pcbi.1010180.ref012","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"R. Tibshirani","year":"1996","journal-title":"J R Stat Soc Ser B"},{"key":"pcbi.1010180.ref013","doi-asserted-by":"crossref","first-page":"1091","DOI":"10.1038\/ng.3367","article-title":"A gene-based association method for mapping traits using reference transcriptome data","volume":"47","author":"ER Gamazon","year":"2015","journal-title":"Nat Genet"},{"key":"pcbi.1010180.ref014","first-page":"1","article-title":"Architecture of gene regulatory networks controlling flower development in Arabidopsis thaliana","volume":"9","author":"D Chen","year":"2018","journal-title":"Nat Commun"},{"key":"pcbi.1010180.ref015","doi-asserted-by":"crossref","first-page":"526","DOI":"10.1038\/s41586-018-0623-z","article-title":"Functional genomic landscape of acute myeloid leukaemia","volume":"562","author":"JW Tyner","year":"2018","journal-title":"Nature"},{"key":"pcbi.1010180.ref016","doi-asserted-by":"crossref","first-page":"1217","DOI":"10.1038\/s41587-019-0233-9","article-title":"Blood metabolome predicts gut microbiome \u03b1-diversity in humans","volume":"37","author":"T Wilmanski","year":"2019","journal-title":"Nat Biotechnol"},{"key":"pcbi.1010180.ref017","article-title":"Extended comparisons of best subset selection, forward stepwise selection, and the lasso.","author":"T Hastie","year":"2017"},{"key":"pcbi.1010180.ref018","first-page":"813","article-title":"Best subset selection via a modern optimization lens","author":"D Bertsimas","year":"2016","journal-title":"Ann Stat."},{"key":"pcbi.1010180.ref019","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1016\/j.csda.2006.12.019","article-title":"Relaxed Lasso.","volume":"52","author":"N. Meinshausen","year":"2007","journal-title":"Comput Stat Data Anal"},{"key":"pcbi.1010180.ref020","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1016\/j.chembiol.2017.08.028","article-title":"Targeting Metabolism for Cancer Therapy.","volume":"24","author":"A Luengo","year":"2017","journal-title":"Cell Chem Biol"},{"key":"pcbi.1010180.ref021","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1109\/TAC.1974.1100705","article-title":"A new look at the statistical model identification","volume":"19","author":"H. Akaike","year":"1974","journal-title":"IEEE Trans Automat Contr"},{"key":"pcbi.1010180.ref022","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"G Schwarz","year":"1978","journal-title":"Ann Stat."},{"key":"pcbi.1010180.ref023","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1093\/biomet\/asn034","article-title":"Extended Bayesian information criteria for model selection with large model spaces","volume":"95","author":"J Chen","year":"2008","journal-title":"Biometrika"},{"key":"pcbi.1010180.ref024","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1038\/nrc2294","article-title":"The properties of high-dimensional data spaces: Implications for exploring gene and protein expression data","volume":"8","author":"R Clarke","year":"2008","journal-title":"Nat Rev Cancer"},{"key":"pcbi.1010180.ref025","first-page":"623","article-title":"A comparative analysis of optimization solvers","volume":"20","author":"R Anand","year":"2017","journal-title":"J Stat Manag Syst"},{"key":"pcbi.1010180.ref026","article-title":"Stepwise regression\u2014a backward and forward look","author":"MA Efroymson","year":"1966","journal-title":"Florham Park New Jersey"},{"key":"pcbi.1010180.ref027","doi-asserted-by":"crossref","DOI":"10.1002\/9781118625590","volume-title":"Applied regression analysis","author":"NR Draper","year":"1998"},{"key":"pcbi.1010180.ref028","doi-asserted-by":"crossref","first-page":"955","DOI":"10.1093\/nar\/gks1111","article-title":"Genomics of Drug Sensitivity in Cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells.","volume":"41","author":"W Yang","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1010180.ref029","article-title":"Next-generation characterization of the Cancer Cell Line Encyclopedia","author":"M Ghandi","year":"2019","journal-title":"Nature"},{"key":"pcbi.1010180.ref030","first-page":"369","article-title":"Ion channels and transporters in the development of drug resistance in cancer cells","author":"EK Hoffmann","year":"2014","journal-title":"Philos Trans R Soc B Biol Sci"},{"key":"pcbi.1010180.ref031","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1016\/j.devcel.2016.12.013","article-title":"Deciphering the Fringe-Mediated Notch Code: Identification of Activating and Inhibiting Sites Allowing Discrimination between Ligands","volume":"40","author":"S Kakuda","year":"2017","journal-title":"Dev Cell"},{"key":"pcbi.1010180.ref032","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1016\/j.bbcan.2010.06.001","article-title":"Targeting Notch signaling pathway to overcome drug resistance for cancer therapy","volume":"1806","author":"Z Wang","year":"2010","journal-title":"Biochim Biophys Acta\u2014Rev Cancer"},{"key":"pcbi.1010180.ref033","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/gm83","article-title":"Networking of differentially expressed genes in human cancer cells resistant to methotrexate","volume":"1","author":"E Selga","year":"2009","journal-title":"Genome Med"},{"key":"pcbi.1010180.ref034","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1016\/j.canlet.2018.08.018","article-title":"CUEDC1 is a primary target of ER\u03b1 essential for the growth of breast cancer cells","volume":"436","author":"R Lopes","year":"2018","journal-title":"Cancer Lett"},{"key":"pcbi.1010180.ref035","first-page":"2014","article-title":"Estrogen-related receptor alpha confers methotrexate resistance via attenuation of reactive oxygen species production and P53 mediated apoptosis in osteosarcoma cells","author":"P Chen","year":"2014","journal-title":"Biomed Res Int"},{"key":"pcbi.1010180.ref036","first-page":"1","article-title":"Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets.","volume":"7","author":"J Li","year":"2017","journal-title":"Sci Rep"},{"key":"pcbi.1010180.ref037","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1016\/j.patrec.2008.11.012","article-title":"Different metaheuristic strategies to solve the feature selection problem","volume":"30","author":"SC Yusta","year":"2009","journal-title":"Pattern Recognit Lett"},{"key":"pcbi.1010180.ref038","volume-title":"The elements of statistical learning","author":"J Friedman","year":"2001"},{"key":"pcbi.1010180.ref039","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v033.i01","article-title":"Regularization Paths for Generalized Linear Models via Coordinate Descent","volume":"33","author":"J Friedman","year":"2010","journal-title":"J Stat Softw"},{"key":"pcbi.1010180.ref040","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1007\/s10589-016-9847-8","article-title":"On handling indicator constraints in mixed integer programming","volume":"65","author":"P Belotti","year":"2016","journal-title":"Comput Optim Appl"},{"key":"pcbi.1010180.ref041","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1016\/j.cell.2017.06.010","article-title":"Defining a Cancer Dependency Map","volume":"170","author":"A Tsherniak","year":"2017","journal-title":"Cell"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010180","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,6,10]],"date-time":"2022-06-10T00:00:00Z","timestamp":1654819200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010180","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,26]],"date-time":"2024-09-26T01:19:06Z","timestamp":1727313546000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010180"}},"subtitle":[],"editor":[{"given":"Sergei L.","family":"Kosakovsky Pond","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,5,31]]},"references-count":41,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2022,5,31]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010180","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.11.18.388579","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,5,31]]}}}