{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T15:34:43Z","timestamp":1770996883358,"version":"3.50.1"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T00:00:00Z","timestamp":1573776000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Department of Epidemiology and Biostatistics, Amsterdam UMC, VU University Amsterdam"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2020,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package is available from<jats:sc>cran<\/jats:sc>.<\/jats:p>","DOI":"10.1007\/s11634-019-00375-6","type":"journal-article","created":{"date-parts":[[2019,11,15]],"date-time":"2019-11-15T15:03:04Z","timestamp":1573830184000},"page":"571-588","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Sparse classification with paired covariates"],"prefix":"10.1007","volume":"14","author":[{"given":"Armin","family":"Rauschenberger","sequence":"first","affiliation":[]},{"given":"Iuliana","family":"Cioc\u0103nea-Teodorescu","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0134-8482","authenticated-orcid":false,"given":"Marianne A.","family":"Jonker","sequence":"additional","affiliation":[]},{"given":"Ren\u00e9e X.","family":"Menezes","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4780-8472","authenticated-orcid":false,"given":"Mark A.","family":"van de Wiel","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,11,15]]},"reference":[{"issue":"17","key":"375_CR1","doi-asserted-by":"publisher","first-page":"i413","DOI":"10.1093\/bioinformatics\/btw449","volume":"32","author":"N Aben","year":"2016","unstructured":"Aben N, Vis DJ, Michaut M, Wessels LF (2016) TANDEM: a two-stage approach to maximize interpretability of drug response models based on multiple molecular data types. Bioinformatics 32(17):i413\u2013i420. https:\/\/doi.org\/10.1093\/bioinformatics\/btw449","journal-title":"Bioinformatics"},{"issue":"1","key":"375_CR2","doi-asserted-by":"publisher","first-page":"39","DOI":"10.2202\/1544-6115.1703","volume":"10","author":"LC Bergersen","year":"2011","unstructured":"Bergersen LC, Glad IK, Lyng H (2011) Weighted lasso with data integration. Stat Appl Genet Mol Biol 10(1):39. https:\/\/doi.org\/10.2202\/1544-6115.1703","journal-title":"Stat Appl Genet Mol Biol"},{"key":"375_CR3","doi-asserted-by":"publisher","first-page":"7691937","DOI":"10.1155\/2017\/7691937","volume":"2017","author":"AL Boulesteix","year":"2017","unstructured":"Boulesteix AL, De Bin R, Jiang X, Fuchs M (2017) IPF-LASSO: Integrative $$L_1$$-penalized regression with penalty factors for prediction based on multi-omics data. Comput Math Methods Med 2017:7691937. https:\/\/doi.org\/10.1155\/2017\/7691937 (ipflasso)","journal-title":"Comput Math Methods Med"},{"key":"375_CR4","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-20192-9","volume-title":"Statistics for high-dimensional data: methods, theory and applications","author":"P B\u00fchlmann","year":"2011","unstructured":"B\u00fchlmann P, van de Geer S (2011) Statistics for high-dimensional data: methods, theory and applications. Springer, Berlin. https:\/\/doi.org\/10.1007\/978-3-642-20192-9"},{"issue":"2","key":"375_CR5","doi-asserted-by":"publisher","first-page":"4220","DOI":"10.1214\/17-EJS1317","volume":"11","author":"F Campbell","year":"2017","unstructured":"Campbell F, Allen GI (2017) Within group variable selection through the exclusive lasso. Electron J Stat 11(2):4220\u20134257. https:\/\/doi.org\/10.1214\/17-EJS1317","journal-title":"Electron J Stat"},{"issue":"8","key":"375_CR6","doi-asserted-by":"publisher","first-page":"e71","DOI":"10.1093\/nar\/gkv1507","volume":"44","author":"A Colaprico","year":"2016","unstructured":"Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, Sabedot TS, Malta TM, Pagnotta SM, Castiglioni I et al (2016) TCGAbiolinks: an R\/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res 44(8):e71. https:\/\/doi.org\/10.1093\/nar\/gkv1507","journal-title":"Nucleic Acids Res"},{"key":"375_CR7","first-page":"313","volume-title":"Advances in neural information processing systems 16","author":"C Cortes","year":"2004","unstructured":"Cortes C, Mohri M (2004) AUC optimization vs. error rate minimization. In: Thrun S, Saul LK, Sch\u00f6lkopf B (eds) Advances in neural information processing systems 16. MIT Press, Cambridge, pp 313\u2013320"},{"key":"375_CR8","doi-asserted-by":"publisher","unstructured":"Dey KK, Stephens M (2018) CorShrink: empirical Bayes shrinkage estimation of correlations, with applications. bioRxiv https:\/\/doi.org\/10.1101\/368316","DOI":"10.1101\/368316"},{"issue":"5","key":"375_CR9","doi-asserted-by":"publisher","first-page":"849","DOI":"10.1111\/j.1467-9868.2008.00674.x","volume":"70","author":"J Fan","year":"2008","unstructured":"Fan J, Lv J (2008) Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Ser B (Stat Methodol) 70(5):849\u2013911. https:\/\/doi.org\/10.1111\/j.1467-9868.2008.00674.x","journal-title":"J R Stat Soc Ser B (Stat Methodol)"},{"key":"375_CR10","doi-asserted-by":"publisher","DOI":"10.18637\/jss.v033.i01","author":"J Friedman","year":"2010","unstructured":"Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw. https:\/\/doi.org\/10.18637\/jss.v033.i01 (glmnet)","journal-title":"J Stat Softw"},{"issue":"1","key":"375_CR11","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1186\/1471-2105-12-488","volume":"12","author":"S Gade","year":"2011","unstructured":"Gade S, Porzelius C, F\u00e4lth M, Brase JC, Wuttig D, Kuner R, Binder H, S\u00fcltmann H, Bei\u00dfbarth T (2011) Graph based fusion of miRNA and mRNA expression data improves clinical outcome prediction in prostate cancer. BMC Bioinform 12(1):488. https:\/\/doi.org\/10.1186\/1471-2105-12-488","journal-title":"BMC Bioinform"},{"issue":"4","key":"375_CR12","first-page":"1603","volume":"18","author":"J Huang","year":"2008","unstructured":"Huang J, Ma S, Zhang CH (2008) Adaptive lasso for sparse high-dimensional regression models. Stat Sin 18(4):1603\u20131618","journal-title":"Stat Sin"},{"key":"375_CR13","doi-asserted-by":"publisher","first-page":"20567","DOI":"10.1038\/srep20567","volume":"6","author":"X Huang","year":"2016","unstructured":"Huang X, Stern DF, Zhao H (2016) Transcriptional profiles from paired normal samples offer complementary information on cancer patient survival-evidence from TCGA pan-cancer data. Sci Rep 6:20567. https:\/\/doi.org\/10.1038\/srep20567","journal-title":"Sci Rep"},{"issue":"2","key":"375_CR14","doi-asserted-by":"publisher","first-page":"364","DOI":"10.1093\/biostatistics\/kxv049","volume":"17","author":"S Reid","year":"2016","unstructured":"Reid S, Tibshirani R (2016) Sparse regression and marginal testing using cluster prototypes. Biostatistics 17(2):364\u2013376. https:\/\/doi.org\/10.1093\/biostatistics\/kxv049","journal-title":"Biostatistics"},{"issue":"3","key":"375_CR15","doi-asserted-by":"publisher","first-page":"R25","DOI":"10.1186\/gb-2010-11-3-r25","volume":"11","author":"MD Robinson","year":"2010","unstructured":"Robinson MD, Oshlack A (2010) A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol 11(3):R25. https:\/\/doi.org\/10.1186\/gb-2010-11-3-r25 (edgeR)","journal-title":"Genome Biol"},{"key":"375_CR16","doi-asserted-by":"publisher","first-page":"259","DOI":"10.1007\/978-3-319-45809-0_14","volume-title":"Statistical analysis of proteomics, metabolomics, and lipidomics data using mass spectrometry","author":"M Rodr\u00edguez-Girondo","year":"2017","unstructured":"Rodr\u00edguez-Girondo M, Kakourou A, Salo P, Perola M, Mesker WE, Tollenaar RA, Houwing-Duistermaat J, Mertens BJ (2017) On the combination of omics data for prediction of binary outcomes. In: Datta S, Mertens BJ (eds) Statistical analysis of proteomics, metabolomics, and lipidomics data using mass spectrometry. Springer, Cham, pp 259\u2013275. https:\/\/doi.org\/10.1007\/978-3-319-45809-0_14"},{"issue":"4","key":"375_CR17","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1093\/bioinformatics\/18.4.555","volume":"18","author":"I Shmulevich","year":"2002","unstructured":"Shmulevich I, Zhang W (2002) Binary analysis and optimization-based normalization of gene expression data. Bioinformatics 18(4):555\u2013565. https:\/\/doi.org\/10.1093\/bioinformatics\/18.4.555","journal-title":"Bioinformatics"},{"issue":"6","key":"375_CR18","doi-asserted-by":"publisher","first-page":"2973","DOI":"10.1093\/nar\/gkx082","volume":"45","author":"AG Telonis","year":"2017","unstructured":"Telonis AG, Magee R, Loher P, Chervoneva I, Londin E, Rigoutsos I (2017) Knowledge about the presence or absence of miRNA isoforms (isomiRs) can successfully discriminate amongst 32 TCGA cancer types. Nucleic Acids Res 45(6):2973\u20132985. https:\/\/doi.org\/10.1093\/nar\/gkx082","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"375_CR19","doi-asserted-by":"publisher","first-page":"685","DOI":"10.1002\/bimj.201500234","volume":"59","author":"N Tern\u00e8s","year":"2017","unstructured":"Tern\u00e8s N, Rotolo F, Heinze G, Michiels S (2017) Identification of biomarker-by-treatment interactions in randomized clinical trials with survival outcomes and high-dimensional spaces. Biom J 59(4):685\u2013701. https:\/\/doi.org\/10.1002\/bimj.201500234","journal-title":"Biom J"},{"issue":"1","key":"375_CR20","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","volume":"58","author":"R Tibshirani","year":"1996","unstructured":"Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58(1):267\u2013288","journal-title":"J R Stat Soc Ser B (Methodol)"},{"issue":"1","key":"375_CR21","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1111\/j.1467-9868.2005.00490.x","volume":"67","author":"R Tibshirani","year":"2005","unstructured":"Tibshirani R, Saunders M, Rosset S, Zhu J, Knight K (2005) Sparsity and smoothness via the fused lasso. J R Stat Soc Ser B (Stat Methodol) 67(1):91\u2013108. https:\/\/doi.org\/10.1111\/j.1467-9868.2005.00490.x","journal-title":"J R Stat Soc Ser B (Stat Methodol)"},{"issue":"3","key":"375_CR22","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1002\/sim.6732","volume":"35","author":"MA van de Wiel","year":"2016","unstructured":"van de Wiel MA, Lien TG, Verlaat W, van Wieringen WN, Wilting SM (2016) Better prediction by use of co-data: adaptive group-regularized ridge regression. Stat Med 35(3):368\u2013381. https:\/\/doi.org\/10.1002\/sim.6732 (GRridge)","journal-title":"Stat Med"},{"issue":"1","key":"375_CR23","doi-asserted-by":"publisher","first-page":"25","DOI":"10.2202\/1544-6115.1309","volume":"6","author":"MJ van der Laan","year":"2007","unstructured":"van der Laan MJ, Polley EC, Hubbard AE (2007) Super learner. Stat Appl Genet Mol Biol 6(1):25. https:\/\/doi.org\/10.2202\/1544-6115.1309","journal-title":"Stat Appl Genet Mol Biol"},{"issue":"5","key":"375_CR24","doi-asserted-by":"publisher","first-page":"1590","DOI":"10.1016\/j.csda.2008.05.021","volume":"53","author":"WN van Wieringen","year":"2009","unstructured":"van Wieringen WN, Kun D, Hampel R, Boulesteix AL (2009) Survival prediction using gene expression data: a review and comparison. Comput Stat Data Anal 53(5):1590\u20131603. https:\/\/doi.org\/10.1016\/j.csda.2008.05.021","journal-title":"Comput Stat Data Anal"},{"key":"375_CR25","doi-asserted-by":"publisher","DOI":"10.1002\/0470011815.b2a15181","volume-title":"Encyclopedia of biostatistics","author":"PH Westfall","year":"2005","unstructured":"Westfall PH (2005) Combining $$P$$ values. In: Armitage P, Colton T (eds) Encyclopedia of biostatistics. Wiley, Hoboken. https:\/\/doi.org\/10.1002\/0470011815.b2a15181"},{"issue":"1","key":"375_CR26","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1111\/j.1467-9868.2005.00532.x","volume":"68","author":"M Yuan","year":"2006","unstructured":"Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J R Stat Soc Ser B (Stat Methodol) 68(1):49\u201367. https:\/\/doi.org\/10.1111\/j.1467-9868.2005.00532.x","journal-title":"J R Stat Soc Ser B (Stat Methodol)"},{"issue":"476","key":"375_CR27","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1198\/016214506000000735","volume":"101","author":"H Zou","year":"2006","unstructured":"Zou H (2006) The adaptive lasso and its oracle properties. J Am Stat Assoc 101(476):1418\u20131429. https:\/\/doi.org\/10.1198\/016214506000000735","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"375_CR28","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B (Stat Methodol) 67(2):301\u2013320. https:\/\/doi.org\/10.1111\/j.1467-9868.2005.00503.x","journal-title":"J R Stat Soc Ser B (Stat Methodol)"},{"issue":"1","key":"375_CR29","doi-asserted-by":"publisher","first-page":"e85150","DOI":"10.1371\/journal.pone.0085150","volume":"9","author":"I Zwiener","year":"2014","unstructured":"Zwiener I, Frisch B, Binder H (2014) Transforming RNA-Seq data to improve the performance of prognostic gene signatures. PLoS ONE 9(1):e85150. https:\/\/doi.org\/10.1371\/journal.pone.0085150","journal-title":"PLoS ONE"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-019-00375-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11634-019-00375-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-019-00375-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,27]],"date-time":"2024-07-27T00:49:13Z","timestamp":1722041353000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11634-019-00375-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11,15]]},"references-count":29,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2020,9]]}},"alternative-id":["375"],"URL":"https:\/\/doi.org\/10.1007\/s11634-019-00375-6","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"value":"1862-5347","type":"print"},{"value":"1862-5355","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11,15]]},"assertion":[{"value":"22 October 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 July 2019","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 October 2019","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 November 2019","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Compliance with ethical standards"}},{"value":"The authors declare that they have no potential conflicts of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"The R package <tt>palasso<\/tt> contains a vignette for reproducing all results.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Reproducibility"}},{"value":"The R package <tt>palasso<\/tt> runs on any operating system equipped with R-3.5.0 or later. It is available from <scp>cran<\/scp> under a free software license:.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Software"}}]}}