{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,6]],"date-time":"2026-03-06T17:50:29Z","timestamp":1772819429807,"version":"3.50.1"},"reference-count":12,"publisher":"Oxford University Press (OUP)","issue":"10","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,5,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Until now, much of the focus in cancer has been on biomarker discovery and generating lists of univariately significant genes, as well as epidemiological and clinical measures. These approaches, although significant on their own, are not effective for elucidating the synergistic qualities of the numerous components in complex diseases. These components do not act one at a time, but rather in concert with numerous others. A compelling need exists to develop analytically sound and computationally advanced methods that elucidate a more biologically meaningful understanding of the mechanisms of cancer initiation and progression by taking these interactions into account.<\/jats:p>\n               <jats:p>Results: We propose a novel algorithm, partDSA, for prediction when several variables jointly affect the outcome. In such settings, piecewise constant estimation provides an intuitive approach by elucidating interactions and correlation patterns in addition to main effects. As well as generating \u2018and\u2019 statements similar to previously described methods, partDSA explores and chooses the best among all possible \u2018or\u2019 statements. The immediate benefit of partDSA is the ability to build a parsimonious model with \u2018and\u2019 and \u2018or\u2019 conjunctions that account for the observed biological phenomena. Importantly, partDSA is capable of handling categorical and continuous explanatory variables and outcomes. We evaluate the effectiveness of partDSA in comparison to several adaptive algorithms in simulations; additionally, we perform several data analyses with publicly available data and introduce the implementation of partDSA as an R package.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/cran.r-project.org\/web\/packages\/partDSA\/index.html<\/jats:p>\n               <jats:p>Contact: \u00a0annette.molinaro@yale.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq142","type":"journal-article","created":{"date-parts":[[2010,4,8]],"date-time":"2010-04-08T00:47:20Z","timestamp":1270687640000},"page":"1357-1363","source":"Crossref","is-referenced-by-count":38,"title":["<i>partDSA<\/i>: deletion\/substitution\/addition algorithm for partitioning the covariate space in prediction"],"prefix":"10.1093","volume":"26","author":[{"given":"Annette M.","family":"Molinaro","sequence":"first","affiliation":[{"name":"1 Division of Biostatistics, Yale University Schools of Public Health and Medicine, 60 College St., New Haven, CT 06519 and 2 Division of Biostatistics, University of California, Berkeley, Earl Warren Hall #7360, Berkeley, CA 94720-7360, USA"}]},{"given":"Karen","family":"Lostritto","sequence":"additional","affiliation":[{"name":"1 Division of Biostatistics, Yale University Schools of Public Health and Medicine, 60 College St., New Haven, CT 06519 and 2 Division of Biostatistics, University of California, Berkeley, Earl Warren Hall #7360, Berkeley, CA 94720-7360, USA"}]},{"given":"Mark","family":"van der Laan","sequence":"additional","affiliation":[{"name":"1 Division of Biostatistics, Yale University Schools of Public Health and Medicine, 60 College St., New Haven, CT 06519 and 2 Division of Biostatistics, University of California, Berkeley, Earl Warren Hall #7360, Berkeley, CA 94720-7360, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,4,7]]},"reference":[{"key":"2023012507503006500_B1","volume-title":"Classification and Regression Trees.","author":"Breiman","year":"1984"},{"key":"2023012507503006500_B2","first-page":"1","article-title":"Multivariate adaptive regression splines","volume":"19","author":"Friedman","year":"1991","journal-title":"Ann. Stat."},{"key":"2023012507503006500_B3","article-title":"Applied nonparametric regression","volume-title":"Number 17 in Econometric Society Monographs.","author":"H\u00e4rdle","year":"1989"},{"key":"2023012507503006500_B4","doi-asserted-by":"crossref","first-page":"626","DOI":"10.1002\/gepi.2001.21.s1.s626","article-title":"Sequence analysis using logic regression","volume":"S1","author":"Kooperberg","year":"2001","journal-title":"Genet. Epidemiol."},{"key":"2023012507503006500_B5","first-page":"219","article-title":"Statistical bioinformatics: a guide for life and biomedical science researchers","volume-title":"Statistical resampling for large screening data analysis such as classical resampling, Bootstrapping, Markov chain Monte Carlo, and statistical simulation and validation strategies.","author":"Molinaro","year":"2010"},{"key":"2023012507503006500_B6","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1016\/j.jmva.2004.02.003","article-title":"Tree-based multivariate regression and density estimation based on right-censored data","volume":"90","author":"Molinaro","year":"2004","journal-title":"J. Multivar. Anal."},{"key":"2023012507503006500_B7","doi-asserted-by":"crossref","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","article-title":"Prediction error estimation: a comparison of resampling methods","volume":"21","author":"Molinaro","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012507503006500_B8","author":"Molinaro","year":"2009","journal-title":"partDSA: partitioning using deletion, substitution, and addition moves."},{"key":"2023012507503006500_B9","first-page":"313","article-title":"Meeting the challenges of functional genomics: from the laboratory to the clinic","volume":"5","author":"Quackenbush","year":"2004","journal-title":"Preclinica2"},{"key":"2023012507503006500_B10","doi-asserted-by":"crossref","first-page":"1937","DOI":"10.1056\/NEJMoa012914","article-title":"The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma","volume":"346","author":"Rosenwald","year":"2002","journal-title":"N. Engl. J. Med."},{"key":"2023012507503006500_B11","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1198\/1061860032238","article-title":"Logic regression","volume":"12","author":"Ruczinski","year":"2003","journal-title":"J. Comput. Graph. Stat."},{"key":"2023012507503006500_B12","article-title":"Unified cross-validation methods for selection among estimators: finite sample results, asymptotic optimality, and applications","volume-title":"Technical Report 130.","author":"van der Laan","year":"2003"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/10\/1357\/48850929\/bioinformatics_26_10_1357.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/10\/1357\/48850929\/bioinformatics_26_10_1357.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T07:50:46Z","timestamp":1674633046000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/10\/1357\/193817"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,4,7]]},"references-count":12,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2010,5,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq142","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,5,15]]},"published":{"date-parts":[[2010,4,7]]}}}