{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T10:52:43Z","timestamp":1740135163468,"version":"3.37.3"},"reference-count":14,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,9,2]],"date-time":"2016-09-02T00:00:00Z","timestamp":1472774400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,9,2]],"date-time":"2016-09-02T00:00:00Z","timestamp":1472774400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000265","name":"Medical Research Council","doi-asserted-by":"publisher","award":["MR\/K006665\/1"],"award-info":[{"award-number":["MR\/K006665\/1"]}],"id":[{"id":"10.13039\/501100000265","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>It is useful to incorporate biological knowledge on the role of genetic determinants in predicting an outcome. It is, however, not always feasible to fully elicit this information when the number of determinants is large. We present an approach to overcome this difficulty. First, using half of the available data, a shortlist of potentially interesting determinants are generated. Second, binary indications of biological importance are elicited for this much smaller number of determinants. Third, an analysis is carried out on this shortlist using the second half of the data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We show through simulations that, compared with adaptive lasso, this approach leads to models containing more biologically relevant variables, while the prediction mean squared error (PMSE) is comparable or even reduced. We also apply our approach to bone mineral density data, and again final models contain more biologically relevant variables and have reduced PMSEs.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Our method leads to comparable or improved predictive performance, and models with greater face validity and interpretability with feasible incorporation of biological knowledge into predictive models.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-016-1210-7","type":"journal-article","created":{"date-parts":[[2016,9,2]],"date-time":"2016-09-02T11:41:21Z","timestamp":1472816481000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Tilting the lasso by knowledge-based post-processing"],"prefix":"10.1186","volume":"17","author":[{"given":"Kukatharmini","family":"Tharmaratnam","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5351-9960","authenticated-orcid":false,"given":"Matthew","family":"Sperrin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Jaki","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sjur","family":"Reppe","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Arnoldo","family":"Frigessi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2016,9,2]]},"reference":[{"key":"1210_CR1","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","volume":"58","author":"R Tibshirani","year":"1996","unstructured":"Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc. 1996; 58:267\u201388.","journal-title":"J R Stat Soc"},{"key":"1210_CR2","first-page":"2541","volume":"7","author":"P Zhao","year":"2006","unstructured":"Zhao P, Yu B. On model selection consistency of lasso. J Mach Learn Res. 2006; 7:2541\u20132563.","journal-title":"J Mach Learn Res"},{"issue":"476","key":"1210_CR3","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1198\/016214506000000735","volume":"101","author":"H Zou","year":"2006","unstructured":"Zou H. The adaptive lasso and its oracle properties. J Am Stat Assoc. 2006; 101(476):1418\u20131429.","journal-title":"J Am Stat Assoc"},{"key":"1210_CR4","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1214\/07-EJS008","volume":"1","author":"F Bunea","year":"2007","unstructured":"Bunea F, Tsybakov A, Wegkamp M. Sparsity oracle inequalities for the lasso. Electron J Stat. 2007; 1:169\u201394.","journal-title":"Electron J Stat"},{"issue":"2","key":"1210_CR5","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1111\/j.1467-9868.2005.00503.x","volume":"67","author":"H Zou","year":"2005","unstructured":"Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol. 2005; 67(2):301\u201320.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"key":"1210_CR6","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","volume":"72","author":"N Meinshausen","year":"2010","unstructured":"Meinshausen N, B\u00fchlmann P. Stability selection. J R Stat Soc Ser B Stat Methodol. 2010; 72:417\u201373.","journal-title":"J R Stat Soc Ser B Stat Methodol"},{"key":"1210_CR7","doi-asserted-by":"crossref","unstructured":"Lim C, Yu B. Estimation Stability With Cross-Validation (ESCV). Journal of Computational and Graphical Statistics 25.2. 2016:464\u2013492.","DOI":"10.1080\/10618600.2015.1020159"},{"issue":"5A","key":"1210_CR8","doi-asserted-by":"publisher","first-page":"2178","DOI":"10.1214\/08-AOS646","volume":"35","author":"L Wasserman","year":"2009","unstructured":"Wasserman L, Roeder K. High dimensional variable selection. Ann Stat. 2009; 35(5A):2178\u20132201.","journal-title":"Ann Stat"},{"key":"1210_CR9","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1038\/ncb2641","volume":"15","author":"PAJ Muller","year":"2013","unstructured":"Muller PAJ, Vousden KH. p53 mutation in cancer. Nat Cell Biol. 2013; 15:2\u20138.","journal-title":"Nat Cell Biol"},{"issue":"482","key":"1210_CR10","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1198\/016214508000000337","volume":"103","author":"T Park","year":"2008","unstructured":"Park T, Casella G. The bayesian lasso. J Am Stat Assoc. 2008; 103(482):681\u20136.","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"1210_CR11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2202\/1544-6115.1703","volume":"10","author":"LC Bergersen","year":"2011","unstructured":"Bergersen LC, Glad IK, Lyng H. Weighted lasso with data integration. Stat Appl Genet Mol Biol. 2011; 10(1):1\u201329.","journal-title":"Stat Appl Genet Mol Biol"},{"issue":"1","key":"1210_CR12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v033.i01","volume":"33","author":"J Friedman","year":"2010","unstructured":"Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010; 33(1):1\u201322.","journal-title":"J Stat Softw"},{"key":"1210_CR13","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1186\/1471-2105-9-488","volume":"9","author":"R Braun","year":"2008","unstructured":"Braun R, Cope L, Parmigiani G. Identifying differential correlation in gene\/pathway combinations. BMC Bioinforma. 2008; 9:488.","journal-title":"BMC Bioinforma"},{"issue":"3","key":"1210_CR14","doi-asserted-by":"publisher","first-page":"604","DOI":"10.1016\/j.bone.2009.11.007","volume":"46","author":"S Reppe","year":"2010","unstructured":"Reppe S, Refvem H, Gautvik VT, Olstad OK, H\u00f8vring PI, Reinholt FP, Holden M, Frigessi A, Jemtland R, Gautvik KM. Eight genes are highly associated with BMD variation in postmenopausal caucasian women. Bone. 2010; 46(3):604\u201312. doi:10.1016\/j.bone.2009.11.007.","journal-title":"Bone"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1210-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1210-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1210-7","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1210-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,19]],"date-time":"2024-06-19T05:04:07Z","timestamp":1718773447000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1210-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,9,2]]},"references-count":14,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1210"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1210-7","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2016,9,2]]},"assertion":[{"value":"7 November 2015","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 August 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 September 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"344"}}