{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T08:12:19Z","timestamp":1769069539499,"version":"3.49.0"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2022,9,30]],"date-time":"2022-09-30T00:00:00Z","timestamp":1664496000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,9,30]],"date-time":"2022-09-30T00:00:00Z","timestamp":1664496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000086","name":"Directorate for Mathematical and Physical Sciences","doi-asserted-by":"publisher","award":["1712554"],"award-info":[{"award-number":["1712554"]}],"id":[{"id":"10.13039\/100000086","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Data Min Knowl Disc"],"published-print":{"date-parts":[[2022,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Varying coefficient models are a flexible extension of generic parametric models whose coefficients are functions of a set of effect-modifying covariates instead of fitted constants. They are capable of achieving higher model complexity while preserving the structure of the underlying parametric models, hence generating interpretable predictions. In this paper we study the use of gradient boosted decision trees as those coefficient-deciding functions in varying coefficient models with linearly structured outputs. In contrast to the traditional choices of splines or kernel smoothers, boosted trees are more flexible since they require no structural assumptions in the effect modifier space. We introduce our proposed method from the perspective of a localized version of gradient descent, prove its theoretical consistency under mild assumptions commonly adapted by decision tree research, and empirically demonstrate that the proposed tree boosted varying coefficient models achieve high performance qualified by their training speed, prediction accuracy and intelligibility as compared to several benchmark algorithms.<\/jats:p>","DOI":"10.1007\/s10618-022-00863-y","type":"journal-article","created":{"date-parts":[[2022,9,30]],"date-time":"2022-09-30T16:02:54Z","timestamp":1664553774000},"page":"2237-2271","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Decision tree boosted varying coefficient models"],"prefix":"10.1007","volume":"36","author":[{"given":"Yichen","family":"Zhou","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2648-1167","authenticated-orcid":false,"given":"Giles","family":"Hooker","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,9,30]]},"reference":[{"key":"863_CR1","doi-asserted-by":"crossref","unstructured":"Basu S, Kumbier K, Brown JB, Yu B (2018) Iterative random forests to discover predictive and stable high-order interactions. In: Proceedings of the National Academy of Sciences, p 201711236","DOI":"10.1101\/222299"},{"key":"863_CR2","first-page":"1","volume":"29","author":"M Berger","year":"2017","unstructured":"Berger M, Tutz G, Schmid M (2017) Tree-structured modelling of varying coefficients. Stat Comput 29:1\u201313","journal-title":"Stat Comput"},{"key":"863_CR3","doi-asserted-by":"publisher","DOI":"10.1201\/9781315139470","volume-title":"Classification and regression trees","author":"L Breiman","year":"2017","unstructured":"Breiman L, Friedman JH, Olshen RA, Stone CJ (2017) Classification and regression trees. Routledge"},{"issue":"6","key":"863_CR4","first-page":"1","volume":"80","author":"RA Buergin","year":"2017","unstructured":"Buergin RA, Ritschard G (2017) Coefficient-wise tree-based varying coefficient regression with vcrpart. J Stat Softw 80(6):1\u201333","journal-title":"J Stat Softw"},{"key":"863_CR5","unstructured":"B\u00fchlmann PL (2002) Consistency for l2 boosting and matching pursuit with trees and tree-type basis functions. In: Research report\/seminar f\u00fcr Statistik, Eidgen\u00f6ssische Technische Hochschule (ETH), Seminar f\u00fcr Statistik, Eidgen\u00f6ssische Technische Hochschule (ETH), vol 109"},{"issue":"4","key":"863_CR6","first-page":"477","volume":"22","author":"P B\u00fchlmann","year":"2007","unstructured":"B\u00fchlmann P, Hothorn T et al (2007) Boosting algorithms: regularization, prediction and model fitting. Stat Sci 22(4):477\u2013505","journal-title":"Stat Sci"},{"key":"863_CR7","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1016\/j.enbuild.2015.11.071","volume":"112","author":"LM Candanedo","year":"2016","unstructured":"Candanedo LM, Feldheim V (2016) Accurate occupancy detection of an office room from light, temperature, humidity and CO2 measurements using statistical learning models. Energy Build 112:28\u201339","journal-title":"Energy Build"},{"issue":"4","key":"863_CR8","doi-asserted-by":"publisher","first-page":"826","DOI":"10.1198\/106186004X13064","volume":"13","author":"KY Chan","year":"2004","unstructured":"Chan KY, Loh WY (2004) Lotus: an algorithm for building accurate and comprehensible logistic regression trees. J Comput Graph Stat 13(4):826\u2013852","journal-title":"J Comput Graph Stat"},{"key":"863_CR9","unstructured":"Chaudhuri P, Huang MC, Loh WY, Yao R (1994) Piecewise-polynomial regression trees. Stat Sin 143\u2013167"},{"issue":"2","key":"863_CR10","doi-asserted-by":"publisher","first-page":"515","DOI":"10.1214\/21-BA1259","volume":"17","author":"HA Chipman","year":"2022","unstructured":"Chipman HA, George EI, McCulloch RE, Shively TS (2022) mbart: Multidimensional monotone bart. Bayesian Anal 17(2):515\u2013544","journal-title":"Bayesian Anal"},{"key":"863_CR11","unstructured":"Cortes C, Mohri M, Storcheus D (2019) Regularized gradient boosting. In: Wallach H, Larochelle H, Beygelzimer A, d\u2019 Alch\u00e9-Buc F, Fox E, Garnett R (eds) Advances in neural information processing systems, vol 32. Curran Associates, Inc., pp 5449\u20135458. http:\/\/papers.nips.cc\/paper\/8784-regularized-gradient-boosting.pdf"},{"key":"863_CR12","unstructured":"Cotter A, Gupta M, Jiang H, Louidor E, Muller J, Narayan T, Wang S, Zhu T (2019) Shape constraints for set functions. In: International conference on machine learning, pp 1388\u20131396"},{"issue":"6","key":"863_CR13","doi-asserted-by":"publisher","first-page":"1031","DOI":"10.3150\/bj\/1137421639","volume":"11","author":"J Fan","year":"2005","unstructured":"Fan J, Huang T et al (2005) Profile likelihood inferences on semiparametric varying-coefficient partially linear models. Bernoulli 11(6):1031\u20131057","journal-title":"Bernoulli"},{"issue":"5","key":"863_CR14","doi-asserted-by":"publisher","first-page":"1491","DOI":"10.1214\/aos\/1017939139","volume":"27","author":"J Fan","year":"1999","unstructured":"Fan J, Zhang W et al (1999) Statistical estimation in varying coefficient models. Ann Stat 27(5):1491\u20131518","journal-title":"Ann Stat"},{"issue":"2\u20133","key":"863_CR15","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1007\/s13748-013-0040-3","volume":"2","author":"H Fanaee-T","year":"2014","unstructured":"Fanaee-T H, Gama J (2014) Event labeling combining ensemble detectors and background knowledge. Progress Artif Intell 2(2\u20133):113\u2013127","journal-title":"Progress Artif Intell"},{"key":"863_CR16","doi-asserted-by":"crossref","unstructured":"Fernandes K, Vinagre P, Cortez P (2015) A proactive intelligent decision support system for predicting the popularity of online news. In: Portuguese conference on artificial intelligence. Springer, pp 535\u2013546","DOI":"10.1007\/978-3-319-23485-4_53"},{"issue":"2","key":"863_CR17","doi-asserted-by":"publisher","first-page":"503","DOI":"10.1080\/10618600.2020.1831930","volume":"30","author":"R Friedberg","year":"2020","unstructured":"Friedberg R, Tibshirani J, Athey S, Wager S (2020) Local linear forests. J Comput Graph Stat 30(2):503\u2013517","journal-title":"J Comput Graph Stat"},{"key":"863_CR18","doi-asserted-by":"crossref","unstructured":"Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 1189\u20131232","DOI":"10.1214\/aos\/1013203451"},{"issue":"4","key":"863_CR19","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","volume":"38","author":"JH Friedman","year":"2002","unstructured":"Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38(4):367\u2013378","journal-title":"Comput Stat Data Anal"},{"issue":"3","key":"863_CR20","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1023\/B:MACH.0000027782.67192.13","volume":"55","author":"J Gama","year":"2004","unstructured":"Gama J (2004) Functional trees. Mach Learn 55(3):219\u2013250","journal-title":"Mach Learn"},{"key":"863_CR21","volume-title":"Partially linear models","author":"W H\u00e4rdle","year":"2012","unstructured":"H\u00e4rdle W, Liang H, Gao J (2012) Partially linear models. Springer"},{"key":"863_CR22","doi-asserted-by":"crossref","unstructured":"Hastie T, Tibshirani R (1993) Varying-coefficient models. J Roy Stat Soc Ser B (Methodological) 757\u2013796","DOI":"10.1111\/j.2517-6161.1993.tb01939.x"},{"key":"863_CR23","unstructured":"Hothorn T, B\u00fchlmann P, Kneib T, Schmid M, Hofner B (2013) mboost: model-based boosting, 2012, pp 2\u20131. http:\/\/CRAN R-projectorg\/package=mboostRpackageversion"},{"key":"863_CR24","unstructured":"Kaggle (2018) Housing price in Beijing. https:\/\/www.kaggle.com\/ruiqurm\/lianjia\/home"},{"issue":"2182","key":"863_CR25","first-page":"20150257","volume":"471","author":"X Liang","year":"2015","unstructured":"Liang X, Zou T, Guo B, Li S, Zhang H, Zhang S, Huang H, Chen SX (2015) Assessing Beijing\u2019s pm 2.5 pollution: severity, weather impact, apec and winter heating. Proc Roy Soc A Math Phys Eng Sci 471(2182):20150257","journal-title":"Proc Roy Soc A Math Phys Eng Sci"},{"key":"863_CR26","doi-asserted-by":"crossref","unstructured":"Lou Y, Caruana R, Gehrke J (2012) Intelligible models for classification and regression. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 150\u2013158","DOI":"10.1145\/2339530.2339556"},{"key":"863_CR27","doi-asserted-by":"crossref","unstructured":"Lou Y, Caruana R, Gehrke J, Hooker G (2013) Accurate intelligible models with pairwise interactions. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 623\u2013631","DOI":"10.1145\/2487575.2487579"},{"key":"863_CR28","doi-asserted-by":"crossref","unstructured":"Mallat S, Zhang Z (1993) Matching pursuit with time-frequency dictionaries. Tech. rep. Courant Institute of Mathematical Sciences, New York, United States","DOI":"10.1109\/78.258082"},{"key":"863_CR29","unstructured":"Melis DA, Jaakkola T (2018) Towards robust interpretability with self-explaining neural networks. In: Advances in neural information processing systems, pp 7786\u20137795"},{"issue":"1","key":"863_CR30","first-page":"841","volume":"17","author":"L Mentch","year":"2016","unstructured":"Mentch L, Hooker G (2016) Quantifying uncertainty in random forests via confidence intervals and hypothesis tests. J Mach Learn Res 17(1):841\u2013881","journal-title":"J Mach Learn Res"},{"key":"863_CR31","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1016\/j.dss.2014.03.001","volume":"62","author":"S Moro","year":"2014","unstructured":"Moro S, Cortez P, Rita P (2014) A data-driven approach to predict the success of bank telemarketing. Decis Support Syst 62:22\u201331","journal-title":"Decis Support Syst"},{"issue":"1","key":"863_CR32","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1111\/insr.12029","volume":"83","author":"BU Park","year":"2015","unstructured":"Park BU, Mammen E, Lee YK, Lee ER (2015) Varying coefficient regression models: a review and new developments. Int Stat Rev 83(1):36\u201364","journal-title":"Int Stat Rev"},{"issue":"1","key":"863_CR33","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1631\/FITEE.1700808","volume":"19","author":"Z Qs","year":"2018","unstructured":"Qs Z, Zhu SC (2018) Visual interpretability for deep learning: a survey. Front Inf Technol Electron Eng 19(1):27\u201339","journal-title":"Front Inf Technol Electron Eng"},{"key":"863_CR34","unstructured":"Rashmi K, Gilad-Bachrach R (2015) Dart: dropouts meet multiple additive regression trees. In: International conference on artificial intelligence and statistics, pp 489\u2013497"},{"key":"863_CR35","doi-asserted-by":"crossref","unstructured":"Ribeiro MT, Singh S, Guestrin C (2016) Why should i trust you?: Explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1135\u20131144","DOI":"10.1145\/2939672.2939778"},{"key":"863_CR36","unstructured":"Rogozhnikov A, Likhomanenko T (2017) Infiniteboost: building infinite ensembles with gradient descent. arXiv preprint arXiv:1706.01109"},{"issue":"3","key":"863_CR37","doi-asserted-by":"publisher","first-page":"1485","DOI":"10.1109\/TIT.2016.2514489","volume":"62","author":"E Scornet","year":"2016","unstructured":"Scornet E (2016) Random forests and kernel methods. IEEE Trans Inf Theory 62(3):1485\u20131500","journal-title":"IEEE Trans Inf Theory"},{"key":"863_CR38","unstructured":"Sundararajan M, Taly A, Yan Q (2017) Axiomatic attribution for deep networks. In: International conference on machine learning. PMLR, pp 3319\u20133328"},{"key":"863_CR39","doi-asserted-by":"crossref","unstructured":"Tan S, Caruana R, Hooker G, Lou Y (2018) Distill-and-compare: auditing black-box models using transparent model distillation. In: Proceedings of the 2018 AAAI\/ACM conference on AI, Ethics, and Society, pp 303\u2013310","DOI":"10.1145\/3278721.3278725"},{"key":"863_CR40","doi-asserted-by":"publisher","first-page":"560","DOI":"10.1016\/j.enbuild.2012.03.003","volume":"49","author":"A Tsanas","year":"2012","unstructured":"Tsanas A, Xifara A (2012) Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy Build 49:560\u2013567","journal-title":"Energy Build"},{"key":"863_CR41","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2545-2","volume-title":"Weak convergence and empirical processes with applications to statistics","author":"AW van der Vaart","year":"1996","unstructured":"van der Vaart AW, Wellner JA (1996) Weak convergence and empirical processes with applications to statistics. Springer"},{"issue":"523","key":"863_CR42","doi-asserted-by":"publisher","first-page":"1228","DOI":"10.1080\/01621459.2017.1319839","volume":"113","author":"S Wager","year":"2018","unstructured":"Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J Am Stat Assoc 113(523):1228\u20131242","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"863_CR43","doi-asserted-by":"publisher","first-page":"361","DOI":"10.1080\/10618600.2013.778777","volume":"23","author":"JC Wang","year":"2014","unstructured":"Wang JC, Hastie T (2014) Boosted varying-coefficient regression models for product demand prediction. J Comput Graph Stat 23(2):361\u2013382","journal-title":"J Comput Graph Stat"},{"key":"863_CR44","unstructured":"You S, Ding D, Canini K, Pfeifer J, Gupta M (2017) Deep lattice networks and partial monotonic functions. In: Advances in neural information processing systems, pp 2981\u20132989"},{"issue":"2","key":"863_CR45","doi-asserted-by":"publisher","first-page":"492","DOI":"10.1198\/106186008X319331","volume":"17","author":"A Zeileis","year":"2008","unstructured":"Zeileis A, Hothorn T, Hornik K (2008) Model-based recursive partitioning. J Comput Graph Stat 17(2):492\u2013514","journal-title":"J Comput Graph Stat"},{"key":"863_CR46","unstructured":"Zheng X, Chen SX (2019) Partitioning structure learning for segmented linear regression trees. In: Advances in neural information processing systems, pp 2219\u20132228"},{"issue":"183","key":"863_CR47","first-page":"1","volume":"23","author":"Y Zhou","year":"2022","unstructured":"Zhou Y, Hooker G (2022) Boulevard: regularized stochastic gradient boosted trees and their limiting distribution. J Mach Learn Res 23(183):1\u201344","journal-title":"J Mach Learn Res"}],"container-title":["Data Mining and Knowledge Discovery"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-022-00863-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10618-022-00863-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-022-00863-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,16]],"date-time":"2022-11-16T12:12:17Z","timestamp":1668600737000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10618-022-00863-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,30]]},"references-count":47,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,11]]}},"alternative-id":["863"],"URL":"https:\/\/doi.org\/10.1007\/s10618-022-00863-y","relation":{},"ISSN":["1384-5810","1573-756X"],"issn-type":[{"value":"1384-5810","type":"print"},{"value":"1573-756X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,9,30]]},"assertion":[{"value":"25 August 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 August 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 September 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}