{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,25]],"date-time":"2026-06-25T16:08:35Z","timestamp":1782403715120,"version":"3.54.5"},"reference-count":82,"publisher":"Institute of Mathematical Statistics","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Ann. Statist."],"published-print":{"date-parts":[[2019,4,1]]},"DOI":"10.1214\/18-aos1709","type":"journal-article","created":{"date-parts":[[2019,1,11]],"date-time":"2019-01-11T09:01:41Z","timestamp":1547197301000},"source":"Crossref","is-referenced-by-count":1318,"title":["Generalized random forests"],"prefix":"10.1214","volume":"47","author":[{"given":"Susan","family":"Athey","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Julie","family":"Tibshirani","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Stefan","family":"Wager","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"108","reference":[{"key":"1","doi-asserted-by":"publisher","unstructured":"Abadie, A. (2003). Semiparametric instrumental variable estimation of treatment response models. <i>J. Econometrics<\/i> <b>113<\/b> 231\u2013263.","DOI":"10.1016\/S0304-4076(02)00201-4"},{"key":"16","unstructured":"Breiman, L. (2001). Random forests. <i>Mach. Learn.<\/i> <b>45<\/b> 5\u201332."},{"key":"27","unstructured":"Fan, J. and Gijbels, I. (1996). <i>Local Polynomial Modelling and Its Applications. Monographs on Statistics and Applied Probability<\/i> <b>66<\/b>. Chapman &amp; Hall, London."},{"key":"63","doi-asserted-by":"publisher","unstructured":"Rosenbaum, P. R. and Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. <i>Biometrika<\/i> <b>70<\/b> 41\u201355.","DOI":"10.1093\/biomet\/70.1.41"},{"key":"72","unstructured":"van der Vaart, A. W. (1998). <i>Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics<\/i> <b>3<\/b>. Cambridge Univ. Press, Cambridge."},{"key":"15","doi-asserted-by":"crossref","unstructured":"Breiman, L. (1996). Bagging predictors. <i>Mach. Learn.<\/i> <b>24<\/b> 123\u2013140.","DOI":"10.1007\/BF00058655"},{"key":"20","doi-asserted-by":"crossref","unstructured":"Chipman, H. A., George, E. I. and McCulloch, R. E. (2010). BART: Bayesian additive regression trees. <i>Ann. Appl. Stat.<\/i> <b>4<\/b> 266\u2013298.","DOI":"10.1214\/09-AOAS285"},{"key":"82","doi-asserted-by":"publisher","unstructured":"Zhu, R., Zeng, D. and Kosorok, M. R. (2015). Reinforcement learning trees. <i>J. Amer. Statist. Assoc.<\/i> <b>110<\/b> 1770\u20131784.","DOI":"10.1080\/01621459.2015.1036994"},{"key":"28","doi-asserted-by":"publisher","unstructured":"Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. <i>Ann. Statist.<\/i> <b>29<\/b> 1189\u20131232.","DOI":"10.1214\/aos\/1013203451"},{"key":"9","doi-asserted-by":"publisher","unstructured":"Belloni, A., Chen, D., Chernozhukov, V. and Hansen, C. (2012). Sparse models and methods for optimal instruments with an application to eminent domain. <i>Econometrica<\/i> <b>80<\/b> 2369\u20132429.","DOI":"10.3982\/ECTA9626"},{"key":"3","doi-asserted-by":"publisher","unstructured":"Andrews, D. W. K. (1993). Tests for parameter instability and structural change with unknown change point. <i>Econometrica<\/i> <b>61<\/b> 821\u2013856.","DOI":"10.2307\/2951764"},{"key":"36","doi-asserted-by":"crossref","unstructured":"Hill, J. L. (2011). Bayesian nonparametric modeling for causal inference. <i>J. Comput. Graph. Statist.<\/i> <b>20<\/b> 217\u2013240.","DOI":"10.1198\/jcgs.2010.08162"},{"key":"14","doi-asserted-by":"publisher","unstructured":"Biau, G. and Scornet, E. (2016). A random forest guided tour. <i>TEST<\/i> <b>25<\/b> 197\u2013227.","DOI":"10.1007\/s11749-016-0481-7"},{"key":"41","unstructured":"Hothorn, T., Lausen, B., Benner, A. and Radespiel-Tr\u00f6ger, M. (2004). Bagging survival trees. <i>Stat. Med.<\/i> <b>23<\/b> 77\u201391."},{"key":"48","doi-asserted-by":"publisher","unstructured":"Lin, Y. and Jeon, Y. (2006). Random forests and adaptive nearest neighbors. <i>J. Amer. Statist. Assoc.<\/i> <b>101<\/b> 578\u2013590.","DOI":"10.1198\/016214505000001230"},{"key":"51","unstructured":"Meinshausen, N. (2006). Quantile regression forests. <i>J. Mach. Learn. Res.<\/i> <b>7<\/b> 983\u2013999."},{"key":"80","doi-asserted-by":"publisher","unstructured":"Zeileis, A. and Hornik, K. (2007). Generalized $M$-fluctuation tests for parameter instability. <i>Stat. Neerl.<\/i> <b>61<\/b> 488\u2013508.","DOI":"10.1111\/j.1467-9574.2007.00371.x"},{"key":"81","doi-asserted-by":"crossref","unstructured":"Zeileis, A., Hothorn, T. and Hornik, K. (2008). Model-based recursive partitioning. <i>J. Comput. Graph. Statist.<\/i> <b>17<\/b> 492\u2013514.","DOI":"10.1198\/106186008X319331"},{"key":"25","doi-asserted-by":"publisher","unstructured":"Efron, B. and Stein, C. (1981). The jackknife estimate of variance. <i>Ann. Statist.<\/i> <b>9<\/b> 586\u2013596.","DOI":"10.1214\/aos\/1176345462"},{"key":"30","doi-asserted-by":"publisher","unstructured":"Geurts, P., Ernst, D. and Wehenkel, L. (2006). Extremely randomized trees. <i>Mach. Learn.<\/i> <b>63<\/b> 3\u201342.","DOI":"10.1007\/s10994-006-6226-1"},{"key":"19","doi-asserted-by":"crossref","unstructured":"Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. and Robins, J. (2018). Double\/debiased machine learning for treatment and structural parameters. <i>Econom. J.<\/i> <b>21<\/b> C1\u2013C68.","DOI":"10.1111\/ectj.12097"},{"key":"75","unstructured":"Wager, S., Hastie, T. and Efron, B. (2014). Confidence intervals for random forests: The jackknife and the infinitesimal jackknife. <i>J. Mach. Learn. Res.<\/i> <b>15<\/b> 1625\u20131651."},{"key":"65","doi-asserted-by":"publisher","unstructured":"Scornet, E., Biau, G. and Vert, J.-P. (2015). Consistency of random forests. <i>Ann. Statist.<\/i> <b>43<\/b> 1716\u20131741.","DOI":"10.1214\/15-AOS1321"},{"key":"76","unstructured":"Wager, S. and Walther, G. (2015). Adaptive concentration of regression trees, with application to random forests. ArXiv preprint. Available at <a href=\"arXiv:1503.06388\">arXiv:1503.06388<\/a>."},{"key":"53","doi-asserted-by":"publisher","unstructured":"Molinaro, A. M., Dudoit, S. and van der Laan, M. J. (2004). Tree-based multivariate regression and density estimation with right-censored data. <i>J. Multivariate Anal.<\/i> <b>90<\/b> 154\u2013177.","DOI":"10.1016\/j.jmva.2004.02.003"},{"key":"2","doi-asserted-by":"crossref","unstructured":"Amit, Y. and Geman, D. (1997). Shape quantization and recognition with randomized trees. <i>Neural Comput.<\/i> <b>9<\/b> 1545\u20131588.","DOI":"10.1162\/neco.1997.9.7.1545"},{"key":"13","unstructured":"Biau, G., Devroye, L. and Lugosi, G. (2008). Consistency of random forests and other averaging classifiers. <i>J. Mach. Learn. Res.<\/i> <b>9<\/b> 2015\u20132033."},{"key":"18","doi-asserted-by":"publisher","unstructured":"B\u00fchlmann, P. and Yu, B. (2002). Analyzing bagging. <i>Ann. Statist.<\/i> <b>30<\/b> 927\u2013961.","DOI":"10.1214\/aos\/1031689014"},{"key":"32","doi-asserted-by":"publisher","unstructured":"Hampel, F. R. (1974). The influence curve and its role in robust estimation. <i>J. Amer. Statist. Assoc.<\/i> <b>69<\/b> 383\u2013393.","DOI":"10.1080\/01621459.1974.10482962"},{"key":"71","doi-asserted-by":"publisher","unstructured":"Tibshirani, R. and Hastie, T. (1987). Local likelihood estimation. <i>J. Amer. Statist. Assoc.<\/i> <b>82<\/b> 559\u2013567.","DOI":"10.1080\/01621459.1987.10478466"},{"key":"56","doi-asserted-by":"publisher","unstructured":"Newey, W. K. and Powell, J. L. (2003). Instrumental variable estimation of nonparametric models. <i>Econometrica<\/i> <b>71<\/b> 1565\u20131578.","DOI":"10.1111\/1468-0262.00459"},{"key":"64","doi-asserted-by":"publisher","unstructured":"Schick, A. (1986). On asymptotically efficient estimation in semiparametric models. <i>Ann. Statist.<\/i> <b>14<\/b> 1139\u20131151.","DOI":"10.1214\/aos\/1176350055"},{"key":"67","doi-asserted-by":"publisher","unstructured":"Staniswalis, J. G. (1989). The kernel estimate of a regression function in likelihood-based models. <i>J. Amer. Statist. Assoc.<\/i> <b>84<\/b> 276\u2013283.","DOI":"10.2307\/2289874"},{"key":"21","doi-asserted-by":"publisher","unstructured":"Darolles, S., Fan, Y., Florens, J. P. and Renault, E. (2011). Nonparametric instrumental regression. <i>Econometrica<\/i> <b>79<\/b> 1541\u20131565.","DOI":"10.3982\/ECTA6539"},{"key":"54","doi-asserted-by":"crossref","unstructured":"Newey, W. K. (1994a). Kernel estimation of partial means and a general variance estimator. <i>Econometric Theory<\/i> <b>10<\/b> 233\u2013253.","DOI":"10.1017\/S0266466600008409"},{"key":"55","doi-asserted-by":"publisher","unstructured":"Newey, W. K. (1994b). The asymptotic variance of semiparametric estimators. <i>Econometrica<\/i> <b>62<\/b> 1349\u20131382.","DOI":"10.2307\/2951752"},{"key":"12","doi-asserted-by":"publisher","unstructured":"Biau, G. and Devroye, L. (2010). On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification. <i>J. Multivariate Anal.<\/i> <b>101<\/b> 2499\u20132518.","DOI":"10.1016\/j.jmva.2010.06.019"},{"key":"68","doi-asserted-by":"publisher","unstructured":"Stone, C. J. (1977). Consistent nonparametric regression. <i>Ann. Statist.<\/i> <b>5<\/b> 595\u2013645.","DOI":"10.1214\/aos\/1176343886"},{"key":"58","doi-asserted-by":"publisher","unstructured":"Nyblom, J. (1989). Testing for the constancy of parameters over time. <i>J. Amer. Statist. Assoc.<\/i> <b>84<\/b> 223\u2013230.","DOI":"10.1080\/01621459.1989.10478759"},{"key":"11","unstructured":"Biau, G. (2012). Analysis of a random forests model. <i>J. Mach. Learn. Res.<\/i> <b>13<\/b> 1063\u20131095."},{"key":"43","doi-asserted-by":"crossref","unstructured":"Ishwaran, H. and Kogalur, U. B. (2010). Consistency of random survival forests. <i>Statist. Probab. Lett.<\/i> <b>80<\/b> 1056\u20131064.","DOI":"10.1016\/j.spl.2010.02.020"},{"key":"62","doi-asserted-by":"crossref","unstructured":"Robinson, P. M. (1988). Root-$N$-consistent semiparametric regression. <i>Econometrica<\/i> <b>56<\/b> 931\u2013954.","DOI":"10.2307\/1912705"},{"key":"40","doi-asserted-by":"crossref","unstructured":"Honor\u00e9, B. E. and Kyriazidou, E. (2000). Panel data discrete choice models with lagged dependent variables. <i>Econometrica<\/i> <b>68<\/b> 839\u2013874.","DOI":"10.1111\/1468-0262.00139"},{"key":"23","unstructured":"Dietterich, T. G. (2000). An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. <i>Mach. Learn.<\/i> <b>40<\/b> 139\u2013157."},{"key":"38","unstructured":"Ho, T. K. (1998). The random subspace method for constructing decision forests. <i>IEEE Trans. Pattern Anal. Mach. Intell.<\/i> <b>20<\/b> 832\u2013844."},{"key":"66","doi-asserted-by":"publisher","unstructured":"Sexton, J. and Laake, P. (2009). Standard errors for bagged and random forest estimators. <i>Comput. Statist. Data Anal.<\/i> <b>53<\/b> 801\u2013811.","DOI":"10.1016\/j.csda.2008.08.007"},{"key":"6","unstructured":"Arlot, S. and Genuer, R. (2014). Analysis of purely random forests bias. ArXiv preprint. Available at <a href=\"arXiv:1407.3939\">arXiv:1407.3939<\/a>."},{"key":"7","doi-asserted-by":"publisher","unstructured":"Athey, S. and Imbens, G. (2016). Recursive partitioning for heterogeneous causal effects. <i>Proc. Natl. Acad. Sci. USA<\/i> <b>113<\/b> 7353\u20137360.","DOI":"10.1073\/pnas.1510489113"},{"key":"8","doi-asserted-by":"crossref","unstructured":"Athey, S., Tibshirani, J. and Wager, S. (2018). Supplement to \u201cGeneralized random forests.\u201d <a href=\"DOI:10.1214\/18-AOS1709SUPP\">DOI:10.1214\/18-AOS1709SUPP<\/a>.","DOI":"10.1214\/18-AOS1709SUPP"},{"key":"24","doi-asserted-by":"crossref","unstructured":"Efron, B. (1982). <i>The Jackknife<\/i>, <i>the Bootstrap and Other Resampling Plans. CBMS-NSF Regional Conference Series in Applied Mathematics<\/i> <b>38<\/b>. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA.","DOI":"10.1137\/1.9781611970319"},{"key":"26","doi-asserted-by":"publisher","unstructured":"Fan, J., Farmen, M. and Gijbels, I. (1998). Local maximum likelihood estimation and inference. <i>J. R. Stat. Soc. Ser. B. Stat. Methodol.<\/i> <b>60<\/b> 591\u2013608.","DOI":"10.1111\/1467-9868.00142"},{"key":"31","unstructured":"Gordon, L. and Olshen, R. A. (1985). Tree-structured survival analysis. <i>Cancer Treat. Rep.<\/i> <b>69<\/b> 1065\u20131069."},{"key":"33","doi-asserted-by":"crossref","unstructured":"Hansen, B. E. (1992). Testing for parameter instability in linear models. <i>J. Policy Model.<\/i> <b>14<\/b> 517\u2013533.","DOI":"10.1016\/0161-8938(92)90019-9"},{"key":"37","doi-asserted-by":"publisher","unstructured":"Hjort, N. L. and Koning, A. (2002). Tests for constancy of model parameters over time. <i>J. Nonparametr. Stat.<\/i> <b>14<\/b> 113\u2013132.","DOI":"10.1080\/10485250211394"},{"key":"39","doi-asserted-by":"publisher","unstructured":"Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. <i>Ann. Math. Stat.<\/i> <b>19<\/b> 293\u2013325.","DOI":"10.1214\/aoms\/1177730196"},{"key":"42","doi-asserted-by":"publisher","unstructured":"Imbens, G. W. and Angrist, J. D. (1994). Identification and estimation of local average treatment effects. <i>Econometrica<\/i> <b>62<\/b> 467\u2013475.","DOI":"10.2307\/2951620"},{"key":"47","doi-asserted-by":"publisher","unstructured":"Lewbel, A. (2007). A local generalized method of moments estimator. <i>Econom. Lett.<\/i> <b>94<\/b> 124\u2013128.","DOI":"10.1016\/j.econlet.2006.08.011"},{"key":"50","unstructured":"Mallows, C. L. (1973). Some comments on Cp. <i>Technometrics<\/i> <b>15<\/b> 661\u2013675."},{"key":"52","unstructured":"Mentch, L. and Hooker, G. (2016). Quantifying uncertainty in random forests via confidence intervals and hypothesis tests. <i>J. Mach. Learn. Res.<\/i> <b>17<\/b> 26."},{"key":"57","unstructured":"Neyman, J. (1979). $C(\\alpha)$ tests and their use. <i>Sankhya<\/i>, <i>Ser. A<\/i> <b>41<\/b> 1\u201321."},{"key":"59","doi-asserted-by":"publisher","unstructured":"Ploberger, W. and Kr\u00e4mer, W. (1992). The CUSUM test with OLS residuals. <i>Econometrica<\/i> <b>60<\/b> 271\u2013285.","DOI":"10.2307\/2951597"},{"key":"60","doi-asserted-by":"crossref","unstructured":"Poterba, J. M., Venti, S. F. and Wise, D. A. (1996). How retirement saving programs increase saving. <i>J. Electron. Publ.<\/i> <b>10<\/b> 91\u2013112.","DOI":"10.1257\/jep.10.4.91"},{"key":"61","unstructured":"Robins, J. M. and Ritov, Y. (1997). Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. <i>Stat. Med.<\/i> <b>16<\/b>."},{"key":"69","doi-asserted-by":"crossref","unstructured":"Su, L., Murtazashvili, I. and Ullah, A. (2013). Local linear GMM estimation of functional coefficient IV models with an application to estimating the rate of return to schooling. <i>J. Bus. Econom. Statist.<\/i> <b>31<\/b> 184\u2013207.","DOI":"10.1080\/07350015.2012.754314"},{"key":"70","doi-asserted-by":"crossref","unstructured":"Su, X., Tsai, C.-L., Wang, H., Nickerson, D. M. and Li, B. (2009). Subgroup analysis via recursive partitioning. <i>J. Mach. Learn. Res.<\/i> <b>10<\/b> 141\u2013158.","DOI":"10.2139\/ssrn.1341380"},{"key":"73","doi-asserted-by":"crossref","unstructured":"Varian, H. R. (2014). Big data: New tricks for econometrics. <i>J. Electron. Publ.<\/i> <b>28<\/b> 3\u201327.","DOI":"10.1257\/jep.28.2.3"},{"key":"74","doi-asserted-by":"publisher","unstructured":"Wager, S. and Athey, S. (2018). Estimation and inference of heterogeneous treatment effects using random forests. <i>J. Amer. Statist. Assoc.<\/i> <b>113<\/b> 1228\u20131242.","DOI":"10.1080\/01621459.2017.1319839"},{"key":"78","doi-asserted-by":"crossref","unstructured":"Wright, M. N. and Ziegler, A. (2017). ranger: A fast implementation of random forests for high dimensional data in C${+}{+}$ and R. <i>J. Stat. Softw.<\/i> <b>77<\/b> 1\u201317.","DOI":"10.18637\/jss.v077.i01"},{"key":"79","doi-asserted-by":"publisher","unstructured":"Zeileis, A. (2005). A unified approach to structural change tests based on ML scores, $F$ statistics, and OLS residuals. <i>Econometric Rev.<\/i> <b>24<\/b> 445\u2013466.","DOI":"10.1080\/07474930500406053"},{"key":"77","unstructured":"Wooldridge, J. M. (2010). <i>Econometric Analysis of Cross Section and Panel Data<\/i>, 2nd ed. MIT Press, Cambridge, MA."},{"key":"29","unstructured":"Gelman, A., Carlin, J. B., Stern, H. S. and Rubin, D. B. (2004). <i>Bayesian Data Analysis<\/i>, 2nd ed. Chapman &amp; Hall\/CRC, Boca Raton, FL."},{"key":"17","unstructured":"Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984). <i>Classification and Regression Trees<\/i>. Wadsworth Advanced Books and Software, Belmont, CA."},{"key":"49","doi-asserted-by":"crossref","unstructured":"Loader, C. (1999). <i>Local Regression and Likelihood<\/i>. Springer, New York.","DOI":"10.1007\/b98858"},{"key":"35","unstructured":"Hastie, T., Tibshirani, R. and Friedman, J. (2009). <i>The Elements of Statistical Learning<\/i>, 2nd ed. Springer, New York."},{"key":"4","unstructured":"Angrist, J. D. (1990). Lifetime earnings and the Vietnam era draft lottery: Evidence from social security administrative records. <i>AER<\/i> 313\u2013336."},{"key":"5","unstructured":"Angrist, J. D. and Evans, W. N. (1998). Children and their parents\u2019 labor supply: Evidence from exogenous variation in family size. <i>AER<\/i> 450\u2013477."},{"key":"10","doi-asserted-by":"crossref","unstructured":"Beygelzimer, A. and Langford, J. (2009). The offset tree for learning with partial labels. In <i>Proceedings of KDD<\/i> 129\u2013138. ACM.","DOI":"10.1145\/1557019.1557040"},{"key":"22","unstructured":"Denil, M., Matheson, D. and De Freitas, N. (2014). Narrowing the Gap: Random forests in theory and in practice. In <i>Proceedings of ICML<\/i> 665\u2013673."},{"key":"34","unstructured":"Hartford, J., Lewis, G., Leyton-Brown, K. and Taddy, M. (2017). Deep IV: A flexible approach for counterfactual prediction. In <i>Proceedings of ICML<\/i> 1414\u20131423."},{"key":"44","unstructured":"Kallus, N. (2017). Recursive Partitioning for Personalization using Observational Data. In <i>Proceedings of ICML<\/i>. 1789\u20131798."},{"key":"45","unstructured":"Kleiber, C. and Zeileis, A. (2008). <i>Applied Econometrics with R<\/i>. Springer Science &amp; Business Media."},{"key":"46","doi-asserted-by":"crossref","unstructured":"LeBlanc, M. and Crowley, J. (1992). Relative risk trees for censored survival data. <i>Biometrics<\/i> 411\u2013425.","DOI":"10.2307\/2532300"}],"container-title":["The Annals of Statistics"],"original-title":[],"link":[{"URL":"https:\/\/projecteuclid.org\/download\/pdfview_1\/euclid.aos\/1547197251","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,10]],"date-time":"2022-09-10T02:54:34Z","timestamp":1662778474000},"score":1,"resource":{"primary":{"URL":"https:\/\/projecteuclid.org\/journals\/annals-of-statistics\/volume-47\/issue-2\/Generalized-random-forests\/10.1214\/18-AOS1709.full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,1]]},"references-count":82,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2019,4,1]]}},"URL":"https:\/\/doi.org\/10.1214\/18-aos1709","relation":{},"ISSN":["0090-5364"],"issn-type":[{"value":"0090-5364","type":"print"}],"subject":[],"published":{"date-parts":[[2019,4,1]]}}}