{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,22]],"date-time":"2026-02-22T19:26:25Z","timestamp":1771788385149,"version":"3.50.1"},"reference-count":48,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2007,8,1]],"date-time":"2007-08-01T00:00:00Z","timestamp":1185926400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2007,8]]},"abstract":"<jats:p>Prediction errors from a linear model tend to be larger when extrapolation is involved, particularly when the model is wrong. This article considers the problem of extrapolation and interpolation errors when a linear model tree is used for prediction. It proposes several ways to curtail the size of the errors, and uses a large collection of real datasets to demonstrate that the solutions are effective in reducing the average mean squared prediction error. The article also provides a proof that, if a linear model is correct, the proposed solutions have no undesirable effects as the training sample size tends to infinity.<\/jats:p>","DOI":"10.1145\/1267066.1267067","type":"journal-article","created":{"date-parts":[[2007,9,14]],"date-time":"2007-09-14T13:44:55Z","timestamp":1189777495000},"page":"6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["Extrapolation errors in linear model trees"],"prefix":"10.1145","volume":"1","author":[{"given":"Wei-Yin","family":"Loh","sequence":"first","affiliation":[{"name":"University of Wisconsin, Madison, WI"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chien-Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Wisconsin, Madison, WI"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Zheng","sequence":"additional","affiliation":[{"name":"University of Wisconsin, Madison, WI"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2007,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(199907\/08)14:4<403::AID-JAE520>3.0.CO;2-4"},{"key":"e_1_2_1_2_1","volume-title":"Statistical Analysis: A Computer Oriented Approach","author":"Afifi A.","year":"1979","unstructured":"Afifi , A. and Azen , S . 1979 . Statistical Analysis: A Computer Oriented Approach , 2 nd ed. Academic Press , New York . Afifi, A. and Azen, S. 1979. Statistical Analysis: A Computer Oriented Approach, 2nd ed. Academic Press, New York.","edition":"2"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1002\/0471725153"},{"key":"e_1_2_1_4_1","volume-title":"The Practice of Econometrics","author":"Berndt E. R.","unstructured":"Berndt , E. R. 1991. The Practice of Econometrics . Addison-Wesley , New York . Berndt, E. R. 1991. The Practice of Econometrics. Addison-Wesley, New York."},{"key":"e_1_2_1_5_1","unstructured":"Blake C. and Merz C. 1998. UCI Repository of Machine Learning Databases http:\/\/www.ics.uci.edu\/~mlearn\/MLRepository.html.  Blake C. and Merz C. 1998. UCI Repository of Machine Learning Databases http:\/\/www.ics.uci.edu\/~mlearn\/MLRepository.html."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1002\/1099-1255(200005\/06)15:3<275::AID-JAE560>3.0.CO;2-Q"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"e_1_2_1_8_1","first-page":"580","article-title":"Estimating optimal transformations for multiple regression and correlation","volume":"83","author":"Breiman L.","year":"1988","unstructured":"Breiman , L. and Friedman , J. 1988 . Estimating optimal transformations for multiple regression and correlation . J. Amer. Stat. Assoc. 83 , 580 -- 597 . Breiman, L. and Friedman, J. 1988. Estimating optimal transformations for multiple regression and correlation. J. Amer. Stat. Assoc. 83, 580--597.","journal-title":"J. Amer. Stat. Assoc."},{"key":"e_1_2_1_9_1","unstructured":"Breiman L. Friedman J. H. Olshen R. A. and Stone C. J. 1984. Classification and Regression Trees. Wadsworth Belmont CA.  Breiman L. Friedman J. H. Olshen R. A. and Stone C. J. 1984. Classification and Regression Trees. Wadsworth Belmont CA."},{"key":"e_1_2_1_10_1","volume-title":"Practical Data Analysis: Case Studies in Business Statistics","volume":"3","author":"Bryant P. G.","unstructured":"Bryant , P. G. and Smith , M. A . 1996 . Practical Data Analysis: Case Studies in Business Statistics , vol. 3 . Irwin\/McGraw Hill, New York. Bryant, P. G. and Smith, M. A. 1996. Practical Data Analysis: Case Studies in Business Statistics, vol. 3. Irwin\/McGraw Hill, New York."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1002\/jae.635"},{"key":"e_1_2_1_12_1","article-title":"Pricing the C's of diamond stones","author":"Chu S.","year":"2001","unstructured":"Chu , S. 2001 . Pricing the C's of diamond stones . J. Stat. Educat. 9. http:\/\/www.amstat.org\/publications\/jse. Chu, S. 2001. Pricing the C's of diamond stones. J. Stat. Educat. 9. http:\/\/www.amstat.org\/publications\/jse.","journal-title":"J. Stat. Educat. 9. http:\/\/www.amstat.org\/publications\/jse."},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","DOI":"10.1080\/10691898.2000.12131296","article-title":"Career records for all modern position players eligible for the Major League Baseball Hall of Fame","author":"Cochran J. J.","year":"2000","unstructured":"Cochran , J. J. 2000 . Career records for all modern position players eligible for the Major League Baseball Hall of Fame . J. Stat. Educat. 8. http:\/\/www.amstat.org\/publications\/jse. Cochran, J. J. 2000. Career records for all modern position players eligible for the Major League Baseball Hall of Fame. J. Stat. Educat. 8. http:\/\/www.amstat.org\/publications\/jse.","journal-title":"J. Stat. Educat. 8. http:\/\/www.amstat.org\/publications\/jse."},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","DOI":"10.1080\/10691898.2002.11910670","article-title":"Data management, exploratory data analysis, and regression analysis with 1969--2000 Major League Baseball Attendance","author":"Cochran J. J.","year":"2002","unstructured":"Cochran , J. J. 2002 . Data management, exploratory data analysis, and regression analysis with 1969--2000 Major League Baseball Attendance . J. Stat. Educat. 10. http:\/\/www.amstat.org\/publications\/jse. Cochran, J. J. 2002. Data management, exploratory data analysis, and regression analysis with 1969--2000 Major League Baseball Attendance. J. Stat. Educat. 10. http:\/\/www.amstat.org\/publications\/jse.","journal-title":"J. Stat. Educat. 10. http:\/\/www.amstat.org\/publications\/jse."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316931"},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Cook D. and Weisberg S. 1994. An Introduction to Regression Graphics. Wiley New York.  Cook D. and Weisberg S. 1994. An Introduction to Regression Graphics. Wiley New York.","DOI":"10.1002\/9780470316863"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(199705)12:3<313::AID-JAE440>3.0.CO;2-G"},{"key":"e_1_2_1_18_1","unstructured":"Denman N. and Gregory D. 1998. Analysis of sugar cane yields in the Mulgrave area for the 1997 sugar cane season. Tech. rep. MS305 Data Analysis Project Department of Mathematics University of Queensland Queensland Australia.  Denman N. and Gregory D. 1998. Analysis of sugar cane yields in the Mulgrave area for the 1997 sugar cane season. Tech. rep. MS305 Data Analysis Project Department of Mathematics University of Queensland Queensland Australia."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(199803\/04)13:2<145::AID-JAE467>3.0.CO;2-9"},{"key":"e_1_2_1_20_1","first-page":"257","article-title":"Bayesian modelling of catch in a north-west Atlantic fishery","volume":"51","author":"Fernandez C.","year":"2002","unstructured":"Fernandez , C. , Ley , E. , and Steel , M. F. J. 2002 . Bayesian modelling of catch in a north-west Atlantic fishery . Appl. Stat. 51 , 257 -- 280 . Fernandez, C., Ley, E., and Steel, M. F. J. 2002. Bayesian modelling of catch in a north-west Atlantic fishery. Appl. Stat. 51, 257--280.","journal-title":"Appl. Stat."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176347963"},{"key":"e_1_2_1_22_1","volume-title":"R. E., Tatham, R. L.","author":"Hair J. F.","year":"1998","unstructured":"Hair , J. F. anderson , R. E., Tatham, R. L. , and Black, W. C. 1998 . Multivariate Data Analysis. Prentice Hall , Englewood Cliffs, NJ. Hair, J. F. anderson, R. E., Tatham, R. L., and Black, W. C. 1998. Multivariate Data Analysis. Prentice Hall, Englewood Cliffs, NJ."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1080\/03461238.1983.10408691"},{"key":"e_1_2_1_24_1","volume-title":"Robust Statistics: The Approach Based on Influence Functions","author":"Hampel F. R.","year":"1986","unstructured":"Hampel , F. R. , Ronchetti , E. M. , Rousseeuw , P. J. , and Stahel , W. A . 1986 . Robust Statistics: The Approach Based on Influence Functions . Wiley , New York . Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J., and Stahel, W. A. 1986. Robust Statistics: The Approach Based on Influence Functions. Wiley, New York."},{"key":"e_1_2_1_25_1","volume-title":"Logistic Regression, and Survival Analysis","author":"Harrell Jr., F. E.","unstructured":"Harrell , Jr., F. E. 2001. Regression Modeling Strategies: With Applications to Linear Models , Logistic Regression, and Survival Analysis . Springer-Verlag , New York . Harrell, Jr., F. E. 2001. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer-Verlag, New York."},{"key":"e_1_2_1_26_1","unstructured":"Hastie T. and Tibshirani R. 1990. Generalized Additive Models. CRC Press.  Hastie T. and Tibshirani R. 1990. Generalized Additive Models. CRC Press."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(200001\/02)15:1<1::AID-JAE551>3.0.CO;2-Y"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1002\/jae.596"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1080\/07408170600897502"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.75.7.3034"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1002\/jae.656"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(199909\/10)14:5<527::AID-JAE528>3.0.CO;2-X"},{"key":"e_1_2_1_33_1","first-page":"361","article-title":"Regression trees with unbiased variable selection and interaction detection","volume":"12","author":"Loh W.-Y.","year":"2002","unstructured":"Loh , W.-Y. 2002 . Regression trees with unbiased variable selection and interaction detection . Stat. Sinica 12 , 361 -- 386 . Loh, W.-Y. 2002. Regression trees with unbiased variable selection and interaction detection. Stat. Sinica 12, 361--386.","journal-title":"Stat. Sinica"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(199909\/10)14:5<511::AID-JAE529>3.0.CO;2-C"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1002\/jae.572"},{"key":"e_1_2_1_36_1","unstructured":"Neter J. Kutner M. H. Nachtsheim C. J. and Wasserman W. 1996. Applied Linear Statistical Models 4th ed. Irwin.  Neter J. Kutner M. H. Nachtsheim C. J. and Wasserman W. 1996. Applied Linear Statistical Models 4th ed. Irwin."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(1998090)13:5<543::AID-JAE507>3.0.CO;2-J"},{"key":"e_1_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Onoyama K. Ohsumi N. Mitsumochi N. and Kishihara T. 1998. Data analysis of deer-train collisions in eastern Hokkaido Japan. In Data Science Classification and Related Methods (Tokyo Japan) C. Hayashi N. Ohsumi K. Yajima Y. Tanaka H.-H. Bock and Y. Baba Eds. Springer-Verlag New York 746--751.  Onoyama K. Ohsumi N. Mitsumochi N. and Kishihara T. 1998. Data analysis of deer-train collisions in eastern Hokkaido Japan. In Data Science Classification and Related Methods (Tokyo Japan) C. Hayashi N. Ohsumi K. Yajima Y. Tanaka H.-H. Bock and Y. Baba Eds. Springer-Verlag New York 746--751.","DOI":"10.1007\/978-4-431-65950-1_81"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-7152(96)00140-X"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1249\/00005768-198504000-00037"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the Australian Joint Conference on Artificial Intelligence","author":"Quinlan J. R.","year":"1992","unstructured":"Quinlan , J. R. 1992 . Learning with continuous classes . In Proceedings of the Australian Joint Conference on Artificial Intelligence ( Singapore), World Scientific, 343--348. Quinlan, J. R. 1992. Learning with continuous classes. In Proceedings of the Australian Joint Conference on Artificial Intelligence (Singapore), World Scientific, 343--348."},{"key":"e_1_2_1_42_1","volume-title":"R: A Language and Environment for Statistical Computing","author":"Development Core Team","year":"2005","unstructured":"R Development Core Team . 2005 . R: A Language and Environment for Statistical Computing . R Foundation for Statistical Computing, (Vienna, Austria). ISBN 3-900051-07-0. R Development Core Team. 2005. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, (Vienna, Austria). ISBN 3-900051-07-0."},{"key":"e_1_2_1_43_1","volume-title":"Applied Regression Analysis: A Research Tool. Wadsworth &amp","author":"Rawlings J. O.","unstructured":"Rawlings , J. O. 1988. Applied Regression Analysis: A Research Tool. Wadsworth &amp ; Brooks\/Cole Advanced Books & amp; Software. Rawlings, J. O. 1988. Applied Regression Analysis: A Research Tool. Wadsworth &amp; Brooks\/Cole Advanced Books &amp; Software."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1099-1255(1998090)13:5<481::AID-JAE509>3.0.CO;2-I"},{"key":"e_1_2_1_45_1","volume-title":"Smoothing Methods in Statistics","author":"Simonoff J.","unstructured":"Simonoff , J. 1996. Smoothing Methods in Statistics . Springer-Verlag , New York . Simonoff, J. 1996. Smoothing Methods in Statistics. Springer-Verlag, New York."},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the Poster Papers of the European Conference on Machine Learning (Prague).","author":"Wang Y.","unstructured":"Wang , Y. and Witten , I . 1997. Inducing model trees for continuous classes . In Proceedings of the Poster Papers of the European Conference on Machine Learning (Prague). Wang, Y. and Witten, I. 1997. Inducing model trees for continuous classes. In Proceedings of the Poster Papers of the European Conference on Machine Learning (Prague)."},{"key":"e_1_2_1_48_1","first-page":"383","article-title":"Rule-based machine learning methods for functional prediction","volume":"3","author":"Weiss S.","year":"1995","unstructured":"Weiss , S. and Indurkhya , N. 1995 . Rule-based machine learning methods for functional prediction . J. Artif. Int. Res. 3 , 383 -- 403 . Weiss, S. and Indurkhya, N. 1995. Rule-based machine learning methods for functional prediction. J. Artif. Int. Res. 3, 383--403.","journal-title":"J. Artif. Int. Res."},{"key":"e_1_2_1_49_1","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques with JAVA Implementations","author":"Witten I.","year":"2005","unstructured":"Witten , I. and Frank , E . 2005 . Data Mining: Practical Machine Learning Tools and Techniques with JAVA Implementations , 2 nd ed. Morgan Kaufmann , San Fransico, CA . http:\/\/www.cs.waikato.ac.nz\/ml\/weka. Witten, I. and Frank, E. 2005. Data Mining: Practical Machine Learning Tools and Techniques with JAVA Implementations, 2nd ed. Morgan Kaufmann, San Fransico, CA. http:\/\/www.cs.waikato.ac.nz\/ml\/weka.","edition":"2"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1267066.1267067","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1267066.1267067","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:52:14Z","timestamp":1750258334000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1267066.1267067"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,8]]},"references-count":48,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2007,8]]}},"alternative-id":["10.1145\/1267066.1267067"],"URL":"https:\/\/doi.org\/10.1145\/1267066.1267067","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,8]]},"assertion":[{"value":"2007-08-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}