{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,17]],"date-time":"2025-04-17T16:09:55Z","timestamp":1744906195213},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,5,31]],"date-time":"2012-05-31T00:00:00Z","timestamp":1338422400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2013,1]]},"DOI":"10.1007\/s10994-012-5298-3","type":"journal-article","created":{"date-parts":[[2012,5,30]],"date-time":"2012-05-30T17:52:31Z","timestamp":1338400351000},"page":"29-57","source":"Crossref","is-referenced-by-count":5,"title":["Density estimation with minimization of U-divergence"],"prefix":"10.1007","volume":"90","author":[{"given":"Kanta","family":"Naito","sequence":"first","affiliation":[]},{"given":"Shinto","family":"Eguchi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,5,31]]},"reference":[{"key":"5298_CR1","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1093\/biomet\/85.3.549","volume":"85","author":"A. Basu","year":"1998","unstructured":"Basu, A., Harris, I. R., Hjort, N. L., & Jones, M. C. (1998). Robust and efficient estimation by minimising a density power divergence. Biometrika, 85, 549\u2013559.","journal-title":"Biometrika"},{"key":"5298_CR2","volume-title":"Pattern recognition and machine learning","author":"C. M. Bishop","year":"2006","unstructured":"Bishop, C. M. (2006). Pattern recognition and machine learning. Berlin: Springer."},{"key":"5298_CR3","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1198\/016214503000125","volume":"98","author":"P. B\u00fchlman","year":"2003","unstructured":"B\u00fchlman, P., & Yu, B. (2003). Boosting with L 2 loss: regression and classification. Journal of the American Statistical Association, 98, 324\u2013339.","journal-title":"Journal of the American Statistical Association"},{"key":"5298_CR4","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-0711-5","volume-title":"A probabilistic theory of pattern recognition","author":"L. Devroye","year":"1996","unstructured":"Devroye, L., Gy\u00f6rfi, L., & Lugosi, G. (1996). A probabilistic theory of pattern recognition. Berlin: Springer."},{"key":"5298_CR5","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1093\/biomet\/91.1.226","volume":"91","author":"M. Marzio Di","year":"2004","unstructured":"Di Marzio, M., & Taylor, C. C. (2004). Boosting kernel density estimates: a bias reduction technique? Biometrika, 91, 226\u2013233.","journal-title":"Biometrika"},{"key":"5298_CR6","unstructured":"Duong, T. (2010). Reference manual for the package ks. http:\/\/cran.r-project.org\/ ."},{"key":"5298_CR7","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1080\/10485250306039","volume":"15","author":"T. Duong","year":"2003","unstructured":"Duong, T., & Hazelton, M. L. (2003). Plug-in bandwidth matrices for bivariate kernel density estimation. Journal of Nonparametric Statistics, 15, 17\u201330.","journal-title":"Journal of Nonparametric Statistics"},{"key":"5298_CR8","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.jmva.2004.04.004","volume":"93","author":"T. Duong","year":"2005","unstructured":"Duong, T., & Hazelton, M. L. (2005a). Convergence rates for unconstrained bandwidth matrix selectors in multivariate kernal density estimation. Journal of Multivariate Analysis, 93, 417\u2013433.","journal-title":"Journal of Multivariate Analysis"},{"key":"5298_CR9","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1111\/j.1467-9469.2005.00445.x","volume":"38","author":"T. Duong","year":"2005","unstructured":"Duong, T., & Hazelton, M. L. (2005b). Cross-validation bandwidth matrices for multivariate kernel density estimation. Scandinavian Journal of Statistics, 38, 485\u2013506.","journal-title":"Scandinavian Journal of Statistics"},{"key":"5298_CR10","first-page":"309","volume-title":"Information theory and statistical learning","author":"S. Eguchi","year":"2008","unstructured":"Eguchi, S. (2008). Information divergence geometry and the application to statistical machine learning. In F.\u00a0Emmert-Streib & M. Dehmer (Eds.), Information theory and statistical learning (pp. 309\u2013332). Berlin: Springer."},{"key":"5298_CR11","doi-asserted-by":"crossref","first-page":"599","DOI":"10.1080\/01621459.1984.10478086","volume":"79","author":"J. H. Friedman","year":"1984","unstructured":"Friedman, J. H., Stuetzle, W., & Schroeder, A. (1984). Projection pursuit density estimation. Journal of the American Statistical Association, 79, 599\u2013608.","journal-title":"Journal of the American Statistical Association"},{"key":"5298_CR12","doi-asserted-by":"crossref","first-page":"2053","DOI":"10.1016\/j.jmva.2008.02.004","volume":"99","author":"H. Fujisawa","year":"2008","unstructured":"Fujisawa, H., & Eguchi, S. (2008). Robust parameter estimation with a small bias against heavy contamination. Journal of Multivariate Analysis, 99, 2053\u20132081.","journal-title":"Journal of Multivariate Analysis"},{"key":"5298_CR13","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1109\/TPAMI.2003.1233899","volume":"25","author":"M. Girolami","year":"2003","unstructured":"Girolami, M., & He, C. (2003). Probability density estimation from optimally condensed data samples. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25, 1253\u20131264.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"5298_CR14","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The elements of statistical learning: data mining, inference and prediction","author":"T. Hastie","year":"2009","unstructured":"Hastie, T., Tibishirani, R., & Friedman, J. H. (2009). The elements of statistical learning: data mining, inference and prediction (2nd ed.). New York: Springer.","edition":"2"},{"key":"5298_CR15","first-page":"453","volume":"5","author":"I. Higuchi","year":"2004","unstructured":"Higuchi, I., & Eguchi, S. (2004). Robust principal component analysis with adaptive selection for tuning parameters. Journal of Machine Learning Research, 5, 453\u2013471.","journal-title":"Journal of Machine Learning Research"},{"key":"5298_CR16","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1080\/01621459.1996.10476701","volume":"91","author":"M. C. Jones","year":"1996","unstructured":"Jones, M. C., Marron, J. S., & Sheather, S. J. (1996). A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association, 91, 401\u2013407.","journal-title":"Journal of the American Statistical Association"},{"key":"5298_CR17","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1007\/s10994-006-5000-8","volume":"67","author":"J. Klemel\u00e4","year":"2007","unstructured":"Klemel\u00e4, J. (2007). Density estimation with stagewise optimization of the empirical risk. Machine Learning, 67, 169\u2013195.","journal-title":"Machine Learning"},{"key":"5298_CR18","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1214\/09-AOS726","volume":"38","author":"J. Klemel\u00e4","year":"2010","unstructured":"Klemel\u00e4, J., & Mammen, E. (2010). Empirical risk minimization in inverse problems. Annals of Statistics, 38, 482\u2013511.","journal-title":"Annals of Statistics"},{"key":"5298_CR19","doi-asserted-by":"crossref","first-page":"1859","DOI":"10.1162\/089976602760128045","volume":"14","author":"M. Minami","year":"2002","unstructured":"Minami, M., & Eguchi, S. (2002). Robust blind source separation by beta-divergence. Neural Computation, 14, 1859\u20131886.","journal-title":"Neural Computation"},{"key":"5298_CR20","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1162\/089976604323057452","volume":"16","author":"N. Murata","year":"2004","unstructured":"Murata, N., Takenouchi, T., Kanamori, T., & Eguchi, S. (2004). Information geometry of U-boost and Bregman divergence. Neural Computation, 16, 1437\u20131481.","journal-title":"Neural Computation"},{"key":"5298_CR21","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1016\/S0167-9473(01)00066-4","volume":"38","author":"G. Ridgeway","year":"2002","unstructured":"Ridgeway, G. (2002). Looking for lumps: boosting and bagging for density estimation. Computational Statistics & Data Analysis, 38, 379\u2013392.","journal-title":"Computational Statistics & Data Analysis"},{"key":"5298_CR22","volume-title":"Convex analysis","author":"R. T. Rockafeller","year":"1996","unstructured":"Rockafeller, R. T. (1996). Convex analysis. Princeton: Princeton University Press."},{"key":"5298_CR23","volume-title":"Proceedings of the 16th international conference on neural information processing systems (NIPS)","author":"S. Rosset","year":"2002","unstructured":"Rosset, S., & Segal, E. (2002). Boosting density estimation. In Proceedings of the 16th international conference on neural information processing systems (NIPS)."},{"key":"5298_CR24","first-page":"197","volume":"5","author":"R. E. Schapire","year":"1990","unstructured":"Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5, 197\u2013227.","journal-title":"Machine Learning"},{"key":"5298_CR25","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1198\/004017001316975880","volume":"43","author":"D. W. Scott","year":"2001","unstructured":"Scott, D. W. (2001). Parametric statistical modeling by minimum integrated square error. Technometrics, 43, 274\u2013285.","journal-title":"Technometrics"},{"key":"5298_CR26","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-4026-6","volume-title":"Smoothing methods in statistics","author":"J. S. Simonoff","year":"1996","unstructured":"Simonoff, J. S. (1996). Smoothing methods in statistics. Berlin: Springer."},{"key":"5298_CR27","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-3324-9","volume-title":"Density estimation for statistics and data analysis","author":"B. W. Silverman","year":"1986","unstructured":"Silverman, B. W. (1986). Density estimation for statistics and data analysis. London: Chapman and Hall."},{"key":"5298_CR28","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1080\/01621459.1993.10476303","volume":"88","author":"M. P. Wand","year":"1993","unstructured":"Wand, M. P., & Jones, M. C. (1993). Comparison of smoothing parameterizations in bivariate kernel density estimation. Journal of the American Statistical Association, 88, 520\u2013528.","journal-title":"Journal of the American Statistical Association"}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-012-5298-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10994-012-5298-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-012-5298-3","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,29]],"date-time":"2019-06-29T07:37:10Z","timestamp":1561793830000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10994-012-5298-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,5,31]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,1]]}},"alternative-id":["5298"],"URL":"https:\/\/doi.org\/10.1007\/s10994-012-5298-3","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,5,31]]}}}