{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,6]],"date-time":"2026-01-06T02:17:30Z","timestamp":1767665850099,"version":"3.37.3"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T00:00:00Z","timestamp":1620086400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T00:00:00Z","timestamp":1620086400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012687","name":"Universit\u00e4t Kassel","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100012687","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2021,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Gathering labeled data to train well-performing machine learning models is one of the critical challenges in many applications. Active learning aims at reducing the labeling costs by an efficient and effective allocation of costly labeling resources. In this article, we propose a decision-theoretic selection strategy that (1) directly optimizes the gain in misclassification error, and (2) uses a Bayesian approach by introducing a conjugate prior distribution to determine the class posterior to deal with uncertainties. By reformulating existing selection strategies within our proposed model, we can explain which aspects are not covered in current state-of-the-art and why this leads to the superior performance of our approach. Extensive experiments on a large variety of datasets and different kernels validate our claims.<\/jats:p>","DOI":"10.1007\/s10994-021-05986-9","type":"journal-article","created":{"date-parts":[[2021,5,4]],"date-time":"2021-05-04T21:07:50Z","timestamp":1620162470000},"page":"1199-1231","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Toward optimal probabilistic active learning using a Bayesian approach"],"prefix":"10.1007","volume":"110","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7870-6033","authenticated-orcid":false,"given":"Daniel","family":"Kottke","sequence":"first","affiliation":[]},{"given":"Marek","family":"Herde","sequence":"additional","affiliation":[]},{"given":"Christoph","family":"Sandrock","sequence":"additional","affiliation":[]},{"given":"Denis","family":"Huseljic","sequence":"additional","affiliation":[]},{"given":"Georg","family":"Krempl","sequence":"additional","affiliation":[]},{"given":"Bernhard","family":"Sick","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,5,4]]},"reference":[{"key":"5986_CR1","first-page":"255","volume":"5","author":"Y Baram","year":"2004","unstructured":"Baram, Y., Yaniv, R. E., & Luz, K. (2004). Online choice of active learning algorithms. Journal of Machine Learning Research, 5, 255\u2013291.","journal-title":"Journal of Machine Learning Research"},{"key":"5986_CR2","doi-asserted-by":"crossref","unstructured":"Beyer, C., Krempl, G., & Lemaire, V. (2015). How to select information that matters: A comparative study on active learning strategies for classification. In Proceedings of the 15th international conference on knowledge technologies and data-driven business, association for computing machinery, i-KNOW \u201915, New York, NY, USA.","DOI":"10.1145\/2809563.2809594"},{"key":"5986_CR3","volume-title":"Pattern recognition and machine learning","author":"CM Bishop","year":"2006","unstructured":"Bishop, C. M. (2006). Pattern recognition and machine learning. Springer."},{"key":"5986_CR4","doi-asserted-by":"crossref","unstructured":"Bondu, A., Lemaire, V., & Boull\u00e9, M. (2010). Exploration vs. exploitation in active learning: A Bayesian approach. In International joint conference on neural networks (IJCNN) (pp. 1\u20137). IEEE.","DOI":"10.1109\/IJCNN.2010.5596815"},{"key":"5986_CR5","unstructured":"Brinker, K. (2003). Incorporating diversity in active learning with support vector machines. In Proceedings of the 20th international conference on machine learning (ICML) (pp. 59\u201366)."},{"key":"5986_CR6","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/j.ins.2018.04.063","volume":"456","author":"A Calma","year":"2018","unstructured":"Calma, A., Reitmaier, T., & Sick, B. (2018). Semi-supervised active learning for support vector machines: A novel approach that exploits structure information in data. Information Sciences, 456, 13\u201333.","journal-title":"Information Sciences"},{"key":"5986_CR7","unstructured":"Chapelle, O. (2005). Active learning for parzen window classifier. In Proceedings of the 10th international workshop on artificial intelligence and statistics (AISTATS) (Vol. 5, pp. 49\u201356)."},{"key":"5986_CR8","doi-asserted-by":"crossref","unstructured":"Chaudhuri, A., Kakde, D., Sadek, C., Gonzalez, L., & Kong, S. (2017). The mean and median criteria for kernel bandwidth selection for support vector data description. In International conference on data mining workshops (ICDMW) (pp. 842\u2013849). IEEE.","DOI":"10.1109\/ICDMW.2017.116"},{"key":"5986_CR9","unstructured":"Cuong, N. V., Lee, W. S., & Ye, N. (2014). Near-optimal adaptive pool-based active learning with general loss. In Proceedings of the 30th conference on uncertainty in artificial intelligence (UAI) (pp. 122\u2013131)."},{"key":"5986_CR10","doi-asserted-by":"crossref","unstructured":"Dasgupta, S. (2009). The two faces of active learning. In International conference on discovery science (pp. 35\u201335). Springer.","DOI":"10.1007\/978-3-642-04747-3_5"},{"key":"5986_CR11","doi-asserted-by":"crossref","unstructured":"Donmez, P., Carbonell, J. G., & Bennett, P. N. (2007). Dual strategy active learning. In Proceedings of the European conference on machine learning (ECML) (pp. 116\u2013127). Springer.","DOI":"10.1007\/978-3-540-74958-5_14"},{"key":"5986_CR12","unstructured":"Golovin, D., & Krause, A. (2010). Adaptive submodularity: A new approach to active learning and stochastic optimization. In Proceedings of the 23rd conference on algorithmic learning theory (ALT) (pp. 333\u2013345)."},{"key":"5986_CR13","unstructured":"Guillory, A., & Bilmes, J. (2010). Interactive submodular set cover. In Proceedings of the 27th International conference on machine learning (ICML)."},{"key":"5986_CR14","doi-asserted-by":"publisher","first-page":"840","DOI":"10.1016\/j.dib.2018.03.109","volume":"18","author":"J Hern\u00e1ndez-Gonz\u00e1lez","year":"2018","unstructured":"Hern\u00e1ndez-Gonz\u00e1lez, J., Rodriguez, D., Inza, I., Harrison, R., & Lozano, J. A. (2018). Two datasets of defect reports labeled by a crowd of annotators of unknown reliability. Data in Brief, 18, 840\u2013845.","journal-title":"Data in Brief"},{"key":"5986_CR15","unstructured":"Houlsby, N., Husz\u00e1r, F., Ghahramani, Z., & Lengyel, M. (2011). Bayesian active learning for classification and preference learning. arXiv:1112.5745 [stat.ML]."},{"key":"5986_CR16","doi-asserted-by":"crossref","unstructured":"Huang, K., & Lin, H. (2016). A novel uncertainty sampling algorithm for cost-sensitive multiclass active learning. In Proceedings of the 16th international conference on data mining (ICDM) (pp. 925\u2013930). IEEE.","DOI":"10.1109\/ICDM.2016.0114"},{"key":"5986_CR17","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/j.artint.2013.10.003","volume":"206","author":"F Hutter","year":"2014","unstructured":"Hutter, F., Xu, L., Hoos, H. H., & Leyton-Brown, K. (2014). Algorithm runtime prediction: Methods & evaluation. Artificial Intelligence, 206, 79\u2013111.","journal-title":"Artificial Intelligence"},{"key":"5986_CR18","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511921803","volume-title":"Evaluating learning algorithms: A classification perspective","author":"N Japkowicz","year":"2011","unstructured":"Japkowicz, N., & Shah, M. (2011). Evaluating learning algorithms: A classification perspective. Cambridge University Press."},{"key":"5986_CR19","unstructured":"Konyushkova, K., Sznitman, R., & Fua, P. (2018). Discovering general purpose active learning strategies. arXiv:1810.04114v2 [cs.LG]."},{"key":"5986_CR20","unstructured":"Kottke, D., Krempl, G., Lang, D., Teschner, J., & Spiliopoulou, M. (2016). Multi-class probabilistic active learning. In Proceedings of the European conference on artificial intelligence (ECAI) (pp. 586\u2013594). IOS Press."},{"key":"5986_CR21","doi-asserted-by":"crossref","unstructured":"Kottke, D., Herde, M., Minh, T. P., Benz, A., Mergard, P., Roghman, A., Sandrock, C., & Sick, B. (2021). scikit-activeml: A library and toolbox for active learning algorithms. Preprints, 2021030194.","DOI":"10.20944\/preprints202103.0194.v1"},{"issue":"2\u20133","key":"5986_CR22","doi-asserted-by":"publisher","first-page":"449","DOI":"10.1007\/s10994-015-5504-1","volume":"100","author":"G Krempl","year":"2015","unstructured":"Krempl, G., Kottke, D., & Lemaire, V. (2015). Optimised probabilistic active learning (OPAL). Machine Learning, 100(2\u20133), 449\u2013476.","journal-title":"Machine Learning"},{"key":"5986_CR23","doi-asserted-by":"crossref","unstructured":"Lewis, D. D., & Gale, W. A. (1994). A sequential algorithm for training text classifiers. In Proceedings of the 17th annual international conference on research and development in information retrieval (SIGIR) (pp. 3\u201312). Springer.","DOI":"10.1007\/978-1-4471-2099-5_1"},{"key":"5986_CR24","unstructured":"Murphy, K. P. (2006). Binomial and multinomial distributions. Technical report, University of British Columbia."},{"key":"5986_CR25","doi-asserted-by":"crossref","unstructured":"Nguyen, H. T., & Smeulders, A. (2004). Active learning using pre-clustering. In Proceedings of the 21st international conference on machine learning (ICML) (pp. 79\u201386). ACM Press.","DOI":"10.1145\/1015330.1015349"},{"key":"5986_CR26","doi-asserted-by":"crossref","unstructured":"Osugi, T., Kim, D., & Scott, S. (2005). Balancing exploration and exploitation: A new algorithm for active machine learning. In Proceedings of the 5th international conference on data mining (ICDM) (pp. 330\u2013337). IEEE.","DOI":"10.1109\/ICDM.2005.33"},{"key":"5986_CR27","unstructured":"Roy, N., & McCallum, A. (2001). Toward optimal active learning through Monte Carlo estimation of error reduction. In Proceedings of the 18th international conference on machine learning (ICML) (pp. 441\u2013448)."},{"key":"5986_CR28","unstructured":"Settles, B. (2009). Active learning literature survey. Technical report, University of Wisconsin-Madison Department of Computer Sciences."},{"key":"5986_CR29","volume-title":"Active learning. No.\u00a018 in Synthesis lectures on artificial intelligence and machine learning","author":"B Settles","year":"2012","unstructured":"Settles, B. (2012). Active learning. No.\u00a018 in Synthesis lectures on artificial intelligence and machine learning. Morgan and Claypool Publishers."},{"key":"5986_CR30","doi-asserted-by":"crossref","unstructured":"Seung, H. S., Opper, M., & Sompolinsky, H. (1992). Query by committee. In Proceedings of the 5th annual workshop on computational learning theory (COLT) (pp. 287\u2013294). ACM.","DOI":"10.1145\/130385.130417"},{"key":"5986_CR31","doi-asserted-by":"crossref","unstructured":"Shi, S., Liu, Y., Huang, Y., Zhu, S., & Liu, Y. (2008). Active learning for kNN based on bagging features. In Proceedings of the 4th international conference on natural computation (pp. 61\u201364), Jinan, China.","DOI":"10.1109\/ICNC.2008.868"},{"key":"5986_CR32","unstructured":"Thrun, S. B., & M\u00f6ller, K. (1992). Active exploration in dynamic environments. In Advances in neural information processing systems (pp. 531\u2013538)."},{"issue":"2","key":"5986_CR33","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1145\/2641190.2641198","volume":"15","author":"J Vanschoren","year":"2013","unstructured":"Vanschoren, J., van Rijn, J. N., Bischl, B., & Torgo, L. (2013). Openml: Networked science in machine learning. SIGKDD Explorations, 15(2), 49\u201360.","journal-title":"SIGKDD Explorations"},{"key":"5986_CR34","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The nature of statistical learning theory","author":"VN Vapnik","year":"1995","unstructured":"Vapnik, V. N. (1995). The nature of statistical learning theory. Springer."},{"key":"5986_CR35","unstructured":"Wei, K., Iyer, R., & Bilmes, J. (2015). Submodularity in data subset selection and active learning. In Proceedings of the 32rd international conference on machine learning (ICML) (pp. 1954\u20131963)."},{"issue":"1","key":"5986_CR36","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1007\/s00138-015-0731-9","volume":"27","author":"E Weigl","year":"2015","unstructured":"Weigl, E., Heidl, W., Lughofer, E., Radauer, T., & Eitzinger, C. (2015). On improving performance of surface inspection systems by online active learning and flexible classifier updates. Machine Vision and Applications, 27(1), 103\u2013127.","journal-title":"Machine Vision and Applications"},{"key":"5986_CR37","doi-asserted-by":"crossref","unstructured":"Xu, Z., Akella, R., & Zhang, Y. (2007). Incorporating diversity and density in active learning for relevance feedback. In Proceedings of the European conference on information retrieval (ECIR) (pp. 246\u2013257). Springer.","DOI":"10.1007\/978-3-540-71496-5_24"},{"key":"5986_CR38","doi-asserted-by":"crossref","unstructured":"Zoller, T., & Buhmann, J. M. (2000). Active learning for hierarchical pairwise data clustering. In Proceedings 15th international conference on pattern recognition (ICPR) (pp. 186\u2013189). IEEE.","DOI":"10.1109\/ICPR.2000.906044"},{"issue":"1","key":"5986_CR39","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1109\/TNNLS.2012.2236570","volume":"25","author":"I \u017dliobait\u0117","year":"2014","unstructured":"\u017dliobait\u0117, I., Bifet, A., Pfahringer, B., & Holmes, G. (2014). Active learning with drifting streaming data. Transactions on Neural Networks and Learning Systems, 25(1), 27\u201339.","journal-title":"Transactions on Neural Networks and Learning Systems"}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-021-05986-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10994-021-05986-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-021-05986-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,18]],"date-time":"2021-06-18T15:11:52Z","timestamp":1624029112000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10994-021-05986-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,4]]},"references-count":39,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2021,6]]}},"alternative-id":["5986"],"URL":"https:\/\/doi.org\/10.1007\/s10994-021-05986-9","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"type":"print","value":"0885-6125"},{"type":"electronic","value":"1573-0565"}],"subject":[],"published":{"date-parts":[[2021,5,4]]},"assertion":[{"value":"20 November 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 March 2021","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 April 2021","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 May 2021","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}