{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T23:08:46Z","timestamp":1778368126235,"version":"3.51.4"},"publisher-location":"Cham","reference-count":27,"publisher":"Springer Nature Switzerland","isbn-type":[{"value":"9783032191045","type":"print"},{"value":"9783032191052","type":"electronic"}],"license":[{"start":{"date-parts":[[2026,1,1]],"date-time":"2026-01-01T00:00:00Z","timestamp":1767225600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"vor","delay-in-days":120,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>The labeling process for supervised learning is costly and time-consuming, and is often impractical to scale due to real-world constraints. Active learning (AL) addresses this challenge by strategically selecting representative and informative data points to reduce labeling efforts. This paper focuses on an AL scenario in which only a very limited number of labels can be acquired. We propose an algorithm operating in two phases: (1) an exploration phase that prioritizes representative and diverse data points using density-driven criteria, and (2) an exploitation phase that combines predictive uncertainty with density weighting to select informative samples from densely populated regions. This enhances both representativeness and informativeness. Our results demonstrate significant improvements in model quality compared to other algorithms typically employed for this scenario, across various scenarios involving imbalanced data in classification tasks and skewness in regression tasks. Through this work, we aim to provide a new algorithm for this scenario and investigate general principles for AL. While most AL studies focus on either classification or regression, our work applies the algorithms to both. Therefore, we can analyze the differences between classification and regression problems and their effects on AL strategies. Furthermore, we explore different categories of AL criteria and their effectiveness in the low-budget regime. These results also provide insight into the cold-start problem, which involves selecting an initial labeled set and is faced by many model-based AL methods.<\/jats:p>","DOI":"10.1007\/978-3-032-19105-2_1","type":"book-chapter","created":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T22:12:31Z","timestamp":1778364751000},"page":"5-21","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Low Query Budget Active Learning for\u00a0Classification and\u00a0Regression"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8362-5369","authenticated-orcid":false,"given":"Bjarne","family":"Jaster","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4204-4506","authenticated-orcid":false,"given":"Alaa","family":"Tharwat","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-2391-8330","authenticated-orcid":false,"given":"Eiram Mahera","family":"Sheikh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-9374-0720","authenticated-orcid":false,"given":"Martin","family":"Kohlhase","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3300-2048","authenticated-orcid":false,"given":"Wolfram","family":"Schenck","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,5,1]]},"reference":[{"key":"1_CR1","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1145\/1964897.1964906","volume":"12","author":"J Attenberg","year":"2011","unstructured":"Attenberg, J., Provost, F.: Inactive learning? Difficulties employing active learning in practice. ACM SIGKDD Explor. Newsl. 12, 36\u201341 (2011). https:\/\/doi.org\/10.1145\/1964897.1964906","journal-title":"ACM SIGKDD Explor. Newsl."},{"issue":"8","key":"1_CR2","doi-asserted-by":"publisher","first-page":"1053","DOI":"10.1093\/aob\/mcn050","volume":"101","author":"T Fourcaud","year":"2008","unstructured":"Fourcaud, T., Zhang, X., Stokes, A., Lambers, H., K\u00f6rner, C.: Plant growth modelling and applications: the increasing importance of plant architecture in growth models. Ann. Botany 101(8), 1053\u20131063 (2008). https:\/\/doi.org\/10.1093\/aob\/mcn050","journal-title":"Ann. Botany"},{"key":"1_CR3","doi-asserted-by":"publisher","unstructured":"Hacohen, G., Dekel, A., Weinshall, D.: Active learning on a budget: opposite strategies suit high and low budgets. In: Proceedings of the 39th International Conference on Machine Learning, vol.\u00a0162, pp. 8175\u20138195. PMLR (2022). https:\/\/doi.org\/10.48550\/arXiv.2202.02794","DOI":"10.48550\/arXiv.2202.02794"},{"issue":"5","key":"1_CR4","doi-asserted-by":"publisher","first-page":"1169","DOI":"10.3233\/IDA-205393","volume":"25","author":"D He","year":"2021","unstructured":"He, D., Yu, H., Wang, G., Li, J.: A two-stage clustering-based cold-start method for active learning. Intell. Data Anal. 25(5), 1169\u20131185 (2021)","journal-title":"Intell. Data Anal."},{"key":"1_CR5","doi-asserted-by":"publisher","unstructured":"Jaster, B., Kohlhase, M.: Active learning for regression problems with ensemble methods. In: Proceedings - 33. Workshop Computational Intelligence, pp. 9\u201329. Karlsruher Institut f\u00fcr Technologie (KIT) (2023). https:\/\/doi.org\/10.5445\/KSP\/1000162754","DOI":"10.5445\/KSP\/1000162754"},{"key":"1_CR6","doi-asserted-by":"publisher","unstructured":"Jose, A., de\u00a0Mendon\u00e7a, J.P.A., Devijver, E., Jakse, N., Monbet, V., Poloni, R.: Regression tree-based active learning. Data Min. Knowl. Discov. (2023). https:\/\/doi.org\/10.1007\/s10618-023-00951-7","DOI":"10.1007\/s10618-023-00951-7"},{"key":"1_CR7","unstructured":"Kelly, M., Longjohn, R., Nottingham, K.: The UCI machine learning repository. https:\/\/archive.ics.uci.edu"},{"key":"1_CR8","doi-asserted-by":"publisher","first-page":"913","DOI":"10.1007\/s11390-020-9487-4","volume":"35","author":"P Kumar","year":"2020","unstructured":"Kumar, P., Gupta, A.: Active learning query strategies for classification, regression, and clustering: a survey. J. Comput. Sci. Technol. 35, 913\u2013945 (2020). https:\/\/doi.org\/10.1007\/s11390-020-9487-4","journal-title":"J. Comput. Sci. Technol."},{"key":"1_CR9","doi-asserted-by":"publisher","unstructured":"Lewis, D.D., Gale, W.A.: A Sequential Algorithm for Training Text Classifiers, pp. 3\u201312. Springer, London (1994). https:\/\/doi.org\/10.1007\/978-1-4471-2099-5_1","DOI":"10.1007\/978-1-4471-2099-5_1"},{"key":"1_CR10","doi-asserted-by":"publisher","unstructured":"Liu, Z., Jiang, X., Luo, H., Fang, W., Liu, J., Wu, D.: Pool-based unsupervised active learning for regression using iterative representativeness-diversity maximization (IRDM). Pattern Recogn. Lett. 142, 11\u201319 (2021). https:\/\/doi.org\/10.1016\/j.patrec.2020.11.019","DOI":"10.1016\/j.patrec.2020.11.019"},{"key":"1_CR11","unstructured":"MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, vol.\u00a05, pp. 281\u2013298. University of California Press (1967)"},{"key":"1_CR12","doi-asserted-by":"publisher","unstructured":"Rezazadeh, F., Abrishambaf, A., D\u00fcrrbaum, A., Zimmermann, G., Kroll, A.: Holistic modeling of ultra-high performance concrete production process: Synergizing mix design, fresh concrete properties, and curing conditions. In: Proceedings - 33. Workshop Computational Intelligence, pp. 215\u2013238. Karlsruher Institut f\u00fcr Technologie (KIT) (2023). https:\/\/doi.org\/10.5445\/KSP\/1000162754","DOI":"10.5445\/KSP\/1000162754"},{"key":"1_CR13","doi-asserted-by":"crossref","unstructured":"Riis, C., Antunes, F., Boe, F., Carlos, H., Azevedo, L., Pereira, F.C.: Bayesian active learning with fully Bayesian gaussian processes. In: Advances in Neural Information Processing Systems, pp. 12141\u201312153. Curran Associates, Inc. (2022)","DOI":"10.52202\/068431-0882"},{"key":"1_CR14","unstructured":"Roy, N., McCallum, A.: Toward optimal active learning through sampling estimation of error reduction. In: ICML, vol.\u00a01, p.\u00a05. Citeseer (2001)"},{"key":"1_CR15","doi-asserted-by":"publisher","unstructured":"Sch\u00f6ne, M., Jaster, B., B\u00fcltemeier, J., K\u00f6sters, J., Holst, C.A., Kohlhase, M.: Pool-based active learning with decision trees: incorporate the tree structure to explore and exploit. In: 2025 IEEE Symposium on Trustworthy, Explainable and Responsible Computational Intelligence (CITREx), pp.\u00a01\u20139. IEEE (2025). https:\/\/doi.org\/10.1109\/CITREx64975.2025.10974940","DOI":"10.1109\/CITREx64975.2025.10974940"},{"key":"1_CR16","unstructured":"Settles, B.: Active learning literature survey. Computer Sciences Technical Report\u00a01648, University of Wisconsin-Madison Department of Computer Sciences (2009)"},{"key":"1_CR17","doi-asserted-by":"crossref","unstructured":"Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 287\u2013294 (1992)","DOI":"10.1145\/130385.130417"},{"key":"1_CR18","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1007\/s10618-016-0460-3","volume":"31","author":"M Sharma","year":"2017","unstructured":"Sharma, M., Bilgic, M.: Evidence-based uncertainty sampling for active learning. Data Min. Knowl. Disc. 31, 164\u2013202 (2017)","journal-title":"Data Min. Knowl. Disc."},{"key":"1_CR19","unstructured":"Shui, C., Zhou, F., Gagn\u00e9, C., Wang, B.: Deep active learning: unified and principled method for query and training. In: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, pp. 1308\u20131318. PMLR (2020). https:\/\/proceedings.mlr.press\/v108\/shui20a.html"},{"issue":"3","key":"1_CR20","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1504\/IJAPR.2016.079733","volume":"3","author":"A Tharwat","year":"2016","unstructured":"Tharwat, A.: Principal component analysis-a tutorial. Int. J. Appl. Pattern Recogn. 3(3), 197\u2013240 (2016). https:\/\/doi.org\/10.1504\/IJAPR.2016.079733","journal-title":"Int. J. Appl. Pattern Recogn."},{"key":"1_CR21","doi-asserted-by":"publisher","unstructured":"Tharwat, A., Schenck, W.: Balancing exploration and exploitation: a novel active learner for imbalanced data. Knowl.-Based Syst. 210, 106500 (2020). https:\/\/doi.org\/10.1016\/j.knosys.2020.106500","DOI":"10.1016\/j.knosys.2020.106500"},{"key":"1_CR22","doi-asserted-by":"publisher","unstructured":"Tharwat, A., Schenck, W.: A survey on active learning: State-of-the-art, practical challenges and research directions. Mathematics 11, 820 (2023). https:\/\/doi.org\/10.3390\/math11040820","DOI":"10.3390\/math11040820"},{"key":"1_CR23","doi-asserted-by":"publisher","unstructured":"Tharwat, A., Schenck, W.: Using methods from dimensionality reduction for active learning with low query budget. IEEE Trans. Knowl. Data Eng. 36(8), 4317\u20134330 (2024). https:\/\/doi.org\/10.1109\/TKDE.2024.3365189","DOI":"10.1109\/TKDE.2024.3365189"},{"key":"1_CR24","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1016\/j.eswa.2017.05.046","volume":"85","author":"M Wang","year":"2017","unstructured":"Wang, M., Min, F., Zhang, Z.H., Wu, Y.X.: Active learning through density clustering. Expert Syst. Appl. 85, 305\u2013317 (2017)","journal-title":"Expert Syst. Appl."},{"key":"1_CR25","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.ins.2018.09.060","volume":"474","author":"D Wu","year":"2019","unstructured":"Wu, D., Lin, C.T., Huang, J.: Active learning for regression using greedy sampling. Inf. Sci. 474, 90\u2013105 (2019). https:\/\/doi.org\/10.1016\/j.ins.2018.09.060","journal-title":"Inf. Sci."},{"key":"1_CR26","doi-asserted-by":"crossref","unstructured":"Yu, K., Bi, J., Tresp, V.: Active learning via transductive experimental design. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 1081\u20131088 (2006)","DOI":"10.1145\/1143844.1143980"},{"key":"1_CR27","unstructured":"Zhao, Z., Jiang, Y., Chen, Y.: Direct acquisition optimization for low-budget active learning. arXiv preprint arXiv:2402.06045 (2024)"}],"container-title":["Communications in Computer and Information Science","Machine Learning and Principles and Practice of Knowledge Discovery in Databases"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-032-19105-2_1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T22:12:34Z","timestamp":1778364754000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-032-19105-2_1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026]]},"ISBN":["9783032191045","9783032191052"],"references-count":27,"URL":"https:\/\/doi.org\/10.1007\/978-3-032-19105-2_1","relation":{},"ISSN":["1865-0929","1865-0937"],"issn-type":[{"value":"1865-0929","type":"print"},{"value":"1865-0937","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026]]},"assertion":[{"value":"1 May 2026","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"The authors have no competing interests to declare that are relevant to the content of this article.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Disclosure of Interests"}},{"value":"ECML PKDD","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Joint European Conference on Machine Learning and Knowledge Discovery in Databases","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Porto","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Portugal","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2025","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"15 September 2025","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"19 September 2025","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"ecml2025","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/ecmlpkdd.org\/2025\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}