{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,22]],"date-time":"2025-12-22T22:13:14Z","timestamp":1766441594011,"version":"3.40.3"},"publisher-location":"Cham","reference-count":33,"publisher":"Springer Nature Switzerland","isbn-type":[{"type":"print","value":"9783031789762"},{"type":"electronic","value":"9783031789779"}],"license":[{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,1,28]],"date-time":"2025-01-28T00:00:00Z","timestamp":1738022400000},"content-version":"vor","delay-in-days":27,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The escalating climate crisis demands urgent action to mitigate the environmental impact of energy-intensive technologies, including Artificial Intelligence (AI). Lowering AI\u2019s environmental impact requires adopting energy-efficient approaches for training Deep Neural Networks (DNNs). One such approach is to use Dataset Pruning (DP) methods to reduce the number of training instances, and thus the total energy consumed. Numerous DP methods have been proposed in the literature (e.g., GraNd and Craig), with the ultimate aim of speeding up model training. On the other hand, Active Learning (AL) approaches, originally conceived to repeatedly select the best data to be labeled by a human expert (from a large collection of unlabeled data), can be exploited as well to train a model on a relatively small subset of (informative) examples. However, despite allowing for reducing the total amount of training data, most DP methods and pure AL-based schemes entail costly computations that may strongly limit their energy saving potential. In this work, we empirically study the effectiveness of DP and AL methods in curbing energy consumption in DNN training, and propose a novel approach to DNN learning, named <jats:italic>Play it straight<\/jats:italic>, which efficiently combines data selection methods and AL-like incremental training. <jats:italic>Play it straight<\/jats:italic> is shown to outperform traditional DP and AL approaches, achieving a better trade-off between accuracy and energy efficiency.<\/jats:p>","DOI":"10.1007\/978-3-031-78977-9_5","type":"book-chapter","created":{"date-parts":[[2025,1,27]],"date-time":"2025-01-27T10:13:25Z","timestamp":1737972805000},"page":"69-85","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Play it Straight: An Intelligent Data Pruning Technique for\u00a0Green-AI"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0009-0007-5224-0910","authenticated-orcid":false,"given":"Francesco","family":"Scala","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4164-940X","authenticated-orcid":false,"given":"Sergio","family":"Flesca","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4513-0362","authenticated-orcid":false,"given":"Luigi","family":"Pontieri","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,1,28]]},"reference":[{"key":"5_CR1","unstructured":"Ash, J.T., Goel, S., Krishnamurthy, A., Kakade, S.M.: Gone fishing: neural active learning with fisher embeddings. In: Ranzato, M., Beygelzimer, A., Dauphin, Y.N., Liang, P., Vaughan, J.W. (eds.) Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, 6\u201314 December 2021, virtual, pp. 8927\u20138939 (2021)"},{"key":"5_CR2","unstructured":"Ayed, F., Hayou, S.: Data pruning and neural scaling laws: fundamental limitations of score-based algorithms (2023)"},{"key":"5_CR3","unstructured":"Chang, H.-S., Learned-Miller, E.G., McCallum, A.: Active bias: training more accurate neural networks by emphasizing high variance samples. In: Neural Information Processing Systems (2017)"},{"key":"5_CR4","unstructured":"Courty, B., et al.: mlco2\/codecarbon: v2.4.1 (2024)"},{"issue":"10","key":"5_CR5","doi-asserted-by":"publisher","first-page":"2191","DOI":"10.1016\/j.joule.2023.09.004","volume":"7","author":"A de Vries","year":"2023","unstructured":"de Vries, A.: The growing energy footprint of artificial intelligence. Joule 7(10), 2191\u20132194 (2023)","journal-title":"Joule"},{"key":"5_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.116936","volume":"200","author":"S Flesca","year":"2022","unstructured":"Flesca, S., Scala, F., Vocaturo, E., Zumpano, F.: On forecasting non-renewable energy production with uncertainty quantification: a case study of the Italian energy market. Expert Syst. Appl. 200, 116936 (2022)","journal-title":"Expert Syst. Appl."},{"key":"5_CR7","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1016\/j.jpdc.2019.07.007","volume":"134","author":"E Garcia-Martin","year":"2019","unstructured":"Garcia-Martin, E., Rodrigues, C.F., Riley, G., Grahn, H.: Estimation of energy consumption in machine learning. J. Parallel Distrib. Comput. 134, 75\u201388 (2019)","journal-title":"J. Parallel Distrib. Comput."},{"key":"5_CR8","doi-asserted-by":"crossref","unstructured":"Guo, C., Zhao, B., Bai, Y.: Deepcore: a comprehensive library for coreset selection in deep learning. In: Database and Expert Systems Applications: 33rd International Conference, DEXA 2022, Vienna, Austria, 22\u201324 August 2022, Proceedings, Part I, pp. 181\u2013195. Springer, Heidelberg (2022)","DOI":"10.1007\/978-3-031-12423-5_14"},{"key":"5_CR9","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778 (2016)","DOI":"10.1109\/CVPR.2016.90"},{"key":"5_CR10","unstructured":"Iyer, R., Khargoankar, N., Bilmes, J., Asanani, H.: Submodular combinatorial information measures with applications in machine learning. In: Feldman, V., Ligett, K., Sabato, S. (eds.) Proceedings of the 32nd International Conference on Algorithmic Learning Theory. Proceedings of Machine Learning Research, vol. 132, pp. 722\u2013754. PMLR, 16\u201319 Mar 2021 (2021)"},{"key":"5_CR11","unstructured":"Katharopoulos, A., Fleuret, F.: Not all samples are created equal: deep learning with importance sampling. In: Dy, J., Krause, A. (eds.) Proceedings of the 35th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 80, pp. 2525\u20132534. PMLR, 10\u201315 July 2018 (2018)"},{"key":"5_CR12","unstructured":"Krizhevsky, A., Nair, V., Hinton, G.: CIFAR-10 (Canadian institute for advanced research)"},{"key":"5_CR13","unstructured":"Krizhevsky, A., Nair, V., Hinton, G.: CIFAR-100 (Canadian institute for advanced research)"},{"key":"5_CR14","unstructured":"Loo, N., Hasani, R., Amini, A., Rus, D.: Efficient dataset distillation using random feature approximation. In: Oh, A.H., Agarwal, A., Belgrave, D., Cho, K. (eds.) Advances in Neural Information Processing Systems (2022)"},{"key":"5_CR15","unstructured":"Mirzasoleiman, B., Bilmes, J., Leskovec, J.: Coresets for data-efficient training of machine learning models. In: III, H.D., Singh, A. (eds.) Proceedings of the 37th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 119, pp. 6950\u20136960. PMLR, 13\u201318 July 2020 (2020)"},{"key":"5_CR16","unstructured":"Okanovic, P., et al.: Repeated random sampling for minimizing the time-to-accuracy of learning. In: The Twelfth International Conference on Learning Representations (2024)"},{"key":"5_CR17","unstructured":"Park, D., Papailiopoulos, D., Lee, K.: Active learning is a strong baseline for data subset selection. In: Has it Trained Yet? NeurIPS 2022 Workshop (2022)"},{"key":"5_CR18","unstructured":"Patterson, D.A., et al.: Carbon emissions and large neural network training. arXiv, abs\/2104.10350 (2021)"},{"key":"5_CR19","unstructured":"Paul, M., Ganguli, S., Dziugaite, G.K.: Deep learning on a data diet: finding important examples early in training. In: Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (eds.) Advances in Neural Information Processing Systems (2021)"},{"key":"5_CR20","doi-asserted-by":"crossref","unstructured":"Quercia, A., Morrison, A., Scharr, H., Assent, I.: SGD biased towards early important samples for efficient training. In: 2023 IEEE International Conference on Data Mining (ICDM), pp. 1289\u20131294 (2023)","DOI":"10.1109\/ICDM58522.2023.00163"},{"key":"5_CR21","unstructured":"Ruder, S.: An overview of gradient descent optimization algorithms (2017)"},{"key":"5_CR22","unstructured":"Sachdeva, N., McAuley, J.J.: Data distillation: a survey. CoRR, abs\/2301.04272 (2023)"},{"key":"5_CR23","doi-asserted-by":"crossref","unstructured":"Salehi, S., Schmeink, A.: Is active learning green? An empirical study. In: 2023 IEEE International Conference on Big Data (BigData), pp. 3823\u20133829, Los Alamitos, CA, USA. IEEE Computer Society (2023)","DOI":"10.1109\/BigData59044.2023.10386411"},{"key":"5_CR24","unstructured":"Scala, F., Flesca, S., Pontieri, L.: Data filtering for a sustainable model training. In: Proceedings of the 32nd Symposium of Advanced Database Systems, Villasimius, Italy, June 23rd to 26th, 2024. CEUR Workshop Proceedings, vol. 3741, pp. 205\u2013216. CEUR-WS.org (2024)"},{"key":"5_CR25","unstructured":"Schaul, T., Quan, J., Antonoglou, I., Silver, D.: Prioritized experience replay (2016)"},{"key":"5_CR26","unstructured":"Sener, O., Savarese, S.: Active learning for convolutional neural networks: a core-set approach. arXiv preprint arXiv:1708.00489 (2017)"},{"key":"5_CR27","unstructured":"Settles, B.: Active learning literature survey. Technical report, University of Wisconsin-Madison Department of Computer Sciences (2009)"},{"key":"5_CR28","doi-asserted-by":"crossref","unstructured":"Strubell, E., Ganesh, A., McCallum, A.: Energy and policy considerations for deep learning in NLP. In: Korhonen, A., Traum, D.R., M\u00e0rquez, L. (eds.) Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, 28July\u20132 August 2019, Volume 1: Long Papers, pp. 3645\u20133650. Association for Computational Linguistics (2019)","DOI":"10.18653\/v1\/P19-1355"},{"key":"5_CR29","doi-asserted-by":"crossref","unstructured":"Wang, K., et al.: Cafe learning to condense dataset by aligning features. In: Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12186\u201312195, United States (2022)","DOI":"10.1109\/CVPR52688.2022.01188"},{"key":"5_CR30","unstructured":"Xu, J., Zhou, W., Fu, Z., Zhou, H., Li, L.: A survey on green deep learning. arXiv, abs\/2111.05193 (2021)"},{"key":"5_CR31","unstructured":"Yang, Z., Yang, H., Majumder, S., Cardoso, J., Gallego, G.: Data pruning can do more: a comprehensive data pruning approach for object re-identification. Trans. Mach. Learn. Res. (2024)"},{"issue":"01","key":"5_CR32","doi-asserted-by":"publisher","first-page":"150","DOI":"10.1109\/TPAMI.2023.3323376","volume":"46","author":"R Yu","year":"2024","unstructured":"Yu, R., Liu, S., Wang, X.: Dataset distillation: a comprehensive review. IEEE Trans. Pattern Anal. Mach. Intell. 46(01), 150\u2013170 (2024)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"5_CR33","unstructured":"Zhao, B., Mopuri, K.R., Bilen, H.: Dataset condensation with gradient matching. In: International Conference on Learning Representations (2021)"}],"container-title":["Lecture Notes in Computer Science","Discovery Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-78977-9_5","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,27]],"date-time":"2025-01-27T10:13:47Z","timestamp":1737972827000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-78977-9_5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"ISBN":["9783031789762","9783031789779"],"references-count":33,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-78977-9_5","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2025]]},"assertion":[{"value":"28 January 2025","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"DS","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Discovery Science","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Pisa","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Italy","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2024","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"14 October 2024","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"16 October 2024","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"27","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"dis2024","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"http:\/\/ds2024.isti.cnr.it\/index.html","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}