{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T10:05:58Z","timestamp":1772186758884,"version":"3.50.1"},"reference-count":46,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2023,7,1]],"date-time":"2023-07-01T00:00:00Z","timestamp":1688169600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Programme Erasmus+, Knowledge Alliances","award":["621639-EPP-1-2020-1-IT-EPPKA2-KA"],"award-info":[{"award-number":["621639-EPP-1-2020-1-IT-EPPKA2-KA"]}]},{"name":"Programme Erasmus+, Knowledge Alliances","award":["PLANET4"],"award-info":[{"award-number":["PLANET4"]}]},{"name":"Programme Erasmus+, Knowledge Alliances","award":["DM MUR 1061\/2022"],"award-info":[{"award-number":["DM MUR 1061\/2022"]}]},{"name":"Ministry of University and Research (MUR)","award":["621639-EPP-1-2020-1-IT-EPPKA2-KA"],"award-info":[{"award-number":["621639-EPP-1-2020-1-IT-EPPKA2-KA"]}]},{"name":"Ministry of University and Research (MUR)","award":["PLANET4"],"award-info":[{"award-number":["PLANET4"]}]},{"name":"Ministry of University and Research (MUR)","award":["DM MUR 1061\/2022"],"award-info":[{"award-number":["DM MUR 1061\/2022"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Small and medium-sized enterprises (SMEs) often encounter practical challenges and limitations when extracting valuable insights from the data of retrofitted or brownfield equipment. The existing literature fails to reflect the full reality and potential of data-driven analysis in current SME environments. In this paper, we provide an anonymized dataset obtained from two medium-sized companies leveraging a non-invasive and scalable data-collection procedure. The dataset comprises mainly power consumption machine data collected over a period of 7 months and 1 year from two medium-sized companies. Using this dataset, we demonstrate how machine learning (ML) techniques can enable SMEs to extract useful information even in the short term, even from a small variety of data types. We develop several ML models to address various tasks, such as power consumption forecasting, item classification, next machine state prediction, and item production count forecasting. By providing this anonymized dataset and showcasing its application through various ML use cases, our paper aims to provide practical insights for SMEs seeking to leverage ML techniques with their limited data resources. The findings contribute to a better understanding of how ML can be effectively utilized in extracting actionable insights from limited datasets, offering valuable implications for SMEs in practical settings.<\/jats:p>","DOI":"10.3390\/s23136078","type":"journal-article","created":{"date-parts":[[2023,7,3]],"date-time":"2023-07-03T00:53:16Z","timestamp":1688345596000},"page":"6078","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Data-Driven Insights through Industrial Retrofitting: An Anonymized Dataset with Machine Learning Use Cases"],"prefix":"10.3390","volume":"23","author":[{"given":"Daniele","family":"Atzeni","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Pisa, 56126 Pisa, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Reshawn","family":"Ramjattan","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Pisa, 56126 Pisa, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7208-6865","authenticated-orcid":false,"given":"Roberto","family":"Figli\u00e8","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Pisa, 56126 Pisa, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Giacomo","family":"Baldi","sequence":"additional","affiliation":[{"name":"Zerynth, 56124 Pisa, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniele","family":"Mazzei","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Pisa, 56126 Pisa, Italy"},{"name":"Zerynth, 56124 Pisa, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1740015","DOI":"10.1142\/S1363919617400151","article-title":"Sustainable industrial value creation: Benefits and challenges of industry 4.0","volume":"21","author":"Kiel","year":"2017","journal-title":"Int. J. Innov. Manag."},{"key":"ref_2","unstructured":"Dassisti, M., Panetto, H., Lezoche, M., Merla, P., Semeraro, C., Giovannini, A., and Chimienti, M. (2017, January 12\u201315). Industry 4.0 paradigm: The viewpoint of the small and medium enterprises. Proceedings of the 7th International Conference on Information Society and Technology, ICIST 2017, Kopaonik, Serbia."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Pa\u015bko, \u0141., M\u0105dziel, M., Stadnicka, D., Dec, G., Carreras-Coch, A., Sol\u00e9-Beteta, X., Pappa, L., Stylios, C., Mazzei, D., and Atzeni, D. (2022). Plan and Develop Advanced Knowledge and Skills for Future Industrial Employees in the Field of Artificial Intelligence, Internet of Things and Edge Computing. Sustainability, 14.","DOI":"10.3390\/su14063312"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Dec, G., Stadnicka, D., Pa\u015bko, \u0141., M\u0105dziel, M., Figli\u00e8, R., Mazzei, D., Tyrovolas, M., Stylios, C., Navarro, J., and Sol\u00e9-Beteta, X. (2022). Role of academics in transferring knowledge and skills on artificial intelligence, internet of things and edge computing. Sensors, 22.","DOI":"10.3390\/s22072496"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1016\/j.promfg.2017.09.093","article-title":"Towards Industry 4.0: An overview of European strategic roadmaps","volume":"13","author":"Santos","year":"2017","journal-title":"Procedia Manuf."},{"key":"ref_6","unstructured":"Calenda, C. (2017). Piano Nazionale Industria 4.0, Ministero Dello Sviluppo Economico."},{"key":"ref_7","first-page":"1033","article-title":"Education and computer skill enhancing for Vietnam laborers under industry 4.0 and evfta agreement","volume":"20","author":"Huy","year":"2021","journal-title":"Ilkogr. Online"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"106193","DOI":"10.1016\/j.cie.2019.106193","article-title":"Cyber-physical production systems retrofitting in context of industry 4.0","volume":"139","author":"Lins","year":"2020","journal-title":"Comput. Ind. Eng."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Lins, T., Oliveira, R.A.R., Correia, L.H., and Silva, J.S. (2018, January 5\u20138). Industry 4.0 Retrofitting. Proceedings of the 2018 VIII Brazilian Symposium on Computing Systems Engineering (SBESC), Salvador, Brazil.","DOI":"10.1109\/SBESC.2018.00011"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1007\/s10845-008-0145-x","article-title":"Data mining in manufacturing: A review based on the kind of knowledge","volume":"20","author":"Choudhary","year":"2009","journal-title":"J. Intell. Manuf."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Strau\u00df, P., Schmitz, M., W\u00f6stmann, R., and Deuse, J. (2018, January 10\u201313). Enabling of predictive maintenance in the brownfield through low-cost sensors, an iiot-architecture and machine learning. Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA.","DOI":"10.1109\/BigData.2018.8622076"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.mfglet.2019.08.001","article-title":"Retrofitting old CNC turning with an accelerometer at a remote location towards Industry 4.0","volume":"21","author":"Herwan","year":"2019","journal-title":"Manuf. Lett."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.mfglet.2018.11.001","article-title":"Tool wear monitoring of a retrofitted CNC milling machine using artificial neural networks","volume":"19","author":"Hesser","year":"2019","journal-title":"Manuf. Lett."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1007\/s10845-021-01856-2","article-title":"Transformation of a rolling mill aggregate to a cyber physical production system: From sensor retrofitting to machine learning","volume":"33","author":"Ralph","year":"2022","journal-title":"J. Intell. Manuf."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"5499","DOI":"10.1109\/IECON.2019.8927799","article-title":"IoT energy retrofit and the connection of legacy machines inside the industry 4.0 concept","volume":"Volume 1","author":"Lima","year":"2019","journal-title":"Proceedings of the IECON 2019\u201445th Annual Conference of the IEEE Industrial Electronics Society"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Pereira, M.S., and Lima, F. (2018, January 21\u201323). A machine learning approach applied to energy prediction in job shop environments. Proceedings of the IECON 2018\u201444th Annual Conference of the IEEE Industrial Electronics Society, Washington, DC, USA.","DOI":"10.1109\/IECON.2018.8592763"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Selvaraj, V., and Min, S. (2023). Real-time fault identification system for a retrofitted ultra-precision CNC machine from equipment\u2019s power consumption data: A case study of an implementation. Int. J. Precis. Eng. Manuf.-Green Technol., 1\u201317.","DOI":"10.1007\/s40684-022-00497-x"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Silva, B., Sousa, J., and Alenya, G. (2021, January 18\u201320). Data acquisition and monitoring system for legacy injection machines. Proceedings of the 2021 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA), Hong Kong, China.","DOI":"10.1109\/CIVEMSA52099.2021.9493675"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Polenta, A., Tomassini, S., Falcionelli, N., Contardo, P., Dragoni, A.F., and Sernani, P. (2022). A Comparison of Machine Learning Techniques for the Quality Classification of Molded Products. Information, 13.","DOI":"10.3390\/info13060272"},{"key":"ref_20","unstructured":"Bubeck, S., Chandrasekaran, V., Eldan, R., Gehrke, J., Horvitz, E., Kamar, E., Lee, P., Lee, Y.T., Li, Y., and Lundberg, S. (2023). Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv."},{"key":"ref_21","unstructured":"Yang, L., Zhang, Z., Song, Y., Hong, S., Xu, R., Zhao, Y., Shao, Y., Zhang, W., Cui, B., and Yang, M.H. (2022). Diffusion models: A comprehensive survey of methods and applications. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Atzeni, D., Bacciu, D., Mazzei, D., and Prencipe, G. (2022). A Systematic Review of Wi-Fi and Machine Learning Integration with Topic Modeling Techniques. Sensors, 22.","DOI":"10.3390\/s22134925"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Paolanti, M., Romeo, L., Felicetti, A., Mancini, A., Frontoni, E., and Loncarski, J. (2018, January 2\u20134). Machine learning approach for predictive maintenance in industry 4.0. Proceedings of the 2018 14th IEEE\/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA), Oulu, Finland.","DOI":"10.1109\/MESA.2018.8449150"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1016\/j.procs.2021.01.127","article-title":"Machine Learning and statistics: A Study for assessing innovative demand forecasting models","volume":"180","author":"Moroff","year":"2021","journal-title":"Procedia Comput. Sci."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"106749","DOI":"10.1016\/j.cie.2020.106749","article-title":"Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0","volume":"149","author":"Hu","year":"2020","journal-title":"Comput. Ind. Eng."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"4773","DOI":"10.1080\/00207543.2021.1956675","article-title":"Machine learning in manufacturing and industry 4.0 applications","volume":"59","author":"Rai","year":"2021","journal-title":"Int. J. Prod. Res."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Mazzei, D., and Ramjattan, R. (2022). Machine Learning for Industry 4.0: A Systematic Review Using Deep Learning-Based Topic Modelling. Sensors, 22.","DOI":"10.3390\/s22228641"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Gade, K., Geyik, S.C., Kenthapadi, K., Mithal, V., and Taly, A. (2019, January 4\u20138). Explainable AI in industry. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA.","DOI":"10.1145\/3292500.3332281"},{"key":"ref_30","unstructured":"Bengio, Y., Courville, A.C., and Vincent, P. (2012). Unsupervised feature learning and deep learning: A review and new perspectives. arXiv."},{"key":"ref_31","unstructured":"Zhu, X.J. (2005). Semi-Supervised Learning Literature Survey, University of Wisconsin."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.neucom.2019.12.130","article-title":"Graph-based semi-supervised learning: A review","volume":"408","author":"Chong","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_33","unstructured":"Ouali, Y., Hudelot, C., and Tami, M. (2020). An overview of deep semi-supervised learning. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Babich, L., Svalov, D., Smirnov, A., and Babich, M. (2019, January 25\u201326). Industrial power consumption forecasting methods comparison. Proceedings of the 2019 Ural Symposium on Biomedical Engineering, Radioelectronics and Information Technology (USBEREIT), Yekaterinburg, Russia.","DOI":"10.1109\/USBEREIT.2019.8736640"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/j.susoc.2022.01.008","article-title":"Understanding the adoption of Industry 4.0 technologies in improving environmental sustainability","volume":"3","author":"Javaid","year":"2022","journal-title":"Sustain. Oper. Comput."},{"key":"ref_36","unstructured":"Raschka, S. (2018). Model evaluation, model selection, and algorithm selection in machine learning. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"102282","DOI":"10.1016\/j.ijinfomgt.2020.102282","article-title":"Forecasting and Anomaly Detection approaches using LSTM and LSTM Autoencoder techniques with the applications in supply chain management","volume":"57","author":"Nguyen","year":"2021","journal-title":"Int. J. Inf. Manag."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.isprsjprs.2016.01.011","article-title":"Random forest in remote sensing: A review of applications and future directions","volume":"114","author":"Belgiu","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"108073","DOI":"10.1016\/j.ijepes.2022.108073","article-title":"Applications of random forest in multivariable response surface for short-term load forecasting","volume":"139","author":"Fan","year":"2022","journal-title":"Int. J. Electr. Power Energy Syst."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Mohammed, R., Rawashdeh, J., and Abdullah, M. (2020, January 7\u20139). Machine learning with oversampling and undersampling techniques: Overview study and experimental results. Proceedings of the 2020 11th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan.","DOI":"10.1109\/ICICS49469.2020.239556"},{"key":"ref_43","first-page":"577","article-title":"Constrained k-means clustering with background knowledge","volume":"1","author":"Wagstaff","year":"2001","journal-title":"ICML"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1985","journal-title":"J. Classif."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1037\/a0027127","article-title":"Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC)","volume":"17","author":"Vrieze","year":"2012","journal-title":"Psychol. Methods"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1007\/BF02294361","article-title":"Model selection and Akaike\u2019s information criterion (AIC): The general theory and its analytical extensions","volume":"52","author":"Bozdogan","year":"1987","journal-title":"Psychometrika"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/6078\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:04:35Z","timestamp":1760126675000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/6078"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,1]]},"references-count":46,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["s23136078"],"URL":"https:\/\/doi.org\/10.3390\/s23136078","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,1]]}}}