{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,1]],"date-time":"2025-12-01T15:46:00Z","timestamp":1764603960893,"version":"build-2065373602"},"reference-count":32,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2023,6,17]],"date-time":"2023-06-17T00:00:00Z","timestamp":1686960000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Aw\u00e9 Group"},{"name":"Elev\u00e9o"},{"name":"MDPI Information"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>The huge amount of data produced by the Internet of Things need to be validated and curated to be prepared for the selection of relevant data in order to prototype models, train them, and serve the model. On the other side, blockchains and open data are also important data sources that need to be integrated into the proposed integrative models. It is difficult to find a sufficiently versatile and agnostic architecture based on the main machine learning frameworks that facilitate model development and allow continuous training to continuously improve them from the data streams. The paper describes the conceptualization, implementation, and testing of a new architecture that proposes a use case agnostic processing chain. The proposed architecture is mainly built around the Apache Submarine, an unified Machine Learning platform that facilitates the training and deployment of algorithms. Here, Internet of Things data are collected and formatted at the edge level. They are then processed and validated at the fog level. On the other hand, open data and blockchain data via Blockchain Access Layer are directly processed at the cloud level. Finally, the data are preprocessed to feed scalable machine learning algorithms.<\/jats:p>","DOI":"10.3390\/info14060345","type":"journal-article","created":{"date-parts":[[2023,6,19]],"date-time":"2023-06-19T02:29:19Z","timestamp":1687141759000},"page":"345","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Towards a Unified Architecture Powering Scalable Learning Models with IoT Data Streams, Blockchain, and Open Data"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4711-2694","authenticated-orcid":false,"given":"Olivier","family":"Debauche","sequence":"first","affiliation":[{"name":"Elev\u00e9o, R&D Service, Innovation Department, Aw\u00e9 Group, 5590 Ciney, Belgium"},{"name":"Gembloux Agro-Bio Tech, Terra, University of Li\u00e8ge, 5030 Gembloux, Belgium"},{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"},{"name":"Gembloux Agro-Bio Tech, Modeling and Development, University of Li\u00e8ge, 5030 Gembloux, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2404-7300","authenticated-orcid":false,"given":"Jean Bertin","family":"Nkamla Penka","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2342-495X","authenticated-orcid":false,"given":"Moad","family":"Hani","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3226-7001","authenticated-orcid":false,"given":"Adriano","family":"Guttadauria","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9582-7988","authenticated-orcid":false,"given":"Rachida","family":"Ait Abdelouahid","sequence":"additional","affiliation":[{"name":"Faculty of Sciences Ben M\u2019sik, Hassan II University\u2014Casablanca, Casablanca P.O. Box 7955, Morocco"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4980-7062","authenticated-orcid":false,"given":"Kaouther","family":"Gasmi","sequence":"additional","affiliation":[{"name":"National Engineering School of Tunis, Tunis El Manar University, 1080 Tunis, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1002-9718","authenticated-orcid":false,"given":"Ouafae","family":"Ben Hardouz","sequence":"additional","affiliation":[{"name":"Faculty of Sciences Ben M\u2019sik, Hassan II University\u2014Casablanca, Casablanca P.O. Box 7955, Morocco"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8724-5363","authenticated-orcid":false,"given":"Fr\u00e9d\u00e9ric","family":"Lebeau","sequence":"additional","affiliation":[{"name":"Gembloux Agro-Bio Tech, Terra, University of Li\u00e8ge, 5030 Gembloux, Belgium"},{"name":"Gembloux Agro-Bio Tech, Digital Energy & Agriculture Lab, University of Li\u00e8ge, 5030 Gembloux, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6974-4313","authenticated-orcid":false,"given":"J\u00e9r\u00f4me","family":"Bindelle","sequence":"additional","affiliation":[{"name":"Gembloux Agro-Bio Tech, Terra, University of Li\u00e8ge, 5030 Gembloux, Belgium"},{"name":"Gembloux Agro-Bio Tech, Animal Production Engineering and Nutrition, University of Li\u00e8ge, 5030 Gembloux, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9883-9047","authenticated-orcid":false,"given":"H\u00e9l\u00e8ne","family":"Soyeurt","sequence":"additional","affiliation":[{"name":"Gembloux Agro-Bio Tech, Terra, University of Li\u00e8ge, 5030 Gembloux, Belgium"},{"name":"Gembloux Agro-Bio Tech, Modeling and Development, University of Li\u00e8ge, 5030 Gembloux, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5981-5509","authenticated-orcid":false,"given":"Nicolas","family":"Gengler","sequence":"additional","affiliation":[{"name":"Gembloux Agro-Bio Tech, Terra, University of Li\u00e8ge, 5030 Gembloux, Belgium"},{"name":"Gembloux Agro-Bio Tech, Animal Production and Nutrition Engineering, University of Li\u00e8ge, 5030 Gembloux, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3990-3621","authenticated-orcid":false,"given":"Pierre","family":"Manneback","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4020-7327","authenticated-orcid":false,"given":"Mohammed","family":"Benjelloun","sequence":"additional","affiliation":[{"name":"Faculty of Engineering, ILIA Unit, University of Mons, 7000 Mons, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4602-3868","authenticated-orcid":false,"given":"Carlo","family":"Bertozzi","sequence":"additional","affiliation":[{"name":"Elev\u00e9o, R&D Service, Innovation Department, Aw\u00e9 Group, 5590 Ciney, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.future.2019.05.056","article-title":"A Relevance-based approach for Big Data Exploration","volume":"101","author":"Bagozi","year":"2019","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1016\/j.ijinfomgt.2019.07.003","article-title":"Autonomic machine learning platform","volume":"49","author":"Lee","year":"2019","journal-title":"Int. J. Inf. Manag."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Baylor, D., Breck, E., Cheng, H.-T., Fiedel, N., Foo, C.Y., Haque, Z., Haykal, S., Ispir, M., Jain, V., and Koc, L. (2017, January 13). Tfx: A tensorflow-based production-scale machine learning platform. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201917), Halifax, NS, Canada.","DOI":"10.1145\/3097983.3098021"},{"key":"ref_4","first-page":"39","article-title":"Accelerating the machine learning lifecycle with MLflow","volume":"41","author":"Zaharia","year":"2018","journal-title":"IEEE Data Eng. Bull."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"136","DOI":"10.1016\/j.procs.2020.10.021","article-title":"Toward a big data knowledge-base management system for precision livestock farming","volume":"177","author":"Roukh","year":"2020","journal-title":"Procedia Comput. Sci."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"100378","DOI":"10.1016\/j.iot.2021.100378","article-title":"Data management and internet of things: A methodological review in smart farming","volume":"14","author":"Debauche","year":"2021","journal-title":"Internet Things"},{"key":"ref_7","first-page":"7494","article-title":"Cloud and distributed architectures for data management in agriculture 4.0: Review and future trends","volume":"34","author":"Debauche","year":"2022","journal-title":"J. King Saud-Univ.-Comput. Inf. Sci."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"59","DOI":"10.5383\/JUSPN.17.02.002","article-title":"An Optimized Kappa Architecture for IoT Data Management in Smart Farming","volume":"17","author":"Mahmoudi","year":"2022","journal-title":"Int. J. Ubiquitous Syst. Pervasive Netw."},{"key":"ref_9","unstructured":"Chen, T., Li, M., Li, Y., Lin, M., Wang, N., Wang, M., Xiao, T., Xu, B., Zhang, C., and Zhang, Z. (2015). Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv."},{"key":"ref_10","unstructured":"Wallach, H., Larochelle, H., Beygelzimer, A., d\u2019Alch\u00e9-Buc, F., Fox, E., and Garnett, R. (2019). Advances in Neural Information Processing Systems, Curran Associates, Inc."},{"key":"ref_11","unstructured":"Sergeev, A., and Del Balso, M. (2018). Horovod: Fast and easy distributed deep learning in TensorFlow. arXiv."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Chen, K.-H., Su, H.-P., Chuang, W.-C., Hsiao, H.-C., Tan, W., Tang, Z., Liu, X., Liang, Y., Lo, W.-C., and Ji, W. (2022, January 5\u20138). Apache submarine: A unified machine learning platform made simple. Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys\u201922), Rennes, France.","DOI":"10.1145\/3517207.3526984"},{"key":"ref_13","unstructured":"Aggarwal, C.C., Philip, S.Y., Han, J., and Wang, J. (2003, January 12\u201313). A framework for clustering evolving data streams. Proceedings of the 29th International Conference on Very Large Databases (VLDB), Berlin, Germany."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Sayed, D., Rady, S., and Aref, M. (2020, January 7\u20139). Enhancing CluStream algorithm for CLUSTERING big data streaming over sliding window. Proceedings of the 12th International Conference on Electrical Engineering (ICEENG), Cairo, Egypt.","DOI":"10.1109\/ICEENG45378.2020.9171705"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ahsani, S., Sanati, M.Y., and Mansoorizadeh, M. (2021, January 28\u201329). Improvement of CluStream algorithm using sliding window for the clustering of data streams. Proceedings of the 11th International Conference on Computer Engineering and Knowledge (ICCKE), Mashhad, Iran.","DOI":"10.1109\/ICCKE54056.2021.9721505"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"973","DOI":"10.1007\/s11634-018-0316-3","article-title":"Equi-Clustream: A framework for clustering time evolving mixed data","volume":"12","author":"Sangam","year":"2018","journal-title":"Adv. Data Anal. Classif."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, X., and Sun, Q. (2017, January 9\u201310). Research on Clustream Algorithm Based on Spark. Proceedings of the 10th International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, China.","DOI":"10.1109\/ISCID.2017.111"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Kumar, A., Singh, A., and Singh, R. (2017, January 4\u20137). An efficient hybrid-clustream algorithm for stream mining. Proceedings of the 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), Jaipur, India.","DOI":"10.1109\/SITIS.2017.77"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Grua, E.M., Hoogendoorn, M., Malavolta, I., Lago, P., and Eiben, A.E. (2019, January 14\u201317). Clustream-GT: Online clustering for personalization in the health domain. Proceedings of the IEEE\/WIC\/ACM International Conference on Web Intelligence (WI\u201919), Thessaloniki, Greece.","DOI":"10.1145\/3350546.3352529"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1571","DOI":"10.1109\/JIOT.2017.2712672","article-title":"Predictive Analytics for Complex IoT Data Streams","volume":"4","author":"Akbar","year":"2017","journal-title":"IEEE Internet Things J."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Calabrese, M., Cimmino, M., Fiume, F., Manfrin, M., Romeo, L., Ceccacci, S., Paolanti, M., Toscano, G., Ciandrini, G., and Carrotta, A. (2020). SOPHIA: An Event-Based IoT and Machine Learning Architecture for Predictive Maintenance in Industry 4.0. Information, 11.","DOI":"10.3390\/info11040202"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Machorro-Cano, I., Alor-Hern\u00e1ndez, G., Paredes-Valverde, M.A., Rodr\u00edguez-Mazahua, L., S\u00e1nchez-Cervantes, J.L., and Olmedo-Aguirre, J.O. (2020). HEMS-IoT: A Big Data and Machine Learning-Based Smart Home System for Energy Saving. Energies, 13.","DOI":"10.3390\/en13051097"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Rashid, R.A., Chin, L., Sarijari, M.A., Sudirman, R., and Ide, T. (2019, January 2\u20135). Machine Learning for Smart Energy Monitoring of Home Appliances Using IoT. Proceedings of the 2019 Eleventh International Conference on Ubiquitous and Future Networks (ICUFN), Zagreb, Croatia.","DOI":"10.1109\/ICUFN.2019.8806026"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Elsisi, M., and Tran, M.-Q. (2021). Development of an IoT Architecture Based on a Deep Neural Network against Cyber Attacks for Automated Guided Vehicles. Sensors, 21.","DOI":"10.3390\/s21248467"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"8874988","DOI":"10.1155\/2021\/8874988","article-title":"Smart Nursing Homes: Self-Management Architecture Based on IoT and Machine Learning for Rural Areas","volume":"2021","author":"Rojo","year":"2021","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_26","first-page":"6138637","article-title":"Waste Management System Using IoT-Based Machine Learning in University","volume":"2020","author":"Phuc","year":"2020","journal-title":"Wirel. Commun. Mob. Comput."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"2078","DOI":"10.3390\/info13090423","article-title":"RAMi: A New Real-Time Internet of Medical Things Architecture for Elderly Patient Monitoring","volume":"13","author":"Debauche","year":"2022","journal-title":"Information"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"2901","DOI":"10.14778\/3415478.3415504","article-title":"Apache IoTDB: Time-series database for internet of things","volume":"13","author":"Wang","year":"2020","journal-title":"Proc. VLDB Endow."},{"key":"ref_29","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (2015). TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.procs.2020.10.014","article-title":"Big data processing architecture for smart farming","volume":"177","author":"Roukh","year":"2020","journal-title":"Procedia Comput. Sci."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Roukh, A., Fote, F.N., Mahmoudi, S.A., and Mahmoudi, S. (2020, January 7\u20139). Wallesmart: Cloud platform for smart farming. Proceedings of the 32nd International Conference on Scientific and Statistical Database Management, Vienna, Austria.","DOI":"10.1145\/3400903.3401690"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Kurtzer, G.M., Sochat, V., and Bauer, M.W. (2017). Singularity: Scientific containers for mobility of compute. PLoS ONE, 12.","DOI":"10.1371\/journal.pone.0177459"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/6\/345\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:55:34Z","timestamp":1760126134000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/14\/6\/345"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,17]]},"references-count":32,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2023,6]]}},"alternative-id":["info14060345"],"URL":"https:\/\/doi.org\/10.3390\/info14060345","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2023,6,17]]}}}