{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,23]],"date-time":"2026-04-23T10:58:47Z","timestamp":1776941927442,"version":"3.51.4"},"reference-count":42,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2026,4,18]],"date-time":"2026-04-18T00:00:00Z","timestamp":1776470400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>As modern computers became increasingly more popular and larger amounts of digital data were available, different methodologies were proposed to extract information from data. CRISP-DM methodology quickly spread and is currently one of the most popular approaches used for data analysis. However, it has some shortcomings, such as being too general or business-centered. Different authors have proposed variations more suitable to specific fields in order to overcome those limitations. The present paper reviews CRISP-DM, some variations and similar methodologies, and proposes a Methodology for Industrial Data Analysis (MIDA)\u2014a methodology conceived and improved over time, based on previous experience in industrial engineering processes. MIDA consists of eight steps and partially overlaps with CRISP-DM. It has been successfully applied in several previous projects.<\/jats:p>","DOI":"10.3390\/make8040108","type":"journal-article","created":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T07:15:08Z","timestamp":1776669308000},"page":"108","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["MIDA\u2014Method for Industrial Data Analysis Based on CRISP-DM"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4313-7966","authenticated-orcid":false,"given":"Mateus","family":"Mendes","sequence":"first","affiliation":[{"name":"Coimbra Institute of Engineering, Polytechnic University of Coimbra, Rua Pedro Nunes, 3030-199 Coimbra, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9694-8079","authenticated-orcid":false,"given":"Torres","family":"Farinha","sequence":"additional","affiliation":[{"name":"RCM2+ Research Centre for Asset Management and Systems Engineering, Rua Pedro Nunes, 3030-199 Coimbra, Portugal"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,4,18]]},"reference":[{"key":"ref_1","unstructured":"Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., and Wirth, R. (1999). CRISP-DM 1.0, IBM. Available online: https:\/\/public.dhe.ibm.com\/software\/analytics\/spss\/documentation\/modeler\/14.2\/es\/CRISP-DM.pdf."},{"key":"ref_2","first-page":"68","article-title":"Knowledge discovery in real databases: A report on the IJCAI-89 workshop","volume":"11","year":"1990","journal-title":"AI Mag."},{"key":"ref_3","first-page":"37","article-title":"From data mining to knowledge discovery in databases","volume":"17","author":"Fayyad","year":"1996","journal-title":"AI Mag."},{"key":"ref_4","unstructured":"SAS Institute (2003). Data Mining and the Case for Sampling, SAS Institute. Available online: https:\/\/www.scribd.com\/document\/339713338\/SAS-SEMMA."},{"key":"ref_5","unstructured":"Wirth, R., and Hipp, J. (2000, January 11\u201313). CRISP-DM: Towards a standard process model for data mining. Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, Manchester, UK. Available online: http:\/\/www.cs.unibo.it\/~danilo.montesi\/CBD\/Beatriz\/10.1.1.198.5133.pdf."},{"key":"ref_6","first-page":"182","article-title":"KDD, SEMMA and CRISP-DM: A parallel overview","volume":"Volume 8","author":"Azevedo","year":"2008","journal-title":"IADIS European Conf. Data Mining"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Shameti, K., and Cico, B. (2024). Comparison of methodological approaches: CRISP-DM vs. OSEMN methodology using linear regression and statistical analysis. Balkan Conference in Informatics, Springer. Available online: https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-84093-7_4.","DOI":"10.1007\/978-3-031-84093-7_4"},{"key":"ref_8","first-page":"47","article-title":"OSEMN process for working over data acquired by iot devices mounted in beehives","volume":"7","author":"Dineva","year":"2018","journal-title":"Curr. Trends Nat. Sci."},{"key":"ref_9","first-page":"107","article-title":"OSEMN approach for real time data analysis","volume":"10","author":"Kumari","year":"2020","journal-title":"Int. J. Eng. Manag. Res."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"3048","DOI":"10.1109\/TKDE.2019.2962680","article-title":"CRISP-DM twenty years later: From data mining processes to data science trajectories","volume":"33","author":"Ferri","year":"2021","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"526","DOI":"10.1016\/j.procs.2021.01.199","article-title":"A systematic literature review on applying CRISP-DM process model","volume":"181","author":"Kruse","year":"2021","journal-title":"Procedia Comput. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"28","DOI":"10.5753\/reviews.2024.3757","article-title":"The evolution of CRISP-DM for data science: Methods, processes and frameworks","volume":"4","author":"Shimaoka","year":"2024","journal-title":"SBC Comput. Rev."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zavaleta-S\u00e1nchez, E., Dom\u00ednguez-S\u00e1nchez, G., Irene Loeza-Mej\u00eda, C., and S\u00e1nchez-DelaCruz, E. (2024). Comparative study of KDD and CRISP-DM methodologies for phishing identification. International Congress on Information and Communication Technology, Springer. Available online: https:\/\/link.springer.com\/chapter\/10.1007\/978-981-97-3559-4_25.","DOI":"10.1007\/978-981-97-3559-4_25"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"598","DOI":"10.25046\/aj020376","article-title":"A comparative between CRISP-DM and SEMMA through the construction of a MODIS repository for studies of land use and cover change","volume":"2","author":"Pantoja","year":"2017","journal-title":"Adv. Sci. Technol. Eng. Syst. J."},{"key":"ref_15","unstructured":"Rosander, A. (2018). Evaluating Frameworks for Implementing Machine Learning in Signal Processing. [Master\u2019s Thesis, Kth Skolan f\u00f6r Electroteknik Och Datavetenskap]. Available online: https:\/\/www.diva-portal.org\/smash\/get\/diva2:1250897\/FULLTEXT01.pdf."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1007\/s11740-024-01298-8","article-title":"Implementation of an intelligent process monitoring system for screw presses using the CRISP-DM standard","volume":"19","author":"Doede","year":"2025","journal-title":"Prod. Eng."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"612","DOI":"10.1111\/dsji.12222","article-title":"Applying the CRISP-DM framework for teaching business analytics","volume":"18","author":"Jaggia","year":"2020","journal-title":"Decis. Sci. J. Innov. Educ."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1016\/j.procs.2021.12.278","article-title":"Predictive models assessment based on CRISP-DM methodology for students performance in colombia\u2014Saber 11 test","volume":"198","author":"Solano","year":"2022","journal-title":"Procedia Comput. Sci."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Purbasari, A., Rinawan, F.R., Zulianto, A., Susanti, A.I., and Komara, H. (2021, January 29\u201330). CRISP-DM for data quality improvement to support machine learning of stunting prediction in infants and toddlers. Proceedings of the 2021 8th International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA), Bandung, Indonesia. Available online: https:\/\/www.researchgate.net\/profile\/Ayi-Purbasari\/publication\/357107554_CRISP-DM_for_Data_Quality_Improvement_to_Support_Machine_Learning_of_Stunting_Prediction_in_Infants_and_Toddlers\/links\/64df7dac1351f5785b7305fc\/CRISP-DM-for-Data-Quality-Improvement-to-Support-Machine-Learning-of-Stunting-Prediction-in-Infants-and-Toddlers.pdf.","DOI":"10.1109\/ICAICTA53211.2021.9640294"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Swaroop, A., Polkowski, Z., Correia, S.D., and Virdee, B. (2023). Crop prediction using machine learning with CRISP-DM approach. Proceedings of Data Analytics and Management, Springer Nature. Available online: https:\/\/link.springer.com\/chapter\/10.1007\/978-981-99-6550-2_31.","DOI":"10.1007\/978-981-99-6553-3"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Bosnjak, Z., Grljevic, O., and Bosnjak, S. (2009). CRISP-DM as a framework for discovering knowledge in small and medium sized enterprises\u2019 data. Proceedings of the 2009 5th International Symposium on Applied Computational Intelligence and Informatics, IEEE. Available online: https:\/\/ieeexplore.ieee.org\/abstract\/document\/5136302.","DOI":"10.1109\/SACI.2009.5136302"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"2705","DOI":"10.1017\/pds.2023.271","article-title":"Modelling the design of models: An example using CRISP-DM","volume":"3","author":"Kannengiesser","year":"2023","journal-title":"Proc. Des. Soc."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Saltz, J.S. (2021). CRISP-DM for data science: Strengths, weaknesses and potential next steps. Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), IEEE. Available online: https:\/\/ieeexplore.ieee.org\/abstract\/document\/9671634.","DOI":"10.1109\/BigData52589.2021.9671634"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"102013","DOI":"10.1016\/j.datak.2022.102013","article-title":"Applying the CRISP-DM data mining process in the financial services industry: Elicitation of adaptation requirements","volume":"139","author":"Plotnikova","year":"2022","journal-title":"Data Knowl. Eng."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/j.procir.2019.02.106","article-title":"DMME: Data mining methodology for engineering applications\u2014A holistic extension to the CRISP-DM model","volume":"79","author":"Huber","year":"2019","journal-title":"Procedia Cirp"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"99","DOI":"10.70594\/brain\/11.2Sup1\/97","article-title":"Adapting CRISP-DM for social sciences","volume":"11","author":"Cazacu","year":"2021","journal-title":"BRAIN Broad Res. Artif. Intell. Neurosci."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Venter, J., de Waal, A., and Willers, C. (2007). Specializing CRISP-DM for evidence mining. IFIP International Conference on Digital Forensics, Springer. Available online: https:\/\/link.springer.com\/chapter\/10.1007\/978-0-387-73742-3_21.","DOI":"10.1007\/978-0-387-73742-3_21"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Nagashima, H., and Kato, Y. (2019). APREP-DM: A framework for automating the pre-processing of a sensor data analysis based on CRISP-DM. Proceedings of the 2019 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), IEEE. Available online: https:\/\/ieeexplore.ieee.org\/abstract\/document\/8730785.","DOI":"10.1109\/PERCOMW.2019.8730785"},{"key":"ref_29","first-page":"20","article-title":"Adapting CRISP-DM for idea mining: A data mining process for generating ideas using a textual dataset","volume":"11","author":"Ayele","year":"2020","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2234","DOI":"10.1080\/09537287.2023.2234882","article-title":"Realising the promises of artificial intelligence in manufacturing by enhancing CRISP-DM","volume":"35","author":"Bokrantz","year":"2024","journal-title":"Prod. Plan. Control"},{"key":"ref_31","unstructured":"Pappas, I.O., Mikalef, P., Dwivedi, Y.K., Jaccheri, L., Krogstie, J., and Mymki, M. (2019). Exploring the relationship between data science and circular economy: An enhanced CRISP-DM process model. Digital Transformation for a Sustainable Society in the 21st Century, Springer International Publishing. Available online: https:\/\/link.springer.com\/chapter\/10.1007\/978-3-030-29374-1_15."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Acu\u00f1a-Cid, H.A., Ahumada-Tello, E., Ovalle-Osuna, \u00d3.O., Evans, R., Hern\u00e1ndez-R\u00edos, J.E., and Zambrano-Soto, M.A. (2025). CRISP-NET: Integration of the CRISP-DM model with network analysis. Mach. Learn. Knowl. Extr., 7.","DOI":"10.3390\/make7030101"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Catley, C., Smith, K., McGregor, C., and Tracy, M. (2009). Extending CRISP-DM to incorporate temporal data mining of multidimensional medical data streams: A neonatal intensive care unit case study. Proceedings of the 2009 22nd IEEE International Symposium on Computer-Based Medical Systems, IEEE. Available online: https:\/\/ieeexplore.ieee.org\/abstract\/document\/5255394.","DOI":"10.1109\/CBMS.2009.5255394"},{"key":"ref_34","unstructured":"Asamoah, D., and Sharda, R. (2026, January 31). Adapting CRISP-DM Process for Social Network Analytics: Application to Healthcare. Available online: https:\/\/aisel.aisnet.org\/amcis2015\/BizAnalytics\/GeneralPresentations\/33\/."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Ramos, J.L.C., Rodrigues, R.L., Silva, J.S., and de Oliveira, P.L.S. (2020). CRISP-EDM: Uma proposta de adapta\u00e7\u00e3o do modelo CRISP-DM para minera\u00e7\u00e3o de dados educacionais. Simp\u00f3sio Brasileiro de Inform\u00e1tica na Educa\u00e7\u00e3o (SBIE), SBC. Available online: https:\/\/sol.sbc.org.br\/index.php\/sbie\/article\/view\/12865\/12719.","DOI":"10.5753\/cbie.sbie.2020.1092"},{"key":"ref_36","first-page":"92","article-title":"CRISP data mining methodology extension for medical domain","volume":"3","author":"Niaksu","year":"2015","journal-title":"Balt. J. Mod. Comput."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Sch\u00e4fer, F., Zeiselmair, C., Becker, J., and Otten, H. (2018). Synthesizing CRISP-DM and quality management: A data mining approach for production processes. Proceedings of the 2018 IEEE International Conference on Technology Management, Operations and Decisions (ICTMOD), IEEE. Available online: https:\/\/ieeexplore.ieee.org\/abstract\/document\/8691266.","DOI":"10.1109\/ITMC.2018.8691266"},{"key":"ref_38","unstructured":"Jajal, P., Jiang, W., Tewari, A., Kocinare, E., Woo, J., Sarraf, A., Lu, Y., Thiruvathukal, G.K., and Davis, J.C. (2023). Analysis of failures and risks in deep learning model converters: A case study in the onnx ecosystem. arXiv."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"440","DOI":"10.17531\/ein.2020.3.6","article-title":"Predicting motor oil condition using artificial neural networks and principal component analysis","volume":"22","author":"Rodrigues","year":"2020","journal-title":"Eksploat. Niezawodn."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Cruz, S., Paulino, A., Duraes, J., and Mendes, M. (2021). Real-time quality control of heat sealed bottles using thermal images and artificial neural network. J. Imaging, 7.","DOI":"10.3390\/jimaging7020024"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Costa, J., Silva, R., Martins, G., Barreiros, J., and Mendes, M. (2025). Analysis of the state and fault detection of a plastic injection machine\u2014A machine learning-based approach. Algorithms, 18.","DOI":"10.3390\/a18080521"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Silva, H., Duque, V., Macedo, M., and Mendes, M. (2024). Aiding ICD-10 encoding of clinical health records using improved text cosine similarity and PLM-ICD. Algorithms, 17.","DOI":"10.3390\/a17040144"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/8\/4\/108\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,23]],"date-time":"2026-04-23T10:33:51Z","timestamp":1776940431000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/8\/4\/108"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,4,18]]},"references-count":42,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2026,4]]}},"alternative-id":["make8040108"],"URL":"https:\/\/doi.org\/10.3390\/make8040108","relation":{},"ISSN":["2504-4990"],"issn-type":[{"value":"2504-4990","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,4,18]]}}}