{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,19]],"date-time":"2025-10-19T16:40:51Z","timestamp":1760892051157,"version":"build-2065373602"},"publisher-location":"Cham","reference-count":21,"publisher":"Springer Nature Switzerland","isbn-type":[{"type":"print","value":"9783031822247"},{"type":"electronic","value":"9783031822254"}],"license":[{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,3,28]],"date-time":"2025-03-28T00:00:00Z","timestamp":1743120000000},"content-version":"vor","delay-in-days":86,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>While event log data quality is recognized as a crucial concern in process mining, the impact of event log errors on different types of process mining tasks has remained largely unexplored. This paper aims to fill such a gap by analyzing how various errors affect analysis results. In particular, we aim to assess whether and to what extent different types of errors that impact the quality of activity labels affect the performance of predictive process monitoring models, considering the three main tasks of next activity, outcome, and remaining time prediction, using publicly available and simulated event logs. The results of the experiments are used to extract preliminary insights into the design of data preparation pipelines for predictive process monitoring.<\/jats:p>","DOI":"10.1007\/978-3-031-82225-4_15","type":"book-chapter","created":{"date-parts":[[2025,3,30]],"date-time":"2025-03-30T03:02:32Z","timestamp":1743303752000},"page":"201-213","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["On the\u00a0Impact of\u00a0Low-Quality Activity Labels in\u00a0Predictive Process Monitoring"],"prefix":"10.1007","author":[{"given":"Marco","family":"Comuzzi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sungkyu","family":"Kim","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonghyeon","family":"Ko","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Musa","family":"Salamov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cinzia","family":"Cappiello","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Barbara","family":"Pernici","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,3,28]]},"reference":[{"key":"15_CR1","unstructured":"Berti-\u00c9quille, L.: Active reinforcement learning for data preparation: Learn2Clean with Human-In-The-Loop. In: CIDR 2020 Proceedings (2020). www.cidrdb.org"},{"key":"15_CR2","doi-asserted-by":"crossref","unstructured":"Bose, J.C., Mans, R.S., van der Aalst, W.M.P.: Wanna improve process mining results? It\u2019s high time we consider data quality issues seriously. In: IEEE Symposium on Computational Intelligence and Data Mining, pp. 127\u2013134 (2013)","DOI":"10.1109\/CIDM.2013.6597227"},{"key":"15_CR3","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2021.101874","volume":"103","author":"C Cappiello","year":"2022","unstructured":"Cappiello, C., Comuzzi, M., Plebani, P., Fim, M.: Assessing and improving measurability of process performance indicators based on quality of logs. Inf. Syst. 103, 101874 (2022)","journal-title":"Inf. Syst."},{"key":"15_CR4","doi-asserted-by":"crossref","unstructured":"Di\u00a0Francescomarino, C., Ghidini, C.: Predictive process monitoring. In: Process Mining Handbook, pp. 320\u2013346. Springer, Cham (2022)","DOI":"10.1007\/978-3-031-08848-3_10"},{"key":"15_CR5","doi-asserted-by":"publisher","DOI":"10.3389\/fdata.2022.850611","volume":"5","author":"L Ehrlinger","year":"2022","unstructured":"Ehrlinger, L., W\u00f6\u00df, W.: A survey of data quality measurement and monitoring tools. Front. Big Data 5, 850611 (2022)","journal-title":"Front. Big Data"},{"key":"15_CR6","doi-asserted-by":"crossref","unstructured":"Foroni, D., Lissandrini, M., Velegrakis, Y.: Estimating the extent of the effects of data quality through observations. In: ICDE 2021, pp. 1913\u20131918. IEEE (2021)","DOI":"10.1109\/ICDE51399.2021.00176"},{"issue":"10","key":"15_CR7","doi-asserted-by":"publisher","first-page":"3614","DOI":"10.1109\/TPAMI.2020.2981604","volume":"43","author":"C Geng","year":"2020","unstructured":"Geng, C., Huang, S.J., Chen, S.: Recent advances in open set recognition: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 43(10), 3614\u20133631 (2020)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"3","key":"15_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3506712","volume":"14","author":"IF Ilyas","year":"2022","unstructured":"Ilyas, I.F., Rekatsinas, T.: Machine learning and data cleaning: which serves the other? ACM J. Data Inf. Qual. (JDIQ) 14(3), 1\u201311 (2022)","journal-title":"ACM J. Data Inf. Qual. (JDIQ)"},{"key":"15_CR9","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2021.113669","volume":"153","author":"J Kim","year":"2022","unstructured":"Kim, J., Comuzzi, M., Dumas, M., Maggi, F.M., Teinemaa, I.: Encoding resource experience for predictive process monitoring. Decis. Support Syst. 153, 113669 (2022)","journal-title":"Decis. Support Syst."},{"key":"15_CR10","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2021.101894","volume":"104","author":"J Ko","year":"2022","unstructured":"Ko, J., Comuzzi, M.: Keeping our rivers clean: information-theoretic online anomaly detection for streaming business process events. Inf. Syst. 104, 101894 (2022)","journal-title":"Inf. Syst."},{"key":"15_CR11","unstructured":"Mahdavi, M., Abedjan, Z.: Semi-supervised data cleaning with Raha and Baran. In: 11th Conference on Innovative Data Systems Research, CIDR 2021 (2021). www.cidrdb.org"},{"key":"15_CR12","doi-asserted-by":"crossref","unstructured":"Navarin, N., Vincenzi, B., Polato, M., Sperduti, A.: LSTM networks for data-aware remaining time prediction of business process instances. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp.\u00a01\u20137. IEEE (2017)","DOI":"10.1109\/SSCI.2017.8285184"},{"issue":"1","key":"15_CR13","doi-asserted-by":"publisher","first-page":"59","DOI":"10.1108\/IJPSM-02-2023-0058","volume":"37","author":"B Pernici","year":"2024","unstructured":"Pernici, B., Bono, C.A., Piro, L., Del Treste, M., Vecchi, G.: Improving the analysis of the judiciary performance - the use of data mining techniques to assess the timeliness of civil trials. Int. J. Public Sect. Manag. 37(1), 59\u201376 (2024)","journal-title":"Int. J. Public Sect. Manag."},{"key":"15_CR14","doi-asserted-by":"crossref","unstructured":"Sadeghianasl, S., ter Hofstede, A.H., Wynn, M.T., Suriadi, S.: A contextual approach to detecting synonymous and polluted activity labels in process event logs. In: Proc. Intl. Conf. Cooperative Information Systems. pp. 76\u201394 (2019)","DOI":"10.1007\/978-3-030-33246-4_5"},{"key":"15_CR15","unstructured":"Sancricca, C., Cappiello, C.: Supporting the design of data preparation pipelines. In: Proc. SEBD 2022. CEUR Workshop Proceedings, vol.\u00a03194, pp. 149\u2013158. CEUR-WS.org (2022)"},{"key":"15_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2023.102246","volume":"118","author":"SJ Schmid","year":"2023","unstructured":"Schmid, S.J., Moder, L., Hofmann, P., R\u00f6glinger, M.: Everything at the proper time: Repairing identical timestamp errors in event logs with generative adversarial networks. Inf. Syst. 118, 102246 (2023)","journal-title":"Inf. Syst."},{"key":"15_CR17","doi-asserted-by":"publisher","first-page":"132","DOI":"10.1016\/j.is.2016.07.011","volume":"64","author":"S Suriadi","year":"2017","unstructured":"Suriadi, S., Andrews, R., ter Hofstede, A.H., Wynn, M.T.: Event log imperfection patterns for process mining: towards a systematic approach to cleaning event logs. Inf. Syst. 64, 132\u2013150 (2017)","journal-title":"Inf. Syst."},{"issue":"2","key":"15_CR18","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3301300","volume":"13","author":"I Teinemaa","year":"2019","unstructured":"Teinemaa, I., Dumas, M., Rosa, M.L., Maggi, F.M.: Outcome-oriented predictive process monitoring: review and benchmark. ACM Trans. Knowl. Discov. Data (TKDD) 13(2), 1\u201357 (2019)","journal-title":"ACM Trans. Knowl. Discov. Data (TKDD)"},{"key":"15_CR19","unstructured":"van Der Aalst, W., et\u00a0al.: Process mining manifesto. In: Business Process Management Workshops: BPM 2011 Workshops, pp. 169\u2013194. Springer (2012)"},{"key":"15_CR20","doi-asserted-by":"crossref","unstructured":"Van\u00a0Eck, M.L., Lu, X., Leemans, S.J., van der Aalst, W.M.: $${PM}^2$$: a process mining project methodology. In: Proceedings of CAiSE, pp. 297\u2013313. Springer (2015)","DOI":"10.1007\/978-3-319-19069-3_19"},{"key":"15_CR21","doi-asserted-by":"crossref","unstructured":"Weytjens, H., De\u00a0Weerdt, J.: Creating unbiased public benchmark datasets with data leakage prevention for predictive process monitoring. In: International Conference on Business Process Management, pp. 18\u201329. Springer (2021)","DOI":"10.1007\/978-3-030-94343-1_2"}],"container-title":["Lecture Notes in Business Information Processing","Process Mining Workshops"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-82225-4_15","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,19]],"date-time":"2025-10-19T16:05:48Z","timestamp":1760889948000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-82225-4_15"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"ISBN":["9783031822247","9783031822254"],"references-count":21,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-82225-4_15","relation":{},"ISSN":["1865-1348","1865-1356"],"issn-type":[{"type":"print","value":"1865-1348"},{"type":"electronic","value":"1865-1356"}],"subject":[],"published":{"date-parts":[[2025]]},"assertion":[{"value":"28 March 2025","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"ICPM","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Process Mining","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Lyngby","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Denmark","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2024","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"14 October 2024","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"18 October 2024","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"6","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"icpm2024","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/icpmconference.org\/2024\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}