{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T04:14:54Z","timestamp":1768968894285,"version":"3.49.0"},"publisher-location":"Cham","reference-count":34,"publisher":"Springer International Publishing","isbn-type":[{"value":"9783030985806","type":"print"},{"value":"9783030985813","type":"electronic"}],"license":[{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,3,24]],"date-time":"2022-03-24T00:00:00Z","timestamp":1648080000000},"content-version":"vor","delay-in-days":82,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Predictive process monitoring is a subfield of process mining that aims to estimate case or event features for running process instances. Such predictions are of significant interest to the process stakeholders. However, state-of-the-art methods for predictive monitoring require the training of complex machine learning models, which is often inefficient. This paper proposes an instance selection procedure that allows sampling training process instances for prediction models. We show that our sampling method allows for a significant increase of training speed for next activity prediction methods while maintaining reliable levels of prediction accuracy.\n<\/jats:p>","DOI":"10.1007\/978-3-030-98581-3_12","type":"book-chapter","created":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T18:03:23Z","timestamp":1648058603000},"page":"154-166","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Event Log Sampling for\u00a0Predictive Monitoring"],"prefix":"10.1007","author":[{"given":"Mohammadreza","family":"Fani Sani","sequence":"first","affiliation":[]},{"given":"Mozhgan","family":"Vazifehdoostirani","sequence":"additional","affiliation":[]},{"given":"Gyunam","family":"Park","sequence":"additional","affiliation":[]},{"given":"Marco","family":"Pegoraro","sequence":"additional","affiliation":[]},{"given":"Sebastiaan J.","family":"van Zelst","sequence":"additional","affiliation":[]},{"given":"Wil M. P.","family":"van der Aalst","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,24]]},"reference":[{"issue":"2","key":"12_CR1","doi-asserted-by":"publisher","first-page":"450","DOI":"10.1016\/j.is.2010.09.001","volume":"36","author":"WMP van der Aalst","year":"2011","unstructured":"van der Aalst, W.M.P., Schonenberg, M., Song, M.: Time prediction based on process mining. Inf. Syst. 36(2), 450\u2013475 (2011). https:\/\/doi.org\/10.1016\/j.is.2010.09.001","journal-title":"Inf. Syst."},{"key":"12_CR2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-49851-4","volume-title":"Process Mining - Data Science in Action","author":"WMP van der Aalst","year":"2016","unstructured":"van der Aalst, W.M.P.: Process Mining - Data Science in Action, 2nd edn. Springer, Verlag (2016)","edition":"2"},{"key":"12_CR3","unstructured":"Bergstra, J., Bardenet, R., Bengio, Y., K\u00e9gl, B.: Algorithms for hyper-parameter optimization. In: Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, Proceedings of a Meeting Held 12\u201314 December 2011, Granada, Spain. pp. 2546\u20132554 (2011)"},{"issue":"2","key":"12_CR4","first-page":"123","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123\u2013140 (1996)","journal-title":"Mach. Learn."},{"key":"12_CR5","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1007\/978-3-642-34156-4_10","volume-title":"Advances in Intelligent Data Analysis XI","author":"J Carmona","year":"2012","unstructured":"Carmona, J., Gavald\u00e0, R.: Online techniques for dealing with concept drift in process mining. In: Hollm\u00e9n, J., Klawonn, F., Tucker, A. (eds.) IDA 2012. LNCS, vol. 7619, pp. 90\u2013102. Springer, Heidelberg (2012). https:\/\/doi.org\/10.1007\/978-3-642-34156-4_10"},{"key":"12_CR6","doi-asserted-by":"crossref","unstructured":"Chen, T., Guestrin, C.: Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13\u201317, 2016, pp. 785\u2013794. ACM (2016)","DOI":"10.1145\/2939672.2939785"},{"key":"12_CR7","unstructured":"De Leoni, M., Mannhardt, F.: Road traffic fine management process. Eindhoven University of Technology, Dataset (2015)"},{"key":"12_CR8","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1016\/j.dss.2017.04.003","volume":"100","author":"J Evermann","year":"2017","unstructured":"Evermann, J., Rehse, J., Fettke, P.: Predicting process behaviour using deep learning. Decis. Support Syst. 100, 129\u2013140 (2017)","journal-title":"Decis. Support Syst."},{"key":"12_CR9","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1007\/978-3-030-49435-3_15","volume-title":"Advanced Information Systems Engineering","author":"M Fani\u00a0Sani","year":"2020","unstructured":"Fani\u00a0Sani, M., van Zelst, S.J., van\u00a0der Aalst, W.M.P.: Conformance checking approximation using subset selection and edit distance. In: Dustdar, S., Yu, E., Salinesi, C., Rieu, D., Pant, V. (eds.) CAiSE 2020. LNCS, vol. 12127, pp. 234\u2013251. Springer, Cham (2020). https:\/\/doi.org\/10.1007\/978-3-030-49435-3_15"},{"issue":"6","key":"12_CR10","doi-asserted-by":"publisher","first-page":"1085","DOI":"10.1007\/s00607-021-00910-4","volume":"103","author":"M Fani\u00a0Sani","year":"2021","unstructured":"Fani\u00a0Sani, M., van\u00a0Zelst, S.J., van\u00a0der\u00a0Aalst, W.M.P.: The impact of biased sampling of event logs on the performance of process discovery. Computing 103(6), 1085\u20131104 (2021). https:\/\/doi.org\/10.1007\/s00607-021-00910-4","journal-title":"Computing"},{"key":"12_CR11","doi-asserted-by":"crossref","unstructured":"Garca, S., Luengo, J., Herrera, F.: Data Preprocessing in Data Mining. Springer, Cham (2014)","DOI":"10.1007\/978-3-319-10247-4"},{"key":"12_CR12","doi-asserted-by":"publisher","unstructured":"Breuker, D., Matzner, M., Delfmann, P., Becker, J.: Comprehensible predictive models for business processes. Mis Q. 40(4), 1009\u20131034. https:\/\/doi.org\/10.25300\/MISQ\/2016\/40.4.10","DOI":"10.25300\/MISQ\/2016\/40.4.10"},{"key":"12_CR13","unstructured":"Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. CoRR arXiv:1508.01991 (2015)"},{"key":"12_CR14","doi-asserted-by":"publisher","unstructured":"de Leoni, M., van der Aalst, W.M.P., Dees, M.: A general process mining framework for correlating, predicting and clustering dynamic behavior based on event logs. Inf. Syst. 56, 235\u2013257 (2016). https:\/\/doi.org\/10.1016\/j.is.2015.07.003","DOI":"10.1016\/j.is.2015.07.003"},{"issue":"6","key":"12_CR15","doi-asserted-by":"publisher","first-page":"962","DOI":"10.1109\/TSC.2017.2772256","volume":"11","author":"AE Marquez-Chamorro","year":"2017","unstructured":"Marquez-Chamorro, A.E., Resinas, M., Ruiz-Cortes, A.: Predictive monitoring of business processes: a survey. IEEE Trans. Services Comput. 11(6), 962\u2013977 (2017). https:\/\/doi.org\/10.1109\/TSC.2017.2772256","journal-title":"IEEE Trans. Services Comput."},{"key":"12_CR16","doi-asserted-by":"crossref","unstructured":"Navarin, N., Vincenzi, B., Polato, M., Sperduti, A.: LSTM networks for data-aware remaining time prediction of business process instances. In: 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017, Honolulu, HI, USA, November 27 - December 1, 2017, pp. 1\u20137. IEEE (2017)","DOI":"10.1109\/SSCI.2017.8285184"},{"key":"12_CR17","series-title":"Lecture Notes in Business Information Processing","doi-asserted-by":"publisher","first-page":"112","DOI":"10.1007\/978-3-030-72693-5_9","volume-title":"Process Mining Workshops","author":"A Nguyen","year":"2021","unstructured":"Nguyen, A., Chatterjee, S., Weinzierl, S., Schwinn, L., Matzner, M., Eskofier, B.: Time matters: time-aware LSTMs for predictive business process monitoring. In: Leemans, S., Leopold, H. (eds.) ICPM 2020. LNBIP, vol. 406, pp. 112\u2013123. Springer, Cham (2021). https:\/\/doi.org\/10.1007\/978-3-030-72693-5_9"},{"key":"12_CR18","doi-asserted-by":"crossref","unstructured":"Park, G., Song, M.: Predicting performances in business processes using deep neural networks. Decis. Support Syst. 129, 113191 (2020)","DOI":"10.1016\/j.dss.2019.113191"},{"key":"12_CR19","doi-asserted-by":"crossref","unstructured":"Pegoraro, M., Uysal, M.S., Georgi, D.B., van der Aalst, W.M.P.: Text-aware predictive monitoring of business processes. In: Abramowicz, W., Auer, S., Lewanska, E. (eds.) 24th International Conference on Business Information Systems, BIS 2021, Hannover, Germany, June 15\u201317, 2021. pp. 221\u2013232 (2021)","DOI":"10.52825\/bis.v1i.62"},{"issue":"9","key":"12_CR20","doi-asserted-by":"publisher","first-page":"1005","DOI":"10.1007\/s00607-018-0593-x","volume":"100","author":"M Polato","year":"2018","unstructured":"Polato, M., Sperduti, A., Burattin, A., Leoni, M.: Time and activity sequence prediction of business process instances. Computing 100(9), 1005\u20131031 (2018). https:\/\/doi.org\/10.1007\/s00607-018-0593-x","journal-title":"Computing"},{"key":"12_CR21","doi-asserted-by":"crossref","unstructured":"Pourghassemi, B., Zhang, C., Lee, J.H., Chandramowlishwaran, A.: On the limits of parallelizing convolutional neural networks on GPUS. In: SPAA 2020: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, Virtual Event, USA, July 15\u201317, 2020. pp. 567\u2013569. ACM (2020)","DOI":"10.1145\/3350755.3400266"},{"key":"12_CR22","unstructured":"Powers, D.M.W.: Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation. CoRR arXiv:2010.16061 (2020)"},{"key":"12_CR23","series-title":"Lecture Notes in Business Information Processing","doi-asserted-by":"publisher","first-page":"155","DOI":"10.1007\/978-3-030-66498-5_12","volume-title":"Business Process Management Workshops","author":"MS Qafari","year":"2020","unstructured":"Qafari, M.S., van der Aalst, W.: Root cause analysis in process mining using structural equation models. In: Del R\u00edo Ortega, A., Leopold, H., Santoro, F.M. (eds.) BPM 2020. LNBIP, vol. 397, pp. 155\u2013167. Springer, Cham (2020). https:\/\/doi.org\/10.1007\/978-3-030-66498-5_12"},{"key":"12_CR24","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1007\/978-3-642-45005-1_27","volume-title":"Service-Oriented Computing","author":"A Rogge-Solti","year":"2013","unstructured":"Rogge-Solti, A., Weske, M.: Prediction of remaining service execution time using stochastic petri nets with arbitrary firing delays. In: Basu, S., Pautasso, C., Zhang, L., Fu, X. (eds.) ICSOC 2013. LNCS, vol. 8274, pp. 389\u2013403. Springer, Heidelberg (2013). https:\/\/doi.org\/10.1007\/978-3-642-45005-1_27"},{"key":"12_CR25","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"306","DOI":"10.1007\/978-3-319-65000-5_18","volume-title":"Business Process Management","author":"A Senderovich","year":"2017","unstructured":"Senderovich, A., Di Francescomarino, C., Ghidini, C., Jorbina, K., Maggi, F.M.: Intra and inter-case features in predictive process monitoring: a tale of two dimensions. In: Carmona, J., Engels, G., Kumar, A. (eds.) BPM 2017. LNCS, vol. 10445, pp. 306\u2013323. Springer, Cham (2017). https:\/\/doi.org\/10.1007\/978-3-319-65000-5_18"},{"key":"12_CR26","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"477","DOI":"10.1007\/978-3-319-59536-8_30","volume-title":"Advanced Information Systems Engineering","author":"N Tax","year":"2017","unstructured":"Tax, N., Verenich, I., La Rosa, M., Dumas, M.: Predictive business process monitoring with LSTM neural networks. In: Dubois, E., Pohl, K. (eds.) CAiSE 2017. LNCS, vol. 10253, pp. 477\u2013492. Springer, Cham (2017). https:\/\/doi.org\/10.1007\/978-3-319-59536-8_30"},{"key":"12_CR27","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1007\/978-3-319-45348-4_23","volume-title":"Business Process Management","author":"I Teinemaa","year":"2016","unstructured":"Teinemaa, I., Dumas, M., Maggi, F.M., Di Francescomarino, C.: Predictive business process monitoring with structured and unstructured data. In: La Rosa, M., Loos, P., Pastor, O. (eds.) BPM 2016. LNCS, vol. 9850, pp. 401\u2013417. Springer, Cham (2016). https:\/\/doi.org\/10.1007\/978-3-319-45348-4_23"},{"issue":"2","key":"12_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3301300","volume":"13","author":"I Teinemaa","year":"2019","unstructured":"Teinemaa, I., Dumas, M., Rosa, M.L., Maggi, F.M.: Outcome-oriented predictive process monitoring: Review and benchmark. ACM Trans. Knowl. Discovery Data (TKDD) 13(2), 1\u201357 (2019)","journal-title":"ACM Trans. Knowl. Discovery Data (TKDD)"},{"key":"12_CR29","doi-asserted-by":"publisher","unstructured":"Van Dongen, B.F. (Boudewijn): BPI Challenge 2012 (2012). https:\/\/doi.org\/10.4121\/UUID:3926DB30-F712-4394-AEBC-75976070E91F","DOI":"10.4121\/UUID:3926DB30-F712-4394-AEBC-75976070E91F"},{"key":"12_CR30","unstructured":"Verbeek, E., Buijs, J.C.A.M., van Dongen, B.F., van der Aalst, W.M.P.: Prom 6: the process mining toolkit. In: Proceedings of the Business Process Management 2010 Demonstration Track, Hoboken, NJ, USA, September 14\u201316, 2010. vol. 615. CEUR-WS.org (2010)"},{"key":"12_CR31","unstructured":"Wang, T., Zhu, J.Y., Torralba, A., Efros, A.A.: Dataset distillation. arXiv preprint arXiv:1811.10959 (2020)"},{"issue":"3","key":"12_CR32","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1023\/A:1007626913721","volume":"38","author":"DR Wilson","year":"2000","unstructured":"Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-basedlearning algorithms. Mach. Learn. 38(3), 257\u2013286 (2000). https:\/\/doi.org\/10.1023\/A:1007626913721","journal-title":"Mach. Learn."},{"issue":"3","key":"12_CR33","doi-asserted-by":"publisher","first-page":"408","DOI":"10.1109\/TSMC.1972.4309137","volume":"2","author":"DL Wilson","year":"1972","unstructured":"Wilson, D.L.: Asymptotic properties of nearest neighbor rules using edited data. Syst., Man Cyber., IEEE Trans. 2(3), 408\u2013421 (1972). https:\/\/doi.org\/10.1109\/TSMC.1972.4309137","journal-title":"Syst., Man Cyber., IEEE Trans."},{"key":"12_CR34","doi-asserted-by":"publisher","first-page":"350","DOI":"10.1016\/j.neucom.2017.01.026","volume":"237","author":"L Zhou","year":"2017","unstructured":"Zhou, L., Pan, S., Wang, J., Vasilakos, A.V.: Machine learning on big data: opportunities and challenges. Neurocomputing 237, 350\u2013361 (2017). https:\/\/doi.org\/10.1016\/j.neucom.2017.01.026","journal-title":"Neurocomputing"}],"container-title":["Lecture Notes in Business Information Processing","Process Mining Workshops"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-030-98581-3_12","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,23]],"date-time":"2022-03-23T18:05:52Z","timestamp":1648058752000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-030-98581-3_12"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"ISBN":["9783030985806","9783030985813"],"references-count":34,"URL":"https:\/\/doi.org\/10.1007\/978-3-030-98581-3_12","relation":{},"ISSN":["1865-1348","1865-1356"],"issn-type":[{"value":"1865-1348","type":"print"},{"value":"1865-1356","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022]]},"assertion":[{"value":"24 March 2022","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"ICPM","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Process Mining","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Eindhoven","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"The Netherlands","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2021","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"31 October 2021","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"4 November 2021","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"3","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"icpm2021a","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/icpmconference.org\/2021\/category\/calls\/workshops\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Single-blind","order":1,"name":"type","label":"Type","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"EasyChair","order":2,"name":"conference_management_system","label":"Conference Management System","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"65","order":3,"name":"number_of_submissions_sent_for_review","label":"Number of Submissions Sent for Review","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"28","order":4,"name":"number_of_full_papers_accepted","label":"Number of Full Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"0","order":5,"name":"number_of_short_papers_accepted","label":"Number of Short Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"43% - The value is computed by the equation \"Number of Full Papers Accepted \/ Number of Submissions Sent for Review * 100\" and then rounded to a whole number.","order":6,"name":"acceptance_rate_of_full_papers","label":"Acceptance Rate of Full Papers","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":7,"name":"average_number_of_reviews_per_paper","label":"Average Number of Reviews per Paper","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"2","order":8,"name":"average_number_of_papers_per_reviewer","label":"Average Number of Papers per Reviewer","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"Yes","order":9,"name":"external_reviewers_involved","label":"External Reviewers Involved","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"The proceedings also include one survey paper on the results of the XES 2.9 Workshop.","order":10,"name":"additional_info_on_review_process","label":"Additional Info on Review Process","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}}]}}