{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T01:26:20Z","timestamp":1760059580696,"version":"build-2065373602"},"reference-count":42,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2025,6,23]],"date-time":"2025-06-23T00:00:00Z","timestamp":1750636800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Thammasat University Research Fund","award":["TUFT034\/2568"],"award-info":[{"award-number":["TUFT034\/2568"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>User churn in online games refers to players becoming inactive for an extended period. Even a small increase in churn can lead to significant revenue loss, making churn prediction crucial for sustaining long-term player engagement. Although user churn prediction has been extensively studied, most existing approaches either ignore non-login periods or treat all inactivity uniformly, overlooking key behavioral differences. This study addresses this gap by categorizing non-login periods into three types, as follows: inactivity due to new or dormant users, genuine loss of interest, and temporary inaccessibility caused by external factors. These periods are treated as either non-existent or missing data and imputed using techniques such as mean or mode substitution, linear interpolation, and multiple imputation by chained equations (MICE). MICE was selected due to its ability to impute missing values more robustly by considering multivariate relationships. A random forest (RF) classifier, chosen for its interpretability and robustness to incomplete data, serves as the primary prediction model. Additionally, classifier chains are used to capture label dependencies, and principal component analysis (PCA) is applied to reduce dimensionality and mitigate overfitting. Experiments on real-world MMORPG data show that our approach improves predictive accuracy, achieving a micro-averaged AUC of above 0.92 and a weighted F1 score exceeding 0.70. These findings suggest that our approach improves churn prediction and offers actionable insights for supporting personalized player retention strategies.<\/jats:p>","DOI":"10.3390\/data10070096","type":"journal-article","created":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T10:44:41Z","timestamp":1750761881000},"page":"96","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-7729-7888","authenticated-orcid":false,"given":"JaeHong","family":"Lee","sequence":"first","affiliation":[{"name":"School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani 12120, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8496-4197","authenticated-orcid":false,"given":"Pavinee","family":"Rerkjirattikal","sequence":"additional","affiliation":[{"name":"Department of Technology and Operations Management, Faculty of Business Administration, Kasetsart University, Bangkok 10900, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7424-8469","authenticated-orcid":false,"given":"SangGyu","family":"Nam","sequence":"additional","affiliation":[{"name":"School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani 12120, Thailand"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,6,23]]},"reference":[{"key":"ref_1","unstructured":"AstuteAnalytica India Pvt. Ltd (2025, May 04). Video Games Market Valuation Is Set to Skyrocket to Reach US $656.21 Billion by 2033. Available online: https:\/\/www.globenewswire.com\/news-release\/2025\/02\/06\/3021813\/0\/en\/Video-Games-Market-Valuation-is-Set-to-Skyrocket-to-Reach-US-656-21-Billion-by-2033-Astute-Analytica.html."},{"key":"ref_2","unstructured":"NCSOFT (2024, February 09). Dataset for 2018 Bigcontest Competition. Available online: https:\/\/danbi-ncsoft.github.io\/."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"103034","DOI":"10.1016\/j.omega.2024.103034","article-title":"Profit-driven weighted classifier with interpretable ability for customer churn prediction","volume":"125","author":"Jiang","year":"2024","journal-title":"Omega"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Suguna, R., Suriya Prakash, J., Aditya Pai, H., Mahesh, T.R., Vinoth Kumar, V., and Yimer, T.E. (2025). Mitigating class imbalance in churn prediction with ensemble methods and SMOTE. Sci. Rep., 15.","DOI":"10.1038\/s41598-025-01031-0"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Chang, V., Hall, K., Xu, Q.A., Amao, F.O., Ganatra, M.A., and Benson, V. (2024). Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models. Algorithms, 17.","DOI":"10.3390\/a17060231"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Imani, M., and Arabnia, H.R. (2023). Hyperparameter Optimization and Combined Data Sampling Techniques in Machine Learning for Customer Churn Prediction: A Comparative Analysis. Technologies, 11.","DOI":"10.20944\/preprints202308.1478.v4"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Musta\u010d, K., Ba\u010di\u0107, K., Skorin-Kapov, L., and Su\u017enjevi\u0107, M. (2022). Predicting player churn of a free-to-play mobile video game using supervised machine learning. Appl. Sci., 12.","DOI":"10.3390\/app12062795"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Kim, S., Choi, D., Lee, E., and Rhee, W. (2017). Churn prediction of mobile and online casual games using play log data. PLoS ONE, 12.","DOI":"10.1371\/journal.pone.0180735"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1109\/TG.2018.2888863","article-title":"Game data mining competition on churn prediction and survival analysis using commercial game log data","volume":"11","author":"Lee","year":"2019","journal-title":"IEEE Trans. Games"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Park, K., Cha, M., Kwak, H., and Chen, K.T. (2017, January 3\u20137). Achievement and friends: Key factors of player retention vary across player levels in online multiplayer games. Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia.","DOI":"10.1145\/3041021.3054176"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Kawale, J., Pal, A., and Srivastava, J. (2009, January 29\u201331). Churn prediction in MMORPGs: A social influence based approach. Proceedings of the 2009 International Conference on Computational Science and Engineering, Vancouver, BC, Canada.","DOI":"10.1109\/CSE.2009.80"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Borbora, Z., Srivastava, J., Hsu, K.W., and Williams, D. (2011, January 9\u201311). Churn prediction in MMORPGs using player motivation theories and an ensemble approach. Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, Boston, MA, USA.","DOI":"10.1109\/PASSAT\/SocialCom.2011.122"},{"key":"ref_13","first-page":"19","article-title":"Factors affecting members\u2019 sense of belonging in virtual community","volume":"35","author":"Lee","year":"2010","journal-title":"J. Korean Oper. Res. Manag. Sci. Soc."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Hadiji, F., Sifa, R., Drachen, A., Thurau, C., Kersting, K., and Bauckhage, C. (2014, January 26\u201329). Predicting player churn in the wild. Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, Dortmund, Germany.","DOI":"10.1109\/CIG.2014.6932876"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Seif El-Nasr, M., Drachen, A., and Canossa, A. (2013). Gameplay metrics in game user research: Examples from the trenches. Game Analytics, Springer.","DOI":"10.1007\/978-1-4471-4769-5"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Runge, J., Gao, P., Garcin, F., and Faltings, B. (2014, January 26\u201329). Churn prediction for high-value players in casual social games. Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, Dortmund, Germany.","DOI":"10.1109\/CIG.2014.6932875"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Tamassia, M., Raffe, W., Sifa, R., Drachen, A., Zambetta, F., and Hitchens, M. (2016, January 20\u201323). Predicting player churn in destiny: A hidden Markov models approach to predicting player departure in a major online game. Proceedings of the 2016 IEEE Conference on Computational Intelligence and Games (CIG), Santorini, Greece.","DOI":"10.1109\/CIG.2016.7860431"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1016\/j.eswa.2017.04.056","article-title":"Early churn prediction with personalized targeting in mobile social games","volume":"83","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Peri\u00e1 nez, A., Saas, A., Guitart, A., and Magne, C. (2016, January 17\u201319). Churn prediction in mobile social games: Towards a complete assessment using survival ensembles. Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), Montreal, QC, Canada.","DOI":"10.1109\/DSAA.2016.84"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Bertens, P., Guitart, A., and Peri\u00e1\u00f1ez, \u00c1. (2017, January 22\u201325). Games and big data: A scalable multi-dimensional churn prediction model. Proceedings of the 2017 IEEE Conference on Computational Intelligence and Games (CIG), New York, NY, USA.","DOI":"10.1109\/CIG.2017.8080412"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1007\/s10100-022-00802-8","article-title":"Clustering mixed-type player behavior data for churn prediction in mobile games","volume":"31","author":"Pahor","year":"2023","journal-title":"Cent. Eur. J. Oper. Res."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"100522","DOI":"10.1016\/j.entcom.2022.100522","article-title":"Predicting subscription renewal using binary classification in World of Warcraft","volume":"44","author":"Hossain","year":"2023","journal-title":"Entertain. Comput."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hoang, H.D., and Cam, N.T. (2024, January 27\u201328). Early churn prediction in freemium game mobile using Transformer-based architecture for tabular data. Proceedings of the 2024 IEEE 3rd World Conference on Applied Intelligence and Computing (AIC), Gwalior, India.","DOI":"10.1109\/AIC61668.2024.10730924"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Dontireddy, S.R., Pallem, R.A., Chindukuri, V., Gurram, S.H., and Pannerselvam, I.R. (2024, January 4\u20135). Enhancing Transparency: AI Applications for Detecting Cheating and Predicting Player Attrition in Online Gaming. Proceedings of the 2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT), Karaikal, India.","DOI":"10.1109\/IConSCEPT61884.2024.10627922"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"103","DOI":"10.59796\/jcst.V15N2.2025.103","article-title":"Predicting Player Churn in the Gaming Industry: A Machine Learning Framework for Enhanced Retention Strategies","volume":"15","author":"Mulla","year":"2025","journal-title":"J. Curr. Sci. Technol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"252","DOI":"10.3390\/make1010016","article-title":"The Winning Solution to the IEEE CIG 2017 Game Data Mining Competition","volume":"1","author":"Guitart","year":"2019","journal-title":"Mach. Learn. Knowl. Extr."},{"key":"ref_27","first-page":"1","article-title":"Predicting Churn Rate of the Massively Multiplayer Online Role-Playing Game (MMORPG) Users by Analyzing Playing Behavior","volume":"8","author":"Sin","year":"2019","journal-title":"Int. J. Sci. Technol. Res."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"101971","DOI":"10.1109\/ACCESS.2024.3429559","article-title":"Prediction of Churning Game Users Based on Social Activity and Churn Graph Neural Networks","volume":"12","author":"Han","year":"2024","journal-title":"IEEE Access"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1109\/TCIAIG.2015.2401979","article-title":"Churn prediction in online games using players\u2019 login records: A frequency analysis approach","volume":"7","author":"Castro","year":"2015","journal-title":"IEEE Trans. Comput. Intell. AI Games"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Templ, M. (2023). Enhancing Precision in Large-Scale Data Analysis: An Innovative Robust Imputation Algorithm for Managing Outliers and Missing Values. Mathematics, 11.","DOI":"10.3390\/math11122729"},{"key":"ref_31","first-page":"9","article-title":"Missing data imputation: Focusing on single imputation","volume":"4","author":"Zhang","year":"2016","journal-title":"Ann. Transl. Med."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1002\/mpr.329","article-title":"Multiple imputation by chained equations: What is it and how does it work?","volume":"20","author":"Azur","year":"2011","journal-title":"Int. J. Methods Psychiatr. Res."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1080\/02626667.2021.2001471","article-title":"Multiple imputations by chained equations for recovering missing daily streamflow observations: A case study of Langat River basin in Malaysia","volume":"67","author":"Hamzah","year":"2022","journal-title":"Hydrol. Sci. J."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"100083","DOI":"10.1016\/j.acags.2022.100083","article-title":"Multivariate imputation via chained equations for elastic well log imputation and prediction","volume":"14","author":"Hallam","year":"2022","journal-title":"Appl. Comput. Geosci."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1109\/TG.2018.2871215","article-title":"Profit optimizing churn prediction for long-term loyal customers in online games","volume":"12","author":"Lee","year":"2018","journal-title":"IEEE Trans. Games"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Liao, H.Y., Chen, K.Y., Liu, D.R., and Chiu, Y.L. (2015, January 12\u201316). Customer churn prediction in virtual worlds. Proceedings of the 2015 IIAI 4th International Congress on Advanced Applied Informatics, Okayama, Japan.","DOI":"10.1109\/IIAI-AAI.2015.265"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.dss.2017.05.015","article-title":"User segmentation for retention management in online social games","volume":"101","author":"Fu","year":"2017","journal-title":"Decis. Support Syst."},{"key":"ref_38","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1002\/wics.14","article-title":"Ridge regression","volume":"1","author":"McDonald","year":"2009","journal-title":"Wiley Interdiscip. Rev. Comput. Stat."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Cohen, J. (2013). Statistical Power Analysis for the Behavioral Sciences, Routledge.","DOI":"10.4324\/9780203771587"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1023\/A:1010920819831","article-title":"A simple generalisation of the area under the ROC curve for multiple class classification problems","volume":"45","author":"Hand","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1007\/s10994-011-5256-5","article-title":"Classifier chains for multi-label classification","volume":"85","author":"Read","year":"2011","journal-title":"Mach. Learn."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/96\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:57:09Z","timestamp":1760032629000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/96"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,23]]},"references-count":42,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7]]}},"alternative-id":["data10070096"],"URL":"https:\/\/doi.org\/10.3390\/data10070096","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2025,6,23]]}}}