{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T13:45:06Z","timestamp":1774964706830,"version":"3.50.1"},"reference-count":56,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2025,4,17]],"date-time":"2025-04-17T00:00:00Z","timestamp":1744848000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>Traditional marketing uplift models suffer from a fundamental limitation: they typically operate under static assumptions that fail to capture the temporal dynamics of customer responses to marketing interventions. This paper introduces a novel framework that combines causal forest algorithms with deep reinforcement learning to dynamically model marketing uplift effects. Our approach enables the real-time identification of heterogeneous treatment effects across customer segments while simultaneously optimizing intervention strategies through an adaptive learning mechanism. The key innovations of our framework include the following: (1) a counterfactual simulation environment that emulates diverse customer response patterns; (2) an adaptive reward mechanism that captures both immediate and long-term intervention outcomes; and (3) a dynamic policy optimization process that continually refines targeting strategies based on evolving customer behaviors. Empirical evaluations on both simulated and real-world marketing campaign data demonstrate that our approach significantly outperforms traditional static uplift models, achieving up to a 27% improvement in targeting efficiency and an 18% increase in the return on marketing investment. The framework leverages inherent symmetries in customer-intervention interactions, where balanced and symmetric reward structures ensure fair optimization across diverse customer segments. The proposed framework addresses the limitations of existing methods by effectively modeling the dynamic and heterogeneous nature of customer responses to marketing interventions, providing marketers with a powerful tool for implementing personalized and adaptive campaign strategies.<\/jats:p>","DOI":"10.3390\/sym17040610","type":"journal-article","created":{"date-parts":[[2025,4,17]],"date-time":"2025-04-17T20:50:46Z","timestamp":1744923046000},"page":"610","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Dynamic Marketing Uplift Modeling: A Symmetry-Preserving Framework Integrating Causal Forests with Deep Reinforcement Learning for Personalized Intervention Strategies"],"prefix":"10.3390","volume":"17","author":[{"given":"Jiyuan","family":"Wang","sequence":"first","affiliation":[{"name":"The Fuqua School of Business, Duke University, Durham, NC 27708, USA"}]},{"given":"Yutong","family":"Tan","sequence":"additional","affiliation":[{"name":"School of Business, Wake Forest University, Winston-Salem, NC 27109, USA"}]},{"given":"Bingying","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Business, University of Wisconsin-Madison, Madison, WI 53706, USA"}]},{"given":"Bi","family":"Wu","sequence":"additional","affiliation":[{"name":"Anderson School of Management, University of California, Los Angeles, CA 90095, USA"}]},{"given":"Wenhe","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA"}]}],"member":"1968","published-online":{"date-parts":[[2025,4,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1509\/jmr.16.0163","article-title":"Retention Futility: Targeting High-Risk Customers Might Be Ineffective","volume":"55","author":"Ascarza","year":"2018","journal-title":"J. Mark. Res."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"647","DOI":"10.1016\/j.ejor.2019.11.030","article-title":"Response Transformation and Profit Decomposition for Revenue Uplift Modeling","volume":"283","author":"Gubela","year":"2020","journal-title":"Eur. J. Oper. Res."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Chohlas-Wood, A., Coots, M., Zhu, H., Brunskill, E., and Goel, S. (2024). Learning to Be Fair: A Consequentialist Approach to Equitable Decision Making. Manag. Sci.","DOI":"10.1287\/mnsc.2022.00345"},{"key":"ref_4","first-page":"115","article-title":"Heterogeneous Treatment Effects and Optimal Targeting Policy Evaluation","volume":"22","author":"Hitsch","year":"2024","journal-title":"SSRN Electron. J."},{"key":"ref_5","unstructured":"Bhattacharya, R., Malinsky, D., and Shpitser, I. (2020). Causal inference under interference and network uncertainty. Uncertainty in Artificial Intelligence, Elsevier."},{"key":"ref_6","unstructured":"Guelman, L., Guill\u00e9n, M., and P\u00e9rez-Mar\u00edn, A.M. (2015). Optimal Personalized Treatment Rules for Marketing Interventions: A Review of Methods, a New Proposal, and an Insurance Case Study. UB Riskcenter Working Paper Series, Universitat de Barcelona. 2015\/06."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Goldenberg, D., Albert, J., Bernardi, L., and Estevez, P. (2020, January 22\u201326). Free lunch! retrospective uplift modeling for dynamic promotions recommendation within roi constraints. Proceedings of the 14th ACM Conference on Recommender Systems, Online.","DOI":"10.1145\/3383313.3412215"},{"key":"ref_8","unstructured":"Radcliffe, N.J., and Surry, P.D. (2011). Real-world uplift modelling with significance-based uplift trees. White Paper TR-2011-1, Stochastic Solutions."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1228","DOI":"10.1080\/01621459.2017.1319839","article-title":"Estimation and Inference of Heterogeneous Treatment Effects using Random Forests","volume":"113","author":"Wager","year":"2018","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1148","DOI":"10.1214\/18-AOS1709","article-title":"Generalized Random Forests","volume":"47","author":"Athey","year":"2019","journal-title":"Ann. Stat."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1038\/nature16961","article-title":"Mastering the Game of Go with Deep Neural Networks and Tree Search","volume":"529","author":"Silver","year":"2016","journal-title":"Nature"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1038\/nature14236","article-title":"Human-level Control Through Deep Reinforcement Learning","volume":"518","author":"Mnih","year":"2015","journal-title":"Nature"},{"key":"ref_13","unstructured":"Tkachenko, Y. (2015). Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s10115-011-0434-0","article-title":"Decision Trees for Uplift Modeling with Single and Multiple Treatments","volume":"32","author":"Rzepakowski","year":"2012","journal-title":"Knowl. Inf. Syst."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"He, B., Weng, Y., Tang, X., Cui, Z., Sun, Z., Chen, L., He, X., and Ma, C. (2024, January 25\u201329). Rankability-enhanced revenue uplift modeling framework for online marketing. Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Barcelona, Spain.","DOI":"10.1145\/3637528.3671516"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1145\/772862.772872","article-title":"The True Lift Model: A Novel Data Mining Approach to Response Modeling in Database Marketing","volume":"4","author":"Lo","year":"2002","journal-title":"ACM SIGKDD Explor. Newsl."},{"key":"ref_17","first-page":"14","article-title":"Using Control Groups to Target on Predicted Lift: Building and Assessing Uplift Models","volume":"8","author":"Radcliffe","year":"2007","journal-title":"Direct Mark. Anal. J."},{"key":"ref_18","unstructured":"Jaskowski, M., and Jaroszewicz, S. (26\u20131, January 26). Uplift Modeling for Clinical Trial Data. Proceedings of the ICML Workshop on Clinical Data Analysis, Edinburgh, Scotland."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Rzepakowski, P., and Jaroszewicz, S. (2010, January 13\u201317). Decision Trees for Uplift Modeling. Proceedings of the 10th IEEE International Conference on Data Mining, Sydney, Australia.","DOI":"10.1109\/ICDM.2010.62"},{"key":"ref_20","unstructured":"Guelman, L., Guill\u00e9n, M., and P\u00e9rez-Mar\u00edn, A.M. (June, January 30). Random Forests for Uplift Modeling: An Insurance Customer Retention Case. Proceedings of the International Conference on Modeling and Simulation in Engineering, Economics and Management, New Rochelle, NY, USA."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhao, Y., and Zeng, X. (2017, January 27\u201329). Uplift Modeling with Multiple Treatments and General Response Types. Proceedings of the 2017 SIAM International Conference on Data Mining, Houston, TX, USA.","DOI":"10.1137\/1.9781611974973.66"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1146\/annurev-statistics-022513-115553","article-title":"Dynamic treatment regimes","volume":"1","author":"Chakraborty","year":"2014","journal-title":"Annu. Rev. Stat. Its Appl."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"910","DOI":"10.1109\/TKDE.2024.3510734","article-title":"Sequential Causal Effect Estimation by Jointly Modeling the Unmeasured Confounders and Instrumental Variables","volume":"37","author":"Sun","year":"2024","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zhang, X., Wang, K., Wang, Z., Du, B., Zhao, S., Wu, R., Shen, X., Lv, T., and Fan, C. (2024, January 25\u201329). Temporal Uplift Modeling for Online Marketing. Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Barcelona, Spain.","DOI":"10.1145\/3637528.3671560"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Bokelmann, B., and Lessmann, S. (2025). Heteroscedasticity-aware stratified sampling to improve uplift modeling. Eur. J. Oper. Res., in press.","DOI":"10.1016\/j.ejor.2025.02.030"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Alkire, L., Hesse, L., Raki, A., Boenigk, S., Kabadayi, S., Fisk, R.P., and Mora, A. (2025). From theory to practice: A collaborative approach to social impact measurement and communication. Eur. J. Mark., in press.","DOI":"10.1108\/EJM-04-2024-0321"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1093\/ectj\/utaa014","article-title":"Machine Learning Estimation of Heterogeneous Causal Effects: Empirical Monte Carlo Evidence","volume":"24","author":"Knaus","year":"2021","journal-title":"Econom. J."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1198\/jcgs.2010.08162","article-title":"Bayesian Nonparametric Modeling for Causal Inference","volume":"20","author":"Hill","year":"2011","journal-title":"J. Comput. Graph. Stat."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"4156","DOI":"10.1073\/pnas.1804597116","article-title":"Metalearners for Estimating Heterogeneous Treatment Effects Using Machine Learning","volume":"116","author":"Sekhon","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1093\/biomet\/asaa076","article-title":"Quasi-Oracle Estimation of Heterogeneous Treatment Effects","volume":"108","author":"Nie","year":"2021","journal-title":"Biometrika"},{"key":"ref_31","first-page":"1179","article-title":"Optimal Doubly Robust Estimation of Heterogeneous Causal Effects","volume":"48","author":"Kennedy","year":"2020","journal-title":"Ann. Stat."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Sun, Z., He, B., Ma, M., Tang, J., Wang, Y., Ma, C., and Liu, D. (2023, January 1\u20134). Robustness-enhanced uplift modeling with adversarial feature desensitization. Proceedings of the 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China.","DOI":"10.1109\/ICDM58522.2023.00169"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chernozhukov, V., Demirer, M., Duflo, E., and Fern\u00e1ndez-Val, I. (2018). Generic Machine Learning Inference on Heterogeneous Treatment Effects in Randomized Experiments. National Bureau of Economic Research, Working Paper No. 24678, National Bureau of Economic Research.","DOI":"10.3386\/w24678"},{"key":"ref_34","unstructured":"Bica, I., Alaa, A.M., Jordon, J., and van der Schaar, M. (2020, January 30). Estimating Counterfactual Treatment Outcomes over Time through Adversarially Balanced Representations. Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Sutton, R.S., and Barto, A.G. (1998). Reinforcement Learning: An Introduction, MIT Press.","DOI":"10.1109\/TNN.1998.712192"},{"key":"ref_36","first-page":"1","article-title":"Leveraging Reinforcement Learning and Natural Language Processing for AI-Driven Hyper-Personalized Marketing Strategies","volume":"10","author":"Rohit","year":"2021","journal-title":"Int. J. Innov."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1287\/mksc.2016.1023","article-title":"Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments","volume":"36","author":"Schwartz","year":"2017","journal-title":"Mark. Sci."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Theocharous, G., Thomas, P.S., and Ghavamzadeh, M. (2015, January 18\u201322). Ad recommendation systems for life-time value optimization. Proceedings of the 24th International Conference on World Wide Web, Florence, Italy.","DOI":"10.1145\/2740908.2741998"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhao, J., Qiu, G., Guan, Z., Zhao, W., and He, X. (2018, January 19\u201323). Deep Reinforcement Learning for Sponsored Search Real-time Bidding. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK.","DOI":"10.1145\/3219819.3219918"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Chen, M., Beutel, A., Covington, P., Jain, S., Belletti, F., and Chi, E.H. (2019, January 11\u201315). Top-K Off-Policy Correction for a REINFORCE Recommender System. Proceedings of the 12th ACM International Conference on Web Search and Data Mining, Melbourne, Australia.","DOI":"10.1145\/3289600.3290999"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Cai, H., Ren, K., Zhang, W., Malialis, K., Wang, J., Yu, Y., and Guo, D. (2017, January 6\u201310). Real-time bidding by reinforcement learning in display advertising. Proceedings of the tenth ACM International Conference on Web Search and Data Mining, Cambridge, UK.","DOI":"10.1145\/3018661.3018702"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"2419","DOI":"10.1007\/s10994-021-05961-4","article-title":"Challenges of Real-World Reinforcement Learning: Definitions, Benchmarks and Analysis","volume":"110","author":"Levine","year":"2021","journal-title":"Mach. Learn."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Lattimore, T., and Szepesv\u00e1ri, C. (2020). Bandit Algorithms, Cambridge University Press.","DOI":"10.1017\/9781108571401"},{"key":"ref_44","first-page":"3207","article-title":"Counterfactual Reasoning and Learning Systems: The Example of Computational Advertising","volume":"14","author":"Bottou","year":"2013","journal-title":"J. Mach. Learn. Res."},{"key":"ref_45","unstructured":"Thomas, P.S., Theocharous, G., and Ghavamzadeh, M. (2016, January 20\u201322). Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning. Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA."},{"key":"ref_46","unstructured":"Deng, Z., Jiang, J., Long, G., and Zhang, C. (2023). Causal reinforcement learning: A survey. arXiv."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"322","DOI":"10.1198\/016214504000001880","article-title":"Causal inference using potential outcomes: Design, modeling, decisions","volume":"100","author":"Rubin","year":"2005","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_48","unstructured":"Puterman, M.L. (2014). Markov Decision Processes: Discrete Stochastic Dynamic Programming, John Wiley & Sons."},{"key":"ref_49","unstructured":"Diemert, E., Betlei, A., Renaudin, C., and Amini, M.-R. (2018). A large scale benchmark for uplift modeling. arXiv."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Imbens, G.W., and Rubin, D.B. (2015). Causal Inference in Statistics, Social, and Biomedical Sciences, Cambridge University Press.","DOI":"10.1017\/CBO9781139025751"},{"key":"ref_51","unstructured":"Gutierrez, P., and G\u00e9rardy, J.Y. (2017, January 24\u201325). Causal inference and uplift modelling: A review of the literature. Proceedings of the International Conference on Predictive Applications and APIs, Boston, MA, USA."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1089\/big.2017.0104","article-title":"A literature survey and experimental evaluation of the state-of-the-art in uplift modeling: A stepping stone toward the development of prescriptive analytics","volume":"6","author":"Devriendt","year":"2018","journal-title":"Big Data"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1177\/1094670506293810","article-title":"Modeling customer lifetime value","volume":"9","author":"Gupta","year":"2006","journal-title":"J. Serv. Res."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1509\/jmkr.2005.42.4.415","article-title":"RFM and CLV: Using iso-value curves for customer base analysis","volume":"42","author":"Fader","year":"2005","journal-title":"J. Mark. Res."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2523813","article-title":"A survey on concept drift adaptation","volume":"46","author":"Gama","year":"2014","journal-title":"ACM Comput. Surv."},{"key":"ref_56","unstructured":"Battocchi, K., Dillon, E., Hei, M., Lewis, G., Oka, P., Oprescu, M., and Syrgkanis, V. (2025, April 01). EconML: A Python Package for ML-Based Heterogeneous Treatment Effects Estimation. Version 0.x. Available online: https:\/\/github.com\/py-why\/EconML."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/4\/610\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:16:52Z","timestamp":1760030212000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/17\/4\/610"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4,17]]},"references-count":56,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2025,4]]}},"alternative-id":["sym17040610"],"URL":"https:\/\/doi.org\/10.3390\/sym17040610","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,4,17]]}}}