{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,4]],"date-time":"2026-07-04T03:21:17Z","timestamp":1783135277530,"version":"3.54.6"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:00:00Z","timestamp":1777852800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:00:00Z","timestamp":1777852800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"TU Wien"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton Agent Multi-Agent Syst"],"published-print":{"date-parts":[[2026,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    We present\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\varepsilon $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    , a general exploration strategy for reinforcement learning (RL) that encourages adherence to behavioral preferences while preserving the convergence guarantees of the underlying RL algorithm.\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\varepsilon $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    \u00a0maintains a dynamic collection of retrain areas\u2014regions of the state space where the agent previously violated a specified preference\u2014and mixes the standard uniform restart distribution with states from these areas, according to a decaying parameter\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\varepsilon $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    . This mixed retraining thus focuses on enforcing the desired behaviors in the collected areas. We develop the theory for both policy and value-based methods, showing that: (i) in policy-based settings, our method retains monotonic improvement bounds; and (ii) in value-based settings,\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\varepsilon $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    \u00a0preserves convergence properties without additional assumptions. The approach is simple to integrate into existing RL algorithms and improves sample efficiency and behavioral adherence in the locomotion, power systems, and navigation tasks tested. These results establish\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\varepsilon $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    \u00a0as a lightweight, theoretically grounded mechanism for incorporating behavioral preferences into RL.\n                  <\/jats:p>","DOI":"10.1007\/s10458-026-09748-6","type":"journal-article","created":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T15:05:52Z","timestamp":1777907152000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["$$\\varepsilon $$-retraining reinforcement learning algorithms"],"prefix":"10.1007","volume":"40","author":[{"given":"Luca","family":"Marzari","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Changliu","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Priya L.","family":"Donti","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Enrico","family":"Marchesini","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2026,5,4]]},"reference":[{"key":"9748_CR1","unstructured":"Amodei, D., Olah, C., Steinhardt, J., Christiano, P. F., Schulman, J., & Man\u00e9, D. (2016). Concrete problems in AI safety. arXiv preprint arXiv:1606.06565"},{"key":"9748_CR2","doi-asserted-by":"publisher","unstructured":"Marzari, L., Donti, P. L., Liu, C., & Marchesini, E. (2025). Improving policy optimization via $$\\epsilon $$-retrain. In Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 (pp. 1464\u20131472). https:\/\/doi.org\/10.5555\/3709347.3743780 . https:\/\/dl.acm.org\/doi\/10.5555\/3709347.3743780","DOI":"10.5555\/3709347.3743780"},{"key":"9748_CR3","unstructured":"Kakade, S., & Langford, J. (2002). Approximately optimal approximate reinforcement learning. In Proceedings of the Nineteenth International Conference on Machine Learning (ICML) (pp. 267\u2013274)."},{"key":"9748_CR4","unstructured":"Schulman, J., Levine, S., Abbeel, P., Jordan, M., & Moritz, P. (2015). Trust region policy optimization. In International Conference on Machine Learning (ICML) (pp. 1889\u20131897)."},{"key":"9748_CR5","unstructured":"Eysenbach, B., Gu, S., Ibarz, J., & Levine, S. (2018). Leave no trace: Learning to reset for safe and autonomous reinforcement learning. In 6th International Conference on Learning Representations, (ICLR). https:\/\/openreview.net\/forum?id=S1vuO-bCW"},{"key":"9748_CR6","doi-asserted-by":"publisher","first-page":"12951","DOI":"10.52202\/075280-0569","volume":"36","author":"Y Jiang","year":"2023","unstructured":"Jiang, Y., Kolter, J. Z., & Raileanu, R. (2023). On the importance of exploration for generalization in reinforcement learning. Advances in Neural Information Processing Systems, 36, 12951\u201312986.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"9748_CR7","unstructured":"Lagoudakis, M. G., & Parr, R. (2003). Reinforcement learning as classification: Leveraging modern classifiers. In Proceedings of the Twentieth International Conference on Machine Learning (ICML) (pp. 424\u2013431)."},{"key":"9748_CR8","first-page":"1754","volume":"26","author":"V Gabillon","year":"2013","unstructured":"Gabillon, V., Ghavamzadeh, M., & Scherrer, B. (2013). Approximate dynamic programming finally performs well in the game of tetris. Advances in Neural Information Processing Systems, 26, 1754\u20131762.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"9748_CR9","unstructured":"Marchesini, E., & Amato, C. (2023). Improving deep policy gradients with value function search. In The eleventh international conference on learning representations. https:\/\/openreview.net\/forum?id=6qZC7pfenQm"},{"key":"9748_CR10","doi-asserted-by":"publisher","first-page":"12334","DOI":"10.52202\/079017-0395","volume":"37","author":"Z Mhammedi","year":"2024","unstructured":"Mhammedi, Z., Foster, D. J., & Rakhlin, A. (2024). The power of resets in online reinforcement learning. Advances in Neural Information Processing Systems, 37, 12334\u201312407. https:\/\/doi.org\/10.52202\/079017-0395","journal-title":"Advances in Neural Information Processing Systems"},{"key":"9748_CR11","unstructured":"Altman, E. (1999). Constrained Markov decision processes. In CRC Press."},{"key":"9748_CR12","unstructured":"Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O. (2017). Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347"},{"key":"9748_CR13","unstructured":"Stooke, A., Achiam, J., & Abbeel, P. (2020). Responsive safety in reinforcement learning by PID lagrangian methods. In Proceedings of the 37th International Conference on Machine Learning, (ICML) (pp. 9133\u20139143). http:\/\/proceedings.mlr.press\/v119\/stooke20a.html"},{"key":"9748_CR14","doi-asserted-by":"publisher","unstructured":"Zhang, L., Shen, L., Yang, L., Chen, S., Wang, X., Yuan, B.,& Tao, D. (2022). Penalized proximal policy optimization for safe reinforcement learning. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, (IJCAI) (pp. 3744\u20133750). https:\/\/doi.org\/10.24963\/IJCAI.2022\/520","DOI":"10.24963\/IJCAI.2022\/520"},{"key":"9748_CR15","unstructured":"Achiam, J., Held, D., Tamar, A., & Abbeel, P. (2017). Constrained policy optimization. In Proceedings of the 34th International Conference on Machine Learning, (ICML) (pp. 22\u201331). http:\/\/proceedings.mlr.press\/v70\/achiam17a.html"},{"key":"9748_CR16","unstructured":"Sootla, A., Cowen-Rivers, A. I., Jafferjee, T., Wang, Z., Mguni, D. H., Wang, J., & Ammar, H. (2022). Saute RL: almost surely safe reinforcement learning using state augmentation. In International Conference on Machine Learning, (ICML) (pp. 20423\u201320443). https:\/\/proceedings.mlr.press\/v162\/sootla22a.html"},{"key":"9748_CR17","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1007\/BF00992698","volume":"8","author":"CJCH Watkins","year":"1992","unstructured":"Watkins, C. J. C. H., & Dayan, P. (1992). Technical note Q-learning. Machine Learning, 8, 279\u2013292. https:\/\/doi.org\/10.1007\/BF00992698","journal-title":"Machine Learning"},{"issue":"4","key":"9748_CR18","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1145\/3197517.3201311","volume":"37","author":"XB Peng","year":"2018","unstructured":"Peng, X. B., Abbeel, P., Levine, S., & Panne, M. (2018). Deepmimic: Example-guided deep reinforcement learning of physics-based character skills. ACM Transactions on Graphics, 37(4), 143\u2013114314.","journal-title":"ACM Transactions on Graphics"},{"issue":"7847","key":"9748_CR19","doi-asserted-by":"publisher","first-page":"580","DOI":"10.1038\/S41586-020-03157-9","volume":"590","author":"A Ecoffet","year":"2021","unstructured":"Ecoffet, A., Huizinga, J., Lehman, J., Stanley, K. O., & Clune, J. (2021). First return, then explore. Nature, 590(7847), 580\u2013586. https:\/\/doi.org\/10.1038\/S41586-020-03157-9","journal-title":"Nature"},{"key":"9748_CR20","doi-asserted-by":"publisher","unstructured":"Messikommer, N., Song, Y., & Scaramuzza, D. (2024). Contrastive initial state buffer for reinforcement learning. In IEEE International Conference on Robotics and Automation, (ICRA) (pp. 2866\u20132872). https:\/\/doi.org\/10.1109\/ICRA57147.2024.10610528","DOI":"10.1109\/ICRA57147.2024.10610528"},{"key":"9748_CR21","unstructured":"Ray, A., Achiam, J., & Amodei, D. (2019). Benchmarking safe exploration in deep reinforcement learning. In OpenAI Blog."},{"key":"9748_CR22","unstructured":"Roy, J., Girgis, R., Romoff, J., Bacon, P., & Pal, C. J. (2022). Direct behavior specification via constrained reinforcement learning. In International Conference on Machine Learning, (ICML) (pp. 18828\u201318843). https:\/\/proceedings.mlr.press\/v162\/roy22a.html"},{"key":"9748_CR23","doi-asserted-by":"publisher","unstructured":"Nocedal, J., & Wright, S. J. (2006). Numerical Optimization (2nd ed.). Springer, New York. https:\/\/doi.org\/10.1007\/978-0-387-40065-5","DOI":"10.1007\/978-0-387-40065-5"},{"key":"9748_CR24","doi-asserted-by":"publisher","unstructured":"Wei, T., Hu, H., Marzari, L., Yun, K. S., Niu, P., Luo, X., & Liu, C. (2025). Modelverification.jl: A comprehensive toolbox for formally verifying deep neural networks. In Computer Aided Verification - 37th International Conference, (CAV) (pp. 395\u2013408). https:\/\/doi.org\/10.1007\/978-3-031-98679-6_18","DOI":"10.1007\/978-3-031-98679-6_18"},{"issue":"3\u20134","key":"9748_CR25","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1561\/2400000035","volume":"4","author":"C Liu","year":"2021","unstructured":"Liu, C., Arnon, T., Lazarus, C., Strong, C., Barrett, C., Kochenderfer, M. J., et al. (2021). Algorithms for verifying deep neural networks. Foundations and Trends in Optimization, 4(3\u20134), 244\u2013404.","journal-title":"Foundations and Trends in Optimization"},{"key":"9748_CR26","doi-asserted-by":"publisher","unstructured":"Marzari, L., Cicalese, F., & Farinelli, A. (2025). Probabilistically tightened linear relaxation-based perturbation analysis for neural network verification. Journal of Artificial Intelligence Research, 84. https:\/\/doi.org\/10.1613\/JAIR.1.20808","DOI":"10.1613\/JAIR.1.20808"},{"key":"9748_CR27","doi-asserted-by":"publisher","unstructured":"Marchesini, E., Marzari, L., Farinelli, A., & Amato, C. (2023). Safe deep reinforcement learning by verifying task-level properties. In Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, (AAMAS) (pp. 1466\u20131475). https:\/\/doi.org\/10.5555\/3545946.3598799","DOI":"10.5555\/3545946.3598799"},{"key":"9748_CR28","doi-asserted-by":"publisher","unstructured":"Marzari, L., Marchesini, E., & Farinelli, A. (2023). Online safety property collection and refinement for safe deep reinforcement learning in mapless navigation. In IEEE International Conference on Robotics and Automation, (ICRA) (pp. 7133\u20137139). https:\/\/doi.org\/10.1109\/ICRA48891.2023.10161312","DOI":"10.1109\/ICRA48891.2023.10161312"},{"issue":"1","key":"9748_CR29","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1145\/3770068","volume":"17","author":"L Marzari","year":"2025","unstructured":"Marzari, L., Cicalese, F., Farinelli, A., Amato, C., & Marchesini, E. (2025). Verifying online safety properties for safe deep reinforcement learning. ACM Transactions on Intelligent Systems and Technology, 17(1), 3\u20131327. https:\/\/doi.org\/10.1145\/3770068","journal-title":"ACM Transactions on Intelligent Systems and Technology"},{"key":"9748_CR30","unstructured":"Wang, S., Pei, K., Whitehouse, J., Yang, J., & Jana, S. (2018). Formal security analysis of neural networks using symbolic intervals. In 27th USENIX Security Symposium, (USENIX) (pp. 1599\u20131614). https:\/\/www.usenix.org\/conference\/usenixsecurity18\/presentation\/wang-shiqi"},{"key":"9748_CR31","doi-asserted-by":"publisher","unstructured":"Marzari, L., Corsi, D., Marchesini, E., Farinelli, A., & Cicalese, F. (2024). Enumerating safe regions in deep neural networks with provable probabilistic guarantees. In Thirty-Eighth AAAI conference on artificial intelligence (pp. 21387\u201321394). https:\/\/doi.org\/10.1609\/AAAI.V38I19.30134","DOI":"10.1609\/AAAI.V38I19.30134"},{"key":"9748_CR32","doi-asserted-by":"publisher","unstructured":"Marzari, L., Bicego, M., Cicalese, F., & Farinelli, A. (2026). On the probabilistic learnability of compact neural network preimage bounds. In Fortieth AAAI conference on artificial intelligence (pp. 35707\u201335714). https:\/\/doi.org\/10.1609\/AAAI.V40I42.40883","DOI":"10.1609\/AAAI.V40I42.40883"},{"issue":"10","key":"9748_CR33","doi-asserted-by":"publisher","first-page":"9630","DOI":"10.1109\/LRA.2025.3596431","volume":"10","author":"L Marzari","year":"2025","unstructured":"Marzari, L., Trotti, F., Marchesini, E., & Farinelli, A. (2025). Designing control barrier function via probabilistic enumeration for safe reinforcement learning navigation. IEEE Robotics and Automation Letters, 10(10), 9630\u20139637. https:\/\/doi.org\/10.1109\/LRA.2025.3596431","journal-title":"IEEE Robotics and Automation Letters"},{"key":"9748_CR34","doi-asserted-by":"publisher","unstructured":"Marzari, L., Corsi, D., Cicalese, F., & Farinelli, A. (2023). The #dnn-verification problem: Counting unsafe inputs for deep neural networks. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, (IJCAI) (pp. 217\u2013224). https:\/\/doi.org\/10.24963\/IJCAI.2023\/25","DOI":"10.24963\/IJCAI.2023\/25"},{"key":"9748_CR35","doi-asserted-by":"publisher","first-page":"1437","DOI":"10.5555\/2789272.2886795","volume":"16","author":"J Garc\u00eda","year":"2015","unstructured":"Garc\u00eda, J., & Fern\u00e1ndez, F. (2015). A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research, 16, 1437\u20131480. https:\/\/doi.org\/10.5555\/2789272.2886795","journal-title":"Journal of Machine Learning Research"},{"key":"9748_CR36","doi-asserted-by":"publisher","unstructured":"Moore, R. E., Kearfott, R. B., & Cloud, M. J. (2009). Introduction to Interval Analysis. SIAM, Philadelphia. https:\/\/doi.org\/10.1137\/1.9780898717716","DOI":"10.1137\/1.9780898717716"},{"key":"9748_CR37","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1007\/978-3-030-26748-3_14","volume-title":"Modern Methods in Operator Theory and Harmonic Analysis","author":"DB Rokhlin","year":"2019","unstructured":"Rokhlin, D. B. (2019). Robbins-monro conditions for persistent exploration learning strategies. In A. Karapetyants, V. Kravchenko, & E. Liflyand (Eds.), Modern Methods in Operator Theory and Harmonic Analysis (pp. 237\u2013247). Cham: Springer."},{"issue":"301","key":"9748_CR38","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1080\/01621459.1963.10500830","volume":"58","author":"W Hoeffding","year":"1963","unstructured":"Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301), 13\u201330.","journal-title":"Journal of the American Statistical Association"},{"key":"9748_CR39","doi-asserted-by":"publisher","first-page":"2413","DOI":"10.5555\/1577069.1755867","volume":"10","author":"AL Strehl","year":"2009","unstructured":"Strehl, A. L., Li, L., & Littman, M. L. (2009). Reinforcement learning in finite MDPs: PAC analysis. Journal of Machine Learning Research, 10, 2413\u20132444. https:\/\/doi.org\/10.5555\/1577069.1755867","journal-title":"Journal of Machine Learning Research"},{"key":"9748_CR40","volume-title":"Probability and Computing: Randomization and Probabilistic Techniques in Algorithms and Data Analysis","author":"M Mitzenmacher","year":"2017","unstructured":"Mitzenmacher, M., & Upfal, E. (2017). Probability and Computing: Randomization and Probabilistic Techniques in Algorithms and Data Analysis (2nd ed.). USA: Cambridge University Press.","edition":"2"},{"key":"9748_CR41","first-page":"285","volume":"25","author":"J Ji","year":"2024","unstructured":"Ji, J., Zhou, J., Zhang, B., Dai, J., Pan, X., Sun, R., Huang, W., Geng, Y., Liu, M., & Yang, Y. (2024). Omnisafe: An infrastructure for accelerating safe reinforcement learning research. Journal of Machine Learning Research, 25, 285\u201312856.","journal-title":"Journal of Machine Learning Research"},{"key":"9748_CR42","doi-asserted-by":"publisher","unstructured":"Hasselt, H., Guez, A., & Silver, D. (2016). Deep reinforcement learning with double Q-learning. In Proceedings of the Thirtieth AAAI conference on artificial intelligence (pp. 2094\u20132100). https:\/\/doi.org\/10.1609\/AAAI.V30I1.10295","DOI":"10.1609\/AAAI.V30I1.10295"},{"key":"9748_CR43","doi-asserted-by":"publisher","first-page":"18964","DOI":"10.52202\/075280-0831","volume":"36","author":"J Ji","year":"2023","unstructured":"Ji, J., Zhang, B., Zhou, J., Pan, X., Huang, W., Sun, R., Geng, Y., Zhong, Y., Dai, J., & Yang, Y. (2023). Safety gymnasium: A unified safe reinforcement learning benchmark. Advances in Neural Information Processing Systems, 36, 18964\u201318993.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"9748_CR44","doi-asserted-by":"publisher","first-page":"100092","DOI":"10.1016\/j.egyai.2021.100092","volume":"5","author":"R Henry","year":"2021","unstructured":"Henry, R., & Ernst, D. (2021). Gym-ANM: Reinforcement learning environments for active network management tasks in electricity distribution systems. Energy and AI, 5, 100092. https:\/\/doi.org\/10.1016\/j.egyai.2021.100092","journal-title":"Energy and AI"},{"key":"9748_CR45","doi-asserted-by":"crossref","unstructured":"Castellini, A., Marchesini, E., Mazzi, G., & Farinelli, A. (2020). Explaining the influence of prior knowledge on POMCP policies. In Multi-Agent Systems and Agreement Technologies (pp. 261\u2013276).","DOI":"10.1007\/978-3-030-66412-1_17"},{"key":"9748_CR46","doi-asserted-by":"publisher","unstructured":"Marchesini, E., & Farinelli, A. (2021). Centralizing state-values in dueling networks for multi-robot reinforcement learning mapless navigation. In IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 4583\u20134588). https:\/\/doi.org\/10.1109\/IROS51168.2021.9636349","DOI":"10.1109\/IROS51168.2021.9636349"},{"key":"9748_CR47","unstructured":"Marchesini, E., Donnot, B., Crozier, C., Dytham, I., Merz, C., Schewe, L., Westerbeck, N., Wu, C., Marot, A., & Donti, P. L. (2025). RL2Grid: Benchmarking reinforcement learning in power grid operations. arXiv preprint arXiv:2503.23101"},{"key":"9748_CR48","unstructured":"Marchesini, E., Boguslawski, E., Leite, A., Amato, C., Dussartre, M., Schoenauer, M., Donnot, B., & Donti, P. L. (2026). MARL2Grid-TR: A multi-agent RL benchmark in power grid operations. In The fourteenth international conference on learning representations. https:\/\/openreview.net\/forum?id=mpAMH1OyMO"},{"key":"9748_CR49","first-page":"274","volume":"23","author":"S Huang","year":"2022","unstructured":"Huang, S., Dossa, R. F. J., Ye, C., Braga, J., Chakraborty, D., Mehta, K., & Ara\u00fajo, J. G. M. (2022). CleanRL: High-quality single-file implementations of deep reinforcement learning algorithms. Journal of Machine Learning Research, 23, 274\u2013127418.","journal-title":"Journal of Machine Learning Research"}],"container-title":["Autonomous Agents and Multi-Agent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-026-09748-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10458-026-09748-6","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-026-09748-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,7,4]],"date-time":"2026-07-04T03:08:05Z","timestamp":1783134485000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10458-026-09748-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5,4]]},"references-count":49,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,6]]}},"alternative-id":["9748"],"URL":"https:\/\/doi.org\/10.1007\/s10458-026-09748-6","relation":{},"ISSN":["1387-2532","1573-7454"],"issn-type":[{"value":"1387-2532","type":"print"},{"value":"1573-7454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,5,4]]},"assertion":[{"value":"22 September 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 April 2026","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 May 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of Interest"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"Not applicable.","order":5,"name":"Ethics","group":{"name":"EthicsHeading","label":"Materials availability"}},{"value":"Code will be released in a public repository upon acceptance of the manuscript.","order":6,"name":"Ethics","group":{"name":"EthicsHeading","label":"Code availability"}},{"value":"The authors declare no competing interests.","order":7,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"24"}}