{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T12:20:07Z","timestamp":1776082807207,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":26,"publisher":"ACM","funder":[{"name":"The authors acknowledge the financial support by the Federal Ministry of Research, Technology and Space of Germany and by S\u00e4chsische Staatsministerium f\u00fcr Wissenschaft, Kultur und Tourismus in the programme Center of Excellence for AI-research ?Center for Scalable Data Analytics and Artificial Intelligence Dresden\/Leipzig?, project identification number: ScaDS.AI","award":["ScaDS.AI"],"award-info":[{"award-number":["ScaDS.AI"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,9,28]]},"DOI":"10.1145\/3746058.3758385","type":"proceedings-article","created":{"date-parts":[[2025,9,27]],"date-time":"2025-09-27T14:35:03Z","timestamp":1758983703000},"page":"1-3","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Increasing Interaction Fidelity: Training Routines for Biomechanical Models in HCI"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-5579-3036","authenticated-orcid":false,"given":"Micha\u0142 Patryk","family":"Miazga","sequence":"first","affiliation":[{"name":"ScaDS.AI, Leipzig University, Leipzig, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4437-2821","authenticated-orcid":false,"given":"Patrick","family":"Ebel","sequence":"additional","affiliation":[{"name":"ScaDS.AI, Leipzig University, Leipzig, Germany"}]}],"member":"320","published-online":{"date-parts":[[2025,9,27]]},"reference":[{"key":"e_1_3_3_2_2_2","unstructured":"Alex Braylan Mark Hollenbeck Elliot Meyerson and Risto Miikkulainen. 2000. Frame skip is a powerful parameter for learning to play atari. Space 1600 (2000) 1800."},{"key":"e_1_3_3_2_3_2","unstructured":"Vittorio Caggiano Huawei Wang Guillaume Durandau Massimo Sartori and Vikash Kumar. 2022. MyoSuite\u2013A contact-rich simulation suite for musculoskeletal motor control. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2205.13600 (2022)."},{"key":"e_1_3_3_2_4_2","doi-asserted-by":"crossref","unstructured":"Wenqian Chen Yaru Chen Yongxuan Wang and Rong Liu. 2023. Static standing balance with musculoskeletal models using ppo with reward shaping. Procedia Computer Science 226 (2023) 78\u201384.","DOI":"10.1016\/j.procs.2023.10.639"},{"key":"e_1_3_3_2_5_2","volume-title":"AAAI spring symposia","author":"Dewey Daniel","year":"2014","unstructured":"Daniel Dewey. 2014. Reinforcement Learning and the Reward Engineering Principle.. In AAAI spring symposia."},{"key":"e_1_3_3_2_6_2","doi-asserted-by":"crossref","unstructured":"Jonas Eschmann. 2021. Reward function design in reinforcement learning. Reinforcement learning algorithms: Analysis and Applications (2021) 25\u201333.","DOI":"10.1007\/978-3-030-41188-6_3"},{"key":"e_1_3_3_2_7_2","doi-asserted-by":"crossref","unstructured":"Florian Fischer Miroslav Bachinski Markus Klar Arthur Fleig and J\u00f6rg M\u00fcller. 2021. Reinforcement learning control of a biomechanical model of the upper extremity. Scientific Reports 11 1 (2021) 14445.","DOI":"10.1038\/s41598-021-93760-1"},{"key":"e_1_3_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1145\/3654777.3676452"},{"key":"e_1_3_3_2_9_2","unstructured":"Marek Grzes. 2017. Reward shaping in episodic reinforcement learning. (2017)."},{"key":"e_1_3_3_2_10_2","unstructured":"Yujing Hu Weixun Wang Hangtian Jia Yixiang Wang Yingfeng Chen Jianye Hao Feng Wu and Changjie Fan. 2020. Learning to utilize shaping rewards: A new approach of reward shaping. Advances in Neural Information Processing Systems 33 (2020) 15931\u201315941."},{"key":"e_1_3_3_2_11_2","unstructured":"Shengyi Huang and Santiago Onta\u00f1\u00f3n. 2020. A closer look at invalid action masking in policy gradient algorithms. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2006.14171 (2020)."},{"key":"e_1_3_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/3526113.3545689"},{"key":"e_1_3_3_2_13_2","first-page":"277","volume-title":"International Conference on NeuroRehabilitation","author":"Ikkala Aleksi","year":"2020","unstructured":"Aleksi Ikkala and Perttu H\u00e4m\u00e4l\u00e4inen. 2020. Converting biomechanical models from opensim to mujoco. In International Conference on NeuroRehabilitation. Springer, 277\u2013281."},{"key":"e_1_3_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3640792.3675735"},{"key":"e_1_3_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN48605.2020.9207427"},{"key":"e_1_3_3_2_16_2","unstructured":"Micha\u0142\u00a0Patryk Miazga Daniel Abitz Matthias T\u00e4schner and Erhard Rahm. 2025. Automated Configuration of Schema Matching Tools: A Reinforcement Learning Approach. Datenbanksysteme f\u00fcr Business Technologie und Web (BTW 2025) (2025) 331."},{"key":"e_1_3_3_2_17_2","doi-asserted-by":"publisher","unstructured":"Roderick Murray-Smith Antti Oulasvirta Andrew Howes J\u00f6rg M\u00fcller Aleksi Ikkala Miroslav Bachinski Arthur Fleig Florian Fischer and Markus Klar. 2022. What simulation can do for HCI research. Interactions 29 6 (Nov. 2022) 48\u201353. 10.1145\/3564038","DOI":"10.1145\/3564038"},{"key":"e_1_3_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/757"},{"key":"e_1_3_3_2_19_2","unstructured":"Sanmit Narvekar Bei Peng Matteo Leonetti Jivko Sinapov Matthew\u00a0E Taylor and Peter Stone. 2020. Curriculum learning for reinforcement learning domains: A framework and survey. Journal of Machine Learning Research 21 181 (2020) 1\u201350."},{"key":"e_1_3_3_2_20_2","doi-asserted-by":"crossref","unstructured":"Katharine Nowakowski Philippe Carvalho Jean-Baptiste Six Yann Maillet Anh\u00a0Tu Nguyen Ismail Seghiri Loick M\u2019pemba Theo Marcille Sy\u00a0Toan Ngo and Tien-Tuan Dao. 2021. Human locomotion with reinforcement learning using bioinspired reward reshaping strategies. Medical & biological engineering & computing 59 (2021) 243\u2013256.","DOI":"10.1007\/s11517-020-02309-3"},{"key":"e_1_3_3_2_21_2","unstructured":"John Schulman Filip Wolski Prafulla Dhariwal Alec Radford and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/1707.06347 (2017)."},{"key":"e_1_3_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/3706599.3719699"},{"key":"e_1_3_3_2_23_2","unstructured":"Roland Stolz Hanna Krasowski Jakob Thumm Michael Eichelbeck Philipp Gassert and Matthias Althoff. 2024. Excluding the irrelevant: Focusing reinforcement learning through continuous action masking. Advances in Neural Information Processing Systems 37 (2024) 95067\u201395094."},{"key":"e_1_3_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6386109"},{"key":"e_1_3_3_2_25_2","doi-asserted-by":"publisher","unstructured":"Peter Vamplew Benjamin\u00a0J Smith Johan K\u00e4llstr\u00f6m Gabriel Ramos Roxana R\u0103dulescu Diederik\u00a0M Roijers Conor\u00a0F Hayes Fredrik Heintz Patrick Mannion Pieter\u00a0JK Libin and others. 2022. Scalar reward is not enough: A response to Silver Singh Precup and Sutton (2021). Autonomous Agents and Multi-Agent Systems 36 2 (2022) 41. 10.1007\/s10458-022-09575-5Publisher: Springer.","DOI":"10.1007\/s10458-022-09575-5"},{"key":"e_1_3_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA46639.2022.9811684"},{"key":"e_1_3_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/1357054.1357306"}],"event":{"name":"UIST '25: The 38th Annual ACM Symposium on User Interface Software and Technology","location":"Busan Republic of Korea","acronym":"UIST Adjunct '25","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction","SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Adjunct Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3746058.3758385","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,6]],"date-time":"2025-10-06T10:03:34Z","timestamp":1759745014000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3746058.3758385"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,27]]},"references-count":26,"alternative-id":["10.1145\/3746058.3758385","10.1145\/3746058"],"URL":"https:\/\/doi.org\/10.1145\/3746058.3758385","relation":{},"subject":[],"published":{"date-parts":[[2025,9,27]]},"assertion":[{"value":"2025-09-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}