{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,1]],"date-time":"2025-10-01T15:59:17Z","timestamp":1759334357577,"version":"build-2065373602"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,9,16]]},"DOI":"10.1145\/3742886.3756746","type":"proceedings-article","created":{"date-parts":[[2025,9,30]],"date-time":"2025-09-30T14:55:41Z","timestamp":1759244141000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Towards an AI-based Sign Language Video Editing Interface"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-5396-1206","authenticated-orcid":false,"given":"Hossein","family":"Ranjbar","sequence":"first","affiliation":[{"name":"Department of Computational Linguistics and Phonetics, University of Zurich, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-0175-1263","authenticated-orcid":false,"given":"Lisa","family":"Arter","sequence":"additional","affiliation":[{"name":"Department of Computational Linguistics and Phonetics, University of Zurich, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-2932-229X","authenticated-orcid":false,"given":"Laura","family":"Setz","sequence":"additional","affiliation":[{"name":"Department of Computational Linguistics and Phonetics, University of Zurich, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1696-6921","authenticated-orcid":false,"given":"Alessia","family":"Battisti","sequence":"additional","affiliation":[{"name":"Department of Computational Linguistics and Phonetics, University of Zurich, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6511-5085","authenticated-orcid":false,"given":"Sarah","family":"Ebling","sequence":"additional","affiliation":[{"name":"Department of Computational Linguistics and Phonetics, University of Zurich, Zurich, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,9,30]]},"reference":[{"key":"e_1_3_3_3_2_2","doi-asserted-by":"crossref","unstructured":"Dhruv Agrawal Jakob Buhmann Dominik Borer Robert\u00a0W Sumner and Martin Guay. 2024. SKEL-Betweener: a Neural Motion Rig for Interactive Motion Authoring. ACM Transactions on Graphics (TOG) 43 6 (2024) 1\u201311.","DOI":"10.1145\/3687941"},{"key":"e_1_3_3_3_3_2","unstructured":"Andreas Blattmann Tim Dockhorn Sumith Kulal Daniel Mendelevitch Maciej Kilian Dominik Lorenz Yam Levi Zion English Vikram Voleti Adam Letts et\u00a0al. 2023. Stable video diffusion: Scaling latent video diffusion models to large datasets. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.15127 (2023)."},{"key":"e_1_3_3_3_4_2","unstructured":"John Brooke. 1995. SUS: A quick and dirty usability scale. Usability Eval. Ind. 189 (11 1995)."},{"key":"e_1_3_3_3_5_2","unstructured":"Zhe Cao Gines Hidalgo Tomas Simon Shih-En Wei and Yaser Sheikh. 2019. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. arxiv:https:\/\/arXiv.org\/abs\/1812.08008\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/1812.08008"},{"key":"e_1_3_3_3_6_2","unstructured":"Di Chang Yichun Shi Quankai Gao Jessica Fu Hongyi Xu Guoxian Song Qing Yan Yizhe Zhu Xiao Yang and Mohammad Soleymani. 2023. Magicpose: Realistic human poses and facial expressions retargeting with identity-aware diffusion. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.12052 (2023)."},{"key":"e_1_3_3_3_7_2","first-page":"183","volume-title":"European Conference on Computer Vision","author":"Deng Yufan","year":"2024","unstructured":"Yufan Deng, Ruida Wang, Yuhao Zhang, Yu-Wing Tai, and Chi-Keung Tang. 2024. Dragvideo: Interactive drag-style video editing. In European Conference on Computer Vision. Springer, 183\u2013199."},{"key":"e_1_3_3_3_8_2","unstructured":"FaceFusion Contributors. 2024. FaceFusion. https:\/\/github.com\/facefusion\/facefusion Accessed: 2024."},{"key":"e_1_3_3_3_9_2","unstructured":"Sen Fang Chunyu Sui Yanghao Zhou Xuedong Zhang Hongbin Zhong Minyu Zhao Yapeng Tian and Chen Chen. 2023. SignDiff: Diffusion Models for American Sign Language Production. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2308.16082 (2023)."},{"key":"e_1_3_3_3_10_2","first-page":"378","volume-title":"European Conference on Computer Vision","author":"Feng Haiwen","year":"2024","unstructured":"Haiwen Feng, Zheng Ding, Zhihao Xia, Simon Niklaus, Victoria Abrevaya, Michael\u00a0J Black, and Xuaner Zhang. 2024. Explorative inbetweening of time and space. In European Conference on Computer Vision. Springer, 378\u2013395."},{"key":"e_1_3_3_3_11_2","unstructured":"Ivan Grishchenko and Valentin Bazarevsky. 2020. Mediapipe holistic\u2014simultaneous face hand and pose prediction on device. Google AI Blog. Dec (2020)."},{"key":"e_1_3_3_3_12_2","unstructured":"Jia Guo and Jiankang Deng. 2022. InsightFace: 2D and 3D Face Analysis Project. https:\/\/github.com\/deepinsight\/insightface."},{"key":"e_1_3_3_3_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/3517428.3544883"},{"key":"e_1_3_3_3_14_2","first-page":"8153","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Hu Li","year":"2024","unstructured":"Li Hu. 2024. Animate anyone: Consistent and controllable image-to-video synthesis for character animation. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 8153\u20138163."},{"key":"e_1_3_3_3_15_2","unstructured":"Li Hu Xin Gao Peng Zhang Ke Sun Bang Zhang and Liefeng Bo. 2023. Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2311.17117 (2023)."},{"key":"e_1_3_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01256"},{"key":"e_1_3_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00938"},{"key":"e_1_3_3_3_18_2","unstructured":"Zhenyu Jiang Yuqi Xie Jinhan Li Ye Yuan Yifeng Zhu and Yuke Zhu. 2024. Harmon: Whole-body motion generation of humanoid robots from language descriptions. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2410.12773 (2024)."},{"key":"e_1_3_3_3_19_2","first-page":"326","volume-title":"European Conference on Computer Vision","author":"Kim Jeongho","year":"2024","unstructured":"Jeongho Kim, Min-Jung Kim, Junsoo Lee, and Jaegul Choo. 2024. Tcan: Animating human images with temporally consistent pose guidance using diffusion models. In European Conference on Computer Vision. Springer, 326\u2013342."},{"key":"e_1_3_3_3_20_2","doi-asserted-by":"crossref","unstructured":"Kihong Kim Yunho Kim Seokju Cho Junyoung Seo Jisu Nam Kychul Lee Seungryong Kim and KwangHee Lee. 2025. Diffface: Diffusion-based face swapping with facial guidance. Pattern Recognition 163 (2025) 111451.","DOI":"10.1016\/j.patcog.2025.111451"},{"key":"e_1_3_3_3_21_2","first-page":"1","volume-title":"Innovations in Deaf Studies","author":"Kusters Annelies Maria\u00a0Jozef","year":"2017","unstructured":"Annelies Maria\u00a0Jozef Kusters, Dai O\u2019Brien, and Maartje De\u00a0Meulder. 2017. Innovations in Deaf Studies: Critically Mapping the Field. In Innovations in Deaf Studies, Annelies Kusters, Maartje De\u00a0Meulder, and Dai O\u2019Brien (Eds.). Oxford University Press, United Kingdom, 1\u201353."},{"key":"e_1_3_3_3_22_2","unstructured":"Eyal Molad Eliahu Horwitz Dani Valevski Alex\u00a0Rav Acha Yossi Matias Yael Pritch Yaniv Leviathan and Yedid Hoshen. 2023. Dreamix: Video diffusion models are general video editors. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2302.01329 (2023)."},{"key":"e_1_3_3_3_23_2","unstructured":"Chong Mou Mingdeng Cao Xintao Wang Zhaoyang Zhang Ying Shan and Jian Zhang. 2024. Revideo: Remake a video with motion and content control. Advances in Neural Information Processing Systems 37 (2024) 18481\u201318505."},{"key":"e_1_3_3_3_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.37"},{"key":"e_1_3_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00728"},{"key":"e_1_3_3_3_26_2","unstructured":"Bohao Peng Jian Wang Yuechen Zhang Wenbo Li Ming-Chang Yang and Jiaya Jia. 2024. ControlNeXt: Powerful and Efficient Control for Image and Video Generation. arxiv:https:\/\/arXiv.org\/abs\/2408.06070\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2408.06070"},{"key":"e_1_3_3_3_27_2","doi-asserted-by":"crossref","unstructured":"Yichen Peng Chunqi Zhao Haoran Xie Tsukasa Fukusato Kazunori Miyata and Takeo Igarashi. 2023. Dualmotion: Global-to-local casual motion design for character animations. IEICE TRANSACTIONS on Information and Systems 106 4 (2023) 459\u2013468.","DOI":"10.1587\/transinf.2022IIP0011"},{"key":"e_1_3_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/FG57933.2023.10042505"},{"key":"e_1_3_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01344"},{"key":"e_1_3_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00753"},{"key":"e_1_3_3_3_31_2","first-page":"1","volume-title":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","author":"Uchida Tsubasa","year":"2023","unstructured":"Tsubasa Uchida, Naoki Nakatani, Taro Miyazaki, Hiroyuki Kaneko, and Masanori Sano. 2023. Motion Editing Tool for Reproducing Grammatical Elements of Japanese Sign Language Avatar Animation. In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW). IEEE, 1\u20135."},{"key":"e_1_3_3_3_32_2","unstructured":"Harry Walsh Ben Saunders and Richard Bowden. 2024. Sign Stitching: A Novel Approach to Sign Language Production. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2405.07663 (2024)."},{"key":"e_1_3_3_3_33_2","unstructured":"Qilin Wang Zhengkai Jiang Chengming Xu Jiangning Zhang Yabiao Wang Xinyi Zhang Yun Cao Weijian Cao Chengjie Wang and Yanwei Fu. 2024. Vividpose: Advancing stable video diffusion for realistic human image animation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2405.18156 (2024)."},{"key":"e_1_3_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00891"},{"key":"e_1_3_3_3_35_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.00905"},{"key":"e_1_3_3_3_36_2","unstructured":"Xiaojuan Wang Boyang Zhou Brian Curless Ira Kemelmacher-Shlizerman Aleksander Holynski and Steven\u00a0M Seitz. 2024. Generative inbetweening: Adapting image-to-video models for keyframe interpolation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2408.15239 (2024)."},{"key":"e_1_3_3_3_37_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52733.2024.00147"},{"key":"e_1_3_3_3_38_2","unstructured":"Haiwei Xue Xiangyang Luo Zhanghao Hu Xin Zhang Xunzhi Xiang Yuqin Dai Jianzhuang Liu Zhensong Zhang Minglei Li Jian Yang et\u00a0al. 2024. Human motion video generation: A survey. Authorea Preprints (2024)."},{"key":"e_1_3_3_3_39_2","unstructured":"Jingyun Xue Hongfa Wang Qi Tian Yue Ma Andong Wang Zhiyuan Zhao Shaobo Min Wenzhe Zhao Kaihao Zhang Heung-Yeung Shum et\u00a0al. 2024. Follow-Your-Pose v2: Multiple-Condition Guided Character Image Animation for Stable Pose Control. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2406.03035 (2024)."},{"key":"e_1_3_3_3_40_2","unstructured":"Zhendong Yang Ailing Zeng Chun Yuan and Yu Li. 2023. Effective Whole-body Pose Estimation with Two-stages Distillation. arxiv:https:\/\/arXiv.org\/abs\/2307.15880\u00a0[cs.CV] https:\/\/arxiv.org\/abs\/2307.15880"},{"key":"e_1_3_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV51070.2023.00355"},{"key":"e_1_3_3_3_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/3613904.3641927"},{"key":"e_1_3_3_3_43_2","unstructured":"Bingwen Zhu Fanyi Wang Tianyi Lu Peng Liu Jingwen Su Jinxiu Liu Yanhao Zhang Zuxuan Wu Yu-Gang Jiang and Guo-Jun Qi. 2024. PoseAnimate: Zero-shot high fidelity pose controllable character animation. arXiv e-prints (2024) arXiv\u20132404."},{"key":"e_1_3_3_3_44_2","unstructured":"Tianyi Zhu Dongwei Ren Qilong Wang Xiaohe Wu and Wangmeng Zuo. 2024. Generative Inbetweening through Frame-wise Conditions-Driven Video Generation. arXiv preprint arXiv:https:\/\/arXiv.org\/abs\/2412.11755 (2024)."},{"key":"e_1_3_3_3_45_2","unstructured":"Yixuan Zhu Wenliang Zhao Yansong Tang Yongming Rao Jie Zhou and Jiwen Lu. 2024. Stableswap: stable face swapping in a shared and controllable latent space. IEEE Transactions on Multimedia (2024)."}],"event":{"name":"IVA Adjunct '25: ACM International Conference on Intelligent Virtual Agents","sponsor":["SIGAI ACM Special Interest Group on Artificial Intelligence"],"location":"Berlin Germany","acronym":"IVA Adjunct '25"},"container-title":["Adjunct Proceedings of the 25th ACM International Conference on Intelligent Virtual Agents"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3742886.3756746","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,30]],"date-time":"2025-09-30T14:59:29Z","timestamp":1759244369000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3742886.3756746"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,16]]},"references-count":44,"alternative-id":["10.1145\/3742886.3756746","10.1145\/3742886"],"URL":"https:\/\/doi.org\/10.1145\/3742886.3756746","relation":{},"subject":[],"published":{"date-parts":[[2025,9,16]]},"assertion":[{"value":"2025-09-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}