{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:09:17Z","timestamp":1750219757569,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":23,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,10,9]],"date-time":"2023-10-09T00:00:00Z","timestamp":1696809600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"funder":[{"name":"JST Moonshot R&D","award":["JPMJPS2011"],"award-info":[{"award-number":["JPMJPS2011"]}]},{"name":"CREST","award":["JPMJCR2015"],"award-info":[{"award-number":["JPMJCR2015"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,10,9]]},"DOI":"10.1145\/3577190.3614175","type":"proceedings-article","created":{"date-parts":[[2023,10,7]],"date-time":"2023-10-07T22:30:48Z","timestamp":1696717848000},"page":"292-300","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Frame-Level Event Representation Learning for Semantic-Level Generation and Editing of Avatar Motion"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0009-0000-0973-9993","authenticated-orcid":false,"given":"Ayaka","family":"Ideno","sequence":"first","affiliation":[{"name":"The University of Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-8016-5144","authenticated-orcid":false,"given":"Takuhiro","family":"Kaneko","sequence":"additional","affiliation":[{"name":"NTT Corporation, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3712-3691","authenticated-orcid":false,"given":"Tatsuya","family":"Harada","sequence":"additional","affiliation":[{"name":"The University of Tokyo, Japan and RIKEN, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2023,10,9]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460608"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV57658.2022.00053"},{"key":"e_1_3_2_2_3_1","volume-title":"Proceedings of Graphics Interface","author":"Barbi\u010d Jernej","year":"2004","unstructured":"Jernej Barbi\u010d , Alla Safonova , Jia-Yu Pan , Christos Faloutsos , Jessica\u00a0 K. Hodgins , and Nancy\u00a0 S. Pollard . 2004 . Segmenting Motion Capture Data into Distinct Behaviors . In Proceedings of Graphics Interface 2004. 185\u2013194. Jernej Barbi\u010d, Alla Safonova, Jia-Yu Pan, Christos Faloutsos, Jessica\u00a0K. Hodgins, and Nancy\u00a0S. Pollard. 2004. Segmenting Motion Capture Data into Distinct Behaviors. In Proceedings of Graphics Interface 2004. 185\u2013194."},{"key":"e_1_3_2_2_4_1","unstructured":"Silvia Chiappa and Jan Peters. 2010. Movement extraction by detecting dynamics switches and repetitions. In Advances in Neural Information Processing Systems Vol.\u00a023. https:\/\/proceedings.neurips.cc\/paper\/2010\/file\/704afe073992cbe4813cae2f7715336f-Paper.pdf  Silvia Chiappa and Jan Peters. 2010. Movement extraction by detecting dynamics switches and repetitions. In Advances in Neural Information Processing Systems Vol.\u00a023. https:\/\/proceedings.neurips.cc\/paper\/2010\/file\/704afe073992cbe4813cae2f7715336f-Paper.pdf"},{"key":"e_1_3_2_2_5_1","volume-title":"Proceedings of the 34th International Conference on Machine Learning -","volume":"903","author":"Cuturi Marco","year":"2017","unstructured":"Marco Cuturi and Mathieu Blondel . 2017 . Soft-DTW: A Differentiable Loss Function for Time-Series . In Proceedings of the 34th International Conference on Machine Learning - Volume 70(ICML\u201917). 894\u2013 903 . Marco Cuturi and Mathieu Blondel. 2017. Soft-DTW: A Differentiable Loss Function for Time-Series. In Proceedings of the 34th International Conference on Machine Learning - Volume 70(ICML\u201917). 894\u2013903."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00143"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00509"},{"key":"e_1_3_2_2_8_1","volume-title":"Proceedings, Part XXXV. 580\u2013597","author":"Guo Chuan","year":"2022","unstructured":"Chuan Guo , Xinxin Zuo , Sen Wang , and Li Cheng . 2022 . TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts. In Computer Vision \u2013 ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23\u201327, 2022 , Proceedings, Part XXXV. 580\u2013597 . https:\/\/doi.org\/10.1007\/978-3-031-19833-5_34 10.1007\/978-3-031-19833-5_34 Chuan Guo, Xinxin Zuo, Sen Wang, and Li Cheng. 2022. TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts. In Computer Vision \u2013 ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23\u201327, 2022, Proceedings, Part XXXV. 580\u2013597. https:\/\/doi.org\/10.1007\/978-3-031-19833-5_34"},{"key":"e_1_3_2_2_9_1","volume-title":"Long Short-Term Memory. Neural Computation 9, 8 (11","author":"Hochreiter Sepp","year":"1997","unstructured":"Sepp Hochreiter and J\u00fcrgen Schmidhuber . 1997. Long Short-Term Memory. Neural Computation 9, 8 (11 1997 ), 1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735 10.1162\/neco.1997.9.8.1735 Sepp Hochreiter and J\u00fcrgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (11 1997), 1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_2_10_1","volume-title":"A deep learning framework for character motion synthesis and editing. ACM Transactions on Graphics 35 (07","author":"Holden Daniel","year":"2016","unstructured":"Daniel Holden , Jun Saito , and Taku Komura . 2016. A deep learning framework for character motion synthesis and editing. ACM Transactions on Graphics 35 (07 2016 ), 1\u201311. https:\/\/doi.org\/10.1145\/2897824.2925975 10.1145\/2897824.2925975 Daniel Holden, Jun Saito, and Taku Komura. 2016. A deep learning framework for character motion synthesis and editing. ACM Transactions on Graphics 35 (07 2016), 1\u201311. https:\/\/doi.org\/10.1145\/2897824.2925975"},{"key":"e_1_3_2_2_11_1","volume-title":"Recognition and Abstraction of Humanoid Motions Based on Correlations and Associative Memory. In 2006 6th IEEE-RAS International Conference on Humanoid Robots. 1\u20136. https:\/\/doi.org\/10","author":"Kadone Hideki","year":"2006","unstructured":"Hideki Kadone and Yoshihiko Nakamura . 2006 . Segmentation, Memorization , Recognition and Abstraction of Humanoid Motions Based on Correlations and Associative Memory. In 2006 6th IEEE-RAS International Conference on Humanoid Robots. 1\u20136. https:\/\/doi.org\/10 .1109\/ICHR.2006.321355 10.1109\/ICHR.2006.321355 Hideki Kadone and Yoshihiko Nakamura. 2006. Segmentation, Memorization, Recognition and Abstraction of Humanoid Motions Based on Correlations and Associative Memory. In 2006 6th IEEE-RAS International Conference on Humanoid Robots. 1\u20136. https:\/\/doi.org\/10.1109\/ICHR.2006.321355"},{"key":"e_1_3_2_2_12_1","volume-title":"Kingma and Jimmy Ba","author":"P.","year":"2015","unstructured":"Diederik\u00a0 P. Kingma and Jimmy Ba . 2015 . Adam : A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings . http:\/\/arxiv.org\/abs\/1412.6980 Diederik\u00a0P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings. http:\/\/arxiv.org\/abs\/1412.6980"},{"volume-title":"Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings. http:\/\/arxiv.org\/abs\/1312","author":"P.","key":"e_1_3_2_2_13_1","unstructured":"Diederik\u00a0 P. Kingma and Max Welling. 2014 . Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings. http:\/\/arxiv.org\/abs\/1312 .6114 Diederik\u00a0P. Kingma and Max Welling. 2014. Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings. http:\/\/arxiv.org\/abs\/1312.6114"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/566570.566604"},{"key":"e_1_3_2_2_15_1","volume-title":"Proceedings of the Visually Grounded Interaction and Language Workshop at NeurIPS. http:\/\/www.cs.utexas.edu\/users\/ai-labpub-view.php?PubID=127730","author":"Lin S.","year":"2018","unstructured":"Angela\u00a0 S. Lin , Wu Lemeng , Corona Rodolfo , Tai Kevin , Huang Qixing , and Raymond\u00a0 J. Mooney . 2018 . Generating Animated Videos of Human Activities from Natural Language Descriptions . In Proceedings of the Visually Grounded Interaction and Language Workshop at NeurIPS. http:\/\/www.cs.utexas.edu\/users\/ai-labpub-view.php?PubID=127730 Angela\u00a0S. Lin, Wu Lemeng, Corona Rodolfo, Tai Kevin, Huang Qixing, and Raymond\u00a0J. Mooney. 2018. Generating Animated Videos of Human Activities from Natural Language Descriptions. In Proceedings of the Visually Grounded Interaction and Language Workshop at NeurIPS. http:\/\/www.cs.utexas.edu\/users\/ai-labpub-view.php?PubID=127730"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.00798"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"e_1_3_2_2_18_1","volume-title":"Proceedings, Part XXII. 480\u2013497","author":"Petrovich Mathis","year":"2022","unstructured":"Mathis Petrovich , Michael\u00a0 J. Black , and G\u00fcl Varol . 2022 . TEMOS: Generating Diverse Human Motions from Textual Descriptions. In Computer Vision \u2013 ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23\u201327, 2022 , Proceedings, Part XXII. 480\u2013497 . https:\/\/doi.org\/10.1007\/978-3-031-20047-2_28 10.1007\/978-3-031-20047-2_28 Mathis Petrovich, Michael\u00a0J. Black, and G\u00fcl Varol. 2022. TEMOS: Generating Diverse Human Motions from Textual Descriptions. In Computer Vision \u2013 ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23\u201327, 2022, Proceedings, Part XXII. 480\u2013497. https:\/\/doi.org\/10.1007\/978-3-031-20047-2_28"},{"key":"e_1_3_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2018.07.006"},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/SII.2011.6147455"},{"key":"e_1_3_2_2_21_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141\u00a0ukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems Vol.\u00a030. https:\/\/proceedings.neurips.cc\/paper\/2017\/file\/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N Gomez \u0141\u00a0ukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In Advances in Neural Information Processing Systems Vol.\u00a030. https:\/\/proceedings.neurips.cc\/paper\/2017\/file\/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/UR49135.2020.9144985"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2852838"}],"event":{"name":"ICMI '23: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION","sponsor":["SIGCHI ACM Special Interest Group on Computer-Human Interaction"],"location":"Paris France","acronym":"ICMI '23"},"container-title":["INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3577190.3614175","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3577190.3614175","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:37:02Z","timestamp":1750178222000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3577190.3614175"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,9]]},"references-count":23,"alternative-id":["10.1145\/3577190.3614175","10.1145\/3577190"],"URL":"https:\/\/doi.org\/10.1145\/3577190.3614175","relation":{},"subject":[],"published":{"date-parts":[[2023,10,9]]},"assertion":[{"value":"2023-10-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}