{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,12]],"date-time":"2026-02-12T15:13:27Z","timestamp":1770909207348,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":57,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,11,3]],"date-time":"2022-11-03T00:00:00Z","timestamp":1667433600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"NSF (National Science Foundation)","doi-asserted-by":"publisher","award":["IIS-2005430"],"award-info":[{"award-number":["IIS-2005430"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,3]]},"DOI":"10.1145\/3561975.3562954","type":"proceedings-article","created":{"date-parts":[[2022,10,11]],"date-time":"2022-10-11T22:10:57Z","timestamp":1665526257000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["S2M-Net: Speech Driven Three-party Conversational Motion Synthesis Networks"],"prefix":"10.1145","author":[{"given":"Aobo","family":"Jin","sequence":"first","affiliation":[{"name":"University of Houston - Victoria, USA"}]},{"given":"Qixin","family":"Deng","sequence":"additional","affiliation":[{"name":"University of Houston, USA"}]},{"given":"Zhigang","family":"Deng","sequence":"additional","affiliation":[{"name":"University of Houston, USA"}]}],"member":"320","published-online":{"date-parts":[[2022,11,3]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340555.3353725"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13946"},{"key":"e_1_3_2_2_3_1","volume-title":"Rigid head motion in expressive speech animation: Analysis and synthesis","author":"Busso Carlos","year":"2007","unstructured":"Carlos Busso , Zhigang Deng , Michael Grimm , Ulrich Neumann , and Shrikanth Narayanan . 2007. Rigid head motion in expressive speech animation: Analysis and synthesis . IEEE transactions on audio, speech, and language processing 15, 3( 2007 ), 1075\u20131086. Carlos Busso, Zhigang Deng, Michael Grimm, Ulrich Neumann, and Shrikanth Narayanan. 2007. Rigid head motion in expressive speech animation: Analysis and synthesis. IEEE transactions on audio, speech, and language processing 15, 3(2007), 1075\u20131086."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.5555\/1089870.1089884"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/192161.192272"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/383259.383315"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1061347.1061355"},{"key":"e_1_3_2_2_8_1","volume-title":"Non-Verbal Behavior Generation for Virtual Characters in Group Conversations. In 2019 IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR). 41\u2013418","author":"de Coninck F.","unstructured":"F. de Coninck , Z. Yumak , G. Sandino , and R. Veltkamp . 2019 . Non-Verbal Behavior Generation for Virtual Characters in Group Conversations. In 2019 IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR). 41\u2013418 . F. de Coninck, Z. Yumak, G. Sandino, and R. Veltkamp. 2019. Non-Verbal Behavior Generation for Virtual Characters in Group Conversations. In 2019 IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR). 41\u2013418."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2005.35"},{"key":"e_1_3_2_2_10_1","unstructured":"Emily\u00a0L Denton Soumith Chintala Rob Fergus 2015. Deep generative image models using a laplacian pyramid of adversarial networks. In Advances in neural information processing systems. 1486\u20131494.  Emily\u00a0L Denton Soumith Chintala Rob Fergus 2015. Deep generative image models using a laplacian pyramid of adversarial networks. In Advances in neural information processing systems. 1486\u20131494."},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3025453.3025644"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2020.04.007"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2388676.2388680"},{"key":"e_1_3_2_2_14_1","volume-title":"Winter semester","author":"Gauthier Jon","year":"2014","unstructured":"Jon Gauthier . 2014. Conditional generative adversarial nets for convolutional face generation. Class Project for Stanford CS231N: Convolutional Neural Networks for Visual Recognition , Winter semester 2014 , 5 (2014), 2. Jon Gauthier. 2014. Conditional generative adversarial nets for convolutional face generation. Class Project for Stanford CS231N: Convolutional Neural Networks for Visual Recognition, Winter semester 2014, 5 (2014), 2."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"crossref","unstructured":"S. Ginosar A. Bar G. Kohavi C. Chan A. Owens and J. Malik. 2019. Learning Individual Styles of Conversational Gesture. In Computer Vision and Pattern Recognition (CVPR). IEEE.  S. Ginosar A. Bar G. Kohavi C. Chan A. Owens and J. Malik. 2019. Learning Individual Styles of Conversational Gesture. In Computer Vision and Pattern Recognition (CVPR). IEEE.","DOI":"10.1109\/CVPR.2019.00361"},{"key":"e_1_3_2_2_16_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014a. Generative adversarial nets. In Advances in neural information processing systems. 2672\u20132680.  Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014a. Generative adversarial nets. In Advances in neural information processing systems. 2672\u20132680."},{"key":"e_1_3_2_2_17_1","volume-title":"Advances in Neural Information Processing Systems 27, Z.\u00a0Ghahramani, M.\u00a0Welling, C.\u00a0Cortes, N.\u00a0D.","author":"Goodfellow Ian","unstructured":"Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza , Bing Xu , David Warde-Farley , Sherjil Ozair , Aaron Courville , and Yoshua Bengio . 2014b. Generative Adversarial Nets . In Advances in Neural Information Processing Systems 27, Z.\u00a0Ghahramani, M.\u00a0Welling, C.\u00a0Cortes, N.\u00a0D. Lawrence , and K.\u00a0Q. Weinberger (Eds.). Curran Associates, Inc ., 2672\u20132680. Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014b. Generative Adversarial Nets. In Advances in Neural Information Processing Systems 27, Z.\u00a0Ghahramani, M.\u00a0Welling, C.\u00a0Cortes, N.\u00a0D. Lawrence, and K.\u00a0Q. Weinberger (Eds.). Curran Associates, Inc., 2672\u20132680."},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/AFGR.2002.1004186"},{"key":"e_1_3_2_2_19_1","volume-title":"Intelligent Virtual Agents","author":"Gu Erdan","unstructured":"Erdan Gu and Norman Badler . 2006. Visual attention and eye gaze during multiparty conversations with distractions . In Intelligent Virtual Agents . Springer , 193\u2013204. Erdan Gu and Norman Badler. 2006. Visual attention and eye gaze during multiparty conversations with distractions. In Intelligent Virtual Agents. Springer, 193\u2013204."},{"key":"e_1_3_2_2_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.632"},{"key":"e_1_3_2_2_21_1","volume-title":"A Live Speech Driven Avatar-mediated Three-party Telepresence System: Design and Evaluation. PRESENCE: Virtual and Augmented Reality (06","author":"Jin Aobo","year":"2022","unstructured":"Aobo Jin , Qixin Deng , and Zhigang Deng . 2022. A Live Speech Driven Avatar-mediated Three-party Telepresence System: Design and Evaluation. PRESENCE: Virtual and Augmented Reality (06 2022 ), 1\u201343. https:\/\/doi.org\/10.1162\/pres_a_00358 arXiv:https:\/\/direct.mit.edu\/pvar\/article-pdf\/doi\/10.1162\/pres_a_00358\/2031159\/pres_a_00358.pdf 10.1162\/pres_a_00358 Aobo Jin, Qixin Deng, and Zhigang Deng. 2022. A Live Speech Driven Avatar-mediated Three-party Telepresence System: Design and Evaluation. PRESENCE: Virtual and Augmented Reality (06 2022), 1\u201343. https:\/\/doi.org\/10.1162\/pres_a_00358 arXiv:https:\/\/direct.mit.edu\/pvar\/article-pdf\/doi\/10.1162\/pres_a_00358\/2031159\/pres_a_00358.pdf"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3340250"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-02675-6_35"},{"key":"e_1_3_2_2_24_1","unstructured":"Levent Karacan Zeynep Akata Aykut Erdem and Erkut Erdem. 2016. Learning to generate images of outdoor scenes from attributes and semantic layouts. arXiv preprint arXiv:1612.00215(2016).  Levent Karacan Zeynep Akata Aykut Erdem and Erkut Erdem. 2016. Learning to generate images of outdoor scenes from attributes and semantic layouts. arXiv preprint arXiv:1612.00215(2016)."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"crossref","unstructured":"Alex Klein Zerrin Yumak Arjen Beij and A.\u00a0Frank van\u00a0der Stappen. 2019. Data-Driven Gaze Animation Using Recurrent Neural Networks. In Motion Interaction and Games(Newcastle upon Tyne United Kingdom) (MIG \u201919). Article 4 11\u00a0pages.  Alex Klein Zerrin Yumak Arjen Beij and A.\u00a0Frank van\u00a0der Stappen. 2019. Data-Driven Gaze Animation Using Recurrent Neural Networks. In Motion Interaction and Games(Newcastle upon Tyne United Kingdom) (MIG \u201919). Article 4 11\u00a0pages.","DOI":"10.1145\/3359566.3360054"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.5898\/JHRI.2.1.Kondo"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3382507.3418815"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2012.74"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"crossref","unstructured":"Sooha\u00a0Park Lee Jeremy\u00a0B Badler and Norman\u00a0I Badler. 2002. Eyes alive. In ACM Transactions on Graphics (TOG) Vol.\u00a021. ACM 637\u2013644.  Sooha\u00a0Park Lee Jeremy\u00a0B Badler and Norman\u00a0I Badler. 2002. Eyes alive. In ACM Transactions on Graphics (TOG) Vol.\u00a021. ACM 637\u2013644.","DOI":"10.1145\/566654.566629"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"crossref","unstructured":"Sergey Levine Philipp Kr\u00e4henb\u00fchl Sebastian Thrun and Vladlen Koltun. 2010. Gesture controllers. In ACM Transactions on Graphics (TOG) Vol.\u00a029. ACM 124.  Sergey Levine Philipp Kr\u00e4henb\u00fchl Sebastian Thrun and Vladlen Koltun. 2010. Gesture controllers. In ACM Transactions on Graphics (TOG) Vol.\u00a029. ACM 124.","DOI":"10.1145\/1833349.1778861"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"crossref","unstructured":"Sergey Levine Christian Theobalt and Vladlen Koltun. 2009. Real-time prosody-driven synthesis of body language. In ACM Transactions on Graphics (TOG) Vol.\u00a028. ACM 172.  Sergey Levine Christian Theobalt and Vladlen Koltun. 2009. Real-time prosody-driven synthesis of body language. In ACM Transactions on Graphics (TOG) Vol.\u00a028. ACM 172.","DOI":"10.1145\/1661412.1618518"},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01022"},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52688.2022.01021"},{"key":"e_1_3_2_2_34_1","volume-title":"Virtual Reality Conference, 2009. VR 2009. IEEE. IEEE, 143\u2013150","author":"Ma Xiaohan","year":"2009","unstructured":"Xiaohan Ma and Zhigang Deng . 2009 . Natural eye motion synthesis by modeling gaze-head coupling . In Virtual Reality Conference, 2009. VR 2009. IEEE. IEEE, 143\u2013150 . Xiaohan Ma and Zhigang Deng. 2009. Natural eye motion synthesis by modeling gaze-head coupling. In Virtual Reality Conference, 2009. VR 2009. IEEE. IEEE, 143\u2013150."},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485895.2485900"},{"key":"e_1_3_2_2_36_1","unstructured":"Michael Mathieu Camille Couprie and Yann LeCun. 2015. Deep multi-scale video prediction beyond mean square error. arXiv preprint arXiv:1511.05440(2015).  Michael Mathieu Camille Couprie and Yann LeCun. 2015. Deep multi-scale video prediction beyond mean square error. arXiv preprint arXiv:1511.05440(2015)."},{"key":"e_1_3_2_2_37_1","volume-title":"AAAI Fall Symposium: Dialog with Robots.","author":"Matsuyama Yoichi","year":"2010","unstructured":"Yoichi Matsuyama , Hikaru Taniyama , Shinya Fujie , and Tetsunori Kobayashi . 2010 . Framework of Communication Activation Robot Participating in Multiparty Conversation .. In AAAI Fall Symposium: Dialog with Robots. Yoichi Matsuyama, Hikaru Taniyama, Shinya Fujie, and Tetsunori Kobayashi. 2010. Framework of Communication Activation Robot Participating in Multiparty Conversation.. In AAAI Fall Symposium: Dialog with Robots."},{"key":"e_1_3_2_2_38_1","unstructured":"Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784(2014).  Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784(2014)."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1514095.1514109"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1088463.1088497"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.278"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/965400.965484"},{"key":"e_1_3_2_2_43_1","unstructured":"Scott Reed Zeynep Akata Xinchen Yan Lajanugen Logeswaran Bernt Schiele and Honglak Lee. 2016b. Generative adversarial text to image synthesis. arXiv preprint arXiv:1605.05396(2016).  Scott Reed Zeynep Akata Xinchen Yan Lajanugen Logeswaran Bernt Schiele and Honglak Lee. 2016b. Generative adversarial text to image synthesis. arXiv preprint arXiv:1605.05396(2016)."},{"key":"e_1_3_2_2_44_1","unstructured":"Scott\u00a0E Reed Zeynep Akata Santosh Mohan Samuel Tenka Bernt Schiele and Honglak Lee. 2016a. Learning what and where to draw. In Advances in Neural Information Processing Systems. 217\u2013225.  Scott\u00a0E Reed Zeynep Akata Santosh Mohan Samuel Tenka Bernt Schiele and Honglak Lee. 2016a. Learning what and where to draw. In Advances in Neural Information Processing Systems. 217\u2013225."},{"key":"e_1_3_2_2_45_1","unstructured":"Kerstin Ruhland Sean Andrist Jeremy Badler Christopher Peters Norman Badler Michael Gleicher Bilge Mutlu and Rachel Mcdonnell. 2014. Look me in the eyes: A survey of eye and gaze animation for virtual agents and artificial systems. In Eurographics State-of-the-Art Report. 69\u201391.  Kerstin Ruhland Sean Andrist Jeremy Badler Christopher Peters Norman Badler Michael Gleicher Bilge Mutlu and Rachel Mcdonnell. 2014. Look me in the eyes: A survey of eye and gaze animation for virtual agents and artificial systems. In Eurographics State-of-the-Art Report. 69\u201391."},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/1015706.1015753"},{"key":"e_1_3_2_2_47_1","volume-title":"Gerrit van\u00a0der Veer, and Harro Vons","author":"Vertegaal Roel","year":"2000","unstructured":"Roel Vertegaal , Gerrit van\u00a0der Veer, and Harro Vons . 2000 . Effects of gaze on multiparty mediated communication. In Graphics Interface . 95\u2013102. Roel Vertegaal, Gerrit van\u00a0der Veer, and Harro Vons. 2000. Effects of gaze on multiparty mediated communication. In Graphics Interface. 95\u2013102."},{"key":"e_1_3_2_2_48_1","volume-title":"Computer Graphics Forum, Vol.\u00a023","author":"Vinayagamoorthy Vinoba","unstructured":"Vinoba Vinayagamoorthy , Maia Garau , Anthony Steed , and Mel Slater . 2004. An eye gaze model for dyadic interaction in an immersive virtual environment: Practice and experience . In Computer Graphics Forum, Vol.\u00a023 . Wiley Online Library , 1\u201311. Vinoba Vinayagamoorthy, Maia Garau, Anthony Steed, and Mel Slater. 2004. An eye gaze model for dyadic interaction in an immersive virtual environment: Practice and experience. In Computer Graphics Forum, Vol.\u00a023. Wiley Online Library, 1\u201311."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925947"},{"key":"e_1_3_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_20"},{"key":"e_1_3_2_2_51_1","volume-title":"Pragmatics of human communication: A study of interactional patterns, pathologies and paradoxes","author":"Watzlawick Paul","unstructured":"Paul Watzlawick , Janet\u00a0Beavin Bavelas , and Don\u00a0 D Jackson . 2011. Pragmatics of human communication: A study of interactional patterns, pathologies and paradoxes . WW Norton & Company . Paul Watzlawick, Janet\u00a0Beavin Bavelas, and Don\u00a0D Jackson. 2011. Pragmatics of human communication: A study of interactional patterns, pathologies and paradoxes. WW Norton & Company."},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.14114"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_31"},{"key":"e_1_3_2_2_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3414685.3417838"},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8793720"},{"key":"e_1_3_2_2_56_1","unstructured":"Yi Yu and Simon Canales. 2019. Conditional LSTM-GAN for Melody Generation from Lyrics. arXiv preprint arXiv:1908.05551(2019).  Yi Yu and Simon Canales. 2019. Conditional LSTM-GAN for Melody Generation from Lyrics. arXiv preprint arXiv:1908.05551(2019)."},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"}],"event":{"name":"MIG '22: ACM SIGGRAPH Conference on Motion, Interaction and Games","location":"Guanajuato Mexico","acronym":"MIG '22","sponsor":["SIGGRAPH ACM Special Interest Group on Computer Graphics and Interactive Techniques"]},"container-title":["Proceedings of the 15th ACM SIGGRAPH Conference on Motion, Interaction and Games"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561975.3562954","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3561975.3562954","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3561975.3562954","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:07Z","timestamp":1750182547000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561975.3562954"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,3]]},"references-count":57,"alternative-id":["10.1145\/3561975.3562954","10.1145\/3561975"],"URL":"https:\/\/doi.org\/10.1145\/3561975.3562954","relation":{},"subject":[],"published":{"date-parts":[[2022,11,3]]},"assertion":[{"value":"2022-11-03","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}