{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,23]],"date-time":"2025-08-23T05:25:05Z","timestamp":1755926705388,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":14,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Science and Technology Innovation Program for Distinguished Young Scholars of Shandong Province Higher Education Institutions","award":["2021KJ036"],"award-info":[{"award-number":["2021KJ036"]}]},{"name":"Major Basic Research Project of Natural Science Foundation of Shandong Province","award":["ZR2021ZD15"],"award-info":[{"award-number":["ZR2021ZD15"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62006142 and U1936203"],"award-info":[{"award-number":["62006142 and U1936203"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Shandong Provincial Natural Science Foundation for Distinguished Young Scholars","award":["ZR2021JQ26"],"award-info":[{"award-number":["ZR2021JQ26"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3551569","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:12Z","timestamp":1665416592000},"page":"7013-7015","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["A Baseline for ViCo Conversational Head Generation Challenge"],"prefix":"10.1145","author":[{"given":"Meng","family":"Liu","sequence":"first","affiliation":[{"name":"Shandong Jianzhu University, Jinan, China"}]},{"given":"Shuyan","family":"Zhai","sequence":"additional","affiliation":[{"name":"Shandong University, Tsingtao, China"}]},{"given":"Yongqiang","family":"Li","sequence":"additional","affiliation":[{"name":"Shandong University, Tsingtao, China"}]},{"given":"Weili","family":"Guan","sequence":"additional","affiliation":[{"name":"Monash University, Melbourne, Australia"}]},{"given":"Liqiang","family":"Nie","sequence":"additional","affiliation":[{"name":"Harbin Institute of Technology (Shenzhen), Shenzhen, China"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","article-title":"Deep audio-visual speech recognition","author":"Afouras Triantafyllos","year":"2018","unstructured":"Triantafyllos Afouras , Joon Son Chung , Andrew Senior , Oriol Vinyals , and Andrew Zisserman . 2018 . Deep audio-visual speech recognition . IEEE Transactions on Pattern Analysis and Machine Intelligence, 1--13. Triantafyllos Afouras, Joon Son Chung, Andrew Senior, Oriol Vinyals, and Andrew Zisserman. 2018. Deep audio-visual speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1--13.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence, 1--13."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/311535.311556"},{"key":"e_1_3_2_2_3_1","first-page":"1","article-title":"A no reference image blur detection using cumulative probability blur detection (cpbd) metric","volume":"1","author":"Bohr P","year":"2013","unstructured":"P Bohr , Rupali Gargote , Rupali Vhorkate , RU Yawle , and VK Bairagi . 2013 . A no reference image blur detection using cumulative probability blur detection (cpbd) metric . International Journal of Science and Modern Engineering , 1 , 1 -- 5 . P Bohr, Rupali Gargote, Rupali Vhorkate, RU Yawle, and VK Bairagi. 2013. A no reference image blur detection using cumulative probability blur detection (cpbd) metric. International Journal of Science and Modern Engineering, 1, 1--5.","journal-title":"International Journal of Science and Modern Engineering"},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"crossref","unstructured":"Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. Voxceleb2: deep speaker recognition. arXiv preprint arXiv:1806.05622 1--6.  Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. Voxceleb2: deep speaker recognition. arXiv preprint arXiv:1806.05622 1--6.","DOI":"10.21437\/Interspeech.2018-1929"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2019.00038"},{"volume-title":"Deep learning (adaptive computation and machine learning series)","author":"Goodfellow Ian","key":"e_1_3_2_2_6_1","unstructured":"Ian Goodfellow , Yoshua Bengio , and Aaron Courville . 2017. Deep learning (adaptive computation and machine learning series) . Cambridge Massachusetts , 321--359. Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2017. Deep learning (adaptive computation and machine learning series). Cambridge Massachusetts, 321--359."},{"key":"e_1_3_2_2_7_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems, 6629--6640","author":"Heusel Martin","year":"2017","unstructured":"Martin Heusel , Hubert Ramsauer , Thomas Unterthiner , Bernhard Nessler , and Sepp Hochreiter . 2017 . Gans trained by a two time-scale update rule converge to a local nash equilibrium . In Proceedings of the International Conference on Neural Information Processing Systems, 6629--6640 . Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Proceedings of the International Conference on Neural Information Processing Systems, 6629--6640."},{"key":"e_1_3_2_2_8_1","volume-title":"Joon Son Chung, and Andrew Zisserman","author":"Nagrani Arsha","year":"2017","unstructured":"Arsha Nagrani , Joon Son Chung, and Andrew Zisserman . 2017 . Voxceleb : a large-scale speaker identification dataset. arXiv preprint arXiv:1706.08612, 1--6. Arsha Nagrani, Joon Son Chung, and Andrew Zisserman. 2017. Voxceleb: a large-scale speaker identification dataset. arXiv preprint arXiv:1706.08612, 1--6."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413532"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01350"},{"key":"e_1_3_2_2_11_1","volume-title":"Proceedings of the International Conference on Neural Information Processing Systems, 7137--7147","author":"Siarohin Aliaksandr","year":"2019","unstructured":"Aliaksandr Siarohin , St\u00e9phane Lathuili\u00e8re , Sergey Tulyakov , Elisa Ricci , and Nicu Sebe . 2019 . First order motion model for image animation . In Proceedings of the International Conference on Neural Information Processing Systems, 7137--7147 . Aliaksandr Siarohin, St\u00e9phane Lathuili\u00e8re, Sergey Tulyakov, Elisa Ricci, and Nicu Sebe. 2019. First order motion model for image animation. In Proceedings of the International Conference on Neural Information Processing Systems, 7137--7147."},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073640"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2003.819861"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3414685.3417774","article-title":"Makelttalk: speaker-aware talking-head animation","volume":"39","author":"Zhou Yang","year":"2020","unstructured":"Yang Zhou , Xintong Han , Eli Shechtman , Jose Echevarria , Evangelos Kalogerakis , and Dingzeyu Li . 2020 . Makelttalk: speaker-aware talking-head animation . ACM Transactions on Graphics , 39 , 6, 1 -- 15 . Yang Zhou, Xintong Han, Eli Shechtman, Jose Echevarria, Evangelos Kalogerakis, and Dingzeyu Li. 2020. Makelttalk: speaker-aware talking-head animation. ACM Transactions on Graphics, 39, 6, 1--15.","journal-title":"ACM Transactions on Graphics"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Lisboa Portugal","acronym":"MM '22"},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3551569","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3551569","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:18Z","timestamp":1750182558000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3551569"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":14,"alternative-id":["10.1145\/3503161.3551569","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3551569","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}