{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:23:39Z","timestamp":1750220619105,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T00:00:00Z","timestamp":1602460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Science Foundation","award":["1816148"],"award-info":[{"award-number":["1816148"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,10,12]]},"DOI":"10.1145\/3394171.3413754","type":"proceedings-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T13:10:44Z","timestamp":1602508244000},"page":"2308-2316","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Rotationally-Consistent Novel View Synthesis for Humans"],"prefix":"10.1145","author":[{"given":"Youngjoong","family":"Kwon","sequence":"first","affiliation":[{"name":"University of North Carolina at Chapel Hill, Chapel Hill, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Stefano","family":"Petrangeli","sequence":"additional","affiliation":[{"name":"Adobe, San Jose, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dahun","family":"Kim","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haoliang","family":"Wang","sequence":"additional","affiliation":[{"name":"Adobe Research, San Jose, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Henry","family":"Fuchs","sequence":"additional","affiliation":[{"name":"University of North Carolina at Chapel Hill, Chapel Hill, NC, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Viswanathan","family":"Swaminathan","sequence":"additional","affiliation":[{"name":"Adobe Research, San Jose, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"et almbox","author":"Chang Angel X","year":"2015","unstructured":"Angel X Chang , Thomas Funkhouser , Leonidas Guibas , Pat Hanrahan , Qixing Huang , Zimo Li , Silvio Savarese , Manolis Savva , Shuran Song , Hao Su , et almbox . 2015 . Shapenet : An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015). Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et almbox. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015)."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_38"},{"key":"e_1_3_2_2_3_1","volume-title":"Frederic Besse, Fabio Viola, Ari S Morcos, Marta Garnelo, Avraham Ruderman, Andrei A Rusu, Ivo Danihelka, Karol Gregor, et almbox.","author":"Ali Eslami SM","year":"2018","unstructured":"SM Ali Eslami , Danilo Jimenez Rezende , Frederic Besse, Fabio Viola, Ari S Morcos, Marta Garnelo, Avraham Ruderman, Andrei A Rusu, Ivo Danihelka, Karol Gregor, et almbox. 2018 . Neural scene representation and rendering. Science , Vol. 360 , 6394 (2018), 1204--1210. SM Ali Eslami, Danilo Jimenez Rezende, Frederic Besse, Fabio Viola, Ari S Morcos, Marta Garnelo, Avraham Ruderman, Andrei A Rusu, Ivo Danihelka, Karol Gregor, et almbox. 2018. Neural scene representation and rendering. Science, Vol. 360, 6394 (2018), 1204--1210."},{"key":"e_1_3_2_2_4_1","unstructured":"Adobe Fuse. [n.d.]. https:\/\/www.adobe.com\/products\/fuse.html.  Adobe Fuse. [n.d.]. https:\/\/www.adobe.com\/products\/fuse.html."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_29"},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.179"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.248"},{"key":"e_1_3_2_2_8_1","volume-title":"et almbox","author":"Jaderberg Max","year":"2015","unstructured":"Max Jaderberg , Karen Simonyan , Andrew Zisserman , et almbox . 2015 . Spatial transformer networks. In Advances in neural information processing systems. 2017--2025. Max Jaderberg, Karen Simonyan, Andrew Zisserman, et almbox. 2015. Spatial transformer networks. In Advances in neural information processing systems. 2017--2025."},{"key":"e_1_3_2_2_9_1","unstructured":"Abhishek Kar Christian H\"ane and Jitendra Malik. 2017. Learning a multi-view stereo machine. In Advances in neural information processing systems. 365--376.  Abhishek Kar Christian H\"ane and Jitendra Malik. 2017. Learning a multi-view stereo machine. In Advances in neural information processing systems. 365--376."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58548-8_23"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_11"},{"key":"e_1_3_2_2_12_1","volume-title":"Neural Volumes: Learning Dynamic Renderable Volumes from Images. arXiv preprint arXiv:1906.07751","author":"Lombardi Stephen","year":"2019","unstructured":"Stephen Lombardi , Tomas Simon , Jason Saragih , Gabriel Schwartz , Andreas Lehrmann , and Yaser Sheikh . 2019 . Neural Volumes: Learning Dynamic Renderable Volumes from Images. arXiv preprint arXiv:1906.07751 (2019). Stephen Lombardi, Tomas Simon, Jason Saragih, Gabriel Schwartz, Andreas Lehrmann, and Yaser Sheikh. 2019. Neural Volumes: Learning Dynamic Renderable Volumes from Images. arXiv preprint arXiv:1906.07751 (2019)."},{"key":"e_1_3_2_2_13_1","unstructured":"Adobe Mixamo. [n.d.]. https:\/\/www.mixamo.com.  Adobe Mixamo. [n.d.]. https:\/\/www.mixamo.com."},{"key":"e_1_3_2_2_14_1","volume-title":"Transformable Bottleneck Networks. In The IEEE International Conference on Computer Vision (ICCV).","author":"Olszewski Kyle","year":"2019","unstructured":"Kyle Olszewski , Sergey Tulyakov , Oliver Woodford , Hao Li , and Linjie Luo . 2019 . Transformable Bottleneck Networks. In The IEEE International Conference on Computer Vision (ICCV). Kyle Olszewski, Sergey Tulyakov, Oliver Woodford, Hao Li, and Linjie Luo. 2019. Transformable Bottleneck Networks. In The IEEE International Conference on Computer Vision (ICCV)."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.82"},{"key":"e_1_3_2_2_16_1","volume-title":"Deepsdf: Learning continuous signed distance functions for shape representation. arXiv preprint arXiv:1901.05103","author":"Park Jeong Joon","year":"2019","unstructured":"Jeong Joon Park , Peter Florence , Julian Straub , Richard Newcombe , and Steven Lovegrove . 2019 . Deepsdf: Learning continuous signed distance functions for shape representation. arXiv preprint arXiv:1901.05103 (2019). Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, and Steven Lovegrove. 2019. Deepsdf: Learning continuous signed distance functions for shape representation. arXiv preprint arXiv:1901.05103 (2019)."},{"key":"e_1_3_2_2_17_1","unstructured":"Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017).  Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017)."},{"key":"e_1_3_2_2_18_1","volume-title":"3DPeople: Modeling the Geometry of Dressed Humans. arXiv preprint arXiv:1904.04571","author":"Pumarola Albert","year":"2019","unstructured":"Albert Pumarola , Jordi Sanchez , Gary Choi , Alberto Sanfeliu , and Francesc Moreno-Noguer . 2019. 3DPeople: Modeling the Geometry of Dressed Humans. arXiv preprint arXiv:1904.04571 ( 2019 ). Albert Pumarola, Jordi Sanchez, Gary Choi, Alberto Sanfeliu, and Francesc Moreno-Noguer. 2019. 3DPeople: Modeling the Geometry of Dressed Humans. arXiv preprint arXiv:1904.04571 (2019)."},{"key":"e_1_3_2_2_19_1","volume-title":"Shakir Mohamed, Peter Battaglia, Max Jaderberg, and Nicolas Heess.","author":"Rezende Danilo Jimenez","year":"2016","unstructured":"Danilo Jimenez Rezende , SM Ali Eslami , Shakir Mohamed, Peter Battaglia, Max Jaderberg, and Nicolas Heess. 2016 . Unsupervised learning of 3d structure from images. In Advances in Neural Information Processing Systems . 4996--5004. Danilo Jimenez Rezende, SM Ali Eslami, Shakir Mohamed, Peter Battaglia, Max Jaderberg, and Nicolas Heess. 2016. Unsupervised learning of 3d structure from images. In Advances in Neural Information Processing Systems. 4996--5004."},{"key":"e_1_3_2_2_20_1","volume-title":"Learning to generate images with perceptual similarity metrics. arXiv preprint arXiv:1511.06409","author":"Ridgeway Karl","year":"2015","unstructured":"Karl Ridgeway , Jake Snell , Brett Roads , Richard S Zemel , and Michael C Mozer . 2015. Learning to generate images with perceptual similarity metrics. arXiv preprint arXiv:1511.06409 ( 2015 ). Karl Ridgeway, Jake Snell, Brett Roads, Richard S Zemel, and Michael C Mozer. 2015. Learning to generate images with perceptual similarity metrics. arXiv preprint arXiv:1511.06409 (2015)."},{"key":"e_1_3_2_2_21_1","volume-title":"PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. arXiv preprint arXiv:1905.05172","author":"Saito Shunsuke","year":"2019","unstructured":"Shunsuke Saito , Zeng Huang , Ryota Natsume , Shigeo Morishima , Angjoo Kanazawa , and Hao Li. 2019. PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. arXiv preprint arXiv:1905.05172 ( 2019 ). Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, and Hao Li. 2019. PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. arXiv preprint arXiv:1905.05172 (2019)."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00249"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00254"},{"key":"e_1_3_2_2_24_1","unstructured":"Vincent Sitzmann Michael Zollh\u00f6fer and Gordon Wetzstein. 2019 b. Scene representation networks: Continuous 3D-structure-aware neural scene representations. In Advances in Neural Information Processing Systems. 1119--1130.  Vincent Sitzmann Michael Zollh\u00f6fer and Gordon Wetzstein. 2019 b. Scene representation networks: Continuous 3D-structure-aware neural scene representations. In Advances in Neural Information Processing Systems. 1119--1130."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01219-9_10"},{"key":"e_1_3_2_2_26_1","volume-title":"Single-view to multi-view: Reconstructing unseen views with a convolutional network. arXiv preprint arXiv:1511.06702","author":"Tatarchenko Maxim","year":"2015","unstructured":"Maxim Tatarchenko , Alexey Dosovitskiy , and Thomas Brox . 2015. Single-view to multi-view: Reconstructing unseen views with a convolutional network. arXiv preprint arXiv:1511.06702 , Vol. 6 ( 2015 ). Maxim Tatarchenko, Alexey Dosovitskiy, and Thomas Brox. 2015. Single-view to multi-view: Reconstructing unseen views with a convolutional network. arXiv preprint arXiv:1511.06702, Vol. 6 (2015)."},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.30"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.492"},{"key":"e_1_3_2_2_29_1","volume-title":"et almbox","author":"Wang Zhou","year":"2004","unstructured":"Zhou Wang , Alan C Bovik , Hamid R Sheikh , Eero P Simoncelli , et almbox . 2004 . Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, Vol. 13 , 4 (2004), 600--612. Zhou Wang, Alan C Bovik, Hamid R Sheikh, Eero P Simoncelli, et almbox. 2004. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, Vol. 13, 4 (2004), 600--612."},{"key":"e_1_3_2_2_30_1","unstructured":"Jiajun Wu Yifan Wang Tianfan Xue Xingyuan Sun Bill Freeman and Josh Tenenbaum. 2017. Marrnet: 3d shape reconstruction via 2.5 d sketches. In Advances in neural information processing systems. 540--550.  Jiajun Wu Yifan Wang Tianfan Xue Xingyuan Sun Bill Freeman and Josh Tenenbaum. 2017. Marrnet: 3d shape reconstruction via 2.5 d sketches. In Advances in neural information processing systems. 540--550."},{"key":"e_1_3_2_2_31_1","unstructured":"Jiajun Wu Chengkai Zhang Tianfan Xue Bill Freeman and Josh Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In Advances in neural information processing systems. 82--90.  Jiajun Wu Chengkai Zhang Tianfan Xue Bill Freeman and Josh Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In Advances in neural information processing systems. 82--90."},{"key":"e_1_3_2_2_32_1","unstructured":"Xinchen Yan Jimei Yang Ersin Yumer Yijie Guo and Honglak Lee. 2016. Perspective transformer nets: Learning single-view 3d object reconstruction without 3d supervision. In Advances in Neural Information Processing Systems. 1696--1704.  Xinchen Yan Jimei Yang Ersin Yumer Yijie Guo and Honglak Lee. 2016. Perspective transformer nets: Learning single-view 3d object reconstruction without 3d supervision. In Advances in Neural Information Processing Systems. 1696--1704."},{"key":"e_1_3_2_2_33_1","unstructured":"Jimei Yang Scott E Reed Ming-Hsuan Yang and Honglak Lee. 2015. Weakly-supervised disentangling with recurrent transformations for 3d view synthesis. In Advances in Neural Information Processing Systems. 1099--1107.  Jimei Yang Scott E Reed Ming-Hsuan Yang and Honglak Lee. 2015. Weakly-supervised disentangling with recurrent transformations for 3d view synthesis. In Advances in Neural Information Processing Systems. 1099--1107."},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_18"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00468"}],"event":{"name":"MM '20: The 28th ACM International Conference on Multimedia","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Seattle WA USA","acronym":"MM '20"},"container-title":["Proceedings of the 28th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413754","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3394171.3413754","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:01:16Z","timestamp":1750197676000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3394171.3413754"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":35,"alternative-id":["10.1145\/3394171.3413754","10.1145\/3394171"],"URL":"https:\/\/doi.org\/10.1145\/3394171.3413754","relation":{},"subject":[],"published":{"date-parts":[[2020,10,12]]},"assertion":[{"value":"2020-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}