{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T16:48:52Z","timestamp":1777654132274,"version":"3.51.4"},"reference-count":78,"publisher":"Association for Computing Machinery (ACM)","issue":"12","license":[{"start":{"date-parts":[[2024,11,18]],"date-time":"2024-11-18T00:00:00Z","timestamp":1731888000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62402018, 62277001, 62201017, 62272014"],"award-info":[{"award-number":["62402018, 62277001, 62201017, 62272014"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Beijing Natural Science Foundation","award":["L233026"],"award-info":[{"award-number":["L233026"]}]},{"name":"R&D Program of Beijing Municipal Education Commission","award":["KM202410011017"],"award-info":[{"award-number":["KM202410011017"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2024,12,31]]},"abstract":"<jats:p>Although deep networks-based 3D reconstruction methods can recover the 3D geometry given few inputs, they may produce unfaithful reconstruction when predicting occluded parts of 3D objects. To address the issue, we propose Detail-Enhanced Generative Adversarial Network (DEGAN) which consists of Encoder\u2013Decoder-Based Generator (EDGen) and Voxel-Point Embedding Network-Based Discriminator (VPDis) for 3D reconstruction from a monocular depth image of an object. Firstly, EDGen decodes the features from the 2.5D voxel grid representation of an input depth image and generates the 3D occupancy grid under GAN losses and a sampling point loss. The sampling loss can improve the accuracy of predicted points with high uncertainty. VPDis helps reconstruct the details under voxel and point adversarial losses, respectively. Experimental results show that DEGAN not only outperforms several state-of-the-art methods on both public ModelNet and ShapeNet datasets but also predicts more reliable occluded\/missing parts of 3D objects.<\/jats:p>","DOI":"10.1145\/3690826","type":"journal-article","created":{"date-parts":[[2024,8,30]],"date-time":"2024-08-30T16:20:05Z","timestamp":1725034805000},"page":"1-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["DEGAN: Detail-Enhanced Generative Adversarial Network for Monocular Depth-Based 3D Reconstruction"],"prefix":"10.1145","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-6165-6436","authenticated-orcid":false,"given":"Caixia","family":"Liu","sequence":"first","affiliation":[{"name":"Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1296-0869","authenticated-orcid":false,"given":"Yali","family":"Chen","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-5742-8506","authenticated-orcid":false,"given":"Minhong","family":"Zhu","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-4774-2495","authenticated-orcid":false,"given":"Chenhui","family":"Hao","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4861-0513","authenticated-orcid":false,"given":"Haisheng","family":"Li","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0657-6217","authenticated-orcid":false,"given":"Xiaochuan","family":"Wang","sequence":"additional","affiliation":[{"name":"Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2024,11,18]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2868195"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2018.2844087"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCSVT.2016.2596118"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2007.4408933"},{"key":"e_1_3_1_6_2","first-page":"564","volume-title":"ECCV","author":"Furukawa Yasutaka","year":"2006","unstructured":"Yasutaka Furukawa and Jean Ponce. 2006. Carved visual hulls for image-based modeling. In ECCV, 564\u2013577."},{"key":"e_1_3_1_7_2","first-page":"1912","volume-title":"CVPR","author":"Wu Zhirong","year":"2015","unstructured":"Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3D ShapeNets: A deep representation for volumetric shapes. In CVPR, 1912\u20131920."},{"key":"e_1_3_1_8_2","first-page":"263","volume-title":"3DV","author":"Gwak JunYoung","year":"2017","unstructured":"JunYoung Gwak, Christopher B. Choy, Manmohan Chandraker, Animesh Garg, and Silvio Savarese. 2017. Weakly supervised 3D reconstruction with adversarial constraint. In 3DV, 263\u2013272."},{"key":"e_1_3_1_9_2","first-page":"365","volume-title":"ECCV","author":"Wu Jiajun","year":"2016","unstructured":"Jiajun Wu, Tianfan Xue, Joseph J. Lim, Yuandong Tian, Joshua B. Tenenbaum, Antonio Torralba, and William T. Freeman. 2016. Single image 3D interpreter network. In ECCV, 365\u2013382."},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISMAR.2011.6092378"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/2508363.2508374"},{"key":"e_1_3_1_12_2","first-page":"3264","volume-title":"ICCV","author":"Steinbrucker Frank","year":"2013","unstructured":"Frank Steinbrucker, Christian Kerl, and Daniel Cremers. 2013. Large-scale multi-resolution surface reconstruction from RGB-D sequences. In ICCV, 3264\u20133271."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46484-8_38"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2612761"},{"issue":"7","key":"e_1_3_1_15_2","first-page":"1681","article-title":"Light field reconstruction using convolutional network on EPI and extended applications","volume":"41","author":"Wu Gaochang","year":"2018","unstructured":"Gaochang Wu, Yebin Liu, Lu Fang, Qionghai Dai, and Tianyou Chai. 2018. Light field reconstruction using convolutional network on EPI and extended applications. IEEE Trans. Pattern Anal. Mach. Intell. 41, 7 (2018), 1681\u20131694.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_16_2","unstructured":"Lingjing Wang and Yi Fang. 2017. Unsupervised 3D reconstruction from a single image via adversarial learning. arXiv:1711.09312."},{"issue":"518","key":"e_1_3_1_17_2","first-page":"679","article-title":"3D object reconstruction from a single depth view with adversarial learning","volume":"112","author":"Yang Bo","year":"2017","unstructured":"Bo Yang, Hongkai Wen, Sen Wang, Ronald Clark, Andrew Markham, and Niki Trigoni. 2017. 3D object reconstruction from a single depth view with adversarial learning. ICCV Workshop 112, 518 (2017), 679\u2013688.","journal-title":"ICCV Workshop"},{"key":"e_1_3_1_18_2","first-page":"2463","volume-title":"CVPR","author":"Fan Haoqiang","year":"2017","unstructured":"Haoqiang Fan, Hao Su, and Leonidas J. Guibas Guibas. 2017. A point set generation network for 3D object reconstruction from a single image. In CVPR, 2463\u20132471."},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2613824"},{"key":"e_1_3_1_20_2","first-page":"87","article-title":"Improved adversarial systems for 3D object generation and reconstruction","author":"Smith Edward","year":"2017","unstructured":"Edward Smith and David Meger. 2017. Improved adversarial systems for 3D object generation and reconstruction. In CoRL, 87\u201396.","journal-title":"CoRL"},{"key":"e_1_3_1_21_2","first-page":"82","volume-title":"NIPS","author":"Wu Jiajun","year":"2016","unstructured":"Jiajun Wu, Chengkai Zhang, Tianfan Xue, Bill Freeman, and Josh Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling. In NIPS, 82\u201390."},{"issue":"2","key":"e_1_3_1_22_2","doi-asserted-by":"crossref","first-page":"697","DOI":"10.2991\/ijcis.d.190617.001","article-title":"3D model generation and reconstruction using conditional generative adversarial network","volume":"12","author":"Li Haisheng","year":"2019","unstructured":"Haisheng Li, Yanping Zheng, Xiaoqun Wu, and Qiang Cai. 2019. 3D model generation and reconstruction using conditional generative adversarial network. Int. J. Comput. Intell. 12, 2 (2019), 697\u2013705.","journal-title":"Int. J. Comput. Intell."},{"key":"e_1_3_1_23_2","first-page":"16102","volume-title":"CVPR","author":"Chan Eric R.","year":"2022","unstructured":"Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas J. Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, and Gordon Wetzstein. 2022. Efficient geometry-aware 3D generative adversarial networks. In CVPR, 16102\u201316112."},{"key":"e_1_3_1_24_2","first-page":"365","volume-title":"ACCV","author":"Pontes Jhony K.","year":"2018","unstructured":"Jhony K. Pontes, Chen Kong, Sridha Sridharan, Simon Lucey, Anders Eriksson, and Clinton Fookes. 2018. Image2Mesh: A learning framework for single image 3D reconstruction. In ACCV, 365\u2013381."},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12573"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_29"},{"key":"e_1_3_1_27_2","first-page":"236","volume-title":"ECCV","author":"Sharma Abhishek","year":"2016","unstructured":"Abhishek Sharma, Oliver Grau, and Mario Fritz. 2016. VCONV-DAE: Deep volumetric shape learning without object labels. In ECCV, 236\u2013250."},{"key":"e_1_3_1_28_2","first-page":"7114","volume-title":"AAAI","author":"Lin Chen-Hsuan","year":"2018","unstructured":"Chen-Hsuan Lin, Chen Kong, and Simon Lucey. 2018. Learning efficient point cloud generation for dense 3D object reconstruction. In AAAI, 7114\u20137121."},{"key":"e_1_3_1_29_2","first-page":"2442","volume-title":"IROS","author":"Varley Jacob","year":"2017","unstructured":"Jacob Varley, Chad DeChant, Adam Richardson, Joaqu\u00edn Ruales, and Peter K. Allen. 2017. Shape completion enabled robotic grasping. In IROS, 2442\u20132447."},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2023.3268305"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.13240"},{"key":"e_1_3_1_32_2","first-page":"4996","article-title":"Unsupervised learning of 3D structure from images","author":"Rezende Danilo Jimenez","year":"2016","unstructured":"Danilo Jimenez Rezende, S. M. Ali Eslami, Shakir Mohamed, Peter Battaglia, Max Jaderberg, and Nicolas Heess. 2016. Unsupervised learning of 3D structure from images. In NIPS, 4996\u20135004.","journal-title":"NIPS"},{"key":"e_1_3_1_33_2","first-page":"3793","volume-title":"CVPR","author":"Xin Wen","year":"2022","unstructured":"Wen Xin, Zhou Junsheng, Liu YuShen, Su Hua, Dong Zhen, and Han Zhizhong. 2022. 3D shape reconstruction from 2D images with disentangled attribute flow. In CVPR, 3793\u20133803."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2022.3196334"},{"key":"e_1_3_1_35_2","doi-asserted-by":"crossref","unstructured":"Xiaoxiao Long YuanChen Guo Cheng Lin Yuan Liu Zhiyang Dou Lingjie Liu Yuexin Ma SongHai Zhang Marc Habermann Christian Theobalt and Wenping Wang. 2023. Wonder3D: Single image to 3D using cross-domain diffusion. arXiv:2310.15008.","DOI":"10.1109\/CVPR52733.2024.00951"},{"key":"e_1_3_1_36_2","first-page":"1","volume-title":"CVPR","author":"Anciukevicius Titas","year":"2023","unstructured":"Titas Anciukevicius, Zexiang Xu, Matthew Fisher, Paul Henderson, Hakan Bilen, Niloy J. Mitra, and Paul Guerrero. 2023. RenderDiffusion: Image diffusion for 3D reconstruction, inpainting and generation. In CVPR, 1\u201315."},{"key":"e_1_3_1_37_2","first-page":"1","volume-title":"ICCV","author":"Chen Hansheng","year":"2023","unstructured":"Hansheng Chen, Jiatao Gu, Anpei Chen, Wei Tian, Zhuowen Tu, Lingjie Liu, and Hao Su. 2023. Single-stage diffusion NeRF: A unified approach to 3D generation and reconstruction. In ICCV, 1\u201319."},{"key":"e_1_3_1_38_2","unstructured":"Jiatao Gu Qingzhe Gao Shuangfei Zhai Baoquan Chen Lingjie Liu and Josh M. Susskind. 2023. Learning controllable 3D diffusion models from single-view images. arXiv:2304.06700."},{"key":"e_1_3_1_39_2","first-page":"12588","volume-title":"CVPR","author":"Zhou Zhizhuo","year":"2023","unstructured":"Zhizhuo Zhou and Shubham Tulsiani. 2023. SparseFusion: Distilling view-conditioned diffusion for 3D reconstruction. In CVPR, 12588\u201312597."},{"key":"e_1_3_1_40_2","first-page":"67","volume-title":"ICANN","author":"Liu Caixia","year":"2021","unstructured":"Caixia Liu, Dehui Kong, Shaofan Wang, Jinghua Li, and Baocai Yin. 2021. Latent feature-aware and local structure-preserving network for 3D completion from a single depth view. In ICANN, 67\u201379."},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.3017924"},{"issue":"4","key":"e_1_3_1_42_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3506733","article-title":"A spatial relationship preserving adversarial network for 3D reconstruction from a single depth view","volume":"18","author":"Liu Caixia","year":"2022","unstructured":"Caixia Liu, Dehui Kong, Shaofan Wang, Jinghua Li, and Baocai Yin. 2022. A spatial relationship preserving adversarial network for 3D reconstruction from a single depth view. ACM Trans. Multim. Comput. Commun. Appl. 18, 4 (2022), 1\u201322.","journal-title":"ACM Trans. Multim. Comput. Commun. Appl."},{"key":"e_1_3_1_43_2","first-page":"1","volume-title":"ICCV","author":"Cui Ruikai","year":"2023","unstructured":"Ruikai Cui, Shi Qiu, Saeed Anwar, Jiawei Liu, Chaoyue Xing, Jing Zhang, and Nick Barnes. 2023. P2C: Self-supervised point cloud completion from single partial clouds. In ICCV, 1\u201310."},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_23"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01219-9_43"},{"key":"e_1_3_1_46_2","first-page":"55","volume-title":"ECCV","author":"Wang Nanyang","year":"2018","unstructured":"Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Wei Liu, and Yu-Gang Jiang. 2018. Pixel2Mesh: Generating 3D mesh models from single RGB images. In ECCV, Vol. 11, 55\u201371."},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2023.02.041"},{"key":"e_1_3_1_48_2","first-page":"6545","volume-title":"CVPR","author":"Dai Angela","year":"2017","unstructured":"Angela Dai, Charles Ruizhongtai Qi, and Matthias Nie\u00dfner. 2017. Shape completion using 3D-encoder-predictor CNNs and shape synthesis. In CVPR, 6545\u20136554."},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00025"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00609"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00459"},{"key":"e_1_3_1_52_2","first-page":"490","volume-title":"NIPS","author":"Xu Qiangeng","year":"2019","unstructured":"Qiangeng Xu, Weiyue Wang, Duygu Ceylan, Radom\u00edr Mech, and Ulrich Neumann. 2019. DISN: Deep implicit surface network for high-quality single-view 3D reconstruction. In NIPS, 490\u2013500."},{"key":"e_1_3_1_53_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2020.10.097"},{"key":"e_1_3_1_54_2","first-page":"94","volume-title":"CVPR","author":"Wang Zhen","year":"2023","unstructured":"Zhen Wang, Shijie Zhou, Jeong Joon Park, Despoina Paschalidou, Suya You, Gordon Wetzstein, Leonidas Guibas, and Achuta Kadambi. 2023. ALTO: Alternating latent topologies for implicit 3D reconstruction. In CVPR, 94\u2013103."},{"key":"e_1_3_1_55_2","first-page":"26744","volume-title":"NIPS","author":"Qi Zekun","year":"2023","unstructured":"Zekun Qi, Muzhou Yu, Runpei Dong, and Kaisheng Ma. 2023. VPP: Efficient conditional 3D generation via voxel-point progressive representation. In NIPS, 26744\u201326763."},{"key":"e_1_3_1_56_2","first-page":"728","volume-title":"3DV","author":"Yuan Wentao","year":"2018","unstructured":"Wentao Yuan, Tejas Khot, David Held, Christoph Mertz, and Martial Hebert. 2018. PCN: Point completion network. In 3DV, 728\u2013737."},{"key":"e_1_3_1_57_2","first-page":"313","volume-title":"ECCV","volume":"8","author":"Speciale Pablo","year":"2016","unstructured":"Pablo Speciale, Martin R. Oswald, Andrea Cohen, and Marc Pollefeys. 2016. A symmetry prior for convex variational 3D reconstruction. In ECCV, Vol. 8, 313\u2013328."},{"key":"e_1_3_1_58_2","doi-asserted-by":"publisher","DOI":"10.1145\/3177756"},{"key":"e_1_3_1_59_2","first-page":"53","volume-title":"IJCV","author":"Yang Bo","year":"2020","unstructured":"Bo Yang, Sen Wang, Andrew Markham, and Niki Trigoni. 2020. Robust attentional aggregation of deep feature sets for multi-view 3D reconstruction. In IJCV, 53\u201373."},{"key":"e_1_3_1_60_2","first-page":"9264","volume-title":"ICCV","author":"Liu Ruoshi","year":"2023","unstructured":"Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, and Carl Vondrick. 2023. Zero-1-to-3: Zero-shot one image to 3D object. In ICCV, 9264\u20139275."},{"key":"e_1_3_1_61_2","first-page":"1","volume-title":"ICLR","author":"Hong Yicong","year":"2023","unstructured":"Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, and Hao Tan. 2023. LRM: Large reconstruction model for single image to 3D. In ICLR, 1\u201325."},{"key":"e_1_3_1_62_2","first-page":"2902","volume-title":"ICCV Workshops","author":"Wei Yao","year":"2023","unstructured":"Yao Wei, George Vosselman, and Michael Ying Yang. 2023. BuilDiff: 3D building shape generation using single-image conditional point cloud diffusion models. In ICCV Workshops, 2902\u20132911."},{"key":"e_1_3_1_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01242"},{"key":"e_1_3_1_64_2","first-page":"1248","volume-title":"CVPR","author":"Jiang Yue","year":"2020","unstructured":"Yue Jiang, Dantong Ji, Zhizhong Han, and Matthias Zwicker. 2020. SDFDiff: Differentiable rendering of signed distance fields for 3D shape optimization. In CVPR, 1248\u20131258."},{"key":"e_1_3_1_65_2","first-page":"1","article-title":"Colorful 3D reconstruction at high resolution using multi-view representation","volume":"85","author":"Zheng Yanping","year":"2022","unstructured":"Yanping Zheng, Guang Zeng, Haisheng Li, Qiang Cai, and Junping Du. 2022. Colorful 3D reconstruction at high resolution using multi-view representation. J. Vis. Commun. Image Represent. 85, 103486 (2022), 1\u201310.","journal-title":"J. Vis. Commun. Image Represent."},{"key":"e_1_3_1_66_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00744"},{"key":"e_1_3_1_67_2","first-page":"5603","volume-title":"CVPR","author":"Kong Chen","year":"2017","unstructured":"Chen Kong, Chenhsuan Lin, and Simon Lucey. 2017. Using locally corresponding CAD models for dense 3D reconstructions from a single image. In CVPR, 5603\u20135611."},{"key":"e_1_3_1_68_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00030"},{"key":"e_1_3_1_69_2","first-page":"2690","volume-title":"ICCV","author":"Xie Haozhe","year":"2019","unstructured":"Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, and Shengping Zhang. 2019. Pix2Vox: Context-aware 3D reconstruction from single and multi-view images. In ICCV, 2690\u20132698."},{"key":"e_1_3_1_70_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"e_1_3_1_71_2","first-page":"75","volume-title":"WACV","author":"Xiang Yu","year":"2014","unstructured":"Yu Xiang, Roozbeh Mottaghi, and Silvio Savarese. 2014. Beyond PASCAL: A benchmark for 3D object detection in the wild. In WACV, 75\u201382."},{"key":"e_1_3_1_72_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.372"},{"key":"e_1_3_1_73_2","first-page":"2553","volume-title":"ICRA","author":"Downs Laura","year":"2022","unstructured":"Laura Downs, Anthony Francis, Nate Koenig, Brandon Kinman, Ryan Hickman, Krista Reymann, Thomas B. McHugh, and Vincent Vanhoucke. 2022. Google scanned objects: A high-quality dataset of 3D scanned household items. In ICRA, 2553\u20132560."},{"key":"e_1_3_1_74_2","first-page":"10377","volume-title":"ICCV","author":"Selvaraju Pratheba","year":"2021","unstructured":"Pratheba Selvaraju, Mohamed Nabail, Marios Loizou, Maria Maslioukova, Melinos Averkiou, Andreas Andreou, Siddhartha Chaudhuri, and Evangelos Kalogerakis. 2021. BuildingNet: Learning to label 3D buildings. In ICCV, 10377\u201310387."},{"key":"e_1_3_1_75_2","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2016.2614483"},{"key":"e_1_3_1_76_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2019.2934799"},{"key":"e_1_3_1_77_2","first-page":"1","volume-title":"ICLR","author":"Kingma Diederik P.","year":"2015","unstructured":"Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In ICLR, 1\u201315."},{"key":"e_1_3_1_78_2","first-page":"383","volume-title":"CVPR","author":"Tchapmi Lyne P.","year":"2019","unstructured":"Lyne P. Tchapmi, Vineet Kosaraju, Hamid Rezatofighi, Ian D. Reid, and Silvio Savarese. 2019. TopNet: Structural point cloud decoder. In CVPR, 383\u2013392."},{"key":"e_1_3_1_79_2","first-page":"787","volume-title":"CVPR","author":"Wang Xiaogang","year":"2020","unstructured":"Xiaogang Wang, Marcelo H. Ang, and Gim Hee Lee. 2020. Cascaded refinement network for point cloud completion. In CVPR, 787\u2013796."}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3690826","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3690826","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:58:06Z","timestamp":1750294686000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3690826"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,18]]},"references-count":78,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,12,31]]}},"alternative-id":["10.1145\/3690826"],"URL":"https:\/\/doi.org\/10.1145\/3690826","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,11,18]]},"assertion":[{"value":"2023-12-18","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-08-15","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-11-18","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}