{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,4]],"date-time":"2026-03-04T01:59:15Z","timestamp":1772589555793,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":81,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"funder":[{"name":"Shanghai Biren Technology Co., Ltd."},{"name":"Ministry of Science and Technology of China","award":["2021ZD0109901"],"award-info":[{"award-number":["2021ZD0109901"]}]},{"name":"Natural Science Foundation of China (NSFC)","award":["62125106, 61860206003, 62088102, 62171256"],"award-info":[{"award-number":["62125106, 61860206003, 62088102, 62171256"]}]},{"name":"Shenzhen Key Laboratory of next generation interactive media innovative technology","award":["ZDSYS20210623092001004"],"award-info":[{"award-number":["ZDSYS20210623092001004"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3547920","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:01Z","timestamp":1665416581000},"page":"6113-6124","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["ParseMVS: Learning Primitive-aware Surface Representations for Sparse Multi-view Stereopsis"],"prefix":"10.1145","author":[{"given":"Haiyang","family":"Ying","sequence":"first","affiliation":[{"name":"Tsinghua University &amp; Tsinghua Shenzhen International Graduate School, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jinzhi","family":"Zhang","sequence":"additional","affiliation":[{"name":"Tsinghua University &amp; Tsinghua Shenzhen International Graduate School, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuzhe","family":"Chen","sequence":"additional","affiliation":[{"name":"Tsinghua University &amp; Tsinghua Shenzhen International Graduate School, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zheng","family":"Cao","sequence":"additional","affiliation":[{"name":"BirenTech Research, Shanghai, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jing","family":"Xiao","sequence":"additional","affiliation":[{"name":"Pingan Group, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruqi","family":"Huang","sequence":"additional","affiliation":[{"name":"Tsinghua Shenzhen International Graduate School, Shenzhen, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lu","family":"Fang","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"George Vogiatzis, Engin Tola, and Anders Bjorholm Dahl.","author":"Aan\u00e6s Henrik","year":"2016","unstructured":"Henrik Aan\u00e6s , Rasmus Ramsb\u00f8l Jensen , George Vogiatzis, Engin Tola, and Anders Bjorholm Dahl. 2016 . Large-Scale Data for Multiple-View Stereopsis. International Journal of Computer Vision ( 2016), 1--16. Henrik Aan\u00e6s, Rasmus Ramsb\u00f8l Jensen, George Vogiatzis, Engin Tola, and Anders Bjorholm Dahl. 2016. Large-Scale Data for Multiple-View Stereopsis. International Journal of Computer Vision (2016), 1--16."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88682-2_58"},{"key":"e_1_3_2_2_3_1","volume-title":"Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, and Richard Newcombe.","author":"Chabra Rohan","year":"2020","unstructured":"Rohan Chabra , Jan Eric Lenssen , Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, and Richard Newcombe. 2020 . Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction . arXiv preprint arXiv:2003.10983 (2020). Rohan Chabra, Jan Eric Lenssen, Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, and Richard Newcombe. 2020. Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction. arXiv preprint arXiv:2003.10983 (2020)."},{"key":"e_1_3_2_2_4_1","unstructured":"Rui Chen Songfang Han Jing Xu etal 2020. Visibility-Aware Point-Based Multi-View Stereo Network. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020).  Rui Chen Songfang Han Jing Xu et al. 2020. Visibility-Aware Point-Based Multi-View Stereo Network. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)."},{"key":"e_1_3_2_2_5_1","volume-title":"Point-Based Multi-View Stereo Network. In IEEE Conference on Computer Vision and Pattern Recognition.","author":"Chen Rui","year":"2019","unstructured":"Rui Chen , Songfang Han , Jing Xu , and Hao Su . 2019 . Point-Based Multi-View Stereo Network. In IEEE Conference on Computer Vision and Pattern Recognition. Rui Chen, Songfang Han, Jing Xu, and Hao Su. 2019. Point-Based Multi-View Stereo Network. In IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_3_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00260"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237269"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206867"},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.161"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.106"},{"key":"e_1_3_2_2_11_1","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition. 5479--5487","author":"Galliani S.","unstructured":"S. Galliani and K. Schindler . 2016. Just Look at the Image: Viewpoint-Specific Surface Normal Prediction for Improved Multi-View Reconstruction . In IEEE Conference on Computer Vision and Pattern Recognition. 5479--5487 . S. Galliani and K. Schindler. 2016. Just Look at the Image: Viewpoint-Specific Surface Normal Prediction for Improved Multi-View Reconstruction. In IEEE Conference on Computer Vision and Pattern Recognition. 5479--5487."},{"key":"e_1_3_2_2_12_1","volume-title":"Piecewise planar and non-planar stereo for urban scene reconstruction. In 2010 IEEE computer society conference on computer vision and pattern recognition","author":"Gallup David","unstructured":"David Gallup , Jan-Michael Frahm , and Marc Pollefeys . 2010. Piecewise planar and non-planar stereo for urban scene reconstruction. In 2010 IEEE computer society conference on computer vision and pattern recognition . IEEE , 1418--1425. David Gallup, Jan-Michael Frahm, and Marc Pollefeys. 2010. Piecewise planar and non-planar stereo for urban scene reconstruction. In 2010 IEEE computer society conference on computer vision and pattern recognition. IEEE, 1418--1425."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00491"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"crossref","unstructured":"Michael Goesele Noah Snavely Brian Curless Hugues Hoppe and Steven Seitz. 2007. Multi-View Stereo for Community Photo Collections. 1--8. https:\/\/doi.org\/10.1109\/ICCV.2007.4408933  Michael Goesele Noah Snavely Brian Curless Hugues Hoppe and Steven Seitz. 2007. Multi-View Stereo for Community Photo Collections. 1--8. https:\/\/doi.org\/10.1109\/ICCV.2007.4408933","DOI":"10.1109\/ICCV.2007.4408933"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00030"},{"key":"e_1_3_2_2_16_1","volume-title":"Cascade Cost","author":"Gu Xiaodong","year":"2019","unstructured":"Xiaodong Gu , Zhiwen Fan , Siyu Zhu , Zuozhuo Dai , Feitong Tan , and Ping Tan . 2019. Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching. arXiv preprint arXiv: 1912 .06378 ( 2019 ). Xiaodong Gu, Zhiwen Fan, Siyu Zhu, Zuozhuo Dai, Feitong Tan, and Ping Tan. 2019. Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching. arXiv preprint arXiv:1912.06378 (2019)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"crossref","unstructured":"Xufeng Han Thomas Leung Yangqing Jia Rahul Sukthankar and Alexander. C. Berg. 2015. MatchNet: Unifying Feature and Metric Learning for Patch-Based Matching. In CVPR.  Xufeng Han Thomas Leung Yangqing Jia Rahul Sukthankar and Alexander. C. Berg. 2015. MatchNet: Unifying Feature and Metric Learning for Patch-Based Matching. In CVPR.","DOI":"10.1109\/CVPR.2015.7298948"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413889"},{"key":"e_1_3_2_2_19_1","volume-title":"Proceedings of the 29th ACM International Conference on Multimedia. 2353--2361","author":"He Qian","year":"2021","unstructured":"Qian He , Desen Zhou , Bo Wan , and Xuming He . 2021 . Single Image 3D Object Estimation with Primitive Graph Networks . In Proceedings of the 29th ACM International Conference on Multimedia. 2353--2361 . Qian He, Desen Zhou, Bo Wan, and Xuming He. 2021. Single Image 3D Object Estimation with Primitive Graph Networks. In Proceedings of the 29th ACM International Conference on Multimedia. 2353--2361."},{"key":"e_1_3_2_2_20_1","volume-title":"International Journal of Computer Vision","author":"Henderson Paul","year":"2019","unstructured":"Paul Henderson and Vittorio Ferrari . 2019. Learning single-image 3D reconstruction by generative modelling of shape, pose and shading . International Journal of Computer Vision ( 2019 ), 1--20. Paul Henderson and Vittorio Ferrari. 2019. Learning single-image 3D reconstruction by generative modelling of shape, pose and shading. International Journal of Computer Vision (2019), 1--20."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.01506"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"crossref","unstructured":"Michal Jancosek and Tomas Pajdla. 2011a. Multi-View Reconstruction Preserving Weakly-Supported Surfaces. 3121 -- 3128. https:\/\/doi.org\/10.1109\/CVPR.2011.5995693  Michal Jancosek and Tomas Pajdla. 2011a. Multi-View Reconstruction Preserving Weakly-Supported Surfaces. 3121 -- 3128. https:\/\/doi.org\/10.1109\/CVPR.2011.5995693","DOI":"10.1109\/CVPR.2011.5995693"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995693"},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.253"},{"key":"e_1_3_2_2_25_1","volume-title":"SurfaceNet: An End-to-end 3D Neural Network for Very Sparse Multi-view Stereopsis","author":"Ji Mengqi","year":"2020","unstructured":"Mengqi Ji , Jinzhi Zhang , Qionghai Dai , and Lu Fang . 2020. SurfaceNet: An End-to-end 3D Neural Network for Very Sparse Multi-view Stereopsis . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2020 ). Mengqi Ji, Jinzhi Zhang, Qionghai Dai, and Lu Fang. 2020. SurfaceNet: An End-to-end 3D Neural Network for Very Sparse Multi-view Stereopsis. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)."},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00604"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00744"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01267-0_23"},{"key":"e_1_3_2_2_29_1","volume-title":"Learning a multi-view stereo machine. Advances in neural information processing systems","author":"Kar Abhishek","year":"2017","unstructured":"Abhishek Kar , Christian H\"ane, and Jitendra Malik . 2017. Learning a multi-view stereo machine. Advances in neural information processing systems , Vol. 30 ( 2017 ). Abhishek Kar, Christian H\"ane, and Jitendra Malik. 2017. Learning a multi-view stereo machine. Advances in neural information processing systems , Vol. 30 (2017)."},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00411"},{"key":"e_1_3_2_2_31_1","volume-title":"End-to-End Learning of Geometry and Context for Deep Stereo Regression. CoRR","author":"Kendall Alex","year":"2017","unstructured":"Alex Kendall , Hayk Martirosyan , Saumitro Dasgupta , Peter Henry , Ryan Kennedy , Abraham Bachrach , and Adam Bry . 2017. End-to-End Learning of Geometry and Context for Deep Stereo Regression. CoRR , Vol. abs\/ 1703 .04309 ( 2017 ). Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry, Ryan Kennedy, Abraham Bachrach, and Adam Bry. 2017. End-to-End Learning of Geometry and Context for Deep Stereo Regression. CoRR , Vol. abs\/1703.04309 (2017)."},{"key":"e_1_3_2_2_32_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073599"},{"key":"e_1_3_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3073599"},{"key":"e_1_3_2_2_35_1","doi-asserted-by":"crossref","unstructured":"Patrick Kn\u00f6belreiter Christian Reinbacher Alexander Shekhovtsov and Thomas Pock. 2017. End-to-End Training of Hybrid CNN-CRF Models for Stereo. In 2017 Computer Vision and Pattern Recognition (CVPR).  Patrick Kn\u00f6belreiter Christian Reinbacher Alexander Shekhovtsov and Thomas Pock. 2017. End-to-End Training of Hybrid CNN-CRF Models for Stereo. In 2017 Computer Vision and Pattern Recognition (CVPR).","DOI":"10.1109\/CVPR.2017.159"},{"key":"e_1_3_2_2_36_1","volume-title":"IEEE International Conference on Computer Vision","volume":"1","author":"Kutulakos K. N.","unstructured":"K. N. Kutulakos and S. M. Seitz . 1999. A theory of shape by space carving . In IEEE International Conference on Computer Vision , Vol. 1 . 307--314. K. N. Kutulakos and S. M. Seitz. 1999. A theory of shape by space carving. In IEEE International Conference on Computer Vision, Vol. 1. 307--314."},{"key":"e_1_3_2_2_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.420"},{"key":"e_1_3_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2007.383293"},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2005.44"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00276"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1964921.1964947"},{"key":"e_1_3_2_2_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00106"},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00458"},{"key":"e_1_3_2_2_44_1","volume-title":"Tat-Seng Chua, and Christian Theobalt.","author":"Liu Lingjie","year":"2020","unstructured":"Lingjie Liu , Jiatao Gu , Kyaw Zaw Lin , Tat-Seng Chua, and Christian Theobalt. 2020 . Neural sparse voxel fields. Advances in Neural Information Processing Systems , Vol. 33 (2020). Lingjie Liu, Jiatao Gu, Kyaw Zaw Lin, Tat-Seng Chua, and Christian Theobalt. 2020. Neural sparse voxel fields. Advances in Neural Information Processing Systems , Vol. 33 (2020)."},{"key":"e_1_3_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00780"},{"key":"e_1_3_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.910883"},{"key":"e_1_3_2_2_47_1","volume-title":"Jonathan T Barron, Alexey Dosovitskiy, and Daniel Duckworth.","author":"Martin-Brualla Ricardo","year":"2020","unstructured":"Ricardo Martin-Brualla , Noha Radwan , Mehdi SM Sajjadi , Jonathan T Barron, Alexey Dosovitskiy, and Daniel Duckworth. 2020 . NeRF in the Wild : Neural Radiance Fields for Unconstrained Photo Collections . arXiv preprint arXiv:2008.02268 (2020). Ricardo Martin-Brualla, Noha Radwan, Mehdi SM Sajjadi, Jonathan T Barron, Alexey Dosovitskiy, and Daniel Duckworth. 2020. NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. arXiv preprint arXiv:2008.02268 (2020)."},{"key":"e_1_3_2_2_48_1","volume-title":"Real-Time Visibility-Based Fusion of Depth Maps. In IEEE International Conference on Computer Vision. 1--8.","author":"Merrell Paul","year":"2007","unstructured":"Paul Merrell , Amir Akbarzadeh , Liang Wang , Philippos Mordohai , Jan-Michael Frahm , Ruigang Yang , David Nist\u00e9 r, and Marc Pollefeys . 2007 . Real-Time Visibility-Based Fusion of Depth Maps. In IEEE International Conference on Computer Vision. 1--8. Paul Merrell, Amir Akbarzadeh, Liang Wang, Philippos Mordohai, Jan-Michael Frahm, Ruigang Yang, David Nist\u00e9 r, and Marc Pollefeys. 2007. Real-Time Visibility-Based Fusion of Depth Maps. In IEEE International Conference on Computer Vision. 1--8."},{"key":"e_1_3_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00459"},{"key":"e_1_3_2_2_50_1","volume-title":"Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934","author":"Mildenhall Ben","year":"2020","unstructured":"Ben Mildenhall , Pratul P Srinivasan , Matthew Tancik , Jonathan T Barron , Ravi Ramamoorthi , and Ren Ng . 2020 . Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934 (2020). Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. 2020. Nerf: Representing scenes as neural radiance fields for view synthesis. arXiv preprint arXiv:2003.08934 (2020)."},{"key":"e_1_3_2_2_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2014.181"},{"key":"e_1_3_2_2_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00356"},{"key":"e_1_3_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00463"},{"key":"e_1_3_2_2_54_1","volume-title":"Computer Graphics Forum","author":"Oesau Sven","unstructured":"Sven Oesau , Florent Lafarge , and Pierre Alliez . 2016. Planar shape detection and regularization in tandem . In Computer Graphics Forum , Vol. 35 . Wiley Online Library , 203--215. Sven Oesau, Florent Lafarge, and Pierre Alliez. 2016. Planar shape detection and regularization in tandem. In Computer Graphics Forum, Vol. 35. Wiley Online Library, 203--215."},{"key":"e_1_3_2_2_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00025"},{"key":"e_1_3_2_2_56_1","volume-title":"Frank Van Den Heuvel, and George Vosselmann","author":"Rabbani Tahir","year":"2006","unstructured":"Tahir Rabbani , Frank Van Den Heuvel, and George Vosselmann . 2006 . Segmentation of point clouds using smoothness constraint. International archives of photogrammetry, remote sensing and spatial information sciences, Vol. 36 , 5 (2006), 248--253. Tahir Rabbani, Frank Van Den Heuvel, and George Vosselmann. 2006. Segmentation of point clouds using smoothness constraint. International archives of photogrammetry, remote sensing and spatial information sciences, Vol. 36, 5 (2006), 248--253."},{"key":"e_1_3_2_2_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2017.00017"},{"key":"e_1_3_2_2_58_1","volume-title":"Computer graphics forum","author":"Schnabel Ruwen","unstructured":"Ruwen Schnabel , Roland Wahl , and Reinhard Klein . 2007. Efficient RANSAC for point-cloud shape detection . In Computer graphics forum , Vol. 26 . Wiley Online Library , 214--226. Ruwen Schnabel, Roland Wahl, and Reinhard Klein. 2007. Efficient RANSAC for point-cloud shape detection. In Computer graphics forum, Vol. 26. Wiley Online Library, 214--226."},{"key":"e_1_3_2_2_59_1","volume-title":"Pixelwise View Selection for Unstructured Multi-View Stereo. In European Conference on Computer Vision (ECCV).","author":"Sch\u00f6nberger Johannes Lutz","year":"2016","unstructured":"Johannes Lutz Sch\u00f6nberger , Enliang Zheng , Marc Pollefeys , and Jan-Michael Frahm . 2016 . Pixelwise View Selection for Unstructured Multi-View Stereo. In European Conference on Computer Vision (ECCV). Johannes Lutz Sch\u00f6nberger, Enliang Zheng, Marc Pollefeys, and Jan-Michael Frahm. 2016. Pixelwise View Selection for Unstructured Multi-View Stereo. In European Conference on Computer Vision (ECCV)."},{"key":"e_1_3_2_2_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.272"},{"key":"e_1_3_2_2_61_1","doi-asserted-by":"crossref","unstructured":"Akihito Seki and Marc Pollefeys. 2017. SGM-Nets: Semi-Global Matching With Neural Networks. In CVPR.  Akihito Seki and Marc Pollefeys. 2017. SGM-Nets: Semi-Global Matching With Neural Networks. In CVPR.","DOI":"10.1109\/CVPR.2017.703"},{"key":"e_1_3_2_2_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.91"},{"key":"e_1_3_2_2_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459417"},{"key":"e_1_3_2_2_64_1","unstructured":"Vincent Sitzmann Michael Zollh\u00f6fer and Gordon Wetzstein. 2019. Scene representation networks: Continuous 3d-structure-aware neural scene representations. In Advances in Neural Information Processing Systems. 1121--1132.  Vincent Sitzmann Michael Zollh\u00f6fer and Gordon Wetzstein. 2019. Scene representation networks: Continuous 3d-structure-aware neural scene representations. In Advances in Neural Information Processing Systems. 1121--1132."},{"key":"e_1_3_2_2_65_1","volume-title":"Efficient large-scale multi-view stereo for ultra high-resolution image sets. Machine Vision and Applications","author":"Tola Engin","year":"2012","unstructured":"Engin Tola , Christoph Strecha , and Pascal Fua . 2012. Efficient large-scale multi-view stereo for ultra high-resolution image sets. Machine Vision and Applications ( 2012 ), 1--18. Engin Tola, Christoph Strecha, and Pascal Fua. 2012. Efficient large-scale multi-view stereo for ultra high-resolution image sets. Machine Vision and Applications (2012), 1--18."},{"key":"e_1_3_2_2_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01397"},{"key":"e_1_3_2_2_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123340"},{"key":"e_1_3_2_2_68_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01252-6_4"},{"key":"e_1_3_2_2_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00545"},{"key":"e_1_3_2_2_70_1","unstructured":"Qingshan Xu and Wenbing Tao. 2020. Planar Prior Assisted PatchMatch Multi-View Stereo. In AAAI.  Qingshan Xu and Wenbing Tao. 2020. Planar Prior Assisted PatchMatch Multi-View Stereo. In AAAI."},{"key":"e_1_3_2_2_71_1","volume-title":"Point-NeRF: Point-based Neural Radiance Fields. arXiv preprint arXiv:2201.08845","author":"Xu Qiangeng","year":"2022","unstructured":"Qiangeng Xu , Zexiang Xu , Julien Philip , Sai Bi , Zhixin Shu , Kalyan Sunkavalli , and Ulrich Neumann . 2022. Point-NeRF: Point-based Neural Radiance Fields. arXiv preprint arXiv:2201.08845 ( 2022 ). Qiangeng Xu, Zexiang Xu, Julien Philip, Sai Bi, Zhixin Shu, Kalyan Sunkavalli, and Ulrich Neumann. 2022. Point-NeRF: Point-based Neural Radiance Fields. arXiv preprint arXiv:2201.08845 (2022)."},{"key":"e_1_3_2_2_72_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01237-3_47"},{"key":"e_1_3_2_2_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00567"},{"key":"e_1_3_2_2_74_1","volume-title":"Advances in Neural Information Processing Systems","volume":"33","author":"Yariv Lior","year":"2020","unstructured":"Lior Yariv , Yoni Kasten , Dror Moran , Meirav Galun , Matan Atzmon , Basri Ronen , and Yaron Lipman . 2020 . Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance . Advances in Neural Information Processing Systems , Vol. 33 (2020). Lior Yariv, Yoni Kasten, Dror Moran, Meirav Galun, Matan Atzmon, Basri Ronen, and Yaron Lipman. 2020. Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance. Advances in Neural Information Processing Systems , Vol. 33 (2020)."},{"key":"e_1_3_2_2_75_1","volume-title":"Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation. arxiv preprint arXiv:1912.03001","author":"Yi Hongwei","year":"2019","unstructured":"Hongwei Yi , Zizhuang Wei , Mingyu Ding , Runze Zhang , Yisong Chen , Guoping Wang , and Yu-Wing Tai . 2019. Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation. arxiv preprint arXiv:1912.03001 ( 2019 ). Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, and Yu-Wing Tai. 2019. Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation. arxiv preprint arXiv:1912.03001 (2019)."},{"key":"e_1_3_2_2_76_1","volume-title":"CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly. arXiv preprint arXiv:2104.05652","author":"Yu Fenggen","year":"2021","unstructured":"Fenggen Yu , Zhiqin Chen , Manyi Li , Aditya Sanghi , Hooman Shayani , Ali Mahdavi-Amiri , and Hao Zhang . 2021. CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly. arXiv preprint arXiv:2104.05652 ( 2021 ). Fenggen Yu, Zhiqin Chen, Manyi Li, Aditya Sanghi, Hooman Shayani, Ali Mahdavi-Amiri, and Hao Zhang. 2021. CAPRI-Net: Learning Compact CAD Shapes with Adaptive Primitive Assembly. arXiv preprint arXiv:2104.05652 (2021)."},{"key":"e_1_3_2_2_77_1","volume-title":"Fast and High Quality Fusion of Depth Maps. (01","author":"Zach Christopher","year":"2008","unstructured":"Christopher Zach . 2008. Fast and High Quality Fusion of Depth Maps. (01 2008 ). Christopher Zach. 2008. Fast and High Quality Fusion of Depth Maps. (01 2008)."},{"key":"e_1_3_2_2_78_1","volume-title":"SurRF: Unsupervised Multi-view Stereopsis by Learning Surface Radiance Field","author":"Zhang Jinzhi","year":"2021","unstructured":"Jinzhi Zhang , Mengqi Ji , Guangyu Wang , Xue Zhiwei , Shengjin Wang , and Lu Fang . 2021b. SurRF: Unsupervised Multi-view Stereopsis by Learning Surface Radiance Field . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2021 ). Jinzhi Zhang, Mengqi Ji, Guangyu Wang, Xue Zhiwei, Shengjin Wang, and Lu Fang. 2021b. SurRF: Unsupervised Multi-view Stereopsis by Learning Surface Radiance Field. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)."},{"key":"e_1_3_2_2_79_1","first-page":"1","article-title":"GigaMVS: A Benchmark for Ultra-large-scale Gigapixel-level 3D Reconstruction","volume":"01","author":"Zhang Jianing","year":"2021","unstructured":"Jianing Zhang , Jinzhi Zhang , Shi Mao , Mengqi Ji , Guangyu Wang , Zequn Chen , Tian Zhang , Xiaoyun Yuan , Qionghai Dai , and Lu Fang . 2021 c. GigaMVS: A Benchmark for Ultra-large-scale Gigapixel-level 3D Reconstruction . IEEE Transactions on Pattern Analysis & Machine Intelligence 01 (2021), 1 -- 1 . Jianing Zhang, Jinzhi Zhang, Shi Mao, Mengqi Ji, Guangyu Wang, Zequn Chen, Tian Zhang, Xiaoyun Yuan, Qionghai Dai, and Lu Fang. 2021c. GigaMVS: A Benchmark for Ultra-large-scale Gigapixel-level 3D Reconstruction. IEEE Transactions on Pattern Analysis & Machine Intelligence 01 (2021), 1--1.","journal-title":"IEEE Transactions on Pattern Analysis & Machine Intelligence"},{"key":"e_1_3_2_2_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01563"},{"key":"e_1_3_2_2_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/3240508.3240700"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547920","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3547920","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:31Z","timestamp":1750186831000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3547920"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":81,"alternative-id":["10.1145\/3503161.3547920","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3547920","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}