{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T15:02:44Z","timestamp":1773414164380,"version":"3.50.1"},"reference-count":63,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2023,6,7]],"date-time":"2023-06-07T00:00:00Z","timestamp":1686096000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,6,7]],"date-time":"2023-06-07T00:00:00Z","timestamp":1686096000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012492","name":"Youth Innovation Promotion Association","doi-asserted-by":"publisher","award":["E03307020D"],"award-info":[{"award-number":["E03307020D"]}],"id":[{"id":"10.13039\/501100012492","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Deep learning has recently been proven to deliver excellent performance in multi-view stereo (MVS). However, it is difficult for deep learning-based MVS approaches to balance their efficiency and effectiveness. Towards this end, we propose the DSC-MVSNet, a novel coarse-to-fine and end-to-end framework for more efficient and more accurate depth estimation in MVS. In particular, we propose an attention aware 3D UNet-shape network, which first uses the depthwise separable convolutions for cost volume regularization. This mechanism enables effective aggregation of information and significantly reduces the model parameters and computation by transforming the ordinary convolution on cost volume as depthwise convolution and pointwise convolution. Besides, a 3D-Attention module is proposed to alleviate the feature mismatching problem in cost volume regularization and aggregate the important information of cost volume in three dimensions (i.e. channel, space, and depth). Moreover, we propose an efficient Feature Transfer Module to upsample the low-resolution\u00a0(LR) depth map to a high-resolution\u00a0(HR) depth map to achieve higher accuracy. With extensive experiments on two benchmark datasets, i.e. DTU and Tanks &amp; Temples, we demonstrate that the parameters of our model are significantly reduced to <jats:inline-formula><jats:alternatives><jats:tex-math>$$25\\%$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:mrow>\n                    <mml:mn>25<\/mml:mn>\n                    <mml:mo>%<\/mml:mo>\n                  <\/mml:mrow>\n                <\/mml:math><\/jats:alternatives><\/jats:inline-formula> of the state-of-the-art model MVSNet. Besides, our method outperforms or maintains on par accuracy with the state-of-the-art models. Our source code is available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/zs670980918\/DSC-MVSNet\">https:\/\/github.com\/zs670980918\/DSC-MVSNet<\/jats:ext-link>.<\/jats:p>","DOI":"10.1007\/s40747-023-01106-3","type":"journal-article","created":{"date-parts":[[2023,6,7]],"date-time":"2023-06-07T04:34:31Z","timestamp":1686112471000},"page":"6953-6969","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":26,"title":["DSC-MVSNet: attention aware cost volume regularization based on depthwise separable convolution for multi-view stereo"],"prefix":"10.1007","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7312-6892","authenticated-orcid":false,"given":"Song","family":"Zhang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3494-3686","authenticated-orcid":false,"given":"Zhiwei","family":"Wei","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1425-4162","authenticated-orcid":false,"given":"Wenjia","family":"Xu","sequence":"additional","affiliation":[]},{"given":"Lili","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Yang","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xin","family":"Zhou","sequence":"additional","affiliation":[]},{"given":"Junyi","family":"Liu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,6,7]]},"reference":[{"key":"1106_CR1","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1007\/s11263-016-0902-9","volume":"120","author":"H Aan\u00e6s","year":"2016","unstructured":"Aan\u00e6s H, Jensen RR, Vogiatzis G, Tola E, Dahl AB (2016) Large-scale data for multiple-view stereopsis. Int J Comput Vision 120:153\u2013168","journal-title":"Int J Comput Vision"},{"key":"1106_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1561\/0600000052","volume":"9","author":"Y Furukawa","year":"2015","unstructured":"Furukawa Y, Hern\u00e1ndez C (2015) Multi-view stereo: a tutorial. Found Trends Comput Graph Vis 9:1\u2013148","journal-title":"Found Trends Comput Graph Vis"},{"issue":"11","key":"1106_CR3","doi-asserted-by":"publisher","first-page":"1611","DOI":"10.1109\/TCSVT.2012.2202185","volume":"22","author":"H Kim","year":"2012","unstructured":"Kim H, Guillemaut J-Y, Takai T, Sarim M, Hilton A (2012) Outdoor dynamic 3-d scene reconstruction. IEEE Trans Circuits Syst Video Technol 22(11):1611\u20131622. https:\/\/doi.org\/10.1109\/TCSVT.2012.2202185","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"issue":"6","key":"1106_CR4","doi-asserted-by":"publisher","first-page":"929","DOI":"10.1109\/TCSVT.2013.2290575","volume":"24","author":"G-T Michailidis","year":"2014","unstructured":"Michailidis G-T, Pajarola R, Andreadis I (2014) High performance stereo system for dense 3-d reconstruction. IEEE Trans Circuits Syst Video Technol 24(6):929\u2013941. https:\/\/doi.org\/10.1109\/TCSVT.2013.2290575","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"key":"1106_CR5","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2022.108395","volume":"242","author":"Q Yu","year":"2022","unstructured":"Yu Q, Yang C, Wei H (2022) Part-wise atlasnet for 3d point cloud reconstruction from a single image. Knowl-Based Syst 242:108395","journal-title":"Knowl-Based Syst"},{"key":"1106_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.107057","volume":"223","author":"P Wang","year":"2021","unstructured":"Wang P, Liu L, Zhang H, Wang T (2021) Cgnet: a cascaded generative network for dense point cloud reconstruction from a single image. Knowl-Based Syst 223:107057","journal-title":"Knowl-Based Syst"},{"key":"1106_CR7","unstructured":"Seitz SM (2002) Photorealistic scene reconstruction by voxel coloring. In: Proceedings., 1997 IEEE Computer Society Conference On Computer Vision and Pattern Recognition, 1997"},{"key":"1106_CR8","doi-asserted-by":"publisher","first-page":"1362","DOI":"10.1109\/TPAMI.2009.161","volume":"32","author":"Y Furukawa","year":"2010","unstructured":"Furukawa Y, Ponce J (2010) Accurate, dense, and robust multiview stereopsis. IEEE Trans Pattern Anal Mach Intell 32:1362\u20131376","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"4","key":"1106_CR9","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1145\/3072959.3073599","volume":"36","author":"A Knapitsch","year":"2017","unstructured":"Knapitsch A, Park J, Zhou Q, Koltun V (2017) Tanks and temples: benchmarking large-scale scene reconstruction. ACM Trans Graph 36(4):78\u201317813","journal-title":"ACM Trans Graph"},{"key":"1106_CR10","doi-asserted-by":"crossref","unstructured":"Yao Y, Luo Z, Li S, Fang T, Quan L (2018) Mvsnet: depth inference for unstructured multi-view stereo. arXiv:1804.02505","DOI":"10.1007\/978-3-030-01237-3_47"},{"key":"1106_CR11","doi-asserted-by":"crossref","unstructured":"Luo K, Guan T, Ju L, Huang H, Luo Y (2019) P-mvsnet: learning patch-wise matching confidence aggregation for multi-view stereo. In: 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), pp 10451\u201310460","DOI":"10.1109\/ICCV.2019.01055"},{"key":"1106_CR12","doi-asserted-by":"crossref","unstructured":"Gu X, Fan Z, Zhu S, Dai Z, Tan F, Tan P (2020) Cascade cost volume for high-resolution multi-view stereo and stereo matching. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 2492\u20132501","DOI":"10.1109\/CVPR42600.2020.00257"},{"key":"1106_CR13","doi-asserted-by":"crossref","unstructured":"Yu Z, Gao S (2020) Fast-mvsnet: sparse-to-dense multi-view stereo with learned propagation and Gauss\u2013Newton refinement. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 1946\u20131955","DOI":"10.1109\/CVPR42600.2020.00202"},{"key":"1106_CR14","doi-asserted-by":"crossref","unstructured":"Cheng S, Xu Z, Zhu S, Li Z, Li EL, Ramamoorthi R, Su H (2020) Deep stereo using adaptive thin volume representation with uncertainty awareness. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 2521\u20132531","DOI":"10.1109\/CVPR42600.2020.00260"},{"key":"1106_CR15","doi-asserted-by":"crossref","unstructured":"Yao Y, Luo Z, Li S, Shen T, Fang T, Quan L (2019) Recurrent mvsnet for high-resolution multi-view stereo depth inference. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 5520\u20135529","DOI":"10.1109\/CVPR.2019.00567"},{"key":"1106_CR16","doi-asserted-by":"crossref","unstructured":"Yan J, Wei Z, Yi H, Ding M, Zhang R, Chen Y, Wang G, Tai Y-W (2020) Dense hybrid recurrent multi-view stereo net with dynamic consistency checking. arXiv:2007.10872","DOI":"10.1007\/978-3-030-58548-8_39"},{"key":"1106_CR17","doi-asserted-by":"crossref","unstructured":"Mikolov T, Karafi\u00e1t M, Burget L, Cernock\u00fd JH, Khudanpur S (2010) Recurrent neural network based language model. In: INTERSPEECH","DOI":"10.21437\/Interspeech.2010-343"},{"key":"1106_CR18","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735\u20131780","journal-title":"Neural Comput"},{"key":"1106_CR19","unstructured":"Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556"},{"key":"1106_CR20","unstructured":"Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017)Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861"},{"key":"1106_CR21","doi-asserted-by":"crossref","unstructured":"Sandler M, Howard AG, Zhu M, Zhmoginov A, Chen L-C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp 4510\u20134520","DOI":"10.1109\/CVPR.2018.00474"},{"key":"1106_CR22","doi-asserted-by":"crossref","unstructured":"Sch\u00f6nberger JL, Zheng E, Frahm J-M, Pollefeys M (2016) Pixelwise view selection for unstructured multi-view stereo. In: ECCV","DOI":"10.1007\/978-3-319-46487-9_31"},{"key":"1106_CR23","doi-asserted-by":"crossref","unstructured":"Galliani S, Lasinger K, Schindler K (2015) Massively parallel multiview stereopsis by surface normal diffusion. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp 873\u2013881","DOI":"10.1109\/ICCV.2015.106"},{"key":"1106_CR24","doi-asserted-by":"crossref","unstructured":"Goesele M, Snavely N, Curless B, Hoppe H, Seitz SM (2007) Multi-view stereo for community photo collections. In: 2007 IEEE 11th International Conference on Computer Vision, pp 1\u20138","DOI":"10.1109\/ICCV.2007.4408933"},{"key":"1106_CR25","doi-asserted-by":"publisher","first-page":"903","DOI":"10.1007\/s00138-011-0346-8","volume":"23","author":"E Tola","year":"2011","unstructured":"Tola E, Strecha C, Fua P (2011) Efficient large-scale multi-view stereo for ultra high-resolution image sets. Mach Vis Appl 23:903\u2013920","journal-title":"Mach Vis Appl"},{"key":"1106_CR26","doi-asserted-by":"crossref","unstructured":"Ji M, Gall J, Zheng H, Liu Y, Fang L (2017) Surfacenet: an end-to-end 3d neural network for multiview stereopsis. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp 2326\u20132334","DOI":"10.1109\/ICCV.2017.253"},{"key":"1106_CR27","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1023\/A:1008191222954","volume":"38","author":"KN Kutulakos","year":"2004","unstructured":"Kutulakos KN, Seitz SM (2004) A theory of shape by space carving. Int J Comput Vis 38:199\u2013218","journal-title":"Int J Comput Vis"},{"key":"1106_CR28","unstructured":"Kar A, H\u00e4ne C, Malik J (2017) Learning a multi-view stereo machine. In: NIPS"},{"key":"1106_CR29","doi-asserted-by":"publisher","first-page":"418","DOI":"10.1109\/TPAMI.2005.44","volume":"27","author":"M Lhuillier","year":"2005","unstructured":"Lhuillier M, Quan L (2005) A quasi-dense approach to surface reconstruction from uncalibrated images. IEEE Trans Pattern Anal Mach Intell 27:418\u2013433","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"7","key":"1106_CR30","doi-asserted-by":"publisher","first-page":"1540","DOI":"10.1109\/TCSVT.2017.2682887","volume":"28","author":"H Rezaee Kaviani","year":"2018","unstructured":"Rezaee Kaviani H, Shirani S (2018) An adaptive patch-based reconstruction scheme for view synthesis by disparity estimation using optical flow. IEEE Trans Circuits Syst Video Technol 28(7):1540\u20131552. https:\/\/doi.org\/10.1109\/TCSVT.2017.2682887","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"issue":"3","key":"1106_CR31","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2021.106968","volume":"221","author":"Y Wang","year":"2021","unstructured":"Wang Y, Luo K, Chen Z, Ju L, Guan T (2021) Deepfusion: a simple way to improve traditional multi-view stereo methods using deep learning. Knowl-Based Syst 221(3):106968","journal-title":"Knowl-Based Syst"},{"key":"1106_CR32","doi-asserted-by":"crossref","unstructured":"Chen R, Han S, Xu J, Su H (2019) Point-based multi-view stereo network. In: 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), pp 1538\u20131547","DOI":"10.1109\/ICCV.2019.00162"},{"key":"1106_CR33","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770\u2013778","DOI":"10.1109\/CVPR.2016.90"},{"key":"1106_CR34","doi-asserted-by":"crossref","unstructured":"Girshick RB (2015) Fast r-cnn. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp 1440\u20131448","DOI":"10.1109\/ICCV.2015.169"},{"key":"1106_CR35","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2015","unstructured":"Ren S, He K, Girshick RB, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137\u20131149","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1106_CR36","doi-asserted-by":"publisher","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","volume":"40","author":"L-C Chen","year":"2018","unstructured":"Chen L-C, Papandreou G, Kokkinos I, Murphy KP, Yuille AL (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40:834\u2013848","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1106_CR37","doi-asserted-by":"publisher","first-page":"640","DOI":"10.1109\/TPAMI.2016.2572683","volume":"39","author":"E Shelhamer","year":"2017","unstructured":"Shelhamer E, Long J, Darrell T (2017) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39:640\u2013651","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1106_CR38","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4\u20139, 2017, Long Beach, CA, USA, pp 5998\u20136008"},{"key":"1106_CR39","doi-asserted-by":"crossref","unstructured":"Dai T, Cai J, Zhang Y, Xia ST, Zhang L (2019) Second-order attention network for single image super-resolution. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2019.01132"},{"key":"1106_CR40","doi-asserted-by":"crossref","unstructured":"Zhang H, Dana K, Shi J, Zhang Z, Wang X, Tyagi A, Agrawal A (2018) Context encoding for semantic segmentation. IEEE","DOI":"10.1109\/CVPR.2018.00747"},{"key":"1106_CR41","doi-asserted-by":"crossref","unstructured":"Yu C, Wang J, Peng C, Gao C, Yu G, Sang N (2018) Learning a discriminative feature network for semantic segmentation. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2018.00199"},{"key":"1106_CR42","doi-asserted-by":"crossref","unstructured":"Li Y, Chen X, Zhu Z, Xie L, Wang X (2019) Attention-guided unified network for panoptic segmentation. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2019.00719"},{"key":"1106_CR43","doi-asserted-by":"crossref","unstructured":"FeI W, Jiang M, Chen Q, Yang S, Tang X (2017) Residual attention network for image classification. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","DOI":"10.1109\/CVPR.2017.683"},{"key":"1106_CR44","unstructured":"Jie H, Li S, Gang S, Albanie S (2017) Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell, p 99"},{"key":"1106_CR45","doi-asserted-by":"crossref","unstructured":"Woo S, Park J, Lee J-Y, Kweon I-S (2018) Cbam: convolutional block attention module. In: ECCV","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"1106_CR46","unstructured":"Yang Q (2012) A non-local cost aggregation method for stereo matching. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)"},{"key":"1106_CR47","doi-asserted-by":"publisher","first-page":"1153","DOI":"10.1109\/TASSP.1981.1163711","volume":"29","author":"R Keys","year":"1981","unstructured":"Keys R (1981) Cubic convolution interpolation for digital image processing. IEEE Trans Acoust Speech Signal Process 29:1153\u20131160","journal-title":"IEEE Trans Acoust Speech Signal Process"},{"key":"1106_CR48","doi-asserted-by":"crossref","unstructured":"Shi W, Caballero J, Husz\u00e1r F, Totz J, Wang Z (2016) Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. IEEE","DOI":"10.1109\/CVPR.2016.207"},{"key":"1106_CR49","doi-asserted-by":"crossref","unstructured":"Yang J, Mao W, \u00c1lvarez JM, Liu M (2020) Cost volume pyramid based depth inference for multi-view stereo. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 4876\u20134885","DOI":"10.1109\/CVPR42600.2020.00493"},{"key":"1106_CR50","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. ArXiv arXiv:1502.03167"},{"key":"1106_CR51","doi-asserted-by":"crossref","unstructured":"Jensen RR, Dahl A, Vogiatzis G, Tola E, Aan\u00e6s H (2014) Large scale multi-view stereopsis evaluation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp 406\u2013413","DOI":"10.1109\/CVPR.2014.59"},{"key":"1106_CR52","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1145\/2487228.2487237","volume":"32","author":"MM Kazhdan","year":"2013","unstructured":"Kazhdan MM, Hoppe H (2013) Screened Poisson surface reconstruction. ACM Trans Graph 32:29\u201312913","journal-title":"ACM Trans Graph"},{"key":"1106_CR53","doi-asserted-by":"crossref","unstructured":"Yi H, Wei Z, Ding M, Zhang R, Chen Y, Wang G, Tai Y-W (2020) Pyramid multi-view stereo net with self-adaptive view aggregation. arXiv:1912.03001","DOI":"10.1007\/978-3-030-58545-7_44"},{"key":"1106_CR54","doi-asserted-by":"publisher","first-page":"7261","DOI":"10.1109\/TIP.2020.3000611","volume":"29","author":"P-H Chen","year":"2020","unstructured":"Chen P-H, Yang H-C, Chen K-W, Chen Y-S (2020) Mvsnet++: learning depth-based attention pyramid features for multi-view stereo. IEEE Trans Image Process 29:7261\u20137273","journal-title":"IEEE Trans Image Process"},{"key":"1106_CR55","doi-asserted-by":"crossref","unstructured":"Campbell NDF, Vogiatzis G, Hern\u00e1ndez C, Cipolla R (2008) Using multiple hypotheses to improve depth-maps for multi-view stereo. In: ECCV","DOI":"10.1007\/978-3-540-88682-2_58"},{"key":"1106_CR56","doi-asserted-by":"crossref","unstructured":"Xue Y, Chen J, Wan W, Huang Y, Yu C, Li T, Bao J (2019) Mvscrf: learning multi-view stereo with conditional random fields. In: 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), pp 4311\u20134320","DOI":"10.1109\/ICCV.2019.00441"},{"key":"1106_CR57","unstructured":"Zhang J, Yao Y, Li S, Luo Z, Fang T (2020) Visibility-aware multi-view stereo network. In: 31st British Machine Vision Conference 2020, BMVC 2020, Virtual Event, UK, September 7\u201310, 2020. BMVA Press"},{"key":"1106_CR58","doi-asserted-by":"crossref","unstructured":"Wei Z, Zhu Q, Min C, Chen Y, Wang G (2021) Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network. arXiv:2108.03824","DOI":"10.1109\/ICCV48922.2021.00613"},{"key":"1106_CR59","doi-asserted-by":"crossref","unstructured":"Wang F, Galliani S, Vogel C, Speciale P, Pollefeys M (2021) Patchmatchnet: learned multi-view patchmatch stereo. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 14194\u201314203","DOI":"10.1109\/CVPR46437.2021.01397"},{"key":"1106_CR60","unstructured":"Pix4d: https:\/\/pix4d.com\/"},{"key":"1106_CR61","unstructured":"Moulon P, Monasse RMP (2014) Openmvg. An open multiple view geometry library. https:\/\/github.com\/openMVG\/openMVG"},{"key":"1106_CR62","unstructured":"Cernea D (2015) Openmvs: open multiple view stereovision. https:\/\/github.com\/cdcseacave\/openMVS"},{"key":"1106_CR63","doi-asserted-by":"crossref","unstructured":"Xu Q, Tao W (2019) Learning inverse depth regression for multi-view stereo with correlation cost volume","DOI":"10.1609\/aaai.v34i07.6939"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01106-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-023-01106-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01106-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,27]],"date-time":"2023-10-27T19:23:20Z","timestamp":1698434600000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-023-01106-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,7]]},"references-count":63,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2023,12]]}},"alternative-id":["1106"],"URL":"https:\/\/doi.org\/10.1007\/s40747-023-01106-3","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,7]]},"assertion":[{"value":"15 November 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 May 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 June 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}