{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T14:36:55Z","timestamp":1773931015434,"version":"3.50.1"},"reference-count":62,"publisher":"MDPI AG","issue":"17","license":[{"start":{"date-parts":[[2023,8,31]],"date-time":"2023-08-31T00:00:00Z","timestamp":1693440000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Hubei Provincial Key R&amp;D Program Projects","award":["2022BAA035"],"award-info":[{"award-number":["2022BAA035"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Automatic reconstruction of surfaces from satellite imagery is a hot topic in computer vision and photogrammetry. State-of-the-art reconstruction methods typically produce 2.5D elevation data. In contrast, we propose a one-stage method directly generating a 3D mesh model from multi-view satellite imagery. We introduce a novel Sat-Mesh approach for satellite implicit surface reconstruction: We represent the scene as a continuous signed distance function (SDF) and leverage a volume rendering framework to learn the SDF values. To address the challenges posed by lighting variations and inconsistent appearances in satellite imagery, we incorporate a latent vector in the network architecture to encode image appearances. Furthermore, we introduce a multi-view stereo constraint to enhance surface quality. This constraint minimizes the similarity between image patches to optimize the position and orientation of the SDF surface. Experimental results demonstrate that our method achieves superior visual quality and quantitative accuracy in generating mesh models. Moreover, our approach can learn seasonal variations in satellite imagery, resulting in texture mesh models with different and consistent seasonal appearances.<\/jats:p>","DOI":"10.3390\/rs15174297","type":"journal-article","created":{"date-parts":[[2023,8,31]],"date-time":"2023-08-31T11:41:18Z","timestamp":1693482078000},"page":"4297","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Sat-Mesh: Learning Neural Implicit Surfaces for Multi-View Satellite Reconstruction"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5527-6299","authenticated-orcid":false,"given":"Yingjie","family":"Qu","sequence":"first","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"}]},{"given":"Fei","family":"Deng","sequence":"additional","affiliation":[{"name":"School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China"},{"name":"Wuhan Tianjihang Information Technology Co., Ltd., Wuhan 430010, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,8,31]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zhao, Q., Yu, L., Du, Z., Peng, D., Hao, P., Zhang, Y., and Gong, P. (2022). An overview of the applications of earth observation satellite data: Impacts and future trends. Remote Sens., 14.","DOI":"10.3390\/rs14081863"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1029\/2018EA000409","article-title":"The Ames Stereo Pipeline: NASA\u2019s open source software for deriving and processing terrain data","volume":"5","author":"Beyer","year":"2018","journal-title":"Earth Space Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1186\/s40965-017-0027-2","article-title":"MicMac\u2014A free, open-source solution for photogrammetry","volume":"2","author":"Rupnik","year":"2017","journal-title":"Open Geospat. Data Softw. Stand."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"77","DOI":"10.5194\/isprs-annals-III-1-77-2016","article-title":"Rpc stereo processor (rsp)\u2014A software package for digital surface model and orthophoto generation from satellite stereo imagery","volume":"3","author":"Qin","year":"2016","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Facciolo, G., De Franchis, C., and Meinhardt-Llopis, E. (2017, January 21\u201326). Automatic 3D reconstruction from multi-date satellite images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.198"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1109\/TPAMI.2007.1166","article-title":"Stereo processing by semiglobal matching and mutual information","volume":"30","author":"Hirschmuller","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.isprsjprs.2020.05.001","article-title":"Photometric multi-view mesh refinement for high-resolution satellite images","volume":"166","author":"Rothermel","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Bullinger, S., Bodensteiner, C., and Arens, M. (2021). 3D Surface Reconstruction From Multi-Date Satellite Images. arXiv.","DOI":"10.5194\/isprs-archives-XLIII-B2-2021-313-2021"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3005","DOI":"10.1080\/01431161.2023.2214275","article-title":"GEMVS: A novel approach for automatic 3D reconstruction from uncalibrated multi-view Google Earth images using multi-view stereo and projective to metric 3D homography transformation","volume":"44","author":"Park","year":"2023","journal-title":"Int. J. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1145\/3503250","article-title":"Nerf: Representing scenes as neural radiance fields for view synthesis","volume":"65","author":"Mildenhall","year":"2021","journal-title":"Commun. ACM"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Martin-Brualla, R., Radwan, N., Sajjadi, M.S., Barron, J.T., Dosovitskiy, A., and Duckworth, D. (2021, January 20\u201325). Nerf in the wild: Neural radiance fields for unconstrained photo collections. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00713"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Barron, J.T., Mildenhall, B., Verbin, D., Srinivasan, P.P., and Hedman, P. (2022, January 18\u201324). Mip-nerf 360: Unbounded anti-aliased neural radiance fields. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00539"},{"key":"ref_13","first-page":"1","article-title":"Instant neural graphics primitives with a multiresolution hash encoding","volume":"41","author":"Evans","year":"2022","journal-title":"ACM Trans. Graph."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Derksen, D., and Izzo, D. (2021, January 19\u201325). Shadow neural radiance fields for multi-view satellite photogrammetry. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPRW53098.2021.00126"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Mar\u00ed, R., Facciolo, G., and Ehret, T. (2022, January 19\u201320). Sat-nerf: Learning multi-view satellite photogrammetry with transient objects and shadow modeling using rpc cameras. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPRW56347.2022.00137"},{"key":"ref_16","first-page":"3403","article-title":"Geo-neus: Geometry-consistent neural implicit surfaces learning for multi-view reconstruction","volume":"35","author":"Fu","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Sun, J., Chen, X., Wang, Q., Li, Z., Averbuch-Elor, H., Zhou, X., and Snavely, N. (2022, January 7\u201311). Neural 3d reconstruction in the wild. Proceedings of the ACM SIGGRAPH 2022 Conference Proceedings, Vancouver, BC, Canada.","DOI":"10.1145\/3528233.3530718"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Wang, Y., Skorokhodov, I., and Wonka, P. (2023, January 17\u201324). PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01212"},{"key":"ref_19","unstructured":"Wang, P., Liu, L., Liu, Y., Theobalt, C., Komura, T., and Wang, W. (2021). Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. arXiv."},{"key":"ref_20","unstructured":"Bojanowski, P., Joulin, A., Lopez-Paz, D., and Szlam, A. (2017). Optimizing the latent space of generative networks. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhang, K., Snavely, N., and Sun, J. (2019, January 27\u201328). Leveraging vision reconstruction pipelines for satellite imagery. Proceedings of the IEEE\/CVF International Conference on Computer Vision Workshops, Seoul, Republic of Korea.","DOI":"10.1109\/ICCVW.2019.00269"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1145\/964965.808594","article-title":"Ray tracing volume densities","volume":"18","author":"Kajiya","year":"1984","journal-title":"ACM SIGGRAPH Comput. Graph."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hartley, R., and Zisserman, A. (2003). Multiple View Geometry in Computer Vision, Cambridge University Press.","DOI":"10.1017\/CBO9780511811685"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Zabih, R., and Woodfill, J. (1994, January 2\u20136). Non-parametric local transforms for computing visual correspondence. Proceedings of the Computer Vision\u2014ECCV\u201994: Third European Conference on Computer Vision, Stockholm, Sweden.","DOI":"10.1007\/BFb0028345"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Facciolo, G., De Franchis, C., and Meinhardt, E. (2015, January 7\u201310). MGM: A significantly more global matching for stereovision. Proceedings of the BMVC 2015, Swansea, UK.","DOI":"10.5244\/C.29.90"},{"key":"ref_26","unstructured":"Rothermel, M., Wenzel, K., Fritsch, D., and Haala, N. (2012, January 4\u20135). SURE: Photogrammetric surface reconstruction from imagery. Proceedings of the LC3D Workshop, Berlin, Germany."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"67","DOI":"10.5194\/isprs-archives-XLII-2-W13-67-2019","article-title":"FOSS4G DATE for DSM generation: Sensitivity analysis of the semi-global block matching parameters","volume":"42","author":"Lastilla","year":"2019","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"G\u00f3mez, A., Randall, G., Facciolo, G., and von Gioi, R.G. (2022, January 3\u20138). An experimental comparison of multi-view stereo approaches on satellite images. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV51458.2022.00078"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"He, S., Zhou, R., Li, S., Jiang, S., and Jiang, W. (2021). Disparity estimation of high-resolution remote sensing images with dual-scale matching network. Remote Sens., 13.","DOI":"10.3390\/rs13245050"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"501","DOI":"10.5201\/ipol.2022.435","article-title":"Disparity Estimation Networks for Aerial and High-Resolution Satellite Images: A Review","volume":"12","author":"Ehret","year":"2022","journal-title":"Image Process. Line"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Zhang, F., Prisacariu, V., Yang, R., and Torr, P.H. (2019, January 15\u201320). Ga-net: Guided aggregation net for end-to-end stereo matching. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00027"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Chang, J.-R., and Chen, Y.-S. (2018, January 18\u201323). Pyramid stereo matching network. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00567"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Yang, G., Manela, J., Happold, M., and Ramanan, D. (2019, January 15\u201320). Hierarchical deep stereo matching on high-resolution images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00566"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Mar\u00ed, R., Facciolo, G., and Ehret, T. (2023, January 17\u201324). Multi-Date Earth Observation NeRF: The Detail Is in the Shadows. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPRW59228.2023.00197"},{"key":"ref_35","first-page":"25018","article-title":"Monosdf: Exploring monocular geometric cues for neural implicit surface reconstruction","volume":"35","author":"Yu","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Li, Z., M\u00fcller, T., Evans, A., Taylor, R.H., Unberath, M., Liu, M.-Y., and Lin, C.-H. (2023, January 17\u201324). Neuralangelo: High-Fidelity Neural Surface Reconstruction. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00817"},{"key":"ref_37","unstructured":"Wang, Y., Han, Q., Habermann, M., Daniilidis, K., Theobalt, C., and Liu, L. (2022). Neus2: Fast learning of neural implicit surfaces for multi-view reconstruction. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1007\/s41095-021-0223-y","article-title":"Learning conditional photometric stereo with high-resolution features","volume":"8","author":"Ju","year":"2022","journal-title":"Comput. Vis. Media"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Chen, G., Han, K., and Wong, K.-Y.K. (2018, January 8\u201314). PS-FCN: A flexible learning framework for photometric stereo. Proceedings of the European conference on computer vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01240-3_1"},{"key":"ref_40","first-page":"10306","article-title":"Gps-net: Graph-based photometric stereo network","volume":"33","author":"Yao","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Lv, B., Liu, J., Wang, P., and Yasir, M. (2022). DSM Generation from Multi-View High-Resolution Satellite Images Based on the Photometric Mesh Refinement Method. Remote Sens., 14.","DOI":"10.3390\/rs14246259"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Qu, Y., Yan, Q., Yang, J., Xiao, T., and Deng, F. (2022). Total Differential Photometric Mesh Refinement with Self-Adapted Mesh Denoising. Photonics, 10.","DOI":"10.3390\/photonics10010020"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Schonberger, J.L., and Frahm, J.-M. (2016, January 27\u201330). Structure-from-motion revisited. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.445"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Sch\u00f6nberger, J.L., Zheng, E., Frahm, J.-M., and Pollefeys, M. (2016, January 11\u201314). Pixelwise view selection for unstructured multi-view stereo. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands. Part III 14.","DOI":"10.1007\/978-3-319-46487-9_31"},{"key":"ref_45","first-page":"233","article-title":"Sequential Cycle Consistency Inference for Eliminating Incorrect Relative Orientations in Structure from Motion","volume":"89","author":"Xiao","year":"2021","journal-title":"PFG\u2013J. Photogramm. Remote Sens. Geoinf. Sci."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1561\/0600000052","article-title":"Multi-view stereo: A tutorial","volume":"Volume 9","author":"Furukawa","year":"2015","journal-title":"Foundations and Trends\u00ae in Computer Graphics and Vision"},{"key":"ref_47","unstructured":"Romanoni, A., and Matteucci, M. (November, January 27). Tapa-mvs: Textureless-aware patchmatch multi-view stereo. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Xu, Q., and Tao, W. (2019, January 15\u201320). Multi-Scale Geometric Consistency Guided Multi-View Stereo. Proceedings of the Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00563"},{"key":"ref_49","unstructured":"Lorensen, W.E., and Cline, H.E. (1998). Seminal Graphics: Pioneering Efforts That Shaped the Field, Association for Computing Machinery."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Furukawa, Y., and Ponce, J. (2007, January 18\u201323). Accurate, dense, and robust multi-view stereopsis (pmvs). Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA.","DOI":"10.1109\/CVPR.2007.383246"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Park, J.J., Florence, P., Straub, J., Newcombe, R., and Lovegrove, S. (2019, January 15\u201320). Deepsdf: Learning continuous signed distance functions for shape representation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00025"},{"key":"ref_52","unstructured":"Salimans, T., and Kingma, D.P. (2016). Weight normalization: A simple reparameterization to accelerate training of deep neural networks. arXiv."},{"key":"ref_53","unstructured":"Gropp, A., Yariv, L., Haim, N., Atzmon, M., and Lipman, Y. (2020). Implicit geometric regularization for learning shapes. arXiv."},{"key":"ref_54","unstructured":"Zhou, Q.-Y., Park, J., and Koltun, V. (2018). Open3D: A modern library for 3D data processing. arXiv."},{"key":"ref_55","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_56","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019). Pytorch: An imperative style, high-performance deep learning library. arXiv."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Bosch, M., Foster, K., Christie, G., Wang, S., Hager, G.D., and Brown, M. (2019, January 7\u201311). Semantic stereo for incidental satellite images. Proceedings of the 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA.","DOI":"10.1109\/WACV.2019.00167"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1109\/MGRS.2019.2949679","article-title":"2019 ieee grss data fusion contest: Large-scale semantic 3d reconstruction","volume":"7","author":"Yokoya","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Mag. (GRSM)"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1007\/s11263-010-0408-9","article-title":"Gradient flows for optimizing triangular mesh-based surfaces: Applications to 3d reconstruction problems dealing with visibility","volume":"95","author":"Delaunoy","year":"2011","journal-title":"Int. J. Comput. Vis."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Bosch, M., Kurtz, Z., Hagstrom, S., and Brown, M. (2016, January 18\u201320). A multiple view stereo benchmark for satellite imagery. Proceedings of the 2016 IEEE Applied Imagery Pattern Recognition Workshop (AIPR), Washington, DC, USA.","DOI":"10.1109\/AIPR.2016.8010543"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Waechter, M., Moehrle, N., and Goesele, M. (2014, January 6\u201312). Let there be color! Large-scale texturing of 3D reconstructions. Proceedings of the Computer Vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland. Part V 13.","DOI":"10.1007\/978-3-319-10602-1_54"},{"key":"ref_62","unstructured":"G\u00f3mez, A., Randall, G., Facciolo, G., and von Gioi, R.G. (November, January 27). Improving the Pair Selection and the Model Fusion Steps of Satellite Multi-View Stereo Pipelines. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Seoul, Republic of Korea."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/17\/4297\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:44:03Z","timestamp":1760129043000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/17\/4297"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,31]]},"references-count":62,"journal-issue":{"issue":"17","published-online":{"date-parts":[[2023,9]]}},"alternative-id":["rs15174297"],"URL":"https:\/\/doi.org\/10.3390\/rs15174297","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,31]]}}}