{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T15:54:49Z","timestamp":1761062089752,"version":"build-2065373602"},"reference-count":29,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2025,1,24]],"date-time":"2025-01-24T00:00:00Z","timestamp":1737676800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"],"award-info":[{"award-number":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"]}]},{"name":"Shandong Provincial Natural Science Foundation","award":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"],"award-info":[{"award-number":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"]}]},{"name":"Postgraduate Education and Teaching Reform Foundation of Shandong Province","award":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"],"award-info":[{"award-number":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"]}]},{"name":"Undergraduate Education and Teaching Reform Foundation of Shandong Province","award":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"],"award-info":[{"award-number":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"]}]},{"name":"high-quality graduate course of Shandong Province","award":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"],"award-info":[{"award-number":["42171435","ZR2021MD006","SDYJG19115","Z2021014","SDYKC2022151"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Aiming to address the issue that existing multi-view stereo reconstruction methods have insufficient adaptability to the repetitive and weak textures in multi-view images, this paper proposes a three-dimensional (3D) reconstruction algorithm based on Feature Enhancement and Weight Optimization MVSNet (Abbreviated as FEWO-MVSNet). To obtain accurate and detailed global and local features, we first develop an adaptive feature enhancement approach to obtain multi-scale information from the images. Second, we introduce an attention mechanism and a spatial feature capture module to enable high-sensitivity detection for weak texture features. Third, based on the 3D convolutional neural network, the fine depth map for multi-view images can be predicted and the complete 3D model is subsequently reconstructed. Last, we evaluated the proposed FEWO-MVSNet through training and testing on the DTU, BlendedMVS, and Tanks and Temples datasets. The results demonstrate significant superiorities of our method for 3D reconstruction from multi-view images, with our method ranking first in accuracy and second in completeness when compared to the existing representative methods.<\/jats:p>","DOI":"10.3390\/ijgi14020043","type":"journal-article","created":{"date-parts":[[2025,1,24]],"date-time":"2025-01-24T03:26:31Z","timestamp":1737689191000},"page":"43","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Multi-View Three-Dimensional Reconstruction Based on Feature Enhancement and Weight Optimization Network"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6305-4514","authenticated-orcid":false,"given":"Guobiao","family":"Yao","sequence":"first","affiliation":[{"name":"School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, China"}]},{"given":"Ziheng","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, China"}]},{"given":"Guozhong","family":"Wei","sequence":"additional","affiliation":[{"name":"Shandong Provincial Institute of Land Surveying and Mapping, Jinan 250101, China"}]},{"given":"Fengqi","family":"Zhu","sequence":"additional","affiliation":[{"name":"Shandong Provincial Institute of Land Surveying and Mapping, Jinan 250101, China"}]},{"given":"Qingqing","family":"Fu","sequence":"additional","affiliation":[{"name":"School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250101, China"}]},{"given":"Qian","family":"Yu","sequence":"additional","affiliation":[{"name":"Shandong Provincial Institute of Land Surveying and Mapping, Jinan 250101, China"}]},{"given":"Min","family":"Wei","sequence":"additional","affiliation":[{"name":"Shandong Zhengyuan Aerial Remote Sensing Technology Co., Ltd., Jinan 250101, China"}]}],"member":"1968","published-online":{"date-parts":[[2025,1,24]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Luo, H., Zhang, J., Liu, X., Zhang, L., and Liu, J. (2024). Large-Scale 3D Reconstruction from Multi-View Imagery: A Comprehensive Review. Remote Sens., 16.","DOI":"10.3390\/rs16050773"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Dong, Y., Song, J., Fan, D., Ji, S., and Lei, R. (2023). Joint Deep Learning and Information Propagation for Fast 3D City Modeling. ISPRS Int. J. Geo-Inf., 12.","DOI":"10.3390\/ijgi12040150"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"112032","DOI":"10.1016\/j.asoc.2024.112032","article-title":"Inverse distance weight-assisted particle swarm optimized indoor localization","volume":"164","author":"Bi","year":"2024","journal-title":"Appl. Soft Comput."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gao, X., Yang, R., Chen, X., Tan, J., Liu, Y., Wang, Z., Tan, J., and Liu, H. (2024). A New Framework for Generating Indoor 3D Digital Models from Point Clouds. Remote Sens., 16.","DOI":"10.3390\/rs16183462"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Galliani, S., Lasinger, K., and Schindler, K. (2015, January 7\u201313). Massively parallel multiview stereopsis by surface normal diffusion. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.106"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Schonberger, J.L., and Frahm, J.-M. (2016, January 27\u201330). Structure-from-motion revisited. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.445"},{"key":"ref_7","unstructured":"Han, X., Leung, T., Jia, Y., Sukthankar, R., and Berg, A.C. (2015, January 7\u201312). Matchnet: Unifying feature and metric learning for patch-based matching. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Yao, Y., Luo, Z., Li, S., Fang, T., and Quan, L. (2018, January 8\u201314). Mvsnet: Depth inference for unstructured multi-view stereo. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01237-3_47"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yao, Y., Luo, Z., Li, S., Shen, T., Fang, T., and Quan, L. (2019, January 15\u201320). Recurrent mvsnet for high-resolution multi-view stereo depth inference. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00567"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Gu, X., Fan, Z., Zhu, S., Dai, Z., Tan, F., and Tan, P. (2020, January 14\u201319). Cascade cost volume for high-resolution multi-view stereo and stereo matching. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00257"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Wang, F., Galliani, S., Vogel, C., Speciale, P., and Pollefeys, M. (2021, January 19\u201325). Patchmatchnet: Learned multi-view patchmatch stereo. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Virtual.","DOI":"10.1109\/CVPR46437.2021.01397"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.4218\/etrij.2021-0305","article-title":"ASPPMVSNet: A high-receptive-field multiview stereo network for dense three-dimensional reconstruction","volume":"44","author":"Saeed","year":"2022","journal-title":"ETRI J."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Li, Y., Li, W., Zhao, Z., and Fan, J. (2022). DRI-MVSNet: A depth residual inference network for multi-view stereo images. PLoS ONE, 17.","DOI":"10.1371\/journal.pone.0264721"},{"key":"ref_14","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wang, X., Zhu, Z., Huang, G., Qin, F., Ye, Y., He, Y., Chi, X., and Wang, X. (2022, January 23\u201327). Mvster: Epipolar transformer for efficient multi-view stereo. Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel.","DOI":"10.1007\/978-3-031-19821-2_33"},{"key":"ref_16","unstructured":"Zhu, J., Peng, B., Li, W., Shen, H., Zhang, Z., and Lei, J. (2021). Multi-view stereo with transformer. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhu, X., Hu, H., Lin, S., and Dai, J. (2019, January 15\u201320). Deformable convnets v2: More deformable, better results. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00953"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Ding, Y., Yuan, W., Zhu, Q., Zhang, H., Liu, X., Wang, Y., and Liu, X. (2022, January 18\u201324). Transmvsnet: Global context-aware multi-view stereo network with transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00839"},{"key":"ref_19","unstructured":"Liu, S., Huang, D., and Wang, Y. (2019). Learning spatial fusion for single-shot object detection. arXiv."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Teng, J., Sun, H., Liu, P., and Jiang, S. (2024). An Improved TransMVSNet Algorithm for Three-Dimensional Reconstruction in the Unmanned Aerial Vehicle Remote Sensing Domain. Sensors, 24.","DOI":"10.3390\/s24072064"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, G., Wang, K., and Lin, L. (2019, January 15\u201320). Adaptively connected neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00188"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Kendall, A., Martirosyan, H., Dasgupta, S., Henry, P., Kennedy, R., Bachrach, A., and Bry, A. (2017, January 22\u201329). End-to-end learning of geometry and context for deep stereo regression. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.17"},{"key":"ref_24","unstructured":"Ross, T.-Y., and Doll\u00e1r, G. (2017, January 21\u201326). Focal loss for dense object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015). Fast r-cnn. arXiv.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_26","unstructured":"Jia, X., De Brabandere, B., Tuytelaars, T., and Gool, L.V. (2016). Dynamic filter networks. Adv. Neural Inf. Process. Syst., 29."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1007\/s11263-016-0902-9","article-title":"Large-scale data for multiple-view stereopsis","volume":"120","author":"Jensen","year":"2016","journal-title":"Int. J. Comput. Vis."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Yao, Y., Luo, Z., Li, S., Zhang, J., Ren, Y., Zhou, L., Fang, T., and Quan, L. (2020, January 14\u201319). Blendedmvs: A large-scale dataset for generalized multi-view stereo networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00186"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072959.3073599","article-title":"Tanks and temples: Benchmarking large-scale scene reconstruction","volume":"36","author":"Knapitsch","year":"2017","journal-title":"ACM Trans. Graph. (ToG)"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/2\/43\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:35:09Z","timestamp":1759919709000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/14\/2\/43"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,1,24]]},"references-count":29,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2025,2]]}},"alternative-id":["ijgi14020043"],"URL":"https:\/\/doi.org\/10.3390\/ijgi14020043","relation":{},"ISSN":["2220-9964"],"issn-type":[{"type":"electronic","value":"2220-9964"}],"subject":[],"published":{"date-parts":[[2025,1,24]]}}}