{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T01:05:04Z","timestamp":1779152704718,"version":"3.51.4"},"reference-count":36,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2021,9,20]],"date-time":"2021-09-20T00:00:00Z","timestamp":1632096000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>It is of great significance to apply the object detection methods to automatically detect boulders from planetary images and analyze their distribution. This contributes to the selection of candidate landing sites and the understanding of the geological processes. This paper improves the state-of-the-art object detection method of YOLOv5 with attention mechanism and designs a pyramid based approach to detect boulders from planetary images. A new feature fusion layer has been designed to capture more shallow features of the small boulders. The attention modules implemented by combining the convolutional block attention module (CBAM) and efficient channel attention network (ECA-Net) are also added into YOLOv5 to highlight the information that contribute to boulder detection. Based on the Pascal Visual Object Classes 2007 (VOC2007) dataset which is widely used for object detection evaluations and the boulder dataset that we constructed from the images of Bennu asteroid, the evaluation results have shown that the improvements have increased the performance of YOLOv5 by 3.4% in precision. With the improved YOLOv5 detection method, the pyramid based approach extracts several layers of images with different resolutions from the large planetary images and detects boulders of different scales from different layers. We have also applied the proposed approach to detect the boulders on Bennu asteroid. The distribution of the boulders on Bennu asteroid has been analyzed and presented.<\/jats:p>","DOI":"10.3390\/rs13183776","type":"journal-article","created":{"date-parts":[[2021,9,21]],"date-time":"2021-09-21T22:35:20Z","timestamp":1632263720000},"page":"3776","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":128,"title":["Improving YOLOv5 with Attention Mechanism for Detecting Boulders from Planetary Images"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7312-6534","authenticated-orcid":false,"given":"Linlin","family":"Zhu","sequence":"first","affiliation":[{"name":"School of Computer and Information Engineering, Henan University, Kaifeng 475001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xun","family":"Geng","sequence":"additional","affiliation":[{"name":"College of Geography and Environmental Science, Henan University, Kaifeng 475001, China"},{"name":"Henan Industrial Technology Academy of Spatio-Temporal Big Data, Henan University, Zhengzhou 450000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zheng","family":"Li","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering, Henan University, Kaifeng 475001, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chun","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Computer and Information Engineering, Henan University, Kaifeng 475001, China"},{"name":"Henan Industrial Technology Academy of Spatio-Temporal Big Data, Henan University, Zhengzhou 450000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Wang, H., Jiang, J., and Zhang, G. (2018). CraterIDNet: An end-to-end fully convolutional neural network for crater detection and identification in remotely sensed planetary images. Remote Sens., 10.","DOI":"10.3390\/rs10071067"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"e2019EA001005","DOI":"10.1029\/2019EA001005","article-title":"Deriving surface ages on Mars using automated crater counting","volume":"7","author":"Benedix","year":"2020","journal-title":"Earth Space Sci."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Ding, M., Cao, Y.F., and Wu, Q.X. (2008, January 18\u201320). Autonomous craters detection from planetary image. Proceedings of the 2008 3rd International Conference on Innovative Computing Information and Control, Dalian, China.","DOI":"10.1109\/ICICIC.2008.181"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1016\/j.pss.2018.11.008","article-title":"Abundance and size-frequency distribution of boulders in Linn\u00e9 crater\u2019s ejecta (Moon)","volume":"165","author":"Pajola","year":"2019","journal-title":"Planet. Space Sci."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1038\/s41561-019-0326-6","article-title":"Craters, boulders and regolith of (101955) Bennu indicative of an old and dynamic surface","volume":"12","author":"Walsh","year":"2019","journal-title":"Nat. Geosci."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"e2020JE006475","DOI":"10.1029\/2020JE006475","article-title":"Global patterns of recent mass movement on asteroid (101955) Bennu","volume":"125","author":"Jawin","year":"2020","journal-title":"J. Geophys. Res. Planets"},{"key":"ref_7","unstructured":"GitHub (2021, March 01). YOLOV5-Master. Available online: https:\/\/github.com\/ultralytics\/yolov5.git\/."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 6). CBAM: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 14). ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"850","DOI":"10.1109\/ICPR.2006.479","article-title":"Efficient non-maximum suppression","volume":"Volume 3","author":"Neubeck","year":"2006","journal-title":"Proceedings of the 18th International Conference on Pattern Recognition (ICPR\u201906)"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_14","first-page":"91","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"ANIPS"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liu, Y., and Liu, Q. (2017, January 11). Convolutional neural networks with large-margin softmax loss function for cognitive load recognition. Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China.","DOI":"10.23919\/ChiCC.2017.8027991"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_17","unstructured":"Fu, C.Y., Liu, W., Ranga, A., Tyagi, A., and Berg, A.C. (2017). Dssd: Deconvolutional single shot detector. arXiv."},{"key":"ref_18","first-page":"1080613","article-title":"Extend the shallow part of single shot multibox detector via convolutional neural network","volume":"Volume 10806","author":"Zheng","year":"2018","journal-title":"Proceedings of the Tenth International Conference on Digital Image Processing (ICDIP 2018)"},{"key":"ref_19","unstructured":"Cui, L., Ma, R., Lv, P., Jiang, X., Gao, Z., Zhou, B., and Xu, M. (2018). MDSSD: Multi-scale deconvolutional single shot detector for small objects. arXiv."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., and Girshick, R. (2016, January 12). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_22","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_23","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_24","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3111","DOI":"10.1109\/TMM.2018.2818020","article-title":"Arbitrary-oriented scene text detection via rotation proposals","volume":"20","author":"Ma","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., and Ren, S. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_28","unstructured":"Wang, W., Xie, E., Song, X., Zang, Y., Wang, W., Lu, T., Yu, G., and Shen, C. (November, January 27). Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., Taylor, G.W., and Fergus, R. (2011, January 6\u201313). Adaptive deconvolutional networks for mid and high level feature learning. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126474"},{"key":"ref_30","unstructured":"Mastromichalakis, S. (2020). ALReLU: A different approach on Leaky ReLU activation function to improve Neural Networks Performance. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Kim, D., Park, S., Kang, D., and Paik, J. (2019, January 8\u201311). Improved center and scale prediction-based pedestrian detection using convolutional block. Proceedings of the 2019 IEEE 9th International Conference on Consumer Electronics (ICCE-Berlin), Berlin, Germany.","DOI":"10.1109\/ICCE-Berlin47944.2019.8966154"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_33","unstructured":"(2021, April 01). 101955 Bennu, Available online: https:\/\/solarsystem.nasa.gov\/asteroids-comets-and-meteors\/asteroids\/101955-bennu\/in-depth\/."},{"key":"ref_34","unstructured":"(2020, July 01). OSIRIS-REx Mission. Available online: https:\/\/www.asteroidmission.org\/bennu_global_mosaic\/."},{"key":"ref_35","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Jiang, B., Luo, R., Mao, J., Xiao, T., and Jiang, Y. (2018, January 8\u201314). Acquisition of localization confidence for accurate object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_48"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3776\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:02:31Z","timestamp":1760166151000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3776"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,20]]},"references-count":36,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["rs13183776"],"URL":"https:\/\/doi.org\/10.3390\/rs13183776","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,20]]}}}