{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T19:56:13Z","timestamp":1775332573975,"version":"3.50.1"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T00:00:00Z","timestamp":1721260800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T00:00:00Z","timestamp":1721260800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012154","name":"Graduate Research and Innovation Projects of Jiangsu Province","doi-asserted-by":"publisher","award":["SJCX23_0320"],"award-info":[{"award-number":["SJCX23_0320"]}],"id":[{"id":"10.13039\/501100012154","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Vis. Intell."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>A forest fire is a natural disaster characterized by rapid spread, difficulty in extinguishing, and widespread destruction, which requires an efficient response. Existing detection methods fail to balance global and local fire features, resulting in the false detection of small or hidden fires. In this paper, we propose a novel detection technique based on an improved YOLO v5 model to enhance the visual representation of forest fires and retain more information about global interactions. We add a plug-and-play global attention mechanism to improve the efficiency of neck and backbone feature extraction of the YOLO v5 model. Then, a re-parameterized convolutional module is designed, and a decoupled detection head is used to accelerate the convergence speed. Finally, a weighted bi-directional feature pyramid network (BiFPN) is introduced to merge feature information for local information processing. In the evaluation, we use the complete intersection over union (CIoU) loss function to optimize the multi-task loss for different kinds of forest fires. Experiments show that the precision, recall, and mean average precision are increased by 4.2%, 3.8%, and 4.6%, respectively, compared with the classic YOLO v5 model. In particular, the mAP@0.5:0.95 is 2.2% higher than the other detection methods, while meeting the requirements of real-time detection.<\/jats:p>","DOI":"10.1007\/s44267-024-00053-y","type":"journal-article","created":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T11:01:37Z","timestamp":1721300497000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":25,"title":["Efficient forest fire detection based on an improved YOLO model"],"prefix":"10.1007","volume":"2","author":[{"given":"Lei","family":"Cao","sequence":"first","affiliation":[]},{"given":"Zirui","family":"Shen","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9017-1510","authenticated-orcid":false,"given":"Sheng","family":"Xu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,7,18]]},"reference":[{"issue":"12","key":"53_CR1","doi-asserted-by":"publisher","DOI":"10.3390\/rs10121992","volume":"10","author":"Z. Xie","year":"2018","unstructured":"Xie, Z., Song, W., Ba, R., Li, X., & Xia, L. (2018). A spatiotemporal contextual model for forest fire detection using Himawari-8 satellite data. Remote Sensing, 10(12), 1992.","journal-title":"Remote Sensing"},{"issue":"22","key":"53_CR2","doi-asserted-by":"publisher","DOI":"10.3390\/s20226442","volume":"20","author":"P. Barmpoutis","year":"2020","unstructured":"Barmpoutis, P., Papaioannou, P., Dimitropoulos, K., & Grammalidis, N. (2020). A review on early forest fire detection systems using optical remote sensing. Sensors, 20(22), 6442.","journal-title":"Sensors"},{"issue":"2","key":"53_CR3","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1007\/s10694-020-01056-z","volume":"57","author":"F. Abid","year":"2021","unstructured":"Abid, F. (2021). A survey of machine learning algorithms based forest fires prediction and detection systems. Fire Technology, 57(2), 559\u2013590.","journal-title":"Fire Technology"},{"issue":"6","key":"53_CR4","doi-asserted-by":"publisher","first-page":"1103","DOI":"10.1007\/s00138-011-0369-1","volume":"23","author":"Y. H. Habiboglu","year":"2012","unstructured":"Habiboglu, Y. H., G\u00fcnay, O., & \u00c7etin, A. E. (2012). Covariance matrix-based fire and flame detection method in video. Machine Vision and Applications, 23(6), 1103\u20131113.","journal-title":"Machine Vision and Applications"},{"key":"53_CR5","first-page":"1","volume-title":"Proceedings of the 20th signal processing and communications applications","author":"F. Erden","year":"2012","unstructured":"Erden, F., T\u00f6reyin, B. U., Soyer, E. B., Inac, I., G\u00fcnay, O., K\u00f6se, K., et al. (2012). Wavelet based flame detection using differential PIR sensors. In Proceedings of the 20th signal processing and communications applications (pp. 1\u20134). Piscataway: IEEE."},{"issue":"7553","key":"53_CR6","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y. LeCun","year":"2015","unstructured":"LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436\u2013444.","journal-title":"Nature"},{"issue":"3","key":"53_CR7","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1109\/JPROC.2023.3238524","volume":"111","author":"Z. Zou","year":"2023","unstructured":"Zou, Z., Chen, K., Shi, Z., Guo, Y., & Ye, J. (2023). Object detection in 20 years: a survey. Proceedings of the IEEE, 111(3), 257\u2013276.","journal-title":"Proceedings of the IEEE"},{"key":"53_CR8","first-page":"580","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"R. B. Girshick","year":"2014","unstructured":"Girshick, R. B., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 580\u2013587). Piscataway: IEEE."},{"key":"53_CR9","first-page":"1440","volume-title":"Proceedings of the IEEE international conference on computer vision","author":"R. B. Girshick","year":"2015","unstructured":"Girshick, R. B. (2015). Fast R-CNN. In Proceedings of the IEEE international conference on computer vision (pp. 1440\u20131448). Cham: Springer."},{"key":"53_CR10","first-page":"91","volume-title":"Proceedings of the 29th international conference on neural information processing systems","author":"S. Ren","year":"2015","unstructured":"Ren, S., He, K., Girshick, R. B., & Sun, J. (2015). Faster R-CNN: towards real-time object detection with region proposal networks. In C. Cortes, N. D. Lawrence, D. D. Lee, et al. (Eds.), Proceedings of the 29th international conference on neural information processing systems (pp. 91\u201399). Red Hook: Curran Associates."},{"issue":"17","key":"53_CR11","doi-asserted-by":"publisher","first-page":"5861","DOI":"10.1007\/s00500-018-3324-5","volume":"22","author":"Y. Song","year":"2018","unstructured":"Song, Y., & Fu, Z. (2018). Uncertain multivariable regression model. Soft Computing, 22(17), 5861\u20135866.","journal-title":"Soft Computing"},{"key":"53_CR12","first-page":"21","volume-title":"Proceedings of the 14th European conference on computer vision","author":"W. Liu","year":"2016","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S. E., Fu, C.-Y., et al. (2016). SSD: single shot multibox detector. In B. Leibe, J. Matas, N. Sebe, et al. (Eds.), Proceedings of the 14th European conference on computer vision (pp. 21\u201337). Cham: Springer."},{"key":"53_CR13","first-page":"1","volume-title":"Proceedings of the British machine vision conference","author":"J. Jeong","year":"2017","unstructured":"Jeong, J., Park, H., & Kwak, N. (2017). Enhancement of SSD by concatenating feature maps for object detection. In Proceedings of the British machine vision conference (pp. 1\u201312). Swansea: BMVA Press."},{"key":"53_CR14","unstructured":"Zhou, X., Wang, D., & Kr\u00e4henb\u00fchl, P. (2019). Objects as points. arXiv preprint. arXiv:1904.07850."},{"key":"53_CR15","first-page":"779","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"J. Redmon","year":"2016","unstructured":"Redmon, J., Divvala, S. K., Girshick, R. B., & Farhadi, A (2016). You only look once: unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779\u2013788). Piscataway: IEEE."},{"issue":"2","key":"53_CR16","doi-asserted-by":"publisher","DOI":"10.3390\/f12020217","volume":"12","author":"R. Xu","year":"2021","unstructured":"Xu, R., Lin, H., Lu, K., Cao, L., & Liu, Y. (2021). A forest fire detection system based on ensemble learning. Forests, 12(2), 217.","journal-title":"Forests"},{"key":"53_CR17","first-page":"10781","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition","author":"M. Tan","year":"2020","unstructured":"Tan, M., Pang, R., & Le, Q. V. (2020). Efficientdet: scalable and efficient object detection. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 10781\u201310790). Piscataway: IEEE."},{"issue":"4","key":"53_CR18","doi-asserted-by":"publisher","first-page":"1680","DOI":"10.3390\/make5040083","volume":"5","author":"J. R. Terven","year":"2023","unstructured":"Terven, J. R., C\u00f3rdova Esparza, D. M., & Romero-Gonz\u00e1lez, J.-A. (2023). A comprehensive review of yolo architectures in computer vision: from yolov1 to yolov8 and yolo-nas. Machine Learning and Knowledge Extraction, 5(4), 1680\u20131716.","journal-title":"Machine Learning and Knowledge Extraction"},{"key":"53_CR19","first-page":"1571","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition workshops","author":"C.-Y. Wang","year":"2020","unstructured":"Wang, C.-Y., Liao, H.-Y. M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., & Yeh, I.-H. (2020). CSPNet: a new backbone that can enhance learning capability of CNN. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition workshops (pp. 1571\u20131580). Piscataway: IEEE."},{"key":"53_CR20","first-page":"936","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"T.-Y. Lin","year":"2017","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R. B., He, K., Hariharan, B., & Belongie, S. J. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 936\u2013944). Piscataway: IEEE."},{"key":"53_CR21","first-page":"8759","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition","author":"S. Liu","year":"2018","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., & Jia, J. (2018). Path aggregation network for instance segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8759\u20138768). Piscataway: IEEE."},{"key":"53_CR22","unstructured":"Liu, Y., Shao, Z., & Hoffmann, N. (2021). Global attention mechanism: retain information to enhance channel-spatial interactions. arXiv preprint. arXiv:2112.05561."},{"key":"53_CR23","first-page":"13733","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition","author":"X. Ding","year":"2021","unstructured":"Ding, X., Zhang, X., Ma, N., Han, J., Ding, G., & Sun, J. (2021). RepVGG: making vgg-style convnets great again. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 13733\u201313742). Piscataway: IEEE."},{"key":"53_CR24","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., & Sun, J. (2021). YOLOX: exceeding YOLO series in 2021. arXiv preprint. arXiv:2107.08430."},{"key":"53_CR25","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1109\/SIBGRAPI.2015.19","volume-title":"Proceedings of the 28th SIBGRAPI conference on graphics, patterns and image","author":"D. Y. T. Chino","year":"2015","unstructured":"Chino, D. Y. T., Avalhais, L. P. S., & Rodrigues, J. F. R. Jr (2015). BoWFire: detection of fire in still images by integrating pixel color and texture analysis. In Proceedings of the 28th SIBGRAPI conference on graphics, patterns and image (pp.\u00a095\u2013102). Piscataway: IEEE."},{"key":"53_CR26","unstructured":"Redmon, J., & Farhadi, A. (2018). Yolov3: an incremental improvement. arXiv preprint. arXiv:1804.02767."},{"key":"53_CR27","first-page":"13029","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition","author":"C.-Y. Wang","year":"2021","unstructured":"Wang, C.-Y., Bochkovskiy, A., & Liao, H.-Y. M. (2021). Scaled-yolov4: scaling cross stage partial network. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 13029\u201313030). Piscataway: IEEE."},{"key":"53_CR28","unstructured":"Wang, C.-Y., Yeh, I. H., & Liao, H.-Y. M. (2021). You only learn one representation: unified network for multiple tasks. arXiv preprint. arXiv:2105.04206."},{"key":"53_CR29","unstructured":"Yu, G., Chang, Q., Lv, W., Xu, C., Cui, C., Ji, W., et\u00a0al. (2021). PP-PicoDet: a better real-time object detector on mobile devices. arXiv preprint. arXiv:2111.00902."},{"key":"53_CR30","first-page":"7464","volume-title":"Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition","author":"C.-Y. Wang","year":"2023","unstructured":"Wang, C.-Y., Bochkovskiy, A., & Liao, H.-Y. M. (2023). YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 7464\u20137475). Piscataway: IEEE."}],"container-title":["Visual Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44267-024-00053-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44267-024-00053-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44267-024-00053-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T11:25:19Z","timestamp":1721301919000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44267-024-00053-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,18]]},"references-count":30,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["53"],"URL":"https:\/\/doi.org\/10.1007\/s44267-024-00053-y","relation":{},"ISSN":["2731-9008"],"issn-type":[{"value":"2731-9008","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,7,18]]},"assertion":[{"value":"15 August 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 June 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 June 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 July 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"20"}}