{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,24]],"date-time":"2025-11-24T12:47:38Z","timestamp":1763988458755,"version":"3.45.0"},"reference-count":41,"publisher":"National Library of Serbia","issue":"4","license":[{"start":{"date-parts":[[2025,1,1]],"date-time":"2025-01-01T00:00:00Z","timestamp":1735689600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["ComSIS","COMPUT SCI INF SYST","COMPUT SCI INFORM SY","COMPUTER SCI INFORM","COMSIS J"],"published-print":{"date-parts":[[2025]]},"abstract":"<jats:p>Fire detection is critical in applications such as fire management and building safety, but dispersion and blurring of flame and smoke boundaries can present challenges. Multiple upsampling and downsampling operations can blur the localisation signals, thus reducing accuracy and efficiency. To address this problem, we propose the AMMF(Attention Mechanisms and Multiscale Features) detection model, which integrates an attention mechanism and multi-scale feature fusion to improve accuracy and real-time performance. The model incorporates a dynamic sparse attention mechanism in the backbone network to enhance feature capture and restructures the neck network using CepBlock and MPFusion modules for better feature fusion. MDPIoU loss and Slideloss are then utilised to reduce the bounding box regression error and address the sample imbalance problem respectively. In addition, parameters are shared by merging 3?3 convolutional branches, which optimises the detection head and improves computational efficiency. The experimental results show that AMMF-Detection can significantly improve the detection speed and accuracy on the public dataset.<\/jats:p>","DOI":"10.2298\/csis241225059z","type":"journal-article","created":{"date-parts":[[2025,8,26]],"date-time":"2025-08-26T09:47:23Z","timestamp":1756201643000},"page":"1509-1532","source":"Crossref","is-referenced-by-count":0,"title":["Fire detection models based on attention mechanisms and multiscale features"],"prefix":"10.2298","volume":"22","author":[{"given":"Shunxiang","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Anhui University of Science Technology, Huainan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Meng","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Anhui University of Science Technology, Huainan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kuan-Ching","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering (CSIE), Providence University, Taizhong, Taiwan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hua","family":"Wen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Anhui University of Science Technology, Huainan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Liang","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Anhui University of Science Technology, Huainan, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1078","reference":[{"key":"ref1","doi-asserted-by":"crossref","unstructured":"Y. Wang, Y. Han, Z. Tang, et al. A Fast Video Fire Detection of Irregular Burning Feature in Fire-Flame Using in Indoor Fire Sensing Robots. IEEE Transactions on Instrumentation and Measurement, vol. 71, pp. 1-14, 2022.","DOI":"10.1109\/TIM.2022.3212986"},{"key":"ref2","doi-asserted-by":"crossref","unstructured":"L. Zhang, C. Lu, H. Xu, et al. MMFNet: Forest fire smoke detection using multiscale convergence coordinated pyramid network with mixed attention and fast-robust NMS. IEEE Internet of Things Journal, 2023.","DOI":"10.1109\/JIOT.2023.3277511"},{"key":"ref3","doi-asserted-by":"crossref","unstructured":"M. Mueller, P. Karasev, I. Kolesov, et al. Optical flow estimation for flame detection in videos. IEEE Transactions on Image Processing, vol. 22, no. 7, pp. 2786-2797, 2013.","DOI":"10.1109\/TIP.2013.2258353"},{"key":"ref4","doi-asserted-by":"crossref","unstructured":"W. Liu, D. Anguelov, D. Erhan, et al. SSD: Single shot multibox detector. In Computer Vision- ECCV, Springer, pp. 21-37, 2016.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref5","doi-asserted-by":"crossref","unstructured":"J. Redmon, S. Divvala, R. Girshick, et al. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, 2016.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref6","doi-asserted-by":"crossref","unstructured":"J. Redmon and A. Farhadi. YOLO9000: Better, faster, stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263-7271, 2017.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref7","doi-asserted-by":"crossref","unstructured":"C. Y. Wang, I. H. Yeh, H. Y. Mark Liao.Yolov9: Learning what you want to learn using programmable gradient information.In European Conference on Computer Vision. Springer, Cham, 2025, pp. 1-21.","DOI":"10.1007\/978-3-031-72751-1_1"},{"key":"ref8","unstructured":"A.Wang, H. Chen, L. Liu, et al. Yolov10: Real-time end-to-end object detection.arXiv preprint arXiv:2405.14458, 2024."},{"key":"ref9","unstructured":"W. Lv, S. Xu, Y. Zhao, et al. DETRs beat YOLOs on real-time object detection. arXiv preprint arXiv:2304.08069, 2023."},{"key":"ref10","doi-asserted-by":"crossref","unstructured":"T. H. Chen, P. H. Wu, Y. C. Chiou. An early fire-detection method based on image processing. In Proceedings of the 2004 International Conference on Image Processing (ICIP), vol. 3, pp. 1707-1710, IEEE, 2004.","DOI":"10.1109\/ICIP.2004.1421401"},{"key":"ref11","unstructured":"N. I. binti Zaidi, N. A. A. binti Lokman, M. R. bin Daud, et al. Fire recognition using RGB and YCbCr color space. ARPN Journal of Engineering and Applied Sciences, vol. 10, no. 21, pp. 9786-9790, 2015."},{"key":"ref12","unstructured":"V. Vipin. Image processing based forest fire detection. International Journal of Emerging Technology and Advanced Engineering, vol. 2, no. 2, pp. 87-95, 2012."},{"key":"ref13","doi-asserted-by":"crossref","unstructured":"K. Dimitropoulos, P. Barmpoutis, N. Grammalidis. Spatio-temporal flame modeling and dynamic texture analysis for automatic video-based fire detection. IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 2, pp. 339-351, 2014.","DOI":"10.1109\/TCSVT.2014.2339592"},{"key":"ref14","doi-asserted-by":"crossref","unstructured":"W. Ye, J. Zhao, S. Wang, et al. Dynamic texture based smoke detection using Surfacelet transform and HMT model. Fire Safety Journal, vol. 73, pp. 91-101, 2015.","DOI":"10.1016\/j.firesaf.2015.03.001"},{"key":"ref15","doi-asserted-by":"crossref","unstructured":"Y. Chunyu, F. Jun,W. Jinjun, et al. Video fire smoke detection using motion and color features. Fire Technology, vol. 46, pp. 651-663, 2010.","DOI":"10.1007\/s10694-009-0110-z"},{"key":"ref16","doi-asserted-by":"crossref","unstructured":"Z. Li, L. S. Mihaylova, O. Isupova, et al. Autonomous flame detection in videos with a Dirichlet process Gaussian mixture color model. IEEE Transactions on Industrial Informatics, vol. 14, no. 3, pp. 1146-1154, 2017.","DOI":"10.1109\/TII.2017.2768530"},{"key":"ref17","unstructured":"S. Han, H. Mao,W. J. Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding. arXiv preprint arXiv:1510.00149, 2015."},{"key":"ref18","unstructured":"M. Courbariaux, Y. Bengio, J. P. David. BinaryConnect: Training deep neural networks with binary weights during propagations. In Advances in Neural Information Processing Systems (NeurIPS), vol. 28, 2015."},{"key":"ref19","unstructured":"M. Courbariaux, I. Hubara, D. Soudry, et al. Binarized neural networks: Training deep neural networks with weights and activations constrained to +1 or -1. arXiv preprint arXiv:1602.02830, 2016."},{"key":"ref20","unstructured":"I. Hubara, M. Courbariaux, D. Soudry, et al. Quantized neural networks: Training neural networks with low precision weights and activations. Journal of Machine Learning Research, vol. 18, no. 187, pp. 1-30, 2018."},{"key":"ref21","doi-asserted-by":"crossref","unstructured":"J. Yim, D. Joo, J. Bae, et al. A gift from knowledge distillation: Fast optimization, network minimization and transfer learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4133-4141, 2017.","DOI":"10.1109\/CVPR.2017.754"},{"key":"ref22","doi-asserted-by":"crossref","unstructured":"C. Szegedy, W. Liu, Y. Jia, et al. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-9, 2015.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref23","unstructured":"A. G. Howard, M. Zhu, B. Chen, et al. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04825, 2017."},{"key":"ref24","doi-asserted-by":"crossref","unstructured":"A. Howard, M. Sandler, G. Chu, et al. Searching for MobileNetV3. In Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), pp. 1314-1324, 2019.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref25","doi-asserted-by":"crossref","unstructured":"K. He, X. Zhang, S. Ren, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 9, pp. 1904-1916, 2015.","DOI":"10.1109\/TPAMI.2015.2389824"},{"key":"ref26","unstructured":"J. Redmon and A. Farhadi. YOLOv3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018."},{"key":"ref27","unstructured":"A. Bochkovskiy, C. Y. Wang, and H. Y. M. Liao. YOLOv4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934, 2020."},{"key":"ref28","doi-asserted-by":"crossref","unstructured":"R. Girshick, J. Donahue, T. Darrell, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580-587, 2014.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref29","doi-asserted-by":"crossref","unstructured":"R. Girshick. Fast R-CNN. In Proceedings of the IEEE international conference on computer vision, pages 1440-1448, 2015.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref30","doi-asserted-by":"crossref","unstructured":"S. Ren, K. He, R. Girshick, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6):1137-1149, 2016.","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"ref31","doi-asserted-by":"crossref","unstructured":"K. He, G. Gkioxari, P. Doll\u00e1r, et al. Mask R-CNN. In Proceedings of the IEEE international conference on computer vision, pages 2961-2969, 2017.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref32","doi-asserted-by":"crossref","unstructured":"J. Miao, G. Zhao, Y. Gao, et al. Fire detection algorithm based on improved YOLOv5. In 2021 International Conference on Control, Automation and Information Sciences (ICCAIS), IEEE, pages 776-781, 2021.","DOI":"10.1109\/ICCAIS52680.2021.9624619"},{"key":"ref33","doi-asserted-by":"crossref","unstructured":"M. Luo, J. Huang, X. Sun, et al. Small Target Forest Fire Recognition Method based on Deep Learning. In 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), IEEE, volume 3, pages 593-597, 2023.","DOI":"10.1109\/ICIBA56860.2023.10165608"},{"key":"ref34","doi-asserted-by":"crossref","unstructured":"P. Li and W. Zhao. Image fire detection algorithms based on convolutional neural networks. Case Studies in Thermal Engineering, 19:100625, 2020.","DOI":"10.1016\/j.csite.2020.100625"},{"key":"ref35","doi-asserted-by":"crossref","unstructured":"S. Li, Q. Yan, and P. Liu. An efficient fire detection method based on multiscale feature extraction, implicit deep supervision and channel attention mechanism. IEEE Transactions on Image Processing, 29:8467-8477, 2020.","DOI":"10.1109\/TIP.2020.3016431"},{"key":"ref36","doi-asserted-by":"crossref","unstructured":"S. Majid, F. Alenezi, S. Masood, M. Ahmad, E. S. G\u00fcnd\u00fcz, and K. Polat. Attention based CNN model for fire detection and localization in real-world images. Expert Systems with Applications, 189:116114, 2022.","DOI":"10.1016\/j.eswa.2021.116114"},{"key":"ref37","doi-asserted-by":"crossref","unstructured":"J. Pincott, P.W. Tien, S.Wei, and J. K. Calautit. Indoor fire detection utilizing computer visionbased strategies. Journal of Building Engineering, 61:105154, 2022.","DOI":"10.1016\/j.jobe.2022.105154"},{"key":"ref38","doi-asserted-by":"crossref","unstructured":"L. Zhu, X. Wang, Z. Ke, et al. dynamic sparse attention: Vision transformer with bi-level routing attention. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pages 10323-10333, 2023.","DOI":"10.1109\/CVPR52729.2023.00995"},{"key":"ref39","unstructured":"C. Li, L. Li, Y. Geng, et al. YOLOv6 v3.0: A full-scale reloading. arXiv preprint arXiv:2301.05586, 2023."},{"key":"ref40","unstructured":"Z. Yu, H. Huang, W. Chen, et al. YOLO-Facev2: A scale and occlusion aware face detector. arXiv preprint arXiv:2208.02019, 2022."},{"key":"ref41","unstructured":"M. Siliang and X. Yong. MPDIoU: A loss for efficient and accurate bounding box regression. arXiv preprint arXiv:2307.07662, 2023."}],"container-title":["Computer Science and Information Systems"],"original-title":[],"language":"en","deposited":{"date-parts":[[2025,11,24]],"date-time":"2025-11-24T12:33:55Z","timestamp":1763987635000},"score":1,"resource":{"primary":{"URL":"https:\/\/doiserbia.nb.rs\/Article.aspx?ID=1820-02142500059Z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"references-count":41,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025]]}},"URL":"https:\/\/doi.org\/10.2298\/csis241225059z","relation":{},"ISSN":["1820-0214","2406-1018"],"issn-type":[{"type":"print","value":"1820-0214"},{"type":"electronic","value":"2406-1018"}],"subject":[],"published":{"date-parts":[[2025]]}}}