{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T05:35:47Z","timestamp":1763012147226,"version":"3.45.0"},"reference-count":37,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T00:00:00Z","timestamp":1762819200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100006407","name":"Natural Science Foundation of Henan","doi-asserted-by":"crossref","award":["252300421063"],"award-info":[{"award-number":["252300421063"]}],"id":[{"id":"10.13039\/501100006407","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Key Research Project of Henan Province Universities","award":["24ZX005"],"award-info":[{"award-number":["24ZX005"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62076223"],"award-info":[{"award-number":["62076223"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>The You Only Look Once (YOLO) series of models, particularly the recently introduced YOLOv12 model, have demonstrated significant potential in achieving accurate and rapid recognition of electric power operation violations, due to their comprehensive advantages in detection accuracy and real-time inference. However, the current YOLO models still have three limitations: (1) the absence of a dedicated feature extraction for multi-scale objects, resulting in suboptimal detection capabilities for objects with varying sizes; (2) naive integration of spatial and channel attentions, which restricts the enhancement of feature discriminability and consequently impairs the detection performance for challenging objects in complex backgrounds; and (3) weak representation capability in low-level features, leading to insufficient accuracy for small-sized objects. To address these limitations, a novel YOLO model named DFA-YOLO is proposed, a real-time object detection model with YOLOv12n as its baseline, which makes three key contributions. Firstly, a dynamic weighted multi-scale convolution (DWMConv) module is proposed to address the first limitation, which employs lightweight multi-scale convolution followed by learnable weighted fusion to enhance feature representation for multi-scale objects. Secondly, a full-dimensional attention (FDA) module is proposed to address the second limitation, which gives a unified attention computation scheme that effectively integrates attention across height, width, and channel dimensions, thereby improving feature discriminability. Thirdly, a set of auxiliary detection heads (Aux-Heads) are introduced to address the third limitation and inserted into the backbone network to strengthen the training effect of labels on the low-level feature extraction module. The ablation studies on the EPOVR-v1.0 dataset demonstrate the validity of the proposed DWMConv module, FDA module, Aux-Heads, and their synergistic integration. Relative to the baseline model, DFA-YOLO achieves significant improvements in mAP@0.5 and mAP@0.5\u20130.95, by 3.15% and 4.13%, respectively, meanwhile reducing parameters and GFLOPS by 0.06M and 0.06, respectively, and increasing FPS by 3.52. Comprehensive quantitative comparisons with nine official YOLO models, including YOLOv13n, confirm that DFA-YOLO achieves superior performance in both detection precision and real-time inference, further validating the effectiveness of the DFA-YOLO model.<\/jats:p>","DOI":"10.3390\/info16110974","type":"journal-article","created":{"date-parts":[[2025,11,11]],"date-time":"2025-11-11T12:44:55Z","timestamp":1762865095000},"page":"974","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["DFA-YOLO: A Novel YOLO Model for Electric Power Operation Violation Recognition"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4328-6411","authenticated-orcid":false,"given":"Xiaoliang","family":"Qian","sequence":"first","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xinyu","family":"Ding","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pengfei","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jungang","family":"Guo","sequence":"additional","affiliation":[{"name":"Zhengzhou Fengjia Technology Co., Ltd., Zhengzhou 450009, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3925-998X","authenticated-orcid":false,"given":"Hu","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8770-3862","authenticated-orcid":false,"given":"Wei","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peixu","family":"Xing","sequence":"additional","affiliation":[{"name":"College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,11,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Meng, L., He, D., Ban, G., Xi, G., Li, A., and Zhu, X. (2025). Active Hard Sample Learning for Violation Action Recognition in Power Grid Operation. Information, 16.","DOI":"10.3390\/info16010067"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1007\/s10462-024-10978-x","article-title":"A systematic review of computer vision-based personal protective equipment compliance in industry practice: Advancements, challenges and future directions","volume":"57","author":"Vukicevic","year":"2024","journal-title":"Artif. Intell. Rev."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"5637","DOI":"10.1007\/s40747-023-01028-0","article-title":"A high-performance framework for personal protective equipment detection on the offshore drilling platform","volume":"9","author":"Ji","year":"2023","journal-title":"Complex Intell. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"5605209","DOI":"10.1109\/TGRS.2023.3256373","article-title":"Building a Bridge of Bounding Box Regression Between Oriented and Horizontal Object Detection in Remote Sensing Images","volume":"61","author":"Qian","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","first-page":"5628411","article-title":"Incorporating Multiscale Context and Task-consistent Focal Loss into Oriented Object Detection","volume":"11","author":"Qian","year":"2025","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","first-page":"5630414","article-title":"IPS-YOLO: Iterative Pseudo-fully Supervised Training of YOLO for Weakly Supervised Object Detection in Remote Sensing Images","volume":"14","author":"Qian","year":"2025","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_11","first-page":"1","article-title":"Yolov3: An incremental improvement","volume":"Volume 1804","author":"Farhadi","year":"2018","journal-title":"Computer Vision and Pattern Recognition"},{"key":"ref_12","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_13","unstructured":"Ultralytics (2022, November 22). YOLOv5 (v7.0). Available online: https:\/\/github.com\/ultralytics\/yolov5."},{"key":"ref_14","unstructured":"Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., and Nie, W. (2022). YOLOv6: A single-stage object detection framework for industrial applications. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2023, January 17\u201324). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_16","unstructured":"Jocher, G. (2023, January 10). YOLOv8. Available online: https:\/\/github.com\/ultralytics\/ultralytics."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Yeh, I.H., and Liao, H.Y.M. (2024). Yolov9: Learning what you want to learn using programmable gradient information. European Conference on Computer Vision, Springer Nature.","DOI":"10.1007\/978-3-031-72751-1_1"},{"key":"ref_18","unstructured":"Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., Han, J., and Ding, G. (2024). Yolov10: Real-time end-to-end object detection. arXiv."},{"key":"ref_19","unstructured":"Khanam, R., and Hussain, M. (2024). YOLOv11: An overview of the key architectural enhancements. arXiv."},{"key":"ref_20","unstructured":"Lei, M., Li, S., Wu, Y., Hu, H., Zhou, Y., Zheng, X., and Gao, Y. (2025). YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wu, C., Cai, C., Xiao, F., Wang, J., Guo, Y., and Ma, L. (2025). YOLO-LSM: A Lightweight UAV Target Detection Algorithm Based on Shallow and Multiscale Information Learning. Information, 16.","DOI":"10.3390\/info16050393"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, Y., Cao, M., Yang, Q., Zhang, Y., and Wang, Z. (2025). YOLO-SSFA: A Lightweight Real-Time Infrared Detection Method for Small Targets. Information, 16.","DOI":"10.3390\/info16070618"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Scapinello Aquino, L., Rodrigues Agottani, L.F., Seman, L.O., Cocco Mariani, V., Coelho, L.D.S., and Gonz\u00e1lez, G.V. (2015). Fault Detection in Power Distribution Systems Using Sensor Data and Hybrid YOLO with Adaptive Context Refinement. Appl. Sci., 15.","DOI":"10.3390\/app15169186"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ji, Y., Ma, T., Shen, H., Feng, H., Zhang, Z., Li, D., and He, Y. (2025). Transmission Line Defect Detection Algorithm Based on Improved YOLOv12. Electronics, 14.","DOI":"10.3390\/electronics14122432"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Qian, X., Li, Y., Ding, X., Luo, L., Guo, J., Wang, W., and Xing, P. (2025). A Real-Time DAO-YOLO Model for Electric Power Operation Violation Recognition. Appl. Sci., 15.","DOI":"10.3390\/app15084492"},{"key":"ref_26","unstructured":"Tian, Y., Ye, Q., and Doermann, D. (2025). Yolov12: Attention-centric real-time object detectors. arXiv."},{"key":"ref_27","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA."},{"key":"ref_28","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_29","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning, Lille, France."},{"key":"ref_30","unstructured":"Ramachandran, P., Zoph, B., and Le, Q.V. (2017). Searching for activation functions. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/323533a0","article-title":"Learning representations by back-propagating errors","volume":"323","author":"Rumelhart","year":"1986","journal-title":"Nature"},{"key":"ref_32","unstructured":"Tianchi Platform, Guangdong Power Information Technology Co., Ltd. (2024, July 23). Guangdong Power Grid Smart Field Operation Challenge, Track 3: High-Altitude Operation and Safety Belt Wearing Dataset. Dataset., Available online: https:\/\/tianchi.aliyun.com\/specials\/promotion\/gzgrid."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., and Zitnick, C.L. (2014). Microsoft COCO: Common objects in context. European Conference on Computer Vision, Springer International Publishing.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 14\u201319). ECA-Net: Efficient channel attention for deep convolutional neural networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Akyon, F.C., Altinuc, S.O., and Temizel, A. (2022, January 16\u201319). Slicing aided hyper inference and fine-tuning for small object detection. Proceedings of the 2022 IEEE International Conference on Image Processing, Bordeaux, France.","DOI":"10.1109\/ICIP46576.2022.9897990"}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/11\/974\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T05:32:25Z","timestamp":1763011945000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/11\/974"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,11]]},"references-count":37,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2025,11]]}},"alternative-id":["info16110974"],"URL":"https:\/\/doi.org\/10.3390\/info16110974","relation":{},"ISSN":["2078-2489"],"issn-type":[{"type":"electronic","value":"2078-2489"}],"subject":[],"published":{"date-parts":[[2025,11,11]]}}}