{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,21]],"date-time":"2025-12-21T06:24:41Z","timestamp":1766298281208,"version":"build-2065373602"},"reference-count":39,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2023,5,24]],"date-time":"2023-05-24T00:00:00Z","timestamp":1684886400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Special Program for Cultivating Science and Technology Innovation Ability among College Students and Middle School Students in Hebei Province","award":["22E50017D","22567637H","G2021203010","F2021203038"],"award-info":[{"award-number":["22E50017D","22567637H","G2021203010","F2021203038"]}]},{"name":"Innovation Capability Improvement Plan Project of Hebei Province","award":["22E50017D","22567637H","G2021203010","F2021203038"],"award-info":[{"award-number":["22E50017D","22567637H","G2021203010","F2021203038"]}]},{"name":"Hebei Natural Science Foundation","award":["22E50017D","22567637H","G2021203010","F2021203038"],"award-info":[{"award-number":["22E50017D","22567637H","G2021203010","F2021203038"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The counting of surgical instruments is an important task to ensure surgical safety and patient health. However, due to the uncertainty of manual operations, there is a risk of missing or miscounting instruments. Applying computer vision technology to the instrument counting process can not only improve efficiency, but also reduce medical disputes and promote the development of medical informatization. However, during the counting process, surgical instruments may be densely arranged or obstruct each other, and they may be affected by different lighting environments, all of which can affect the accuracy of instrument recognition. In addition, similar instruments may have only minor differences in appearance and shape, which increases the difficulty of identification. To address these issues, this paper improves the YOLOv7x object detection algorithm and applies it to the surgical instrument detection task. First, the RepLK Block module is introduced into the YOLOv7x backbone network, which can increase the effective receptive field and guide the network to learn more shape features. Second, the ODConv structure is introduced into the neck module of the network, which can significantly enhance the feature extraction ability of the basic convolution operation of the CNN and capture more rich contextual information. At the same time, we created the OSI26 data set, which contains 452 images and 26 surgical instruments, for model training and evaluation. The experimental results show that our improved algorithm exhibits higher accuracy and robustness in surgical instrument detection tasks, with F1, AP, AP50, and AP75 reaching 94.7%, 91.5%, 99.1%, and 98.2%, respectively, which are 4.6%, 3.1%, 3.6%, and 3.9% higher than the baseline. Compared to other mainstream object detection algorithms, our method has significant advantages. These results demonstrate that our method can more accurately identify surgical instruments, thereby improving surgical safety and patient health.<\/jats:p>","DOI":"10.3390\/s23115037","type":"journal-article","created":{"date-parts":[[2023,5,25]],"date-time":"2023-05-25T02:30:06Z","timestamp":1684981806000},"page":"5037","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["Surgical Instrument Detection Algorithm Based on Improved YOLOv7x"],"prefix":"10.3390","volume":"23","author":[{"given":"Boping","family":"Ran","sequence":"first","affiliation":[{"name":"School of Information Science and Engineering, Yanshan University, Qinhuangdao 066000, China"}]},{"given":"Bo","family":"Huang","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Yanshan University, Qinhuangdao 066000, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2015-7000","authenticated-orcid":false,"given":"Shunpan","family":"Liang","sequence":"additional","affiliation":[{"name":"School of Information Science and Engineering, Yanshan University, Qinhuangdao 066000, China"}]},{"given":"Yulei","family":"Hou","sequence":"additional","affiliation":[{"name":"School of Mechanical Engineering, Yanshan University, Qinhuangdao 066000, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,5,24]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1038\/s41746-020-00376-2","article-title":"Deep learning-enabled medical computer vision","volume":"4","author":"Esteva","year":"2021","journal-title":"NPJ Digit. Med."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1001","DOI":"10.1007\/s40747-022-00815-5","article-title":"Deep learning based brain tumor segmentation: A survey","volume":"9","author":"Liu","year":"2020","journal-title":"Complex Intell. Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.1038\/s43018-022-00436-4","article-title":"Artificial intelligence in histopathology: Enhancing cancer research and clinical oncology","volume":"3","author":"Shmatko","year":"2022","journal-title":"Nat. Cancer"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"4","DOI":"10.4103\/jpi.jpi_59_18","article-title":"Automated Computational Detection, Quantitation, and Mapping of Mitosis in Whole-Slide Images for Clinically Actionable Surgical Pathology Decision Support","volume":"10","author":"Puri","year":"2019","journal-title":"J. Pathol. Inform."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1115","DOI":"10.1002\/mp.13978","article-title":"Synthetic CT Generation from CBCT images via Deep Learning","volume":"47","author":"Chen","year":"2019","journal-title":"Med. Phys."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Ma, H. (2021, January 24\u201326). Automatic positioning system of medical service robot based on binocular vision. Proceedings of the 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT), Changzhou, China.","DOI":"10.1109\/ISRIMT53730.2021.9597049"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1016\/j.media.2016.10.004","article-title":"Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation","volume":"36","author":"Kamnitsas","year":"2016","journal-title":"Med. Image Anal."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2092","DOI":"10.1109\/TMI.2019.2893944","article-title":"Attention Residual Learning for Skin Lesion Classification","volume":"38","author":"Zhang","year":"2019","journal-title":"IEEE Trans. Med. Imaging"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Hooshangnejad, H., Chen, Q., Feng, X., Zhang, R., and Ding, K. (2023). deepPERFECT: Novel Deep Learning CT Synthesis Method for Expeditious Pancreatic Cancer Radiotherapy. arXiv.","DOI":"10.3390\/cancers15113061"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Sharghi, A., Haugerud, H., Oh, D., and Mohareri, O. (2020). Automatic Operating Room Surgical Activity Recognition for Robot-Assisted Surgery. arXiv.","DOI":"10.1007\/978-3-030-59716-0_37"},{"key":"ref_11","first-page":"249","article-title":"Hardy-Fairbanks, Unintentionally Retained Foreign Objects: A Descriptive Study of 308 Sentinel Events and Contributing Factors","volume":"45","author":"Steelman","year":"2019","journal-title":"Jt. Comm. J. Qual. Patient Saf."},{"key":"ref_12","first-page":"9","article-title":"The patient, case, individual and environmental factors that impact on the surgical count process: An integrative review","volume":"32","author":"Warwick","year":"2019","journal-title":"J. Perioper. Nurs."},{"key":"ref_13","unstructured":"Hua, R.F., and Tie, Q. (, 2014). Application of optimized device placement method in nasal septum device inventory. Proceedings of the 2014 Henan Provincial Hospital Disinfection Supply Center (Room) Standardization Construction and Management Academic Conference, Henan, China."},{"key":"ref_14","first-page":"1835","article-title":"Improving the counting method of surgical instruments and articles to improve the safety of patients\u2019 operation","volume":"20","author":"Huang","year":"2007","journal-title":"J. Nurse Educ."},{"key":"ref_15","first-page":"116","article-title":"Analysis of the application effect of instrument atlas in improving the correct rate of instrument handover in operating room and supply room","volume":"125","author":"Wu","year":"2022","journal-title":"Famous Dr."},{"key":"ref_16","first-page":"55","article-title":"Application of Ultra-High Frequency Electronic Radio Frequency Identification Technology in Automatic Inventory of Surgical Instruments","volume":"35","author":"Ying","year":"2022","journal-title":"Med. Equip."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Lee, J.-D., Chien, J.-C., Hsu, Y.-T., and Wu, C.-T. (2021). Automatic Surgical Instrument Recognition A Case of Comparison Study between the Faster R-CNN, Mask R-CNN, and Single-Shot Multi-Box Detectors. Appl. Sci., 11.","DOI":"10.3390\/app11178097"},{"key":"ref_18","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y. (2022). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wang, S., Raju, A., and Huang, J. (2017, January 18\u201321). Deep learning based multi-label classification for surgical tool presence detection in laparoscopic videos. Proceedings of the 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), Melbourne, VIC, Australia.","DOI":"10.1109\/ISBI.2017.7950597"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Wang, Y., Sun, Q., Sun, G., Gu, L., and Liu, Z. (2021, January 3\u20135). Object Detection of Surgical Instruments Based on YOLOv4. Proceedings of the 2021 6th IEEE International Conference on Advanced Robotics and Mechatronics (ICARM), Chongqing, China.","DOI":"10.1109\/ICARM52023.2021.9536075"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhou, Y., and Liu, Z. (2022, January 8\u201312). Detection of Surgical Instruments Based on YOLOv5. Proceedings of the 2022 IEEE International Conference on Manipulation, Manufacturing and Measurement on the Nanoscale (3M-NANO), Tianjin, China.","DOI":"10.1109\/3M-NANO56083.2022.9941507"},{"key":"ref_22","first-page":"1123","article-title":"Real-time surgical tool detection in computer-aided surgery based on enhanced feature-fusion convolutional neural network","volume":"9","author":"Liu","year":"2022","journal-title":"J. Comput. Des. Eng."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Jin, A., Yeung, S., Jopling, J., Krause, J., Azagury, D., Milstein, A., and Fei-Fei, L. (2018, January 12\u201315). Tool detection and operative skill assessment in surgical videos using region-based convolutional neural networks. Proceedings of the 2018 IEEE winter conference on applications of computer vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00081"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Kurmann, T., Marquez Neila, P., Du, X., Fua, P., Stoyanov, D., Wolf, S., and Sznitman, R. (2017, January 11\u201313). Simultaneous recognition and pose estimation of instruments in minimally invasive surgery. Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2017: 20th International Conference, Quebec City, QC, Canada. Proceedings, Part II 20.","DOI":"10.1007\/978-3-319-66185-8_57"},{"key":"ref_25","first-page":"51","article-title":"A method for counting surgical instruments based on improved template matching","volume":"28","author":"Wang","year":"2022","journal-title":"Mechatronics"},{"key":"ref_26","unstructured":"Lu, K. (2021). Research on Image Detection Methods of Surgical Instruments Based on Deep Learning. [Master\u2019s Thesis, Tianjin University of Technology]."},{"key":"ref_27","unstructured":"Zhang, W.K. (2021). Research on Surgical Instrument Recognition Based on Fine-Grained Image Classification. [Master\u2019s Thesis, Dalian University of Technology]."},{"key":"ref_28","unstructured":"Liang, P.K. (2022). Research on Image Recognition and Sorting of Surgical Instruments Based on Deep Learning. [Master\u2019s Thesis, Yanshan University]."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","article-title":"A survey on Image Data Augmentation for Deep Learning","volume":"6","author":"Shorten","year":"2019","journal-title":"J. Big Data"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhang, H., Cisse, M., Dauphin, Y.N., and Lopez-Paz, D. (2017). mixup: Beyond Empirical Risk Minimization. arXiv.","DOI":"10.1007\/978-1-4899-7687-1_79"},{"key":"ref_31","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_32","first-page":"41","article-title":"Small target detection based on improved YOLOv7","volume":"49","author":"Qi","year":"2023","journal-title":"Comput. Eng."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path Aggregation Network for Instance Segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ding, X., Zhang, X., Han, J., Ding, G., and Sun, J. (2022, January 18\u201324). Scaling Up Your Kernels to 31 \u00d7 31: Revisiting Large Kernel Design in CNNs. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01166"},{"key":"ref_35","unstructured":"Li, C., Zhou, A., and Yao, A. (2022). Omni-Dimensional Dynamic Convolution. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Chen, Y., Dai, X., Liu, M., Chen, D., Yuan, L., and Liu, Z. (2020, January 13\u201319). Dynamic Convolution: Attention Over Convolution Kernels. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01104"},{"key":"ref_37","unstructured":"Yang, B., Bender, G., Le, Q.V., and Ngiam, J. (2019, January 8\u201314). CondConv: Conditionally Parameterized Convolutions for Efficient Inference. Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","article-title":"A Survey on Transfer Learning","volume":"22","author":"Pan","year":"2010","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_39","first-page":"15","article-title":"Pavement Disease Detection Model Based on Improved YOLOv5","volume":"49","author":"Wang","year":"2013","journal-title":"Comput. Eng."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/11\/5037\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T19:41:10Z","timestamp":1760125270000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/11\/5037"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,24]]},"references-count":39,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2023,6]]}},"alternative-id":["s23115037"],"URL":"https:\/\/doi.org\/10.3390\/s23115037","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2023,5,24]]}}}