{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T20:35:54Z","timestamp":1773693354440,"version":"3.50.1"},"reference-count":35,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,10,27]],"date-time":"2021-10-27T00:00:00Z","timestamp":1635292800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Hubei Provincial Development and Reform Commission Program"},{"name":"Ankon Technologies (Wuhan) Co., Ltd."}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Things"],"published-print":{"date-parts":[[2022,2,28]]},"abstract":"<jats:p>\n            Wireless capsule endoscopy is a modern non-invasive Internet of Medical Imaging Things that has been increasingly used in gastrointestinal tract examination. With about one gigabyte image data generated for a patient in each examination, automatic lesion detection is highly desirable to improve the efficiency of the diagnosis process and mitigate human errors. Despite many approaches for lesion detection have been proposed, they mainly focus on large lesions and are not directly applicable to tiny lesions due to the limitations of feature representation. As bleeding lesions are a common symptom in most serious gastrointestinal diseases, detecting tiny bleeding lesions is extremely important for early diagnosis of those diseases, which is highly relevant to the survival, treatment, and expenses of patients. In this article, a method is proposed to extract and fuse multi-scale deep features for detecting and locating both large and tiny lesions. A feature extracting network is first used as our backbone network to extract the basic features from wireless capsule endoscopy images, and then at each layer multiple regions could be identified as potential lesions. As a result, the features maps of those potential lesions are obtained at each level and fused in a top-down manner to the fully connected layer for producing final detection results. Our proposed method has been evaluated on a clinical dataset that contains 20,000 wireless capsule endoscopy images with clinical annotation. Experimental results demonstrate that our method can achieve 98.9% prediction accuracy and 93.5%\n            <jats:inline-formula content-type=\"math\/tex\">\n              <jats:inline-graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" content-type=\"gif\" xlink:href=\"3477540-inline1.gif\"\/>\n            <\/jats:inline-formula>\n            score, which has a significant performance improvement of up to 31.69% and 22.12% in terms of recall rate and\n            <jats:inline-formula content-type=\"math\/tex\">\n              <jats:inline-graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" content-type=\"gif\" xlink:href=\"3477540-inline2.gif\"\/>\n            <\/jats:inline-formula>\n            score, respectively, when compared to the state-of-the-art approaches for both large and tiny bleeding lesions. Moreover, our model also has the highest AP and the best medical diagnosis performance compared to state-of-the-art multi-scale models.\n          <\/jats:p>","DOI":"10.1145\/3477540","type":"journal-article","created":{"date-parts":[[2021,10,27]],"date-time":"2021-10-27T17:26:03Z","timestamp":1635355563000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Multi-scale Features Fusion for the Detection of Tiny Bleeding in Wireless Capsule Endoscopy Images"],"prefix":"10.1145","volume":"3","author":[{"given":"Feng","family":"Lu","sequence":"first","affiliation":[{"name":"Huazhong University of Science and Technology, Wu Han, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4731-3226","authenticated-orcid":false,"given":"Wei","family":"Li","sequence":"additional","affiliation":[{"name":"The University of Sydney, Darlington, NSW, Australia"}]},{"given":"Song","family":"Lin","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wu Han, China"}]},{"given":"Chengwangli","family":"Peng","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wu Han, China"}]},{"given":"Zhiyong","family":"Wang","sequence":"additional","affiliation":[{"name":"The University of Sydney, Darlington, NSW, Australia"}]},{"given":"Bin","family":"Qian","sequence":"additional","affiliation":[{"name":"Newcastle University, Newcastle upon Tyne, United Kingdom"}]},{"given":"Rajiv","family":"Ranjan","sequence":"additional","affiliation":[{"name":"Newcastle University, Newcastle upon Tyne, United Kingdom"}]},{"given":"Hai","family":"Jin","sequence":"additional","affiliation":[{"name":"Huazhong University of Science and Technology, Wu Han, China"}]},{"given":"Albert Y.","family":"Zomaya","sequence":"additional","affiliation":[{"name":"The University of Sydney, Darlington, NSW, Australia"}]}],"member":"320","published-online":{"date-parts":[[2021,10,27]]},"reference":[{"key":"e_1_3_1_2_2","first-page":"354","volume-title":"European Conference on Computer Vision","author":"Cai Zhaowei","year":"2016","unstructured":"Zhaowei Cai, Quanfu Fan, Rogerio S. Feris, and Nuno Vasconcelos. 2016. A unified multi-scale deep convolutional neural network for fast object detection. In European Conference on Computer Vision. Springer, 354\u2013370."},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2021.3057172"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/RBME.2011.2171182"},{"key":"e_1_3_1_5_2","first-page":"1746","volume-title":"IEEE International Conference on Information and Automation","author":"Cui Lei","year":"2010","unstructured":"Lei Cui, Chao Hu, Yuexian Zou, and Max Q.-H. Meng. 2010. Bleeding detection in wireless capsule endoscopy images by support vector classifier. In IEEE International Conference on Information and Automation. IEEE, 1746\u20131751."},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature21056"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.167"},{"key":"e_1_3_1_8_2","first-page":"1","volume-title":"IEEE International Midwest Symposium on Circuits and Systems","author":"Fu Yanan","year":"2011","unstructured":"Yanan Fu, Mrinal Mandal, and Gencheng Guo. 2011. Bleeding region detection in WCE images based on color features and neural network. In IEEE International Midwest Symposium on Circuits and Systems. IEEE, 1\u20134."},{"key":"e_1_3_1_9_2","first-page":"000256","volume-title":"IEEE International Symposium on Signal Processing and Information Technology","author":"Ghosh Tonmoy","year":"2014","unstructured":"Tonmoy Ghosh, S. K. Bashar, Shaikh Anowarul Fattah, Celia Shahnaz, and Khan A. Wahid. 2014. A feature extraction scheme from region of interest of wireless capsule endoscopy images for automatic bleeding detection. In IEEE International Symposium on Signal Processing and Information Technology. IEEE, 000256\u2013000260."},{"key":"e_1_3_1_10_2","first-page":"1","volume-title":"International Conference on Informatics, Electronics & Vision","author":"Ghosh Tonmoy","year":"2014","unstructured":"Tonmoy Ghosh, Syed Khairul Bashar, Md Samiul Alam, Khan Wahid, and Shaikh Anowarul Fattah. 2014. A statistical feature based novel method to detect bleeding in wireless capsule endoscopy images. In International Conference on Informatics, Electronics & Vision. IEEE, 1\u20134."},{"key":"e_1_3_1_11_2","first-page":"1","volume-title":"IEEE Region 10 Conference","author":"Ghosh T.","year":"2015","unstructured":"T. Ghosh, S. A. Fattah, C. Shahnaz, A. K. Kundu, and M. N. Rizve. 2015. Block based histogram feature extraction method for bleeding detection in wireless capsule endoscopy. In IEEE Region 10 Conference. IEEE, 1\u20134."},{"key":"e_1_3_1_12_2","first-page":"12595","volume-title":"IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Guo Chaoxu","year":"2020","unstructured":"Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, and Chunhong Pan. 2020. AugFPN: Improving multi-scale feature learning for object detection. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 12595\u201312604."},{"key":"e_1_3_1_13_2","first-page":"770","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition","author":"He Kaiming","year":"2016","unstructured":"Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 770\u2013778."},{"key":"e_1_3_1_14_2","article-title":"Statistical Communique of National Health Commission","author":"China National health commission of the People\u2019s Republic of","year":"2017","unstructured":"National health commission of the People\u2019s Republic of China. 2017. Statistical Communique of National Health Commission. Retrieved from http:\/\/www.nhc.gov.cn\/guihuaxxs\/s10748\/201708\/d82fa7141696407abb4ef764f3edf095.shtml.","journal-title":"Retrieved from http:\/\/www.nhc.gov.cn\/guihuaxxs\/s10748\/201708\/d82fa7141696407abb4ef764f3edf095.shtml"},{"key":"e_1_3_1_15_2","first-page":"951","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition","author":"Hu Peiyun","year":"2017","unstructured":"Peiyun Hu and Deva Ramanan. 2017. Finding tiny faces. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 951\u2013959."},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1038\/nrgastro.2015.13"},{"key":"e_1_3_1_17_2","first-page":"639","volume-title":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society","author":"Jia Xiao","year":"2016","unstructured":"Xiao Jia and Max Q.-H. Meng. 2016. A deep convolutional neural network for bleeding detection in wireless capsule endoscopy images. In Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE, 639\u2013642."},{"key":"e_1_3_1_18_2","first-page":"3154","volume-title":"Annual International Conference of the IEEE Engineering in Medicine and Biology Society","author":"Jia Xiao","year":"2017","unstructured":"Xiao Jia and Max Q.-H. Meng. 2017. Gastrointestinal bleeding detection in wireless capsule endoscopy images using handcrafted and CNN features. In Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE, 3154\u20133157."},{"key":"e_1_3_1_19_2","article-title":"Zoom out-and-in network with recursive training for object proposal","author":"Li Hongyang","year":"2017","unstructured":"Hongyang Li, Yu Liu, Wanli Ouyang, and Xiaogang Wang. 2017. Zoom out-and-in network with recursive training for object proposal. arXiv preprint arXiv:1702.05711 (2017).","journal-title":"arXiv preprint arXiv:1702.05711"},{"key":"e_1_3_1_20_2","first-page":"1951","article-title":"Perceptual generative adversarial networks for small object detection","author":"Li Jianan","year":"2017","unstructured":"Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, and Shuicheng Yan. 2017. Perceptual generative adversarial networks for small object detection. In IEEE Conference on Computer Vision and Pattern Recognition. 1951\u20131959.","journal-title":"IEEE Conference on Computer Vision and Pattern Recognition"},{"issue":"3","key":"e_1_3_1_21_2","first-page":"661","article-title":"Object detection based on deep learning for urine sediment examination","volume":"38","author":"Liang Yixiong","year":"2018","unstructured":"Yixiong Liang, Zhihong Tang, Meng Yan, and Jianfeng Liu. 2018. Object detection based on deep learning for urine sediment examination. Biocyber. Biomed. Eng. 38, 3 (2018), 661\u2013670.","journal-title":"Biocyber. Biomed. Eng."},{"key":"e_1_3_1_22_2","first-page":"2117","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition","author":"Lin Tsung-Yi","year":"2017","unstructured":"Tsung-Yi Lin, Piotr Doll\u00e1r, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2117\u20132125."},{"key":"e_1_3_1_23_2","first-page":"21","volume-title":"European Conference on Computer Vision","author":"Liu Wei","year":"2016","unstructured":"Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot multibox detector. In European Conference on Computer Vision. Springer, 21\u201337."},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.5555\/3045390.3045445"},{"key":"e_1_3_1_25_2","first-page":"3431","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition","author":"Long Jonathan","year":"2015","unstructured":"Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 3431\u20133440."},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.178"},{"key":"e_1_3_1_27_2","first-page":"76","volume-title":"IEEE Conference on Cybernetics and Intelligent Systems","author":"Poh Chee Khun","year":"2010","unstructured":"Chee Khun Poh, That Mon Htwe, Liyuan Li, Weijia Shen, Jiang Liu, Joo Hwee Lim, Kap Luk Chan, and Ping Chun Tan. 2010. Multi-level local feature classification for bleeding detection in wireless capsule endoscopy images. In IEEE Conference on Cybernetics and Intelligent Systems. IEEE, 76\u201381."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969250"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2017.9"},{"key":"e_1_3_1_30_2","article-title":"Very deep convolutional networks for large-scale image recognition","volume":"1409","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. CoRR abs\/1409.1556 (2014).","journal-title":"CoRR"},{"key":"e_1_3_1_31_2","first-page":"2700","volume-title":"IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Tang Chang","year":"2019","unstructured":"Chang Tang, Xinzhong Zhu, Xinwang Liu, Lizhe Wang, and Albert Zomaya. 2019. DeFusionNet: Defocus blur detection via recurrently fusing and refining multi-scale deep features. In IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2700\u20132709."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2015.2399502"},{"key":"e_1_3_1_33_2","first-page":"4203","volume-title":"IEEE Conference on Computer Vision and Pattern Recognition","author":"Zhang Shifeng","year":"2018","unstructured":"Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, and Stan Z. Li. 2018. Single-shot refinement neural network for object detection. In IEEE Conference on Computer Vision and Pattern Recognition. 4203\u20134212."},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33019259"},{"key":"e_1_3_1_35_2","first-page":"372","volume-title":"International Congress on Image and Signal Processing","author":"Zhu Rongsheng","year":"2015","unstructured":"Rongsheng Zhu, Rong Zhang, and Dixiu Xue. 2015. Lesion detection of endoscopy images based on convolutional neural network features. In International Congress on Image and Signal Processing. IEEE, 372\u2013376."},{"key":"e_1_3_1_36_2","first-page":"1274","volume-title":"IEEE International Conference on Digital Signal Processing","author":"Zou Yuexian","year":"2015","unstructured":"Yuexian Zou, Lei Li, Yi Wang, Jiasheng Yu, Yi Li, and W. J. Deng. 2015. Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network. In IEEE International Conference on Digital Signal Processing. IEEE, 1274\u20131278."}],"container-title":["ACM Transactions on Internet of Things"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477540","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3477540","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:37Z","timestamp":1750183837000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3477540"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,27]]},"references-count":35,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,2,28]]}},"alternative-id":["10.1145\/3477540"],"URL":"https:\/\/doi.org\/10.1145\/3477540","relation":{},"ISSN":["2691-1914","2577-6207"],"issn-type":[{"value":"2691-1914","type":"print"},{"value":"2577-6207","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,27]]},"assertion":[{"value":"2020-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-10-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}