{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T14:37:08Z","timestamp":1776436628750,"version":"3.51.2"},"reference-count":71,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T00:00:00Z","timestamp":1674604800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["91846201 and 72101079"],"award-info":[{"award-number":["91846201 and 72101079"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","award":["JZ2021HGPA0060"],"award-info":[{"award-number":["JZ2021HGPA0060"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Shanghai Data Exchange Cooperative Program","award":["W2021JSZX0052"],"award-info":[{"award-number":["W2021JSZX0052"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Manage. Inf. Syst."],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:p>Medical image annotation aims to automatically describe the content of medical images. It helps doctors to understand the content of medical images and make better informed decisions like diagnoses. Existing methods mainly follow the approach for natural images and fail to emphasize the object abnormalities, which is the essence of medical images annotation. In light of this, we propose to transform the medical image annotation to a multi-label classification problem, where object abnormalities are focused directly. However, extant multi-label classification studies rely on arduous feature engineering, or do not solve label correlation issues well in medical images. To solve these problems, we propose a novel deep learning model where a frequent pattern mining component and an adversarial-based denoising autoencoder component are introduced. Extensive experiments are conducted on a real retinal image dataset to evaluate the performance of the proposed model. Results indicate that the proposed model significantly outperforms image captioning baselines and multi-label classification baselines.<\/jats:p>","DOI":"10.1145\/3561653","type":"journal-article","created":{"date-parts":[[2022,9,15]],"date-time":"2022-09-15T09:53:50Z","timestamp":1663235630000},"page":"1-21","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["A Multi-Label Classification with an Adversarial-Based Denoising Autoencoder for Medical Image Annotation"],"prefix":"10.1145","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0260-7589","authenticated-orcid":false,"given":"Yidong","family":"Chai","sequence":"first","affiliation":[{"name":"School of Management of Hefei University of Technology, Key Laboratory of Process Optimization and Intelligence Decision Making, Minister of Education, Hefei, Anhui, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4902-1078","authenticated-orcid":false,"given":"Hongyan","family":"Liu","sequence":"additional","affiliation":[{"name":"Research Center for Contemporary Management, School of Economics and Management, Tsinghua University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2039-7055","authenticated-orcid":false,"given":"Jie","family":"Xu","sequence":"additional","affiliation":[{"name":"Beijing Institute of Ophthalmology, Beijing Tongren Hospital, Capital Medical University, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4513-805X","authenticated-orcid":false,"given":"Sagar","family":"Samtani","sequence":"additional","affiliation":[{"name":"Kelley School of Business, Indiana University, Bloomington, Indiana, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0886-3647","authenticated-orcid":false,"given":"Yuanchun","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Management of Hefei University of Technology, Key Laboratory of Process Optimization and Intelligence Decision Making, Minister of Education, Hefei, Anhui, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9237-0708","authenticated-orcid":false,"given":"Haoxin","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Management of Hefei University of Technology, Key Laboratory of Process Optimization and Intelligence Decision Making, Minister of Education, Hefei, Anhui, China"}]}],"member":"320","published-online":{"date-parts":[[2023,1,25]]},"reference":[{"key":"e_1_3_1_2_1","doi-asserted-by":"publisher","DOI":"10.25300\/MISQ\/2020\/14644"},{"key":"e_1_3_1_3_1","first-page":"113120V","volume-title":"Medical Imaging 2020: Physics of Medical Imaging","author":"Bhadra Sayantan","year":"2020","unstructured":"Sayantan Bhadra, Weimin Zhou, and Mark A. Anastasio. 2020. Medical image reconstruction with image-adaptive priors learned by use of generative adversarial networks. In Medical Imaging 2020: Physics of Medical Imaging, Vol. 11312. International Society for Optics and Photonics, 113120V."},{"issue":"9","key":"e_1_3_1_4_1","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1016\/j.patcog.2004.03.009","article-title":"Learning multi-label scene classification","volume":"37","author":"Boutell Matthew R.","year":"2004","unstructured":"Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. 2004. Learning multi-label scene classification. Pattern Recognition 37, 9 (2004), 1757\u20131771.","journal-title":"Pattern Recognition"},{"key":"e_1_3_1_5_1","first-page":"10486","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"34","author":"Cao Bing","year":"2020","unstructured":"Bing Cao, Han Zhang, Nannan Wang, Xinbo Gao, and Dinggang Shen. 2020. Auto-GAN: Self-supervised collaborative learning for medical image synthesis. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 10486\u201310493."},{"key":"e_1_3_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102454"},{"key":"e_1_3_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2018.07.043"},{"key":"e_1_3_1_8_1","article-title":"Multi-label chest X-ray image classification via semantic similarity graph embedding","author":"Chen Bingzhi","year":"2021","unstructured":"Bingzhi Chen, Zheng Zhang, Yingjian Li, Guangming Lu, and David Zhang. 2021. Multi-label chest X-ray image classification via semantic similarity graph embedding. IEEE Transactions on Circuits and Systems for Video Technology (2021).","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_3_1_9_1","doi-asserted-by":"crossref","first-page":"101811","DOI":"10.1016\/j.media.2020.101811","article-title":"Deep hiearchical multi-label classification applied to chest X-ray abnormality taxonomies","volume":"66","author":"Chen Haomin","year":"2020","unstructured":"Haomin Chen, Shun Miao, Daguang Xu, Gregory D. Hager, and Adam P. Harrison. 2020a. Deep hiearchical multi-label classification applied to chest X-ray abnormality taxonomies. Medical Image Analysis 66 (2020), 101811.","journal-title":"Medical Image Analysis"},{"key":"e_1_3_1_10_1","first-page":"616","volume-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention","author":"Chen Wenting","year":"2020","unstructured":"Wenting Chen, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Chunyan Chu, Linlin Shen, and Yefeng Zheng. 2020b. TR-GAN: Topology ranking GAN with triplet loss for retinal artery\/vein classification. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 616\u2013625."},{"issue":"1","key":"e_1_3_1_11_1","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1007\/s11682-018-9846-8","article-title":"Robust multi-label transfer feature learning for early diagnosis of Alzheimer\u2019s disease","volume":"13","author":"Cheng Bo","year":"2019","unstructured":"Bo Cheng, Mingxia Liu, Daoqiang Zhang, Dinggang Shen, and Alzheimer\u2019s Disease Neuroimaging Initiative. 2019. Robust multi-label transfer feature learning for early diagnosis of Alzheimer\u2019s disease. Brain Imaging and Behavior 13, 1 (2019), 138\u2013153.","journal-title":"Brain Imaging and Behavior"},{"key":"e_1_3_1_12_1","article-title":"A thorough review on recent deep learning methodologies for image captioning","author":"Elhagry Ahmed","year":"2021","unstructured":"Ahmed Elhagry and Karima Kadaoui. 2021. A thorough review on recent deep learning methodologies for image captioning. arXiv preprint arXiv:2107.13114 (2021).","journal-title":"arXiv preprint arXiv:2107.13114"},{"key":"e_1_3_1_13_1","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1016\/j.neucom.2018.09.013","article-title":"GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification","volume":"321","author":"Frid-Adar Maayan","year":"2018","unstructured":"Maayan Frid-Adar, Idit Diamant, Eyal Klang, Michal Amitai, Jacob Goldberger, and Hayit Greenspan. 2018. GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. Neurocomputing 321 (2018), 321\u2013331.","journal-title":"Neurocomputing"},{"issue":"7","key":"e_1_3_1_14_1","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1016\/j.ophtha.2017.02.008","article-title":"Automated identification of diabetic retinopathy using deep learning","volume":"124","author":"Gargeya Rishab","year":"2017","unstructured":"Rishab Gargeya and Theodore Leng. 2017. Automated identification of diabetic retinopathy using deep learning. Ophthalmology 124, 7 (2017), 962\u2013969.","journal-title":"Ophthalmology"},{"key":"e_1_3_1_15_1","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1007\/978-3-540-24775-3_5","volume-title":"Pacific-Asia Conference on Knowledge Discovery and Data Mining","author":"Godbole Shantanu","year":"2004","unstructured":"Shantanu Godbole and Sunita Sarawagi. 2004. Discriminative methods for multi-labeled classification. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 22\u201330."},{"key":"e_1_3_1_16_1","article-title":"Deep convolutional ranking for multilabel image annotation","author":"Gong Yunchao","year":"2014","unstructured":"Yunchao Gong, Yangqing Jia, Thomas K. Leung, Alexander Toshev, and Sergey Ioffe. 2014. Deep convolutional ranking for multilabel image annotation. 2nd International Conference on Learning Representations, ICLR 2014 - Conference Track Proceedings (2014).","journal-title":"2nd International Conference on Learning Representations, ICLR 2014 - Conference Track Proceedings"},{"key":"e_1_3_1_17_1","volume-title":"Deep Learning","author":"Goodfellow Ian","year":"2016","unstructured":"Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. 2016. Deep Learning. Vol. 1. MIT Press Cambridge."},{"key":"e_1_3_1_18_1","first-page":"2672","volume-title":"Advances in Neural Information Processing Systems","author":"Goodfellow Ian","year":"2014","unstructured":"Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. 2672\u20132680."},{"key":"e_1_3_1_19_1","volume-title":"26th Americas Conference on Information Systems, AMCIS 2020","author":"Gu Yang","year":"2020","unstructured":"Yang Gu and Gondy Leroy. 2020. Large-scale analysis of free-text data for mental health surveillance with topic modelling. In 26th Americas Conference on Information Systems, AMCIS 2020. Association for Information Systems."},{"key":"e_1_3_1_20_1","article-title":"A review on generative adversarial networks: Algorithms, theory, and applications","author":"Gui Jie","year":"2021","unstructured":"Jie Gui, Zhenan Sun, Yonggang Wen, Dacheng Tao, and Jieping Ye. 2021. A review on generative adversarial networks: Algorithms, theory, and applications. IEEE Transactions on Knowledge and Data Engineering (2021).","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"issue":"22","key":"e_1_3_1_21_1","doi-asserted-by":"crossref","first-page":"2402","DOI":"10.1001\/jama.2016.17216","article-title":"Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs","volume":"316","author":"Gulshan Varun","year":"2016","unstructured":"Varun Gulshan, Lily Peng, Marc Coram, Martin C. Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan, Kasumi Widner, Tom Madams, Jorge Cuadros, et\u00a0al. 2016. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 22 (2016), 2402\u20132410.","journal-title":"JAMA"},{"issue":"9","key":"e_1_3_1_22_1","doi-asserted-by":"crossref","first-page":"987","DOI":"10.1001\/jamaophthalmol.2019.2004","article-title":"Performance of a deep-learning algorithm vs manual grading for detecting diabetic retinopathy in India","volume":"137","author":"Gulshan Varun","year":"2019","unstructured":"Varun Gulshan, Renu P. Rajan, Kasumi Widner, Derek Wu, Peter Wubbels, Tyler Rhodes, Kira Whitehouse, Marc Coram, Greg Corrado, Kim Ramasamy, et\u00a0al. 2019. Performance of a deep-learning algorithm vs manual grading for detecting diabetic retinopathy in India. JAMA Ophthalmology 137, 9 (2019), 987\u2013993.","journal-title":"JAMA Ophthalmology"},{"key":"e_1_3_1_23_1","first-page":"10327","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Guo Longteng","year":"2020","unstructured":"Longteng Guo, Jing Liu, Xinxin Zhu, Peng Yao, Shichen Lu, and Hanqing Lu. 2020. Normalized and geometry-aware self-attention network for image captioning. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 10327\u201310336."},{"issue":"7","key":"e_1_3_1_24_1","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1016\/j.compmedimag.2013.09.005","article-title":"Automatic extraction of retinal features from colour retinal images for glaucoma diagnosis: A review","volume":"37","author":"Haleem Muhammad Salman","year":"2013","unstructured":"Muhammad Salman Haleem, Liangxiu Han, Jano van Hemert, and Baihua Li. 2013. Automatic extraction of retinal features from colour retinal images for glaucoma diagnosis: A review. Computerized Medical Imaging 37, 7\u20138 (2013), 581\u2013596.","journal-title":"Computerized Medical Imaging"},{"issue":"1","key":"e_1_3_1_25_1","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1023\/B:DAMI.0000005258.31418.83","article-title":"Mining frequent patterns without candidate generation: A frequent-pattern tree approach","volume":"8","author":"Han Jiawei","year":"2004","unstructured":"Jiawei Han, Jian Pei, Yiwen Yin, and Runying Mao. 2004. Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Mining and Knowledge Discovery 8, 1 (2004), 53\u201387.","journal-title":"Data Mining and Knowledge Discovery"},{"key":"e_1_3_1_26_1","first-page":"4188","article-title":"Framing image description as a ranking task: Data, models and evaluation metrics","volume":"2015","author":"Hodosh Micah","year":"2015","unstructured":"Micah Hodosh, Peter Young, and Julia Hockenmaier. 2015. Framing image description as a ranking task: Data, models and evaluation metrics. IJCAI International Joint Conference on Artificial Intelligence 2015-, IJCAI (2015), 4188\u20134192.","journal-title":"IJCAI International Joint Conference on Artificial Intelligence"},{"key":"e_1_3_1_27_1","first-page":"293","volume-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention","author":"Hou Benjamin","year":"2021","unstructured":"Benjamin Hou, Georgios Kaissis, Ronald M. Summers, and Bernhard Kainz. 2021a. RATCHET: Medical transformer for chest x-ray diagnosis and reporting. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 293\u2013303."},{"key":"e_1_3_1_28_1","doi-asserted-by":"crossref","first-page":"15720","DOI":"10.1109\/ACCESS.2021.3052794","article-title":"Multi-label learning with visual-semantic embedded knowledge graph for diagnosis of radiology imaging","volume":"9","author":"Hou Daibing","year":"2021","unstructured":"Daibing Hou, Zijian Zhao, and Sanyuan Hu. 2021b. Multi-label learning with visual-semantic embedded knowledge graph for diagnosis of radiology imaging. IEEE Access 9 (2021), 15720\u201315730.","journal-title":"IEEE Access"},{"key":"e_1_3_1_29_1","doi-asserted-by":"crossref","first-page":"3762","DOI":"10.1109\/ICIP42928.2021.9506803","volume-title":"2021 IEEE International Conference on Image Processing (ICIP\u201921)","author":"Huang Jia-Hong","year":"2021","unstructured":"Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, and Marcel Worring. 2021. Deep context-encoding network for retinal image captioning. In 2021 IEEE International Conference on Image Processing (ICIP\u201921). IEEE, 3762\u20133766."},{"key":"e_1_3_1_30_1","article-title":"Multi-label learning for crop leaf diseases recognition and severity estimation based on convolutional neural networks","author":"Ji Miaomiao","year":"2020","unstructured":"Miaomiao Ji, Keke Zhang, Qiufeng Wu, and Zhao Deng. 2020. Multi-label learning for crop leaf diseases recognition and severity estimation based on convolutional neural networks. Soft Computing (2020).","journal-title":"Soft Computing"},{"key":"e_1_3_1_31_1","doi-asserted-by":"crossref","first-page":"1450","DOI":"10.1109\/IJCNN.2008.4633988","volume-title":"2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence)","author":"Jiang Aiwen","year":"2008","unstructured":"Aiwen Jiang, Chunheng Wang, and Yuanping Zhu. 2008. Calibrated rank-SVM for multi-label image categorization. In 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence). IEEE, 1450\u20131455."},{"key":"e_1_3_1_32_1","doi-asserted-by":"crossref","DOI":"10.1002\/ima.22554","article-title":"Three-class classification of brain magnetic resonance images using average-pooling convolutional neural network","author":"Kakarla Jagadeesh","year":"2021","unstructured":"Jagadeesh Kakarla, Bala Venkateswarlu Isunuri, Krishna Sai Doppalapudi, and Karthik Satya Raghuram Bylapudi. 2021. Three-class classification of brain magnetic resonance images using average-pooling convolutional neural network. International Journal of Imaging Systems and Technology (2021).","journal-title":"International Journal of Imaging Systems and Technology"},{"issue":"3","key":"e_1_3_1_33_1","first-page":"271","article-title":"Covid-19 disease diagnosis using smart deep learning techniques","volume":"24","author":"Kavitha M.","year":"2021","unstructured":"M. Kavitha, T. Jayasankar, P. Maheswara Venkatesh, G. Mani, C. Bharatiraja, Bhekisipho Twala, et\u00a0al. 2021. Covid-19 disease diagnosis using smart deep learning techniques. Journal of Applied Science and Engineering 24, 3 (2021), 271\u2013277.","journal-title":"Journal of Applied Science and Engineering"},{"key":"e_1_3_1_34_1","doi-asserted-by":"crossref","first-page":"546","DOI":"10.1007\/978-3-319-93000-8_62","volume-title":"International Conference Image Analysis and Recognition","author":"Kumar Pulkit","year":"2018","unstructured":"Pulkit Kumar, Monika Grewal, and Muktabh Mayank Srivastava. 2018. Boosted cascaded convnets for multilabel classification of thoracic diseases in chest radiographs. In International Conference Image Analysis and Recognition. Springer, 546\u2013552."},{"issue":"7553","key":"e_1_3_1_35_1","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun Yann","year":"2015","unstructured":"Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436\u2013444.","journal-title":"Nature"},{"issue":"1","key":"e_1_3_1_36_1","doi-asserted-by":"crossref","first-page":"102146","DOI":"10.1016\/j.ipm.2019.102146","article-title":"Deep adversarial model for musculoskeletal quality evaluation","volume":"57","author":"Li Shenglong","year":"2020","unstructured":"Shenglong Li. 2020. Deep adversarial model for musculoskeletal quality evaluation. Information Processing & Management 57, 1 (2020), 102146.","journal-title":"Information Processing & Management"},{"issue":"3","key":"e_1_3_1_37_1","doi-asserted-by":"crossref","first-page":"811","DOI":"10.3390\/s20030811","article-title":"Attention based CNN-ConvLSTM for pedestrian attribute recognition","volume":"20","author":"Li Yang","year":"2020","unstructured":"Yang Li, Huahu Xu, Minjie Bian, and Junsheng Xiao. 2020. Attention based CNN-ConvLSTM for pedestrian attribute recognition. Sensors 20, 3 (2020), 811.","journal-title":"Sensors"},{"key":"e_1_3_1_38_1","article-title":"The emerging trends of multi-label learning","author":"Liu Weiwei","year":"2021","unstructured":"Weiwei Liu, Haobo Wang, Xiaobo Shen, and Ivor Tsang.2021. The emerging trends of multi-label learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_39_1","doi-asserted-by":"publisher","DOI":"10.25300\/MISQ\/2020\/15107"},{"issue":"8","key":"e_1_3_1_40_1","first-page":"1517","article-title":"Generative adversarial active learning for unsupervised outlier detection","volume":"32","author":"Liu Yezheng","year":"2019","unstructured":"Yezheng Liu, Zhe Li, Chong Zhou, Yuanchun Jiang, Jianshan Sun, Meng Wang, and Xiangnan He. 2019. Generative adversarial active learning for unsupervised outlier detection. IEEE Transactions on Knowledge and Data Engineering 32, 8 (2019), 1517\u20131528.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_1_41_1","first-page":"2286","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"35","author":"Luo Yunpeng","year":"2021","unstructured":"Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun, Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, and Rongrong Ji. 2021. Dual-level collaborative transformer for image captioning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 2286\u20132293."},{"key":"e_1_3_1_42_1","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.compmedimag.2018.10.005","article-title":"Image super-resolution using progressive generative adversarial networks for medical image analysis","volume":"71","author":"Mahapatra Dwarikanath","year":"2019","unstructured":"Dwarikanath Mahapatra, Behzad Bozorgtabar, and Rahil Garnavi. 2019. Image super-resolution using progressive generative adversarial networks for medical image analysis. Computerized Medical Imaging and Graphics 71 (2019), 30\u201339.","journal-title":"Computerized Medical Imaging and Graphics"},{"key":"e_1_3_1_43_1","first-page":"1717","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Oquab Maxime","year":"2014","unstructured":"Maxime Oquab, Leon Bottou, Ivan Laptev, and Josef Sivic. 2014. Learning and transferring mid-level image representations using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1717\u20131724."},{"key":"e_1_3_1_44_1","first-page":"139","article-title":"Automatic Myanmar image captioning using CNN and LSTM-based language model","author":"Pa San","year":"2020","unstructured":"San Pa, Pa Aung, Win Pa Pa, and Tin Lay Nwe. 2020. Automatic Myanmar image captioning using CNN and LSTM-based language model. Proceedings of the 1st Joint SLTU and CCURL WorkshopMay (2020), 139\u2013143.","journal-title":"Proceedings of the 1st Joint SLTU and CCURL Workshop"},{"key":"e_1_3_1_45_1","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1016\/j.dss.2017.05.012","article-title":"A data analytics approach to building a clinical decision support system for diabetic retinopathy: Developing and deploying a model ensemble","volume":"101","author":"Piri Saeed","year":"2017","unstructured":"Saeed Piri, Dursun Delen, Tieming Liu, and Hamed M. Zolbanin. 2017. A data analytics approach to building a clinical decision support system for diabetic retinopathy: Developing and deploying a model ensemble. Decision Support Systems 101 (2017), 12\u201327.","journal-title":"Decision Support Systems"},{"key":"e_1_3_1_46_1","first-page":"1","volume-title":"2018 International Joint Conference on Neural Networks (IJCNN\u201918)","author":"Prellberg Jonas","year":"2018","unstructured":"Jonas Prellberg and Oliver Kramer. 2018. Multi-label classification of surgical tools with convolutional neural networks. In 2018 International Joint Conference on Neural Networks (IJCNN\u201918). IEEE, 1\u20138."},{"key":"e_1_3_1_47_1","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1016\/j.media.2019.05.001","article-title":"Denoising of 3D magnetic resonance images using a residual encoder\u2013decoder Wasserstein generative adversarial network","volume":"55","author":"Ran Maosong","year":"2019","unstructured":"Maosong Ran, Jinrong Hu, Yang Chen, Hu Chen, Huaiqiang Sun, Jiliu Zhou, and Yi Zhang. 2019. Denoising of 3D magnetic resonance images using a residual encoder\u2013decoder Wasserstein generative adversarial network. Medical Image Analysis 55 (2019), 165\u2013180.","journal-title":"Medical Image Analysis"},{"key":"e_1_3_1_48_1","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1109\/ICDM.2008.74","volume-title":"2008 Eighth IEEE International Conference on Data Mining","author":"Read Jesse","year":"2008","unstructured":"Jesse Read, Bernhard Pfahringer, and Geoff Holmes. 2008. Multi-label classification using ensembles of pruned sets. In 2008 Eighth IEEE International Conference on Data Mining. IEEE, 995\u20131000."},{"key":"e_1_3_1_49_1","doi-asserted-by":"crossref","first-page":"254","DOI":"10.1007\/978-3-642-04174-7_17","volume-title":"Joint European Conference on Machine Learning and Knowledge Discovery in Databases","author":"Read Jesse","year":"2009","unstructured":"Jesse Read, Bernhard Pfahringer, Geoff Holmes, and Eibe Frank. 2009. Classifier chains for multi-label classification. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 254\u2013269."},{"key":"e_1_3_1_50_1","doi-asserted-by":"crossref","DOI":"10.25300\/MISQ\/2022\/15392","article-title":"Linking exploits from the dark web to known vulnerabilities for proactive cyber threat intelligence: An attention-based deep structured semantic model","author":"Samtani Sagar","year":"2022","unstructured":"Sagar Samtani, Yidong Chai, and Hsinchun Chen. 2022. Linking exploits from the dark web to known vulnerabilities for proactive cyber threat intelligence: An attention-based deep structured semantic model. MIS Quarterly (2022).","journal-title":"MIS Quarterly"},{"key":"e_1_3_1_51_1","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2016.274","article-title":"Learning to read chest X-rays: Recurrent neural cascade model for automated image annotation","author":"Shin Hoo Chang","year":"2016","unstructured":"Hoo Chang Shin, Kirk Roberts, Le Lu, Dina Demner-Fushman, Jianhua Yao, and Ronald M. Summers. 2016. Learning to read chest X-rays: Recurrent neural cascade model for automated image annotation. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916).","journal-title":"2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916)"},{"key":"e_1_3_1_52_1","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1109\/ICBK50248.2020.00041","volume-title":"2020 IEEE International Conference on Knowledge Graph (ICKG\u201920)","author":"Sun Yi","year":"2020","unstructured":"Yi Sun, Peisen Yuan, and Yuming Sun. 2020. MM-GAN: 3D MRI data augmentation for medical image segmentation via generative adversarial networks. In 2020 IEEE International Conference on Knowledge Graph (ICKG\u201920). IEEE, 227\u2013234."},{"key":"e_1_3_1_53_1","article-title":"Image caption generator","author":"Ann Liya","year":"2021","unstructured":"Liya Ann Sunny, Sara Susan Joseph, Sonu Sara Geogy, K. S. Sreelakshmi, and Abin T. Abraham.2021. Image caption generator. International Journal of Recent Advances in Multidisciplinary Topics 2, no. 4 (2021): 42-44 (2021).","journal-title":"International Journal of Recent Advances in Multidisciplinary Topics 2, no. 4 (2021): 42-44"},{"issue":"6","key":"e_1_3_1_54_1","doi-asserted-by":"crossref","first-page":"e0179790","DOI":"10.1371\/journal.pone.0179790","article-title":"Applying artificial intelligence to disease staging: Deep learning for improved staging of diabetic retinopathy","volume":"12","author":"Takahashi Hidenori","year":"2017","unstructured":"Hidenori Takahashi, Hironobu Tampo, Yusuke Arai, Yuji Inoue, and Hidetoshi Kawashima. 2017. Applying artificial intelligence to disease staging: Deep learning for improved staging of diabetic retinopathy. PloS One 12, 6 (2017), e0179790.","journal-title":"PloS One"},{"key":"e_1_3_1_55_1","doi-asserted-by":"crossref","first-page":"107965","DOI":"10.1016\/j.patcog.2021.107965","article-title":"A review of methods for imbalanced multi-label classification","author":"Tarekegn Adane","year":"2021","unstructured":"Adane Tarekegn, Mario Giacobini, and Krzysztof Michalak. 2021. A review of methods for imbalanced multi-label classification. Pattern Recognition (2021), 107965.","journal-title":"Pattern Recognition"},{"issue":"22","key":"e_1_3_1_56_1","doi-asserted-by":"crossref","first-page":"2211","DOI":"10.1001\/jama.2017.18152","article-title":"Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes","volume":"318","author":"Ting Daniel Shu Wei","year":"2017","unstructured":"Daniel Shu Wei Ting, Carol Yim-Lui Cheung, Gilbert Lim, Gavin Siew Wei Tan, Nguyen D. Quang, Alfred Gan, Haslina Hamzah, Renata Garcia-Franco, Ian Yew San Yeo, Shu Yen Lee, et\u00a0al. 2017. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA 318, 22 (2017), 2211\u20132223.","journal-title":"JAMA"},{"key":"e_1_3_1_57_1","doi-asserted-by":"crossref","first-page":"3847","DOI":"10.1109\/ICASSP.2019.8682549","volume-title":"ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201919)","author":"Tsai Che-Ping","year":"2019","unstructured":"Che-Ping Tsai and Hung-Yi Lee. 2019. Adversarial learning of label dependency: A novel framework for multi-class classification. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201919). IEEE, 3847\u20133851."},{"issue":"7","key":"e_1_3_1_58_1","doi-asserted-by":"crossref","first-page":"1079","DOI":"10.1109\/TKDE.2010.164","article-title":"Random k-labelsets for multilabel classification","volume":"23","author":"Tsoumakas Grigorios","year":"2010","unstructured":"Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. 2010. Random k-labelsets for multilabel classification. IEEE Transactions on Knowledge and Data Engineering 23, 7 (2010), 1079\u20131089.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_1_59_1","first-page":"2285","article-title":"CNN-RNN: A unified framework for multi-label image classification","volume":"2016","author":"Wang Jiang","year":"2016","unstructured":"Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, and Wei Xu. 2016. CNN-RNN: A unified framework for multi-label image classification. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2016- (2016), 2285\u20132294.","journal-title":"Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition"},{"key":"e_1_3_1_60_1","article-title":"CNN + CNN: Convolutional decoders for image captioning","author":"Wang Qingzhong","year":"2018","unstructured":"Qingzhong Wang and Antoni B. Chan. 2018. CNN + CNN: Convolutional decoders for image captioning. arXiv preprint arXiv:1805.09019 (2018).","journal-title":"arXiv preprint arXiv:1805.09019"},{"issue":"1","key":"e_1_3_1_61_1","first-page":"1","article-title":"DSSAE: Deep stacked sparse autoencoder analytical model for COVID-19 diagnosis by fractional Fourier entropy","volume":"13","author":"Wang Shui-Hua","year":"2021","unstructured":"Shui-Hua Wang, Xin Zhang, and Yu-Dong Zhang. 2021. DSSAE: Deep stacked sparse autoencoder analytical model for COVID-19 diagnosis by fractional Fourier entropy. ACM Transactions on Management Information System (TMIS) 13, 1 (2021), 1\u201320.","journal-title":"ACM Transactions on Management Information System (TMIS)"},{"key":"e_1_3_1_62_1","first-page":"98","article-title":"Conditional infilling GANs for data augmentation in mammogram classification","volume":"11040","author":"Wu Eric","year":"2018","unstructured":"Eric Wu, Kevin Wu, David Cox, and William Lotter. 2018. Conditional infilling GANs for data augmentation in mammogram classification. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11040 LNCS (2018), 98\u2013106.","journal-title":"Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)"},{"key":"e_1_3_1_63_1","article-title":"Understanding medication nonadherence from social media: A sentiment-enriched deep learning approach","author":"Xie Jiaheng","year":"2022","unstructured":"Jiaheng Xie, Xiao Liu, Daniel Dajun Zeng, and Xiao Fang. 2022. Understanding medication nonadherence from social media: A sentiment-enriched deep learning approach. MISQ Quarterly (2022).","journal-title":"MISQ Quarterly"},{"key":"e_1_3_1_64_1","first-page":"73","volume-title":"CLEF2019 Working Notes. CEUR Workshop Proceedings, (CEUR-WS. org), ISSN","author":"Xu Jing","year":"2019","unstructured":"Jing Xu, Wei Liu, Chao Liu, Yu Wang, Ying Chi, Xuansong Xie, and Xiansheng Hua. 2019. Concept detection based on multi-label classification and image captioning approach-DAMO at ImageCLEF 2019. In CLEF2019 Working Notes. CEUR Workshop Proceedings, (CEUR-WS. org), ISSN. 73\u20131613."},{"issue":"3","key":"e_1_3_1_65_1","first-page":"383","article-title":"SeGAN: Adversarial network with multi-scale l1 loss for medical image segmentation","volume":"16","author":"Xue Yuan","year":"2018","unstructured":"Yuan Xue, Tao Xu, Han Zhang, L. Rodney Long, and Xiaolei Huang. 2018. SeGAN: Adversarial network with multi-scale l1 loss for medical image segmentation. Neuroinformatics 16, 3\u20134 (2018), 383\u2013392.","journal-title":"Neuroinformatics"},{"key":"e_1_3_1_66_1","article-title":"Deconfounded image captioning: A causal retrospect","author":"Yang Xu","year":"2021","unstructured":"Xu Yang, Hanwang Zhang, and Jianfei Cai. 2021. Deconfounded image captioning: A causal retrospect. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_1_67_1","volume-title":"2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916)","author":"You Quanzeng","year":"2016","unstructured":"Quanzeng You, Hailin Jin, Zhaowen Wang, Fang Chen, and Jiebo Luo. 2016. Image captioning with semantic attention. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916)."},{"key":"e_1_3_1_68_1","article-title":"Wearable sensor-based chronic condition severity assessment: An adversarial attention-based deep multisource multitask learning approach","author":"Yu Shuo","year":"2022","unstructured":"Shuo Yu, Yidong Chai, Hsinchun Chen, Scott J. Sherman, and Randall A. Brown. 2022. Wearable sensor-based chronic condition severity assessment: An adversarial attention-based deep multisource multitask learning approach. MISQ Quarterly (2022).","journal-title":"MISQ Quarterly"},{"key":"e_1_3_1_69_1","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1016\/j.neucom.2018.11.114","article-title":"Deep learning for ultrasound image caption generation based on object detection","volume":"392","author":"Zeng Xianhua","year":"2020","unstructured":"Xianhua Zeng, Li Wen, Banggui Liu, and Xiaojun Qi. 2020. Deep learning for ultrasound image caption generation based on object detection. Neurocomputing 392 (2020), 132\u2013141.","journal-title":"Neurocomputing"},{"key":"e_1_3_1_70_1","first-page":"3549","article-title":"MDNet: A semantically and visually interpretable medical image diagnosis network","author":"Zhang Zizhao","year":"2017","unstructured":"Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, and Lin Yang. 2017. MDNet: A semantically and visually interpretable medical image diagnosis network. Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (2017), 3549\u20133557.","journal-title":"Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017"},{"key":"e_1_3_1_71_1","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1016\/j.imavis.2016.07.004","article-title":"Multi-label convolutional neural network based pedestrian attribute classification","volume":"58","author":"Zhu Jianqing","year":"2017","unstructured":"Jianqing Zhu, Shengcai Liao, Zhen Lei, and Stan Z. Li. 2017. Multi-label convolutional neural network based pedestrian attribute classification. Image and Vision Computing 58 (2017), 224\u2013229.","journal-title":"Image and Vision Computing"},{"issue":"4","key":"e_1_3_1_72_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3466690","article-title":"AIDCOV: An interpretable artificial intelligence model for detection of COVID-19 from chest radiography images","volume":"12","author":"Zokaeinikoo Maryam","year":"2021","unstructured":"Maryam Zokaeinikoo, Pooyan Kazemian, Prasenjit Mitra, and Soundar Kumara. 2021. AIDCOV: An interpretable artificial intelligence model for detection of COVID-19 from chest radiography images. ACM Transactions on Management Information System (TMIS) 12, 4 (2021), 1\u201320.","journal-title":"ACM Transactions on Management Information System (TMIS)"}],"container-title":["ACM Transactions on Management Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561653","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3561653","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:24Z","timestamp":1750186944000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3561653"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,25]]},"references-count":71,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,6,30]]}},"alternative-id":["10.1145\/3561653"],"URL":"https:\/\/doi.org\/10.1145\/3561653","relation":{},"ISSN":["2158-656X","2158-6578"],"issn-type":[{"value":"2158-656X","type":"print"},{"value":"2158-6578","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,25]]},"assertion":[{"value":"2022-01-10","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-09-03","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-01-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}