{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T05:40:34Z","timestamp":1769751634506,"version":"3.49.0"},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2024,4,25]],"date-time":"2024-04-25T00:00:00Z","timestamp":1714003200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,4,25]],"date-time":"2024-04-25T00:00:00Z","timestamp":1714003200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100007689","name":"Universidade de Aveiro","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100007689","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Lexical answer type prediction is integral to biomedical question\u2013answering systems. LAT prediction aims to predict the expected answer\u2019s semantic type of a factoid or list-type biomedical question. It also aids in the answer processing stage of a QA system to assign a high score to the most relevant answers. Although considerable research efforts exist for LAT prediction in diverse domains, it remains a challenging biomedical problem. LAT prediction for the biomedical field is a multi-label classification problem, as one biomedical question might have more than one expected answer type. Achieving high performance on this task is challenging as biomedical questions have limited lexical features. One biomedical question must be assigned multiple labels given these limited lexical features. In this paper, we develop a novel feature set (lexical, noun concepts, verb concepts, protein\u2013protein interactions, and biomedical entities) from these lexical features. Using ensemble learning with bagging, we use the label power set transformation technique to classify multi-label. We evaluate the integrity of our proposed methodology on the publicly available multi-label biomedical questions dataset (MLBioMedLAT) and compare it with twelve state-of-the-art multi-label classification algorithms. Our proposed method attains a micro-F1 score of 77%, outperforming the baseline model by 25.5%.<\/jats:p>","DOI":"10.1007\/s10115-024-02113-7","type":"journal-article","created":{"date-parts":[[2024,4,25]],"date-time":"2024-04-25T15:04:36Z","timestamp":1714057476000},"page":"5003-5019","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Semantic features analysis for biomedical lexical answer type prediction using ensemble learning approach"],"prefix":"10.1007","volume":"66","author":[{"given":"Fiza Gulzar","family":"Hussain","sequence":"first","affiliation":[]},{"given":"Muhammad","family":"Wasim","sequence":"additional","affiliation":[]},{"given":"Sehrish Munawar","family":"Cheema","sequence":"additional","affiliation":[]},{"given":"Ivan Miguel","family":"Pires","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,4,25]]},"reference":[{"key":"2113_CR1","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1007\/978-3-030-58721-5_2","volume-title":"Biomedical informatics: computer applications in health care and biomedicine","author":"EH Shortliffe","year":"2021","unstructured":"Shortliffe EH, Chiang MF (2021) Biomedical data: their acquisition, storage, and use. Biomedical informatics: computer applications in health care and biomedicine. Springer, Cham, pp 45\u201375"},{"issue":"2","key":"2113_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3490238","volume":"55","author":"Q Jin","year":"2022","unstructured":"Jin Q, Yuan Z, Xiong G, Yu Q, Ying H, Tan C, Chen M, Huang S, Liu X, Yu S (2022) Biomedical question answering: a survey of approaches and challenges. ACM Comput Surv (CSUR) 55(2):1\u201336","journal-title":"ACM Comput Surv (CSUR)"},{"key":"2113_CR3","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1017\/S0269888921000138","volume":"37","author":"C Antoniou","year":"2022","unstructured":"Antoniou C, Bassiliades N (2022) A survey on semantic question answering systems. Knowl Eng Rev 37:2","journal-title":"Knowl Eng Rev"},{"key":"2113_CR4","doi-asserted-by":"crossref","unstructured":"Li X, Roth D (2002) Learning question classifiers. In: COLING 2002: the 19th international conference on computational Linguistics","DOI":"10.3115\/1072228.1072378"},{"key":"2113_CR5","unstructured":"Neves M, Kraus M (2016) Biomedlat corpus: annotation of the lexical answer type for biomedical questions. In: Proceedings of the open knowledge base and question answering workshop (OKBQA 2016), pp 49\u201358"},{"key":"2113_CR6","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2019.103143","volume":"93","author":"M Wasim","year":"2019","unstructured":"Wasim M, Asim MN, Khan MUG, Mahmood W (2019) Multi-label biomedical question classification for lexical answer type prediction. J Biomed Inform 93:103143","journal-title":"J Biomed Inform"},{"issue":"5","key":"2113_CR7","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1007\/s10664-021-09976-2","volume":"26","author":"M Izadi","year":"2021","unstructured":"Izadi M, Heydarnoori A, Gousios G (2021) Topic recommendation for software repositories using multi-label classification algorithms. Empir Softw Eng 26(5):93","journal-title":"Empir Softw Eng"},{"issue":"8","key":"2113_CR8","first-page":"6354","volume":"34","author":"P Prajapati","year":"2022","unstructured":"Prajapati P, Thakkar A (2022) Performance improvement of extreme multi-label classification using k-way tree construction with parallel clustering algorithm. J King Saud Univ Comput Inf Sci 34(8):6354\u20136364","journal-title":"J King Saud Univ Comput Inf Sci"},{"key":"2113_CR9","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1016\/j.ins.2022.05.057","volume":"606","author":"JA Kumar","year":"2022","unstructured":"Kumar JA, Trueman TE, Cambria E (2022) Gender-based multi-aspect sentiment detection using multilabel learning. Inf Sci 606:453\u2013468","journal-title":"Inf Sci"},{"key":"2113_CR10","doi-asserted-by":"crossref","unstructured":"Shi W, Li F, Li J, Fei H, Ji D (2022) Effective token graph modeling using a novel labeling strategy for structured sentiment analysis. In: Proceedings of the 60th annual meeting of the association for computational Linguistics, Vol. 1. Long Papers, pp 4232\u20134241","DOI":"10.18653\/v1\/2022.acl-long.291"},{"issue":"4","key":"2113_CR11","doi-asserted-by":"publisher","first-page":"5203","DOI":"10.1007\/s11227-021-04087-7","volume":"78","author":"PK Jain","year":"2022","unstructured":"Jain PK, Pamula R, Yekun EA (2022) A multi-label ensemble predicting model to service recommendation from social media contents. J Supercomput 78(4):5203\u20135220","journal-title":"J Supercomput"},{"issue":"9","key":"2113_CR12","doi-asserted-by":"publisher","first-page":"436","DOI":"10.3390\/axioms11090436","volume":"11","author":"E Deniz","year":"2022","unstructured":"Deniz E, Erbay H, Co\u015far M (2022) Multi-label classification of e-commerce customer reviews via machine learning. Axioms 11(9):436","journal-title":"Axioms"},{"issue":"2","key":"2113_CR13","doi-asserted-by":"publisher","first-page":"966","DOI":"10.1007\/s10489-020-01838-6","volume":"51","author":"Z Chen","year":"2021","unstructured":"Chen Z, Ren J (2021) Multi-label text classification with latent word-wise label information. Appl Intell 51(2):966\u2013979","journal-title":"Appl Intell"},{"key":"2113_CR14","doi-asserted-by":"crossref","unstructured":"Javeed A (2023) Hawk: an industrial-strength multi-label document classifier. arXiv preprint arXiv:2301.06057","DOI":"10.21203\/rs.3.rs-3235545\/v1"},{"issue":"8","key":"2113_CR15","doi-asserted-by":"publisher","first-page":"1516","DOI":"10.5829\/IJE.2022.35.08B.08","volume":"35","author":"V Balamurugan","year":"2022","unstructured":"Balamurugan V, Vedanarayanan V, Sahaya Anselin Nisha A, Narmadha R, Amirthalakshmi T (2022) Multi-label text categorization using error-correcting output coding with weighted probability. Int J Eng 35(8):1516\u20131523","journal-title":"Int J Eng"},{"key":"2113_CR16","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1016\/j.ins.2019.02.021","volume":"485","author":"J Lee","year":"2019","unstructured":"Lee J, Yu I, Park J, Kim D-W (2019) Memetic feature selection for multilabel text categorization using label frequency difference. Inf Sci 485:263\u2013280","journal-title":"Inf Sci"},{"key":"2113_CR17","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1007\/978-981-16-1740-9_19","volume-title":"Soft computing theories and applications proceedings of SoCTA","author":"V Vaissnave","year":"2022","unstructured":"Vaissnave V, Deepalakshmi P (2022) A keyword-based multi-label text categorization in the Indian legal domain using bi-lstm. Soft computing theories and applications proceedings of SoCTA. Springer, Cham, pp 213\u2013227"},{"key":"2113_CR18","doi-asserted-by":"publisher","unstructured":"Ma Q, Yuan C, Zhou W, Hu S (2021) Label-specific dual graph neural network for multi-label text classification. In: Zong C, Xia F, Li W, Navigli R. (eds.) Proceedings of the 59th annual meeting of the association for computational Linguistics and the 11th international joint conference on natural language processing, Vol. 1. Long Papers, pp 3855\u20133864. Association for Computational Linguistics, Onlinehttps:\/\/doi.org\/10.18653\/v1\/2021.acl-long.298.https:\/\/aclanthology.org\/2021.acl-long.298","DOI":"10.18653\/v1\/2021.acl-long.298."},{"key":"2113_CR19","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.neucom.2023.01.018","volume":"526","author":"T Pu","year":"2023","unstructured":"Pu T, Sun M, Wu H, Chen T, Tian L, Lin L (2023) Semantic representation and dependency learning for multi-label image recognition. Neurocomputing 526:121\u2013130","journal-title":"Neurocomputing"},{"key":"2113_CR20","doi-asserted-by":"crossref","unstructured":"Abdel-Khalek S, Algarni M, Mansour RF, Gupta D, Ilayaraja M (2021) Quantum neural network-based multilabel image classification in high-resolution unmanned aerial vehicle imagery. Soft Comput 1\u201312","DOI":"10.1007\/s00500-021-06460-3"},{"key":"2113_CR21","doi-asserted-by":"publisher","first-page":"1696","DOI":"10.1109\/TMM.2020.3002185","volume":"23","author":"J Xu","year":"2020","unstructured":"Xu J, Tian H, Wang Z, Wang Y, Kang W, Chen F (2020) Joint input and output space learning for multi-label image classification. IEEE Trans Multimedia 23:1696\u20131707","journal-title":"IEEE Trans Multimedia"},{"key":"2113_CR22","volume":"10","author":"S Coulibaly","year":"2022","unstructured":"Coulibaly S, Kamsu-Foguem B, Kamissoko D, Traore D (2022) Deep convolution neural network sharing for the multi-label images classification. Mach Learn Appl 10:100422","journal-title":"Mach Learn Appl"},{"key":"2113_CR23","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1016\/j.neucom.2022.03.057","volume":"491","author":"J Liang","year":"2022","unstructured":"Liang J, Xu F, Yu S (2022) A multi-scale semantic attention representation for multi-label image recognition with graph networks. Neurocomputing 491:14\u201323","journal-title":"Neurocomputing"},{"key":"2113_CR24","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.117215","volume":"203","author":"J Bogatinovski","year":"2022","unstructured":"Bogatinovski J, Todorovski L, D\u017eeroski S, Kocev D (2022) Comprehensive comparative study of multi-label classification methods. Expert Syst Appl 203:117215","journal-title":"Expert Syst Appl"},{"issue":"4","key":"2113_CR25","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1017\/pan.2021.15","volume":"30","author":"A Erlich","year":"2022","unstructured":"Erlich A, Dantas SG, Bagozzi BE, Berliner D, Palmer-Rubin B (2022) Multi-label prediction for political text-as-data. Polit Anal 30(4):463\u2013480","journal-title":"Polit Anal"},{"key":"2113_CR26","doi-asserted-by":"crossref","unstructured":"Peng K, Rong W, Li C, Hu J, Xiong Z (2020) Weight aware feature enriched biomedical lexical answer type prediction. In: Neural information processing: 27th international conference, ICONIP 2020, Bangkok, Thailand, 23\u201327 Nov 2020, Proceedings, Part III 27. Springer, pp 63\u201375","DOI":"10.1007\/978-3-030-63836-8_6"},{"key":"2113_CR27","doi-asserted-by":"publisher","DOI":"10.1155\/2015\/910423","volume":"2015","author":"AW Muzaffar","year":"2015","unstructured":"Muzaffar AW, Azam F, Qamar U (2015) A relation extraction framework for biomedical text using hybrid feature set. Comput Math Methods Med 2015:910423","journal-title":"Comput Math Methods Med"},{"key":"2113_CR28","doi-asserted-by":"crossref","unstructured":"Ahmed M, Islam J, Samee MR, Mercer RE (2019) Identifying protein-protein interaction using tree lstm and structured attention. In: 2019 IEEE 13th international conference on semantic computing (ICSC). IEEE, pp 224\u2013231","DOI":"10.1109\/ICOSC.2019.8665584"},{"issue":"1","key":"2113_CR29","doi-asserted-by":"publisher","first-page":"945","DOI":"10.1007\/s11042-022-13211-5","volume":"82","author":"S Kumar","year":"2023","unstructured":"Kumar S, Kumar N, Dev A, Naorem S (2023) Movie genre classification using binary relevance, label powerset, and machine learning classifiers. Multimedia Tools Appl 82(1):945\u2013968","journal-title":"Multimedia Tools Appl"},{"key":"2113_CR30","doi-asserted-by":"publisher","DOI":"10.1016\/j.techfore.2022.122271","volume":"188","author":"A Huang","year":"2023","unstructured":"Huang A, Xu R, Chen Y, Guo M (2023) Research on multi-label user classification of social media based on ml-knn algorithm. Technol Forecasting Soc Change 188:122271","journal-title":"Technol Forecasting Soc Change"},{"key":"2113_CR31","doi-asserted-by":"publisher","first-page":"056","DOI":"10.1093\/database\/baac056","volume":"2022","author":"S-J Lin","year":"2022","unstructured":"Lin S-J, Yeh W-C, Chiu Y-W, Chang Y-C, Hsu M-H, Chen Y-S, Hsu W-L (2022) A bert-based ensemble learning approach for the biocreative vii challenges: full-text chemical identification and multi-label classification in pubmed articles. Database 2022:056","journal-title":"Database"},{"key":"2113_CR32","doi-asserted-by":"crossref","unstructured":"Yang Z, Wang S, Rawat BPS, Mitra A, Yu H (2022) Knowledge injected prompt based fine-tuning for multi-label few-shot icd coding. In: Proceedings of the conference on empirical methods in natural language processing. Conference on empirical methods in natural language processing, vol. 2022. NIH Public Access, p 1767","DOI":"10.18653\/v1\/2022.findings-emnlp.127"},{"issue":"5","key":"2113_CR33","doi-asserted-by":"publisher","first-page":"2584","DOI":"10.1109\/TCBB.2022.3173562","volume":"19","author":"Q Chen","year":"2022","unstructured":"Chen Q, Du J, Allot A, Lu Z (2022) Litmc-bert: transformer-based multi-label classification of biomedical literature with an application on covid-19 literature curation. IEEE\/ACM Trans Comput Biol Bioinform 19(5):2584\u20132595","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2113_CR34","doi-asserted-by":"crossref","unstructured":"Ozmen M, Zhang H, Wang P, Coates M (2022) Multi-relation message passing for multi-label text classification. In: ICASSP 2022-2022 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 3583\u20133587","DOI":"10.1109\/ICASSP43922.2022.9747225"},{"key":"2113_CR35","doi-asserted-by":"crossref","unstructured":"Roy S, Chakraborty S, Mandal A, Balde G, Sharma P, Natarajan A, Khosla M, Sural S, Ganguly N(2021) Knowledge-aware neural networks for medical forum question classification. In: Proceedings of the 30th acm international conference on information & knowledge management, pp 3398\u20133402","DOI":"10.1145\/3459637.3482128"},{"issue":"3","key":"2113_CR36","doi-asserted-by":"publisher","first-page":"069","DOI":"10.1093\/jamiaopen\/ooaa069","volume":"4","author":"R Stemerman","year":"2021","unstructured":"Stemerman R, Arguello J, Brice J, Krishnamurthy A, Houston M, Kitzmiller R (2021) Identification of social determinants of health using multi-label classification of electronic health record clinical notes. JAMIA Open 4(3):069","journal-title":"JAMIA Open"},{"key":"2113_CR37","doi-asserted-by":"crossref","unstructured":"Yang W, Li J, Fukumoto F, Ye Y (2020) Hscnn: a hybrid-siamese convolutional neural network for extremely imbalanced multi-label text classification. In: Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), pp 6716\u20136722","DOI":"10.18653\/v1\/2020.emnlp-main.545"},{"key":"2113_CR38","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis E, Malakasiotis P, Androutsopoulos I (2019) Large-scale multi-label text classification on eu legislation. In: Proceedings of the 57th annual meeting of the association for computational Linguistics, pp 6314\u20136322","DOI":"10.18653\/v1\/P19-1636"},{"key":"2113_CR39","doi-asserted-by":"crossref","unstructured":"Aly R, Remus S, Biemann C (2019) Hierarchical multi-label classification of text with capsule networks. In: Proceedings of the 57th annual meeting of the association for computational Linguistics: student research workshop, pp 323\u2013330","DOI":"10.18653\/v1\/P19-2045"},{"key":"2113_CR40","doi-asserted-by":"crossref","unstructured":"Pal A, Selvakumar M, Sankarasubbu M (2020) Multi-label text classification using attention-based graph neural network. arXiv preprint arXiv:2003.11644","DOI":"10.5220\/0008940304940505"},{"issue":"11","key":"2113_CR41","doi-asserted-by":"publisher","first-page":"1279","DOI":"10.1093\/jamia\/ocz085","volume":"26","author":"J Du","year":"2019","unstructured":"Du J, Chen Q, Peng Y, Xiang Y, Tao C, Lu Z (2019) Ml-net: multi-label classification of biomedical texts with deep neural networks. J Am Med Inform Assoc 26(11):1279\u20131285","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2113_CR42","doi-asserted-by":"publisher","first-page":"44892","DOI":"10.2196\/44892","volume":"11","author":"Y Zhang","year":"2023","unstructured":"Zhang Y, Li X, Liu Y, Li A, Yang X, Tang X (2023) A multilabel text classifier of cancer literature at the publication level: methods study of medical text classification. JMIR Med Inform 11(1):44892","journal-title":"JMIR Med Inform"},{"key":"2113_CR43","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2021.115905","volume":"187","author":"Y Ma","year":"2022","unstructured":"Ma Y, Liu X, Zhao L, Liang Y, Zhang P, Jin B (2022) Hybrid embedding-based text representation for hierarchical multi-label text classification. Expert Syst Appl 187:115905","journal-title":"Expert Syst Appl"},{"issue":"2","key":"2113_CR44","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2020.102441","volume":"58","author":"R Wang","year":"2021","unstructured":"Wang R, Ridley R, Qu W, Dai X (2021) A novel reasoning mechanism for multi-label text classification. Inf Process Manag 58(2):102441","journal-title":"Inf Process Manag"},{"key":"2113_CR45","doi-asserted-by":"crossref","unstructured":"Nentidis A, Bougiatiotis K, Krithara A, Paliouras G, Kakadiaris I (2017) Results of the fifth edition of the bioasq challenge. In: BioNLP 2017, pp 48\u201357","DOI":"10.18653\/v1\/W17-2306"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-024-02113-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10115-024-02113-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-024-02113-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,18]],"date-time":"2024-07-18T11:08:15Z","timestamp":1721300895000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10115-024-02113-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,25]]},"references-count":45,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["2113"],"URL":"https:\/\/doi.org\/10.1007\/s10115-024-02113-7","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"value":"0219-1377","type":"print"},{"value":"0219-3116","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,25]]},"assertion":[{"value":"6 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 January 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 March 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 April 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}