{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T07:42:22Z","timestamp":1775806942566,"version":"3.50.1"},"reference-count":61,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2021,7,18]],"date-time":"2021-07-18T00:00:00Z","timestamp":1626566400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000006","name":"Office of Naval Research","doi-asserted-by":"publisher","award":["N00014-20-1-2332"],"award-info":[{"award-number":["N00014-20-1-2332"]}],"id":[{"id":"10.13039\/100000006","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000185","name":"DARPA","doi-asserted-by":"crossref","award":["W911NF2020006"],"award-info":[{"award-number":["W911NF2020006"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Comput. Healthcare"],"published-print":{"date-parts":[[2021,10,31]]},"abstract":"<jats:p>In this work, we formulated the named entity recognition (NER) task as a multi-answer knowledge guided question-answer task (KGQA) and showed that the knowledge guidance helps to achieve state-of-the-art results for 11 of 18 biomedical NER datasets. We prepended five different knowledge contexts\u2014entity types, questions, definitions, and examples\u2014to the input text and trained and tested BERT-based neural models on such input sequences from a combined dataset of the 18 different datasets. This novel formulation of the task (a) improved named entity recognition and illustrated the impact of different knowledge contexts, (b) reduced system confusion by limiting prediction to a single entity-class for each input token (i.e.,<jats:italic>B<\/jats:italic>,<jats:italic>I<\/jats:italic>,<jats:italic>O<\/jats:italic>only) compared to multiple entity-classes in traditional NER (i.e.,<jats:italic>B<\/jats:italic><jats:sub><jats:italic>entity<\/jats:italic><\/jats:sub>1,<jats:italic>B<\/jats:italic><jats:sub><jats:italic>entity<\/jats:italic><\/jats:sub>2,<jats:italic>I<\/jats:italic><jats:sub><jats:italic>entity<\/jats:italic><\/jats:sub>1,<jats:italic>I<\/jats:italic>,<jats:italic>O<\/jats:italic>), (c) made detection of nested entities easier, and (d) enabled the models to jointly learn NER-specific features from a large number of datasets. We performed extensive experiments of this KGQA formulation on the biomedical datasets, and through the experiments, we showed when knowledge improved named entity recognition. We analyzed the effect of the task formulation, the impact of the different knowledge contexts, the multi-task aspect of the generic format, and the generalization ability of KGQA. We also probed the model to better understand the key contributors for these improvements.<\/jats:p>","DOI":"10.1145\/3465221","type":"journal-article","created":{"date-parts":[[2021,7,18]],"date-time":"2021-07-18T16:05:05Z","timestamp":1626624305000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Biomedical Named Entity Recognition via Knowledge Guidance and Question Answering"],"prefix":"10.1145","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5634-410X","authenticated-orcid":false,"given":"Pratyay","family":"Banerjee","sequence":"first","affiliation":[{"name":"School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, U.S.A"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1278-3252","authenticated-orcid":false,"given":"Kuntal Kumar","family":"Pal","sequence":"additional","affiliation":[{"name":"School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, U.S.A"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3418-8924","authenticated-orcid":false,"given":"Murthy","family":"Devarakonda","sequence":"additional","affiliation":[{"name":"School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, U.S.A"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7549-723X","authenticated-orcid":false,"given":"Chitta","family":"Baral","sequence":"additional","affiliation":[{"name":"School of Computing, Informatics, and Decision Systems Engineering, Arizona State University, U.S.A"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,7,18]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM\u201917)","author":"Abacha Asma Ben"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 1st Workshop on Natural Language Processing for Medical Conversations. Association for Computational Linguistics, 31\u201340","author":"Amith Muhammad","year":"2020"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-13-161"},{"key":"e_1_2_1_4_1","first-page":"19","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP\u201919)","author":"Beltagy Iz","year":"2019"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S17-2093"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkh061"},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the 6th Workshop on Very Large Corpora.","author":"Borthwick Andrew","year":"1998"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the AMIA Summits on Translational Science. 592","author":"Chowdhuri Sanchari","year":"2019"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the NIPS Workshop on Advances in Structured Learning for Text and Speech Processing","volume":"2005","author":"Ciaramita Massimiliano","year":"2005"},{"key":"e_1_2_1_10_1","volume-title":"Artificial Intelligence Methods and Tools for Systems Biology","author":"Bretonnel Cohen K."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-017-1776-8"},{"key":"e_1_2_1_12_1","first-page":"19","volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 4171\u20134186","author":"Devlin Jacob","year":"2019"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2013.12.006"},{"key":"e_1_2_1_14_1","volume-title":"Question answering is a format","author":"Gardner Matt","year":"1909"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-11-85"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btz504"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. 643\u2013653","author":"He Luheng","year":"2015"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1079"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL\u201907)","author":"Kazama Jun\u2019ichi","year":"2007"},{"key":"e_1_2_1_20_1","volume-title":"Proceedings of the BioNLP Workshop Companion","volume":"9","author":"Kim Jin-Dong","year":"2009"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btg1023"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/1567594.1567610"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1186\/1758-2946-7-S1-S1"},{"key":"e_1_2_1_24_1","first-page":"17","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 785\u2013794","author":"Lai Guokun","year":"2017"},{"key":"e_1_2_1_25_1","doi-asserted-by":"crossref","first-page":"1234","DOI":"10.1093\/bioinformatics\/btz682","article-title":"BioBERT: A pre-trained biomedical language representation model for biomedical text mining","volume":"36","author":"Lee Jinhyuk","year":"2020","journal-title":"Bioinformatics"},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5849\u20135859","author":"Li Xiaoya","year":"2020"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1511"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1197\/jamia.M2085"},{"key":"e_1_2_1_29_1","first-page":"19","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5301\u20135307","author":"Liu Tianyu","year":"2019"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 3036\u20133046","author":"Luan Yi","year":"2019"},{"key":"e_1_2_1_31_1","volume-title":"Caiming Xiong, and Richard Socher.","author":"McCann Bryan","year":"2018"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201918)","author":"Mihaylov Todor","year":"2018"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the BioNLP Shared Task Workshop. 1\u20137.","author":"N\u00e9dellec Claire","year":"2013"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the BioNLP Shared Task Workshop. 1\u20136.","author":"Ohta Tomoko","year":"2011"},{"key":"e_1_2_1_35_1","volume-title":"Advances in Neural Information Processing Systems 32","author":"Paszke Adam"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btt580"},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of the BioNLP Workshop. 114\u2013123","author":"Pyysalo Sampo","year":"2011"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1264"},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP\u201919)","author":"Sap Maarten","year":"2019"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the AMIA Summits on Translational Science. 561","author":"Savery Max E.","year":"2020"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00334"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocz096"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1186\/gb-2008-9-s2-s2"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1186\/s12938-018-0573-6","article-title":"Comparison of named entity recognition methodologies in biomedical documents","volume":"17","author":"Song Hye-Jeong","year":"2018","journal-title":"Biomed. Eng. Online"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1527"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2013-001628"},{"key":"e_1_2_1_47_1","first-page":"19","volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 4149\u20134158","author":"Talmor Alon","year":"2019"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000784"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1136\/jamia.2010.003947"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000203"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1186\/s12859-019-3000-5"},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM\u201919)","author":"Wang Xuan","year":"2019"},{"key":"e_1_2_1_53_1","doi-asserted-by":"crossref","first-page":"1745","DOI":"10.1093\/bioinformatics\/bty869","article-title":"Cross-type biomedical named entity recognition with deep multi-task learning","volume":"35","author":"Wang Xuan","year":"2018","journal-title":"Bioinformatics"},{"key":"e_1_2_1_54_1","volume-title":"Proceedings of the 5th Biocreative Challenge Evaluation Workshop","volume":"14","author":"Wei Chih-Hsuan","year":"2015"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1927.10502953"},{"key":"e_1_2_1_56_1","doi-asserted-by":"crossref","unstructured":"Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u2019emi Louf Morgan Funtowicz and Jamie Brew. 2019. HuggingFace\u2019s transformers: State-of-the-art natural language processing. Retrieved from https:\/\/abs\/1910.03771. Thomas Wolf Lysandre Debut Victor Sanh Julien Chaumond Clement Delangue Anthony Moi Pierric Cistac Tim Rault R\u2019emi Louf Morgan Funtowicz and Jamie Brew. 2019. HuggingFace\u2019s transformers: State-of-the-art natural language processing. Retrieved from https:\/\/abs\/1910.03771.","DOI":"10.18653\/v1\/2020.emnlp-demos.6"},{"key":"e_1_2_1_57_1","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1093\/jamia\/ocz200","article-title":"Deep learning in clinical natural language processing: A methodical review","volume":"27","author":"Wu Stephen","year":"2020","journal-title":"J. Amer. Med. Info. Assoc."},{"key":"e_1_2_1_58_1","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics","author":"Yadav Vikas","year":"2018"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of the 7th Joint Conference on Lexical and Computational Semantics. 167\u2013172","author":"Yadav Vikas","year":"2018"},{"key":"e_1_2_1_60_1","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1186\/s12859-019-2813-6","article-title":"Collabonet: Collaboration of deep neural networks for biomedical named entity recognition","volume":"20","author":"Yoon Wonjin","year":"2019","journal-title":"BMC Bioinform."},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 6470\u20136476","author":"Yu Juntao","year":"2020"}],"container-title":["ACM Transactions on Computing for Healthcare"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3465221","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3465221","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3465221","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:17:11Z","timestamp":1750191431000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3465221"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,18]]},"references-count":61,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,10,31]]}},"alternative-id":["10.1145\/3465221"],"URL":"https:\/\/doi.org\/10.1145\/3465221","relation":{},"ISSN":["2691-1957","2637-8051"],"issn-type":[{"value":"2691-1957","type":"print"},{"value":"2637-8051","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,18]]},"assertion":[{"value":"2020-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-05-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-07-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}