{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T21:11:33Z","timestamp":1769980293375,"version":"3.49.0"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"30","license":[{"start":{"date-parts":[[2023,8,6]],"date-time":"2023-08-06T00:00:00Z","timestamp":1691280000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,8,6]],"date-time":"2023-08-06T00:00:00Z","timestamp":1691280000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2023,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Named entity recognition (NER) is a fundamental task for natural language processing, which aims to detect mentions of real-world entities from text and classifying them into predefined types. Recently, research on overlapped and discontinuous named entity recognition has received increasing attention. However, we note that few studies have considered both overlapped and discontinuous entities. In this paper, we proposed a novel sequence-to-sequence model that is capable of recognizing both overlapped and discontinuous entities based on machine reading comprehension. The model utilizes machine reading comprehension formulation to encode significant inferior information about the entity category. Then input sequence passes through a question-answering model to predict the mention relevance of the given source sentences to the query. Finally, we incorporate the mention relevance into the BART-based generation model. We conducted experiments on three type of NER datasets to show the generality of our model. The experimental results demonstrate that our model beats almost all the current top-performing baselines achieves a vast amount of performance boost over current SOTA models on overlapped and discontinuous NER datasets.<\/jats:p>","DOI":"10.1007\/s00521-023-08820-6","type":"journal-article","created":{"date-parts":[[2023,8,6]],"date-time":"2023-08-06T14:01:21Z","timestamp":1691330481000},"page":"22223-22234","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Improving unified named entity recognition by incorporating mention relevance"],"prefix":"10.1007","volume":"35","author":[{"given":"Lijun","family":"Ji","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Danfeng","family":"Yan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhuoran","family":"Cheng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yan","family":"Song","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,8,6]]},"reference":[{"key":"8820_CR1","unstructured":"Sang EFTK, Meulder FD (2003) Introduction to the conll-2003 shared task: language-independent named entity recognition. In: Proceedings of the seventh conference on natural language learning. https:\/\/aclanthology.org\/W03-0419\/"},{"key":"8820_CR2","doi-asserted-by":"crossref","unstructured":"Alex B, Haddow B, Grover C (2007) Recognising nested named entities in biomedical text. In: Biological translational and clinical language processing BioNLP@ACL, pp 65\u201372. https:\/\/aclanthology.org\/W07-1009\/","DOI":"10.3115\/1572392.1572404"},{"key":"8820_CR3","unstructured":"Huang Z, Xu W, Yu K (2015) Bidirectional LSTM-CRF models for sequence tagging. CoRR arXiv:abs\/1508.01991"},{"key":"8820_CR4","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1162\/tacl_a_00104","volume":"4","author":"JPC Chiu","year":"2016","unstructured":"Chiu JPC, Nichols E (2016) Named entity recognition with bidirectional lstm-cnns. Trans Assoc Comput Linguist 4:357\u2013370","journal-title":"Trans Assoc Comput Linguist"},{"key":"8820_CR5","doi-asserted-by":"crossref","unstructured":"Ma X, Hovy EH (2016) End-to-end sequence labeling via bi-directional lstm-cnns-crf. In: Proceedings of the 54th annual meeting of the association for computational linguistics. https:\/\/doi.org\/10.18653\/v1\/p16-1101","DOI":"10.18653\/v1\/P16-1101"},{"key":"8820_CR6","doi-asserted-by":"publisher","unstructured":"Ju M, Miwa M, Ananiadou S (2018) A neural layered model for nested named entity recognition. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, vol 1 (Long Papers), pp 1446\u20131459. https:\/\/doi.org\/10.18653\/v1\/N18-1131.https:\/\/aclanthology.org\/N18-1131","DOI":"10.18653\/v1\/N18-1131."},{"key":"8820_CR7","doi-asserted-by":"crossref","unstructured":"Strakov\u00e1 J, Straka M, Haji\u010d J (2019) Neural architectures for nested ner through linearization. arXiv preprint arXiv:1908.06926","DOI":"10.18653\/v1\/P19-1527"},{"key":"8820_CR8","doi-asserted-by":"crossref","unstructured":"Wang J, Shou L, Chen K, Chen G (2020) Pyramid: a layered model for nested named entity recognition. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 5918\u20135928","DOI":"10.18653\/v1\/2020.acl-main.525"},{"key":"8820_CR9","doi-asserted-by":"crossref","unstructured":"Lu W, Roth D (2015) Joint mention extraction and classification with mention hypergraphs. In: Conference on empirical methods in natural language processing","DOI":"10.18653\/v1\/D15-1102"},{"key":"8820_CR10","doi-asserted-by":"publisher","unstructured":"Luan Y, Wadden D, He L, Shah A, Ostendorf M, Hajishirzi H (2019) A general framework for information extraction using dynamic span graphs. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 3036\u20133046. https:\/\/doi.org\/10.18653\/v1\/n19-1308","DOI":"10.18653\/v1\/n19-1308"},{"key":"8820_CR11","doi-asserted-by":"publisher","unstructured":"Wang B, Lu W (2019) Combining spans into entities: a neural two-stage approach for recognizing discontiguous entities. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, pp 6215\u20136223. https:\/\/doi.org\/10.18653\/v1\/D19-1644","DOI":"10.18653\/v1\/D19-1644"},{"key":"8820_CR12","doi-asserted-by":"publisher","unstructured":"Yu J, Bohnet B, Poesio M (2020) Named entity recognition as dependency parsing. In: Proceedings of the 58th annual meeting of the association for computational linguistics, ACL 2020, Online, 5\u201310 July 2020, pp 6470\u20136476. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.577","DOI":"10.18653\/v1\/2020.acl-main.577"},{"key":"8820_CR13","doi-asserted-by":"publisher","unstructured":"Li F, Lin Z, Zhang M, Ji D (2021) A span-based model for joint overlapped and discontinuous named entity recognition. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, pp 4814\u20134828. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.372","DOI":"10.18653\/v1\/2021.acl-long.372"},{"key":"8820_CR14","doi-asserted-by":"publisher","unstructured":"Li X, Feng J, Meng Y, Han Q, Wu F, Li J (2020) A unified MRC framework for named entity recognition. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 5849\u20135859. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.519","DOI":"10.18653\/v1\/2020.acl-main.519"},{"key":"8820_CR15","doi-asserted-by":"publisher","unstructured":"Li X, Sun X, Meng Y, Liang J, Wu F, Li J (2020) Dice loss for data-imbalanced NLP tasks. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 465\u2013476. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.45","DOI":"10.18653\/v1\/2020.acl-main.45"},{"key":"8820_CR16","doi-asserted-by":"crossref","unstructured":"Phan TH, Do P (2021) Ner2ques: combining named entity recognition and sequence to sequence to automatically generating vietnamese questions. Neural Comput Appl 1\u201320","DOI":"10.1007\/s00521-021-06477-7"},{"key":"8820_CR17","doi-asserted-by":"publisher","unstructured":"Cui L, Wu Y, Liu J, Yang S, Zhang Y (2021) Template-based named entity recognition using BART. In: Findings of the association for computational linguistics: ACL\/IJCNLP 2021, vol ACL\/IJCNLP 2021, pp 1835\u20131845. https:\/\/doi.org\/10.18653\/v1\/2021.findings-acl.161","DOI":"10.18653\/v1\/2021.findings-acl.161"},{"key":"8820_CR18","doi-asserted-by":"publisher","unstructured":"Yan H, Gui T, Dai J, Guo Q, Zhang Z, Qiu X (2021) A unified generative framework for various NER subtasks. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing. https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.451","DOI":"10.18653\/v1\/2021.acl-long.451"},{"key":"8820_CR19","doi-asserted-by":"publisher","unstructured":"Su D, Yu T, Fung P (2021) Improve query focused abstractive summarization by incorporating answer relevance. In: Findings of the association for computationa linguistics: ACL\/IJCNLP 2021, pp 3124\u20133131. https:\/\/doi.org\/10.18653\/v1\/2021.findings-acl.275","DOI":"10.18653\/v1\/2021.findings-acl.275"},{"key":"8820_CR20","doi-asserted-by":"publisher","unstructured":"Su D, Xu Y, Winata G.I, Xu P, Kim H, Liu Z, Fung P (2019) Generalizing question answering system with pre-trained language model fine-tuning. In: Proceedings of the 2nd workshop on machine reading for question answering, pp 203\u2013211. https:\/\/doi.org\/10.18653\/v1\/D19-5827","DOI":"10.18653\/v1\/D19-5827"},{"key":"8820_CR21","doi-asserted-by":"crossref","unstructured":"Liu L, Shang J, Ren X, Xu FF, Gui H, Peng J, Han J (2018) Empower sequence labeling with task-aware neural language model. In: Proceedings of the thirty-second AAAI conference on artificial intelligence, pp 5253\u20135260. AAAI Press. https:\/\/www.aaai.org\/ocs\/index.php\/AAAI\/AAAI18\/paper\/view\/17123","DOI":"10.1609\/aaai.v32i1.12006"},{"key":"8820_CR22","unstructured":"Sapci AOB, Tastan \u00d6, Yeniterzi R (2021) Focusing on possible named entities in active named entity label acquisition. CoRR arXiv:abs\/2111.03837"},{"key":"8820_CR23","doi-asserted-by":"crossref","unstructured":"Zhou G, Su J (2002) Named entity recognition using an hmm-based chunk tagger. In: Proceedings of the 40th annual meeting of the association for computational linguistics, pp 473\u2013480. https:\/\/aclanthology.org\/P02-1060\/","DOI":"10.3115\/1073083.1073163"},{"key":"8820_CR24","doi-asserted-by":"publisher","unstructured":"Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 4171\u20134186. https:\/\/doi.org\/10.18653\/v1\/n19-1423","DOI":"10.18653\/v1\/n19-1423"},{"key":"8820_CR25","doi-asserted-by":"publisher","unstructured":"Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 2227\u20132237. https:\/\/doi.org\/10.18653\/v1\/n18-1202","DOI":"10.18653\/v1\/n18-1202"},{"key":"8820_CR26","doi-asserted-by":"crossref","unstructured":"Kim J, Ohta T, Tateisi Y, Tsujii J (2003) GENIA corpus\u2014a semantically annotated corpus for bio-textmining. In: Proceedings of the eleventh international conference on intelligent systems for molecular biology, pp 180\u2013182. http:\/\/bioinformatics.oupjournals.org\/cgi\/content\/abstract\/19\/suppl_1\/i180?etoc","DOI":"10.1093\/bioinformatics\/btg1023"},{"key":"8820_CR27","doi-asserted-by":"publisher","unstructured":"Strakov\u00e1 J, Straka M, Hajic J (2019) Neural architectures for nested NER through linearization. In: Proceedings of the 57th conference of the association for computational linguistics, pp 5326\u20135331. https:\/\/doi.org\/10.18653\/v1\/p19-1527","DOI":"10.18653\/v1\/p19-1527"},{"key":"8820_CR28","doi-asserted-by":"publisher","unstructured":"Gillick D, Brunk C, Vinyals O, Subramanya A (2016) Multilingual language processing from bytes. In: NAACL HLT 2016, The 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 1296\u20131306. https:\/\/doi.org\/10.18653\/v1\/n16-1155","DOI":"10.18653\/v1\/n16-1155"},{"key":"8820_CR29","unstructured":"Chen X, Li L, Deng S, Tan C, Xu C, Huang F, Si L, Chen H, Zhang N (2022) Lightner: a lightweight tuning paradigm for low-resource ner via pluggable prompting. In: Proceedings of the 29th international conference on computational linguistics, pp 2374\u20132387"},{"key":"8820_CR30","doi-asserted-by":"crossref","unstructured":"Fei H, Ji D, Li B, Liu Y, Ren Y, Li F (2021) Rethinking boundaries: end-to-end recognition of discontinuous mentions with pointer networks. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 12785\u201312793","DOI":"10.1609\/aaai.v35i14.17513"},{"key":"8820_CR31","doi-asserted-by":"publisher","unstructured":"Dai X, Karimi S, Hachey B, Paris C (2020) An effective transition-based model for discontinuous NER. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp. 5860\u20135870. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.520","DOI":"10.18653\/v1\/2020.acl-main.520"},{"key":"8820_CR32","doi-asserted-by":"publisher","unstructured":"Wang B, Lu W (2018) Neural segmental hypergraphs for overlapping mention recognition. In: Riloff E, Chiang D, Hockenmaier J, Tsujii J (eds) Proceedings of the 2018 conference on empirical methods in natural language processing, Brussels, Belgium, October 31\u2013November 4, 2018, pp. 204\u2013214. Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/d18-1019","DOI":"10.18653\/v1\/d18-1019"},{"key":"8820_CR33","unstructured":"Li J, Fei H, Liu J, Wu S, Zhang M, Teng C, Ji D, Li F (2021) Unified named entity recognition as word-word relation classification. CoRR arXiv:abs\/2112.10070"},{"key":"8820_CR34","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, pp 5998\u20136008. https:\/\/proceedings.neurips.cc\/paper\/2017\/hash\/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html"},{"key":"8820_CR35","unstructured":"Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: 3rd International conference on learning representations. arxiv: http:\/\/arxiv.org\/abs\/1409.0473"},{"key":"8820_CR36","doi-asserted-by":"publisher","unstructured":"See A, Liu PJ, Manning CD (2017) Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th annual meeting of the association for computational linguistics, pp 1073\u20131083. https:\/\/doi.org\/10.18653\/v1\/P17-1099","DOI":"10.18653\/v1\/P17-1099"},{"key":"8820_CR37","doi-asserted-by":"publisher","unstructured":"Katiyar A, Cardie C (2018) Nested named entity recognition revisited. In: Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 861\u2013871. https:\/\/doi.org\/10.18653\/v1\/n18-1079","DOI":"10.18653\/v1\/n18-1079"},{"key":"8820_CR38","doi-asserted-by":"publisher","unstructured":"Yu J, Bohnet B, Poesio M (2020) Named entity recognition as dependency parsing. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 6470\u20136476. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.577","DOI":"10.18653\/v1\/2020.acl-main.577"},{"key":"8820_CR39","unstructured":"Loshchilov I, Hutter F (2019) Decoupled weight decay regularization. In: 7th International conference on learning representations. https:\/\/openreview.net\/forum?id=Bkg6RiCqY7"},{"key":"8820_CR40","unstructured":"Gu J, Bradbury J, Xiong C, Li V.O, Socher R Non-autoregressive neural machine translation. In: International conference on learning representations"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-023-08820-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-023-08820-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-023-08820-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,16]],"date-time":"2023-09-16T15:03:29Z","timestamp":1694876609000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-023-08820-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,6]]},"references-count":40,"journal-issue":{"issue":"30","published-print":{"date-parts":[[2023,10]]}},"alternative-id":["8820"],"URL":"https:\/\/doi.org\/10.1007\/s00521-023-08820-6","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"value":"0941-0643","type":"print"},{"value":"1433-3058","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,8,6]]},"assertion":[{"value":"29 June 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 June 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 August 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}