{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T08:49:04Z","timestamp":1770972544176,"version":"3.50.1"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"15","license":[{"start":{"date-parts":[[2017,3,24]],"date-time":"2017-03-24T00:00:00Z","timestamp":1490313600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Disease named entities play a central role in many areas of biomedical research, and automatic recognition and normalization of such entities have received increasing attention in biomedical research communities. Existing methods typically used pipeline models with two independent phases: (i) a disease named entity recognition (DER) system is used to find the boundaries of mentions in text and (ii) a disease named entity normalization (DEN) system is used to connect the mentions recognized to concepts in a controlled vocabulary. The main problems of such models are: (i) there is error propagation from DER to DEN and (ii) DEN is useful for DER, but pipeline models cannot utilize this.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Methods<\/jats:title>\n                  <jats:p>We propose a transition-based model to jointly perform disease named entity recognition and normalization, casting the output construction process into an incremental state transition process, learning sequences of transition actions globally, which correspond to joint structural outputs. Beam search and online structured learning are used, with learning being designed to guide search. Compared with the only existing method for joint DEN and DER, our method allows non-local features to be used, which significantly improves the accuracies.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We evaluate our model on two corpora: the BioCreative V Chemical Disease Relation (CDR) corpus and the NCBI disease corpus. Experiments show that our joint framework achieves significantly higher performances compared to competitive pipeline baselines. Our method compares favourably to other state-of-the-art approaches.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and Implementation<\/jats:title>\n                  <jats:p>Data and code are available at https:\/\/github.com\/louyinxia\/jointRN<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx172","type":"journal-article","created":{"date-parts":[[2017,3,24]],"date-time":"2017-03-24T03:10:34Z","timestamp":1490325034000},"page":"2363-2371","source":"Crossref","is-referenced-by-count":39,"title":["A transition-based joint model for disease named entity recognition and normalization"],"prefix":"10.1093","volume":"33","author":[{"given":"Yinxia","family":"Lou","sequence":"first","affiliation":[{"name":"Computer School, Wuhan University, Wuhan, China"},{"name":"School of Computer and Information Technology, Shangqiu Normal University, Shangqiu, China"}]},{"given":"Yue","family":"Zhang","sequence":"additional","affiliation":[{"name":"Singapore University of Technology and Design, Shangqiu Normal University, Shangqiu, China"}]},{"given":"Tao","family":"Qian","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology,Hubei University of Science and Technology, Xianning, China"}]},{"given":"Fei","family":"Li","sequence":"additional","affiliation":[{"name":"Computer School, Wuhan University, Wuhan, China"}]},{"given":"Shufeng","family":"Xiong","sequence":"additional","affiliation":[{"name":"Pingdingshan University, Pingdingshan, China"}]},{"given":"Donghong","family":"Ji","sequence":"additional","affiliation":[{"name":"Computer School, Wuhan University, Wuhan, China"}]}],"member":"286","published-online":{"date-parts":[[2017,3,24]]},"reference":[{"key":"2023063012500182100_btx172-B1","author":"Andor","year":"2016"},{"key":"2023063012500182100_btx172-B2","author":"Bohnet","year":"2012"},{"key":"2023063012500182100_btx172-B3","author":"Chowdhury","year":"2010"},{"key":"2023063012500182100_btx172-B4","author":"Collins","year":"2002"},{"key":"2023063012500182100_btx172-B5","author":"Collins","year":"2004"},{"key":"2023063012500182100_btx172-B6","author":"Constant","year":"2016"},{"key":"2023063012500182100_btx172-B7","doi-asserted-by":"crossref","first-page":"914","DOI":"10.1093\/nar\/gku935","article-title":"The comparative toxicogenomics database\u2019s 10th year anniversary: update 2015","volume":"43","author":"Davis","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023063012500182100_btx172-B8","author":"Doddington","year":"2002"},{"key":"2023063012500182100_btx172-B9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jbi.2013.12.006","article-title":"Ncbi disease corpus: a resource for disease name recognition and concept normalization","volume":"47","author":"Do\u011fan","year":"2014","journal-title":"J. Biomed. Informatics"},{"key":"2023063012500182100_btx172-B10","author":"Ghiasvand","year":"2014"},{"key":"2023063012500182100_btx172-B11","author":"Hatori","year":"2011"},{"key":"2023063012500182100_btx172-B12","doi-asserted-by":"crossref","first-page":"D1032","DOI":"10.1093\/nar\/gkv1146","article-title":"Pcoskb: a knowledgebase on genes, diseases, ontology terms and biochemical pathways associated with polycystic ovary syndrome","volume":"44","author":"Joseph","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023063012500182100_btx172-B13","author":"Kaji","year":"2014"},{"key":"2023063012500182100_btx172-B14","doi-asserted-by":"crossref","first-page":"876","DOI":"10.1136\/amiajnl-2012-001173","article-title":"Using rule-based natural language processing to improve disease normalization in biomedical text","volume":"20","author":"Kang","year":"2013","journal-title":"J. Am. Med. Informatics Assoc"},{"key":"2023063012500182100_btx172-B15","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1093\/jamia\/ocv108","article-title":"Normalizing clinical terms using learned edit distance patterns","volume":"23","author":"Kate","year":"2016","journal-title":"J. Am. Med. Informatics Assoc"},{"key":"2023063012500182100_btx172-B16","first-page":"210","article-title":"Importance of artificial neural network in medical diagnosis disease like acute nephritis disease and heart disease","volume":"2","author":"Khan","year":"2013","journal-title":"Int. J. Eng. Sci. Innovative Technol"},{"key":"2023063012500182100_btx172-B17","doi-asserted-by":"crossref","first-page":"2839","DOI":"10.1093\/bioinformatics\/btw343","article-title":"Taggerone: joint named entity recognition and normalization with semi-markov models","volume":"32","author":"Leaman","year":"2016","journal-title":"Bioinformatics"},{"key":"2023063012500182100_btx172-B18","author":"Leaman","year":"2008"},{"key":"2023063012500182100_btx172-B19","doi-asserted-by":"crossref","first-page":"2909","DOI":"10.1093\/bioinformatics\/btt474","article-title":"Dnorm: disease name normalization with pairwise learning to rank","volume":"29","author":"Leaman","year":"2013","journal-title":"Bioinformatics"},{"key":"2023063012500182100_btx172-B20","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1016\/j.jbi.2015.07.010","article-title":"Challenges in clinical natural language processing for automated disorder normalization","volume":"57","author":"Leaman","year":"2015","journal-title":"J. Biomed. Informatics"},{"key":"2023063012500182100_btx172-B21","first-page":"1","article-title":"Audis: an automatic crf-enhanced disease normalization in biomedical text","author":"Lee","year":"2016","journal-title":"Database"},{"key":"2023063012500182100_btx172-B22","first-page":"707","article-title":"Binary codes capable of correcting deletions, insertions and reversals","volume":"10","author":"Levenshtein","year":"1966","journal-title":"Soviet Phys. Doklady"},{"key":"2023063012500182100_btx172-B23","first-page":"1263","article-title":"Joint pos tagging and text normalization for informal text","author":"Li","year":"2015","journal-title":"IJCAI"},{"key":"2023063012500182100_btx172-B24","first-page":"1","article-title":"Biocreative v cdr task corpus: a resource for chemical disease relation extraction","author":"Li","year":"2016","journal-title":"Database"},{"key":"2023063012500182100_btx172-B25","first-page":"240","article-title":"Leadmine: disease identification and concept mapping using wikipedia","volume-title":"Proceedings of the Fifth BioCreative Challenge Evaluation Workshop","author":"Lowe","year":"2015"},{"key":"2023063012500182100_btx172-B26","first-page":"879","article-title":"Joint named entity recognition and disambiguation","volume-title":"Proc. EMNLP","author":"Luo","year":"2015"},{"key":"2023063012500182100_btx172-B27","author":"Lyu","year":"2016"},{"key":"2023063012500182100_btx172-B28","first-page":"54","article-title":"Semeval-2014 task 7: analysis of clinical text","author":"Pradhan","year":"2014"},{"key":"2023063012500182100_btx172-B29","author":"Qian","year":"2015"},{"key":"2023063012500182100_btx172-B30","author":"Sahu","year":"2016"},{"key":"2023063012500182100_btx172-B31","author":"Sampo","year":"2013"},{"key":"2023063012500182100_btx172-B32","author":"S\u00f8Gaard","year":"2011"},{"key":"2023063012500182100_btx172-B33","author":"Stern","year":"2012"},{"key":"2023063012500182100_btx172-B34","author":"Watanabe","year":"2015"},{"key":"2023063012500182100_btx172-B35","author":"Wei","year":"2015"},{"key":"2023063012500182100_btx172-B36","author":"Wei","year":"2015"},{"key":"2023063012500182100_btx172-B37","author":"Zhang","year":"2008"},{"key":"2023063012500182100_btx172-B38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1162\/coli_a_00037","article-title":"Syntactic processing using the generalized perceptron and beam search","volume":"37","author":"Zhang","year":"2011","journal-title":"Comput. Linguist"},{"key":"2023063012500182100_btx172-B39","author":"Zhang","year":"2011"},{"key":"2023063012500182100_btx172-B40","author":"Zhou","year":"2015"},{"key":"2023063012500182100_btx172-B41","author":"Zhu","year":"2013"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/15\/2363\/50756459\/bioinformatics_33_15_2363.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/15\/2363\/50756459\/bioinformatics_33_15_2363.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T12:50:26Z","timestamp":1688129426000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/15\/2363\/3089942"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,3,24]]},"references-count":41,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2017,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx172","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,8,1]]},"published":{"date-parts":[[2017,3,24]]}}}