{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T08:14:54Z","timestamp":1774944894796,"version":"3.50.1"},"reference-count":20,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2019,9,24]],"date-time":"2019-09-24T00:00:00Z","timestamp":1569283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100012455","name":"China Knowledge Centre for Engineering Sciences and Technology","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100012455","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Traditional Chinese Medicine (TCM) has been developed for several thousand years and plays a significant role in health care for Chinese people. This paper studies the problem of classifying TCM clinical records into 5 main disease categories in TCM. We explored a number of state-of-the-art deep learning models and found that the recent Bidirectional Encoder Representations from Transformers can achieve better results than other deep learning models and other state-of-the-art methods. We further utilized an unlabeled clinical corpus to fine-tune the BERT language model before training the text classifier. The method only uses Chinese characters in clinical text as input without preprocessing or feature engineering. We evaluated deep learning models and traditional text classifiers on a benchmark data set. Our method achieves a state-of-the-art accuracy 89.39% \u00b1 0.35%, Macro F1 score 88.64% \u00b1 0.40% and Micro F1 score 89.39% \u00b1 0.35%. We also visualized attention weights in our method, which can reveal indicative characters in clinical text.<\/jats:p>","DOI":"10.1093\/jamia\/ocz164","type":"journal-article","created":{"date-parts":[[2019,8,21]],"date-time":"2019-08-21T11:40:21Z","timestamp":1566387621000},"page":"1632-1636","source":"Crossref","is-referenced-by-count":73,"title":["Traditional Chinese medicine clinical records classification with BERT and domain specific corpora"],"prefix":"10.1093","volume":"26","author":[{"given":"Liang","family":"Yao","sequence":"first","affiliation":[{"name":"Department of Preventive Medicine, Northwestern University, Chicago, Illinois, USA"}]},{"given":"Zhe","family":"Jin","sequence":"additional","affiliation":[{"name":"Zhejiang University, College of Computer Science and Technology, Hangzhou, China"}]},{"given":"Chengsheng","family":"Mao","sequence":"additional","affiliation":[{"name":"Department of Preventive Medicine, Northwestern University, Chicago, Illinois, USA"}]},{"given":"Yin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Zhejiang University, College of Computer Science and Technology, Hangzhou, China"}]},{"given":"Yuan","family":"Luo","sequence":"additional","affiliation":[{"name":"Department of Preventive Medicine, Northwestern University, Chicago, Illinois, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,9,24]]},"reference":[{"issue":"7378","key":"2021012411202149700_ocz164-B1","doi-asserted-by":"crossref","first-page":"S82","DOI":"10.1038\/480S82a","article-title":"TCM: Made in China","volume":"480","author":"Cheung","year":"2011","journal-title":"Nature"},{"issue":"4","key":"2021012411202149700_ocz164-B2","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1016\/j.jbi.2010.01.002","article-title":"Text mining for traditional Chinese medical knowledge discovery: a survey","volume":"43","author":"Zhou","year":"2010","journal-title":"J Biomed Inform"},{"issue":"6","key":"2021012411202149700_ocz164-B3","doi-asserted-by":"crossref","first-page":"646","DOI":"10.1136\/jamia.2009.001024","article-title":"A systematic literature review of automated clinical coding and classification systems","volume":"17","author":"Stanfill","year":"2010","journal-title":"J Am Med Inform Assoc"},{"key":"2021012411202149700_ocz164-B4","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1016\/j.eswa.2018.09.034","article-title":"Clinical text classification research trends: systematic literature review and open issues","volume":"116","author":"Mujtaba","year":"2019","journal-title":"Expert Syst Appl"},{"key":"2021012411202149700_ocz164-B5","author":"Yao"},{"key":"2021012411202149700_ocz164-B6","author":"Devlin"},{"key":"2021012411202149700_ocz164-B7","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.cmpb.2018.10.011","article-title":"End-to-End syndrome differentiation of Yin deficiency and Yang deficiency in traditional Chinese medicine","volume":"174","author":"Hu","year":"2019","journal-title":"Comput Methods Programs Biomed"},{"key":"2021012411202149700_ocz164-B8","author":"Kim"},{"key":"2021012411202149700_ocz164-B9","author":"Joulin"},{"key":"2021012411202149700_ocz164-B10","author":"Liu"},{"key":"2021012411202149700_ocz164-B11","author":"Yang"},{"key":"2021012411202149700_ocz164-B12","author":"Vaswani"},{"key":"2021012411202149700_ocz164-B13","author":"Peters"},{"key":"2021012411202149700_ocz164-B14","author":"Howard"},{"key":"2021012411202149700_ocz164-B15","author":"Radford","year":"2018"},{"key":"2021012411202149700_ocz164-B16","author":"Alsentzer","year":"2019"},{"key":"2021012411202149700_ocz164-B17","author":"Huang","year":"2019"},{"key":"2021012411202149700_ocz164-B18","author":"Song"},{"key":"2021012411202149700_ocz164-B19","author":"Meng"},{"key":"2021012411202149700_ocz164-B20","author":"Vig","year":"2019"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/26\/12\/1632\/36088620\/ocz164.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/26\/12\/1632\/36088620\/ocz164.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,1,24]],"date-time":"2021-01-24T16:20:36Z","timestamp":1611505236000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/26\/12\/1632\/5573314"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,24]]},"references-count":20,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2019,9,24]]},"published-print":{"date-parts":[[2019,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocz164","relation":{},"ISSN":["1527-974X"],"issn-type":[{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,12]]},"published":{"date-parts":[[2019,9,24]]}}}