{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,3]],"date-time":"2025-12-03T17:52:29Z","timestamp":1764784349516},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"S7","license":[{"start":{"date-parts":[[2019,12,1]],"date-time":"2019-12-01T00:00:00Z","timestamp":1575158400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,12,23]],"date-time":"2019-12-23T00:00:00Z","timestamp":1577059200000},"content-version":"vor","delay-in-days":22,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Clinical Named Entity Recognition is to find the name of diseases, body parts and other related terms from the given text. Because Chinese language is quite different with English language, the machine cannot simply get the graphical and phonetic information form Chinese characters. The method for Chinese should be different from that for English. Chinese characters present abundant information with the graphical features, recent research on Chinese word embedding tries to use graphical information as subword. This paper uses both graphical and phonetic features to improve Chinese Clinical Named Entity Recognition based on the presence of phono-semantic characters.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>This paper proposed three different embedding models and tested them on the annotated data. The data have been divided into two sections for exploring the effect of the proportion of phono-semantic characters.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>The model using primary radical and pinyin can improve Clinical Named Entity Recognition in Chinese and get the F-measure of 0.712. More phono-semantic characters does not give a better result.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The paper proves that the use of the combination of graphical and phonetic features can improve the Clinical Named Entity Recognition in Chinese.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12911-019-0980-z","type":"journal-article","created":{"date-parts":[[2019,12,23]],"date-time":"2019-12-23T01:02:30Z","timestamp":1577062950000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Improving clinical named entity recognition in Chinese using the graphical and phonetic feature"],"prefix":"10.1186","volume":"19","author":[{"given":"Yifei","family":"Wang","sequence":"first","affiliation":[]},{"given":"Sophia","family":"Ananiadou","sequence":"additional","affiliation":[]},{"given":"Jun\u2019ichi","family":"Tsujii","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,12,23]]},"reference":[{"key":"980_CR1","volume-title":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"Y Wang","year":"2018","unstructured":"Wang Y, Ananiadou S, Tsujii J. Improve chinese clinical named entity recognition performance by using the graphical and phonetic feature. In: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Piscataway: IEEE: 2018. p. 1582\u20136."},{"key":"980_CR2","volume-title":"The Origin and Early Development of the Chinese Writing System, vol. 78","author":"WG Boltz","year":"1994","unstructured":"Boltz WG. The Origin and Early Development of the Chinese Writing System, vol. 78. University Park: Eisenbrauns; 1994."},{"key":"980_CR3","first-page":"624","volume":"216","author":"Y Wu","year":"2015","unstructured":"Wu Y, Jiang M, Lei J, Xu H. Named entity recognition in chinese clinical text using deep neural network. Stud Health Tech Inf. 2015; 216:624.","journal-title":"Stud Health Tech Inf"},{"key":"980_CR4","volume-title":"Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2","author":"N Peng","year":"2016","unstructured":"Peng N, Dredze M. Improving named entity recognition for chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2. Stroudsburg, Pennsylvania: ACL: 2016. p. 149\u201355."},{"key":"980_CR5","volume-title":"CoNLL","author":"T Luong","year":"2013","unstructured":"Luong T, Socher R, Manning CD. Better word representations with recursive neural networks for morphology. In: CoNLL. Stroudsburg, Pennsylvania: ACL: 2013. p. 104\u201313."},{"key":"980_CR6","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T. Enriching word vectors with subword information. Trans Assoc Comput Linguist. 2017; 5:135\u201346. MIT Press; Cambridge, Massachusetts.","journal-title":"Trans Assoc Comput Linguist"},{"key":"980_CR7","volume-title":"Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing","author":"J Yu","year":"2017","unstructured":"Yu J, Jian X, Xin H, Song Y. Joint embeddings of chinese words, characters, and fine-grained subcharacter components. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, Pennsylvania: ACL: 2017. p. 286\u201391."},{"key":"980_CR8","unstructured":"Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781. 2013. https:\/\/arxiv.org\/abs\/1301.3781."},{"key":"980_CR9","volume-title":"Thirty-Second AAAI Conference on Artificial Intelligence","author":"S Cao","year":"2018","unstructured":"Cao S, Lu W, Zhou J, Li X. cw2vec: Learning Chinese word embeddings with stroke n-gram information. In: Thirty-Second AAAI Conference on Artificial Intelligence. Palo Alto: AAAI: 2018."},{"key":"980_CR10","volume-title":"International Conference on Computer Processing of Oriental Languages","author":"C Dong","year":"2016","unstructured":"Dong C, Zhang J, Zong C, Hattori M, Di H. Character-based lstm-crf with radical-level features for chinese named entity recognition. In: International Conference on Computer Processing of Oriental Languages. Berlin: Springer: 2016. p. 239\u201350."},{"key":"980_CR11","volume-title":"Proceedings of the First Workshop on Subword and Character Level Models in NLP","author":"F Dai","year":"2017","unstructured":"Dai F, Cai Z. Glyph-aware embedding of chinese characters. In: Proceedings of the First Workshop on Subword and Character Level Models in NLP. Stroudsburg: ACL: 2017. p. 64\u20139."},{"key":"980_CR12","unstructured":"Huang Z, Xu W, Yu K. Bidirectional lstm-crf models for sequence tagging. arXiv preprint arXiv:1508.01991. 2015. https:\/\/arxiv.org\/abs\/1508.01991."},{"key":"980_CR13","unstructured":". https:\/\/zh.wikipedia.org\/wiki\/Category:%E7%94%9F%E7%89%A9%E5%AD%A6. Accessed 29 Aug 2018."},{"key":"980_CR14","doi-asserted-by":"crossref","unstructured":". https:\/\/zh.wikipedia.org\/wiki\/Category:%E5%8C%BB%E5%AD%A6. Accessed 29 Aug 2018.","DOI":"10.1111\/jce.13468"},{"key":"980_CR15","unstructured":"Google Code Archive - Long-term Storage for Google Code Project Hosting. https:\/\/code.google.com\/archive\/p\/word2vec. Accessed 29 Aug 2018."},{"key":"980_CR16","unstructured":"GitHub - HKUST-KnowComp\/JWE: Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components. https:\/\/github.com\/HKUST-KnowComp\/JWE. Accessed 10 Sept 2018."},{"key":"980_CR17","unstructured":"GitHub - Bamtercelboo\/cw2vec: Cw2vec: Learning Chinese Word Embeddings with Stroke N-gram Information. https:\/\/github.com\/bamtercelboo\/cw2vec. Accessed 10 Sept 2018."},{"key":"980_CR18","unstructured":"Unihan Database Lookup. http:\/\/www.unicode.org\/charts\/unihan.html. Accessed 30 Aug 2018."},{"key":"980_CR19","unstructured":"CHISE Project. http:\/\/www.chise.org\/index.en.html. Accessed 08 Feb 2019."}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-019-0980-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12911-019-0980-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-019-0980-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,9]],"date-time":"2022-10-09T06:15:53Z","timestamp":1665296153000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-019-0980-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12]]},"references-count":19,"journal-issue":{"issue":"S7","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["980"],"URL":"https:\/\/doi.org\/10.1186\/s12911-019-0980-z","relation":{},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12]]},"assertion":[{"value":"23 December 2019","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"273"}}