{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,4]],"date-time":"2025-01-04T05:27:08Z","timestamp":1735968428861,"version":"3.32.0"},"reference-count":38,"publisher":"Wiley","issue":"3","license":[{"start":{"date-parts":[[2019,9,1]],"date-time":"2019-09-01T00:00:00Z","timestamp":1567296000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"},{"start":{"date-parts":[[2019,9,1]],"date-time":"2019-09-01T00:00:00Z","timestamp":1567296000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2019,9,1]],"date-time":"2019-09-01T00:00:00Z","timestamp":1567296000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["11771242","11401338"],"award-info":[{"award-number":["11771242","11401338"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Quant. Biol."],"published-print":{"date-parts":[[2019,9]]},"abstract":"<jats:sec><jats:title>Background<\/jats:title><jats:p>Traditional Chinese medicine (TCM) has been attracting lots of attentions from various disciplines recently. However, TCM is still mysterious because of its unique philosophy and theoretical thinking. Due to the lack of high quality data, understanding TCM thoroughly faces critical challenges. In this study, we introduce the Zhou Archive, a large\u2010scale database of expert\u2010specific Electronic Medical Records containing information about 73,000+ visits to one TCM doctor for over 35 years. Covering the full spectrum of diagnosis\u2010treatment model behind TCM practice, the archive provides an opportunity to understand TCM from the data\u2010driven perspective.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>Processing the text data in the archive via a series of data processing steps, we transformed the semi\u2010structured EMRs in the archive to a well\u2010structured feature table. Based on the structured feature table obtained, a series of statistical analyses are implemented to learn principles of TCM clinical practice from the archive, including correlation analysis, enrichment analysis, embedding analysis and association pattern discovery.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>A structured feature table of 14,000+ features is generated at the end of the proposed data processing procedure, with a feature codebook, a term dictionary and a term\u2010feature map as byproducts. Statistical analysis of the feature table reveals underlying principles about the diagnosis\u2010treatment model of TCM, helping us better understand the TDM practice from a data\u2010driven perspective.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Expert\u2010specific EMRs provide opportunities to understand TCM from the data\u2010driven perspective. Taking advantage of recent progresses on NLP for Chinese, we can process a large number of TCM EMRs efficiently to gain insights via statistical analysis.<\/jats:p><\/jats:sec>","DOI":"10.1007\/s40484-019-0173-x","type":"journal-article","created":{"date-parts":[[2019,8,2]],"date-time":"2019-08-02T02:02:39Z","timestamp":1564711359000},"page":"210-232","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Understanding traditional Chinese medicine via statistical learning of expert\u2010specific Electronic Medical Records"],"prefix":"10.1002","volume":"7","author":[{"given":"Yang","family":"Yang","sequence":"first","affiliation":[{"name":"<!--1--> Center for Statistical Science &amp; Department of Industry Engineering Tsinghua University Beijing 100084 China"},{"name":"<!--2--> Department of Mathematical Sciences Tsinghua University Beijing 100084 China"}]},{"given":"Qi","family":"Li","sequence":"additional","affiliation":[{"name":"<!--1--> Center for Statistical Science &amp; Department of Industry Engineering Tsinghua University Beijing 100084 China"}]},{"given":"Zhaoyang","family":"Liu","sequence":"additional","affiliation":[{"name":"<!--1--> Center for Statistical Science &amp; Department of Industry Engineering Tsinghua University Beijing 100084 China"}]},{"given":"Fang","family":"Ye","sequence":"additional","affiliation":[{"name":"<!--3--> Zhou Zhongying\u2019s Studio Nanjing University of Chinese Medicine Nanjing 210046 China"}]},{"given":"Ke","family":"Deng","sequence":"additional","affiliation":[{"name":"<!--1--> Center for Statistical Science &amp; Department of Industry Engineering Tsinghua University Beijing 100084 China"}]}],"member":"311","published-online":{"date-parts":[[2019,9]]},"reference":[{"key":"e_1_2_10_2_2","doi-asserted-by":"publisher","DOI":"10.1016\/S1003\u20105257(17)30089\u20102"},{"key":"e_1_2_10_3_2","doi-asserted-by":"publisher","DOI":"10.1136\/aim.25.1\u20102.11"},{"key":"e_1_2_10_4_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF02848387"},{"key":"e_1_2_10_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/S1875\u20105364(13)60037\u20100"},{"key":"e_1_2_10_6_2","doi-asserted-by":"publisher","DOI":"10.1155\/2013\/456747"},{"key":"e_1_2_10_7_2","doi-asserted-by":"publisher","DOI":"10.1186\/1752\u20100509\u20105\u2010S1\u2010S10"},{"key":"e_1_2_10_8_2","doi-asserted-by":"publisher","DOI":"10.1126\/scitranslmed.3001270"},{"key":"e_1_2_10_9_2","doi-asserted-by":"publisher","DOI":"10.1002\/ptr.2384"},{"key":"e_1_2_10_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/S1050\u20104648(03)00062\u20107"},{"key":"e_1_2_10_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0024\u20103205(02)02302\u20100"},{"key":"e_1_2_10_12_2","doi-asserted-by":"publisher","DOI":"10.4088\/JCP.v66n0214"},{"key":"e_1_2_10_13_2","doi-asserted-by":"publisher","DOI":"10.1007\/11540007_45"},{"key":"e_1_2_10_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.artmed.2006.07.005"},{"key":"e_1_2_10_15_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0218339009002971"},{"key":"e_1_2_10_16_2","first-page":"7","article-title":"Epidemiological investigation of constitutional types of Chinese medicine in general population: based on 21,948 epidemiological investigation data of nine provinces in China.","volume":"24","author":"Wang Q.","year":"2009","journal-title":"Zhonghua Zhongyiyao Zazhi"},{"key":"e_1_2_10_17_2","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gks1100"},{"key":"e_1_2_10_18_2","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4417"},{"key":"e_1_2_10_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2003.10.004"},{"key":"e_1_2_10_20_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocx111"},{"key":"e_1_2_10_21_2","doi-asserted-by":"publisher","DOI":"10.1038\/clpt.2008.89"},{"key":"e_1_2_10_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cell.2013.08.030"},{"key":"e_1_2_10_23_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41598\u2010017\u201005778\u2010z"},{"key":"e_1_2_10_24_2","doi-asserted-by":"publisher","DOI":"10.1001\/jamacardio.2016.3236"},{"key":"e_1_2_10_25_2","doi-asserted-by":"publisher","DOI":"10.1038\/nbt.2749"},{"key":"e_1_2_10_26_2","doi-asserted-by":"publisher","DOI":"10.1542\/peds.2013\u20100819"},{"key":"e_1_2_10_27_2","doi-asserted-by":"crossref","unstructured":"Chang P. C. Tseng H. Dan J.andManning C. D.(2009)Discriminative reordering with Chinese grammatical relations features. In:SSST\u2019 09 Proceedings of the 3rd Workshop on Syntax and Structure in Statistical Translation. pp.51\u201359","DOI":"10.3115\/1626344.1626351"},{"key":"e_1_2_10_28_2","doi-asserted-by":"crossref","unstructured":"Levy R.andManning C. D.(2003)Is it harder to parse Chinese or the Chinese Treebank?In:Proceedings of the 41st Annual Meeting on Association for Computational Linguistics 1 439\u2013446","DOI":"10.3115\/1075096.1075152"},{"key":"e_1_2_10_29_2","unstructured":"Che W. Li Z.andLiu T.(2010)LTP: A Chinese language technology platform. In:COLING\u201910 Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations pp.13\u201316"},{"key":"e_1_2_10_30_2","unstructured":"Sun M. Chen X. Zhang K. Guo Z. Ma J.andLiu Z.(2016)THULAC: An efficient lexical analyzer for Chinese"},{"key":"e_1_2_10_31_2","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2009.35.4.35403"},{"key":"e_1_2_10_32_2","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1516510113"},{"key":"e_1_2_10_33_2","unstructured":"Levy O.andGoldberg Y.(2014)Neural word embedding as implicit matrix factorization.In:Adv. Neural Inf. Process. Syst. Conference"},{"key":"e_1_2_10_34_2","first-page":"2579","article-title":"Visualizing high\u2010dimensional data using t\u2010SNE.","volume":"9","author":"Maaten L.","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_2_10_35_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1745-3984.2003.tb01108.x"},{"key":"e_1_2_10_36_2","doi-asserted-by":"crossref","unstructured":"Agrawal R. Imielinski T.andSwami A.(1993)Mining association rules between sets of items in large databases. In:SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data pp.207\u2013216","DOI":"10.1145\/170035.170072"},{"key":"e_1_2_10_37_2","first-page":"580","volume-title":"Readings in database systems (3rd ed.)","author":"Agrawal R.","year":"1994"},{"key":"e_1_2_10_38_2","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4146"},{"key":"e_1_2_10_39_2","doi-asserted-by":"publisher","DOI":"10.1111\/rssb.12032"}],"container-title":["Quantitative Biology"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s40484-019-0173-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s40484-019-0173-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s40484-019-0173-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1007\/s40484-019-0173-x","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,3]],"date-time":"2025-01-03T09:53:44Z","timestamp":1735898024000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1007\/s40484-019-0173-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9]]},"references-count":38,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2019,9]]}},"alternative-id":["10.1007\/s40484-019-0173-x"],"URL":"https:\/\/doi.org\/10.1007\/s40484-019-0173-x","archive":["Portico"],"relation":{},"ISSN":["2095-4689","2095-4697"],"issn-type":[{"type":"print","value":"2095-4689"},{"type":"electronic","value":"2095-4697"}],"subject":[],"published":{"date-parts":[[2019,9]]},"assertion":[{"value":"28 August 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 January 2019","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 March 2019","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 August 2019","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors Yang Yang, Qi Li, Zhaoyang Liu, Fang Ye and Ke Deng declare that they have no conflict of interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Compliance with Ethics Guidelines"}},{"value":"All procedures were in accordance with the ethical standards of the institution or practice at which the studies were conducted, and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Compliance with Ethics Guidelines"}},{"value":"This content has been made available to all.","name":"free","label":"Free to read"}]}}