{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T00:58:05Z","timestamp":1769389085798,"version":"3.49.0"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,7,17]],"date-time":"2021-07-17T00:00:00Z","timestamp":1626480000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,7,17]],"date-time":"2021-07-17T00:00:00Z","timestamp":1626480000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"the ministry of science and higher education of russian federation","award":["2019-1339"],"award-info":[{"award-number":["2019-1339"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Netw Sci"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>A system of hospitals in large cities can be considered a large and diverse but interconnected system. Widely applied in hospitals, electronic health records (EHR) are crucially different from each other because of the use of different health information systems, internal hospital rules, and individual behavior of physicians. The unstructured (textual) data of EHR is rarely used to assess the citywide quality of healthcare. Within the study, we analyze EHR data, particularly textual unstructured data, as a reflection of the complex multi-agent system of healthcare in the city of Saint Petersburg, Russia. Through analyzing the data collected by the Medical Information and Analytical Center, a method was proposed and evaluated for identifying a common structure, understanding the diversity, and assessing information quality in EHR data through the application of natural language processing techniques.<\/jats:p>","DOI":"10.1007\/s41109-021-00395-2","type":"journal-article","created":{"date-parts":[[2021,7,17]],"date-time":"2021-07-17T10:02:28Z","timestamp":1626516148000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Citywide quality of health information system through text mining of electronic health records"],"prefix":"10.1007","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4596-6293","authenticated-orcid":false,"given":"Anastasia A.","family":"Funkner","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michil P.","family":"Egorov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sergey A.","family":"Fokin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gennady M.","family":"Orlov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sergey V.","family":"Kovalchuk","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,7,17]]},"reference":[{"key":"395_CR1","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1145\/304181.304187","volume":"28","author":"M Ankerst","year":"1999","unstructured":"Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) OPTICS: ordering points to identify the clustering structure. ACM Sigmod Rec 28:49\u201360. https:\/\/doi.org\/10.1145\/304181.304187","journal-title":"ACM Sigmod Rec"},{"key":"395_CR2","doi-asserted-by":"publisher","unstructured":"Balabaeva K, Kovalchuk S (2020) Experiencer detection and automated extraction of a family disease tree from medical texts in russian language. In: Lecture Notes in Computer Science, vol 12140, pp 603\u2013612. https:\/\/doi.org\/10.1007\/978-3-030-50423-6_45","DOI":"10.1007\/978-3-030-50423-6_45"},{"key":"395_CR3","first-page":"43","volume":"270","author":"K Balabaeva","year":"2020","unstructured":"Balabaeva K, Funkner A, Kovalchuk S (2020) Automated spelling correction for clinical text mining in Russian. Digit Pers Heal Med Proc MIE 270:43\u201347","journal-title":"Digit Pers Heal Med Proc MIE"},{"key":"395_CR4","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24106-7","volume-title":"Data and information quality: dimensions, principles and techniques","author":"C Batini","year":"2016","unstructured":"Batini C, Scannapieco M (2016) Data and information quality: dimensions, principles and techniques, 1st edn. Springer Publishing Company Incorporated","edition":"1"},{"key":"395_CR5","doi-asserted-by":"publisher","first-page":"313","DOI":"10.3233\/978-1-61499-830-3-313","volume":"245","author":"P Bruland","year":"2017","unstructured":"Bruland P, Doods J, Storck M, Dugas M (2017) What information does your EHR contain? Automatic generation of a clinical metadata warehouse (CMDW) to support identification and data access within distributed clinical research networks. Stud Health Technol Inform 245:313\u2013317. https:\/\/doi.org\/10.3233\/978-1-61499-830-3-313","journal-title":"Stud Health Technol Inform"},{"key":"395_CR6","doi-asserted-by":"publisher","first-page":"910","DOI":"10.1136\/amiajnl-2013-002321","volume":"21","author":"HB Burke","year":"2014","unstructured":"Burke HB, Hoang A, Becher D et al (2014) QNOTE: an instrument for measuring the quality of EHR clinical notes. J Am Med Inform Assoc 21:910\u2013916. https:\/\/doi.org\/10.1136\/amiajnl-2013-002321","journal-title":"J Am Med Inform Assoc"},{"key":"395_CR7","doi-asserted-by":"publisher","first-page":"103301","DOI":"10.1016\/j.jbi.2019.103301","volume":"100","author":"S Datta","year":"2019","unstructured":"Datta S, Bernstam EV, Roberts K (2019) A frame semantic overview of NLP-based information extraction for cancer-related EHR notes. J Biomed Inform 100:103301. https:\/\/doi.org\/10.1016\/j.jbi.2019.103301","journal-title":"J Biomed Inform"},{"key":"395_CR8","doi-asserted-by":"publisher","first-page":"100086","DOI":"10.1016\/j.yjbinx.2020.100086","volume":"8","author":"HG Freedman","year":"2020","unstructured":"Freedman HG, Williams H, Miller MA et al (2020) A novel tool for standardizing clinical data in a semantically rich model. J Biomed Informatics X 8:100086. https:\/\/doi.org\/10.1016\/j.yjbinx.2020.100086","journal-title":"J Biomed Informatics X"},{"key":"395_CR9","doi-asserted-by":"publisher","unstructured":"Funkner AA, Kovalchuk SV (2020) Time expressions identification without human-labeled corpus for clinical text mining in Russian. In: Lecture Notes in Computer Science, vol 12140, pp 591\u2013602. https:\/\/doi.org\/10.1007\/978-3-030-50423-6_44","DOI":"10.1007\/978-3-030-50423-6_44"},{"key":"395_CR10","doi-asserted-by":"publisher","unstructured":"Funkner A, Balabaeva K, Kovalchuk S (2020) Negation detection for clinical text mining in Russian. In: Studies in health technology and informatics, vol 270, pp 342\u2013346. https:\/\/doi.org\/10.3233\/SHTI200179","DOI":"10.3233\/SHTI200179"},{"key":"395_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13326-020-00231-z","volume":"11","author":"MG Kersloot","year":"2020","unstructured":"Kersloot MG, van Putten FJP, Abu-Hanna A et al (2020) Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies. J Biomed Semant 11:1\u201321. https:\/\/doi.org\/10.1186\/s13326-020-00231-z","journal-title":"J Biomed Semant"},{"key":"395_CR12","doi-asserted-by":"publisher","unstructured":"Lamy M, Pereira R, Ferreira JC, et al (2019) Extracting clinical information from electronic medical records. In: Advances in intelligent systems and computing. In: Advances in Intelligent Systems and Computing, vol. 806, pp 113\u2013120. https:\/\/doi.org\/10.1007\/978-3-030-01746-0_13","DOI":"10.1007\/978-3-030-01746-0_13"},{"key":"395_CR13","unstructured":"Logan JR, Gorman PN, Middleton B (2001) Measuring the quality of medical records: a method for comparing completeness and correctness of clinical encounter data. In: Proc AMIA Symp, pp 408\u2013412. https:\/\/pubmed.ncbi.nlm.nih.gov\/11825220\/"},{"key":"395_CR14","unstructured":"MIAC (2021) Medical organizations ranking: St. Petersburg citizen\u2019s EHR (Pe\u0439\u0442\u0438\u043d\u0433\u0438 \u043ce\u0434\u0438\u0446\u0438\u043dc\u043a\u0438x op\u0433a\u043d\u0438\u0437a\u0446\u0438\u0439: \u042dM\u041a \u043fe\u0442ep\u0431yp\u0436\u0446a) - in Russian. https:\/\/spbmiac.ru\/ehlektronnoe-zdravookhranenie\/rejjtingi-e-zdravookhraneniya\/rejtingi-mo-emk-peterburzhca\/"},{"key":"395_CR15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13326-018-0179-8","volume":"9","author":"A N\u00e9v\u00e9ol","year":"2018","unstructured":"N\u00e9v\u00e9ol A, Dalianis H, Velupillai S et al (2018) Clinical natural language processing in languages other than English: opportunities and challenges. J Biomed Semant 9:1\u201313. https:\/\/doi.org\/10.1186\/s13326-018-0179-8","journal-title":"J Biomed Semant"},{"key":"395_CR16","doi-asserted-by":"publisher","first-page":"26","DOI":"10.13063\/2327-9214.1265","volume":"4","author":"R Newton-Dame","year":"2016","unstructured":"Newton-Dame R, McVeigh KH, Schreibstein L et al (2016) Design of the New York city macroscope: innovations in population health surveillance using electronic health records. Gener Evid Methods Improv Patient Outcomes 4:26. https:\/\/doi.org\/10.13063\/2327-9214.1265","journal-title":"Gener Evid Methods Improv Patient Outcomes"},{"key":"395_CR17","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1016\/j.ijmedinf.2014.06.011","volume":"83","author":"L Nguyen","year":"2014","unstructured":"Nguyen L, Bellucci E, Nguyen LT (2014) Electronic health records implementation: an evaluation of information system impact and contingency factors. Int J Med Inform 83:779\u2013796. https:\/\/doi.org\/10.1016\/j.ijmedinf.2014.06.011","journal-title":"Int J Med Inform"},{"key":"395_CR18","unstructured":"Roberts K, Demner-Fushman D (2016) Annotating logical forms for EHR questions. In: Proceedings of the 10th international conference on language resources and evaluation, LREC 2016. pp 3772\u20133778. https:\/\/pubmed.ncbi.nlm.nih.gov\/28503677\/"},{"key":"395_CR19","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1016\/j.procs.2020.11.026","volume":"178","author":"AA Shaikina","year":"2020","unstructured":"Shaikina AA, Funkner AA (2020) Medical corpora comparison using topic modeling. Procedia Comput Sci 178:244\u2013253. https:\/\/doi.org\/10.1016\/j.procs.2020.11.026","journal-title":"Procedia Comput Sci"},{"key":"395_CR20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3058750","volume":"8","author":"J St-Maurice","year":"2017","unstructured":"St-Maurice J, Burns C (2017) An exploratory case study to understand primary care users and their data quality tradeoffs. J Data Inf Qual 8:1\u201324. https:\/\/doi.org\/10.1145\/3058750","journal-title":"J Data Inf Qual"},{"key":"395_CR21","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1016\/j.jbi.2015.10.009","volume":"58","author":"H Sun","year":"2015","unstructured":"Sun H, Depraetere K, De Roo J et al (2015) Semantic processing of EHR data for clinical research. J Biomed Inform 58:247\u2013259. https:\/\/doi.org\/10.1016\/j.jbi.2015.10.009","journal-title":"J Biomed Inform"},{"key":"395_CR22","doi-asserted-by":"publisher","first-page":"828","DOI":"10.1136\/amiajnl-2013-001635","volume":"20","author":"B Tang","year":"2013","unstructured":"Tang B, Wu Y, Jiang M et al (2013) A hybrid system for temporal information extraction from clinical text. J Am Med Inform Assoc 20:828\u2013835. https:\/\/doi.org\/10.1136\/amiajnl-2013-001635","journal-title":"J Am Med Inform Assoc"},{"key":"395_CR23","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1093\/jamia\/ocw054","volume":"24","author":"S van der Bij","year":"2017","unstructured":"van der Bij S, Khan N, ten Veen P et al (2017) Improving the quality of EHR recording in primary care: a data quality feedback tool. J Am Med Inform Assoc 24:81\u201387. https:\/\/doi.org\/10.1093\/jamia\/ocw054","journal-title":"J Am Med Inform Assoc"},{"key":"395_CR24","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1007\/978-3-319-26123-2_36","volume":"542","author":"K Vorontsov","year":"2015","unstructured":"Vorontsov K, Frei O, Apishev M et al (2015) Bigartm: open source library for regularized multimodal topic modeling of large collections. Commun Comput Inf Sci 542:370\u2013381. https:\/\/doi.org\/10.1007\/978-3-319-26123-2_36","journal-title":"Commun Comput Inf Sci"},{"key":"395_CR25","doi-asserted-by":"publisher","first-page":"e30412","DOI":"10.1371\/journal.pone.0030412","volume":"7","author":"Z Wang","year":"2012","unstructured":"Wang Z, Shah AD, Tate AR et al (2012) Extracting diagnoses and investigation results from unstructured text in electronic health records by semi-supervised machine learning. PLOS ONE 7:e30412. https:\/\/doi.org\/10.1371\/journal.pone.0030412","journal-title":"PLOS ONE"},{"key":"395_CR26","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1136\/amiajnl-2011-000681","volume":"20","author":"NG Weiskopf","year":"2013","unstructured":"Weiskopf NG, Weng C (2013) Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc 20:144\u2013151. https:\/\/doi.org\/10.1136\/amiajnl-2011-000681","journal-title":"J Am Med Inform Assoc"},{"key":"395_CR27","doi-asserted-by":"publisher","first-page":"482","DOI":"10.1055\/s-0038-1634243","volume":"42","author":"JG Williams","year":"2003","unstructured":"Williams JG (2003) Measuring the completeness and currency of codified clinical information. Methods Inf Med 42:482\u2013488. https:\/\/doi.org\/10.1055\/s-0038-1634243","journal-title":"Methods Inf Med"},{"key":"395_CR28","doi-asserted-by":"publisher","first-page":"103276","DOI":"10.1016\/j.jbi.2019.103276","volume":"98","author":"E Yehia","year":"2019","unstructured":"Yehia E, Boshnak H, AbdelGaber S et al (2019) Ontology-based clinical information extraction from physician\u2019s free-text notes. J Biomed Inform 98:103276. https:\/\/doi.org\/10.1016\/j.jbi.2019.103276","journal-title":"J Biomed Inform"}],"container-title":["Applied Network Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41109-021-00395-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s41109-021-00395-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41109-021-00395-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,17]],"date-time":"2021-07-17T10:20:43Z","timestamp":1626517243000},"score":1,"resource":{"primary":{"URL":"https:\/\/appliednetsci.springeropen.com\/articles\/10.1007\/s41109-021-00395-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,17]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["395"],"URL":"https:\/\/doi.org\/10.1007\/s41109-021-00395-2","relation":{},"ISSN":["2364-8228"],"issn-type":[{"value":"2364-8228","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,17]]},"assertion":[{"value":"18 March 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 June 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 July 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"53"}}