{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:20:05Z","timestamp":1772166005575,"version":"3.50.1"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2018,6,25]],"date-time":"2018-06-25T00:00:00Z","timestamp":1529884800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100004440","name":"Wellcome Trust","doi-asserted-by":"publisher","award":["MR\/K006584\/1"],"award-info":[{"award-number":["MR\/K006584\/1"]}],"id":[{"id":"10.13039\/100004440","id-type":"DOI","asserted-by":"publisher"}]},{"name":"UK Infrastructure for Large-scale Clinical Genomics Research","award":["MC_PC_14089"],"award-info":[{"award-number":["MC_PC_14089"]}]},{"name":"National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust"},{"name":"National Institute for Health Research (NIHR) Biomedical Research Centre at University College London Hospital"},{"name":"NHS England Enablement"},{"DOI":"10.13039\/100010661","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["644753"],"award-info":[{"award-number":["644753"]}],"id":[{"id":"10.13039\/100010661","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2018,12]]},"DOI":"10.1186\/s12911-018-0623-9","type":"journal-article","created":{"date-parts":[[2018,6,25]],"date-time":"2018-06-25T08:36:59Z","timestamp":1529915819000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":109,"title":["CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital"],"prefix":"10.1186","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3278-8547","authenticated-orcid":false,"given":"Richard","family":"Jackson","sequence":"first","affiliation":[]},{"given":"Ismail","family":"Kartoglu","sequence":"additional","affiliation":[]},{"given":"Clive","family":"Stringer","sequence":"additional","affiliation":[]},{"given":"Genevieve","family":"Gorrell","sequence":"additional","affiliation":[]},{"given":"Angus","family":"Roberts","sequence":"additional","affiliation":[]},{"given":"Xingyi","family":"Song","sequence":"additional","affiliation":[]},{"given":"Honghan","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Asha","family":"Agrawal","sequence":"additional","affiliation":[]},{"given":"Kenneth","family":"Lui","sequence":"additional","affiliation":[]},{"given":"Tudor","family":"Groza","sequence":"additional","affiliation":[]},{"given":"Damian","family":"Lewsley","sequence":"additional","affiliation":[]},{"given":"Doug","family":"Northwood","sequence":"additional","affiliation":[]},{"given":"Amos","family":"Folarin","sequence":"additional","affiliation":[]},{"given":"Robert","family":"Stewart","sequence":"additional","affiliation":[]},{"given":"Richard","family":"Dobson","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2018,6,25]]},"reference":[{"issue":"10","key":"623_CR1","first-page":"58","volume":"4","author":"DW Simborg","year":"1987","unstructured":"Simborg DW. An emerging standard for health communications: The HL7 standard. Healthc Comput Commun. 1987; 4(10):58\u201360.","journal-title":"Healthc Comput Commun"},{"issue":"4","key":"623_CR2","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1055\/s-0038-1634486","volume":"41","author":"GO Klein","year":"2002","unstructured":"Klein GO. Standardization of health informatics\u2013results and challenges. Methods Inf Med. 2002; 41(4):261\u201370.","journal-title":"Methods Inf Med"},{"key":"623_CR3","volume-title":"Lessons learned from the implementation of clinical messaging systems. AMIA... Annual Symposium proceedings \/ AMIA Symposium. AMIA Symposium","author":"M Barnes","year":"2007","unstructured":"Barnes M. Lessons learned from the implementation of clinical messaging systems. AMIA... Annual Symposium proceedings \/ AMIA Symposium. AMIA Symposium. Montgomery: The American Medical Informatics Institution; 2007, pp. 36\u201340."},{"key":"623_CR4","first-page":"709","volume":"169","author":"R Worden","year":"2011","unstructured":"Worden R, Scott P. Simplifying HL7 Version 3 messages. Stud Health Technol Inform. 2011; 169:709\u201313.","journal-title":"Stud Health Technol Inform"},{"key":"623_CR5","first-page":"817","volume":"116","author":"J Antol\u00edk","year":"2005","unstructured":"Antol\u00edk J. Automatic annotation of medical records. Stud Health Technol Inform. 2005; 116:817\u201322. Cited by 0003.","journal-title":"Stud Health Technol Inform"},{"issue":"5","key":"623_CR6","doi-asserted-by":"publisher","first-page":"922","DOI":"10.1136\/amiajnl-2012-001317","volume":"20","author":"D Albright","year":"2013","unstructured":"Albright D, Lanfranchi A, Fredriksen A, Styler WF, Warner C, Hwang JD, Choi JD, Dligach D, Nielsen RD, Martin J, Ward W, Palmer M, Savova GK. Towards comprehensive syntactic and semantic annotations of the clinical narrative. J Am Med Inform Assoc. 2013; 20(5):922\u201330.","journal-title":"J Am Med Inform Assoc"},{"key":"623_CR7","first-page":"441","volume":"143","author":"N Barrett","year":"2009","unstructured":"Barrett N, Weber-Jahnke JH. Applying natural language processing toolkits to electronic health records - an experience report. Stud Health Technol Inform. 2009; 143:441\u20136.","journal-title":"Stud Health Technol Inform"},{"issue":"4","key":"623_CR8","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1016\/S1532-0464(03)00012-1","volume":"35","author":"C Friedman","year":"2002","unstructured":"Friedman C, Kra P, Rzhetsky A. Two biomedical sublanguages: A description based on the theories of Zellig Harris. J Biomed Inform. 2002; 35(4):222\u201335.","journal-title":"J Biomed Inform"},{"issue":"3","key":"623_CR9","doi-asserted-by":"publisher","first-page":"008721","DOI":"10.1136\/bmjopen-2015-008721","volume":"6","author":"G Perera","year":"2016","unstructured":"Perera G, Broadbent M, Callard F, Chang C-K, Downs J, Dutta R, Fernandes A, Hayes RD, Henderson M, Jackson R, Jewell A, Kadra G, Little R, Pritchard M, Shetty H, Tulloch A, Stewart R. Cohort profile of the South London and Maudsley NHS Foundation Trust Biomedical Research Centre (SLaM BRC) Case Register: Current status and recent enhancement of an Electronic Mental Health Record-derived data resource. BMJ Open. 2016; 6(3):008721.","journal-title":"BMJ Open"},{"key":"623_CR10","doi-asserted-by":"publisher","first-page":"196","DOI":"10.1016\/j.jbi.2014.01.003","volume":"50","author":"KH Jones","year":"2014","unstructured":"Jones KH, Ford DV, Jones C, Dsilva R, Thompson S, Brooks CJ, Heaven ML, Thayer DS, McNerney CL, Lyons RA. A case study of the Secure Anonymous Information Linkage (SAIL) Gateway: A privacy-protecting remote access system for health-related research and evaluation. J Biomed Inform. 2014; 50:196\u2013204.","journal-title":"J Biomed Inform"},{"key":"623_CR11","doi-asserted-by":"publisher","unstructured":"The 100,000 Genomes Project Protocol v3, Genomics England. 2017. \n                    https:\/\/doi.org\/10.6084\/m9.figshare.4530893.v2\n                    \n                  . (from \n                    https:\/\/www.genomicsengland.co.uk\/about-gecip\/publications\/\n                    \n                  ).","DOI":"10.6084\/m9.figshare.4530893.v2"},{"key":"623_CR12","doi-asserted-by":"crossref","unstructured":"Moen H, Ginter F, Marsi E, Peltonen L-M, Salakoski T, Salanter\u00e4 S. Care episode retrieval: Distributional semantic models for information retrieval in the clinical domain. BMC Med Inform Dec Making. 2015; 15(S2).","DOI":"10.1186\/1472-6947-15-S2-S2"},{"key":"623_CR13","first-page":"150","volume":"2016","author":"R McEwan","year":"2016","unstructured":"McEwan R, Melton GB, Knoll BC, Wang Y, Hultman G, Dale JL, Meyer T, Pakhomov SV. NLP-PIER: A Scalable Natural Language Processing, Indexing, and Searching Architecture for Clinical Notes. AMIA Jt Summits Transl Sci Proc. 2016; 2016:150\u20139.","journal-title":"AMIA Jt Summits Transl Sci Proc"},{"issue":"1","key":"623_CR14","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1186\/1471-244X-9-51","volume":"9","author":"R Stewart","year":"2009","unstructured":"Stewart R, Soremekun M, Perera G, Broadbent M, Callard F, Denis M, Hotopf M, Thornicroft G, Lovestone S. The South London and Maudsley NHS Foundation Trust Biomedical Research Centre (SLAM BRC) case register: Development and descriptive data. BMC Psychiatry. 2009; 9(1):51.","journal-title":"BMC Psychiatry"},{"key":"623_CR15","unstructured":"Kartoglu IE. Cognition: DB binary-to-text converter and pseudonymiser for clinical research. 2015. \n                    https:\/\/github.com\/KHP-Informatics\/Cognition-DNC\n                    \n                  ."},{"key":"623_CR16","unstructured":"Jackson R, Kartoglu I. A Open Pipeline for Masking Patient Identifiers in Electronic Health Records, The Farr Institute International Conference 2015. 2015."},{"issue":"1","key":"623_CR17","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/1472-6947-13-71","volume":"13","author":"AC Fernandes","year":"2013","unstructured":"Fernandes AC, Cloete D, Broadbent MT, Hayes RD, Chang C-K, Jackson RG, Roberts A, Tsang J, Soncul M, Liebscher J, Stewart R, Callard F. Development and evaluation of a de-identification procedure for a case register sourced from mental health electronic records. BMC Med Inf Decis Making. 2013; 13(1):71.","journal-title":"BMC Med Inf Decis Making"},{"issue":"5","key":"623_CR18","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1136\/amiajnl-2011-000183","volume":"18","author":"LW D\u2019Avolio","year":"2011","unstructured":"D\u2019Avolio LW, Nguyen TM, Goryachev S, Fiore LD. Automated concept-level information extraction to reduce the need for custom software and rules development. J Am Med Inform Assoc. 2011; 18(5):607\u201313.","journal-title":"J Am Med Inform Assoc"},{"key":"623_CR19","volume-title":"Tika in Action","author":"C Mattmann","year":"2011","unstructured":"Mattmann C, Zitting J. Tika in Action. Greenwich, CT, USA: Manning Publications Co.; 2011."},{"key":"623_CR20","volume-title":"Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02. ICDAR \u201907","author":"R Smith","year":"2007","unstructured":"Smith R. An Overview of the Tesseract OCR Engine. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02. ICDAR \u201907. Washington, DC, USA: IEEE Computer Society: 2007. p. 629\u201333."},{"issue":"11","key":"623_CR21","doi-asserted-by":"publisher","first-page":"112774","DOI":"10.1371\/journal.pone.0112774","volume":"9","author":"S Wu","year":"2014","unstructured":"Wu S, Miller T, Masanz J, Coarr M, Halgrim S, Carrell D, Clark C. Negation\u2019s Not Solved: Generalizability Versus Optimizability in Clinical Natural Language Processing. PLoS ONE. 2014; 9(11):112774.","journal-title":"PLoS ONE"},{"key":"623_CR22","first-page":"24","volume":"22","author":"C Friedman","year":"1998","unstructured":"Friedman C, Hripcsak G. Evaluating natural language processors in the clinical domain. Development. 1998; 22:24.","journal-title":"Development"},{"issue":"suppl 1","key":"623_CR23","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1093\/nar\/gkh061","volume":"32","author":"O Bodenreider","year":"2004","unstructured":"Bodenreider O. The unified medical language system (UMLS): Integrating biomedical terminology. Nucleic Acids Res. 2004; 32(suppl 1):267\u201370.","journal-title":"Nucleic Acids Res"},{"key":"623_CR24","doi-asserted-by":"crossref","unstructured":"Neamatullah I, Douglass MM, Lehman L-wH, Reisner A, Villarroel M, Long WJ, Szolovits P, Moody GB, Mark RG, Clifford GD. Automated de-identification of free-text medical records. BMC Med Inform Decis Mak. 2008;8(1).","DOI":"10.1186\/1472-6947-8-32"},{"key":"623_CR25","doi-asserted-by":"publisher","first-page":"160035","DOI":"10.1038\/sdata.2016.35","volume":"3","author":"AEW Johnson","year":"2016","unstructured":"Johnson AEW, Pollard TJ, Shen L, Lehman L-wH, Feng M, Ghassemi M, Moody B, Szolovits P, Anthony Celi L, Mark RG. MIMIC-III, a freely accessible critical care database. Sci Data. 2016; 3:160035.","journal-title":"Sci Data"},{"key":"623_CR26","unstructured":"University of Sheffield. KConnect UMLS Annotation Task. 2016. \n                    http:\/\/www.dcs.shef.ac.uk\/~genevieve\/kconnect\/annotation-manual.pdf\n                    \n                  ."},{"issue":"D1","key":"623_CR27","doi-asserted-by":"publisher","first-page":"966","DOI":"10.1093\/nar\/gkt1026","volume":"42","author":"S K\u00f6hler","year":"2014","unstructured":"K\u00f6hler S, Doelken SC, Mungall CJ, Bauer S, Firth HV, Bailleul-Forestier I, Black GCM, Brown DL, Brudno M, Campbell J, FitzPatrick DR, Eppig JT, Jackson AP, Freson K, Girdea M, Helbig I, Hurst JA, J\u00e4hn J, Jackson LG, Kelly AM, Ledbetter DH, Mansour S, Martin CL, Moss C, Mumford A, Ouwehand WH, Park S-M, Riggs ER, Scott RH, Sisodiya S, Vooren SV, Wapner RJ, Wilkie AOM, Wright CF, Vulto-van Silfhout AT, de Leeuw N, de Vries BBA, Washingthon NL, Smith CL, Westerfield M, Schofield P, Ruef BJ, Gkoutos GV, Haendel M, Smedley D, Lewis SE, Robinson PN. The Human Phenotype Ontology project: Linking molecular biology and disease through phenotype data. Nucleic Acids Res. 2014; 42(D1):966\u201374.","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"623_CR28","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1006\/jbin.2001.1029","volume":"34","author":"WW Chapman","year":"2001","unstructured":"Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries. J Biomed Inform. 2001; 34(5):301\u201310.","journal-title":"J Biomed Inform"},{"issue":"0","key":"623_CR29","doi-asserted-by":"publisher","first-page":"005","DOI":"10.1093\/database\/bav005","volume":"2015","author":"T Groza","year":"2015","unstructured":"Groza T, Kohler S, Doelken S, Collier N, Oellrich A, Smedley D, Couto FM, Baynam G, Zankl A, Robinson PN. Automatic concept recognition using the Human Phenotype Ontology reference and test suite corpora. Database. 2015; 2015(0):005.","journal-title":"Database"},{"key":"623_CR30","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1016\/j.jbi.2015.07.020","volume":"58","author":"A Stubbs","year":"2015","unstructured":"Stubbs A, Uzuner \u00d6. Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2\/UTHealth corpus. J Biomed Inform. 2015; 58:20\u201329.","journal-title":"J Biomed Inform"},{"key":"623_CR31","doi-asserted-by":"crossref","unstructured":"Aamot H, Kohl CD, Richter D, Knaup-Gregori P. Pseudonymization of patient identifiers for translational research. BMC Med Inform Dec Making. 2013;13(1).","DOI":"10.1186\/1472-6947-13-75"},{"key":"623_CR32","unstructured":"Crown. Anonymisation: Managing Data Protection Risk Code of Practice, Infomation Commisioners Office. 2012. \n                    https:\/\/ico.org.uk\/media\/1061\/anonymisation-code.pdf\n                    \n                  ."},{"issue":"1","key":"623_CR33","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1136\/amiajnl-2012-001509","volume":"20","author":"BA Malin","year":"2013","unstructured":"Malin BA, Emam KE, O\u2019Keefe CM. Biomedical data privacy: Problems, perspectives, and recent advances. J Am Med Inform Assoc. 2013; 20(1):2\u20136.","journal-title":"J Am Med Inform Assoc"},{"issue":"2","key":"623_CR34","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1111\/acps.12281","volume":"130","author":"P Munk-J\u00f8rgensen","year":"2014","unstructured":"Munk-J\u00f8rgensen P, Okkels N, Golberg D, Ruggeri M, Thornicroft G. Fifty years\u2019 development and future perspectives of psychiatric register research. Acta Psychiatr Scand. 2014; 130(2):87\u201398.","journal-title":"Acta Psychiatr Scand"},{"key":"623_CR35","first-page":"1","volume":"2010","author":"T Botsis","year":"2010","unstructured":"Botsis T, Hartvigsen G, Chen F, Weng C. Secondary use of EHR: Data quality issues and informatics opportunities. AMIA Summits Transl Sci Proc. 2010; 2010:1\u20135.","journal-title":"AMIA Summits Transl Sci Proc"},{"issue":"5","key":"623_CR36","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1016\/j.ijmedinf.2004.11.001","volume":"74","author":"G Mikkelsen","year":"2005","unstructured":"Mikkelsen G, Aasly J. Consequences of impaired data quality on information retrieval in electronic patient records. Int J Med Inform. 2005; 74(5):387\u201394.","journal-title":"Int J Med Inform"},{"issue":"12","key":"623_CR37","doi-asserted-by":"publisher","first-page":"005654","DOI":"10.1136\/bmjopen-2014-005654","volume":"4","author":"F Callard","year":"2014","unstructured":"Callard F, Broadbent M, Denis M, Hotopf M, Soncul M, Wykes T, Lovestone S, Stewart R. Developing a new model for patient recruitment in mental health services: A cohort study using Electronic Health Records. BMJ Open. 2014; 4(12):005654.","journal-title":"BMJ Open"},{"issue":"5","key":"623_CR38","doi-asserted-by":"publisher","first-page":"931","DOI":"10.1136\/amiajnl-2012-001453","volume":"20","author":"JP Ferraro","year":"2013","unstructured":"Ferraro JP, Daume H, DuVall SL, Chapman WW, Harkema H, Haug PJ. Improving performance of natural language processing part-of-speech tagging on clinical narratives through domain adaptation. J Am Med Inform Assoc. 2013; 20(5):931\u20139.","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"623_CR39","doi-asserted-by":"publisher","first-page":"967","DOI":"10.1093\/jamia\/ocu048","volume":"22","author":"Y Zhang","year":"2015","unstructured":"Zhang Y, Tang B, Jiang M, Wang J, Xu H. Domain adaptation for semantic role labeling of clinical text. J Am Med Inform Assoc. 2015; 22(5):967\u201379.","journal-title":"J Am Med Inform Assoc"},{"key":"623_CR40","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1016\/j.jbi.2015.06.007","volume":"58","author":"A Stubbs","year":"2015","unstructured":"Stubbs A, Kotfila C, Uzuner \u00d6. Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2\/UTHealth shared task Track 1. J Biomed Inform. 2015; 58:11\u20139.","journal-title":"J Biomed Inform"}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-018-0623-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12911-018-0623-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-018-0623-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,9,22]],"date-time":"2019-09-22T01:43:00Z","timestamp":1569116580000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-018-0623-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,25]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,12]]}},"alternative-id":["623"],"URL":"https:\/\/doi.org\/10.1186\/s12911-018-0623-9","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/123299","asserted-by":"object"}]},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,6,25]]},"assertion":[{"value":"20 March 2017","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 June 2018","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 June 2018","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The creation of the CogStack software was an internal service development project for King\u2019s College Hospital NHS Foundation Trust, and thus did not require ethical approval. As no patient identifiable data was required for the development of the software, no approval was sought from the Health Research Authority according to Confidentiality Advisory Group guidelines (\n                      \n                      ). The validation of the Bio-YODIE software made use of the CRIS dataset, which is approved as an anonymised data resource for secondary analysis by Oxfordshire Research Ethics Committee C (08\/H0606\/71) and governance is provided for all projects and dissemination through a patient-led oversight committee.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable: No individual persons data is presented in this manuscript.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"RJ and RS have received research funding from Roche, Pfizer, J&J and Lundbeck.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}},{"value":"Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Publisher\u2019s Note"}}],"article-number":"47"}}