{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T06:23:18Z","timestamp":1761805398694},"reference-count":24,"publisher":"Springer Science and Business Media LLC","issue":"S1","license":[{"start":{"date-parts":[[2019,11,1]],"date-time":"2019-11-01T00:00:00Z","timestamp":1572566400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2019,11,12]],"date-time":"2019-11-12T00:00:00Z","timestamp":1573516800000},"content-version":"vor","delay-in-days":11,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Biomed Semant"],"published-print":{"date-parts":[[2019,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n              <jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>There is an increasing amount of unstructured medical data that can be analysed for different purposes. However, information extraction from free text data may be particularly inefficient in the presence of spelling errors. Existing approaches use string similarity methods to search for valid words within a text, coupled with a supporting dictionary. However, they are not rich enough to encode both typing and phonetic misspellings.<\/jats:p>\n              <\/jats:sec>\n              <jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>Experimental results showed a joint string and language-dependent phonetic similarity is more accurate than traditional string distance metrics when identifying misspelt names of drugs in a set of medical records written in Portuguese.<\/jats:p>\n              <\/jats:sec>\n              <jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>We present a hybrid approach to efficiently perform similarity match that overcomes the loss of information inherit from using either exact match search or string based similarity search methods.<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s13326-019-0216-2","type":"journal-article","created":{"date-parts":[[2019,11,12]],"date-time":"2019-11-12T01:02:36Z","timestamp":1573520556000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Combining string and phonetic similarity matching to identify misspelt names of drugs in medical records written in Portuguese"],"prefix":"10.1186","volume":"10","author":[{"given":"Hegler","family":"Tissot","sequence":"first","affiliation":[]},{"given":"Richard","family":"Dobson","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2019,11,12]]},"reference":[{"key":"216_CR1","doi-asserted-by":"crossref","unstructured":"Jellouli I, Mohajir ME. An ontology-based approach for web information extraction. In: 2011 Colloquium in Information Science and Technology. IEEE: 2011. https:\/\/doi.org\/10.1109\/cist.2011.6148583.","DOI":"10.1109\/CIST.2011.6148583"},{"issue":"1","key":"216_CR2","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1109\/TKDE.2011.253","volume":"25","author":"P. Shvaiko","year":"2013","unstructured":"Pavel S, Euzenat J. Ontology Matching: State of the Art and Future Challenges. IEEE Trans Knowl Data Eng; 25(1):158\u201376. https:\/\/doi.org\/10.1109\/tkde.2011.253.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"216_CR3","doi-asserted-by":"crossref","unstructured":"Karystianis G, Sheppard T, Dixon WG, Nenadic G. Modelling and extraction of variability in free-text medication prescriptions from an anonymised primary care electronic medical record research database. BMC Med Inf Dec Mak. 2016;16(1). https:\/\/doi.org\/10.1186\/s12911-016-0255-x.","DOI":"10.1186\/s12911-016-0255-x"},{"issue":"5","key":"216_CR4","first-page":"514","volume":"17","author":"O Uzuner","year":"2010","unstructured":"Uzuner O, Solti I, Cadag E. Extracting medication information from clinical text. JAMIA. 2010; 17(5):514\u20138.","journal-title":"JAMIA"},{"issue":"6","key":"216_CR5","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1038\/nrg3208","volume":"13","author":"PB Jensen","year":"2012","unstructured":"Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012; 13(6):395\u2013405. https:\/\/doi.org\/10.1038\/nrg3208.","journal-title":"Nat Rev Genet"},{"issue":"12","key":"216_CR6","doi-asserted-by":"publisher","first-page":"832","DOI":"10.1016\/j.ijmedinf.2010.09.005","volume":"79","author":"C Senger","year":"2010","unstructured":"Senger C, Kaltschmidt J, Schmitt SPW, Pruszydlo MG, Haefeli WE. Misspellings in drug information system queries: Characteristics of drug name spelling errors and strategies for their prevention. I J Med Inf. 2010; 79(12):832\u20139.","journal-title":"I J Med Inf"},{"key":"216_CR7","volume-title":"CIKM","author":"S Godbole","year":"2010","unstructured":"Godbole S, Bhattacharya I, Gupta A, Verma A. Building re-usable dictionary repositories for real-world text mining In: Huang J, Koudas N, Jones GJF, Wu X, Collins-Thompson K, An A, editors. CIKM. New York: ACM: 2010. p. 1189\u201398."},{"issue":"8","key":"216_CR8","first-page":"707","volume":"10","author":"VI Levenshtein","year":"1966","unstructured":"Levenshtein VI. Binary codes capable of correcting insertions and reversals. Sov Phys Dokl. 1966; 10(8):707\u201310.","journal-title":"Sov Phys Dokl"},{"key":"216_CR9","volume-title":"Proceedings of the Section on Survey Research","author":"WE Winkler","year":"1990","unstructured":"Winkler WE. String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage. In: Proceedings of the Section on Survey Research. Wachington: American Statistical Association: 1990. p. 354\u20139."},{"key":"216_CR10","doi-asserted-by":"crossref","unstructured":"Stvilia B. A model for ontology quality evaluation. First Monday. 2007; 12(12). https:\/\/doi.org\/10.5210\/fm.v12i12.2043. University of Illinois Libraries.","DOI":"10.5210\/fm.v12i12.2043"},{"key":"216_CR11","unstructured":"Brazilian Ministry of Health: Programa Mais Medicos (More Doctors Program). http:\/\/maismedicos.gov.br\/. Accessed 22 May 2015."},{"key":"216_CR12","unstructured":"Bona C. Avalia\u00e7\u00e3o de Processos de Software: Um estudo de caso em XP e ICONIX. Master\u2019s thesis, Programa de P\u00f3s-Gradua\u00e7\u00e3o em Engenharia de Produ\u00e7\u00e3o, Universidade Federal de Santa Catarina (UFSC). 2002."},{"issue":"2","key":"216_CR13","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1002\/j.1538-7305.1950.tb00463.x","volume":"26","author":"R Hamming","year":"1950","unstructured":"Hamming R. Error Detecting and Error Correcting Codes. Bell Syst Tech J. 1950; 26(2):147\u201360.","journal-title":"Bell Syst Tech J"},{"key":"216_CR14","volume-title":"Database and Expert Systems Applications - 25th International Conference, DEXA 2014, Munich, Germany, September 1-4, 2014. Proceedings, Part II","author":"H Tissot","year":"2014","unstructured":"Tissot H, Peschl G, Fabro MDD. Fast phonetic similarity search over large repositories. In: Database and Expert Systems Applications - 25th International Conference, DEXA 2014, Munich, Germany, September 1-4, 2014. Proceedings, Part II. Cham: Springer International Publishing: 2014. p. 74\u201381."},{"key":"216_CR15","volume-title":"Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR \u201996","author":"J Zobel","year":"1996","unstructured":"Zobel J, Dart P. Phonetic string matching: Lessons from information retrieval. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR \u201996. New York: ACM: 1996. p. 166\u201372."},{"key":"216_CR16","doi-asserted-by":"crossref","unstructured":"Droppo J, Acero A. Context dependent phonetic string edit distance for automatic speech recognition. In: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE: 2010. p. 4358\u201361. https:\/\/doi.org\/10.1109\/icassp.2010.5495652.","DOI":"10.1109\/ICASSP.2010.5495652"},{"key":"216_CR17","volume-title":"The Sounds of the World\u2019s Languages","author":"P Ladefoged","year":"1996","unstructured":"Ladefoged P, Maddieson I. The Sounds of the World\u2019s Languages. Oxford: Blackwell; 1996."},{"key":"216_CR18","unstructured":"Tissot H. Normalisation of imprecise temporal expressions extracted from text. PhD thesis, Federal University of Parana, Brazil, Computer Science Department. 2016."},{"key":"216_CR19","unstructured":"Bocek T, Hunt E, Stiller B, Hecht F. Fast similarity search in large dictionaries. Technical Report ifi-2007.02, Department of Informatics, University of Zurich (April 2007). http:\/\/fastss.csg.uzh.ch\/. Accessed 17 Jan 2018."},{"key":"216_CR20","volume-title":"Proceedings of the 12th ACM\/IEEE-CS Joint Conference on Digital Libraries","author":"M Khabsa","year":"2012","unstructured":"Khabsa M, Treeratpituk P, Giles CL. Ackseer: a repository and search engine for automatically extracted acknowledgments from digital libraries. In: Proceedings of the 12th ACM\/IEEE-CS Joint Conference on Digital Libraries. New York: ACM: 2012. p. 185\u201394."},{"issue":"1","key":"216_CR21","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1145\/375360.375365","volume":"33","author":"G Navarro","year":"2001","unstructured":"Navarro G. A guided tour to approximate string matching. ACM Comput Surv. 2001; 33(1):31\u201388.","journal-title":"ACM Comput Surv"},{"key":"216_CR22","volume-title":"Proceedings of the 18th International Conference on World Wide Web, WWW \u201909","author":"S Ji","year":"2009","unstructured":"Ji S, Li G, Li C, Feng J. Efficient interactive fuzzy keyword search. In: Proceedings of the 18th International Conference on World Wide Web, WWW \u201909. New York: ACM: 2009. p. 371\u201380."},{"key":"216_CR23","volume-title":"Scientific and Statistical Database Management. Lecture Notes in Computer Science, vol 7338","author":"D Fenz","year":"2012","unstructured":"Fenz D, Lange D, Rheinl\u00e4nder A, Naumann F, Leser U. Efficient similarity search in very large string sets In: Ailamaki A, Bowers S, editors. Scientific and Statistical Database Management. Lecture Notes in Computer Science, vol 7338. Berlin: Springer Berlin Heidelberg: 2012. p. 262\u201379."},{"key":"216_CR24","volume-title":"Proceedings of the 23rd International Conference on Machine Learning, ICML \u201906","author":"J Davis","year":"2006","unstructured":"Davis J, Goadrich M. The relationship between precision-recall and roc curves. In: Proceedings of the 23rd International Conference on Machine Learning, ICML \u201906. New York: ACM Press: 2006. p. 233\u201340. https:\/\/doi.org\/10.1145\/1143844.1143874."}],"container-title":["Journal of Biomedical Semantics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13326-019-0216-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13326-019-0216-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13326-019-0216-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,11]],"date-time":"2020-11-11T00:37:25Z","timestamp":1605055045000},"score":1,"resource":{"primary":{"URL":"https:\/\/jbiomedsem.biomedcentral.com\/articles\/10.1186\/s13326-019-0216-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,11]]},"references-count":24,"journal-issue":{"issue":"S1","published-print":{"date-parts":[[2019,11]]}},"alternative-id":["216"],"URL":"https:\/\/doi.org\/10.1186\/s13326-019-0216-2","relation":{},"ISSN":["2041-1480"],"issn-type":[{"value":"2041-1480","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,11]]},"assertion":[{"value":"12 November 2019","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"17"}}