{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T04:39:58Z","timestamp":1771562398365,"version":"3.50.1"},"reference-count":28,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2018,9,19]],"date-time":"2018-09-19T00:00:00Z","timestamp":1537315200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Health Informatics J"],"published-print":{"date-parts":[[2019,12]]},"abstract":"<jats:p> This work focuses on adverse drug reaction extraction tackling the class imbalance problem. Adverse drug reactions are infrequent events in electronic health records, nevertheless, it is compulsory to get them documented. Text mining techniques can help to retrieve this kind of valuable information from text. The class imbalance was tackled using different sampling methods, cost-sensitive learning, ensemble learning and one-class classification and the Random Forest classifier was used. The adverse drug reaction extraction model was inferred from a dataset that comprises real electronic health records with an imbalance ratio of 1:222, this means that for each drug\u2013disease pair that is an adverse drug reaction, there are approximately 222 that are not adverse drug reactions. The application of a sampling technique before using cost-sensitive learning offered the best result. On the test set, the f-measure was 0.121 for the minority class and 0.996 for the majority class. <\/jats:p>","DOI":"10.1177\/1460458218799470","type":"journal-article","created":{"date-parts":[[2018,9,19]],"date-time":"2018-09-19T14:07:56Z","timestamp":1537366076000},"page":"1768-1778","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":26,"title":["The class imbalance problem detecting adverse drug reactions in electronic health records"],"prefix":"10.1177","volume":"25","author":[{"given":"Sara","family":"Santiso","sequence":"first","affiliation":[{"name":"IXA Group, University of the Basque Country (UPV-EHU), Spain"}]},{"given":"Arantza","family":"Casillas","sequence":"additional","affiliation":[{"name":"IXA Group, University of the Basque Country (UPV-EHU), Spain"}]},{"given":"Alicia","family":"P\u00e9rez","sequence":"additional","affiliation":[{"name":"IXA Group, University of the Basque Country (UPV-EHU), Spain"}]}],"member":"179","published-online":{"date-parts":[[2018,9,19]]},"reference":[{"key":"bibr1-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541882"},{"key":"bibr2-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2008.239"},{"key":"bibr3-1460458218799470","doi-asserted-by":"publisher","DOI":"10.7326\/0003-4819-140-10-200405180-00009"},{"key":"bibr4-1460458218799470","unstructured":"Ministerio Sanidad y Consumo (MSC). Estudio nacional sobre los efectos adversos ligados a la hospitalizaci\u00f3n (ENEAS), 2006, http:\/\/www.seguridaddelpaciente.es\/resources\/contenidos\/castellano\/2006\/ENEAS.pdf"},{"key":"bibr5-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1093\/intqhc\/mzr059"},{"key":"bibr6-1460458218799470","unstructured":"World Health Organization (WHO). Safety monitoring of medicinal products: guidelines for setting up and running a pharmacovigilance centre. Uppsala: Uppsala Monitoring Centre, 2000, pp. 1\u201328."},{"key":"bibr7-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-009-0152-3"},{"key":"bibr8-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000351"},{"key":"bibr9-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2015.08.013"},{"key":"bibr10-1460458218799470","first-page":"536","volume-title":"IEEE international conference on bioinformatics and biomedicine","author":"Zhao J"},{"key":"bibr11-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2010-000022"},{"key":"bibr12-1460458218799470","first-page":"1","volume":"2014","author":"Patki A","year":"2014","journal-title":"Proc BioLinkSig"},{"key":"bibr13-1460458218799470","first-page":"1","volume-title":"Proceedings of the fourth workshop on building and evaluating resources for health and biomedical text processing","author":"Ginn R","year":"2014"},{"key":"bibr14-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.01.068"},{"key":"bibr15-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocv010"},{"key":"bibr16-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1186\/s12911-017-0443-3"},{"key":"bibr17-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1075\/nlp.11"},{"key":"bibr18-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"bibr19-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-41827-3_67"},{"key":"bibr20-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"bibr21-1460458218799470","first-page":"973","volume-title":"International joint conference on artificial intelligence","volume":"17","author":"Elkan C"},{"key":"bibr22-1460458218799470","first-page":"155","volume-title":"Proceedings of the fifth ACM SIGKDD international conference on knowledge discovery and data mining","author":"Domingos P"},{"key":"bibr23-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1007\/BF00058655"},{"key":"bibr24-1460458218799470","first-page":"148","volume-title":"Thirteenth international conference on machine learning","volume":"96","author":"Freund Y"},{"key":"bibr25-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(05)80023-1"},{"key":"bibr26-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199"},{"key":"bibr27-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2009.03.002"},{"key":"bibr28-1460458218799470","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2005.10.010"}],"container-title":["Health Informatics Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1460458218799470","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1460458218799470","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1460458218799470","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T04:49:35Z","timestamp":1740804575000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1460458218799470"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,9,19]]},"references-count":28,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2019,12]]}},"alternative-id":["10.1177\/1460458218799470"],"URL":"https:\/\/doi.org\/10.1177\/1460458218799470","relation":{},"ISSN":["1460-4582","1741-2811"],"issn-type":[{"value":"1460-4582","type":"print"},{"value":"1741-2811","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,9,19]]}}}