{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,18]],"date-time":"2025-12-18T20:06:20Z","timestamp":1766088380753,"version":"3.37.3"},"reference-count":27,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2024,10,9]],"date-time":"2024-10-09T00:00:00Z","timestamp":1728432000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,10,9]],"date-time":"2024-10-09T00:00:00Z","timestamp":1728432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Hochschule f\u00fcr Angewandte Wissenschaften Hamburg (HAW Hamburg)"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Datenbank Spektrum"],"published-print":{"date-parts":[[2024,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Traditionally, disproportionality analysis (DPA) methods are employed for signal detection in pharmacovigilance, but these methods utilize only a\u00a0limited portion of the data available from spontaneous event reports (SERs). This research aims to enhance signal detection by applying machine learning (ML) methods that can leverage additional data. We create a\u00a0dataset by integrating SER data from the FDA Adverse Event Reporting System (FAERS) with biological and chemical data from DrugBank, and information on known adverse drug reactions (ADRs) from Side Effect Resource (SIDER). The known ADRs from SIDER are used to label the dataset for ML training. Using the AutoML library TPOT, ML models are trained on this dataset. Our findings indicate that ML models, even when trained with the same features as DPA methods, achieve higher recall and precision. Moreover, incorporating additional features related to drugs and events significantly boosts the performance of ML models. Analysis using the explainable AI (XAI) technique SHAP reveals that the drug name, event name, and fifth-level ATC code are the most influential features for model predictions. These ML models offer a\u00a0promising alternative or supplement to conventional DPA methods for signal detection in pharmacovigilance.<\/jats:p>","DOI":"10.1007\/s13222-024-00486-1","type":"journal-article","created":{"date-parts":[[2024,10,9]],"date-time":"2024-10-09T15:14:11Z","timestamp":1728486851000},"page":"233-242","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Integration of FAERS, DrugBank and SIDER Data for Machine Learning-based Detection of Adverse Drug Reactions"],"prefix":"10.1007","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-7345-7394","authenticated-orcid":false,"given":"Tobias","family":"Schreier","sequence":"first","affiliation":[]},{"given":"Marina","family":"Tropmann-Frick","sequence":"additional","affiliation":[]},{"given":"Ruwen","family":"B\u00f6hm","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,10,9]]},"reference":[{"key":"486_CR1","doi-asserted-by":"publisher","DOI":"10.3389\/fphar.2020.602365","author":"JH Bae","year":"2020","unstructured":"Bae JH, Baek YH, Lee JE et al (2020) Machine learning for detection of safety signals from spontaneous reporting system data: example of nivolumab and docetaxel. Front Pharmacol. https:\/\/doi.org\/10.3389\/fphar.2020.602365","journal-title":"Front Pharmacol"},{"issue":"4","key":"486_CR2","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1007\/s002280050466","volume":"54","author":"A Bate","year":"1998","unstructured":"Bate A, Lindquist M, Edwards IR et al (1998) A Bayesian neural network method for adverse drug reaction signal generation. Eur J Clin Pharmacol 54(4):315\u2013321. https:\/\/doi.org\/10.1007\/s002280050466","journal-title":"Eur J Clin Pharmacol"},{"issue":"6","key":"486_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0157753","volume":"11","author":"R B\u00f6hm","year":"2016","unstructured":"B\u00f6hm R, von Hehn L, Herdegen T et al (2016) OpenVigil FDA \u2013 Inspection of U.S. american adverse drug events pharmacovigilance data and novel clinical applications. PLoS ONE 11(6):1\u201320. https:\/\/doi.org\/10.1371\/journal.pone.0157753","journal-title":"PLoS ONE"},{"issue":"1","key":"486_CR4","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2002","unstructured":"Chawla NV, Bowyer KW, Hall LO et al (2002) SMOTE: synthetic minority over-sampling technique. J\u00a0Artif Intell Res 16(1):321\u2013357. https:\/\/doi.org\/10.1613\/jair.953","journal-title":"J Artif Intell Res"},{"issue":"2","key":"486_CR5","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1109\/4235.996017","volume":"6","author":"K Deb","year":"2002","unstructured":"Deb K, Pratap A, Agarwal S et al (2002) A\u00a0fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE T Evol Comput 6(2):182\u2013197. https:\/\/doi.org\/10.1109\/4235.996017","journal-title":"IEEE T Evol Comput"},{"issue":"6","key":"486_CR6","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1002\/pds.677","volume":"10","author":"SJ Evans","year":"2001","unstructured":"Evans SJ, Waller PC, Davis S (2001) Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports. Pharmacoepidemiol Dr 10(6):483\u2013486. https:\/\/doi.org\/10.1002\/pds.677","journal-title":"Pharmacoepidemiol Dr"},{"issue":"1","key":"486_CR7","first-page":"2171","volume":"13","author":"FA Fortin","year":"2012","unstructured":"Fortin FA, De Rainville FM, Gardner MA et al (2012) DEAP: evolutionary algorithms made easy. J\u00a0Mach Learn Res 13(1):2171\u20132175","journal-title":"J Mach Learn Res"},{"key":"486_CR8","volume-title":"Hands-on machine learning with scikit-learn, keras, and tensorflow","author":"A G\u00e9ron","year":"2019","unstructured":"G\u00e9ron A (2019) Hands-on machine learning with scikit-learn, keras, and tensorflow, 2nd\u00a0edn. O\u2019Reilly Media, Inc.","edition":"2nd"},{"key":"486_CR9","unstructured":"International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (2024) Welcome to the ICH MedDRA website. https:\/\/www.meddra.org\/how-to-use\/support-documentation\/english\/welcome. Accessed: August 29th 2024"},{"key":"486_CR10","doi-asserted-by":"publisher","DOI":"10.1097\/MD.0000000000029387","author":"HR Kim","year":"2022","unstructured":"Kim HR, Sung M, Park JA et al (2022) Analyzing adverse drug reaction using statistical and machine learning methods: A systematic review. Medicine. https:\/\/doi.org\/10.1097\/MD.0000000000029387","journal-title":"Medicine"},{"issue":"1","key":"486_CR11","doi-asserted-by":"publisher","first-page":"1373","DOI":"10.1093\/nar\/gkac956","volume":"51","author":"S Kim","year":"2022","unstructured":"Kim S, Chen J, Cheng T et al (2022) PubChem 2023 update. Nucleic Acids Res 51(1):1373\u20131380. https:\/\/doi.org\/10.1093\/nar\/gkac956","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"486_CR12","doi-asserted-by":"publisher","first-page":"1265","DOI":"10.1093\/nar\/gkad976","volume":"52","author":"C Knox","year":"2023","unstructured":"Knox C, Wilson M, Klinger CM et al (2023) DrugBank 6.0: the DrugBank Knowledgebase for 2024. Nucleic Acids Res 52(1):1265\u20131275. https:\/\/doi.org\/10.1093\/nar\/gkad976","journal-title":"Nucleic Acids Res"},{"key":"486_CR13","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1038\/msb.2009.98","volume":"6","author":"M Kuhn","year":"2010","unstructured":"Kuhn M, Campillos M, Letunic I et al (2010) A\u00a0side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 6:343. https:\/\/doi.org\/10.1038\/msb.2009.98","journal-title":"Mol Syst Biol"},{"issue":"1","key":"486_CR14","doi-asserted-by":"publisher","first-page":"1075","DOI":"10.1093\/nar\/gkv1075","volume":"44","author":"M Kuhn","year":"2016","unstructured":"Kuhn M, Letunic I, Jensen LJ et al (2016) The SIDER database of drugs and side effects. Nucleic Acids Res 44(1):1075\u20131079. https:\/\/doi.org\/10.1093\/nar\/gkv1075","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"486_CR15","doi-asserted-by":"publisher","first-page":"250","DOI":"10.1093\/bioinformatics\/btz470","volume":"36","author":"TT Le","year":"2019","unstructured":"Le TT, Fu W, Moore JH (2019) Scaling tree-based automated machine learning to biomedical big data with a\u00a0feature set selector. Bioinformatics 36(1):250\u2013256. https:\/\/doi.org\/10.1093\/bioinformatics\/btz470","journal-title":"Bioinformatics"},{"issue":"7","key":"486_CR16","doi-asserted-by":"publisher","first-page":"1332","DOI":"10.1016\/j.drudis.2019.03.003","volume":"24","author":"CY Lee","year":"2019","unstructured":"Lee CY, Chen YPP (2019) Machine learning on adverse drug reactions for pharmacovigilance. Drug Discov Today 24(7):1332\u20131343. https:\/\/doi.org\/10.1016\/j.drudis.2019.03.003","journal-title":"Drug Discov Today"},{"issue":"1","key":"486_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/S41598-022-18522-Z\/FIGURES\/3","volume":"12","author":"JE Lee","year":"2022","unstructured":"Lee JE, Kim JH, Bae JH et al (2022) Detecting early safety signals of infliximab using machine learning algorithms in the Korea adverse event reporting system. Sci Rep 12(1):1\u201312. https:\/\/doi.org\/10.1038\/S41598-022-18522-Z\/FIGURES\/3","journal-title":"Sci Rep"},{"issue":"1","key":"486_CR18","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1136\/AMIAJNL-2011-000699","volume":"19","author":"M Liu","year":"2012","unstructured":"Liu M, Wu Y, Chen Y et al (2012) Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs. J\u00a0Am Med Inform Assn 19(1):28\u201335. https:\/\/doi.org\/10.1136\/AMIAJNL-2011-000699","journal-title":"J Am Med Inform Assn"},{"key":"486_CR19","series-title":"NIPS","doi-asserted-by":"publisher","first-page":"4768","DOI":"10.48550\/arXiv.1705.07874","volume-title":"Proceedings of the 31st International Conference on Neural Information Processing Systems","author":"SM Lundberg","year":"2017","unstructured":"Lundberg SM, Lee SI (2017) A\u00a0unified approach to interpreting model predictions. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. NIPS, vol 17. Curran Associates Inc, Red Hook, NY, USA, pp 4768\u20134777 https:\/\/doi.org\/10.48550\/arXiv.1705.07874"},{"issue":"86","key":"486_CR20","first-page":"2579","volume":"9","author":"L van der Maaten","year":"2008","unstructured":"van der Maaten L, Hinton G (2008) Visualizing data using t\u2011SNE. J\u00a0Mach Learn Res 9(86):2579\u20132605 (http:\/\/jmlr.org\/papers\/v9\/vandermaaten08a.html)","journal-title":"J Mach Learn Res"},{"key":"486_CR21","doi-asserted-by":"publisher","first-page":"485","DOI":"10.1145\/2908812.2908918","volume-title":"Proceedings of the Genetic and Evolutionary Computation Conference 2016","author":"RS Olson","year":"2016","unstructured":"Olson RS, Bartley N, Urbanowicz RJ et al (2016) Evaluation of a\u00a0tree-based pipeline optimization tool for automating data science. In: Proceedings of the Genetic and Evolutionary Computation Conference 2016, vol 16. Association for Computing Machinery, New York, NY, USA, pp 485\u2013492 https:\/\/doi.org\/10.1145\/2908812.2908918"},{"key":"486_CR22","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: Machine learning in Python. J\u00a0Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"issue":"6","key":"486_CR23","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1007\/s40264-018-00792-0","volume":"42","author":"M Pham","year":"2019","unstructured":"Pham M, Cheng F, Ramachandran K (2019) A\u00a0comparison study of algorithms to detect drug-adverse event associations: Frequentist, bayesian, and machine-learning approaches. Drug Saf 42(6):743\u2013750. https:\/\/doi.org\/10.1007\/s40264-018-00792-0","journal-title":"Drug Saf"},{"key":"486_CR24","series-title":"chap","doi-asserted-by":"publisher","DOI":"10.5772\/50095","volume-title":"Data Mining Applications in Engineering and Medicine","author":"E Poluzzi","year":"2012","unstructured":"Poluzzi E, Raschi E, Piccinni C et al (2012) Data mining techniques in pharmacovigilance: Analysis of the publicly accessible FDA Adverse Event Reporting System (AERS). In: Karahoca A (ed) Data Mining Applications in Engineering and Medicine. chap, vol 12. IntechOpen, Rijeka https:\/\/doi.org\/10.5772\/50095"},{"key":"486_CR25","series-title":"NIPS\u201908","first-page":"1313","volume-title":"Advances in Neural Information Processing Systems","author":"A Rahimi","year":"2008","unstructured":"Rahimi A, Recht B (2008) Weighted sums of random kitchen sinks: Replacing minimization with randomization in learning. In: Koller D, Schuurmans D, Bengio Y et al (eds) Advances in Neural Information Processing Systems. NIPS\u201908, vol 21. Curran Associates Inc, Red Hook, NY, USA, pp 1313\u20131320"},{"issue":"8","key":"486_CR26","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1002\/pds.1001","volume":"13","author":"KJ Rothman","year":"2004","unstructured":"Rothman KJ, Lanes S, Sacks ST (2004) The reporting odds ratio and its advantages over the proportional reporting ratio. Pharmacoepidem Dr 13(8):519\u2013523. https:\/\/doi.org\/10.1002\/pds.1001","journal-title":"Pharmacoepidem Dr"},{"key":"486_CR27","series-title":"SAC","doi-asserted-by":"publisher","first-page":"676","DOI":"10.1145\/3341105.3374068","volume-title":"Proceedings of the 35th Annual ACM Symposium on Applied Computing","author":"CH Wang","year":"2020","unstructured":"Wang CH, Lin WY (2020) Deep learning from spontaneous reporting systems data to detect ADR signals. In: Proceedings of the 35th Annual ACM Symposium on Applied Computing. SAC, vol 20. Association for Computing Machinery, New York, NY, USA, pp 676\u2013678 https:\/\/doi.org\/10.1145\/3341105.3374068"}],"container-title":["Datenbank-Spektrum"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13222-024-00486-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13222-024-00486-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13222-024-00486-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T16:43:23Z","timestamp":1732034603000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s13222-024-00486-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,9]]},"references-count":27,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,11]]}},"alternative-id":["486"],"URL":"https:\/\/doi.org\/10.1007\/s13222-024-00486-1","relation":{},"ISSN":["1618-2162","1610-1995"],"issn-type":[{"type":"print","value":"1618-2162"},{"type":"electronic","value":"1610-1995"}],"subject":[],"published":{"date-parts":[[2024,10,9]]},"assertion":[{"value":"1 June 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 September 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 October 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors have no competing interests to declare that are relevant to the content of this article.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}