{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T18:00:51Z","timestamp":1768413651277,"version":"3.49.0"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,2,3]],"date-time":"2020-02-03T00:00:00Z","timestamp":1580688000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2020,2,3]],"date-time":"2020-02-03T00:00:00Z","timestamp":1580688000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Social Science Foundation of China","award":["14BTQ036"],"award-info":[{"award-number":["14BTQ036"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Data masking is an inborn defect of measures of disproportionality in adverse drug reactions (ADRs) signal detection. Many previous studies can be roughly classified into three categories: data removal, regression and stratification. However, frequency differences of adverse drug events (ADEs) reports, which would be an important factor of masking, were not considered in these methods. The aim of this study is to explore a novel stratification method for minimizing the impact of frequency differences on real signals masking.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>Reports in the Chinese Spontaneous Reporting Database (CSRD) between 2010 and 2011 were selected. The overall dataset was stratified into some clusters by the frequency of drugs, ADRs, and drug-event combinations (DECs) in sequence.<jats:italic>K-means<\/jats:italic>clustering was used to conduct stratification according to data distribution characteristics. The Information Component (IC) was adopted for signal detection in each cluster respectively. By extracting ADRs from drug product labeling, a reference database was introduced for performance evaluation based on<jats:italic>Recall<\/jats:italic>,<jats:italic>Precision<\/jats:italic>and<jats:italic>F<\/jats:italic>-measure. In addition, some DECs from the Adverse Drug Reactions Information Bulletin (ADRIB) issued by CFDA were collected for further reliability evaluation.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>With stratification, the study dataset was divided into 21 clusters, among which the frequency of DRUGs, ADRs or DECs followed the similar order of magnitude respectively.<jats:italic>Recall<\/jats:italic>increased by 34.95% from 29.93 to 40.39%,<jats:italic>Precision<\/jats:italic>reduced by 10.52% from 54.56 to 48.82%, while<jats:italic>F-<\/jats:italic>measure increased by 14.39% from 38.65 to 44.21%. According to ADRIB after 2011, 5 DECs related to<jats:italic>Potassium Magnesium Aspartate<\/jats:italic>, 61 DECs related to<jats:italic>Levofloxacin Hydrochloride<\/jats:italic>and 26 DECs related to<jats:italic>Cefazolin<\/jats:italic>were highlighted.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>The proposed method is effectively and reliably for the minimization of data masking effect in signal detection. Considering the decrease of<jats:italic>Precision<\/jats:italic>, it is suggested to be a supplement rather than an alternative to non-stratification method.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12911-020-1037-z","type":"journal-article","created":{"date-parts":[[2020,2,3]],"date-time":"2020-02-03T16:03:42Z","timestamp":1580745822000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A stratification method based on clustering for the minimization of data masking effect in signal detection"],"prefix":"10.1186","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9052-9212","authenticated-orcid":false,"given":"Jian-Xiang","family":"Wei","sequence":"first","affiliation":[]},{"given":"Yue","family":"Ding","sequence":"additional","affiliation":[]},{"given":"Ming","family":"Li","sequence":"additional","affiliation":[]},{"given":"Jun","family":"Sun","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,2,3]]},"reference":[{"issue":"11","key":"1037_CR1","doi-asserted-by":"publisher","first-page":"1590","DOI":"10.1001\/jamainternmed.2019.3066","volume":"179","author":"Noam Tau","year":"2019","unstructured":"Tau N, Shochat T, Gafter-Gvili A, Tibau A, Amir E, Shepshelovich D. Association between data sources and US Food and Drug Administration drug safety communications. JAMA Intern Med. Published online September. 2019;03. https:\/\/doi.org\/10.1001\/jamainternmed.2019.3066.","journal-title":"JAMA Internal Medicine"},{"issue":"1","key":"1037_CR2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1055\/s-0038-1636131","volume":"13","author":"DJ Finney","year":"1974","unstructured":"Finney DJ. Systemic signalling of adverse reactions to drugs. Methods Inf Med. 1974;13(1):1\u201310.","journal-title":"Methods Inf Med"},{"issue":"6","key":"1037_CR3","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1002\/pds.677","volume":"10","author":"SJW Evans","year":"2001","unstructured":"Evans SJW, Waller PC, Davis S. Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports. Pharmacoepidemiol Drug Saf. 2001;10(6):483.","journal-title":"Pharmacoepidemiol Drug Saf"},{"issue":"2","key":"1037_CR4","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1046\/j.1365-2125.2003.01968.x","volume":"57","author":"AM Wilson","year":"2004","unstructured":"Wilson AM, Thabane L, Holbrook A. Application of data mining techniques in pharmacovigilance. Br J Clin Pharmacol. 2004;57(2):127\u201334.","journal-title":"Br J Clin Pharmacol"},{"issue":"5","key":"1037_CR5","doi-asserted-by":"publisher","first-page":"293","DOI":"10.2165\/00002018-200326050-00001","volume":"26","author":"PE Van","year":"2003","unstructured":"Van PE, Diemont W, Van GK. Application of quantitative signal detection in the Dutch spontaneous reporting system for adverse drug reactions. Drug Saf. 2003;26(5):293\u2013301.","journal-title":"Drug Saf"},{"issue":"6","key":"1037_CR6","doi-asserted-by":"publisher","first-page":"393","DOI":"10.2165\/00002018-200225060-00002","volume":"25","author":"A Bate","year":"2002","unstructured":"Bate A, Lindquist M, Edwards IR, Orre R. A data mining approach for signal detection and analysis. Drug Saf. 2002;25(6):393\u20137.","journal-title":"Drug Saf"},{"issue":"3","key":"1037_CR7","doi-asserted-by":"publisher","first-page":"159","DOI":"10.2165\/00002018-200326030-00003","volume":"26","author":"M Hauben","year":"2003","unstructured":"Hauben M, Zhou X. Quantitative methods in pharmacovigilance: focus on signal detection. Drug Saf. 2003;26(3):159\u201386.","journal-title":"Drug Saf"},{"issue":"4","key":"1037_CR8","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1007\/s002280050466","volume":"54","author":"A Bate","year":"1998","unstructured":"Bate A, Lindquist M, Edwards IR, Olsson S, Orre R, Lansner A, et al. A Bayesian neural network method for adverse drug reaction signal generation. Eur J Clin Pharmacol. 1998;54(4):315\u201321.","journal-title":"Eur J Clin Pharmacol"},{"issue":"4","key":"1037_CR9","doi-asserted-by":"publisher","first-page":"473","DOI":"10.1016\/S0167-9473(99)00114-0","volume":"34","author":"R Orre","year":"2000","unstructured":"Orre R, Lansner A, Bate A, Lindquist M. Bayesian neural networks with confidence estimations applied to data mining. Comput Stat Data Anal. 2000;34(4):473\u201393.","journal-title":"Comput Stat Data Anal"},{"issue":"7","key":"1037_CR10","doi-asserted-by":"publisher","first-page":"623","DOI":"10.2165\/00002018-200730070-00011","volume":"30","author":"A Bate","year":"2007","unstructured":"Bate A. Bayesian confidence propagation neural network. Drug Saf. 2007;30(7):623\u20135.","journal-title":"Drug Saf"},{"issue":"3","key":"1037_CR11","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1080\/00031305.1999.10474456","volume":"53","author":"W Dumouchel","year":"1999","unstructured":"Dumouchel W. Bayesian data Mining in Large Frequency Tables, with an application to the FDA spontaneous reporting system. Am Stat. 1999;53(3):177\u201390.","journal-title":"Am Stat"},{"key":"1037_CR12","doi-asserted-by":"publisher","first-page":"152","DOI":"10.1016\/j.ajp.2017.09.005","volume":"30","author":"V Subeesh","year":"2017","unstructured":"Subeesh V, Singh H, Maheswari E, Beulah E. Novel adverse events of vortioxetine: a disproportionality analysis in USFDA adverse event reporting system database. Asian J Psychiatr. 2017;30:152.","journal-title":"Asian J Psychiatr"},{"key":"1037_CR13","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1016\/j.psychres.2017.06.038","volume":"256","author":"S Kim","year":"2017","unstructured":"Kim S, Park K, Kim MS, Yang BR, Choi HJ, Park BJ. Data-mining for detecting signals of adverse drug reactions of fluoxetine using the Korea adverse event reporting system (KAERS) database. Psychiatry Res. 2017;256:237.","journal-title":"Psychiatry Res"},{"key":"1037_CR14","volume-title":"Adverse drug reactions in the ICU\/\/ critical care toxicology","author":"P Moore","year":"2016","unstructured":"Moore P, Burkhart K. Adverse drug reactions in the ICU\/\/ critical care toxicology; 2016."},{"issue":"3","key":"1037_CR15","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1007\/s40264-015-0375-8","volume":"39","author":"M Arnaud","year":"2016","unstructured":"Arnaud M, Salvo F, Ahmed I, Robinson P, Moore N, B\u00e9gaud B, et al. A method for the minimization of competition Bias in signal detection from spontaneous reporting databases. Drug Saf. 2016;39(3):251\u201360.","journal-title":"Drug Saf"},{"issue":"7","key":"1037_CR16","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1002\/pds.771","volume":"12","author":"AL Gould","year":"2003","unstructured":"Gould AL. Practical pharmacovigilance analysis strategies. Pharmacoepidemiol Drug Saf. 2003;12(7):559\u201374.","journal-title":"Pharmacoepidemiol Drug Saf"},{"issue":"5","key":"1037_CR17","doi-asserted-by":"publisher","first-page":"929","DOI":"10.1517\/14740338.4.5.929","volume":"4","author":"M Hauben","year":"2005","unstructured":"Hauben M, Madigan D, Gerrits CM, Walsh L, Van Puijenbroek EP. The role of data mining in pharmacovigilance. Expert Opin Drug Saf. 2005;4(5):929\u201348.","journal-title":"Expert Opin Drug Saf"},{"key":"1037_CR18","doi-asserted-by":"crossref","unstructured":"Ewans SJW. Statistics: analysis and presentation of safety data. In: Talbot J, Waller P, editors. Stephens\u2019 Detection of New Adverse Drug Reactions, 5th edn. Chichester: Wiley; 2004. p. 301\u201328.","DOI":"10.1002\/0470014199.ch6"},{"issue":"11","key":"1037_CR19","doi-asserted-by":"publisher","first-page":"1166","DOI":"10.1002\/pds.2022","volume":"19","author":"A Pariente","year":"2010","unstructured":"Pariente A, Didailler M, Avillach P, Miremont-Salam\u00e9 G, Fourrier-Reglat A, Haramburu F, et al. A potential competition bias in the detection of safety signals from spontaneous reporting databases. Pharmacoepidemiol Drug Saf. 2010;19(11):1166\u201371.","journal-title":"Pharmacoepidemiol Drug Saf"},{"issue":"2","key":"1037_CR20","doi-asserted-by":"publisher","first-page":"208","DOI":"10.1002\/pds.3530","volume":"23","author":"F Maignen","year":"2014","unstructured":"Maignen F, Hauben M, Hung E, Holle LV, Dogne JM. A conceptual approach to the masking effect of measures of disproportionality. Pharmacoepidemiol Drug Saf. 2014;23(2):208\u201317.","journal-title":"Pharmacoepidemiol Drug Saf"},{"issue":"7","key":"1037_CR21","doi-asserted-by":"publisher","first-page":"689","DOI":"10.1007\/s40264-016-0413-1","volume":"39","author":"F Montastruc","year":"2016","unstructured":"Montastruc F, Salvo F, Arnaud M, B\u00e9gaud B, Pariente A. Signal of gastrointestinal congenital malformations with antipsychotics after Minimising competition Bias: a disproportionality analysis using data from Vigibase (\u00ae). Drug Saf. 2016;39(7):689\u201396.","journal-title":"Drug Saf"},{"issue":"12","key":"1037_CR22","doi-asserted-by":"publisher","first-page":"1117","DOI":"10.2165\/11584390-000000000-00000","volume":"33","author":"HW Wang","year":"2010","unstructured":"Wang HW, Hochberg AM, Pearson RK, Hauben M. An experimental investigation of masking in the US FDA adverse event reporting system database. Drug Saf. 2010;33(12):1117\u201333.","journal-title":"Drug Saf"},{"issue":"10","key":"1037_CR23","doi-asserted-by":"crossref","first-page":"1119","DOI":"10.1002\/pds.3474","volume":"22","author":"K Juhlin","year":"2013","unstructured":"Juhlin K, Ye X, Star K, Nor\u00e9n GN. Outlier removal to uncover patterns in adverse drug reaction surveillance-a simple unmasking strategy. Pharmacoepidemiol Drug Saf. 2013;22(10):1119\u201329.","journal-title":"Pharmacoepidemiol Drug Saf"},{"issue":"4","key":"1037_CR24","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1002\/sam.10078","volume":"3","author":"O Caster","year":"2010","unstructured":"Caster O, Madigan D, Bate A. Large-scale regression-based pattern discovery: the example of screening the WHO global drug safety database. Stat Anal Data Min. 2010;3(4):197\u2013208.","journal-title":"Stat Anal Data Min"},{"key":"1037_CR25","volume-title":"Large-scale regression-based pattern discovery in international adverse drug reaction surveillance","author":"O Caster","year":"2008","unstructured":"Caster O, Nor\u00e9n GN, Madigan D, Bate A. Large-scale regression-based pattern discovery in international adverse drug reaction surveillance; 2008."},{"issue":"1","key":"1037_CR26","first-page":"33","volume":"10","author":"YE Xiao-fei","year":"2013","unstructured":"Xiao-fei YE, Gang CHENG, Xiao-jing GUO, Jin-fang XU, Jia HE. Masking effect in the process of signal detection of adverse drug reaction. Chinese J Pharmacovigilance. 2013;10(1):33\u20135.","journal-title":"Chinese J Pharmacovigilance"},{"issue":"21","key":"1037_CR27","doi-asserted-by":"publisher","first-page":"3740","DOI":"10.1002\/sim.2473","volume":"25","author":"GN Nor\u00e9n","year":"2006","unstructured":"Nor\u00e9n GN, Bate A, Orre R, Edwards IR. Extending the methods used to screen the WHO drug safety database towards analysis of complex associations and improved accuracy for rare events. Stat Med. 2006;25(21):3740\u201357.","journal-title":"Stat Med"},{"issue":"11","key":"1037_CR28","doi-asserted-by":"publisher","first-page":"1035","DOI":"10.2165\/00002018-200831110-00008","volume":"31","author":"J Hopstadius","year":"2008","unstructured":"Hopstadius J, Nor\u00e9n GN, Bate A, Edwards IR. Impact of stratification on adverse drug reaction surveillance. Drug Saf. 2008;31(11):1035\u201348.","journal-title":"Drug Saf"},{"issue":"3","key":"1037_CR29","first-page":"142","volume":"7","author":"Q Wei","year":"2010","unstructured":"Wei Q, Xiao-fei Y, Chao W, Jia H. Methods of controlling confounding factors in adverse drug reaction signal detection. Chinese J Pharmacovigilance. 2010;7(3):142\u20134.","journal-title":"Chinese J Pharmacovigilance"},{"issue":"3","key":"1037_CR30","first-page":"158","volume":"08","author":"Z Lu","year":"2011","unstructured":"Lu Z, Xiao-fei Y, Chao W, Wei Q, Wen-min D, Jia H. Application of stratification analysis in adverse drug reaction signal detection. Chinese J Pharmacovigilance. 2011;08(3):158\u201360.","journal-title":"Chinese J Pharmacovigilance"},{"issue":"11","key":"1037_CR31","doi-asserted-by":"publisher","first-page":"1049","DOI":"10.2165\/00002018-200831110-00009","volume":"31","author":"SJW Evans","year":"2008","unstructured":"Evans SJW. Stratification for spontaneous report databases. Drug Saf. 2008;31(11):1049\u201352.","journal-title":"Drug Saf"},{"key":"1037_CR32","unstructured":"Han J. Data Mining: Concepts and Techniques. San Francisco: Margan Kaufmann; 2005."},{"key":"1037_CR33","unstructured":"CFDA. Adverse Drug Reactions Bulletin (No. 50) Alerting Severe Allergic Response to Potassium Magnesium Aspartate Injection. 2012. http:\/\/www.sda.gov.cn\/WS01\/CL0078\/74984.html. Accessed 19 Sept 2012."},{"key":"1037_CR34","unstructured":"CFDA. Adverse Drug Reactions Bulletin (No. 56) Beware of Severe Adverse Effects of Levofloxacin Hydrochloride Injection. 2013. http:\/\/www.sda.gov.cn\/WS01\/CL0078\/82939.html. Accessed 2 Aug 2013."},{"key":"1037_CR35","unstructured":"CFDA. Adverse Drug Reaction Information Bulletin (No. 59) Concerned about Severe Adverse Drug Reactions of Cefazolin Injection. 2014. http:\/\/www.sda.gov.cn\/WS01\/CL0078\/ 96374.html. Accessed 26 Jan 2014."}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-020-1037-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12911-020-1037-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-020-1037-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,31]],"date-time":"2024-07-31T00:24:34Z","timestamp":1722385474000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-020-1037-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,2,3]]},"references-count":35,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["1037"],"URL":"https:\/\/doi.org\/10.1186\/s12911-020-1037-z","relation":{},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,2,3]]},"assertion":[{"value":"13 August 2018","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 January 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"For all studies mentioned in this manuscript, no approval of the ethical review board was needed according to \u201cEthical Review Involving Human Biomedical Research\u201d (No.11 Order of National Health and Family Planning Commission of China,2017\/content_ 5227817. html); only human subject research with a high impact for patients have to be reviewed.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"18"}}