{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T19:49:02Z","timestamp":1781552942589,"version":"3.54.5"},"reference-count":31,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2014,9,23]],"date-time":"2014-09-23T00:00:00Z","timestamp":1411430400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2014,11,17]]},"abstract":"<jats:p>Finding the most interesting correlations among items is essential for problems in many commercial, medical, and scientific domains. Although there are numerous measures available for evaluating correlations, different correlation measures provide drastically different results. Piatetsky-Shapiro provided three mandatory properties for any reasonable correlation measure, and Tan et al. proposed several properties to categorize correlation measures; however, it is still hard for users to choose the desirable correlation measures according to their needs. In order to solve this problem, we explore the effectiveness problem in three ways. First, we propose two desirable properties and two optional properties for correlation measure selection and study the property satisfaction for different correlation measures. Second, we study different techniques to adjust correlation measures and propose two new correlation measures: the Simplified \u03c7<jats:sup>2<\/jats:sup>with Continuity Correction and the Simplified \u03c7<jats:sup>2<\/jats:sup>with Support. Third, we analyze the upper and lower bounds of different measures and categorize them by the bound differences. Combining these three directions, we provide guidelines for users to choose the proper measure according to their needs.<\/jats:p>","DOI":"10.1145\/2637484","type":"journal-article","created":{"date-parts":[[2014,10,1]],"date-time":"2014-10-01T13:34:59Z","timestamp":1412170499000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Selecting the Right Correlation Measure for Binary Data"],"prefix":"10.1145","volume":"9","author":[{"given":"Lian","family":"Duan","sequence":"first","affiliation":[{"name":"New Jersey Institute of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"W. Nick","family":"Street","sequence":"additional","affiliation":[{"name":"University of Iowa"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yanchi","family":"Liu","sequence":"additional","affiliation":[{"name":"New Jersey Institute of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Songhua","family":"Xu","sequence":"additional","affiliation":[{"name":"New Jersey Institute of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Brook","family":"Wu","sequence":"additional","affiliation":[{"name":"New Jersey Institute of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2014,9,23]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/170035.170072"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s002280050466"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/253260.253327"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/253260.253325"},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.70.066111"},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2009.89"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2013.05.027"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2623330.2623629"},{"key":"e_1_2_2_9_1","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1080\/00031305.1999.10474456","article-title":"Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system","volume":"53","author":"Dumouchel W.","year":"1999","unstructured":"W. Dumouchel . 1999 . Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system . American Statistician 53 , 3 (1999), 177 -- 202 . W. Dumouchel. 1999. Bayesian data mining in large frequency tables, with an application to the FDA spontaneous reporting system. American Statistician 53, 3 (1999), 177--202.","journal-title":"American Statistician"},{"key":"e_1_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.5555\/972450.972454"},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1103\/RevModPhys.29.454"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1132960.1132963"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2003.08.004"},{"key":"e_1_2_2_14_1","unstructured":"R. H. Johnson and D. W. Wichern. 2001. Applied Multivariate Statistical Analysis. Prentice Hall. R. H. Johnson and D. W. Wichern. 2001. Applied Multivariate Statistical Analysis. Prentice Hall."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2012-001119"},{"key":"e_1_2_2_16_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/01621459.1968.11009219","article-title":"Association and estimation in contingency tables","volume":"63","author":"Mosteller F.","year":"1968","unstructured":"F. Mosteller . 1968 . Association and estimation in contingency tables . Journal of the American Statistical Association 63 , 321 (1968), 1 -- 28 . F. Mosteller. 1968. Association and estimation in contingency tables. Journal of the American Statistical Association 63, 321 (1968), 1--28.","journal-title":"Journal of the American Statistical Association"},{"key":"e_1_2_2_17_1","volume-title":"Proceedings of the KDD 2000 Workshop on Postprocessing in Machine Learning and Data Mining.","author":"Tan P.-N.","unstructured":"P.-N. Tan and V. Kumar . 2000. Interestingness measures for association patterns: A perspective . In Proceedings of the KDD 2000 Workshop on Postprocessing in Machine Learning and Data Mining. P.-N. Tan and V. Kumar. 2000. Interestingness measures for association patterns: A perspective. In Proceedings of the KDD 2000 Workshop on Postprocessing in Machine Learning and Data Mining."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1402005"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1161582"},{"key":"e_1_2_2_20_1","volume-title":"Methods section for the disproportionality paper. (September","author":"OP.","year":"2010","unstructured":"OM OP. 2010. Methods section for the disproportionality paper. (September 2010 ). http:\/\/omop.fnih.org\/MethodsLibrary. OMOP. 2010. Methods section for the disproportionality paper. (September 2010). http:\/\/omop.fnih.org\/MethodsLibrary."},{"key":"e_1_2_2_21_1","volume-title":"Discovery, Analysis, and Presentation of Strong Rules","author":"Piatetsky-Shapiro G.","unstructured":"G. Piatetsky-Shapiro . 1991. Discovery, Analysis, and Presentation of Strong Rules . AAAI\/MIT Press , 229--248. G. Piatetsky-Shapiro. 1991. Discovery, Analysis, and Presentation of Strong Rules. AAAI\/MIT Press, 229--248."},{"key":"e_1_2_2_22_1","volume-title":"The Analysis of Cross-Classifications","author":"Reynold H. T.","unstructured":"H. T. Reynold . 1977. The Analysis of Cross-Classifications . Free Press . H. T. Reynold. 1977. The Analysis of Cross-Classifications. Free Press."},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1148\/radiol.2301031028"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4379(03)00072-3"},{"key":"e_1_2_2_25_1","unstructured":"P.-N. Tan M. Steinbach and V. Kumar. 2005. Introduction to Data Mining. Addison Wesley. P.-N. Tan M. Steinbach and V. Kumar. 2005. Introduction to Data Mining. Addison Wesley."},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-013-0326-x"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.161"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2006.68"},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1183614.1183640"},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001401000976"},{"key":"e_1_2_2_31_1","volume-title":"Proceedings of the 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD\u201999)","author":"Zhong N.","unstructured":"N. Zhong , Y. Y. Yao , and S. Ohsuga . 1999. Peculiarity oriented multi-database mining . In Proceedings of the 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD\u201999) . Springer-Verlag, London, UK, 136--146. N. Zhong, Y. Y. Yao, and S. Ohsuga. 1999. Peculiarity oriented multi-database mining. In Proceedings of the 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD\u201999). Springer-Verlag, London, UK, 136--146."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2637484","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2637484","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:28:17Z","timestamp":1750231697000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2637484"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,9,23]]},"references-count":31,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2014,11,17]]}},"alternative-id":["10.1145\/2637484"],"URL":"https:\/\/doi.org\/10.1145\/2637484","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,9,23]]},"assertion":[{"value":"2013-08-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2014-09-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}