{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,7]],"date-time":"2026-02-07T16:52:32Z","timestamp":1770483152846,"version":"3.49.0"},"reference-count":50,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2010,5,1]],"date-time":"2010-05-01T00:00:00Z","timestamp":1272672000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2010,5]]},"abstract":"<jats:p>In the context of civil rights law, discrimination refers to unfair or unequal treatment of people based on membership to a category or a minority, without regard to individual merit. Discrimination in credit, mortgage, insurance, labor market, and education has been investigated by researchers in economics and human sciences. With the advent of automatic decision support systems, such as credit scoring systems, the ease of data collection opens several challenges to data analysts for the fight against discrimination. In this article, we introduce the problem of discovering discrimination through data mining in a dataset of historical decision records, taken by humans or by automatic systems. We formalize the processes of direct and indirect discrimination discovery by modelling protected-by-law groups and contexts where discrimination occurs in a classification rule based syntax. Basically, classification rules extracted from the dataset allow for unveiling contexts of unlawful discrimination, where the degree of burden over protected-by-law groups is formalized by an extension of the lift measure of a classification rule. In direct discrimination, the extracted rules can be directly mined in search of discriminatory contexts. In indirect discrimination, the mining process needs some background knowledge as a further input, for example, census data, that combined with the extracted rules might allow for unveiling contexts of discriminatory decisions. A strategy adopted for combining extracted classification rules with background knowledge is called an inference model. In this article, we propose two inference models and provide automatic procedures for their implementation. An empirical assessment of our results is provided on the German credit dataset and on the PKDD Discovery Challenge 1999 financial dataset.<\/jats:p>","DOI":"10.1145\/1754428.1754432","type":"journal-article","created":{"date-parts":[[2010,6,1]],"date-time":"2010-06-01T12:21:35Z","timestamp":1275394895000},"page":"1-40","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":100,"title":["Data mining for discrimination discovery"],"prefix":"10.1145","volume":"4","author":[{"given":"Salvatore","family":"Ruggieri","sequence":"first","affiliation":[{"name":"Universit\u00e0 di Pisa, Pisa, Italy"}]},{"given":"Dino","family":"Pedreschi","sequence":"additional","affiliation":[{"name":"Universit\u00e0 di Pisa, Pisa, Italy"}]},{"given":"Franco","family":"Turini","sequence":"additional","affiliation":[{"name":"Universit\u00e0 di Pisa, Pisa, Italy"}]}],"member":"320","published-online":{"date-parts":[[2010,5,28]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"Proceedings of the International Conference on Very Large Databases. Morgan Kaufmann, 487--499","author":"Agrawal R."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/342009.335438"},{"key":"e_1_2_2_3_1","unstructured":"Australian Legislation. 2009. (a) Equal Opportunity Act\u2014Victoria State (b) Anti-Discrimination Act\u2014Queensland State. http:\/\/www.austlii.edu.au. Australian Legislation. 2009. (a) Equal Opportunity Act\u2014Victoria State (b) Anti-Discrimination Act\u2014Queensland State. http:\/\/www.austlii.edu.au."},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1057\/palgrave.jors.2601545"},{"key":"e_1_2_2_5_1","unstructured":"Becker G. S. 1957. The Economics of Discrimination. University of Chicago Press. Becker G. S. 1957. The Economics of Discrimination. University of Chicago Press."},{"key":"e_1_2_2_6_1","volume-title":"PKDD 1999 discovery challenge. http:\/\/lisp.vse.cz\/challenge.","author":"Berka P.","year":"1999"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2006.09.003"},{"key":"e_1_2_2_8_1","unstructured":"Clifton C. 2003. Privacy preserving data mining: How do we mine data when we aren't allowed to see it&quest; In Procedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Tutorial. http:\/\/www.cs.purdue.edu\/homes\/clifton. Clifton C. 2003. Privacy preserving data mining: How do we mine data when we aren't allowed to see it&quest; In Procedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Tutorial. http:\/\/www.cs.purdue.edu\/homes\/clifton."},{"key":"e_1_2_2_9_1","unstructured":"European Union Legislation. 2009. (a) Racial Equality Directive (b) Employment Equality Directive. http:\/\/ec.europa.eu\/employment_social\/fundamental_rights. European Union Legislation. 2009. (a) Racial Equality Directive (b) Employment Equality Directive. http:\/\/ec.europa.eu\/employment_social\/fundamental_rights."},{"key":"e_1_2_2_10_1","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1080\/00031305.1992.10475851","article-title":"Statistical reasoning in the legal setting","volume":"46","author":"Gastwirth J. L.","year":"1992","journal-title":"Amer. Statist."},{"key":"e_1_2_2_11_1","doi-asserted-by":"crossref","unstructured":"Goethals B. 2009. Frequent itemset mining implementations repository. http:\/\/fimi.cs.helsinki.fi. Goethals B. 2009. Frequent itemset mining implementations repository. http:\/\/fimi.cs.helsinki.fi.","DOI":"10.1007\/978-0-387-09823-4_16"},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1093\/imaman\/12.2.139"},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-985X.1997.00078.x"},{"key":"e_1_2_2_14_1","unstructured":"Harford T. 2008. Logic of Life. The Random House. Harford T. 2008. Logic of Life. The Random House."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.140"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1177\/001979390405700206"},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1002\/pam.20181"},{"key":"e_1_2_2_18_1","unstructured":"Hunter R. 1992. Indirect Discrimination in the Workplace. The Federation Press. Hunter R. 1992. Indirect Discrimination in the Workplace. The Federation Press."},{"key":"e_1_2_2_19_1","volume-title":"Proceedings of the IEEE International Conference on Computer, Control &amp; Communication. IEEE Press.","author":"Kamiran F."},{"key":"e_1_2_2_20_1","unstructured":"Kaye D. and Aickin M. Eds. 1992. Statistical Methods in Discrimination Litigation. Marcel Dekker Inc. Kaye D. and Aickin M. Eds. 1992. Statistical Methods in Discrimination Litigation. Marcel Dekker Inc."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.2307\/3550667"},{"key":"e_1_2_2_22_1","unstructured":"Knuth D. 1997. Fundamental Algorithms. Addison-Wesley. Knuth D. 1997. Fundamental Algorithms. Addison-Wesley."},{"key":"e_1_2_2_23_1","first-page":"567","article-title":"Sex discrimination in labor markets: The role of statistical evidence","volume":"77","author":"Kuhn P.","year":"1987","journal-title":"Amer. Econ. Rev."},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1008616203852"},{"key":"e_1_2_2_25_1","volume-title":"Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. AAAI Press, 80--86","author":"Liu B."},{"key":"e_1_2_2_26_1","unstructured":"Liu K. 2009. Privacy preserving data mining bibliography. http:\/\/www.csee.umbc.edu\/~kunliu1\/research\/privacy_review.html. Liu K. 2009. Privacy preserving data mining bibliography. http:\/\/www.csee.umbc.edu\/~kunliu1\/research\/privacy_review.html."},{"key":"e_1_2_2_27_1","unstructured":"Makkonen T. 2007. Measuring discrimination: Data collection and the EU equality law. http:\/\/ec.europa.eu\/employment_social\/fundamental_rights. Makkonen T. 2007. Measuring discrimination: Data collection and the EU equality law. http:\/\/ec.europa.eu\/employment_social\/fundamental_rights."},{"key":"e_1_2_2_28_1","unstructured":"Newman D. Hettich S. Blake C. and Merz C. 1998. UCI repository of machine learning databases. http:\/\/archive.ics.uci.edu\/ml. Newman D. Hettich S. Blake C. and Merz C. 1998. UCI repository of machine learning databases. http:\/\/archive.ics.uci.edu\/ml."},{"key":"e_1_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401959"},{"key":"e_1_2_2_30_1","volume-title":"Proceedings of the SIAM International Conference on Data Mining. SIAM, 581--592","author":"Pedreschi D."},{"key":"e_1_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.5085\/0898-5510-12.1.43"},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:APIN.0000047380.15356.7a"},{"key":"e_1_2_2_33_1","volume-title":"Proceedings of the INAP","author":"Rauch J.","year":"2001"},{"key":"e_1_2_2_34_1","unstructured":"Rauch J. and Simunek M. 2009. 4-ft Miner Procedure. http:\/\/lispminer.vse.cz. Rauch J. and Simunek M. 2009. 4-ft Miner Procedure. http:\/\/lispminer.vse.cz."},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1111\/1468-0297.00080"},{"key":"e_1_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1111\/1467-9906.t01-1-00168"},{"key":"e_1_2_2_37_1","volume-title":"Proceedings of the International Conference on Very Large Databases. Morgan Kaufmann, 407--419","author":"Srikant R."},{"key":"e_1_2_2_38_1","unstructured":"Sweeney L. 2001. Computational disclosure control: A primer on data privacy protection. Ph.D. thesis MIT Cambridge MA. Sweeney L. 2001. Computational disclosure control: A primer on data privacy protection. Ph.D. thesis MIT Cambridge MA."},{"key":"e_1_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.1142\/S021848850200165X"},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4379(03)00072-3"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0169-2070(00)00034-0"},{"key":"e_1_2_2_42_1","unstructured":"U.K. Legislation. 2009. (a) Sex Discrimination Act (b) Race Relation Act. http:\/\/www.statutelaw.gov.uk. U.K. Legislation. 2009. (a) Sex Discrimination Act (b) Race Relation Act. http:\/\/www.statutelaw.gov.uk."},{"key":"e_1_2_2_43_1","unstructured":"U.S. Federal Legislation. 2009. (a) Equal Credit Opportunity Act (b) Fair Housing Act (c) Intentional Employment Discrimination (d) Equal Pay Act (e) Pregnancy Discrimination Act. http:\/\/www.usdoj.gov. U.S. Federal Legislation. 2009. (a) Equal Credit Opportunity Act (b) Fair Housing Act (c) Intentional Employment Discrimination (d) Equal Pay Act (e) Pregnancy Discrimination Act. http:\/\/www.usdoj.gov."},{"key":"e_1_2_2_44_1","doi-asserted-by":"crossref","unstructured":"Vaidya J. Clifton C. W. and Zhu Y. M. 2006. Privacy Preserving Data Mining. Advances in Information Security. Springer. Vaidya J. Clifton C. W. and Zhu Y. M. 2006. Privacy Preserving Data Mining. Advances in Information Security. Springer.","DOI":"10.1007\/11362197_11"},{"key":"e_1_2_2_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2004.1269668"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1111\/1539-6975.00023"},{"key":"e_1_2_2_47_1","first-page":"152","article-title":"Credit scoring methods","volume":"56","author":"Vojtek M.","year":"2006","journal-title":"J. Econ. Finance"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.142"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/1010614.1010616"},{"key":"e_1_2_2_50_1","volume-title":"CPAR: Classification based on Predictive Association Rules. In Proceedings of the SIAM International Conference on Data Mining","author":"Yin X.","year":"2003"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1754428.1754432","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1754428.1754432","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:22:50Z","timestamp":1750245770000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1754428.1754432"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5]]},"references-count":50,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,5]]}},"alternative-id":["10.1145\/1754428.1754432"],"URL":"https:\/\/doi.org\/10.1145\/1754428.1754432","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,5]]},"assertion":[{"value":"2008-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-05-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}