{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T17:03:19Z","timestamp":1773248599006,"version":"3.50.1"},"reference-count":54,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,12,9]],"date-time":"2023-12-09T00:00:00Z","timestamp":1702080000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004837","name":"Ministry of Science and Innovation","doi-asserted-by":"crossref","award":["RTI2018-095094-B-C21, and RTI2018-095094-B-C22"],"award-info":[{"award-number":["RTI2018-095094-B-C21, and RTI2018-095094-B-C22"]}],"id":[{"id":"10.13039\/501100004837","id-type":"DOI","asserted-by":"crossref"}]},{"name":"CONSENT","award":["PID2021-125962OB-C31"],"award-info":[{"award-number":["PID2021-125962OB-C31"]}]},{"name":"SECURING"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2024,4,30]]},"abstract":"<jats:p>\n            Privacy protection for personal data and fairness in automated decisions are fundamental requirements for responsible Machine Learning. Both may be enforced through data preprocessing and share a common target: data should remain useful for a task, while becoming uninformative of the sensitive information. The intrinsic connection between privacy and fairness implies that modifications performed to guarantee one of these goals, may have an effect on the other, e.g., hiding a sensitive attribute from a classification algorithm might prevent a biased decision rule having such attribute as a criterion. This work resides at the intersection of algorithmic fairness and privacy. We show how the two goals are compatible, and may be simultaneously achieved, with a small loss in predictive performance. Our results are competitive with both state-of-the-art fairness correcting algorithms and hybrid privacy-fairness methods. Experiments were performed on three widely used benchmark datasets:\n            <jats:italic>Adult Income<\/jats:italic>\n            ,\n            <jats:italic>COMPAS,<\/jats:italic>\n            and\n            <jats:italic>German Credit<\/jats:italic>\n            .\n          <\/jats:p>","DOI":"10.1145\/3617377","type":"journal-article","created":{"date-parts":[[2023,8,24]],"date-time":"2023-08-24T12:31:22Z","timestamp":1692880282000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Fair and Private Data Preprocessing through Microaggregation"],"prefix":"10.1145","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1840-9205","authenticated-orcid":false,"given":"Vladimiro","family":"Gonz\u00e1lez-Zelaya","sequence":"first","affiliation":[{"name":"Universidad Panamericana, Mexico"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1787-0654","authenticated-orcid":false,"given":"Juli\u00e1n","family":"Salas","sequence":"additional","affiliation":[{"name":"Universitat Oberta de Catalunya (UOC), Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0507-7731","authenticated-orcid":false,"given":"David","family":"Meg\u00edas","sequence":"additional","affiliation":[{"name":"Universitat Oberta de Catalunya (UOC), Spain"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0978-2446","authenticated-orcid":false,"given":"Paolo","family":"Missier","sequence":"additional","affiliation":[{"name":"Newcastle University, UK"}]}],"member":"320","published-online":{"date-parts":[[2023,12,9]]},"reference":[{"key":"e_1_3_3_2_2","unstructured":"Alekh Agarwal Alina Beygelzimer Miroslav Dud\u00edk John Langford and Hanna Wallach. 2018. A reductions approach to fair classification. arXiv:1803.02453. Retrieved from https:\/\/arxiv.org\/abs\/1803.02453"},{"key":"e_1_3_3_3_2","first-page":"405","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Backurs Arturs","year":"2019","unstructured":"Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali Vakilian, and Tal Wagner. 2019. Scalable fair clustering. In Proceedings of the International Conference on Machine Learning. 405\u2013413."},{"key":"e_1_3_3_4_2","first-page":"15479","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Bagdasaryan Eugene","year":"2019","unstructured":"Eugene Bagdasaryan, Omid Poursaeed, and Vitaly Shmatikov. 2019. Differential privacy has disparate impact on model accuracy. In Proceedings of the Advances in Neural Information Processing Systems. 15479\u201315488."},{"key":"e_1_3_3_5_2","first-page":"671","article-title":"Big data\u2019s disparate impact","volume":"104","author":"Barocas Solon","year":"2016","unstructured":"Solon Barocas and Andrew D. Selbst. 2016. Big data\u2019s disparate impact. California Law Review 104 (2016), 671.","journal-title":"California Law Review"},{"key":"e_1_3_3_6_2","article-title":"Fairness in criminal justice risk assessments: The state-of-the-art","author":"Berk Richard","year":"2018","unstructured":"Richard Berk, Hoda Heidari, Shahin Jabbari, Michael Kearns, and Aaron Roth. 2018. Fairness in criminal justice risk assessments: The state-of-the-art. Sociological Methods and Research (2018).","journal-title":"Sociological Methods and Research"},{"key":"e_1_3_3_7_2","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199401)45:1<12::AID-ASI2>3.0.CO;2-L"},{"key":"e_1_3_3_8_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-010-0190-x"},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287586"},{"key":"e_1_3_3_10_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-39804-2_12"},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295256"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1089\/big.2016.0047"},{"key":"e_1_3_3_13_2","unstructured":"Alexandra Chouldechova and Aaron Roth. 2018. The frontiers of fairness in machine learning. arXiv:1810.08810. Retrieved from https:\/\/arxiv.org\/abs\/1810.08810"},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314183.3323847"},{"key":"e_1_3_3_15_2","unstructured":"Anupam Datta Shayak Sen and Michael Carl Tschantz. 2018. Correspondences between privacy and nondiscrimination: Why they should be studied together. arXiv:1808.01735. Retrieved from https:\/\/arxiv.org\/abs\/1808.01735"},{"key":"e_1_3_3_16_2","unstructured":"Sci-kit Learn Developers. 2019. scikit-learn: machine learning in Python. (2019)."},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-005-0007-5"},{"key":"e_1_3_3_18_2","unstructured":"Dheeru Dua and Casey Graff. 2017. UCI Machine Learning Repository. (2017). Retrieved from http:\/\/archive.ics.uci.edu\/ml"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/2090236.2090255"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.1561\/0400000042"},{"key":"e_1_3_3_21_2","series-title":"Proceedings of the 1st Conference on Fairness, Accountability and Transparency.","first-page":"35","volume":"81","author":"Ekstrand Michael D.","year":"2018","unstructured":"Michael D. Ekstrand, Rezvan Joshaghani, and Hoda Mehrpouyan. 2018. Privacy for All: Ensuring fair and equitable privacy protections. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency.Sorelle A. Friedler and Christo Wilson (Eds.), Proceedings of Machine Learning Research, Vol. 81. PMLR, 35\u201347."},{"key":"e_1_3_3_22_2","unstructured":"European Parliament and Council of the European Union. 2016. Regulation (EU) 2016\/679 of the European Parliament and of the Council. (2016). Retrieved from https:\/\/data.europa.eu\/eli\/reg\/2016\/679\/oj"},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE48307.2020.00203"},{"key":"e_1_3_3_24_2","unstructured":"Peter I. Frazier. 2018. A tutorial on Bayesian optimization. arXiv:1807.02811. Retrieved from https:\/\/arxiv.org\/abs\/1807.02811"},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287589"},{"key":"e_1_3_3_26_2","unstructured":"Pratik Gajane and Mykola Pechenizkiy. 2017. On formalizing fairness in prediction with machine learning. arXiv:1710.03184. Retrieved from https:\/\/arxiv.org\/abs\/1710.03184"},{"key":"e_1_3_3_27_2","first-page":"445","volume-title":"Proceedings of the EDBT","author":"Gonz\u00e1lez-Zelaya Vladimiro","year":"2021","unstructured":"Vladimiro Gonz\u00e1lez-Zelaya, Juli\u00e1n Salas, Dennis Prangle, and Paolo Missier. 2021. Optimising fairness through parametrised data sampling. In Proceedings of the EDBT. 445\u2013450."},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-014-0393-7"},{"key":"e_1_3_3_29_2","first-page":"3315","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Hardt Moritz","year":"2016","unstructured":"Moritz Hardt, Eric Price, Nati Srebro, et\u00a0al. 2016. Equality of opportunity in supervised learning. In Proceedings of the Advances in Neural Information Processing Systems. 3315\u20133323."},{"key":"e_1_3_3_30_2","doi-asserted-by":"publisher","DOI":"10.1145\/3357384.3357974"},{"key":"e_1_3_3_31_2","first-page":"3000","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Jagielski Matthew","year":"2019","unstructured":"Matthew Jagielski, Michael Aaron Kearns, Saeed Sharifi-Malvajerdi, Jieming Mao, Alina Oprea, Aaron Roth, and Jonathan Ullman. 2019. Differentially private fair learning. In Proceedings of the International Conference on Machine Learning. PMLR, 3000\u20133008."},{"key":"e_1_3_3_32_2","first-page":"1","volume-title":"Proceedings of the 19th Machine Learning Conf. Belgium and The Netherlands","author":"Kamiran Faisal","year":"2010","unstructured":"Faisal Kamiran and Toon Calders. 2010. Classification with no discrimination by preferential sampling. In Proceedings of the 19th Machine Learning Conf. Belgium and The Netherlands. 1\u20136."},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-011-0463-8"},{"key":"e_1_3_3_34_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33486-3_3"},{"key":"e_1_3_3_35_2","first-page":"656","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Kilbertus Niki","year":"2017","unstructured":"Niki Kilbertus, Mateo Rojas Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, and Bernhard Sch\u00f6lkopf. 2017. Avoiding discrimination through causal reasoning. In Proceedings of the Advances in Neural Information Processing Systems. 656\u2013666."},{"key":"e_1_3_3_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186133"},{"key":"e_1_3_3_37_2","first-page":"4066","volume-title":"Proceedings of the Advances in Neural Information Processing Systems","author":"Kusner Matt J.","year":"2017","unstructured":"Matt J. Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. 2017. Counterfactual fairness. In Proceedings of the Advances in Neural Information Processing Systems. 4066\u20134076."},{"key":"e_1_3_3_38_2","article-title":"How we analyzed the COMPAS recidivism algorithm","volume":"9","author":"Larson Jeff","year":"2016","unstructured":"Jeff Larson, Surya Mattu, Lauren Kirchner, and Julia Angwin. 2016. How we analyzed the COMPAS recidivism algorithm. ProPublica 9 (2016).","journal-title":"ProPublica"},{"key":"e_1_3_3_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2007.367856"},{"key":"e_1_3_3_40_2","doi-asserted-by":"publisher","DOI":"10.1002\/widm.1356"},{"key":"e_1_3_3_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372872"},{"key":"e_1_3_3_42_2","first-page":"arXiv\u20132006","article-title":"A variational approach to privacy and fairness","author":"Rodr\u00edguez-G\u00e1lvez Borja","year":"2020","unstructured":"Borja Rodr\u00edguez-G\u00e1lvez, Ragnar Thobaben, and Mikael Skoglund. 2020. A variational approach to privacy and fairness. arXiv (2020), arXiv\u20132006.","journal-title":"arXiv"},{"key":"e_1_3_3_43_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2020.103531"},{"key":"e_1_3_3_44_2","doi-asserted-by":"publisher","DOI":"10.2307\/2529685"},{"key":"e_1_3_3_45_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11786-018-0344-6"},{"key":"e_1_3_3_46_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-57524-3_24"},{"key":"e_1_3_3_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-00305-0_28"},{"key":"e_1_3_3_48_2","unstructured":"Babak Salimi Luke Rodriguez Bill Howe and Dan Suciu. 2019. Capuchin: Causal database repair for algorithmic fairness. arXiv:1902.08283. Retrieved from https:\/\/arxiv.org\/abs\/1902.08283"},{"key":"e_1_3_3_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/69.971193"},{"key":"e_1_3_3_50_2","doi-asserted-by":"publisher","DOI":"10.1142\/S0218488502001648"},{"key":"e_1_3_3_51_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3317584"},{"key":"e_1_3_3_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052660"},{"key":"e_1_3_3_53_2","doi-asserted-by":"publisher","DOI":"10.5555\/3322706.3362016"},{"key":"e_1_3_3_54_2","unstructured":"Muhammad Bilal Zafar Isabel Valera Manuel Gomez Rodriguez and Krishna P. Gummadi. 2015. Fairness constraints: Mechanisms for fair classification. arXiv:1507.05259. Retrieved from https:\/\/arxiv.org\/abs\/1507.05259"},{"key":"e_1_3_3_55_2","first-page":"325","volume-title":"Proceedings of the International Conference on Machine Learning","author":"Zemel Rich","year":"2013","unstructured":"Rich Zemel, Yu Wu, Kevin Swersky, Toni Pitassi, and Cynthia Dwork. 2013. Learning fair representations. In Proceedings of the International Conference on Machine Learning. 325\u2013333."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617377","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3617377","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:45:54Z","timestamp":1750178754000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617377"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,9]]},"references-count":54,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,4,30]]}},"alternative-id":["10.1145\/3617377"],"URL":"https:\/\/doi.org\/10.1145\/3617377","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,9]]},"assertion":[{"value":"2022-07-20","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-08-18","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}