{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T16:25:41Z","timestamp":1769012741178,"version":"3.49.0"},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T00:00:00Z","timestamp":1747785600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T00:00:00Z","timestamp":1747785600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100018755","name":"Universit\u00e4t Trier","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100018755","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Data Sci Anal"],"published-print":{"date-parts":[[2025,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>In this paper, we propose , a  package providing a framework for fair classification in machine learning. In this framework, the fair learning process is divided into three stages. Each stage aims to reduce unfairness, such as disparate impact and disparate mistreatment, in the final prediction. For the pre-processing stage, we present a resampling method that addresses unfairness coming from data imbalances. The in-processing phase consists of a classification method. This can be either one coming from the  package, or a user-defined one. For this phase, we incorporate fair ML methods that can handle unfairness to a certain degree through their optimization process. In the post-processing, we discuss the choice of the cutoff value for fair prediction. With simulations, we show the performance of the single phases and their combinations.<\/jats:p>","DOI":"10.1007\/s41060-025-00793-0","type":"journal-article","created":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T10:01:55Z","timestamp":1747821715000},"page":"5703-5718","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["FairML: a Julia package for fair classification"],"prefix":"10.1007","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5771-6179","authenticated-orcid":false,"given":"Jan Pablo","family":"Burgard","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-5862-3465","authenticated-orcid":false,"given":"Jo\u00e3o Vitor","family":"Pamplona","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,5,21]]},"reference":[{"key":"793_CR1","doi-asserted-by":"crossref","unstructured":"Aghaei, S., Azizi, M.J., Vayanos, P.: Learning optimal and fair decision trees for non-discriminative decision-making. CoRR arXiv:1903.10598 (2019)","DOI":"10.1609\/aaai.v33i01.33011418"},{"key":"793_CR2","doi-asserted-by":"publisher","unstructured":"Agrawal, A., Chen, J., Vollmer, S., Blaom, A.: ashryaagr\/Fairness.jl. https:\/\/doi.org\/10.5281\/zenodo.3977197","DOI":"10.5281\/zenodo.3977197"},{"key":"793_CR3","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2021.666182","author":"R Bono","year":"2021","unstructured":"Bono, R., Alarc\u00f3n, R., Blanca, M.J.: Report quality of generalized linear mixed models in psychology: a systematic review. Front. Psychol. (2021). https:\/\/doi.org\/10.3389\/fpsyg.2021.666182","journal-title":"Front. Psychol."},{"issue":"1","key":"793_CR4","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1137\/14100067","volume":"59","author":"J Bezanson","year":"2017","unstructured":"Bezanson, J., Edelman, A., Karpinski, S., Shah, V.B.: Julia: a fresh approach to numerical computing. SIAM Rev. 59(1), 65\u201398 (2017). https:\/\/doi.org\/10.1137\/14100067","journal-title":"SIAM Rev."},{"key":"793_CR5","unstructured":"Berman, E., Ginesin, J.: The state of julia for scientific machine learning. arXiv preprint arXiv:2410.10908 (2024)"},{"key":"793_CR6","unstructured":"Berk, R., Heidari, H., Jabbari, S., Joseph, M., Kearns, M., Morgenstern, J., Neel, S., Roth, A.: A convex framework for fair regression. arXiv preprint arXiv:1706.02409 (2017)"},{"key":"793_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-015-0784-9","volume":"16","author":"R Blagus","year":"2015","unstructured":"Blagus, R., Lusa, L.: Joint use of over-and under-sampling techniques and cross-validation for the development and assessment of prediction models. BMC Bioinform. 16, 1\u201310 (2015)","journal-title":"BMC Bioinform."},{"key":"793_CR8","unstructured":"Burgard, J.P., Pamplona, J.V.: Fair generalized linear mixed models (2024). arXiv preprint arXiv:2405.09273"},{"key":"793_CR9","unstructured":"Burgard, J.P., Pamplona, J.V.: Fair mixed effects support vector machine (2024). arXiv:2405.06433"},{"issue":"16","key":"793_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v098.i16","volume":"98","author":"M Besan\u00e7on","year":"2021","unstructured":"Besan\u00e7on, M., Papamarkou, T., Anthoff, D., Arslan, A., Byrne, S., Lin, D., Pearson, J.: Distributions.jl: definition and modeling of probability distributions in the juliastats ecosystem. J. Stat. Softw. 98(16), 1\u201330 (2021). https:\/\/doi.org\/10.18637\/jss.v098.i16","journal-title":"J. Stat. Softw."},{"issue":"3","key":"793_CR11","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1093\/icvts\/ivy163","volume":"27","author":"F Barili","year":"2018","unstructured":"Barili, F., Parolari, A., Kappetein, P.A., Freemantle, N.: Statistical primer: heterogeneity, random-or fixed-effects model analyses. Interact. Cardiovasc. Thorac. Surg. 27(3), 317\u2013321 (2018)","journal-title":"Interact. Cardiovasc. Thorac. Surg."},{"issue":"1","key":"793_CR12","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1006\/jmps.1999.1279","volume":"44","author":"MW Browne","year":"2000","unstructured":"Browne, M.W.: Cross-validation methods. J. Math. Psychol. 44(1), 108\u2013132 (2000)","journal-title":"J. Math. Psychol."},{"key":"793_CR13","first-page":"671","volume":"104","author":"S Barocas","year":"2016","unstructured":"Barocas, S., Selbst, A.D.: Big data\u2019s disparate impact. Calif. L. Rev. 104, 671 (2016)","journal-title":"Calif. L. Rev."},{"issue":"4","key":"793_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v107.i04","volume":"107","author":"M Bouchet-Valat","year":"2023","unstructured":"Bouchet-Valat, M., Kami\u0144ski, B.: Dataframes.jl: flexible and fast tabular data in julia. J. Stat. Softw. 107(4), 1\u201332 (2023). https:\/\/doi.org\/10.18637\/jss.v107.i04","journal-title":"J. Stat. Softw."},{"key":"793_CR15","unstructured":"Cruz, A.F., Bel\u00e9m, C., Jesus, S., Bravo, J., Saleiro, P., Bizarro, P.: FairGBM: gradient boosting with fairness constraints (2023). arXiv:2209.07850"},{"key":"793_CR16","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0112653","volume":"9","author":"M Casals","year":"2014","unstructured":"Casals, M., Girabent-Farr\u00e9s, M., Carrasco, J.L.: Methodological quality and reporting of generalized linear mixed models in clinical medicine (2000\u20132012): a systematic review. PLoS ONE 9, e112653 (2014)","journal-title":"PLoS ONE"},{"key":"793_CR17","unstructured":"Caton, S., Haas, C.: Fairness in machine learning: a survey. ACM Comput. Surv. arXiv preprint arXiv:2010.04053 (2020)"},{"key":"793_CR18","doi-asserted-by":"publisher","unstructured":"Christ, S., Schwabeneder, D., Rackauckas, C., Borregaard, M.K., Breloff, T.: Plots.jl: a user extendable plotting api for the julia programming language (2023). https:\/\/doi.org\/10.5334\/jors.431","DOI":"10.5334\/jors.431"},{"issue":"3","key":"793_CR19","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1017\/S1368980012002911","volume":"16","author":"KC Cheong","year":"2013","unstructured":"Cheong, K.C., Yusoff, A.F., Ghazali, S.M., Lim, K.H., Selvarajah, S., Haniff, J., Khor, G.L., Shahar, S., Abd, R.J., Zainuddin, A.A., et al.: Optimal bmi cut-off values for predicting diabetes, hypertension and hypercholesterolaemia in a multi-ethnic population. Public Health Nutr. 16(3), 453\u2013459 (2013)","journal-title":"Public Health Nutr."},{"key":"793_CR20","doi-asserted-by":"crossref","unstructured":"Das, S., Donini, M., Gelman, J., Haas, K., Hardt, M., Katzman, J., Kenthapadi, K., Larroy, P., Yilmaz, P., Zafar, M.B.: Fairness measures for machine learning in finance. J. Financ. Data Sci. (2021)","DOI":"10.3905\/jfds.2021.1.075"},{"key":"793_CR21","unstructured":"Do, H., Putzel, P., Martin, A.S., Smyth, P., Zhong, J.: Fair generalized linear models with a convex penalty. In: International Conference on Machine Learning, pp. 5286\u20135308. PMLR (2022)"},{"key":"793_CR22","volume-title":"Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses","author":"P Good","year":"2013","unstructured":"Good, P.: Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses. Springer, New York (2013)"},{"key":"793_CR23","unstructured":"Green, B.: Fair risk assessments: a precarious approach for criminal justice reform. In: 5th Workshop on Fairness, Accountability, and Transparency in Machine Learning, pp. 1\u20135 (2018)"},{"key":"793_CR24","unstructured":"Hsieh, C.J., Chang, K., Lin, C.J.: A dual coordinate descent method for large-scale linear svm. In: Proceedings of the Twenty-fifth International Conference on Machine Learning, pp. 1369\u20131398 (2008)"},{"issue":"4","key":"793_CR25","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1109\/5254.708428","volume":"13","author":"MA Hearst","year":"1998","unstructured":"Hearst, M.A., Dumais, S.T., Osuna, E., Platt, J., Scholkopf, B.: Support vector machines. IEEE Intell. Syst. Appl. 13(4), 18\u201328 (1998)","journal-title":"IEEE Intell. Syst. Appl."},{"key":"793_CR26","unstructured":"Hardt, M., Price, E., Srebro, N.: Equality of opportunity in supervised learning. Advances in neural information processing systems 29. arXiv preprint arXiv:1610.02413 (2016)"},{"key":"793_CR27","unstructured":"Jesus, S., Saleiro, P., Jorge, B.M., Ribeiro, R.P., Gama, J., Bizarro, P., Ghani, R., et al.: Aequitas flow: streamlining fair ml experimentation. arXiv preprint arXiv:2405.05809 (2024)"},{"key":"793_CR28","doi-asserted-by":"publisher","DOI":"10.1007\/s12532-023-00239-3","author":"M Lubin","year":"2023","unstructured":"Lubin, M., Dowson, O., Garcia, J.D., Huchette, J., Legat, B., Vielma, J.P.: JuMP 1.0: recent improvements to a modeling language for mathematical optimization. Math. Program. Comput. (2023). https:\/\/doi.org\/10.1007\/s12532-023-00239-3","journal-title":"Math. Program. Comput."},{"key":"793_CR29","volume-title":"Sampling: Design and Analysis","author":"SL Lohr","year":"2009","unstructured":"Lohr, S.L.: Sampling: Design and Analysis, 2nd edn. Brooks\/Cole, Florence (2009)","edition":"2"},{"key":"793_CR30","doi-asserted-by":"crossref","unstructured":"Mohammed, R., Rawashdeh, J., Abdullah, M.: Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: 2020 11th International Conference on Information and Communication Systems (ICICS), pp. 243\u2013248. IEEE (2020)","DOI":"10.1109\/ICICS49469.2020.239556"},{"key":"793_CR31","unstructured":"Menon, A.K., Williamson, R.C.: The cost of fairness in binary classification. In: Conference on Fairness, Accountability and Transparency, pp. 107\u2013118. PMLR (2018)"},{"key":"793_CR32","volume-title":"Mp Applied Linear Regression Models-Revised Edition with Student cd","author":"DJ Neter","year":"2004","unstructured":"Neter, D.J., Kutner, M.H., Nachtsheim, C.J.: Mp Applied Linear Regression Models-Revised Edition with Student cd. McGraw-Hill Education, New York (2004)"},{"key":"793_CR33","unstructured":"Olfat, M., Aswani, A.: Spectral algorithms for computing fair support vector machines. CoRR arXiv:1710.05895 (2017)"},{"key":"793_CR34","doi-asserted-by":"publisher","unstructured":"Radovanovi\u0107, S., Petrovi\u0107, A., Deliba\u0161i\u0107, B., Suknovi\u0107, M.: Enforcing fairness in logistic regression algorithm. In: 2020 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), pp. 1\u20137 (2020). https:\/\/doi.org\/10.1109\/INISTA49547.2020.9194676","DOI":"10.1109\/INISTA49547.2020.9194676"},{"issue":"3","key":"793_CR35","doi-asserted-by":"publisher","first-page":"0148140","DOI":"10.1371\/journal.pone.0148140","volume":"11","author":"Q Ren","year":"2016","unstructured":"Ren, Q., Su, C., Wang, H., Wang, Z., Du, W., Zhang, B.: Prospective study of optimal obesity index cut-off values for predicting incidence of hypertension in 18\u201365-year-old Chinese adults. PLoS ONE 11(3), 0148140 (2016)","journal-title":"PLoS ONE"},{"key":"793_CR36","unstructured":"Scutari, M.: fairml: a statistician\u2019s take on fair machine learning modelling (2023). arXiv:2305.02009"},{"issue":"6","key":"793_CR37","first-page":"937","volume":"25","author":"V Vapnik","year":"1964","unstructured":"Vapnik, V., Chervonenkis, A.Y.: A class of algorithms for pattern recognition learning. Avtomat. i Telemekh 25(6), 937\u2013945 (1964)","journal-title":"Avtomat. i Telemekh"},{"key":"793_CR38","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1007\/s10107-004-0559-y","volume":"106","author":"A W\u00e4chter","year":"2006","unstructured":"W\u00e4chter, A., Biegler, L.T.: On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math. Program. 106, 25\u201357 (2006)","journal-title":"Math. Program."},{"issue":"2","key":"793_CR39","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1038\/ng.2876","volume":"46","author":"J Yang","year":"2014","unstructured":"Yang, J., Zaitlen, N.A., Goddard, M.E., Visscher, P.M., Price, A.L.: Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46(2), 100\u2013106 (2014)","journal-title":"Nat. Genet."},{"key":"793_CR40","doi-asserted-by":"crossref","unstructured":"Zhang, W., Bifet, A., Zhang, X., Weiss, J.C., Nejdl, W.: FARF: a fair and adaptive random forests classifier. CoRR arXiv:2108.07403 (2021)","DOI":"10.1007\/978-3-030-75765-6_20"},{"issue":"57","key":"793_CR41","first-page":"1","volume":"23","author":"H Zhao","year":"2022","unstructured":"Zhao, H., Gordon, G.J.: Inherent tradeoffs in learning fair representations. J. Mach. Learn. Res. 23(57), 1\u201326 (2022)","journal-title":"J. Mach. Learn. Res."},{"issue":"3","key":"793_CR42","doi-asserted-by":"publisher","first-page":"8756","DOI":"10.3182\/20140824-6-ZA-1003.00794","volume":"47","author":"A Zughrat","year":"2014","unstructured":"Zughrat, A., Mahfouf, M., Yang, Y.Y., Thornton, S.: Support vector machines for class imbalance rail data classification with bootstrapping-based over-sampling and under-sampling. IFAC Proc. Vol. 47(3), 8756\u20138761 (2014)","journal-title":"IFAC Proc. Vol."},{"key":"793_CR43","unstructured":"Zafar, M.B., Valera, I., Gomez-Rodriguez, M., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. In: Singh, A., Zhu, J. (eds.) Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research, vol. 54, pp. 962\u2013970 (2017)"},{"issue":"75","key":"793_CR44","first-page":"1","volume":"20","author":"MB Zafar","year":"2019","unstructured":"Zafar, M.B., Valera, I., Gomez-Rodriguez, M., Gummadi, K.P.: Fairness constraints: a flexible approach for fair classification. J. Mach. Learn. Res. 20(75), 1\u201342 (2019)","journal-title":"J. Mach. Learn. Res."},{"key":"793_CR45","doi-asserted-by":"crossref","unstructured":"Zafar, M., Valera, I., Rodriguez, M., Gummadi, K.P.: Fairness beyond disparate treatment & disparate impact: learning classification without disparate mistreatment. Conference: Proceedings of the 26th international conference on world wide web (2016)","DOI":"10.1145\/3038912.3052660"}],"container-title":["International Journal of Data Science and Analytics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41060-025-00793-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s41060-025-00793-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s41060-025-00793-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,27]],"date-time":"2025-09-27T12:17:10Z","timestamp":1758975430000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s41060-025-00793-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,21]]},"references-count":45,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2025,11]]}},"alternative-id":["793"],"URL":"https:\/\/doi.org\/10.1007\/s41060-025-00793-0","relation":{},"ISSN":["2364-415X","2364-4168"],"issn-type":[{"value":"2364-415X","type":"print"},{"value":"2364-4168","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5,21]]},"assertion":[{"value":"12 December 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 April 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 May 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or nonfinancial interest in the subject matter or materials discussed in this manuscript.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval"}},{"value":"Not applicable.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Clinical trial number"}}]}}