{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T21:15:32Z","timestamp":1767993332530,"version":"3.49.0"},"reference-count":36,"publisher":"World Scientific Pub Co Pte Ltd","issue":"04n05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2025,8]]},"abstract":"<jats:p> Fraud detection through the classification of highly imbalanced Big Data is an exciting area of Machine Learning research. On the one hand, in certain fraud detection application domains, the use of One-Class classifiers is an overlooked opportunity. On the other hand, for researchers faced with the task of building Machine Learning models for identifying fraud, when only legitimate transaction data is available, One-Class Classifiers are indispensable. We investigate the efficacy of SHapley Additive exPlanations (SHAP) as a feature selection technique for One-Class classification tasks. In this study we utilize authentic data from the Credit Card fraud and Medicare insurance fraud application domains. Our contribution is to show that researchers can use SHAP in conjunction with One-Class Classifiers to do feature selection on highly imbalanced datasets, and then build models, with the selected features, that yield performance similar to, or better than, models built using all features. Our results in Big Medicare data fraud detection show that an over 90% data reduction through feature selection can nevertheless coincide with the best performance in terms of Area under the Precision Recall Curve. <\/jats:p>","DOI":"10.1142\/s0218213025400019","type":"journal-article","created":{"date-parts":[[2025,1,10]],"date-time":"2025-01-10T04:19:19Z","timestamp":1736482759000},"source":"Crossref","is-referenced-by-count":1,"title":["SHAP as a Data Reduction Technique for Highly Imbalanced Big Data"],"prefix":"10.1142","volume":"34","author":[{"suffix":"III","given":"John T.","family":"Hancock","sequence":"first","affiliation":[{"name":"College of Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, Florida 33431, USA"}]},{"given":"Richard A.","family":"Bauder","sequence":"additional","affiliation":[{"name":"College of Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, Florida 33431, USA"}]},{"given":"Taghi M.","family":"Khoshgoftaar","sequence":"additional","affiliation":[{"name":"College of Engineering and Computer Science, Florida Atlantic University, 777 Glades Road, Boca Raton, Florida 33431, USA"}]}],"member":"219","published-online":{"date-parts":[[2025,2,19]]},"reference":[{"key":"S0218213025400019BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2006.595"},{"key":"S0218213025400019BIB002","first-page":"4765","volume":"30","author":"Lundberg S. M.","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"S0218213025400019BIB003","first-page":"451","volume-title":"Joint European Conference on Machine Learning and Knowledge Discovery in Databases","author":"Boyd K.","year":"2013"},{"key":"S0218213025400019BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/ICISA.2014.6847442"},{"key":"S0218213025400019BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/ICTAI59109.2023.00021"},{"key":"S0218213025400019BIB010","doi-asserted-by":"publisher","DOI":"10.1007\/s10742-016-0154-8"},{"key":"S0218213025400019BIB011","doi-asserted-by":"publisher","DOI":"10.1109\/IRI58017.2023.00028"},{"key":"S0218213025400019BIB012","doi-asserted-by":"publisher","DOI":"10.1016\/j.ifacol.2022.09.550"},{"key":"S0218213025400019BIB013","doi-asserted-by":"publisher","DOI":"10.5220\/0011665300003411"},{"key":"S0218213025400019BIB014","first-page":"432","volume-title":"28th International Workshop on Intelligent Computing in Engineering","author":"Abad\u00edaa J. J. P.","year":"2021"},{"key":"S0218213025400019BIB015","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-023-00684-w"},{"key":"S0218213025400019BIB016","first-page":"52","volume":"6791","author":"Masci J.","year":"2011","journal-title":"Artificial Neural Networks and Machine Learning"},{"key":"S0218213025400019BIB017","doi-asserted-by":"publisher","DOI":"10.14569\/IJACSA.2019.0101101"},{"issue":"1","key":"S0218213025400019BIB018","first-page":"191","volume":"41","author":"Cessie S. Le","year":"1992","journal-title":"Journal of the Royal Statistical Society: Series C Applied Statistics"},{"key":"S0218213025400019BIB019","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"S0218213025400019BIB020","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939785"},{"key":"S0218213025400019BIB021","first-page":"3146","volume":"30","author":"Ke G.","year":"2017","journal-title":"Advances in Neural Information Processing Systems"},{"key":"S0218213025400019BIB022","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-006-6226-1"},{"key":"S0218213025400019BIB023","first-page":"1","volume":"31","author":"Prokhorenkova L.","year":"2018","journal-title":"Advances in Neural Information Processing Systems"},{"key":"S0218213025400019BIB024","doi-asserted-by":"publisher","DOI":"10.1016\/0169-7439(87)80084-9"},{"key":"S0218213025400019BIB026","doi-asserted-by":"publisher","DOI":"10.1007\/s42979-023-01809-x"},{"key":"S0218213025400019BIB031","doi-asserted-by":"publisher","DOI":"10.1109\/IRI.2016.11"},{"key":"S0218213025400019BIB032","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-021-00514-x"},{"key":"S0218213025400019BIB033","first-page":"2825","volume":"12","author":"Pedregosa F.","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"S0218213025400019BIB034","doi-asserted-by":"publisher","DOI":"10.1111\/j.2517-6161.1977.tb01600.x"},{"key":"S0218213025400019BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/IRI58017.2023.00060"},{"key":"S0218213025400019BIB036","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-023-00724-5"},{"issue":"3","key":"S0218213025400019BIB037","first-page":"61","volume":"10","author":"Platt J.","year":"1999","journal-title":"Advances in Large Margin Classifiers"},{"key":"S0218213025400019BIB038","doi-asserted-by":"publisher","DOI":"10.1080\/00450618.2012.733025"},{"key":"S0218213025400019BIB039","doi-asserted-by":"publisher","DOI":"10.1214\/17-EJS1338SI"},{"key":"S0218213025400019BIB040","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"S0218213025400019BIB041","doi-asserted-by":"publisher","DOI":"10.1515\/9781400881970-018"},{"key":"S0218213025400019BIB043","doi-asserted-by":"publisher","DOI":"10.1109\/ICTAI56018.2022.00202"},{"key":"S0218213025400019BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/IRI58017.2023.00053"},{"key":"S0218213025400019BIB045","doi-asserted-by":"publisher","DOI":"10.4135\/9781412983327"},{"key":"S0218213025400019BIB046","doi-asserted-by":"publisher","DOI":"10.2307\/3001913"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213025400019","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T07:44:09Z","timestamp":1761032649000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218213025400019"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,19]]},"references-count":36,"journal-issue":{"issue":"04n05","published-print":{"date-parts":[[2025,8]]}},"alternative-id":["10.1142\/S0218213025400019"],"URL":"https:\/\/doi.org\/10.1142\/s0218213025400019","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,19]]},"article-number":"2540001"}}