{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T00:56:30Z","timestamp":1760403390389,"version":"build-2065373602"},"reference-count":29,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"DOI":"10.13039\/501100007414","name":"Qassim University","doi-asserted-by":"crossref","award":["QU-J-PG-2-2025-52583"],"award-info":[{"award-number":["QU-J-PG-2-2025-52583"]}],"id":[{"id":"10.13039\/501100007414","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Comput. Healthcare"],"published-print":{"date-parts":[[2025,10,31]]},"abstract":"<jats:p>Machine learning is widely used across various fields, including e-health, to enhance efficiency, classify events, and make accurate predictions, such as diagnosing diseases and prescribing medications. However, machine learning models are increasingly vulnerable to data poisoning attacks, which manipulate training data to degrade model accuracy and cause incorrect predictions. This study focuses on detecting data poisoning in e-health applications by simulating label-flipping attacks at different rates (5%, 25%, 50%, 75%) on breast cancer and diabetes datasets. The performance of machine learning models in disease detection was evaluated before and after poisoning, alongside their ability to detect poisoned data. Results show that models perform significantly better on clean data, with a marked deterioration at higher poisoning rates (50%\u201375%). The Random Forest (RF) and Gradient Boosting (GB) models proved most effective in detecting poisoned data, particularly at higher rates of poisoning. Conversely, the Logistic Regression (LR) and Multi-layer Perceptron (MLP) models tended to overgeneralize, leading to false positives, especially in the breast cancer dataset. This study highlights the importance of safeguarding ML models in e-health from data poisoning threats.<\/jats:p>","DOI":"10.1145\/3728369","type":"journal-article","created":{"date-parts":[[2025,4,4]],"date-time":"2025-04-04T12:18:54Z","timestamp":1743769134000},"page":"1-19","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Prevention of Data Poisonous Threats on Machine Learning Models in e-Health"],"prefix":"10.1145","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0009-0003-5779-3961","authenticated-orcid":false,"given":"Etidal","family":"Alruwaili","sequence":"first","affiliation":[{"name":"Department of Information Technology, College of Computer, Qassim University, Buraidah, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5173-3656","authenticated-orcid":false,"given":"Tarek","family":"Moulahi","sequence":"additional","affiliation":[{"name":"Department of Information Technology, College of Computer, Qassim University, Buraidah, Saudi Arabia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,10,13]]},"reference":[{"key":"e_1_3_1_2_2","unstructured":"D. Reinsel J. Gantz and J. Rydning. 2018. The Digitization of the World from Edge to Core. Retrieved from https:\/\/www.seagate.com\/files\/www-content\/our-story\/trends\/files\/idc-seagate-dataage-whitepaper.pdf"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41591-018-0316-z"},{"key":"e_1_3_1_4_2","unstructured":"GeeksforGeeks. 2019. ML | What is Machine Learning?\u2014GeeksforGeeks. Retrieved April 10 2019 from https:\/\/www.geeksforgeeks.org\/ml-machine-learning\/"},{"key":"e_1_3_1_5_2","doi-asserted-by":"crossref","unstructured":"Arawinkumaar Selvakkumar S. Pal and Zahra Jadidi. 2021. Addressing adversarial machine learning attacks in smart healthcare perspectives. arXiv:2112.08862. Retrieved from https:\/\/arxiv.org\/abs\/2112.08862","DOI":"10.1007\/978-3-030-98886-9_21"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2021.3057686"},{"key":"e_1_3_1_7_2","unstructured":"guest_blog. 2022. Machine Learning: Adversarial Attacks and Defense. Analytics Vidhya. Retrieved from September 1 2022 https:\/\/www.analyticsvidhya.com\/blog\/2022\/09\/machine-learning-adversarial-attacks-and-defense\/"},{"key":"e_1_3_1_8_2","first-page":"1","volume-title":"Proceedings of the 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO","author":"Gupta N.","year":"2022","unstructured":"N. Gupta. 2022. Machine learning applications in healthcare. In Proceedings of the 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO). IEEE, 1\u20136."},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMra1814259"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41591-018-0300-7"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1038\/nature21056"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2022.3162397"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/3538707"},{"key":"e_1_3_1_14_2","unstructured":"A. E. Cin\u00e0 K. Grosse A. Demontis B. Biggio F. Roli and M. Pelillo. 2022. Machine learning security against data poisoning: Are we there yet? arXiv:2204.05986. Retrieved from https:\/\/arxiv.org\/abs\/2204.05986"},{"key":"e_1_3_1_15_2","unstructured":"W. Qiu. 2022. A survey on poisoning attacks against supervised machine learning. arXiv:2202.02510. Retrieved from https:\/\/arxiv.org\/abs\/2202.02510"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dcan.2021.07.009"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/GLOBECOM42002.2020.9322472"},{"key":"e_1_3_1_18_2","article-title":"Data poisoning attack on random forest classification model","author":"Chang J. Y.","year":"2020","unstructured":"J. Y. Chang and E. G. Im. 2020. Data poisoning attack on random forest classification model. In Proceedings of the Smart Media and Applications (SMA \u201920).","journal-title":"Proceedings of the Smart Media and Applications (SMA \u201920)"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-08421-8_34"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12243-023-00951-0"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.cosrev.2019.100199"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2018.00057"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/UBMYK48245.2019.8965459"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-13453-2_1"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.3390\/su12166434"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-019-03968-7"},{"key":"e_1_3_1_27_2","first-page":"4503","volume-title":"Applied Intelligence","author":"Zhang H.","year":"2021","unstructured":"H. Zhang, N. Cheng, Y. Zhang, and Z. Li. 2021. Label flipping attacks against Naive Bayes on spam filtering systems. Applied Intelligence 51, 7 (2021), 4503\u20134514."},{"key":"e_1_3_1_28_2","unstructured":"Pima Indians Diabetes Database. 2024. Www.kaggle.com. Retrieved March 2024 from https:\/\/www.kaggle.com\/datasets\/uciml\/pima-indians-diabetes-database"},{"key":"e_1_3_1_29_2","unstructured":"Breast Cancer Dataset. 2024. Www.kaggle.com. Retrieved March 2024 from https:\/\/www.kaggle.com\/datasets\/yasserh\/breast-cancer-dataset"},{"key":"e_1_3_1_30_2","unstructured":"E. Eid. 2024. Retrieved April 15 2024 from https:\/\/github.com\/Etidal21\/ML"}],"container-title":["ACM Transactions on Computing for Healthcare"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3728369","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T23:29:45Z","timestamp":1760398185000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3728369"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,13]]},"references-count":29,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,10,31]]}},"alternative-id":["10.1145\/3728369"],"URL":"https:\/\/doi.org\/10.1145\/3728369","relation":{},"ISSN":["2691-1957","2637-8051"],"issn-type":[{"type":"print","value":"2691-1957"},{"type":"electronic","value":"2637-8051"}],"subject":[],"published":{"date-parts":[[2025,10,13]]},"assertion":[{"value":"2024-04-22","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-03-28","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-10-13","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}