{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T20:01:37Z","timestamp":1767902497895,"version":"3.49.0"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2021,3,5]],"date-time":"2021-03-05T00:00:00Z","timestamp":1614902400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Novel Approaches for Predicting Unstructured Short Periods of Physical Activities in Youth","award":["R21HL093407-01A1"],"award-info":[{"award-number":["R21HL093407-01A1"]}]},{"name":"NIH","award":["1R01HD083431-01A1"],"award-info":[{"award-number":["1R01HD083431-01A1"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2021,4,30]]},"abstract":"<jats:p>The study of model bias and variance with respect to decision boundaries is critically important in supervised learning and artificial intelligence. There is generally a tradeoff between the two, as fine-tuning of the decision boundary of a classification model to accommodate more boundary training samples (i.e., higher model complexity) may improve training accuracy (i.e., lower bias) but hurt generalization against unseen data (i.e., higher variance). By focusing on just classification boundary fine-tuning and model complexity, it is difficult to reduce both bias and variance. To overcome this dilemma, we take a different perspective and investigate a new approach to handle inaccuracy and uncertainty in the training data labels, which are inevitable in many applications where labels are conceptual entities and labeling is performed by human annotators. The process of classification can be undermined by uncertainty in the labels of the training data; extending a boundary to accommodate an inaccurately labeled point will increase both bias and variance. Our novel method can reduce both bias and variance by estimating the pointwise label uncertainty of the training set and accordingly adjusting the training sample weights such that those samples with high uncertainty are weighted down and those with low uncertainty are weighted up. In this way, uncertain samples have a smaller contribution to the objective function of the model\u2019s learning algorithm and exert less pull on the decision boundary. In a real-world physical activity recognition case study, the data present many labeling challenges, and we show that this new approach improves model performance and reduces model variance.<\/jats:p>","DOI":"10.1145\/3429447","type":"journal-article","created":{"date-parts":[[2021,3,5]],"date-time":"2021-03-05T11:11:56Z","timestamp":1614942716000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Mitigating Class-Boundary Label Uncertainty to Reduce Both Model Bias and Variance"],"prefix":"10.1145","volume":"15","author":[{"given":"Matthew","family":"Almeida","sequence":"first","affiliation":[{"name":"University of Massachusetts Boston, Boston, MA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yong","family":"Zhuang","sequence":"additional","affiliation":[{"name":"University of Massachusetts Boston, Boston, MA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3383-551X","authenticated-orcid":false,"given":"Wei","family":"Ding","sequence":"additional","affiliation":[{"name":"University of Massachusetts Boston, Boston, MA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Scott E.","family":"Crouter","sequence":"additional","affiliation":[{"name":"University of Tennesee Knoxville, Knoxville, TN"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ping","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Massachusetts Boston, Boston, MA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,3,5]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Yaser S. Abu-Mostafa Malik Magdon-Ismail and Hsuan-Tien Lin. 2012. Learning from Data. AMLBook.  Yaser S. Abu-Mostafa Malik Magdon-Ismail and Hsuan-Tien Lin. 2012. Learning from Data. AMLBook."},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the KDD Workshop","volume":"10","author":"Donald"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1214\/18-AOS1688"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-29894-4_5"},{"key":"e_1_2_1_5_1","first-page":"2973","article-title":"Semi-supervised novelty detection","author":"Blanchard Gilles","year":"2010","journal-title":"Journal of Machine Learning Research 11"},{"key":"e_1_2_1_6_1","unstructured":"Fran\u00e7ois Chollet et\u00a0al. 2015. Keras. Retrieved from https:\/\/keras.io.  Fran\u00e7ois Chollet et\u00a0al. 2015. Keras. Retrieved from https:\/\/keras.io."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1249\/MSS.0000000000000502"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASSP.1980.1163420"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 18th International Society for Music Information Retrieval Conference. DOI:https:\/\/arxiv.org\/abs\/1612","author":"Defferrard Micha\u00ebl","year":"2017"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.strusafe.2008.06.020"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the 2009 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics.","author":"Ding Wei"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-017-5663-3"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 30th International Conference on Neural Information Processing Systems. ACM, 1632--1640","author":"Fawzi Alhussein","year":"2016"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the 10th International Conference on Speech and Computer.","author":"Ganchev Todor","year":"2005"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISIT.2017.8006749"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2018.2807481"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1992.4.1.1"},{"key":"e_1_2_1_18_1","volume-title":"The Nature of Mathematical Modeling","author":"Gershenfeld Neil"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the 5th International Conference on Learning Representations.","author":"Goldberger Jacob","year":"2016"},{"key":"e_1_2_1_20_1","volume-title":"Deep Learning","author":"Goodfellow Ian"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the International Conference on Learning Representations.","author":"Goodfellow Ian J","year":"2014"},{"key":"e_1_2_1_22_1","first-page":"R14","volume-title":"Proceedings of the 3rd International Conference on Learning Representations (ICLR'15)","author":"Gu Shixiang","year":"2015"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02985802"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1022899518027"},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 35th International Conference on Machine Learning.","author":"Jiang Lu","year":"2018"},{"key":"e_1_2_1_27_1","first-page":"9","article-title":"Sample estimate of the entropy of a random vector","volume":"23","author":"Kozachenko L. F.","year":"1987","journal-title":"Problemy Peredachi Informatsii"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8682345"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0893-6080(05)80131-5"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2456899"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the International Conference on Machine Learning. 125--134","author":"Menon Aditya","year":"2015"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2014.2300480"},{"key":"e_1_2_1_35_1","volume-title":"Proceedings of the Advances in Neural Information Processing Systems, J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger (Eds.)","volume":"26","author":"Natarajan Nagarajan","year":"2013"},{"key":"e_1_2_1_36_1","volume-title":"Retrieved","author":"Ng Andrew","year":"2017"},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence (UAI'17)","author":"Northcutt Curtis G.","year":"2017"},{"key":"e_1_2_1_38_1","volume-title":"A Guide to NumPy","author":"Oliphant Travis E."},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR'18)","author":"Park Jiyoung","year":"2018"},{"key":"e_1_2_1_40_1","unstructured":"Mengye Ren Wenyuan Zeng Bin Yang and Raquel Urtasun. 2018. Learning to reweight examples for robust deep learning. arXiv:1803.09050.  Mengye Ren Wenyuan Zeng Bin Yang and Raquel Urtasun. 2018. Learning to reweight examples for robust deep learning. arXiv:1803.09050."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/1367985.1367993"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the 18th International Conference on Artificial Intelligence and Statistics. 838--846","author":"Scott Clayton","year":"2015"},{"key":"e_1_2_1_43_1","volume-title":"Proccedings of the Conference on Learning Theory. 489--511","author":"Scott Clayton","year":"2013"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","volume-title":"Understanding Machine Learning: From Theory to Algorithms","author":"Shalev-Shwartz Shai","DOI":"10.1017\/CBO9781107298019"},{"key":"e_1_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Tom Stepinski Wei Ding and R. Vilalta. 2012. Detecting impact craters in planetary images using machine learning. In Intelligent Data Analysis for Real-Life Applications: Theory and Practice. IGI Global 146\u2013159.  Tom Stepinski Wei Ding and R. Vilalta. 2012. Detecting impact craters in planetary images using machine learning. In Intelligent Data Analysis for Real-Life Applications: Theory and Practice. IGI Global 146\u2013159.","DOI":"10.4018\/978-1-4666-1806-0.ch008"},{"key":"e_1_2_1_47_1","unstructured":"Sethu Vijayakumar. 2007. The Bias-Variance Tradeoff (PDF). Retrieved from http:\/\/www.inf.ed.ac.uk\/teaching\/courses\/mlsc\/Notes\/Lecture4\/BiasVariance.pdf.  Sethu Vijayakumar. 2007. The Bias-Variance Tradeoff (PDF). Retrieved from http:\/\/www.inf.ed.ac.uk\/teaching\/courses\/mlsc\/Notes\/Lecture4\/BiasVariance.pdf."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2488220"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3429447","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3429447","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:48:03Z","timestamp":1750193283000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3429447"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,5]]},"references-count":47,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,4,30]]}},"alternative-id":["10.1145\/3429447"],"URL":"https:\/\/doi.org\/10.1145\/3429447","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,5]]},"assertion":[{"value":"2019-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}