{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T18:01:39Z","timestamp":1763748099325,"version":"build-2065373602"},"reference-count":63,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2020,3,27]],"date-time":"2020-03-27T00:00:00Z","timestamp":1585267200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Human activity recognition (HAR) is a popular field of study. The outcomes of the projects in this area have the potential to impact on the quality of life of people with conditions such as dementia. HAR is focused primarily on applying machine learning classifiers on data from low level sensors such as accelerometers. The performance of these classifiers can be improved through an adequate training process. In order to improve the training process, multivariate outlier detection was used in order to improve the quality of data in the training set and, subsequently, performance of the classifier. The impact of the technique was evaluated with KNN and random forest (RF) classifiers. In the case of KNN, the performance of the classifier was improved from 55.9% to 63.59%.<\/jats:p>","DOI":"10.3390\/s20071858","type":"journal-article","created":{"date-parts":[[2020,4,1]],"date-time":"2020-04-01T03:44:13Z","timestamp":1585712653000},"page":"1858","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition"],"prefix":"10.3390","volume":"20","author":[{"given":"Dionicio","family":"Neira-Rodado","sequence":"first","affiliation":[{"name":"Department of Industrial Agroindustrial and Operations Management GIAO, Universidad de la Costa, Barranquilla 080002, Colombia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0882-7902","authenticated-orcid":false,"given":"Chris","family":"Nugent","sequence":"additional","affiliation":[{"name":"School of Computing, Ulster University, Shore Road, Newtownabbey, County Antrim BT37 0QB, Northern Ireland, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2368-7354","authenticated-orcid":false,"given":"Ian","family":"Cleland","sequence":"additional","affiliation":[{"name":"School of Computing, Ulster University, Shore Road, Newtownabbey, County Antrim BT37 0QB, Northern Ireland, UK"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Javier","family":"Velasquez","sequence":"additional","affiliation":[{"name":"Department of Industrial Agroindustrial and Operations Management GIAO, Universidad de la Costa, Barranquilla 080002, Colombia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Amelec","family":"Viloria","sequence":"additional","affiliation":[{"name":"Department of Industrial Agroindustrial and Operations Management GIAO, Universidad de la Costa, Barranquilla 080002, Colombia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,3,27]]},"reference":[{"key":"ref_1","unstructured":"Prins, J., and Mader, D. (2007). Multivariate Control Charts for Grouped and Individual Observations. Manuf. Syst., 37\u201341."},{"key":"ref_2","first-page":"217","article-title":"Detection and accommodation of outliers","volume":"6","author":"Fallon","year":"1997","journal-title":"Environ. Sampl. Monit. Primer"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1111\/coin.12045","article-title":"User activities outliers detection; Integration of statistical and computational intelligence techniques","volume":"32","author":"Mahmoud","year":"2016","journal-title":"Comput. Intell."},{"key":"ref_4","first-page":"171","article-title":"Aumento de la potencia del gr\u00e1fico de control multivariante T 2 de Hotelling utilizando se\u00f1ales adicionales de falta de control","volume":"43","author":"Aparisi","year":"2001","journal-title":"Estad\u00edstica Espa\u00f1ola"},{"key":"ref_5","unstructured":"Bauder, R.A., and Khoshgoftaar, T.M. (2017, January 22\u201324). Multivariate anomaly detection in medicare using model residuals and probabilistic programming. Proceedings of the FLAIRS 2017\u201430th International Florida Artificial Intelligence Research Society Conference, Marco Island, FL, USA."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2012\/531864","article-title":"An Improvement of the Hotelling T 2 Statistic in Monitoring Multivariate Quality Characteristics","volume":"2012","author":"Shabbak","year":"2012","journal-title":"Math. Probl. Eng."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1016\/S0169-7161(04)24010-X","article-title":"Multivariate Outlier Detection and Robustness","volume":"24","author":"Hubert","year":"2004","journal-title":"Handb. Stat."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1016\/j.pmcj.2016.09.009","article-title":"Adaptive sliding window segmentation for physical activity recognition using a single tri-axial accelerometer","volume":"38","author":"Noor","year":"2017","journal-title":"Pervasive Mob. Comput."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1081","DOI":"10.1016\/j.jalz.2017.02.010","article-title":"Population-based stroke and dementia incidence trends: Age and sex variations","volume":"13","author":"Cerasuolo","year":"2017","journal-title":"Alzheimer\u2019s Dement."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"11312","DOI":"10.3390\/s150511312","article-title":"The Elderly\u2019s Independent Living in Smart Homes: A Characterization of Activities and Sensing Infrastructure Survey to Facilitate Services Development","volume":"15","author":"Ni","year":"2015","journal-title":"Sensors"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Cleland, I., Donnelly, M.P., Nugent, C.D., Hallberg, J., Espinilla, M., and Garcia-Constantino, M. (2018, January 19\u201323). Collection of a Diverse, Realistic and Annotated Dataset for Wearable Activity Recognition. Proceedings of the 2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), Athens, Greece.","DOI":"10.1109\/PERCOMW.2018.8480322"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kleinberger, T., Becker, M., Ras, E., Holzinger, A., and M\u00fcller, P. (2007). LNCS 4555-Ambient Intelligence in Assisted Living: Enable Elderly People to Handle Future Interfaces, Springer.","DOI":"10.1007\/978-3-540-73281-5_11"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Irvine, N., Nugent, C., Zhang, S., Wang, H., Ng, W.W., Cleland, I., and Espinilla, M. (2018). The impact of dataset quality on the performance of data-driven approaches for human activity recognition. Data Science and Knowledge Engineering for Sensing Decision Support, World Scientific.","DOI":"10.1142\/9789813273238_0162"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.patrec.2014.04.011","article-title":"Human activity recognition from 3D data: A review","volume":"48","author":"Aggarwal","year":"2014","journal-title":"Pattern Recognit. Lett."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Chen, Y., and Xue, Y. (2015, January 9\u201312). A Deep Learning Approach to Human Activity Recognition Based on Single Accelerometer. Proceedings of the 2015 IEEE International Conference on Systems, Man, and Cybernetics, Kowloon, China.","DOI":"10.1109\/SMC.2015.263"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Qi, W., Su, H., Yang, C., Ferrigno, G., De Momi, E., and Aliverti, A. (2019). A Fast and Robust Deep Convolutional Neural Networks for Complex Human Activity Recognition Using Smartphone. Sensors, 19.","DOI":"10.3390\/s19173731"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1321","DOI":"10.1109\/JSEN.2014.2370945","article-title":"Wearable Sensors for Human Activity Monitoring: A Review","volume":"15","author":"Mukhopadhyay","year":"2015","journal-title":"IEEE Sens. J."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1109\/TSMCC.2012.2198883","article-title":"Sensor-based Activity Recognition","volume":"42","author":"Chen","year":"2012","journal-title":"IEEE Trans. Syst. Man Cybern. Part C"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1359","DOI":"10.1109\/TKDE.2010.184","article-title":"A Pattern Mining Approach to Sensor-Based Human Activity Recognition","volume":"23","author":"Gu","year":"2011","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1109\/JSEN.2016.2628346","article-title":"A Survey on Activity Detection and Classification Using Wearable Sensors","volume":"17","author":"Cornacchia","year":"2017","journal-title":"IEEE Sens. J."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1016\/j.eswa.2018.03.056","article-title":"Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges","volume":"105","author":"Nweke","year":"2018","journal-title":"Expert Syst. Appl."},{"key":"ref_22","unstructured":"Alsheikh, M.A., Selim, A., Niyato, D., Doyle, L., Lin, S., and Tan, H.-P. (2015). Deep Activity Recognition Models with Triaxial Accelerometers. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2499621","article-title":"A tutorial on human activity recognition using body-worn inertial sensors","volume":"46","author":"Bulling","year":"2014","journal-title":"ACM Comput. Surv."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1007\/s00779-010-0293-9","article-title":"Preprocessing Techniques for Context Recognition from Accelerometer Data","volume":"14","author":"Figo","year":"2010","journal-title":"Pers. Ubiquitous Comput."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Pires, I., Garcia, N.M., Pombo, N., and Fl\u00f3rez-Revuelta, F. (2016). From Data Acquisition to Data Fusion: A Comprehensive Review and a Roadmap for the Identification of Activities of Daily Living Using Mobile Devices. Sensors, 16.","DOI":"10.3390\/s16020184"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1145\/2347736.2347755","article-title":"A Few Useful Things to Know About Machine Learning","volume":"55","author":"Domingos","year":"2012","journal-title":"Commun. ACM"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"25","DOI":"10.21739\/IBR.2015.12.19.4.25","article-title":"A Study on the Effect of Entrepreneurship of Korean and Chinese University Students on Entrepreneurial Intention: Focused on Mediating of Self-efficacy","volume":"19","author":"Yang","year":"2015","journal-title":"Int. Bus. Rev."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","article-title":"Representation learning: A review and new perspectives","volume":"35","author":"Bengio","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"LeCun, Y., Kavukcuoglu, K., and Farabet, C. (June, January 30). Convolutional networks and applications in vision. Proceedings of the ISCAS 2010\u20132010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems, Paris, France.","DOI":"10.1109\/ISCAS.2010.5537907"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1007\/978-3-642-25446-8_4","article-title":"Sequential deep learning for human action recognition","volume":"Volume 7065","author":"Baccouche","year":"2011","journal-title":"Lecture Notes in Computer Science"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Wang, K., Wang, X., Lin, L., Wang, M., and Zuo, W. (2014). 3D Human activity recognition with reconfigurable convolutional neural networks. Proceedings of the 2014 ACM Conference on Multimedia, Orlando, FL, USA, 3\u20137 November 2014, Association for Computing Machinery.","DOI":"10.1145\/2647868.2654912"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Veeriah, V., Zhuang, N., and Qi, G.-J. (2015). Differential Recurrent Neural Networks for Action Recognition. arXiv.","DOI":"10.1109\/ICCV.2015.460"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"388","DOI":"10.1016\/j.bbe.2017.04.004","article-title":"Physical activity recognition by smartphones, a survey","volume":"Volume 37","author":"Morales","year":"2017","journal-title":"Biocybernetics and Biomedical Engineering"},{"key":"ref_35","unstructured":"Gjoreski, H., Bizjak, J., Gjoreski, M., and Gams, M. (2016, January 9\u201315). Comparing Deep and Classical Machine Learning Methods for Human Activity Recognition using Wrist Accelerometer. Proceedings of the IJCAI 2016 Workshop on Deep Learning for Artificial Intelligence, New York, NY, USA."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Maimon, O., and Rokach, L. (2010). Data Mining and Knowledge Discovery Handbook, Springer.","DOI":"10.1007\/978-0-387-09823-4"},{"key":"ref_37","unstructured":"Gorunescu, F. (2011). Data Mining: Concepts, Models and Techniques, Springer Science & Business Media."},{"key":"ref_38","unstructured":"Han, J., Kamber, M., and Pei, J. (2011). Data Mining. Concepts and Techniques, Morgan Kaufmann. [3rd ed.]."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/s10462-004-0751-8","article-title":"Class Noise vs. Attribute Noise: A Quantitative Study","volume":"22","author":"Zhu","year":"2004","journal-title":"Artif. Intell. Rev."},{"key":"ref_40","first-page":"245","article-title":"Identifying Learners Robust to Low Quality Data","volume":"33","author":"Folleco","year":"2009","journal-title":"Informatica"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1175","DOI":"10.1016\/j.patrec.2008.01.009","article-title":"Fuzzy relevance vector machine for learning from unbalanced data and noise","volume":"29","author":"Li","year":"2008","journal-title":"Pattern Recognit. Lett."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"487","DOI":"10.3233\/IDA-2006-10602","article-title":"Class noise detection using frequent itemsets","volume":"10","author":"Khoshgoftaar","year":"2006","journal-title":"Intell. Data Anal."},{"key":"ref_43","first-page":"69","article-title":"Identification of Multivariate Outliers\u2013Problems and Challenges","volume":"247","author":"Majewska","year":"2015","journal-title":"Studia Ekonomiczne."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1080\/00949658208810595","article-title":"Outlier detection using robust measures of scale","volume":"15","author":"Iglewicz","year":"1982","journal-title":"J. Stat. Comput. Simul."},{"key":"ref_45","first-page":"37","article-title":"The Masking Breakdown Point of Multivariate Outlier Identification Rules The Masking Breakdown Point of Multivariate Outlier Identification Rules","volume":"94","author":"Becker","year":"2012","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_46","unstructured":"(2020, March 27). Identifying multivariate outliers, Stata Technical Bulletin STB-11. September 1993. Available online: https:\/\/www.researchgate.net\/publication\/24136964_Identifying_multivariate_outliers."},{"key":"ref_47","first-page":"295","article-title":"A Comparison of Multivariate Outlier Detection Methods for Clinical Laboratory Safety Data","volume":"50","author":"Penny","year":"2001","journal-title":"J. R. Stat. Soc. Ser. D (Stat.)"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Rousseeuw, P.J., and Leroy, A.M. (1987). Robust Regression and Outlier Detection, Wiley.","DOI":"10.1002\/0471725382"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1385","DOI":"10.1214\/aos\/1176349264","article-title":"Asymptotics for the Minimum Covariance Determinant Estimator","volume":"21","author":"Butler","year":"1993","journal-title":"Ann. Stat."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"4942","DOI":"10.1016\/j.csda.2006.07.028","article-title":"Robust forecasting of mortality and fertility rates: A functional data approach","volume":"51","author":"Hyndman","year":"2007","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"10779","DOI":"10.1007\/s11042-016-3267-8","article-title":"Towards Online and Personalized Daily Activity Recognition, Habit Modeling, and Anomaly Detection for the Solitary Elderly through Unobtrusive Sensing","volume":"76","author":"Meng","year":"2017","journal-title":"Multimed. Tools Appl."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"771","DOI":"10.3233\/JIFS-181315","article-title":"Improving data classification accuracy in sensor networks using hybrid outlier detection in HAR","volume":"37","author":"Gopalakrishnan","year":"2019","journal-title":"J. Intell. Fuzzy Syst."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"103397","DOI":"10.1016\/j.jbi.2020.103397","article-title":"FilterK: A new outlier detection method for k-means clustering of physical activity","volume":"104","author":"Jones","year":"2020","journal-title":"J. Biomed. Inform."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"74422","DOI":"10.1109\/ACCESS.2019.2921096","article-title":"Outlier Detection in Wearable Sensor Data for Human Activity Recognition (HAR) Based on DRNNs","volume":"7","year":"2019","journal-title":"IEEE Access"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Sunderland, K.M., Beaton, D., Fraser, J., Kwan, D., McLaughlin, P.M., Montero-Odasso, M., Peltsch, A.J., Pieruccini-Faria, F., Sahlas, D.J., and Swartz, R.H. (2019). The utility of multivariate outlier detection techniques for data quality evaluation in large studies: An application within the ONDRI project. BMC Med. Res. Methodol., 19.","DOI":"10.1186\/s12874-019-0737-5"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Zhao, S., Li, W., and Cao, J. (2018). A user-adaptive algorithm for activity recognition based on K-means clustering, local outlier factor, and multivariate gaussian distribution. Sensors, 18.","DOI":"10.3390\/s18061850"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Marubini, E., and Orenti, A. (2014). Detecting outliers and\/or leverage points: A robust two-stage procedure with bootstrap cut-off points. Epidemiol. Biostat. Public Health, 11.","DOI":"10.2427\/9094"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Dehghani, A., Sarbishei, O., Glatard, T., and Shihab, E. (2019). A Quantitative Comparison of Overlapping and Non-Overlapping Sliding Windows for Human Activity Recognition Using Inertial Sensors. Sensors, 19.","DOI":"10.3390\/s19225026"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1249\/MSS.0000000000001144","article-title":"Activity recognition in youth using single accelerometer placed at wrist or ankle","volume":"49","author":"Mannini","year":"2017","journal-title":"Med. Sci. Sports Exerc."},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"59192","DOI":"10.1109\/ACCESS.2018.2873502","article-title":"Sensor-based datasets for human activity recognition\u2014A systematic review of literature","volume":"6","author":"Quero","year":"2018","journal-title":"IEEE Access"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"206","DOI":"10.1016\/j.jmva.2004.08.002","article-title":"High breakdown estimators for principal components: The projection-pursuit approach revisited","volume":"95","author":"Croux","year":"2005","journal-title":"J. Multivar. Anal."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Qi, W., and Aliverti, A. (2019). A Multimodal Wearable System for Continuous and Real-time Breathing Pattern Monitoring During Daily Activity. IEEE J. Biomed. Health Inform.","DOI":"10.1109\/JBHI.2019.2963048"},{"key":"ref_63","first-page":"1","article-title":"Non-intrusive human activity recognition and abnormal behavior detection on elderly people: A review","volume":"53","author":"Lentzas","year":"2019","journal-title":"Artif. Intell. Rev."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/7\/1858\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T09:12:21Z","timestamp":1760173941000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/7\/1858"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,27]]},"references-count":63,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2020,4]]}},"alternative-id":["s20071858"],"URL":"https:\/\/doi.org\/10.3390\/s20071858","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2020,3,27]]}}}