{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T16:45:25Z","timestamp":1768322725445,"version":"3.49.0"},"reference-count":32,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,6,22]],"date-time":"2023-06-22T00:00:00Z","timestamp":1687392000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2023,6,30]]},"abstract":"<jats:p>Missing data is one of the most persistent problems found in data that hinders information and value extraction. Handling missing data is a preprocessing task that has been extensively studied by the research community and remains an active research topic due to its impact and pervasiveness. Many surveys have been conducted to evaluate traditional and state-of-the-art techniques, however, the accuracy of missing data imputation techniques is evaluated without differentiating between isolated and sequence missing instances. In this article, we highlight the presence of both of these types of missing data at different percentages in real-world time-series datasets. We demonstrate that existing imputation techniques have different estimation accuracies for isolated and sequence missing instances. We then propose using a hybrid approach that differentiate between the two types of missing data to yield improved overall imputation accuracy.<\/jats:p>","DOI":"10.1145\/3575809","type":"journal-article","created":{"date-parts":[[2023,1,19]],"date-time":"2023-01-19T13:16:46Z","timestamp":1674134206000},"page":"1-15","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Experience: Differentiating Between Isolated and Sequence Missing Data"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2092-6666","authenticated-orcid":false,"given":"Amal","family":"Tawakuli","sequence":"first","affiliation":[{"name":"University of Luxembourg"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6832-148X","authenticated-orcid":false,"given":"Daniel","family":"Kaiser","sequence":"additional","affiliation":[{"name":"University of Luxembourg"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7374-3927","authenticated-orcid":false,"given":"Thomas","family":"Engel","sequence":"additional","affiliation":[{"name":"University of Luxembourg"}]}],"member":"320","published-online":{"date-parts":[[2023,6,22]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.4236\/ojs.2021.114030"},{"key":"e_1_3_1_3_2","volume-title":"200+ Financial Indicators of US Stocks (2014-2018)","author":"Carbone Nicolas","year":"2020","unstructured":"Nicolas Carbone. 2020. 200+ Financial Indicators of US Stocks (2014-2018). Retrieved from https:\/\/www.kaggle.com\/cnic92\/200-financial-indicators-of-us-stocks-20142018\/metadata."},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.oceaneng.2019.106220"},{"issue":"1","key":"e_1_3_1_5_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster A. P.","year":"1977","unstructured":"A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39, 1 (1977), 1\u201338.","journal-title":"Journal of the Royal Statistical Society"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1186\/2193-1801-2-222"},{"key":"e_1_3_1_7_2","article-title":"Missing data problem in the monitoring system: A review","author":"Du Jinghan","year":"2015","unstructured":"Jinghan Du, Minghua Hu, and Weining Zhang. 2015. Missing data problem in the monitoring system: A review. IEEE Sensors Journal (2015).","journal-title":"IEEE Sensors Journal"},{"key":"e_1_3_1_8_2","volume-title":"Applied Missing Data Analysis","author":"Enders Craig K.","year":"2010","unstructured":"Craig K. Enders. 2010. Applied Missing Data Analysis. Guilford Publications."},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20102832"},{"key":"e_1_3_1_10_2","volume-title":"Data Preprocessing in Data Mining","author":"Garc\u00eda Salvador","year":"2014","unstructured":"Salvador Garc\u00eda, Juli\u00e1n Luengo, and Francisco Herrera. 2014. Data Preprocessing in Data Mining. Springer-Verlag GmbH."},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-44668-0_93"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.scitotenv.2020.139140"},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1016\/C2009-0-61819-5"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.03.029"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.14778\/3377369.3377383"},{"key":"e_1_3_1_16_2","first-page":"111","article-title":"Data preprocessing for supervised learning","volume":"1","author":"Kotsiantis Sotiris","year":"2006","unstructured":"Sotiris Kotsiantis, Dimitris Kanellopoulos, and P. Pintelas. 2006. Data preprocessing for supervised learning. International Journal of Computer Science 1, 02(2006), 111\u2013117.","journal-title":"International Journal of Computer Science"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1098\/rspa.2015.0257"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2020.2970467"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-14-75"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.3390\/electronics10243167"},{"issue":"9","key":"e_1_3_1_21_2","article-title":"Missing data and multiple imputation in clinical epidemiological research","volume":"15","author":"Pedersen Alma B.","year":"2017","unstructured":"Alma B. Pedersen, Ellen M. Mikkelsen, Deirdre Cronin-Fenton, Nickolaj R. Kristensen, Tra My Pham, Lars Pedersen, and Irene Petersen. 2017. Missing data and multiple imputation in clinical epidemiological research. Clin Epidemiol 15, 9(2017).","journal-title":"Clin Epidemiol"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.dt.2019.07.016"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISWC.2012.13"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/63.3.581"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1162\/089976601750264965"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20185045"},{"key":"e_1_3_1_27_2","volume-title":"fillmissing","author":"Inc The MathWorks,","unstructured":"The MathWorks, Inc. [n. d.]. fillmissing. Retrieved from https:\/\/nl.mathworks.com\/help\/matlab\/ref\/fillmissing.html. Accessed 2-4-2023."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/METRICS.2005.21"},{"issue":"2","key":"e_1_3_1_29_2","article-title":"On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario","volume":"129","author":"Vito S. De","year":"2008","unstructured":"S. De Vito, E. Massera, M. Piga, L. Martinotto, and G. Di Francia. 2008. On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario. Sensors and Actuators B: Chemical 129, 2(2008).","journal-title":"Sensors and Actuators B: Chemical"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2019.2909038"},{"key":"e_1_3_1_31_2","unstructured":"Jingguang Zhou and Zili Huang. [n. d.]. Recover missing sensor data with iterative imputing network. ([n. d.]). arXiv:1711.07878v1. Retrieved from https:\/\/arxiv.org\/abs\/1711.07878v1."},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3194206.3194214"},{"issue":"188","key":"e_1_3_1_33_2","article-title":"An efficient ensemble method for missing value imputation in microarray gene expression data","volume":"22","author":"Zhu Xinshan","year":"2021","unstructured":"Xinshan Zhu, Jiayu Wang, Biao Sun, Chao Ren, Ting Yang, and Jie Ding. 2021. An efficient ensemble method for missing value imputation in microarray gene expression data. BMC Bioinformatic 22, 188 (2021).","journal-title":"BMC Bioinformatic"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3575809","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3575809","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:51:21Z","timestamp":1750182681000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3575809"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,22]]},"references-count":32,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,6,30]]}},"alternative-id":["10.1145\/3575809"],"URL":"https:\/\/doi.org\/10.1145\/3575809","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"value":"1936-1955","type":"print"},{"value":"1936-1963","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,22]]},"assertion":[{"value":"2022-02-25","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-14","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-06-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}