{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T19:25:23Z","timestamp":1773775523838,"version":"3.50.1"},"reference-count":27,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,1,13]],"date-time":"2021-01-13T00:00:00Z","timestamp":1610496000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"FCT","award":["UIDB\/00127\/2020 and POCI-01-0145-FEDER-032636"],"award-info":[{"award-number":["UIDB\/00127\/2020 and POCI-01-0145-FEDER-032636"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2021,3,31]]},"abstract":"<jats:p>Spatio-temporal data can be used to study and simulate the movement and behavior of objects and natural phenomena. However, the use of real-world data raises several challenges related to its acquisition, representation, and quality. This article presents a data cleaning process, based on consistency rules and checks, that uses geometric operations to detect and remove outliers or inaccurate data in a spatio-temporal series. The proposal consists of selecting key frames and applying the process iteratively until the data have the desired quality. The case study consists of extracting and cleaning spatio-temporal data from a video tracking the propagation of a controlled fire captured using drones. The source data was generated using segmentation techniques to obtain the regions representing the burned area across time. The main issues concern noisy data (e.g., the height of flames is highly variable) and occlusion due to smoke. The results show that the quality assessment and improvement method proposed in this work can identify and remove inconsistencies from a dataset of more than 22,500 polygons in just a few iterations. The quality of the corrected dataset is verified using metrics and graph analysis.<\/jats:p>","DOI":"10.1145\/3428155","type":"journal-article","created":{"date-parts":[[2021,1,13]],"date-time":"2021-01-13T11:17:11Z","timestamp":1610536631000},"page":"1-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["Experience"],"prefix":"10.1145","volume":"13","author":[{"given":"Rog\u00e9rio Lu\u00eds C.","family":"Costa","sequence":"first","affiliation":[{"name":"IEETA, University of Aveiro, Leiria, Portugal"}]},{"given":"Enrico","family":"Miranda","sequence":"additional","affiliation":[{"name":"IEETA, University of Aveiro, Leiria, Portugal"}]},{"given":"Paulo","family":"Dias","sequence":"additional","affiliation":[{"name":"DETI - IEETA, University of Aveiro, Aveiro, Portugal"}]},{"given":"Jos\u00e9","family":"Moreira","sequence":"additional","affiliation":[{"name":"DETI - IEETA, University of Aveiro, Aveiro, Portugal"}]}],"member":"320","published-online":{"date-parts":[[2021,1,13]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 2004 ACM Symposium on Applied Computing (SAC\u201904)","author":"Adam N. R.","unstructured":"N. R. Adam , V. P. Janeja , and V. Atluri . 2004. Neighborhood based detection of anomalies in high dimensional spatio-temporal sensor datasets . In Proceedings of the 2004 ACM Symposium on Applied Computing (SAC\u201904) . 576--583. N. R. Adam, V. P. Janeja, and V. Atluri. 2004. Neighborhood based detection of anomalies in high dimensional spatio-temporal sensor datasets. In Proceedings of the 2004 ACM Symposium on Applied Computing (SAC\u201904). 576--583."},{"key":"e_1_2_1_2_1","volume-title":"2011 IEEE 27th International Conference on Data Engineering. 399--409","author":"Aggarwal C. C.","unstructured":"C. C. Aggarwal , Y. Zhao , and P. S. Yu . 2011. Outlier detection in graph streams . In 2011 IEEE 27th International Conference on Data Engineering. 399--409 . C. C. Aggarwal, Y. Zhao, and P. S. Yu. 2011. Outlier detection in graph streams. In 2011 IEEE 27th International Conference on Data Engineering. 399--409."},{"key":"e_1_2_1_3_1","volume-title":"Database Systems for Advanced Applications","author":"Assent Ira","unstructured":"Ira Assent , Philipp Kranen , Corinna Baldauf , and Thomas Seidl . 2012. AnyOut: Anytime outlier detection on streaming data . In Database Systems for Advanced Applications . Springer , Berlin , 228--242. Ira Assent, Philipp Kranen, Corinna Baldauf, and Thomas Seidl. 2012. AnyOut: Anytime outlier detection on streaming data. In Database Systems for Advanced Applications. Springer, Berlin, 228--242."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1002\/for.768"},{"key":"e_1_2_1_5_1","volume-title":"28th International Conference on Information Technology Interfaces. 179--184","author":"Birant D.","unstructured":"D. Birant and A. Kut . 2006. Spatio-temporal outlier detection in large databases . In 28th International Conference on Information Technology Interfaces. 179--184 . D. Birant and A. Kut. 2006. Spatio-temporal outlier detection in large databases. In 28th International Conference on Information Technology Interfaces. 179--184."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541882"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1467-9671.2006.00256.x"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341105.3374019"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2950109"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.3138\/FM57-6770-U75U-7727"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"J. Duarte P. Dias and J. Moreira. 2018. A framework for the management of deformable moving objects. In Geospatial Technologies for All. Springer 327--346.  J. Duarte P. Dias and J. Moreira. 2018. A framework for the management of deformable moving objects. In Geospatial Technologies for All. Springer 327--346.","DOI":"10.1007\/978-3-319-78208-9_17"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/3001460.3001507"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/342009.335426"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2013.184"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.envsoft.2009.08.010"},{"key":"e_1_2_1_16_1","volume-title":"Noise reduction of NDVI time series: An empirical comparison of selected techniques. Remote Sensing of Environment 113 (January","author":"Hrid J. N.","year":"2008","unstructured":"J. N. Hrid and Greg Mcdermid . 2008. Noise reduction of NDVI time series: An empirical comparison of selected techniques. Remote Sensing of Environment 113 (January 2008 ), 248--258. J. N. Hrid and Greg Mcdermid. 2008. Noise reduction of NDVI time series: An empirical comparison of selected techniques. Remote Sensing of Environment 113 (January 2008), 248--258."},{"key":"e_1_2_1_17_1","volume-title":"2008 5th International Conference on Fuzzy Systems and Knowledge Discovery","volume":"2","author":"Jiang S.","unstructured":"S. Jiang and Q. An . 2008. Clustering-based outlier detection method . In 2008 5th International Conference on Fuzzy Systems and Knowledge Discovery , Vol. 2 . 429--433. S. Jiang and Q. An. 2008. Clustering-based outlier detection method. In 2008 5th International Conference on Fuzzy Systems and Knowledge Discovery, Vol. 2. 429--433."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1080\/03610929708831995"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-017-1117-8"},{"key":"e_1_2_1_20_1","volume-title":"Multivariate time series anomaly detection: A framework of hidden Markov models. Applied Soft Computing 60 (June","author":"Li Jinbo","year":"2017","unstructured":"Jinbo Li , Witold Pedrycz , and Iqbal Jamal . 2017. Multivariate time series anomaly detection: A framework of hidden Markov models. Applied Soft Computing 60 (June 2017 ), 229--240. Jinbo Li, Witold Pedrycz, and Iqbal Jamal. 2017. Multivariate time series anomaly detection: A framework of hidden Markov models. Applied Soft Computing 60 (June 2017), 229--240."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1869790.1869856"},{"key":"e_1_2_1_22_1","volume-title":"MoST Forest Fire Dataset. Retrieved","year":"2020","unstructured":"MoST-Team. 2020. MoST Forest Fire Dataset. Retrieved 26 February 2020 , from http:\/\/most.web.ua.pt\/Video15minImagesWKT.zip. MoST-Team. 2020. MoST Forest Fire Dataset. Retrieved 26 February 2020, from http:\/\/most.web.ua.pt\/Video15minImagesWKT.zip."},{"key":"e_1_2_1_23_1","volume-title":"Retrieved","year":"2020","unstructured":"PostGis-Team. 2020 . PostGIS Reference . Retrieved July 13, 2020, from https:\/\/postgis.net\/docs\/reference.html. PostGis-Team. 2020. PostGIS Reference. Retrieved July 13, 2020, from https:\/\/postgis.net\/docs\/reference.html."},{"key":"e_1_2_1_24_1","unstructured":"PostGis-Team. 2020. ST_SimplifyPreserveTopology - PostGIS dev Manual. Retrieved 26 February 2020 from https:\/\/postgis.net\/docs\/ST_SimplifyPreserveTopology.html.  PostGis-Team. 2020. ST_SimplifyPreserveTopology - PostGIS dev Manual. Retrieved 26 February 2020 from https:\/\/postgis.net\/docs\/ST_SimplifyPreserveTopology.html."},{"key":"e_1_2_1_25_1","volume-title":"Advances in Spatial and Temporal Databases","author":"T\u00f8ssebro Erlend","unstructured":"Erlend T\u00f8ssebro and Ralf Hartmut G\u00fcting . 2001. Creating representations for continuously moving regions from observations . In Advances in Spatial and Temporal Databases . Springer , Berlin , 321--344. Erlend T\u00f8ssebro and Ralf Hartmut G\u00fcting. 2001. Creating representations for continuously moving regions from observations. In Advances in Spatial and Temporal Databases. Springer, Berlin, 321--344."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/87.4.789"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12519-5_7"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3428155","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3428155","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:23Z","timestamp":1750195463000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3428155"}},"subtitle":["Quality Assessment and Improvement on a Forest Fire Dataset"],"short-title":[],"issued":{"date-parts":[[2021,1,13]]},"references-count":27,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,3,31]]}},"alternative-id":["10.1145\/3428155"],"URL":"https:\/\/doi.org\/10.1145\/3428155","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"value":"1936-1955","type":"print"},{"value":"1936-1963","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,13]]},"assertion":[{"value":"2020-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-01-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}