{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,5]],"date-time":"2025-12-05T14:28:12Z","timestamp":1764944892922},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2018,5,29]],"date-time":"2018-05-29T00:00:00Z","timestamp":1527552000000},"content-version":"vor","delay-in-days":59,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Centre for Visual Analytics Science and Technology"},{"name":"Austrian Federal Ministry of Science, Research, and Economy in the exceptional Laura Bassi Centres of Excellence initiative","award":["#822746"],"award-info":[{"award-number":["#822746"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Data and Information Quality"],"published-print":{"date-parts":[[2018,3,31]]},"abstract":"<jats:p>\n            During data preprocessing, analysts spend a significant part of their time and effort profiling the quality of the data along with cleansing and transforming the data for further analysis. While quality metrics\u2014ranging from general to domain-specific measures\u2014support assessment of the quality of a dataset, there are hardly any approaches to visually support the analyst in customizing and applying such metrics. Yet, visual approaches could facilitate users\u2019 involvement in data quality assessment. We present\n            <jats:italic>MetricDoc<\/jats:italic>\n            , an interactive environment for assessing data quality that provides customizable, reusable quality metrics in combination with immediate visual feedback. Moreover, we provide an overview visualization of these quality metrics along with error visualizations that facilitate interactive navigation of the data to determine the causes of quality issues present in the data. In this article, we describe the architecture, design, and evaluation of\n            <jats:italic>MetricDoc<\/jats:italic>\n            , which underwent several design cycles, including heuristic evaluation and expert reviews as well as a focus group with data quality, human-computer interaction, and visual analytics experts.\n          <\/jats:p>","DOI":"10.1145\/3190578","type":"journal-article","created":{"date-parts":[[2018,5,29]],"date-time":"2018-05-29T12:24:26Z","timestamp":1527596666000},"page":"1-26","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":21,"title":["Visual Interactive Creation, Customization, and Analysis of Data Quality Metrics"],"prefix":"10.1145","volume":"10","author":[{"given":"Christian","family":"Bors","sequence":"first","affiliation":[{"name":"Institute of Visual Computing 8 Human-Centered Technology, TU Wien, Austria"}]},{"given":"Theresia","family":"Gschwandtner","sequence":"additional","affiliation":[{"name":"Institute of Visual Computing 8 Human-Centered Technology, TU Wien, Austria"}]},{"given":"Simone","family":"Kriglstein","sequence":"additional","affiliation":[{"name":"Institute of Visual Computing 8 Human-Centered Technology, TU Wien, Austria"}]},{"given":"Silvia","family":"Miksch","sequence":"additional","affiliation":[{"name":"Institute of Visual Computing 8 Human-Centered Technology, TU Wien, Austria"}]},{"given":"Margit","family":"Pohl","sequence":"additional","affiliation":[{"name":"Institute of Visual Computing 8 Human-Centered Technology, TU Wien, Austria"}]}],"member":"320","published-online":{"date-parts":[[2018,5,29]]},"reference":[{"key":"e_1_2_2_1_1","volume-title":"A survey of data quality tools.Datenbank-Spektrum 14, 15--21","author":"Barateiro Jos\u00e9","year":"2005","unstructured":"Jos\u00e9 Barateiro and Helena Galhardas . 2005. A survey of data quality tools.Datenbank-Spektrum 14, 15--21 ( 2005 ), 48. Jos\u00e9 Barateiro and Helena Galhardas. 2005. A survey of data quality tools.Datenbank-Spektrum 14, 15--21 (2005), 48."},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541883"},{"key":"e_1_2_2_3_1","volume-title":"Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications)","author":"Batini Carlo","year":"2006","unstructured":"Carlo Batini and Monica Scannapieco . 2006 . Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications) . Springer Verlag New York ,Secaucus, NJ. Carlo Batini and Monica Scannapieco. 2006. Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications). Springer Verlag New York,Secaucus, NJ."},{"key":"e_1_2_2_4_1","volume-title":"Proceedings of SIGRAD 2012: Interactive Visual Analysis of Data. 39--48","author":"Bernard J\u00fcrgen","year":"2012","unstructured":"J\u00fcrgen Bernard , Tobias Ruppert , Oliver Goroll , Thorsten May , and J\u00f6rn Kohlhammer . 2012 . Visual-interactive preprocessing of time series data . In Proceedings of SIGRAD 2012: Interactive Visual Analysis of Data. 39--48 . J\u00fcrgen Bernard, Tobias Ruppert, Oliver Goroll, Thorsten May, and J\u00f6rn Kohlhammer. 2012. Visual-interactive preprocessing of time series data. In Proceedings of SIGRAD 2012: Interactive Visual Analysis of Data. 39--48."},{"key":"e_1_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2011.185"},{"key":"e_1_2_2_6_1","volume-title":"A Guide to the Business Analysis Body of Knowledger","author":"Brennan K.","unstructured":"K. Brennan . 2009. A Guide to the Business Analysis Body of Knowledger . International Institute of Business Analysis . K. Brennan. 2009. A Guide to the Business Analysis Body of Knowledger. International Institute of Business Analysis."},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740908.2742135"},{"key":"e_1_2_2_8_1","volume-title":"Case Method Fast-Track: A Rad Approach","author":"Clegg Dai","unstructured":"Dai Clegg and Richard Barker . 1994. Case Method Fast-Track: A Rad Approach . Addison-Wesley Longman Publishing Co. Dai Clegg and Richard Barker. 1994. Case Method Fast-Track: A Rad Approach. Addison-Wesley Longman Publishing Co."},{"key":"e_1_2_2_9_1","volume-title":"Understanding Your Users: A Practical Guide to User Requirements Methods, Tools, and Techniques","author":"Courage Catherine","unstructured":"Catherine Courage and Kathy Baxter . 2004. Understanding Your Users: A Practical Guide to User Requirements Methods, Tools, and Techniques . Morgan Kaufmann Publishers . Catherine Courage and Kathy Baxter. 2004. Understanding Your Users: A Practical Guide to User Requirements Methods, Tools, and Techniques. Morgan Kaufmann Publishers."},{"key":"e_1_2_2_10_1","volume-title":"Handbook of Data Quality","author":"Dasu Tamraparni","unstructured":"Tamraparni Dasu . 2013. Data glitches: Monsters in your data . In Handbook of Data Quality , Shazia Sadiq (Ed.). Springer , Berlin , 163--178. Tamraparni Dasu. 2013. Data glitches: Monsters in your data. In Handbook of Data Quality, Shazia Sadiq (Ed.). Springer, Berlin, 163--178."},{"key":"e_1_2_2_11_1","unstructured":"DataTables. 2017. \n      DataTables\n       &verbar; Table plug-in for j\n      Query\n    .\n   Retrieved from https:\/\/datatables.net\/ (accessed \n  May\n  2017\n  ).  DataTables. 2017. DataTables &verbar; Table plug-in for jQuery. Retrieved from https:\/\/datatables.net\/ (accessed May 2017)."},{"key":"e_1_2_2_12_1","unstructured":"Jeremy Debattista Makx Dekkers Christophe Guret Deirdre Lee Nandana Mihindukulasooriya and Amrapali Zaveri. 2016. Data on the Web Best Practices: Data Quality Vocabulary. Retrieved from https:\/\/www.w3.org\/TR\/vocab-dqv\/.  Jeremy Debattista Makx Dekkers Christophe Guret Deirdre Lee Nandana Mihindukulasooriya and Amrapali Zaveri. 2016. Data on the Web Best Practices: Data Quality Vocabulary. Retrieved from https:\/\/www.w3.org\/TR\/vocab-dqv\/."},{"key":"e_1_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1842993.1843029"},{"key":"e_1_2_2_15_1","volume-title":"Scapin","author":"Freitas Carla M. D. S.","year":"2014","unstructured":"Carla M. D. S. Freitas , Marcelo S. Pimenta , and Dominique L . Scapin . 2014 . User-centered evaluation of information visualization techniques: Making the HCI-InfoVis connection explicit. In Handbook of Human Centric Visualization, Weidong Huang (Ed.). Springer , 315--336. Carla M. D. S. Freitas, Marcelo S. Pimenta, and Dominique L. Scapin. 2014. User-centered evaluation of information visualization techniques: Making the HCI-InfoVis connection explicit. In Handbook of Human Centric Visualization, Weidong Huang (Ed.). Springer, 315--336."},{"key":"e_1_2_2_16_1","volume-title":"The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques","author":"Galitz Wilbert O.","unstructured":"Wilbert O. Galitz . 2007. The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques . Wiley 8 Sons. Wilbert O. Galitz. 2007. The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques. Wiley 8 Sons."},{"key":"e_1_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2637748.2638423"},{"key":"e_1_2_2_18_1","series-title":"Lecture Notes in Computer Science (LNCS 7465): Multidisciplinary Research and Practice for Information Systems (Proceedings of the CD-ARES\u201912)","volume-title":"A taxonomy of dirty timeoriented data","author":"Gschwandtner Theresia","unstructured":"Theresia Gschwandtner , Johannes G\u00e4rtner , Wolfgang Aigner , and Silvia Miksch . 2012. A taxonomy of dirty timeoriented data . In Lecture Notes in Computer Science (LNCS 7465): Multidisciplinary Research and Practice for Information Systems (Proceedings of the CD-ARES\u201912) , Gerald Quirchmayr, Josef Basl, Ilsun You, Lida Xu, and EdgarWeippl (Eds.). Springer , Berlin , 58--72. Theresia Gschwandtner, Johannes G\u00e4rtner, Wolfgang Aigner, and Silvia Miksch. 2012. A taxonomy of dirty timeoriented data. In Lecture Notes in Computer Science (LNCS 7465): Multidisciplinary Research and Practice for Information Systems (Proceedings of the CD-ARES\u201912), Gerald Quirchmayr, Josef Basl, Ilsun You, Lida Xu, and EdgarWeippl (Eds.). Springer, Berlin, 58--72."},{"key":"e_1_2_2_19_1","volume-title":"Pyla","author":"Hartson Rex","year":"2012","unstructured":"Rex Hartson and Pardha A . Pyla . 2012 . The UX Book: Process and Guidelines for Ensuring a Quality User Experience. Morgan Kaufmann . Rex Hartson and Pardha A. Pyla. 2012. The UX Book: Process and Guidelines for Ensuring a Quality User Experience. Morgan Kaufmann."},{"key":"e_1_2_2_20_1","volume-title":"Conference on Innovative Data Systems Research (CIDR\u201915)","author":"Heer Jeffrey","year":"2015","unstructured":"Jeffrey Heer , Joseph Hellerstein , and Sean Kandel . 2015 . Predictive interaction for data transformation . In Conference on Innovative Data Systems Research (CIDR\u201915) . Jeffrey Heer, Joseph Hellerstein, and Sean Kandel. 2015. Predictive interaction for data transformation. In Conference on Innovative Data Systems Research (CIDR\u201915)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2013.126"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1177\/1473871611415994"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1979444"},{"key":"e_1_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254556.2254659"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/2945.981847"},{"key":"e_1_2_2_26_1","volume-title":"Human-Centered Visualization Environments (Lecture Notes in Computer Science), Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer (Eds.)","author":"Kerren Andreas","unstructured":"Andreas Kerren , Achim Ebert , and J\u00f6rg Meyer . 2006. Introduction to human-centered visualization environments . In Human-Centered Visualization Environments (Lecture Notes in Computer Science), Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer (Eds.) . Springer , 1--9. Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer. 2006. Introduction to human-centered visualization environments. In Human-Centered Visualization Environments (Lecture Notes in Computer Science), Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer (Eds.). Springer, 1--9."},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1021564703268"},{"key":"e_1_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1136\/bmj.311.7000.299"},{"key":"e_1_2_2_29_1","volume-title":"Handbook of Human Centric Visualization","author":"Kriglstein Simone","unstructured":"Simone Kriglstein , Margit Pohl , and Michael Smuc . 2014. Pep up your time machine: Recommendations for the design of information visualizations of time-dependent data . In Handbook of Human Centric Visualization , Weidong Huang (Ed.). Springer New York , 203--225. Simone Kriglstein, Margit Pohl, and Michael Smuc. 2014. Pep up your time machine: Recommendations for the design of information visualizations of time-dependent data. In Handbook of Human Centric Visualization, Weidong Huang (Ed.). Springer New York, 203--225."},{"key":"e_1_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2669557.2669571"},{"key":"e_1_2_2_31_1","volume-title":"Human centered design in practice: A case study with the ontology visualization tool knoocks","author":"Kriglstein Simone","unstructured":"Simone Kriglstein and G\u00fcnter Wallner . 2013. Human centered design in practice: A case study with the ontology visualization tool knoocks . In Computer Vision, Imaging and Computer Graphics. Theory and Applications, Gabriela Csurka, Martin Kraus, Leonid Mestetskiy, Paul Richard, and Jos Braz (Eds.). Springer , 123--141. Simone Kriglstein and G\u00fcnter Wallner. 2013. Human centered design in practice: A case study with the ontology visualization tool knoocks. In Computer Vision, Imaging and Computer Graphics. Theory and Applications, Gabriela Csurka, Martin Kraus, Leonid Mestetskiy, Paul Richard, and Jos Braz (Eds.). Springer, 123--141."},{"key":"e_1_2_2_32_1","volume-title":"Wassink","author":"Kulyk Olga A.","year":"2006","unstructured":"Olga A. Kulyk , Robert Kosara , Jaime Urquiza-Fuentes , and Ingo H. C . Wassink . 2006 . Human-centered aspects. In Human-Centered Visualization Environments (Lecture Notes in Computer Science), Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer (Eds.). Springer, 13--75. Olga A. Kulyk, Robert Kosara, Jaime Urquiza-Fuentes, and Ingo H. C. Wassink. 2006. Human-centered aspects. In Human-Centered Visualization Environments (Lecture Notes in Computer Science), Andreas Kerren, Achim Ebert, and J\u00f6rg Meyer (Eds.). Springer, 13--75."},{"key":"e_1_2_2_33_1","volume-title":"Proceedings of the IFIP 17th World Computer Congress - TC13 Stream on Usability: Gaining a Competitive Edge. Kluwer, B.V., 133--148","author":"Maguire Martin","year":"2002","unstructured":"Martin Maguire and Nigel Bevan . 2002 . User requirements analysis: A review of supporting methods . In Proceedings of the IFIP 17th World Computer Congress - TC13 Stream on Usability: Gaining a Competitive Edge. Kluwer, B.V., 133--148 . Martin Maguire and Nigel Bevan. 2002. User requirements analysis: A review of supporting methods. In Proceedings of the IFIP 17th World Computer Congress - TC13 Stream on Usability: Gaining a Competitive Edge. Kluwer, B.V., 133--148."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cag.2013.11.002"},{"key":"e_1_2_2_36_1","series-title":"A K Peters Visualization Series","volume-title":"Visualization Analysis and Design","author":"Munzner Tamara","unstructured":"Tamara Munzner . 2014. Visualization Analysis and Design . A K Peters Visualization Series , CRC Press . Tamara Munzner. 2014. Visualization Analysis and Design. A K Peters Visualization Series, CRC Press."},{"key":"e_1_2_2_37_1","volume-title":"Usability Inspection Methods","author":"Nielsen Jakob","unstructured":"Jakob Nielsen . 1994. Usability Inspection Methods . Wiley 8 Sons, Inc., Chapter Heuristic Evaluation, 25--62. Jakob Nielsen. 1994. Usability Inspection Methods. Wiley 8 Sons, Inc., Chapter Heuristic Evaluation, 25--62."},{"key":"e_1_2_2_38_1","unstructured":"Paulo Oliveira F\u00e1tima Rodrigues and Pedro Rangel Henriques. 2005. A formal definition of data quality problems. In IQ.  Paulo Oliveira F\u00e1tima Rodrigues and Pedro Rangel Henriques. 2005. A formal definition of data quality problems. In IQ."},{"key":"e_1_2_2_39_1","unstructured":"Open\n      Refine\n    . 2017. OpenRefine.\n   Retrieved from https:\/\/github.com\/OpenRefine\/OpenRefine https:\/\/github.com\/OpenRefine\/OpenRefine (accessed \n  May\n  2017\n  ).  Open Refine. 2017. OpenRefine. Retrieved from https:\/\/github.com\/OpenRefine\/OpenRefine https:\/\/github.com\/OpenRefine\/OpenRefine (accessed May 2017)."},{"key":"e_1_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/505248.506010"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1093\/intqhc\/8.5.499"},{"key":"e_1_2_2_42_1","volume-title":"R: A Language and Environment for Statistical Computing","author":"Team R Core","year":"2017","unstructured":"R Core Team . 2017 . R: A Language and Environment for Statistical Computing . R Foundation for Statistical Computing, Vienna, Austria . Retrieved from https:\/\/www.R-project.org. R Core Team. 2017. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Retrieved from https:\/\/www.R-project.org."},{"key":"e_1_2_2_43_1","first-page":"4","article-title":"Data cleaning: Problems and current approaches","volume":"23","author":"Rahm Erhard","year":"2000","unstructured":"Erhard Rahm and Hong-Hai Do . 2000 . Data cleaning: Problems and current approaches . IEEE Bulletin of the Technical Committee on Data Engineering 23 , 4 (March 2000), 3--13. Erhard Rahm and Hong-Hai Do. 2000. Data cleaning: Problems and current approaches. IEEE Bulletin of the Technical Committee on Data Engineering 23, 4 (March 2000), 3--13.","journal-title":"IEEE Bulletin of the Technical Committee on Data Engineering"},{"key":"e_1_2_2_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/191666.191776"},{"key":"e_1_2_2_45_1","doi-asserted-by":"crossref","unstructured":"Shazia Sadiq (Ed.). 2013. Handbook of Data Quality. Springer Verlag Berlin.   Shazia Sadiq (Ed.). 2013. Handbook of Data Quality. Springer Verlag Berlin.","DOI":"10.1007\/978-3-642-36257-6"},{"key":"e_1_2_2_46_1","volume-title":"Qualitative Content Analysis in Practice","author":"Schreier Margrit","unstructured":"Margrit Schreier . 2012. Qualitative Content Analysis in Practice . SAGE Publications . Margrit Schreier. 2012. Qualitative Content Analysis in Practice. SAGE Publications."},{"key":"e_1_2_2_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2012.213"},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1080\/10447318.2012.687676"},{"key":"e_1_2_2_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2669557.2669580"},{"key":"e_1_2_2_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2004.1260759"},{"key":"e_1_2_2_51_1","unstructured":"Trifacta. 2016. Trifacta Wrangler. Retrieved from https:\/\/www.trifacta.com\/trifacta-wrangler\/.  Trifacta. 2016. Trifacta Wrangler. Retrieved from https:\/\/www.trifacta.com\/trifacta-wrangler\/."},{"key":"e_1_2_2_52_1","unstructured":"Edward R. Tufte. 2006. Beautiful Evidence. Graphics Press.   Edward R. Tufte. 2006. Beautiful Evidence. Graphics Press."},{"key":"e_1_2_2_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1168149.1168162"}],"container-title":["Journal of Data and Information Quality"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3190578","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,31]],"date-time":"2022-12-31T21:05:47Z","timestamp":1672520747000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3190578"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,3,31]]},"references-count":51,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,3,31]]}},"alternative-id":["10.1145\/3190578"],"URL":"https:\/\/doi.org\/10.1145\/3190578","relation":{},"ISSN":["1936-1955","1936-1963"],"issn-type":[{"value":"1936-1955","type":"print"},{"value":"1936-1963","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,3,31]]},"assertion":[{"value":"2018-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-02-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-05-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}