{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:54:29Z","timestamp":1775638469689,"version":"3.50.1"},"reference-count":81,"publisher":"Association for Computing Machinery (ACM)","issue":"13","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2022,9]]},"abstract":"<jats:p>When exploring a new dataset, Data Scientists often apply analysis queries, look for insights in the resulting dataframe, and repeat to apply further queries. We propose in this paper a novel solution that assists data scientists in this laborious process. In a nutshell, our solution pinpoints the most interesting (sets of) rows in each obtained dataframe. Uniquely, our definition of interest is based on the contribution of each row to the interestingness of different columns of the entire dataframe, which, in turn, is defined using standard measures such as diversity and exceptionality. Intuitively, interesting rows are ones that explain why (some column of) the analysis query result is interesting as a whole. Rows are correlated in their contribution and so the interesting score for a set of rows may not be directly computed based on that of individual rows. We address the resulting computational challenge by restricting attention to semantically-related sets, based on multiple notions of semantic relatedness; these sets serve as more informative explanations. Our experimental study across multiple real-world datasets shows the usefulness of our system in various scenarios.<\/jats:p>","DOI":"10.14778\/3565838.3565841","type":"journal-article","created":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T23:09:56Z","timestamp":1674256196000},"page":"3854-3868","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["FEDEX"],"prefix":"10.14778","volume":"15","author":[{"given":"Daniel","family":"Deutch","sequence":"first","affiliation":[{"name":"Tel Aviv University"}]},{"given":"Amir","family":"Gilad","sequence":"additional","affiliation":[{"name":"Duke University"}]},{"given":"Tova","family":"Milo","sequence":"additional","affiliation":[{"name":"Tel Aviv University"}]},{"given":"Amit","family":"Mualem","sequence":"additional","affiliation":[{"name":"Tel Aviv University"}]},{"given":"Amit","family":"Somech","sequence":"additional","affiliation":[{"name":"Bar-Ilan University"}]}],"member":"320","published-online":{"date-parts":[[2023,1,20]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. 6--15","author":"Agarwal Deepak","year":"2007","unstructured":"Deepak Agarwal , Dhiman Barman , Dimitrios Gunopulos , Neal E Young , Flip Korn , and Divesh Srivastava . 2007 . Efficient and effective explanation of change in hierarchical summaries . In Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. 6--15 . Deepak Agarwal, Dhiman Barman, Dimitrios Gunopulos, Neal E Young, Flip Korn, and Divesh Srivastava. 2007. Efficient and effective explanation of change in hierarchical summaries. In Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. 6--15."},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 153--164","author":"Amsterdamer Yael","year":"2011","unstructured":"Yael Amsterdamer , Daniel Deutch , and Val Tannen . 2011 . Provenance for aggregate queries . In Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 153--164 . Yael Amsterdamer, Daniel Deutch, and Val Tannen. 2011. Provenance for aggregate queries. In Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. 153--164."},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","first-page":"939","DOI":"10.14778\/3380750.3380762","article-title":"On detecting cherry-picked trendlines","volume":"13","author":"Asudeh Abolfazl","year":"2020","unstructured":"Abolfazl Asudeh , Hosagrahar Visvesvaraya Jagadish , You Wu , and Cong Yu . 2020 . On detecting cherry-picked trendlines . Proceedings of the VLDB Endowment 13 , 6 (2020), 939 -- 952 . Abolfazl Asudeh, Hosagrahar Visvesvaraya Jagadish, You Wu, and Cong Yu. 2020. On detecting cherry-picked trendlines. Proceedings of the VLDB Endowment 13, 6 (2020), 939--952.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. 871--876","author":"Bao Zhifeng","year":"2015","unstructured":"Zhifeng Bao , Yong Zeng , HV Jagadish , and Tok Wang Ling . 2015 . Exploratory keyword search with interactive input . In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. 871--876 . Zhifeng Bao, Yong Zeng, HV Jagadish, and Tok Wang Ling. 2015. Exploratory keyword search with interactive input. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. 871--876."},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Ori Bar El Tova Milo and Amit Somech. 2020. Automatically generating data exploration sessions using deep reinforcement learning. In SIGMOD. 1527--1537.  Ori Bar El Tova Milo and Amit Somech. 2020. Automatically generating data exploration sessions using deep reinforcement learning. In SIGMOD. 1527--1537.","DOI":"10.1145\/3318464.3389779"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1145\/2714064.2660207","article-title":"Checkcell: Data debugging for spreadsheets","volume":"49","author":"Barowy Daniel W","year":"2014","unstructured":"Daniel W Barowy , Dimitar Gochev , and Emery D Berger . 2014 . Checkcell: Data debugging for spreadsheets . ACM SIGPLAN Notices 49 , 10 (2014), 507 -- 523 . Daniel W Barowy, Dimitar Gochev, and Emery D Berger. 2014. Checkcell: Data debugging for spreadsheets. ACM SIGPLAN Notices 49, 10 (2014), 507--523.","journal-title":"ACM SIGPLAN Notices"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1177\/109442810033005","article-title":"On the use of the coefficient of variation as a measure of diversity","volume":"3","author":"Bedeian Arthur G","year":"2000","unstructured":"Arthur G Bedeian and Kevin W Mossholder . 2000 . On the use of the coefficient of variation as a measure of diversity . Organizational Research Methods 3 , 3 (2000), 285 -- 297 . Arthur G Bedeian and Kevin W Mossholder. 2000. On the use of the coefficient of variation as a measure of diversity. Organizational Research Methods 3, 3 (2000), 285--297.","journal-title":"Organizational Research Methods"},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 3261--3264","author":"Behar Rachel","year":"2020","unstructured":"Rachel Behar and Sara Cohen . 2020 . Optimal End-Biased Histograms for Hierarchical Data . In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 3261--3264 . Rachel Behar and Sara Cohen. 2020. Optimal End-Biased Histograms for Hierarchical Data. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 3261--3264."},{"key":"e_1_2_1_9_1","unstructured":"Rachel Behar and Sara Cohen. 2020. Optimal Histograms with Outliers.. In Extending database technology (EDBT). 181--192.  Rachel Behar and Sara Cohen. 2020. Optimal Histograms with Outliers.. In Extending database technology (EDBT). 181--192."},{"key":"e_1_2_1_10_1","first-page":"1529","article-title":"ExRank: An exploratory ranking interface","volume":"9","author":"Bespinyowong Ramon","year":"2016","unstructured":"Ramon Bespinyowong , Wei Chen , HV Jagadish , and Yuxin Ma . 2016 . ExRank: An exploratory ranking interface . PVLBD 9 , 13 (2016), 1529 -- 1532 . Ramon Bespinyowong, Wei Chen, HV Jagadish, and Yuxin Ma. 2016. ExRank: An exploratory ranking interface. PVLBD 9, 13 (2016), 1529--1532.","journal-title":"PVLBD"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 713--722","author":"Bidoit Nicole","year":"2015","unstructured":"Nicole Bidoit , Melanie Herschel , and Aikaterini Tzompanaki . 2015 . Efficient computation of polynomial explanations of why-not questions . In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 713--722 . Nicole Bidoit, Melanie Herschel, and Aikaterini Tzompanaki. 2015. Efficient computation of polynomial explanations of why-not questions. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. 713--722."},{"key":"e_1_2_1_12_1","unstructured":"Nicole Bidoit Melanie Herschel and Katerina Tzompanaki. 2014. Query-based why-not provenance with nedexplain. In Extending database technology (EDBT).  Nicole Bidoit Melanie Herschel and Katerina Tzompanaki. 2014. Query-based why-not provenance with nedexplain. In Extending database technology (EDBT)."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings 17th international conference on data engineering. IEEE, 421--430","author":"Borzsony Stephan","year":"2001","unstructured":"Stephan Borzsony , Donald Kossmann , and Konrad Stocker . 2001 . The skyline operator . In Proceedings 17th international conference on data engineering. IEEE, 421--430 . Stephan Borzsony, Donald Kossmann, and Konrad Stocker. 2001. The skyline operator. In Proceedings 17th international conference on data engineering. IEEE, 421--430."},{"key":"e_1_2_1_14_1","unstructured":"Stan Brown. 2011. Measures of shape: Skewness and kurtosis.  Stan Brown. 2011. Measures of shape: Skewness and kurtosis."},{"key":"e_1_2_1_15_1","doi-asserted-by":"crossref","unstructured":"P. Buneman S. Khanna and W.C. Tan. 2001. Why and Where: A Characterization of Data Provenance. In ICDT. 316--330.  P. Buneman S. Khanna and W.C. Tan. 2001. Why and Where: A Characterization of Data Provenance. In ICDT. 316--330.","DOI":"10.1007\/3-540-44503-X_20"},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1007\/s10115-006-0039-1","article-title":"Summarization-compressing data into an informative representation","volume":"12","author":"Chandola Varun","year":"2007","unstructured":"Varun Chandola and Vipin Kumar . 2007 . Summarization-compressing data into an informative representation . Knowledge and Information Systems 12 , 3 (2007), 355 -- 378 . Varun Chandola and Vipin Kumar. 2007. Summarization-compressing data into an informative representation. Knowledge and Information Systems 12, 3 (2007), 355--378.","journal-title":"Knowledge and Information Systems"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. 523--534","author":"Chapman Adriane","year":"2009","unstructured":"Adriane Chapman and HV Jagadish . 2009 . Why not? . In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. 523--534 . Adriane Chapman and HV Jagadish. 2009. Why not?. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. 523--534."},{"key":"e_1_2_1_18_1","volume-title":"Text-to-viz: Automatic generation of infographics from proportion-related natural language statements","author":"Cui Weiwei","year":"2019","unstructured":"Weiwei Cui , Xiaoyu Zhang , Yun Wang , He Huang , Bei Chen , Lei Fang , Haidong Zhang , Jian-Guan Lou , and Dongmei Zhang . 2019 . Text-to-viz: Automatic generation of infographics from proportion-related natural language statements . IEEE transactions on visualization and computer graphics 26, 1 (2019), 906--916. Weiwei Cui, Xiaoyu Zhang, Yun Wang, He Huang, Bei Chen, Lei Fang, Haidong Zhang, Jian-Guan Lou, and Dongmei Zhang. 2019. Text-to-viz: Automatic generation of infographics from proportion-related natural language statements. IEEE transactions on visualization and computer graphics 26, 1 (2019), 906--916."},{"key":"e_1_2_1_19_1","unstructured":"Credit Card Customers Dataset. 2021. https:\/\/www.kaggle.com\/sakshigoyal7\/credit-card-customers\/tasks?taskId=2729.  Credit Card Customers Dataset. 2021. https:\/\/www.kaggle.com\/sakshigoyal7\/credit-card-customers\/tasks?taskId=2729."},{"key":"e_1_2_1_20_1","unstructured":"Spotify Dataset. 2021. https:\/\/www.kaggle.com\/mrmorj\/dataset-of-songs-in-spotify.  Spotify Dataset. 2021. https:\/\/www.kaggle.com\/mrmorj\/dataset-of-songs-in-spotify."},{"key":"e_1_2_1_21_1","volume-title":"Advances in Intelligent","author":"Bie Tijl De","unstructured":"Tijl De Bie . 2013. Subjective interestingness in exploratory data mining . In Advances in Intelligent Data Analysis XII. Springer , 19--31. Tijl De Bie. 2013. Subjective interestingness in exploratory data mining. In Advances in Intelligent Data Analysis XII. Springer, 19--31."},{"key":"e_1_2_1_22_1","first-page":"577","article-title":"Provenance for Natural Language Queries","volume":"10","author":"Deutch Daniel","year":"2017","unstructured":"Daniel Deutch , Nave Frost , and Amir Gilad . 2017 . Provenance for Natural Language Queries . PVLDB 10 , 5 (2017), 577 -- 588 . Daniel Deutch, Nave Frost, and Amir Gilad. 2017. Provenance for Natural Language Queries. PVLDB 10, 5 (2017), 577--588.","journal-title":"PVLDB"},{"key":"e_1_2_1_23_1","volume-title":"2016 IEEE 32nd International Conference on Data Engineering (ICDE). IEEE, 1358--1361","author":"Deutch Daniel","year":"2016","unstructured":"Daniel Deutch and Amir Gilad . 2016 . Qplain: Query by explanation . In 2016 IEEE 32nd International Conference on Data Engineering (ICDE). IEEE, 1358--1361 . Daniel Deutch and Amir Gilad. 2016. Qplain: Query by explanation. In 2016 IEEE 32nd International Conference on Data Engineering (ICDE). IEEE, 1358--1361."},{"key":"e_1_2_1_24_1","volume-title":"AIDE: An Active Learning-based Approach for Interactive Data Exploration. TKDE","author":"Dimitriadou Kyriaki","year":"2016","unstructured":"Kyriaki Dimitriadou , Olga Papaemmanouil , and Yanlei Diao . 2016 . AIDE: An Active Learning-based Approach for Interactive Data Exploration. TKDE (2016). Kyriaki Dimitriadou, Olga Papaemmanouil, and Yanlei Diao. 2016. AIDE: An Active Learning-based Approach for Interactive Data Exploration. TKDE (2016)."},{"key":"e_1_2_1_25_1","volume-title":"Proceedings of the 2019 International Conference on Management of Data. 317--332","author":"Ding Rui","year":"2019","unstructured":"Rui Ding , Shi Han , Yong Xu , Haidong Zhang , and Dongmei Zhang . 2019 . Quick-insights: Quick and automatic discovery of insights from multi-dimensional data . In Proceedings of the 2019 International Conference on Management of Data. 317--332 . Rui Ding, Shi Han, Yong Xu, Haidong Zhang, and Dongmei Zhang. 2019. Quick-insights: Quick and automatic discovery of insights from multi-dimensional data. In Proceedings of the 2019 International Conference on Management of Data. 317--332."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the 22nd international conference on World Wide Web. 379--390","author":"Dong Xin Luna","year":"2013","unstructured":"Xin Luna Dong and Divesh Srivastava . 2013 . Compact explanation of data fusion decisions . In Proceedings of the 22nd international conference on World Wide Web. 379--390 . Xin Luna Dong and Divesh Srivastava. 2013. Compact explanation of data fusion decisions. In Proceedings of the 22nd international conference on World Wide Web. 379--390."},{"key":"e_1_2_1_27_1","volume-title":"Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2741--2744","author":"Fariha Anna","year":"2020","unstructured":"Anna Fariha , Ashish Tiwari , Arjun Radhakrishna , and Sumit Gulwani . 2020 . Extune: Explaining tuple non-conformance . In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2741--2744 . Anna Fariha, Ashish Tiwari, Arjun Radhakrishna, and Sumit Gulwani. 2020. Extune: Explaining tuple non-conformance. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2741--2744."},{"key":"e_1_2_1_28_1","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1145\/1132960.1132963","article-title":"Interestingness measures for data mining: A survey","volume":"38","author":"Geng Liqiang","year":"2006","unstructured":"Liqiang Geng and Howard J Hamilton . 2006 . Interestingness measures for data mining: A survey . ACM Computing Surveys (CSUR) 38 , 3 (2006), 9 --es. Liqiang Geng and Howard J Hamilton. 2006. Interestingness measures for data mining: A survey. ACM Computing Surveys (CSUR) 38, 3 (2006), 9--es.","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"T.J. Green G. Karvounarakis and V. Tannen. 2007. Provenance semirings. In PODS. 31--40.  T.J. Green G. Karvounarakis and V. Tannen. 2007. Provenance semirings. In PODS. 31--40.","DOI":"10.1145\/1265530.1265535"},{"key":"e_1_2_1_30_1","volume-title":"Mark Wiebe, Pearu Peterson, Pierre G\u00e9rard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke, and Travis E. Oliphant.","author":"Harris Charles R.","year":"2020","unstructured":"Charles R. Harris , K. Jarrod Millman , St\u00e9fan J. van der Walt , Ralf Gommers , Pauli Virtanen , David Cournapeau , Eric Wieser , Julian Taylor , Sebastian Berg , Nathaniel J. Smith , Robert Kern , Matti Picus , Stephan Hoyer , Marten H. van Kerkwijk , Matthew Brett , Allan Haldane , Jaime Fern\u00e1ndez del R\u00edo , Mark Wiebe, Pearu Peterson, Pierre G\u00e9rard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke, and Travis E. Oliphant. 2020 . Array programming with NumPy. Nature 585, 7825 (2020), 357--362. Charles R. Harris, K. Jarrod Millman, St\u00e9fan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fern\u00e1ndez del R\u00edo, Mark Wiebe, Pearu Peterson, Pierre G\u00e9rard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke, and Travis E. Oliphant. 2020. Array programming with NumPy. Nature 585, 7825 (2020), 357--362."},{"key":"e_1_2_1_31_1","volume-title":"Knowledge discovery and measures of interest","author":"Hilderman Robert J","unstructured":"Robert J Hilderman and Howard J Hamilton . 2013. Knowledge discovery and measures of interest . Vol. 638 . Springer Science & Business Media . Robert J Hilderman and Howard J Hamilton. 2013. Knowledge discovery and measures of interest. Vol. 638. Springer Science & Business Media."},{"key":"e_1_2_1_32_1","first-page":"1761","article-title":"LERI: Local Exploration for Rare-Category Identification","volume":"32","author":"Huang Hao","year":"2019","unstructured":"Hao Huang , Qian Yan , Wei Lu , Huaizhong Lin , Yunjun Gao , and Lei Chen . 2019 . LERI: Local Exploration for Rare-Category Identification . IEEE Transactions on Knowledge and Data Engineering 32 , 9 (2019), 1761 -- 1772 . Hao Huang, Qian Yan, Wei Lu, Huaizhong Lin, Yunjun Gao, and Lei Chen. 2019. LERI: Local Exploration for Rare-Category Identification. IEEE Transactions on Knowledge and Data Engineering 32, 9 (2019), 1761--1772.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_2_1_33_1","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1109\/MCSE.2007.55","article-title":"Matplotlib: A 2D graphics environment","volume":"9","author":"Hunter J. D.","year":"2007","unstructured":"J. D. Hunter . 2007 . Matplotlib: A 2D graphics environment . Computing in Science & Engineering 9 , 3 (2007), 90 -- 95 . J. D. Hunter. 2007. Matplotlib: A 2D graphics environment. Computing in Science & Engineering 9, 3 (2007), 90--95.","journal-title":"Computing in Science & Engineering"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 2004 ACM SIGMOD international conference on Management of data. 647--658","author":"Ilyas Ihab F","year":"2004","unstructured":"Ihab F Ilyas , Volker Markl , Peter Haas , Paul Brown , and Ashraf Aboulnaga . 2004 . CORDS: Automatic discovery of correlations and soft functional dependencies . In Proceedings of the 2004 ACM SIGMOD international conference on Management of data. 647--658 . Ihab F Ilyas, Volker Markl, Peter Haas, Paul Brown, and Ashraf Aboulnaga. 2004. CORDS: Automatic discovery of correlations and soft functional dependencies. In Proceedings of the 2004 ACM SIGMOD international conference on Management of data. 647--658."},{"key":"e_1_2_1_35_1","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1145\/582415.582418","article-title":"Cumulated gain-based evaluation of IR techniques","volume":"20","author":"J\u00e4rvelin Kalervo","year":"2002","unstructured":"Kalervo J\u00e4rvelin and Jaana Kek\u00e4l\u00e4inen . 2002 . Cumulated gain-based evaluation of IR techniques . ACM Trans. Inf. Syst. 20 , 4 (2002), 422 -- 446 . Kalervo J\u00e4rvelin and Jaana Kek\u00e4l\u00e4inen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 4 (2002), 422--446.","journal-title":"ACM Trans. Inf. Syst."},{"key":"e_1_2_1_36_1","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1109\/TKDE.2017.2685998","article-title":"Interactive data exploration with smart drill-down","volume":"31","author":"Joglekar Manas","year":"2017","unstructured":"Manas Joglekar , Hector Garcia-Molina , and Aditya Parameswaran . 2017 . Interactive data exploration with smart drill-down . IEEE Transactions on Knowledge and Data Engineering 31 , 1 (2017), 46 -- 60 . Manas Joglekar, Hector Garcia-Molina, and Aditya Parameswaran. 2017. Interactive data exploration with smart drill-down. IEEE Transactions on Knowledge and Data Engineering 31, 1 (2017), 46--60.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_2_1_37_1","unstructured":"Maurice George Kendall. 1948. Rank correlation methods. (1948).  Maurice George Kendall. 1948. Rank correlation methods. (1948)."},{"key":"e_1_2_1_38_1","unstructured":"Mary Beth Kery Marissa Radensky Mahima Arya Bonnie E John and Brad A Myers. 2018. The story in the notebook: Exploratory data science using a literate programming tool. In CHI.  Mary Beth Kery Marissa Radensky Mahima Arya Bonnie E John and Brad A Myers. 2018. The story in the notebook: Exploratory data science using a literate programming tool. In CHI."},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","first-page":"22","DOI":"10.14778\/1880172.1880175","article-title":"SnipSuggest: Context-Aware Autocompletion for SQL","volume":"4","author":"Khoussainova Nodira","year":"2010","unstructured":"Nodira Khoussainova , YongChul Kwon , Magdalena Balazinska , and Dan Suciu . 2010 . SnipSuggest: Context-Aware Autocompletion for SQL . Proc. VLDB Endow. 4 , 1 (2010), 22 -- 33 . Nodira Khoussainova, YongChul Kwon, Magdalena Balazinska, and Dan Suciu. 2010. SnipSuggest: Context-Aware Autocompletion for SQL. Proc. VLDB Endow. 4, 1 (2010), 22--33.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2877--2880","author":"Guilly Marie Le","year":"2019","unstructured":"Marie Le Guilly , Jean-Marc Petit , Vasile-Marian Scuturici , and Ihab F Ilyas . 2019 . ExplIQuE: Interactive Databases Exploration with SQL . In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2877--2880 . Marie Le Guilly, Jean-Marc Petit, Vasile-Marian Scuturici, and Ihab F Ilyas. 2019. ExplIQuE: Interactive Databases Exploration with SQL. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2877--2880."},{"key":"e_1_2_1_41_1","volume-title":"Lux: Always-on Visualization Recommendations for Exploratory Data Science. arXiv preprint arXiv:2105.00121","author":"Jung-Lin Lee Doris","year":"2021","unstructured":"Doris Jung-Lin Lee , Dixin Tang , Kunal Agarwal , Thyne Boonmark , Caitlyn Chen , Jake Kang , Ujjaini Mukhopadhyay , Jerry Song , Micah Yong , Marti A Hearst , 2021 . Lux: Always-on Visualization Recommendations for Exploratory Data Science. arXiv preprint arXiv:2105.00121 (2021). Doris Jung-Lin Lee, Dixin Tang, Kunal Agarwal, Thyne Boonmark, Caitlyn Chen, Jake Kang, Ujjaini Mukhopadhyay, Jerry Song, Micah Yong, Marti A Hearst, et al. 2021. Lux: Always-on Visualization Recommendations for Exploratory Data Science. arXiv preprint arXiv:2105.00121 (2021)."},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the 2021 International Conference on Management of Data. 1051--1063","author":"Li Chenjie","year":"2021","unstructured":"Chenjie Li , Zhengjie Miao , Qitian Zeng , Boris Glavic , and Sudeepa Roy . 2021 . Putting Things into Context: Rich Explanations for Query Answers using Join Graphs . In Proceedings of the 2021 International Conference on Management of Data. 1051--1063 . Chenjie Li, Zhengjie Miao, Qitian Zeng, Boris Glavic, and Sudeepa Roy. 2021. Putting Things into Context: Rich Explanations for Query Answers using Join Graphs. In Proceedings of the 2021 International Conference on Management of Data. 1051--1063."},{"key":"e_1_2_1_43_1","doi-asserted-by":"crossref","first-page":"817","DOI":"10.1109\/69.824588","article-title":"Finding interesting patterns using user expectations","volume":"11","author":"Liu Bing","year":"1999","unstructured":"Bing Liu , Wynne Hsu , Lai-Fun Mun , and Hing-Yan Lee . 1999 . Finding interesting patterns using user expectations . IEEE Transactions on Knowledge and Data Engineering 11 , 6 (1999), 817 -- 832 . Bing Liu, Wynne Hsu, Lai-Fun Mun, and Hing-Yan Lee. 1999. Finding interesting patterns using user expectations. IEEE Transactions on Knowledge and Data Engineering 11, 6 (1999), 817--832.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_2_1_44_1","volume-title":"2015 IEEE 31st International Conference on Data Engineering. IEEE, 1476--1479","author":"Liu Xiufeng","year":"2015","unstructured":"Xiufeng Liu , Lukasz Golab , and Ihab F Ilyas . 2015 . SMAS: A smart meter data analytics system . In 2015 IEEE 31st International Conference on Data Engineering. IEEE, 1476--1479 . Xiufeng Liu, Lukasz Golab, and Ihab F Ilyas. 2015. SMAS: A smart meter data analytics system. In 2015 IEEE 31st International Conference on Data Engineering. IEEE, 1476--1479."},{"key":"e_1_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Yuyu Luo Xuedi Qin Nan Tang and Guoliang Li. 2018. DeepEye: Towards Automatic Data Visualization. ICDE.  Yuyu Luo Xuedi Qin Nan Tang and Guoliang Li. 2018. DeepEye: Towards Automatic Data Visualization. ICDE.","DOI":"10.1145\/3183713.3193545"},{"key":"e_1_2_1_46_1","volume-title":"A survey of interestingness measures for knowledge discovery. The knowledge engineering review 20, 1","author":"McGarry Ken","year":"2005","unstructured":"Ken McGarry . 2005. A survey of interestingness measures for knowledge discovery. The knowledge engineering review 20, 1 ( 2005 ), 39--61. Ken McGarry. 2005. A survey of interestingness measures for knowledge discovery. The knowledge engineering review 20, 1 (2005), 39--61."},{"key":"e_1_2_1_47_1","first-page":"59","article-title":"Causality in databases","volume":"33","author":"Meliou Alexandra","year":"2010","unstructured":"Alexandra Meliou , Wolfgang Gatterbauer , Joseph Y Halpern , Christoph Koch , Katherine F Moore , and Dan Suciu . 2010 . Causality in databases . IEEE Data Engineering Bulletin 33 (2010), 59 -- 67 . Alexandra Meliou, Wolfgang Gatterbauer, Joseph Y Halpern, Christoph Koch, Katherine F Moore, and Dan Suciu. 2010. Causality in databases. IEEE Data Engineering Bulletin 33 (2010), 59--67.","journal-title":"IEEE Data Engineering Bulletin"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of the 2019 International Conference on Management of Data. 485--502","author":"Miao Zhengjie","year":"2019","unstructured":"Zhengjie Miao , Qitian Zeng , Boris Glavic , and Sudeepa Roy . 2019 . Going beyond provenance: Explaining query answers with pattern-based counterbalances . In Proceedings of the 2019 International Conference on Management of Data. 485--502 . Zhengjie Miao, Qitian Zeng, Boris Glavic, and Sudeepa Roy. 2019. Going beyond provenance: Explaining query answers with pattern-based counterbalances. In Proceedings of the 2019 International Conference on Management of Data. 485--502."},{"key":"e_1_2_1_49_1","unstructured":"Tova Milo Chai Ozeri and Amit Somech. 2019. Predicting \"What is Interesting\" by Mining Interactive-Data-Analysis Session Logs. In EDBT. 456--467.  Tova Milo Chai Ozeri and Amit Somech. 2019. Predicting \"What is Interesting\" by Mining Interactive-Data-Analysis Session Logs. In EDBT. 456--467."},{"key":"e_1_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Tova Milo and Amit Somech. 2018. Next-Step Suggestions for Modern Interactive Data Analysis Platforms. In KDD. ACM 576--585.  Tova Milo and Amit Somech. 2018. Next-Step Suggestions for Modern Interactive Data Analysis Platforms. In KDD. ACM 576--585.","DOI":"10.1145\/3219819.3219848"},{"key":"e_1_2_1_51_1","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1007\/s00778-015-0382-5","article-title":"Conditional heavy hitters: detecting interesting correlations in data streams","volume":"24","author":"Mirylenka Katsiaryna","year":"2015","unstructured":"Katsiaryna Mirylenka , Graham Cormode , Themis Palpanas , and Divesh Srivastava . 2015 . Conditional heavy hitters: detecting interesting correlations in data streams . The VLDB Journal 24 , 3 (2015), 395 -- 414 . Katsiaryna Mirylenka, Graham Cormode, Themis Palpanas, and Divesh Srivastava. 2015. Conditional heavy hitters: detecting interesting correlations in data streams. The VLDB Journal 24, 3 (2015), 395--414.","journal-title":"The VLDB Journal"},{"key":"e_1_2_1_52_1","volume-title":"2013 IEEE 29th International Conference on Data Engineering (ICDE). IEEE, 1069--1080","author":"Mirylenka Katsiaryna","year":"2013","unstructured":"Katsiaryna Mirylenka , Themis Palpanas , Graham Cormode , and Divesh Srivastava . 2013 . Finding interesting correlations with conditional heavy hitters . In 2013 IEEE 29th International Conference on Data Engineering (ICDE). IEEE, 1069--1080 . Katsiaryna Mirylenka, Themis Palpanas, Graham Cormode, and Divesh Srivastava. 2013. Finding interesting correlations with conditional heavy hitters. In 2013 IEEE 29th International Conference on Data Engineering (ICDE). IEEE, 1069--1080."},{"key":"e_1_2_1_53_1","unstructured":"The pandas development team. 2020. pandas-dev\/pandas: Pandas. 10.5281\/zenodo.3509134  The pandas development team. 2020. pandas-dev\/pandas: Pandas. 10.5281\/zenodo.3509134"},{"key":"e_1_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Judea Pearl et al. 2009. Causal inference in statistics: An overview. Statistics surveys 3 (2009) 96--146.  Judea Pearl et al. 2009. Causal inference in statistics: An overview. Statistics surveys 3 (2009) 96--146.","DOI":"10.1214\/09-SS057"},{"key":"e_1_2_1_55_1","unstructured":"Products and Sales dataset. 2018. https:\/\/data.world\/classrooms\/guide-to-data-analysis-with-sql-and-datadotworld.  Products and Sales dataset. 2018. https:\/\/data.world\/classrooms\/guide-to-data-analysis-with-sql-and-datadotworld."},{"key":"e_1_2_1_56_1","doi-asserted-by":"crossref","first-page":"1322","DOI":"10.14778\/2809974.2809992","article-title":"Learning user preferences by adaptive pairwise comparison","volume":"8","author":"Qian Li","year":"2015","unstructured":"Li Qian , Jinyang Gao , and HV Jagadish . 2015 . Learning user preferences by adaptive pairwise comparison . Proceedings of the VLDB Endowment 8 , 11 (2015), 1322 -- 1333 . Li Qian, Jinyang Gao, and HV Jagadish. 2015. Learning user preferences by adaptive pairwise comparison. Proceedings of the VLDB Endowment 8, 11 (2015), 1322--1333.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_57_1","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1007\/s00778-019-00588-3","article-title":"Making data visualization more efficient and effective: a survey","volume":"29","author":"Qin Xuedi","year":"2020","unstructured":"Xuedi Qin , Yuyu Luo , Nan Tang , and Guoliang Li . 2020 . Making data visualization more efficient and effective: a survey . The VLDB Journal 29 , 1 (2020), 93 -- 117 . Xuedi Qin, Yuyu Luo, Nan Tang, and Guoliang Li. 2020. Making data visualization more efficient and effective: a survey. The VLDB Journal 29, 1 (2020), 93--117.","journal-title":"The VLDB Journal"},{"key":"e_1_2_1_58_1","unstructured":"Rath repository. 2018. https:\/\/github.com\/snknitin\/-SeeDB.  Rath repository. 2018. https:\/\/github.com\/snknitin\/-SeeDB."},{"key":"e_1_2_1_59_1","unstructured":"Rath repository. 2022. https:\/\/github.com\/Kanaries\/Rath.  Rath repository. 2022. https:\/\/github.com\/Kanaries\/Rath."},{"key":"e_1_2_1_60_1","volume-title":"Introduction to probability and statistics for engineers and scientists","author":"Ross Sheldon M","unstructured":"Sheldon M Ross . 2004. Introduction to probability and statistics for engineers and scientists . Elsevier . Sheldon M Ross. 2004. Introduction to probability and statistics for engineers and scientists. Elsevier."},{"key":"e_1_2_1_61_1","unstructured":"Sudeepa Roy and Dan Suciu. 2014. A formal approach to finding explanations for database queries. In SIGMOD Curtis E. Dyreson Feifei Li and M. Tamer \u00d6zsu (Eds.). 1579--1590.  Sudeepa Roy and Dan Suciu. 2014. A formal approach to finding explanations for database queries. In SIGMOD Curtis E. Dyreson Feifei Li and M. Tamer \u00d6zsu (Eds.). 1579--1590."},{"key":"e_1_2_1_62_1","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1007\/s007780100046","article-title":"User-cognizant multidimensional analysis","volume":"10","author":"Sarawagi Sunita","year":"2001","unstructured":"Sunita Sarawagi . 2001 . User-cognizant multidimensional analysis . The VLDB Journal 10 , 2 (2001), 224 -- 239 . Sunita Sarawagi. 2001. User-cognizant multidimensional analysis. The VLDB Journal 10, 2 (2001), 224--239.","journal-title":"The VLDB Journal"},{"key":"e_1_2_1_63_1","doi-asserted-by":"crossref","unstructured":"Sunita Sarawagi Rakesh Agrawal and Nimrod Megiddo. 1998. Discovery-driven exploration of OLAP data cubes. In EDBT.  Sunita Sarawagi Rakesh Agrawal and Nimrod Megiddo. 1998. Discovery-driven exploration of OLAP data cubes. In EDBT.","DOI":"10.1007\/BFb0100984"},{"key":"e_1_2_1_64_1","volume-title":"Introduction to information retrieval","author":"Sch\u00fctze Hinrich","unstructured":"Hinrich Sch\u00fctze , Christopher D Manning , and Prabhakar Raghavan . 2008. Introduction to information retrieval . Vol. 39 . Cambridge University Press Cambridge . Hinrich Sch\u00fctze, Christopher D Manning, and Prabhakar Raghavan. 2008. Introduction to information retrieval. Vol. 39. Cambridge University Press Cambridge."},{"key":"e_1_2_1_65_1","doi-asserted-by":"crossref","first-page":"1469","DOI":"10.14778\/3397230.3397242","article-title":"Guided exploration of user groups","volume":"13","author":"Seleznova Mariia","year":"2020","unstructured":"Mariia Seleznova , Behrooz Omidvar-Tehrani , Sihem Amer-Yahia , and Eric Simon . 2020 . Guided exploration of user groups . Proceedings of the VLDB Endowment (PVLDB) 13 , 9 (2020), 1469 -- 1482 . Mariia Seleznova, Behrooz Omidvar-Tehrani, Sihem Amer-Yahia, and Eric Simon. 2020. Guided exploration of user groups. Proceedings of the VLDB Endowment (PVLDB) 13, 9 (2020), 1469--1482.","journal-title":"Proceedings of the VLDB Endowment (PVLDB)"},{"key":"e_1_2_1_66_1","doi-asserted-by":"crossref","first-page":"1118","DOI":"10.1109\/TKDE.2016.2515590","article-title":"Cluster-driven navigation of the query space","volume":"28","author":"Sellam Thibault","year":"2016","unstructured":"Thibault Sellam and Martin Kersten . 2016 . Cluster-driven navigation of the query space . IEEE Transactions on Knowledge and Data Engineering 28 , 5 (2016), 1118 -- 1131 . Thibault Sellam and Martin Kersten. 2016. Cluster-driven navigation of the query space. IEEE Transactions on Knowledge and Data Engineering 28, 5 (2016), 1118--1131.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the 2021 International Conference on Management of Data. 1571--1583","author":"Shafieinejad Masoumeh","year":"2021","unstructured":"Masoumeh Shafieinejad , Florian Kerschbaum , and Ihab F Ilyas . 2021 . PCOR: Private Contextual Outlier Release via Differentially Private Search . In Proceedings of the 2021 International Conference on Management of Data. 1571--1583 . Masoumeh Shafieinejad, Florian Kerschbaum, and Ihab F Ilyas. 2021. PCOR: Private Contextual Outlier Release via Differentially Private Search. In Proceedings of the 2021 International Conference on Management of Data. 1571--1583."},{"key":"e_1_2_1_68_1","first-page":"453","article-title":"Calliope: Automatic visual data story generation from a spreadsheet","volume":"27","author":"Shi Danqing","year":"2020","unstructured":"Danqing Shi , Xinyue Xu , Fuling Sun , Yang Shi , and Nan Cao . 2020 . Calliope: Automatic visual data story generation from a spreadsheet . IEEE Transactions on Visualization and Computer Graphics 27 , 2 (2020), 453 -- 463 . Danqing Shi, Xinyue Xu, Fuling Sun, Yang Shi, and Nan Cao. 2020. Calliope: Automatic visual data story generation from a spreadsheet. IEEE Transactions on Visualization and Computer Graphics 27, 2 (2020), 453--463.","journal-title":"IEEE Transactions on Visualization and Computer Graphics"},{"key":"e_1_2_1_69_1","volume-title":"DBExplorer: Exploratory Search in Databases. EDBT","author":"Singh Manish","year":"2016","unstructured":"Manish Singh , Michael J Cafarella , and HV Jagadish . 2016. DBExplorer: Exploratory Search in Databases. EDBT ( 2016 ). Manish Singh, Michael J Cafarella, and HV Jagadish. 2016. DBExplorer: Exploratory Search in Databases. EDBT (2016)."},{"key":"e_1_2_1_70_1","volume-title":"Augmenting visualizations with interactive data facts to facilitate interpretation and communication","author":"Srinivasan Arjun","year":"2018","unstructured":"Arjun Srinivasan , Steven M Drucker , Alex Endert , and John Stasko . 2018. Augmenting visualizations with interactive data facts to facilitate interpretation and communication . IEEE transactions on visualization and computer graphics 25, 1 ( 2018 ), 672--681. Arjun Srinivasan, Steven M Drucker, Alex Endert, and John Stasko. 2018. Augmenting visualizations with interactive data facts to facilitate interpretation and communication. IEEE transactions on visualization and computer graphics 25, 1 (2018), 672--681."},{"key":"e_1_2_1_71_1","unstructured":"fedex Repository. 2022. https:\/\/github.com\/TAU-DB\/FEDEx.  fedex Repository. 2022. https:\/\/github.com\/TAU-DB\/FEDEx."},{"key":"e_1_2_1_72_1","volume-title":"Proceedings of the 2017 ACM International Conference on Management of Data. 1509--1524","author":"Tang Bo","year":"2017","unstructured":"Bo Tang , Shi Han , Man Lung Yiu , Rui Ding , and Dongmei Zhang . 2017 . Extracting top-k insights from multi-dimensional data . In Proceedings of the 2017 ACM International Conference on Management of Data. 1509--1524 . Bo Tang, Shi Han, Man Lung Yiu, Rui Ding, and Dongmei Zhang. 2017. Extracting top-k insights from multi-dimensional data. In Proceedings of the 2017 ACM International Conference on Management of Data. 1509--1524."},{"key":"e_1_2_1_73_1","volume-title":"Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. 31--43","author":"Cate Balder","year":"2015","unstructured":"Balder ten Cate , Cristina Civili , Evgeny Sherkhonov , and Wang-Chiew Tan . 2015 . High-level why-not explanations using ontologies . In Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. 31--43 . Balder ten Cate, Cristina Civili, Evgeny Sherkhonov, and Wang-Chiew Tan. 2015. High-level why-not explanations using ontologies. In Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. 31--43."},{"key":"e_1_2_1_74_1","doi-asserted-by":"crossref","first-page":"1986","DOI":"10.14778\/2367502.2367554","article-title":"Maprat: Meaningful explanation, interactive exploration and geo-visualization of collaborative ratings","volume":"5","author":"Thirumuruganathan Saravanan","year":"2012","unstructured":"Saravanan Thirumuruganathan , Mahashweta Das , Shrikant Desai , Sihem Amer-Yahia , Gautam Das , and Cong Yu . 2012 . Maprat: Meaningful explanation, interactive exploration and geo-visualization of collaborative ratings . Proceedings of the VLDB Endowment (PVLDB) 5 , 12 (2012), 1986 -- 1989 . Saravanan Thirumuruganathan, Mahashweta Das, Shrikant Desai, Sihem Amer-Yahia, Gautam Das, and Cong Yu. 2012. Maprat: Meaningful explanation, interactive exploration and geo-visualization of collaborative ratings. Proceedings of the VLDB Endowment (PVLDB) 5, 12 (2012), 1986--1989.","journal-title":"Proceedings of the VLDB Endowment (PVLDB)"},{"key":"e_1_2_1_75_1","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1007\/s10618-010-0187-5","article-title":"Maximal exceptions with minimal descriptions","volume":"21","author":"van Leeuwen Matthijs","year":"2010","unstructured":"Matthijs van Leeuwen . 2010 . Maximal exceptions with minimal descriptions . Data Mining and Knowledge Discovery 21 , 2 (2010), 259 -- 276 . Matthijs van Leeuwen. 2010. Maximal exceptions with minimal descriptions. Data Mining and Knowledge Discovery 21, 2 (2010), 259--276.","journal-title":"Data Mining and Knowledge Discovery"},{"key":"e_1_2_1_76_1","doi-asserted-by":"crossref","first-page":"2182","DOI":"10.14778\/2831360.2831371","article-title":"SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics","volume":"8","author":"Vartak Manasi","year":"2015","unstructured":"Manasi Vartak , Sajjadur Rahman , Samuel Madden , Aditya G. Parameswaran , and Neoklis Polyzotis . 2015 . SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics . Proc. VLDB Endow. 8 , 13 (2015), 2182 -- 2193 . Manasi Vartak, Sajjadur Rahman, Samuel Madden, Aditya G. Parameswaran, and Neoklis Polyzotis. 2015. SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics. Proc. VLDB Endow. 8, 13 (2015), 2182--2193.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_77_1","volume-title":"Datashot: Automatic generation of fact sheets from tabular data","author":"Wang Yun","year":"2019","unstructured":"Yun Wang , Zhida Sun , Haidong Zhang , Weiwei Cui , Ke Xu , Xiaojuan Ma , and Dongmei Zhang . 2019 . Datashot: Automatic generation of fact sheets from tabular data . IEEE transactions on visualization and computer graphics 26, 1 (2019), 895--905. Yun Wang, Zhida Sun, Haidong Zhang, Weiwei Cui, Ke Xu, Xiaojuan Ma, and Dongmei Zhang. 2019. Datashot: Automatic generation of fact sheets from tabular data. IEEE transactions on visualization and computer graphics 26, 1 (2019), 895--905."},{"key":"e_1_2_1_78_1","volume-title":"Voyager: Exploratory analysis via faceted browsing of visualization recommendations. TVCG","author":"Wongsuphasawat Kanit","year":"2016","unstructured":"Kanit Wongsuphasawat , Dominik Moritz , Anushka Anand , Jock Mackinlay , Bill Howe , and Jeffrey Heer . 2016 . Voyager: Exploratory analysis via faceted browsing of visualization recommendations. TVCG (2016). Kanit Wongsuphasawat, Dominik Moritz, Anushka Anand, Jock Mackinlay, Bill Howe, and Jeffrey Heer. 2016. Voyager: Exploratory analysis via faceted browsing of visualization recommendations. TVCG (2016)."},{"key":"e_1_2_1_79_1","doi-asserted-by":"crossref","first-page":"553","DOI":"10.14778\/2536354.2536356","article-title":"Scorpion: Explaining Away Outliers in Aggregate Queries","volume":"6","author":"Wu Eugene","year":"2013","unstructured":"Eugene Wu and Samuel Madden . 2013 . Scorpion: Explaining Away Outliers in Aggregate Queries . Proc. VLDB Endow. 6 , 8 (2013), 553 -- 564 . Eugene Wu and Samuel Madden. 2013. Scorpion: Explaining Away Outliers in Aggregate Queries. Proc. VLDB Endow. 6, 8 (2013), 553--564.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_80_1","doi-asserted-by":"crossref","unstructured":"Cong Yan and Yeye He. 2020. Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks. In SIGMOD. 1539--1554.  Cong Yan and Yeye He. 2020. Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks. In SIGMOD. 1539--1554.","DOI":"10.1145\/3318464.3389738"},{"key":"e_1_2_1_81_1","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1016\/j.future.2019.12.011","article-title":"Visual exploration of rating datasets and user groups","volume":"105","author":"Zegarra Fabian Colque","year":"2020","unstructured":"Fabian Colque Zegarra , Juan C Carbajal Ipenza , Behrooz Omidvar-Tehrani , Viviane P Moreira , Sihem Amer-Yahia , and Jo\u00e3o LD Comba . 2020 . Visual exploration of rating datasets and user groups . Future Generation Computer Systems 105 (2020), 547 -- 561 . Fabian Colque Zegarra, Juan C Carbajal Ipenza, Behrooz Omidvar-Tehrani, Viviane P Moreira, Sihem Amer-Yahia, and Jo\u00e3o LD Comba. 2020. Visual exploration of rating datasets and user groups. Future Generation Computer Systems 105 (2020), 547--561.","journal-title":"Future Generation Computer Systems"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3565838.3565841","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T23:15:29Z","timestamp":1674256529000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3565838.3565841"}},"subtitle":["An Explainability Framework for Data Exploration Steps"],"short-title":[],"issued":{"date-parts":[[2022,9]]},"references-count":81,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2022,9]]}},"alternative-id":["10.14778\/3565838.3565841"],"URL":"https:\/\/doi.org\/10.14778\/3565838.3565841","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2022,9]]},"assertion":[{"value":"2023-01-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}