{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T08:17:36Z","timestamp":1759133856808,"version":"3.41.0"},"reference-count":85,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2018,2,13]],"date-time":"2018-02-13T00:00:00Z","timestamp":1518480000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1453582 and 1422977"],"award-info":[{"award-number":["1453582 and 1422977"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2018,2,28]]},"abstract":"<jats:p>\n            With the proliferation of large datasets, sampling has become pervasive in data analysis. Sampling has numerous benefits\u2014from reducing the computation time and cost to increasing the scope of interactive analysis. A popular task in data science, well-suited toward sampling, is the computation of fast-but-approximate aggregations over sampled data. Aggregation is a foundational block of data analysis, with data cube being its primary construct. We observe that such aggregation queries are typically issued in an ad-hoc, interactive setting. In contrast to one-off queries, a typical query session consists of a series of quick queries, interspersed with the user inspecting the results and formulating the next query. The similarity between session queries opens up opportunities for reusing computation of not just query results, but also error estimates. Error estimates need to be provided alongside sampled results for the results to be meaningful. We propose\n            <jats:italic>Sesame<\/jats:italic>\n            , a rewrite and caching framework that accelerates the entire interactive &lt;underline&gt;ses&lt;\/underline&gt;sion of aggregation queries over &lt;underline&gt;sam&lt;\/underline&gt;pl&lt;underline&gt;e&lt;\/underline&gt;d data. We focus on two unique and computationally expensive aspects of this use case: query speculation in the presence of sampling, and error computation, and provide novel strategies for result and error reuse. We demonstrate that our approach outperforms conventional sampled aggregation techniques by at least an order of magnitude, without modifying the underlying database.\n          <\/jats:p>","DOI":"10.1145\/3070648","type":"journal-article","created":{"date-parts":[[2018,2,13]],"date-time":"2018-02-13T15:40:40Z","timestamp":1518536440000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["A Session-Based Approach to Fast-But-Approximate Interactive Data Cube Exploration"],"prefix":"10.1145","volume":"12","author":[{"given":"Niranjan","family":"Kamat","sequence":"first","affiliation":[{"name":"The Ohio State University, Columbus, OH"}]},{"given":"Arnab","family":"Nandi","sequence":"additional","affiliation":[{"name":"The Ohio State University, Columbus, OH"}]}],"member":"320","published-online":{"date-parts":[[2018,2,13]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465355"},{"volume-title":"Narasayya","year":"2000","author":"Agrawal Sanjay","key":"e_1_2_1_2_1"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/379240.379251"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmp.2010.08.009"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/872757.872822"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDEW.2013.6547461"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2463683"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2013.6691708"},{"volume-title":"Zdonik","year":"2013","author":"Cetintemel Ugur","key":"e_1_2_1_9_1"},{"volume-title":"LeVeque","year":"1982","author":"Chan Tony F.","key":"e_1_2_1_10_1"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/11733836_27"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242524.1242526"},{"volume-title":"Mowry","year":"2007","author":"Chen Shimin","key":"e_1_2_1_13_1"},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Rada Chirkova and Jun Yang. 2011. Materialized Views. Foundations and Trends in Databases.   Rada Chirkova and Jun Yang. 2011. Materialized Views. Foundations and Trends in Databases.","DOI":"10.1561\/1900000020"},{"volume-title":"Sampling Techniques","author":"Cochran William","key":"e_1_2_1_15_1"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807295"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610523"},{"volume-title":"Proceedings of the AAAI\/IAAI.","year":"2002","author":"Elomaa Tapio","key":"e_1_2_1_18_1"},{"volume-title":"Proceedings of the Workshop on Machine Learning for Scientific data Analysis (IBERAMIA\u201904)","author":"Estrada Alfonso","key":"e_1_2_1_19_1"},{"volume-title":"Garofalakis and others","year":"2001","author":"Minos","key":"e_1_2_1_20_1"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0307-0"},{"volume-title":"Proceedings of the Bell Labs TR.","year":"1998","author":"Gibbons Phillip B.","key":"e_1_2_1_22_1"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009726021843"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/645328.650038"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/s007780100054"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/2041746.2041755"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/233269.233333"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.781635"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/253260.253291"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824103"},{"volume-title":"John and Pat Langley","year":"1996","author":"George","key":"e_1_2_1_31_1"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939502.2939503"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/564691.564695"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2014.6816674"},{"volume-title":"Proceedings of the KDD.","year":"1997","author":"Kamber Micheline","key":"e_1_2_1_35_1"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2254556.2254659"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1142473.1142579"},{"volume-title":"Ng","year":"1997","author":"Knorr Edwin M.","key":"e_1_2_1_38_1"},{"key":"e_1_2_1_39_1","unstructured":"Donald E. Knuth. 2014. Art of Computer Programming Volume 2: Seminumerical Algorithms.  Donald E. Knuth. 2014. Art of Computer Programming Volume 2: Seminumerical Algorithms."},{"key":"e_1_2_1_40_1","unstructured":"Marcel Kornacker and Justin Erickson. 2012. Cloudera impala: Real-time queries in Apache Hadoop for real. (2012).  Marcel Kornacker and Justin Erickson. 2012. Cloudera impala: Real-time queries in Apache Hadoop for real. (2012)."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/304182.304215"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCG.2017.6"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376695"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2013.179"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346452"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12129"},{"volume-title":"The age of big data. New York Times 11","year":"2012","author":"Lohr Steve","key":"e_1_2_1_47_1"},{"volume-title":"Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications. Springer.","year":"2008","author":"Malinowski Elzbieta","key":"e_1_2_1_48_1"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2206869.2206874"},{"volume-title":"Byers","year":"2011","author":"Manyika James","key":"e_1_2_1_50_1"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3025453.3025456"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2011.257"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375666"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.14778\/2556549.2556563"},{"volume-title":"Proceedings of the CIDR.","year":"2009","author":"Olston Christopher","key":"e_1_2_1_55_1"},{"volume-title":"Comba","year":"2017","author":"Pahins C\u00edcero A. L.","key":"e_1_2_1_56_1"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402748"},{"key":"e_1_2_1_58_1","first-page":"38","article-title":"Interactive data exploration via machine learning models","volume":"39","author":"Papaemmanouil Olga","year":"2016","journal-title":"IEEE Data Eng. Bull."},{"volume-title":"Proceedings of the UMass TR.","year":"2017","author":"Peng Liping","key":"e_1_2_1_59_1"},{"volume-title":"Jermaine","year":"2014","author":"Perez Luis L.","key":"e_1_2_1_60_1"},{"volume-title":"Proceedings of the VLDB.","year":"2007","author":"Poess Meikel","key":"e_1_2_1_61_1"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/312129.312188"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/2484838.2484874"},{"volume-title":"Proceedings of the IJKBO.","author":"Philipp","key":"e_1_2_1_64_1"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/233269.233361"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/342009.335419"},{"volume-title":"Proceedings of the DMDW.","year":"1999","author":"Sapia Carsten","key":"e_1_2_1_67_1"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/342009.336564"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/2514.2517"},{"volume-title":"Boncz","year":"2011","author":"Sidirourgos Lefteris","key":"e_1_2_1_70_1"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/564691.564745"},{"volume-title":"Proceedings of the TODS.","year":"1978","author":"Smith A. J.","key":"e_1_2_1_72_1"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/2945.981851"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350267"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.5555\/1855087"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/2831244.2831247"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/2746285.2746312"},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1080\/00401706.1962.10490022"},{"volume-title":"Proceedings of the CIDR.","year":"2017","author":"Wu Eugene","key":"e_1_2_1_79_1"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687678"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/1386118.1386122"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.5555\/582987.785951"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2594532"},{"volume-title":"Proceedings of the TVCG.","year":"2016","author":"Zgraggen Emanuel","key":"e_1_2_1_84_1"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375722"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3070648","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3070648","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3070648","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:27Z","timestamp":1750217427000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3070648"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,13]]},"references-count":85,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2018,2,28]]}},"alternative-id":["10.1145\/3070648"],"URL":"https:\/\/doi.org\/10.1145\/3070648","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"type":"print","value":"1556-4681"},{"type":"electronic","value":"1556-472X"}],"subject":[],"published":{"date-parts":[[2018,2,13]]},"assertion":[{"value":"2015-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2017-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-02-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}