{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T05:03:45Z","timestamp":1750309425339,"version":"3.41.0"},"reference-count":33,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2024,5,14]],"date-time":"2024-05-14T00:00:00Z","timestamp":1715644800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2024,5,14]]},"abstract":"<jats:p>Modern analytical engines rely on Approximate Query Processing (AQP) to provide faster response times than the hardware allows for exact query answering. However, existing AQP methods impose steep performance penalties as workload unpredictability increases. While offline AQP relies on predictable workloads to a priori create samples that match the queries, as soon as workload predictability diminishes, returning to existing online AQP methods that create query-specific samples with little reuse across queries results in significantly smaller gains in response times. As a result, existing approaches cannot fully exploit the benefits of sampling under increased unpredictability.<\/jats:p>\n          <jats:p>We propose LAQy, a framework for building, expanding, and merging samples to adapt to the changes in workload predicates. We propose lazy sampling to overcome the unpredictability issues that cause fast-but-specialized samples to be query-specific and design it for a scale-up analytical engine to show the adaptivity and practicality of our framework in a modern system. LAQy speeds up online sampling processing as a function of data access and computation reuse, making sampler placement after expensive operators more practical.<\/jats:p>","DOI":"10.1145\/3665252.3665261","type":"journal-article","created":{"date-parts":[[2024,5,14]],"date-time":"2024-05-14T22:04:33Z","timestamp":1715724273000},"page":"33-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Efficient and Reusable Lazy Sampling"],"prefix":"10.1145","volume":"53","author":[{"given":"Viktor","family":"Sanca","sequence":"first","affiliation":[{"name":"EPFL"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Periklis","family":"Chrysogelos","sequence":"additional","affiliation":[{"name":"Oracle"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anastasia","family":"Ailamaki","sequence":"additional","affiliation":[{"name":"EPFL"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,5,14]]},"reference":[{"key":"e_1_2_1_1_1","first-page":"23","volume-title":"Proceedings of the 31st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2012","author":"Agarwal P. K.","year":"2012","unstructured":"P. K. Agarwal, G. Cormode, Z. Huang, J. M. Phillips, Z. Wei, and K. Yi. Mergeable summaries. In M. Benedikt, M. Kr\u00a8otzsch, and M. Lenzerini, editors, Proceedings of the 31st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2012, Scottsdale, AZ, USA, May 20--24, 2012, pages 23--34. ACM, 2012."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465355"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3399666.3399924"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/69.3.653"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1007568.1007602"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3056097"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.14778\/3303753.3303760"},{"key":"e_1_2_1_8_1","volume-title":"Synopses for massive data: Samples, histograms, wavelets, sketches. Found. Trends Databases, 4(1--3):1--294","author":"Cormode G.","year":"2012","unstructured":"G. Cormode, M. N. Garofalakis, P. J. Haas, and C. Jermaine. Synopses for massive data: Samples, histograms, wavelets, sketches. Found. Trends Databases, 4(1--3):1--294, 2012."},{"key":"e_1_2_1_9_1","volume-title":"I. \"Zliobaitundefined, A. Bifet, M. Pechenizkiy, and A. Bouchachia. A survey on concept drift adaptation. ACM Comput. Surv., 46(4), mar","author":"J.","year":"2014","unstructured":"J. a. Gama, I. \"Zliobaitundefined, A. Bifet, M. Pechenizkiy, and A. Bouchachia. A survey on concept drift adaptation. ACM Comput. Surv., 46(4), mar 2014."},{"key":"e_1_2_1_10_1","volume-title":"Technical report","author":"Gibbons P. B.","year":"1998","unstructured":"P. B. Gibbons, V. Poosala, S. Acharya, Y. Bartal, Y. Matias, S. Muthukrishnan, S. Ramaswamy, and T. Suel. Aqua: System and techniques for approximate query answering. Technical report, Technical report, Bell Labs, 1998."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/3186728.3164145"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.14778\/3384345.3384349"},{"key":"e_1_2_1_13_1","unstructured":"S. Igescu V. Sanca E. Zapridou and A. Ailamaki. Improving k-means clustering using speculation. In R. Bordawekar C. Cappiello V. Efthymiou L. Ehrlinger V. Gadepally S. Galhotra S. Geisler S. Groppe L. Gruenwald A. Y. Halevy H. Harmouch O. Hassanzadeh I. F. Ilyas E. Jim\u00b4enez-Ruiz S. Krishnan T. Lahiri G. Li J. Lu W. Mauerer U. F. Minhas F. Naumann M. T. \u00a8Ozsu E. K. Rezig K. Srinivas M. Stonebraker S. R. Valluri M. Vidal H. Wang J. Wang Y. Wu X. Xue M. Za\u00a8?t and K. Zeng editors Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023) Vancouver Canada August 28 - September 1 2023 volume 3462 of CEUR Workshop Proceedings. CEUR-WS.org 2023."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352130"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2882940"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994516"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14778\/2735479.2735485"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3056099"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s41019-018-0074-4"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3324958"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.14778\/3425879.3425882"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"R. B. Miller. Response time in man-computer conversational transactions. In American Federation of Information Processing Societies: Proceedings of the AFIPS '68 Fall Joint Computer Conference December 9--11 1968 San Francisco California USA - Part I volume 33 of AFIPS Conference Proceedings pages 267--277. AFIPS \/ ACM \/ Thomson Book Company Washington D.C. 1968.","DOI":"10.1145\/1476589.1476628"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.14778\/2002938.2002940"},{"volume-title":"35th IEEE","author":"Olma M.","key":"e_1_2_1_24_1","unstructured":"M. Olma, O. Papapetrou, R. Appuswamy, and A. Ailamaki. Taster: Self-tuning, elastic and online approximate query processing. In 35th IEEE"},{"key":"e_1_2_1_25_1","volume-title":"The Star Schema Benchmark and Augmented Fact Table Indexing, page 237--252","author":"O'Neil P.","year":"2009","unstructured":"P. O'Neil, E. O'Neil, X. Chen, and S. Revilak. The Star Schema Benchmark and Augmented Fact Table Indexing, page 237--252. Springer-Verlag, Berlin, Heidelberg, 2009."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196905"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3183747"},{"key":"e_1_2_1_28_1","first-page":"2043","volume-title":"Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14--19, 2020","author":"Raza A.","year":"2020","unstructured":"A. Raza, P. Chrysogelos, A. G. Anadiotis, and A. Ailamaki. Adaptive HTAP through elastic resource scheduling. In D. Maier, R. Pottinger, A. Doan, W. Tan, A. Alawini, and H. Q. Ngo, editors, Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14--19, 2020, pages 2043--2054. ACM, 2020."},{"key":"e_1_2_1_29_1","first-page":"1","volume-title":"DaMoN","author":"Sanca V.","year":"2022","unstructured":"V. Sanca and A. Ailamaki. Sampling-based AQP in modern analytical engines. In DaMoN, pages 4:1--4:8. ACM, 2022."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.5555\/832277.834354"},{"key":"e_1_2_1_31_1","volume-title":"CIDR","author":"Sioulas P.","year":"2021","unstructured":"P. Sioulas, V. Sanca, I. Mytilinis, and A. Ailamaki. Accelerating complex analytics using speculation. In CIDR, 2021."},{"key":"e_1_2_1_32_1","series-title":"Proceedings of Machine Learning Research","first-page":"10054","volume-title":"Proceedings of the 38th International Conference on Machine Learning, ICML","author":"Tahmasbi A.","year":"2021","unstructured":"A. Tahmasbi, E. Jothimurugesan, S. Tirthapura, and P. B. Gibbons. Driftsurf: Stable-state \/ reactive-state learning under concept drift. In M. Meila and T. Zhang, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18--24 July 2021, Virtual Event, volume 139 of Proceedings of Machine Learning Research, pages 10054--10064. PMLR, 2021."},{"key":"e_1_2_1_33_1","first-page":"345","volume-title":"Weighted Reservoir Sampling: Randomly Sampling Streams","author":"Wyman C.","year":"2021","unstructured":"C. Wyman. Ray Tracing Gems II: Next Generation Real-Time Rendering with DXR, Vulkan, and OptiX, chapter 22, Weighted Reservoir Sampling: Randomly Sampling Streams, pages 345--349. Apress, Berkeley, CA, 2021."}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3665252.3665261","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3665252.3665261","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:58:33Z","timestamp":1750294713000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3665252.3665261"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,14]]},"references-count":33,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,5,14]]}},"alternative-id":["10.1145\/3665252.3665261"],"URL":"https:\/\/doi.org\/10.1145\/3665252.3665261","relation":{},"ISSN":["0163-5808"],"issn-type":[{"type":"print","value":"0163-5808"}],"subject":[],"published":{"date-parts":[[2024,5,14]]},"assertion":[{"value":"2024-05-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}