{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T11:48:42Z","timestamp":1763466522293},"reference-count":35,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2015,8]]},"abstract":"<jats:p>Materialized views (MVs), stored pre-computed results, are widely used to facilitate fast queries on large datasets. When new records arrive at a high rate, it is infeasible to continuously update (maintain) MVs and a common solution is to defer maintenance by batching updates together. Between batches the MVs become increasingly stale with incorrect, missing, and superfluous rows leading to increasingly inaccurate query results. We propose Stale View Cleaning (SVC) which addresses this problem from a data cleaning perspective. In SVC, we efficiently clean a sample of rows from a stale MV, and use the clean sample to estimate aggregate query results. While approximate, the estimated query results reflect the most recent data. As sampling can be sensitive to long-tailed distributions, we further explore an outlier indexing technique to give increased accuracy when the data distributions are skewed. SVC complements existing deferred maintenance approaches by giving accurate and bounded query answers between maintenance. We evaluate our method on a generated dataset from the TPC-D benchmark and a real video distribution application. Experiments confirm our theoretical results: (1) cleaning an MV sample is more efficient than full view maintenance, (2) the estimated results are more accurate than using the stale MV, and (3) SVC is applicable for a wide variety of MVs.<\/jats:p>","DOI":"10.14778\/2824032.2824037","type":"journal-article","created":{"date-parts":[[2015,9,16]],"date-time":"2015-09-16T12:18:17Z","timestamp":1442405897000},"page":"1370-1381","source":"Crossref","is-referenced-by-count":15,"title":["Stale view cleaning"],"prefix":"10.14778","volume":"8","author":[{"given":"Sanjay","family":"Krishnan","sequence":"first","affiliation":[{"name":"UC Berkeley"}]},{"given":"Jiannan","family":"Wang","sequence":"additional","affiliation":[{"name":"UC Berkeley"}]},{"given":"Michael J.","family":"Franklin","sequence":"additional","affiliation":[{"name":"UC Berkeley"}]},{"given":"Ken","family":"Goldberg","sequence":"additional","affiliation":[{"name":"UC Berkeley"}]},{"given":"Tim","family":"Kraska","sequence":"additional","affiliation":[{"name":"Brown University"}]}],"member":"320","published-online":{"date-parts":[[2015,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Conviva. http:\/\/www.conviva.com\/. Conviva. http:\/\/www.conviva.com\/."},{"key":"e_1_2_1_2_1","first-page":"481","volume-title":"SIGMOD Conference","author":"Agarwal S.","year":"2014"},{"key":"e_1_2_1_3_1","first-page":"29","volume-title":"EuroSys","author":"Agarwal S.","year":"2013"},{"key":"e_1_2_1_4_1","first-page":"445","volume-title":"SIGMOD Conference","author":"Chalamalla A.","year":"2014"},{"key":"e_1_2_1_5_1","first-page":"534","volume-title":"ICDE","author":"Chaudhuri S.","year":"2001"},{"key":"e_1_2_1_6_1","unstructured":"S. Chaudhuri and V. Narasayya. TPC-D data generation with skew. ftp.research.microsoft.com\/users\/viveknar\/tpcdskew. S. Chaudhuri and V. Narasayya. TPC-D data generation with skew. ftp.research.microsoft.com\/users\/viveknar\/tpcdskew."},{"issue":"4","key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1561\/1900000020","article-title":"Materialized views","volume":"4","author":"Chirkova R.","year":"2012","journal-title":"Foundations and Trends in Databases"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1137\/070710111"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000004"},{"key":"e_1_2_1_10_1","volume-title":"CRC Press","author":"Cox D. R.","year":"1979"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-002-0083-8"},{"key":"e_1_2_1_12_1","volume-title":"VLDB","author":"Garofalakis M. N.","year":"2001"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/581751.581753"},{"issue":"2","key":"e_1_2_1_14_1","first-page":"3","article-title":"Maintenance of materialized views: Problems, techniques, and applications","volume":"18","author":"Gupta A.","year":"1995","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2004.11.011"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-00975-4_20"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2007.190664"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/2336664.2336670"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"S. Krishnan J. Wang M. J. Franklin K. Goldberg and T. Kraska. Stale view cleaning: Getting fresh answers from stale materialized views. http:\/\/www.ocf.berkeley.edu\/~sanjayk\/pubs\/svc-2014.pdf 2014. S. Krishnan J. Wang M. J. Franklin K. Goldberg and T. Kraska. Stale view cleaning: Getting fresh answers from stale materialized views. http:\/\/www.ocf.berkeley.edu\/~sanjayk\/pubs\/svc-2014.pdf 2014.","DOI":"10.14778\/2824032.2824037"},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1145\/1247480.1247502","volume-title":"SIGMOD","author":"Larson P.-A.","year":"2007"},{"key":"e_1_2_1_21_1","first-page":"259","volume-title":"VLDB","author":"Yang H. Z.","year":"1985"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"P. L'Ecuyer and R. Simard. Testu01: Ac library for empirical testing of random number generators. ACM Transactions on Mathematical Software (TOMS) 33(4):22 2007. 10.1145\/1268776.1268777 P. L'Ecuyer and R. Simard. Testu01: Ac library for empirical testing of random number generators. ACM Transactions on Mathematical Software (TOMS) 33(4):22 2007. 10.1145\/1268776.1268777","DOI":"10.1145\/1268776.1268777"},{"issue":"3","key":"e_1_2_1_23_1","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1111\/cgf.12129","article-title":"imMens: Real-time visual querying of big data","volume":"32","author":"Liu Z.","year":"2013","journal-title":"Comput. Graph. Forum"},{"key":"e_1_2_1_24_1","first-page":"505","volume-title":"SIGMOD Conference","author":"Meliou A.","year":"2011"},{"issue":"2","key":"e_1_2_1_25_1","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1080\/15427951.2004.10129088","article-title":"A brief history of generative models for power law and lognormal distributions","volume":"1","author":"Mitzenmacher M.","year":"2003","journal-title":"Internet Mathematics"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.14778\/2556549.2556563"},{"key":"e_1_2_1_27_1","volume-title":"University of California","author":"Olken F.","year":"1993"},{"key":"e_1_2_1_28_1","first-page":"160","volume-title":"VLDB","author":"Olken F.","year":"1986"},{"key":"e_1_2_1_29_1","first-page":"632","volume-title":"ICDE","author":"Olken F.","year":"1992"},{"issue":"4","key":"e_1_2_1_30_1","first-page":"3","article-title":"Data cleaning: Problems and current approaches","volume":"23","author":"Rahm E.","year":"2000","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_2_1_31_1","first-page":"331","volume-title":"SIGMOD Conference","author":"Srinivasan V.","year":"1992"},{"key":"e_1_2_1_32_1","first-page":"469","volume-title":"SIGMOD Conference","author":"Wang J.","year":"2014"},{"key":"e_1_2_1_33_1","volume-title":"Strata","author":"Weil K.","year":"2011"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536354.2536356"},{"key":"e_1_2_1_35_1","first-page":"277","volume-title":"SIGMOD","author":"Zeng K.","year":"2014"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/2824032.2824037","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,14]],"date-time":"2023-08-14T05:49:37Z","timestamp":1691992177000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/2824032.2824037"}},"subtitle":["getting fresh answers from stale materialized views"],"short-title":[],"issued":{"date-parts":[[2015,8]]},"references-count":35,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2015,8]]}},"alternative-id":["10.14778\/2824032.2824037"],"URL":"https:\/\/doi.org\/10.14778\/2824032.2824037","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2015,8]]}}}