{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:15:02Z","timestamp":1750306502956,"version":"3.41.0"},"reference-count":17,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2015,8,12]],"date-time":"2015-08-12T00:00:00Z","timestamp":1439337600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["SIGMOD Rec."],"published-print":{"date-parts":[[2015,8,12]]},"abstract":"<jats:p>As a consequence of ever more powerful computing hardware and increasingly precise instruments, our capacity to produce scientific data by far outpaces our ability to efficiently store and analyse it. Few of today's tools to analyse scientific data are able to handle the deluge captured by instruments or generated by supercomputers.<\/jats:p>\n          <jats:p>In many scenarios, however, it suffices to analyse a small subset of the data in detail. What scientists analysing the data consequently need are efficient means to explore the full dataset using approximate query results and to identify the subsets of interest. Once found, interesting areas can still be scrutinised using a precise, but also more time-consuming analysis. Data synopses fit the bill as they provide fast (but approximate) query execution on massive amounts of data. Generating data synopses after the data is stored, however, requires us to analyse all the data again, and is thus inefficient<\/jats:p>\n          <jats:p>What we propose is to generate the synopsis for simulation applications on-the-fly when the data is captured. Doing so typically means changing the simulation or data capturing code and is tedious and typically just a one-off solution that is not generally applicable. In contrast, our vision gives scientists a high-level language and the infrastructure needed to generate code that creates data synopses on-the-fly, as the simulation runs. In this paper we discuss the data management challenges associated with our approach<\/jats:p>","DOI":"10.1145\/2814710.2814715","type":"journal-article","created":{"date-parts":[[2015,8,24]],"date-time":"2015-08-24T14:08:55Z","timestamp":1440425335000},"page":"23-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["On-the-Fly Data Synopses"],"prefix":"10.1145","volume":"44","author":[{"given":"Thomas","family":"Heinis","sequence":"first","affiliation":[{"name":"Imperial College, London, UK"}]},{"given":"David A.","family":"Ham","sequence":"additional","affiliation":[{"name":"Imperial College, London, UK"}]}],"member":"320","published-online":{"date-parts":[[2015,8,12]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/2465351.2465355"},{"key":"e_1_2_1_2_1","volume-title":"Would-be Worlds: How Simulation is Changing the Frontiers of Science","author":"Casti J. L.","year":"1996","unstructured":"J. L. Casti . Would-be Worlds: How Simulation is Changing the Frontiers of Science . Springer , 1996 . J. L. Casti. Would-be Worlds: How Simulation is Changing the Frontiers of Science. Springer, 1996."},{"volume-title":"Approximate Query Processing Using Wavelets. In VLDB '06","author":"Chakrabarti K.","key":"e_1_2_1_3_1","unstructured":"K. Chakrabarti , M. N. Garofalakis , R. Rastogi , and K. Shim . Approximate Query Processing Using Wavelets. In VLDB '06 . K. Chakrabarti, M. N. Garofalakis, R. Rastogi, and K. Shim. Approximate Query Processing Using Wavelets. In VLDB '06."},{"volume-title":"Sketching Streams Through the Net: Distributed Approximate Query Tracking. In VLDB '05","author":"Cormode G.","key":"e_1_2_1_4_1","unstructured":"G. Cormode and M. Garofalakis . Sketching Streams Through the Net: Distributed Approximate Query Tracking. In VLDB '05 . G. Cormode and M. Garofalakis. Sketching Streams Through the Net: Distributed Approximate Query Tracking. In VLDB '05."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000004"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732967.2732971"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2015039.2015535"},{"volume-title":"VLDB '05","author":"Ioannidis Y.","key":"e_1_2_1_8_1","unstructured":"Y. Ioannidis . The History of Histograms (Abridged) . In VLDB '05 . Y. Ioannidis. The History of Histograms (Abridged). In VLDB '05."},{"key":"e_1_2_1_9_1","volume-title":"VLDB","author":"Kersten M. L.","year":"2011","unstructured":"M. L. Kersten , S. Idreos , S. Manegold , and E. Liarou . The Researcher's Guide to the Data Deluge: Querying a Scientific Database in Just a Few Seconds . VLDB , 2011 . M. L. Kersten, S. Idreos, S. Manegold, and E. Liarou. The Researcher's Guide to the Data Deluge: Querying a Scientific Database in Just a Few Seconds. VLDB, 2011."},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the IEEE International Conference on Big Data","author":"Lawrence B.","year":"2013","unstructured":"B. Lawrence , V. Bennett , J. Churchill , M. Juckes , P. Kershaw , S. Pascoe , S. Pepler , M. Pritchard , and A. Stephens . Storing and manipulating environmental bigdata with JASMIN . In Proceedings of the IEEE International Conference on Big Data ,, 2013 . B. Lawrence, V. Bennett, J. Churchill, M. Juckes, P. Kershaw, S. Pascoe, S. Pepler, M. Pritchard, and A. Stephens. Storing and manipulating environmental bigdata with JASMIN. In Proceedings of the IEEE International Conference on Big Data,, 2013."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/2331176"},{"key":"e_1_2_1_12_1","volume-title":"ICCS","author":"Markall G. R.","year":"2010","unstructured":"G. R. Markall , D. A. Ham , and P. H. J. Kelly . Towards Generating Optimised Finite Element Solvers for GPUs from High-level Specifications. Procedia Computer Science, 1(1) . ICCS 2010 . G. R. Markall, D. A. Ham, and P. H. J. Kelly. Towards Generating Optimised Finite Element Solvers for GPUs from High-level Specifications. Procedia Computer Science, 1(1). ICCS 2010."},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 2nd European Future Technologies Conference and Exhibition","author":"Markram H.","year":"2011","unstructured":"H. Markram and et al. Introducing the Human Brain Project. volume 7, pages 39 \u00bf 42, 2011 . Proceedings of the 2nd European Future Technologies Conference and Exhibition 2011 . H. Markram and et al. Introducing the Human Brain Project. volume 7, pages 39 \u00bf 42, 2011. Proceedings of the 2nd European Future Technologies Conference and Exhibition 2011."},{"key":"e_1_2_1_14_1","volume-title":"CIDR","author":"M\u00fchleisen H.","year":"2015","unstructured":"H. M\u00fchleisen , M. L. Kersten , and S. Manegold . Capturing the Laws of (Data) Nature . CIDR , 2015 . H. M\u00fchleisen, M. L. Kersten, and S. Manegold. Capturing the Laws of (Data) Nature. CIDR, 2015."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/971697.602294"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-07518-1_21"},{"key":"e_1_2_1_17_1","volume-title":"Simulations in the Natural and Social Sciences","author":"Stephan H.","year":"2005","unstructured":"H. Stephan . The World as a Process : Simulations in the Natural and Social Sciences , 2005 . H. Stephan. The World as a Process: Simulations in the Natural and Social Sciences, 2005."}],"container-title":["ACM SIGMOD Record"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2814710.2814715","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2814710.2814715","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:48:53Z","timestamp":1750225733000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2814710.2814715"}},"subtitle":["Efficient Data Exploration in the Simulation Sciences"],"short-title":[],"issued":{"date-parts":[[2015,8,12]]},"references-count":17,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,8,12]]}},"alternative-id":["10.1145\/2814710.2814715"],"URL":"https:\/\/doi.org\/10.1145\/2814710.2814715","relation":{},"ISSN":["0163-5808"],"issn-type":[{"type":"print","value":"0163-5808"}],"subject":[],"published":{"date-parts":[[2015,8,12]]},"assertion":[{"value":"2015-08-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}