{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T06:48:18Z","timestamp":1782197298046,"version":"3.54.5"},"reference-count":33,"publisher":"Association for Computing Machinery (ACM)","issue":"2","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2022,10]]},"abstract":"<jats:p>\n            In data streaming, why-provenance can explain why a given outcome is observed but offers no help in understanding why an expected outcome is missing. Explaining\n            <jats:italic>missing answers<\/jats:italic>\n            has been addressed in DBMSs, but these solutions are not directly applicable to the streaming setting, because of the extra challenges posed by limited storage and by the unbounded nature of data streams.\n          <\/jats:p>\n          <jats:p>\n            With our framework,\n            <jats:italic>Erebus<\/jats:italic>\n            , we tackle the unaddressed challenges behind explaining missing answers in streaming applications.\n            <jats:italic>Erebus<\/jats:italic>\n            allows users to define\n            <jats:italic>expectations<\/jats:italic>\n            about the results of a query, verifying at runtime if such expectations hold, and also providing explanations when expected and observed outcomes diverge (missing answers). To the best of our knowledge,\n            <jats:italic>Erebus<\/jats:italic>\n            is the first such solution in data streaming. Our thorough evaluation on real data shows that\n            <jats:italic>Erebus<\/jats:italic>\n            can explain the (missing) answers with small overheads, both in low- and higher-end devices, even when large portions of the processed data are part of such explanations.\n          <\/jats:p>","DOI":"10.14778\/3565816.3565825","type":"journal-article","created":{"date-parts":[[2022,11,24]],"date-time":"2022-11-24T00:35:16Z","timestamp":1669250116000},"page":"230-242","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":8,"title":["Erebus"],"prefix":"10.14778","volume":"16","author":[{"given":"Dimitris","family":"Palyvos-Giannas","sequence":"first","affiliation":[{"name":"Chalmers University of Technology, Gothenburg, Sweden"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Katerina","family":"Tzompanaki","sequence":"additional","affiliation":[{"name":"CY Cergy-Paris University, Cergy, France"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Marina","family":"Papatriantafilou","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vincenzo","family":"Gulisano","sequence":"additional","affiliation":[{"name":"Chalmers University of Technology"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,11,23]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824076"},{"key":"e_1_2_1_2_1","volume-title":"Retrieved","year":"2021","unstructured":"Apache. 2021. Beam . Retrieved November 5, 2021 from https:\/\/beam.apache.org\/ Apache. 2021. Beam. Retrieved November 5, 2021 from https:\/\/beam.apache.org\/"},{"key":"e_1_2_1_3_1","unstructured":"Apache. 2021. Heron. Retrieved November 5 2021 from https:\/\/heron.incubator.apache.org\/  Apache. 2021. Heron. Retrieved November 5 2021 from https:\/\/heron.incubator.apache.org\/"},{"key":"e_1_2_1_4_1","unstructured":"Apache. 2021. Storm. Retrieved November 5 2021 from https:\/\/storm.apache.org\/  Apache. 2021. Storm. Retrieved November 5 2021 from https:\/\/storm.apache.org\/"},{"key":"e_1_2_1_5_1","volume-title":"Retrieved","year":"2022","unstructured":"Apache. 2022 . Kafka . Retrieved March 24, 2022 from https:\/\/kafka.apache.org\/ Apache. 2022. Kafka. Retrieved March 24, 2022 from https:\/\/kafka.apache.org\/"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the Thirtieth International Conference on Very Large Data Bases -","volume":"30","author":"Arasu Arvind","year":"2004","unstructured":"Arvind Arasu , Mitch Cherniack , Eduardo Galvez , David Maier , Anurag S. Maskey , Esther Ryvkina , Michael Stonebraker , and Richard Tibbetts . 2004 . Linear Road: A Stream Data Management Benchmark . In Proceedings of the Thirtieth International Conference on Very Large Data Bases - Volume 30 (Toronto, Canada) (VLDB '04). VLDB Endowment, Toronto, Canada, 480--491. http:\/\/dl.acm.org\/citation.cfm?id=1316689.1316732 Arvind Arasu, Mitch Cherniack, Eduardo Galvez, David Maier, Anurag S. Maskey, Esther Ryvkina, Michael Stonebraker, and Richard Tibbetts. 2004. Linear Road: A Stream Data Management Benchmark. In Proceedings of the Thirtieth International Conference on Very Large Data Bases - Volume 30 (Toronto, Canada) (VLDB '04). VLDB Endowment, Toronto, Canada, 480--491. http:\/\/dl.acm.org\/citation.cfm?id=1316689.1316732"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/2806416.2806426"},{"key":"e_1_2_1_8_1","volume-title":"Immutably Answering Why-Not Questions for Equivalent Conjunctive Queries. In 6th USENIX Workshop on the Theory and Practice of Provenance (TaPP","author":"Bidoit Nicole","year":"2014","unstructured":"Nicole Bidoit , Melanie Herschel , and Katerina Tzompanaki . 2014 . Immutably Answering Why-Not Questions for Equivalent Conjunctive Queries. In 6th USENIX Workshop on the Theory and Practice of Provenance (TaPP 2014). USENIX Association, Cologne. Nicole Bidoit, Melanie Herschel, and Katerina Tzompanaki. 2014. Immutably Answering Why-Not Questions for Equivalent Conjunctive Queries. In 6th USENIX Workshop on the Theory and Practice of Provenance (TaPP 2014). USENIX Association, Cologne."},{"key":"e_1_2_1_9_1","unstructured":"Nicole Bidoit Melanie Herschel and Katerina Tzompanaki. 2014. Query-Based Why-Not Provenance with NedExplain. In Extending database technology (EDBT).  Nicole Bidoit Melanie Herschel and Katerina Tzompanaki. 2014. Query-Based Why-Not Provenance with NedExplain. In Extending database technology (EDBT)."},{"key":"e_1_2_1_10_1","first-page":"28","article-title":"Apache Flink: Stream and batch processing in a single engine","volume":"36","author":"Carbone Paris","year":"2015","unstructured":"Paris Carbone , Asterios Katsifodimos , Stephan Ewen , Volker Markl , Seif Haridi , and Kostas Tzoumas . 2015 . Apache Flink: Stream and batch processing in a single engine . Bulletin of the IEEE Computer Society Technical Committee on Data Engineering 36 , 4 (2015), 28 -- 38 . Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, and Kostas Tzoumas. 2015. Apache Flink: Stream and batch processing in a single engine. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering 36, 4 (2015), 28--38.","journal-title":"Bulletin of the IEEE Computer Society Technical Committee on Data Engineering"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00895"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1559845.1559901"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000006"},{"key":"e_1_2_1_14_1","unstructured":"Sean Chester and Ira Assent. 2015. Explanations for Skyline Query Results. In EDBT. 349--360.  Sean Chester and Ira Assent. 2015. Explanations for Skyline Query Results. In EDBT. 349--360."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/357775.357777"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/0004-3702(91)90006-6"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457249"},{"key":"e_1_2_1_18_1","volume-title":"Edgewise: A Better Stream Processing Engine for the Edge. In USENIX Annual Technical Conference (ATC) 19","author":"Fu Xinwei","year":"2019","unstructured":"Xinwei Fu , Talha Ghaffar , James C Davis , and Dongyoon Lee . 2019 . Edgewise: A Better Stream Processing Engine for the Edge. In USENIX Annual Technical Conference (ATC) 19 . USENIX, WA, USA, 929--946. Xinwei Fu, Talha Ghaffar, James C Davis, and Dongyoon Lee. 2019. Edgewise: A Better Stream Processing Engine for the Edge. In USENIX Annual Technical Conference (ATC) 19. USENIX, WA, USA, 929--946."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.14778\/2752939.2752943"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2633689"},{"key":"e_1_2_1_21_1","volume-title":"Retrieved","year":"2020","unstructured":"HardKernel. 2020 . Odroid-XU4 . Retrieved November 12, 2020 from http:\/\/www.hardkernel.com HardKernel. 2020. Odroid-XU4. Retrieved November 12, 2020 from http:\/\/www.hardkernel.com"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2012.158"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-017-0486-1"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2013.6544890"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2611286.2611333"},{"key":"e_1_2_1_26_1","volume-title":"Retrieved","year":"2022","unstructured":"MovieLens. 2022 . MovieLens . Retrieved March 24, 2022 from https:\/\/www.kaggle.com\/rounakbanik\/the-movies-dataset MovieLens. 2022. MovieLens. Retrieved March 24, 2022 from https:\/\/www.kaggle.com\/rounakbanik\/the-movies-dataset"},{"key":"e_1_2_1_27_1","unstructured":"Open Source. 2022. Erebus Implementation. Retrieved August 5 2022 from https:\/\/github.com\/dmpalyvos\/erebus  Open Source. 2022. Erebus Implementation. Retrieved August 5 2022 from https:\/\/github.com\/dmpalyvos\/erebus"},{"key":"e_1_2_1_28_1","volume-title":"Retrieved","author":"JDK.","year":"2021","unstructured":"Open JDK. 2021 . Java Microbenchmark Harness . Retrieved February 24, 2022 from https:\/\/github.com\/openjdk\/jmh OpenJDK. 2021. Java Microbenchmark Harness. Retrieved February 24, 2022 from https:\/\/github.com\/openjdk\/jmh"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2019.102552"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/3430915.3430928"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452818"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1107499.1107504"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/2740070.2626335"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3565816.3565825","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T09:33:21Z","timestamp":1672220001000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3565816.3565825"}},"subtitle":["explaining the outputs of data streaming queries"],"short-title":[],"issued":{"date-parts":[[2022,10]]},"references-count":33,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,10]]}},"alternative-id":["10.14778\/3565816.3565825"],"URL":"https:\/\/doi.org\/10.14778\/3565816.3565825","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2022,10]]},"assertion":[{"value":"2022-11-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}