{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,4]],"date-time":"2024-06-04T11:18:14Z","timestamp":1717499894835},"reference-count":23,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2022,8]]},"abstract":"<jats:p>Cloud data warehouses (CDWs) bring large-scale data and compute power closer to users in enterprises. However, existing tools for analyzing data in CDWs are either limited in ad-hoc transformations or difficult to use for business users. Here we introduce Sigma Workbook, a new interactive system that enables business users to easily perform visual analysis of data in CDWs at scale. For this, Sigma Workbook provides an accessible spreadsheet-like interface for analysis through direct manipulation. Sigma Workbook dynamically constructs matching SQL queries from user interactions, building on the versatility and expressivity of SQL. Constructed queries are directly executed on CDWs, leveraging the superior characteristics of the new generation CDWs, including scalability. We demonstrate Sigma Workbook through 3 real-life use cases---cohort analysis, sessionization, and data augmentation---and underline Workbook's ease of use, scalability, and expressivity.<\/jats:p>","DOI":"10.14778\/3554821.3554871","type":"journal-article","created":{"date-parts":[[2022,9,29]],"date-time":"2022-09-29T22:28:39Z","timestamp":1664490519000},"page":"3670-3673","source":"Crossref","is-referenced-by-count":2,"title":["Sigma workbook"],"prefix":"10.14778","volume":"15","author":[{"given":"James","family":"Gale","sequence":"first","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Max","family":"Seiden","sequence":"additional","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Deepanshu","family":"Utkarsh","sequence":"additional","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jason","family":"Frantz","sequence":"additional","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rob","family":"Woollen","sequence":"additional","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"\u00c7a\u011fatay","family":"Demiralp","sequence":"additional","affiliation":[{"name":"Sigma Computing"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,9,29]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Amazon. 2013--2022. Redshift. https:\/\/aws.amazon.com\/redshift.  Amazon. 2013--2022. Redshift. https:\/\/aws.amazon.com\/redshift."},{"key":"e_1_2_1_2_1","unstructured":"Bureau of Transportation Statistics. 1987--2020. Airline On-Time Performance Data. https:\/\/www.transtats.bts.gov\/ONTIME.  Bureau of Transportation Statistics. 1987--2020. Airline On-Time Performance Data. https:\/\/www.transtats.bts.gov\/ONTIME."},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Bakke et al. 2011. A spreadsheet-based user interface for managing plural relationships in structured data. In CHI.  Bakke et al. 2011. A spreadsheet-based user interface for managing plural relationships in structured data. In CHI.","DOI":"10.1145\/1978942.1979313"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824121"},{"key":"e_1_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Bendre et al. 2018. Towards a holistic integration of spreadsheets with databases: A scalable storage engine for presentational data management. In ICDE.  Bendre et al. 2018. Towards a holistic integration of spreadsheets with databases: A scalable storage engine for presentational data management. In ICDE.","DOI":"10.1109\/ICDE.2018.00020"},{"key":"e_1_2_1_6_1","volume-title":"Photon: A Fast Query Engine for Lakehouse Systems. In SIGMOD.","author":"Behm","year":"2022","unstructured":"Behm et al. 2022 . Photon: A Fast Query Engine for Lakehouse Systems. In SIGMOD. Behm et al. 2022. Photon: A Fast Query Engine for Lakehouse Systems. In SIGMOD."},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Dageville et al. 2016. The Snowflake Elastic Data Warehouse. In SIGMOD.  Dageville et al. 2016. The Snowflake Elastic Data Warehouse. In SIGMOD.","DOI":"10.1145\/2882903.2903741"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Gupta et al. 2015. Amazon redshift and the case for simpler data warehouses. In SIGMOD.  Gupta et al. 2015. Amazon redshift and the case for simpler data warehouses. In SIGMOD.","DOI":"10.1145\/2723372.2742795"},{"key":"e_1_2_1_9_1","volume-title":"Sigma Worksheet: Interactive Construction of OLAP Queries. arXiv:2012.00697","author":"Gale","year":"2021","unstructured":"Gale et al. 2021 . Sigma Worksheet: Interactive Construction of OLAP Queries. arXiv:2012.00697 Gale et al. 2021. Sigma Worksheet: Interactive Construction of OLAP Queries. arXiv:2012.00697"},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Haas et al. 2017. Bringing the Web up to Speed with WebAssembly. In SIGPLAN.  Haas et al. 2017. Bringing the Web up to Speed with WebAssembly. In SIGPLAN.","DOI":"10.1145\/3062341.3062363"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Jiang et al. 2016. Cohort Query Processing. In VLDB.  Jiang et al. 2016. Cohort Query Processing. In VLDB.","DOI":"10.14778\/3015270.3015271"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Melnik et al. 2010. Dremel: interactive analysis of web-scale datasets. In VLDB.  Melnik et al. 2010. Dremel: interactive analysis of web-scale datasets. In VLDB.","DOI":"10.14778\/1920841.1920886"},{"key":"e_1_2_1_13_1","unstructured":"Raman et al. 1999. Scalable Spreadsheets for Interactive Data Analysis. In SIGMOD.  Raman et al. 1999. Scalable Spreadsheets for Interactive Data Analysis. In SIGMOD."},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Rahman et al. 2020. Benchmarking Spreadsheet Systems. In SIGMOD.  Rahman et al. 2020. Benchmarking Spreadsheet Systems. In SIGMOD.","DOI":"10.1145\/3318464.3389782"},{"key":"e_1_2_1_15_1","volume-title":"Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization","author":"Satyanarayan","year":"2016","unstructured":"Satyanarayan et al. 2016 . Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization . In IEEE TVCG (Proc. InfoVis) . Satyanarayan et al. 2016. Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization. In IEEE TVCG (Proc. InfoVis)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Witkowski et al. 2003. Spreadsheets in RDBMS for OLAP. In SIGMOD.  Witkowski et al. 2003. Spreadsheets in RDBMS for OLAP. In SIGMOD.","DOI":"10.1145\/872757.872767"},{"key":"e_1_2_1_17_1","unstructured":"Google. 2011--2022. BigQuery. https:\/\/cloud.google.com\/bigquery.  Google. 2011--2022. BigQuery. https:\/\/cloud.google.com\/bigquery."},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Bin Liu and HV Jagadish. 2009. A spreadsheet algebra for a direct data manipulation query interface. In ICDE.  Bin Liu and HV Jagadish. 2009. A spreadsheet algebra for a direct data manipulation query interface. In ICDE.","DOI":"10.1109\/ICDE.2009.34"},{"key":"e_1_2_1_19_1","volume-title":"Miller","author":"Nardi Bonnie A.","year":"1990","unstructured":"Bonnie A. Nardi and James R . Miller . 1990 . The Spreadsheet Interface: A Basis for End User Programming. In INTERACT. Bonnie A. Nardi and James R. Miller. 1990. The Spreadsheet Interface: A Basis for End User Programming. In INTERACT."},{"key":"e_1_2_1_20_1","unstructured":"PostgreSQL. 1996--2022. PostgreSQL. https:\/\/www.postgresql.org\/.  PostgreSQL. 1996--2022. PostgreSQL. https:\/\/www.postgresql.org\/."},{"key":"e_1_2_1_21_1","unstructured":"Snowflake. 2015--2022. Snowflake. https:\/\/www.snowflake.com\/workloads\/data-warehouse-modernization\/.  Snowflake. 2015--2022. Snowflake. https:\/\/www.snowflake.com\/workloads\/data-warehouse-modernization\/."},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Jerzy Tyszkiewicz. 2010. Spreadsheet as a relational database engine. In SIGMOD.  Jerzy Tyszkiewicz. 2010. Spreadsheet as a relational database engine. In SIGMOD.","DOI":"10.1145\/1807167.1807191"},{"key":"e_1_2_1_23_1","volume-title":"Power BI: Chort Analysis. https:\/\/finance-bi.com\/power-bi-cohort-analysis.","author":"Zanna Luca","year":"2019","unstructured":"Luca Zanna . 2019 . Power BI: Chort Analysis. https:\/\/finance-bi.com\/power-bi-cohort-analysis. Luca Zanna. 2019. Power BI: Chort Analysis. https:\/\/finance-bi.com\/power-bi-cohort-analysis."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3554821.3554871","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T11:33:54Z","timestamp":1672227234000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3554821.3554871"}},"subtitle":["a spreadsheet for cloud data warehouses"],"short-title":[],"issued":{"date-parts":[[2022,8]]},"references-count":23,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2022,8]]}},"alternative-id":["10.14778\/3554821.3554871"],"URL":"https:\/\/doi.org\/10.14778\/3554821.3554871","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2022,8]]}}}