{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T02:22:26Z","timestamp":1773886946127,"version":"3.50.1"},"reference-count":61,"publisher":"Association for Computing Machinery (ACM)","issue":"11","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2021,7]]},"abstract":"<jats:p>Groupjoins, the combined execution of a join and a subsequent group by, are common in analytical queries, and occur in about 1\/8 of the queries in TPC-H and TPC-DS. While they were originally invented to improve performance, efficient parallel execution of groupjoins can be limited by contention, which limits their usefulness in a many-core system. Having an efficient implementation of groupjoins is highly desirable, as groupjoins are not only used to fuse group by and join but are also introduced by the unnesting component of the query optimizer to avoid nested-loops evaluation of aggregates. Furthermore, the query optimizer needs be able to reason over the result of aggregation in order to schedule it correctly. Traditional selectivity and cardinality estimations quickly reach their limits when faced with computed columns from nested aggregates, which leads to poor cost estimations and thus, suboptimal query plans.<\/jats:p>\n          <jats:p>\n            In this paper, we present techniques to efficiently estimate, plan, and execute groupjoins and nested aggregates. We propose two novel techniques,\n            <jats:italic>aggregate estimates<\/jats:italic>\n            to predict the result distribution of aggregates, and\n            <jats:italic>parallel groupjoin execution<\/jats:italic>\n            for a scalable execution of groupjoins. The resulting system has significantly better estimates and a contention-free evaluation of groupjoins, which can speed up some TPC-H queries up to a factor of 2.\n          <\/jats:p>","DOI":"10.14778\/3476249.3476288","type":"journal-article","created":{"date-parts":[[2021,10,27]],"date-time":"2021-10-27T16:46:23Z","timestamp":1635353183000},"page":"2383-2396","source":"Crossref","is-referenced-by-count":8,"title":["A practical approach to groupjoin and nested aggregates"],"prefix":"10.14778","volume":"14","author":[{"given":"Philipp","family":"Fent","sequence":"first","affiliation":[{"name":"Technische Universit\u00e4t M\u00fcnchen"}]},{"given":"Thomas","family":"Neumann","sequence":"additional","affiliation":[{"name":"Technische Universit\u00e4t M\u00fcnchen"}]}],"member":"320","published-online":{"date-parts":[[2021,10,27]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"A class of distributions which includes the normal ones. Scandinavian journal of statistics","author":"Azzalini Adelchi","year":"1985"},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","volume-title":"The skew-normal and related families","author":"Azzalini Adelchi","DOI":"10.1017\/CBO9781139248891"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452831"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687563"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3399666.3399924"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1379022.1375591"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-04936-6_5"},{"key":"e_1_2_1_8_1","unstructured":"Peter A. Boncz Marcin Zukowski and Niels Nes. 2005. MonetDB X100: Hyper-Pipelining Query Execution. In CIDR. www.cidrdb.org 225--237.  Peter A. Boncz Marcin Zukowski and Niels Nes. 2005. MonetDB X100: Hyper-Pipelining Query Execution. In CIDR. www.cidrdb.org 225--237."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/645484.656553"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375694"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1325851.1325893"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.5555\/648291.754504"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000004"},{"key":"e_1_2_1_14_1","volume-title":"Getting Swole: Generating Access-Aware Code with Predicate Pullups","author":"Crotty Andrew","year":"2020"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/645914.671634"},{"key":"e_1_2_1_16_1","volume-title":"Extreme value theory: an introduction","author":"Haan Laurens De"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.14778\/3389133.3389138"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/3329772.3329780"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-017-0476-3"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687556"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1247480.1247598"},{"key":"e_1_2_1_22_1","volume-title":"Freitag and Thomas Neumann","author":"Michael","year":"2019"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375748"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/191843.191886"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.14778\/3384345.3384349"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3320211"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2915241"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2882940"},{"key":"e_1_2_1_29_1","unstructured":"Martin Kersten Panagiotis Koutsourakis Niels Nes and Ying Zhan. 2021. Bridging the Chasm between Science and Reality. In CIDR. www.cidrdb.org.  Martin Kersten Panagiotis Koutsourakis Niels Nes and Ying Zhan. 2021. Bridging the Chasm between Science and Reality. In CIDR. www.cidrdb.org."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687564"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/319732.319745"},{"key":"e_1_2_1_32_1","unstructured":"Andreas Kipf Michael Freitag Dimitri Vorona Peter Boncz Thomas Neumann and Alfons Kemper. 2019. Estimating Filtered Group-By Queries is Hard: Deep Learning to the Rescue. In AIDB.  Andreas Kipf Michael Freitag Dimitri Vorona Peter Boncz Thomas Neumann and Alfons Kemper. 2019. Estimating Filtered Group-By Queries is Hard: Deep Learning to the Rescue. In AIDB."},{"key":"e_1_2_1_33_1","volume-title":"Learned Cardinalities: Estimating Correlated Joins with Deep Learning. In CIDR. www.cidrdb.org.","author":"Kipf Andreas","year":"2019"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457288"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610507"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850583.2850594"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-017-0480-7"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/3291264.3291273"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-017-0475-4"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/11547273_12"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.14778\/3151113.3151114"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402723"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687738"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.5555\/645918.756653"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.5555\/1182635.1164217"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.14778\/2002938.2002940"},{"key":"e_1_2_1_47_1","volume-title":"Freitag","author":"Neumann Thomas","year":"2020"},{"key":"e_1_2_1_48_1","volume-title":"BTW (LNI)","author":"Neumann Thomas"},{"key":"e_1_2_1_49_1","volume-title":"BTW (LNI)","author":"Neumann Thomas"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/2987443.2987489"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1080\/02664760050120542"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/2485278.2485284"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536222.2536233"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3324961"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2882917"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/235968.233360"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/16894.16888"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209950.3209952"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.5555\/645921.673154"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.14778\/2733004.2733022"},{"key":"e_1_2_1_61_1","volume-title":"Patel","author":"Zhang Zuyu","year":"2019"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3476249.3476288","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:03:50Z","timestamp":1672221830000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3476249.3476288"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7]]},"references-count":61,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2021,7]]}},"alternative-id":["10.14778\/3476249.3476288"],"URL":"https:\/\/doi.org\/10.14778\/3476249.3476288","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2021,7]]}}}