{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,8]],"date-time":"2026-01-08T03:00:04Z","timestamp":1767841204613,"version":"3.49.0"},"reference-count":64,"publisher":"Association for Computing Machinery (ACM)","issue":"11","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,7]]},"abstract":"<jats:p>Chiplet-based CPUs, which combine multiple independent dies on a single package, allow hardware to scale to higher CPU core counts at the cost of more memory heterogeneity and performance variability. This introduces challenges when existing query engines are deployed on chiplet-based CPUs, as current designs make assumptions about uniform memory access, cache locality and consistent core performance, e.g., leading to ineffective CPU utilization.<\/jats:p>\n          <jats:p>In this paper, we analyse the performance impact when query engines ignore chiplet-specific properties. We demonstrate that a na\u00efve deployment can result in a significant degradation of query processing efficiency, exhibiting non-linear scaling even within a single CPU socket domain. Based on comprehensive experiments, we explore approaches to deploy query engines on chiplet-based CPUs with improved performance: we show that distributing processing tasks according to a chiplet-aware strategy achieves higher resource utilization and scalability, yielding an up to 7\u00d7 speedup compared to hardware-oblivious approaches.<\/jats:p>","DOI":"10.14778\/3681954.3682011","type":"journal-article","created":{"date-parts":[[2024,8,30]],"date-time":"2024-08-30T16:23:36Z","timestamp":1725035016000},"page":"3428-3441","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["OLAP on Modern Chiplet-Based Processors"],"prefix":"10.14778","volume":"17","author":[{"given":"Alessandro","family":"Fogli","sequence":"first","affiliation":[{"name":"Imperial College London"}]},{"given":"Bo","family":"Zhao","sequence":"additional","affiliation":[{"name":"Aalto University"}]},{"given":"Peter","family":"Pietzuch","sequence":"additional","affiliation":[{"name":"Imperial College London"}]},{"given":"Maximilian","family":"Bandle","sequence":"additional","affiliation":[{"name":"TU Munich"}]},{"given":"Jana","family":"Giceva","sequence":"additional","affiliation":[{"name":"TU Munich"}]}],"member":"320","published-online":{"date-parts":[[2024,8,30]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"2023. Apache Hadoop. https:\/\/hadoop.apache.org. Accessed: 2023-6-19."},{"key":"e_1_2_1_2_1","unstructured":"2023. Introduction to Greenplum. https:\/\/docs.greenplum.org\/6-10\/install_guide\/preinstall_concepts.html. Accessed: 2023-6-19."},{"key":"e_1_2_1_3_1","unstructured":"2023. SingleStore Documentation. https:\/\/docs.singlestore.com\/v7.3\/introduction\/documentation-overview\/. Accessed: 2023-6-19."},{"key":"e_1_2_1_4_1","unstructured":"2023. Teradata online documentation. https:\/\/docs.teradata.com\/r\/Teradata-VantageTM-SQL-Fundamentals\/June-2022\/Introduction-to-SQL-Fundamentals. Accessed: 2023-6-19."},{"key":"e_1_2_1_5_1","unstructured":"2023. VMware Greenplum 6.24 documentation. https:\/\/docs.greenplum.org\/6-12\/common\/gpdb-features.html. Accessed: 2023-6-19."},{"key":"e_1_2_1_6_1","unstructured":"2024. Amazon Athena. https:\/\/docs.aws.amazon.com\/whitepapers\/latest\/big-data-analytics-options\/amazon-athena.html. Accessed: 2023-6-19."},{"key":"e_1_2_1_7_1","unstructured":"2024. Amazon Redshift provisioned clusters. https:\/\/docs.aws.amazon.com\/redshift\/latest\/mgmt\/working-with-clusters.html. Accessed: 2023-6-19."},{"key":"e_1_2_1_8_1","unstructured":"2024. Analyzing Unconventional Logic Semiconductors - A Shift Away from Semiconductor Manufacturers. https:\/\/hacarus.com\/ai-lab\/03312022-graviton3\/. Accessed: 2024-3-1."},{"key":"e_1_2_1_9_1","unstructured":"2024. Chiplet Market Update. https:\/\/chipletsummit.com\/proceeding_files\/a0q5f000001WuE0\/20230125_PLEN_Hackenberg.PDF. Accessed: 2024-5-12."},{"key":"e_1_2_1_10_1","unstructured":"2024. Configuring NUMA for SingleStore. https:\/\/support.singlestore.com\/hc\/en-us\/articles\/360058633252-Configuring-NUMA-for-SingleStore. Accessed: 2024-3-1."},{"key":"e_1_2_1_11_1","unstructured":"2024. Core to Core Latency Data on Large Systems. https:\/\/chipsandcheese.com\/2023\/11\/07\/core-to-core-latency-data-on-large-systems\/. Accessed: 2024-5-12."},{"key":"e_1_2_1_12_1","unstructured":"2024. The evolution of single-core bandwidth in multicore processors. https:\/\/sites.utexas.edu\/jdm4372\/2023\/04\/25\/the-evolution-of-single-core-bandwidth-in-multicore-processors\/. Accessed: 2024-5-12."},{"key":"e_1_2_1_13_1","unstructured":"2024. The evolution of single-core bandwidth in multicore systems --- update. https:\/\/sites.utexas.edu\/jdm4372\/2023\/12\/19\/the-evolution-of-single-core-bandwidth-in-multicore-systems-update\/. Accessed: 2024-5-12."},{"key":"e_1_2_1_14_1","unstructured":"2024. Intel\u00ae Xeon\u00ae Platinum 8380 Processor. https:\/\/ark.intel.com\/content\/www\/us\/en\/ark\/products\/212287\/intel-xeon-platinum-8380-processor-60m-cache-2-30-ghz.html. Accessed: 2024-5-12."},{"key":"e_1_2_1_15_1","unstructured":"2024. Intel\u00ae Xeon\u00ae Platinum 8460H Processor. https:\/\/ark.intel.com\/content\/www\/us\/en\/ark\/products\/231744\/intel-xeon-platinum-8460h-processor-105m-cache-2-20-ghz.html. Accessed: 2024-5-12."},{"key":"e_1_2_1_16_1","unstructured":"2024. Mastering Dynamic Resource Allocation in Apache Spark. https:\/\/www.sparkcodehub.com\/spark-dynamic-allocation. Accessed: 2024-5-12."},{"key":"e_1_2_1_17_1","unstructured":"2024. MySQL Documentation. https:\/\/dev.mysql.com\/doc\/. Accessed: 2024-3-1."},{"key":"e_1_2_1_18_1","unstructured":"2024. NUMA Balancing. https:\/\/www.kernel.org\/doc\/html\/latest\/admin-guide\/sysctl\/kernel.html#numa-balancing. Accessed: 2023-6-19."},{"key":"e_1_2_1_19_1","unstructured":"2024. numactl(8) --- Linux manual page. https:\/\/man7.org\/linux\/man-pages\/man8\/numactl.8.html. Accessed: 2024-5-12."},{"key":"e_1_2_1_20_1","unstructured":"2024. Oracle Exadata Database Machine X8-2. https:\/\/www.oracle.com\/technetwork\/database\/exadata\/exadata-x8-2-ds-5444350.pdf. Accessed: 2023-6-19."},{"key":"e_1_2_1_21_1","unstructured":"2024. Overview of ClickHouse Architecture. https:\/\/clickhouse.com\/docs\/en\/development\/architecture. Accessed: 2024-3-1."},{"key":"e_1_2_1_22_1","unstructured":"2024. Sapphire Rapids: Golden Cove Hits Servers. https:\/\/chipsandcheese-com.translate.goog\/2023\/03\/12\/a-peek-at-sapphire-rapids\/?_x_tr_sl=en&_x_tr_tl=it&_x_tr_hl=it&_x_tr_pto=sc. Accessed: 2024-5-12."},{"key":"e_1_2_1_23_1","unstructured":"2024. TPC Benchmark\u2122 DS. https:\/\/www.tpc.org\/TPC_Documents_Current_Versions\/pdf\/TPC-DS_v3.2.0.pdf. Accessed: 2024-3-1."},{"key":"e_1_2_1_24_1","unstructured":"2024. TPC Benchmark\u2122 H. https:\/\/www.tpc.org\/TPC_Documents_Current_Versions\/pdf\/TPC-H_v3.0.1.pdf. Accessed: 2024-3-1."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.14778\/3415478.3415545"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/NFV-SDN53031.2021.9665051"},{"key":"e_1_2_1_27_1","volume-title":"Spark SQL: Relational Data Processing in Spark. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data","author":"Armbrust Michael","year":"2015","unstructured":"Michael Armbrust, Reynold Xin, Cheng Lian, Yin Huai, Davies Liu, Joseph K. Bradley, Xiangrui Meng, Tomer Kaftan, Michael J. Franklin, Ali Ghodsi, and Matei A. Zaharia. 2015. Spark SQL: Relational Data Processing in Spark. Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (2015)."},{"key":"e_1_2_1_28_1","volume-title":"Amazon Redshift Re-invented. SIGMOD\/PODS '22: International Conference on Management of Data","author":"Armenatzoglou Nikos","year":"2022","unstructured":"Nikos Armenatzoglou, Sanuj Basu, Naga Bhanoori, Mengchu Cai, Naresh Chainani, Kiran Chinta, Venkatraman Govindaraju, Todd J Green, Monish Gupta, Sebastian Hillig, Eric Hotinger, Yan Leshinksy, Jintian Liang, Michael McCreedy, Fabian Nagel, Ippokratis Pandis, Panos Parchas, Rahul Pathak, Orestis Polychroniou, Foyzur Rahman, Gaurav Saxena, Gokul Soundararajan, Sriram Subramanian, and Doug Terry. 2022. Amazon Redshift Re-invented. SIGMOD\/PODS '22: International Conference on Management of Data (2022)."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.14778\/3055540.3055545"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNSM.2019.2942475"},{"key":"e_1_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Carsten Binnig Andrew Crotty Alex Galakatos Tim Kraska and Erfan Zamanian. 2015. The end of slow networks: It's time for a redesign. (2015).","DOI":"10.14778\/2904483.2904485"},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Peter Boncz Angelos-Christos Anatiotis and Steffen Kl\u00e4be. 2018. JCC-H: Adding join crossing correlations with skew to TPC-H. (2018) 103--119.","DOI":"10.1007\/978-3-319-72401-0_8"},{"key":"e_1_2_1_33_1","volume-title":"Black-box Concurrent Data Structures for NUMA Architectures. ASPLOS '17: Architectural Support for Programming Languages and Operating Systems","author":"Calciu Irina","year":"2017","unstructured":"Irina Calciu, Siddhartha Sen, Mahesh Balakrishnan, and Marcos K Aguilera. 2017. Black-box Concurrent Data Structures for NUMA Architectures. ASPLOS '17: Architectural Support for Programming Languages and Operating Systems (2017)."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.14778\/3007263.3007277"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/3577193.3593702"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/3389133.3389138"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3489517.3530428"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2790755.2790797"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357526.3357564"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2814328"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2742795"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3282307"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.14778\/1454159.1454211"},{"key":"e_1_2_1_44_1","unstructured":"T Kiefer B Schlegel and W Lehner. 2013. Experimental Evaluation of NUMA Effects on Database Management Systems. BTW."},{"key":"e_1_2_1_45_1","volume-title":"A numa api for linux","author":"Kleen Andi","year":"2005","unstructured":"Andi Kleen. 2005. A numa api for linux. Novel Inc (2005)."},{"key":"e_1_2_1_46_1","unstructured":"B Lepers V Qu\u00e9ma and A Fedorova. 2015. Thread and Memory Placement on NUMA Systems: Asymmetry Matters.\" USENIX Annual Technical Conference."},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Mian Liao Daniel H. Zhou P. Wang and Minjie Chen. 2023. Power Systems on Chiplet: Inductor-Linked Multi-Output Switched-Capacitor Multi-Rail Power Delivery on Chiplets. 2023 Fourth International Symposium on 3D Power Electronics Integration and Manufacturing (3D-PEIM) (2023) 1--7.","DOI":"10.1109\/3D-PEIM55914.2023.10052630"},{"key":"e_1_2_1_48_1","volume-title":"High Bandwidth Packaging Interconnect. 2016 IEEE 66th Electronic Components and Technology Conference (ECTC)","author":"Mahajan Ravi","year":"2016","unstructured":"Ravi Mahajan, Robert Sankman, N. Patel, Dae woo Kim, Kemal Aygun, Zhiguo Qian, Yidnekachew S. Mekonnen, Islam A. Salama, Sujit Sharan, Deepti Iyengar, and D. Mallik. 2016. Embedded Multi-die Interconnect Bridge (EMIB) - A High Density, High Bandwidth Packaging Interconnect. 2016 IEEE 66th Electronic Components and Technology Conference (ECTC) (2016), 557--565."},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2021.3086541"},{"key":"e_1_2_1_51_1","unstructured":"John D. McCalpin. 2021. Mapping Core and L3 Slice Numbering to Die Locations in Intel Xeon Scalable Processors. (2021)."},{"key":"e_1_2_1_52_1","volume-title":"Bandwidth Limits in the Intel Xeon Max (Sapphire Rapids with HBM) Processors. ISC Workshops","author":"McCalpin John D.","year":"2023","unstructured":"John D. McCalpin. 2023. Bandwidth Limits in the Intel Xeon Max (Sapphire Rapids with HBM) Processors. ISC Workshops (2023)."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/AERO.2016.7500830"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00014"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISSCC19947.2020.9063103"},{"key":"e_1_2_1_56_1","volume-title":"Sapphire Rapids: The Next-Generation Intel Xeon Scalable Processor. 2022 IEEE International SolidState Circuits Conference (ISSCC) 65","author":"Nassif Nevine","year":"2022","unstructured":"Nevine Nassif, Ashley Munch, Carleton L. Molnar, Gerald Pasdast, Sitaraman V. Lyer, Zibing Yang, Oscar Mendoza, Mark Huddart, Srikrishnan Venkataraman, Sireesha Kandula, Rafi Marom, Alexander M. Kern, William J. Bowhill, David Mulvihill, Srikanth Nimmagadda, Varma Kalidindi, Jonathan Krause, Mohammad MinHazul Haq, Roopali Sharma, and Kevin Duda. 2022. Sapphire Rapids: The Next-Generation Intel Xeon Scalable Processor. 2022 IEEE International SolidState Circuits Conference (ISSCC) 65 (2022), 44--46."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/3241039"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350260"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966445.1966448"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/Cluster48925.2021.00087"},{"key":"e_1_2_1_61_1","volume-title":"Presto: SQL on Everything. 2019 IEEE 35th International Conference on Data Engineering (ICDE) (2019)","author":"Sethi Raghav","year":"2019","unstructured":"Raghav Sethi, Martin Traverso, Dain Sundstrom, Dave Phillips, Wenlei Xie, Yutian Sun, Nezih Yegitbasi, Haozhun Jin, Eric Hwang, Nileema Shingte, and Christopher Berner. 2019. Presto: SQL on Everything. 2019 IEEE 35th International Conference on Data Engineering (ICDE) (2019), 1802--1813."},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCPMT.2022.3207195"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2020.2974217"},{"key":"e_1_2_1_64_1","volume-title":"Memory Performance of AMD EPYC Rome and Intel Cascade Lake SP Server Processors. Proceedings of the 2022 ACM\/SPEC on International Conference on Performance Engineering","author":"Velten Markus","year":"2022","unstructured":"Markus Velten, Robert Sch\u00f6ne, Thomas Ilsche, and Daniel Hackenberg. 2022. Memory Performance of AMD EPYC Rome and Intel Cascade Lake SP Server Processors. Proceedings of the 2022 ACM\/SPEC on International Conference on Performance Engineering (2022)."},{"key":"e_1_2_1_65_1","volume-title":"Research on A Chiplet-based DSA (Domain-Specific Architectures) Scalable Convolutional Acceleration Architecture. 2022 23rd International Conference on Electronic Packaging Technology (ICEPT)","author":"Zhu Shiliang","year":"2022","unstructured":"Shiliang Zhu, Min Miao, Zhuanzhuan Zhang, and Xiaolong Duan. 2022. Research on A Chiplet-based DSA (Domain-Specific Architectures) Scalable Convolutional Acceleration Architecture. 2022 23rd International Conference on Electronic Packaging Technology (ICEPT) (2022), 1--6."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3681954.3682011","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,4]],"date-time":"2024-09-04T18:33:00Z","timestamp":1725474780000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3681954.3682011"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7]]},"references-count":64,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2024,7]]}},"alternative-id":["10.14778\/3681954.3682011"],"URL":"https:\/\/doi.org\/10.14778\/3681954.3682011","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,7]]},"assertion":[{"value":"2024-08-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}