{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:51:26Z","timestamp":1768902686778,"version":"3.49.0"},"reference-count":42,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2024,9,30]],"date-time":"2024-09-30T00:00:00Z","timestamp":1727654400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSERC Discovery","award":["RGPIN-2019-04613 and DGECR-2019-00120"],"award-info":[{"award-number":["RGPIN-2019-04613 and DGECR-2019-00120"]}]},{"DOI":"10.13039\/100027925","name":"Alliance","doi-asserted-by":"crossref","award":["ALLRP-552042-2020"],"award-info":[{"award-number":["ALLRP-552042-2020"]}],"id":[{"id":"10.13039\/100027925","id-type":"DOI","asserted-by":"crossref"}]},{"name":"CFI John R. Evans Leaders"},{"name":"Huawei Canada and AMD-Xilinx"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Reconfigurable Technol. Syst."],"published-print":{"date-parts":[[2024,9,30]]},"abstract":"<jats:p>Today\u2019s big data query engines are constantly under pressure to keep up with the rapidly increasing demand for faster processing of more complex workloads. In the past few years, FPGA-based database acceleration efforts have demonstrated promising performance improvement with good energy efficiency. However, few studies target the programming and design automation support to leverage the FPGA accelerator benefits in query processing. Most of them rely on the SQL query plan generated by CPU query engines and manually map the query plan onto the FPGA accelerators, which is tedious and error-prone. Moreover, such CPU-oriented query plans do not consider the utilization of FPGA accelerators and could lose more optimization opportunities.<\/jats:p>\n          <jats:p>In this article, we present SQL2FPGA, an FPGA accelerator-aware compiler to automatically map SQL queries onto the heterogeneous CPU-FPGA platforms. Our SQL2FPGA front-end takes an optimized logical plan of an SQL query from a database query engine and transforms it into a unified operator-level intermediate representation. To generate an optimized FPGA-aware physical plan, SQL2FPGA implements a set of compiler optimization passes to (1) improve operator acceleration coverage by the FPGA, (2) eliminate redundant computation during physical execution, and (3) minimize data transfer overhead between operators on the CPU and FPGA. Furthermore, it also leverages machine learning techniques to predict and identify the optimal platform, either CPU or FPGA, for the physical execution of individual query operations. Finally, SQL2FPGA generates the associated query acceleration code for heterogeneous CPU-FPGA system deployment. Compared to the widely used Apache Spark SQL framework running on the CPU, SQL2FPGA\u2014using two AMD\/Xilinx HBM-based Alveo U280 FPGA boards and Ver.2020 AMD\/Xilinx FPGA overlay designs\u2014achieves an average performance speedup of 10.1x and 13.9x across all 22 TPC-H benchmark queries in a scale factor of 1 GB (SF1) and 30 GB (SF30), respectively. While evaluated on AMD\/Xilinx Alveo U50 FPGA boards, SQL2FPGA using Ver. 2022 AMD\/Xilinx FPGA overlay designs also achieve an average speedup of 9.6x at SF1 scale factor.<\/jats:p>","DOI":"10.1145\/3674843","type":"journal-article","created":{"date-parts":[[2024,7,2]],"date-time":"2024-07-02T20:44:52Z","timestamp":1719953092000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["SQL2FPGA: Automated Acceleration of SQL Query Processing on Modern CPU-FPGA Platforms"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3315-7368","authenticated-orcid":false,"given":"Alec","family":"Lu","sequence":"first","affiliation":[{"name":"School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-9709-2367","authenticated-orcid":false,"given":"Jahanvi","family":"Narendra Agrawal","sequence":"additional","affiliation":[{"name":"Simon Fraser University, Burnaby, BC, Canada"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0603-9697","authenticated-orcid":false,"given":"Zhenman","family":"Fang","sequence":"additional","affiliation":[{"name":"School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada"}]}],"member":"320","published-online":{"date-parts":[[2024,9,30]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2012.64"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2742797"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/2436696.2443836"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735706"},{"key":"e_1_3_1_6_2","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2014.6927502"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1145\/2554688.2554787"},{"key":"e_1_3_1_8_2","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2012.18"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2013.38"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/2000064.2000108"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-019-00581-w"},{"key":"e_1_3_1_12_2","unstructured":"Phil Francisco. 2011. The Netezza Data Appliance Architecture: A Platform for High Performance Data Warehousing and Analytics. IBM Corp."},{"key":"e_1_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/1620585.1620588"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3314041"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3337801.3337810"},{"key":"e_1_3_1_16_2","unstructured":"MonetDB. 2022. MonetDB: The Database System to Speed Up Your Analytical Jobs. Retrieved December 20 2022 from https:\/\/www.monetdb.org\/"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807307"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICFPT56656.2022.9974596"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/HOTCHIPS.2016.7936221"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2017.37"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.14778\/3476311.3476391"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2915224"},{"key":"e_1_3_1_23_2","first-page":"85","article-title":"Scikit-Learn: Machine Learning in Python","volume":"12","author":"Pedregosa Fabian","year":"2011","unstructured":"Fabian Pedregosa, Ga\u00ebl Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake Vanderplas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, and Edouard Duchesnay. 2011. Scikit-Learn: Machine Learning in Python. Journal of Machine Learning Research 12, 85 (2011), 2825\u20132830.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_1_24_2","unstructured":"PostgreSQL. 2022. PostgreSQL: The World\u2019s Most Advanced Open Source Relational Database. Retrieved December 20 2022 from https:\/\/www.postgresql.org\/"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF00116251"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3058746"},{"key":"e_1_3_1_27_2","unstructured":"Apache Spark. 2022. Unified Engine for Large-Scale Data Analytics. Retrieved December 20 2022 from https:\/\/spark.apache.org\/"},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/2370816.2370874"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/SBAC-PAD.2013.21"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10766-014-0327-4"},{"key":"e_1_3_1_31_2","unstructured":"TPC. 2022a. TPC-DS is a Decision Support Benchmark. Retrieved December 20 2022 from https:\/\/www.tpc.org\/tpcds\/"},{"key":"e_1_3_1_32_2","unstructured":"TPC. 2022b. TPC-H is a Decision Support Benchmark. Retrieved December 20 2022 from https:\/\/www.tpc.org\/tpch\/"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1109\/FPL.2016.7577329"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00067"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.14778\/2732967.2732972"},{"key":"e_1_3_1_36_2","first-page":"44","volume-title":"Proceedings of Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO \u201914)","author":"Wu Haicheng","year":"2014","unstructured":"Haicheng Wu, Gregory Diamos, Tim Sheard, Molham Aref, Sean Baxter, Michael Garland, and Sudhakar Yalamanchili. 2014a. Red Fox: An Execution Environment for Relational Query Processing on GPUs. In Proceedings of Annual IEEE\/ACM International Symposium on Code Generation and Optimization (CGO \u201914). ACM, New York, NY, 44\u201354."},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/2541940.2541961"},{"key":"e_1_3_1_38_2","unstructured":"Xilinx. 2022. Vitis Database Library. Retrieved December 20 2022 from https:\/\/www.xilinx.com\/products\/design-tools\/vitis\/vitis-libraries\/vitis-database.html"},{"key":"e_1_3_1_39_2","unstructured":"Xilinx. 2023a. Alveo U280 Data Center Accelerator Card Data Sheet (DS963). Retrieved December 20 2022 from https:\/\/docs.xilinx.com\/r\/en-US\/ds963-u280\/Summary"},{"key":"e_1_3_1_40_2","unstructured":"Xilinx. 2023b. Alveo U50 Data Center Accelerator Card Data Sheet. Retrieved December 20 2022 from https:\/\/www.xilinx.com\/content\/dam\/xilinx\/support\/documents\/data_sheets\/ds965-u50.pdf"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00041"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.5555\/3386691.3386713"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1145\/2845087"}],"container-title":["ACM Transactions on Reconfigurable Technology and Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3674843","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3674843","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:05:56Z","timestamp":1750291556000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3674843"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,30]]},"references-count":42,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,9,30]]}},"alternative-id":["10.1145\/3674843"],"URL":"https:\/\/doi.org\/10.1145\/3674843","relation":{},"ISSN":["1936-7406","1936-7414"],"issn-type":[{"value":"1936-7406","type":"print"},{"value":"1936-7414","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,30]]},"assertion":[{"value":"2024-01-11","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-06-17","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-09-30","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}