{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T11:10:07Z","timestamp":1753355407150,"version":"3.32.0"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:p>Query optimization is a key component in database management systems (DBMS) and distributed data processing platforms. Recent research in the database community incorporated techniques from artificial intelligence to enhance query optimization. Various learning models have been extended and applied to the query optimization tasks, including query execution plan, query rewriting, and cost estimation. The tasks involved in query optimization differ based on the type of data being processed, such as relational data or spatial geometries. This tutorial reviews recent learning-based approaches for spatial query optimization tasks. We go over methods designed specifically for spatial data, as well as solutions proposed for high-dimensional data. Additionally, we present learning-based spatial indexing and spatial partitioning methods, which are also vital components in spatial data processing. We also identify several open research problems in these fields.<\/jats:p>","DOI":"10.14778\/3685800.3685846","type":"journal-article","created":{"date-parts":[[2024,11,8]],"date-time":"2024-11-08T17:25:21Z","timestamp":1731086721000},"page":"4245-4248","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Spatial Query Optimization With Learning"],"prefix":"10.14778","volume":"17","author":[{"given":"Xin","family":"Zhang","sequence":"first","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}]},{"given":"Ahmed","family":"Eldawy","sequence":"additional","affiliation":[{"name":"University of California, Riverside, Riverside, USA"}]}],"member":"320","published-online":{"date-parts":[[2024,11,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3397536.3426358"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3557915.3560935"},{"key":"e_1_2_1_3_1","unstructured":"Lance Ashdown et al. 2024. Oracle Database SQL Tuning Guide 19c. https:\/\/docs.oracle.com\/en\/database\/oracle\/oracle-database\/19\/tgsql\/index.html"},{"key":"e_1_2_1_4_1","volume-title":"Maliva: Using Machine Learning to Rewrite Visualization Queries Under Time Constraints.. In EDBT. 157--170.","author":"Bai Qiushi","year":"2023","unstructured":"Qiushi Bai, Sadeem Alsudais, Chen Li, and Shuang Zhao. 2023. Maliva: Using Machine Learning to Rewrite Visualization Queries Under Time Constraints.. In EDBT. 157--170."},{"volume-title":"Proceedings of the 30th International Conference on Advances in Geographic Information Systems. 1--4.","author":"Alberto","key":"e_1_2_1_5_1","unstructured":"Alberto Belussi et al. 2022. Spatial embedding: a generic machine learning model for spatial query optimization. In Proceedings of the 30th International Conference on Advances in Geographic Information Systems. 1--4."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3555041.3589401"},{"key":"e_1_2_1_7_1","volume-title":"Defining and designing spatial queries: the role of spatial relationships. Geo-spatial Information Science","author":"Carniel Anderson Chaves","year":"2023","unstructured":"Anderson Chaves Carniel. 2023. Defining and designing spatial queries: the role of spatial relationships. Geo-spatial Information Science (2023), 1--25."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.14778\/3598581.3598597"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Graham Cormode Minos Garofalakis Peter J Haas Chris Jermaine et al. 2011. Synopses for massive data: Samples histograms wavelets sketches. Foundations and Trends\u00ae in Databases 4 1--3 (2011) 1--294.","DOI":"10.1561\/1900000004"},{"key":"e_1_2_1_10_1","volume-title":"Rw-tree: A learned workload-aware framework for r-tree construction","author":"Dong Haowen","year":"2022","unstructured":"Haowen Dong, Chengliang Chai, Yuyu Luo, Jiabin Liu, Jianhua Feng, and Chaoqun Zhan. 2022. Rw-tree: A learned workload-aware framework for r-tree construction. In ICDE. IEEE, 2073--2085."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.14778\/3329772.3329780"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.14778\/3137765.3137828"},{"key":"e_1_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Ahmed Eldawy Mohamed F Mokbel et al. 2016. The era of big spatial data: A survey. Foundations and Trends\u00ae in Databases 6 3--4 (2016) 163--273.","DOI":"10.1561\/1900000054"},{"key":"e_1_2_1_14_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3588917","article-title":"The rlr-tree: A reinforcement learning based r-tree for spatial data","volume":"1","author":"Gu Tu","year":"2023","unstructured":"Tu Gu, Kaiyu Feng, Gao Cong, Cheng Long, Zheng Wang, and Sheng Wang. 2023. The rlr-tree: A reinforcement learning based r-tree for spatial data. Proceedings of the ACM on Management of Data 1, 1 (2023), 1--26.","journal-title":"Proceedings of the ACM on Management of Data"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00242"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517896"},{"key":"e_1_2_1_17_1","volume-title":"ASM: Harmonizing Autoregressive Model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation. PACMOD 2, 1","author":"Kim Kyoungmin","year":"2024","unstructured":"Kyoungmin Kim, Sangoh Lee, Injung Kim, and Wook-Shin Han. 2024. ASM: Harmonizing Autoregressive Model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation. PACMOD 2, 1 (2024)."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457542"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389703"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/3626292.3626302"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2023.120022"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE55515.2023.00124"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the VLDB Endowment 14","author":"Ryan","year":"2021","unstructured":"Ryan Marcus et al. 2021. Flow-loss: Learning cardinality estimates that matter. Proceedings of the VLDB Endowment 14, 11 (2021)."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10707-018-00335-w"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.14778\/3583140.3583164"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389727"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407829"},{"key":"e_1_2_1_28_1","first-page":"4","article-title":"Sample-Efficient Cardinality Estimation Using Geometric Deep Learning","volume":"17","author":"Reiner Silvan","year":"2024","unstructured":"Silvan Reiner and Michael Grossniklaus. 2024. Sample-Efficient Cardinality Estimation Using Geometric Deep Learning. Proc. VLDB Endow. 17, 4 (mar 2024), 740--752.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE48307.2020.00169"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3274895.3274987"},{"key":"e_1_2_1_31_1","volume-title":"Spatial Databases: A Tour","author":"Shekhar Shashi Shekhar","year":"2003","unstructured":"Shashi Shekhar Shekhar and Sanjay Chawla. 2003. Spatial Databases: A Tour. Prentice Hall."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.14778\/3485450.3485459"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE53745.2022.00301"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/3474717.3484217"},{"key":"e_1_2_1_35_1","volume-title":"A learning-based framework for spatial join processing: estimation, optimization and tuning. The VLDB Journal","author":"Vu Tin","year":"2024","unstructured":"Tin Vu, Alberto Belussi, Sara Migliorini, and Ahmed Eldawy. 2024. A learning-based framework for spatial join processing: estimation, optimization and tuning. The VLDB Journal (2024), 1--23."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368302"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3390\/ijgi12040171"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452772"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3526125"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.14778\/3611479.3611528"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452830"},{"key":"e_1_2_1_42_1","first-page":"1","article-title":"FactorJoin: a new cardinality estimation framework for join queries","volume":"1","author":"Wu Ziniu","year":"2023","unstructured":"Ziniu Wu, Parimarjan Negi, Mohammad Alizadeh, Tim Kraska, and Samuel Madden. 2023. FactorJoin: a new cardinality estimation framework for join queries. Proceedings of the ACM on Management of Data 1, 1 (2023), 1--27.","journal-title":"Proceedings of the ACM on Management of Data"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.14778\/3611540.3611576"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.14778\/3565838.3565846"},{"key":"e_1_2_1_45_1","first-page":"100731","article-title":"Clustering-based method for big spatial data partitioning. Measurement","volume":"27","author":"Zein Alaa Aldin","year":"2023","unstructured":"Alaa Aldin Zein, Salah Dowaji, and Mohamad Iyad Al-Khayatt. 2023. Clustering-based method for big spatial data partitioning. Measurement: Sensors 27 (2023), 100731.","journal-title":"Sensors"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-021-01547-7"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2022.3186857"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3526156"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3529337.3529349"},{"key":"e_1_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Johan Kok Zhi Kang et al. 2021. Efficient deep learning pipelines for accurate cost estimations over large scale query workload. In SIGMOD. 1014--1022.","DOI":"10.1145\/3448016.3457546"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.14778\/3485450.3485456"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.14778\/3461535.3461539"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3685800.3685846","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,31]],"date-time":"2024-12-31T05:29:25Z","timestamp":1735622965000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3685800.3685846"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8]]},"references-count":52,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["10.14778\/3685800.3685846"],"URL":"https:\/\/doi.org\/10.14778\/3685800.3685846","relation":{},"ISSN":["2150-8097"],"issn-type":[{"type":"print","value":"2150-8097"}],"subject":[],"published":{"date-parts":[[2024,8]]},"assertion":[{"value":"2024-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}