{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T19:39:33Z","timestamp":1771616373296,"version":"3.50.1"},"reference-count":100,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T00:00:00Z","timestamp":1734480000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T00:00:00Z","timestamp":1734480000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004564","name":"Ministarstvo Prosvete, Nauke i Tehnolo\u0161kog Razvoja","doi-asserted-by":"publisher","award":["451-03-65\/2024-03\/200107"],"award-info":[{"award-number":["451-03-65\/2024-03\/200107"]}],"id":[{"id":"10.13039\/501100004564","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ministarstvo Nauke, Tehnolo\u0161kog razvoja i inovacija, Serbia","award":["451-03-66\/2024-03\/20022"],"award-info":[{"award-number":["451-03-66\/2024-03\/20022"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Modern database management systems (DBMS), primarily designed as general-purpose systems, face the challenging task of efficiently handling data from diverse sources for both analytical services and online transactional processing (OLTP). The volume of data has grown significantly, with distributions ranging from linear to highly skewed, sometimes involving very slow changes or rapid, intensive updates. Recent research in this field has been significantly influenced by advances in machine learning (ML), particularly deep learning (DL), and these developments have led to the application of various ML algorithms to enhance the efficiency of different parts of the query execution engine. While previous research studies were mostly focused on identifying drawbacks to individual components, such as the query optimizer, there is a notable lack of studies examining the applicability and effectiveness of various machine learning approaches across multiple aspects of the query execution engine. This article aims to provide a systematic review of approaches that apply deep learning models at various levels within the query execution engine. We categorize these approaches into three groups based on how such models are applied: improving performance of index structures and consequently data manipulation algorithms, query optimization tasks, and externally controlling query optimizers through parameter tuning. Furthermore, we discuss the key challenges associated with implementing deep learning algorithms in DBMS.<\/jats:p>","DOI":"10.1186\/s40537-024-01025-1","type":"journal-article","created":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T14:05:12Z","timestamp":1734530712000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["A systematic review of deep learning applications in database query execution"],"prefix":"10.1186","volume":"11","author":[{"given":"Bogdan","family":"Milicevic","sequence":"first","affiliation":[]},{"given":"Zoran","family":"Babovic","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,12,18]]},"reference":[{"key":"1025_CR1","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1145\/3524284","volume":"65","author":"D Abadi","year":"2022","unstructured":"Abadi D, et al. The seattle report on database research. Commun ACM. 2022;65:72\u20139.","journal-title":"Commun ACM"},{"key":"1025_CR2","doi-asserted-by":"crossref","unstructured":"Akdere M, \u00c7etintemel U, Riondato M, Upfal E, Zdonik SB. Learning-based query performance modeling and prediction. In 2012 IEEE 28th International Conference on Data Engineering. 2012: 390\u2013401.","DOI":"10.1109\/ICDE.2012.64"},{"key":"1025_CR3","doi-asserted-by":"publisher","DOI":"10.4855\/arXiv.2403.06456","author":"A Al-Mamun","year":"2024","unstructured":"Al-Mamun A, Wu H, He Q, Wang J, Aref WG. A survey of learned indexes for the multi-dimensional space. arXiv preprint. 2024. https:\/\/doi.org\/10.4855\/arXiv.2403.06456.","journal-title":"arXiv preprint"},{"issue":"1","key":"1025_CR4","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1186\/s40537-023-00731-6","volume":"10","author":"Z Babovi\u0107","year":"2023","unstructured":"Babovi\u0107 Z, et al. Research in computing-intensive simulations for nature-oriented civil-engineering and related scientific fields, using machine learning and big data: an overview of open problems. J Big Data. 2023;10(1):73.","journal-title":"J Big Data"},{"key":"1025_CR5","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452831","author":"M Bandle","year":"2021","unstructured":"Bandle M, Giceva J, Neumann T. To partition or not to partition, that is the join question in a real system. ACM SIGMOD. 2021. https:\/\/doi.org\/10.1145\/3448016.3452831.","journal-title":"ACM SIGMOD"},{"key":"1025_CR6","volume-title":"Main-memory hash joins on multi-core CPUs: tuning to the underlying hardware","author":"C Balkesen","year":"2013","unstructured":"Balkesen C, Teubner J, Alonso G, \u00d6zsu MT. Main-memory hash joins on multi-core CPUs: tuning to the underlying hardware. Oslo: ICDE; 2013."},{"key":"1025_CR7","first-page":"85","volume":"7","author":"C Balkesen","year":"2013","unstructured":"Balkesen C, Alonso G, Teubner J, \u00d6zsu MT. Multi-core, main-memory joins: sort vs. Hash revisited. PVLDB. 2013;7:85\u201396.","journal-title":"PVLDB"},{"key":"1025_CR8","first-page":"1","volume-title":"Teaching graduate students how to review research articles and respond to reviewer comments","author":"V Bankovic","year":"2020","unstructured":"Bankovic V, et al. Teaching graduate students how to review research articles and respond to reviewer comments. Amsterdam: Elsevier; 2020. p. 1\u201363."},{"key":"1025_CR9","unstructured":"Bao for PostgreSQL prototype. (n.d.). Retrieved May, 2024 from https:\/\/learned.systems\/bao"},{"key":"1025_CR10","unstructured":"Bao online appendix. (n.d.). Retrieved May, 2024 from https:\/\/rm.cab\/bao_appendix"},{"key":"1025_CR11","first-page":"1","volume-title":"A systematic approach to generation of new ideas for PhD research in computing","author":"V Blagojevic","year":"2017","unstructured":"Blagojevic V, et al. A systematic approach to generation of new ideas for PhD research in computing. Amsterdam: Elsevier; 2017. p. 1\u201331."},{"key":"1025_CR12","volume-title":"Encyclopedia of robotics","author":"S Calinon","year":"2018","unstructured":"Calinon S. Learning from demonstration (programming by demonstration). In: Ang MH, Khatib O, Siciliano B, editors. Encyclopedia of robotics. Berlin: Springer; 2018."},{"key":"1025_CR13","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1007\/s007780100049","volume":"10","author":"K Chakrabarti","year":"2001","unstructured":"Chakrabarti K, Garofalakis M, Rastogi R, Shim K. Approximate query processing using wavelets. VLDB J. 2001;10:199\u2013223.","journal-title":"VLDB J"},{"key":"1025_CR14","doi-asserted-by":"crossref","unstructured":"Chaudhuri S. An Overview of Query Optimization in Relational Systems. Proceedings of the ACM SIGMOD International Conference on Management of Data. 1998.","DOI":"10.1145\/275487.275492"},{"key":"1025_CR15","unstructured":"Dai Y, Xu Y, Ganesan A, Alagappan R, Kroth B, Arpaci-Dusseau AC, Arpaci-Dusseau RH. From WiscKey to Bourbon: A Learned Index for Log-Structured Merge Trees. OSDI. 2020: 155\u2013171."},{"key":"1025_CR16","unstructured":"Dell'Era A. Join Over Histograms. 2007, Available on www.adellera.it\/investigations\/join_over_histograms."},{"key":"1025_CR17","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1145\/376284.375685","volume":"30","author":"A Deshpande","year":"2001","unstructured":"Deshpande A, Garofalakis M, Rastogi R. Independence is good. ACM SIGMOD Rec. 2001;30:199\u2013210.","journal-title":"ACM SIGMOD Rec"},{"key":"1025_CR18","first-page":"969","volume":"20","author":"J Ding","year":"2020","unstructured":"Ding J, et al. Alex: an updatable adaptive learned index. ACM SIGMOD. 2020;20:969\u201384.","journal-title":"ACM SIGMOD"},{"key":"1025_CR19","first-page":"4062","volume":"15","author":"J Ding","year":"2022","unstructured":"Ding J, Marcus RC, Kipf A, Nathan V, Nrusimha A, Vaidya K, van Renen A, Kraska T. SageDB: an instance-optimized data analytics system. PVLDB. 2022;15:4062\u201378.","journal-title":"PVLDB"},{"key":"1025_CR20","first-page":"1","volume":"1","author":"L Doshi","year":"2023","unstructured":"Doshi L, et al. Kepler: robust learning for parametric query optimization. ACM. 2023;1:1\u201325.","journal-title":"ACM"},{"key":"1025_CR21","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989359","author":"J Duggan","year":"2011","unstructured":"Duggan J, Cetintemel U, Papaemmanouil O, Upfal E. Performance prediction for concurrent database workloads. ACM SIGMOD. 2011. https:\/\/doi.org\/10.1145\/1989323.1989359.","journal-title":"ACM SIGMOD"},{"key":"1025_CR22","volume-title":"Predicting multiple metrics for queries: better decisions enabled by machine learning","author":"A Ganapathi","year":"2009","unstructured":"Ganapathi A, et al. Predicting multiple metrics for queries: better decisions enabled by machine learning. Piscataway: IEEE; 2009."},{"key":"1025_CR23","doi-asserted-by":"publisher","first-page":"241","DOI":"10.1007\/s10115-015-0830-y","volume":"46","author":"A Gani","year":"2016","unstructured":"Gani A, Siddiqa A, Shamshirband S, Hanum F. A survey on indexing techniques for big data: taxonomy and performance evaluation. Knowl Inform Syst. 2016;46:241\u201384.","journal-title":"Knowl Inform Syst"},{"key":"1025_CR24","first-page":"349","volume-title":"B-tree indexes and CPU caches","author":"G Graefe","year":"2001","unstructured":"Graefe G, Larson PA. B-tree indexes and CPU caches. Oslo: ICDE; 2001. p. 349\u201358."},{"key":"1025_CR25","doi-asserted-by":"crossref","unstructured":"Graefe G, McKenna WJ. The volcano optimizer generator: Extensibility and efficient search. In Proceedings of IEEE 9th international conference on data engineering. 1993: 209\u2013218.","DOI":"10.1109\/ICDE.1993.344061"},{"key":"1025_CR26","first-page":"1","volume":"1","author":"T Gu","year":"2023","unstructured":"Gu T, Feng K, Cong G, Long C, Wang Z, Wang S. The RLR-tree: a reinforcement learning based R-tree for spatial data. ACM. 2023;1:1\u201326.","journal-title":"ACM"},{"key":"1025_CR27","unstructured":"Hardesty L. Building Systems That Automatically Adjust to Workloads and Data,\" Amazon Science Blog. Accessed on Apr 2024. 2022."},{"key":"1025_CR28","doi-asserted-by":"publisher","DOI":"10.4855\/arXiv.2109.05877","author":"Y Han","year":"2021","unstructured":"Han Y, et al. Cardinality estimation in dbms: a comprehensive benchmark evaluation. arXiv preprint. 2021. https:\/\/doi.org\/10.4855\/arXiv.2109.05877.","journal-title":"arXiv preprint"},{"key":"1025_CR29","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11757","author":"T Hester","year":"2018","unstructured":"Hester T, et al. Deep Q-learning from demonstrations. AAAI. 2018. https:\/\/doi.org\/10.1609\/aaai.v32i1.11757.","journal-title":"AAAI"},{"key":"1025_CR30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3447772","volume":"54","author":"A Hogan","year":"2021","unstructured":"Hogan A, et al. Knowledge graphs. ACM Comput Surv. 2021;54:1\u201337.","journal-title":"ACM Comput Surv"},{"key":"1025_CR31","doi-asserted-by":"crossref","unstructured":"Ioannidis Y. 2003 The history of histograms (abridged). In Proceedings 2003 VLDB Conference. 2003: 19\u201330.","DOI":"10.1016\/B978-012722442-8\/50011-2"},{"issue":"2","key":"1025_CR32","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1145\/356924.356928","volume":"16","author":"M Jarke","year":"1984","unstructured":"Jarke M, Koch J. Query optimization in database systems. ACM Comput Surv. 1984;16(2):111\u201352.","journal-title":"ACM Comput Surv"},{"key":"1025_CR33","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1023\/A:1017984413808","volume":"49","author":"M Kearns","year":"2002","unstructured":"Kearns M, Singh S. Near-optimal reinforcement learning in polynomial time. Mach Learn. 2002;49:209\u201332.","journal-title":"Mach Learn"},{"key":"1025_CR34","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807206","author":"C Kim","year":"2010","unstructured":"Kim C, et al. FAST: fast architecture-sensitive tree search on modern CPUs and GPUs. ACM SIGMOD. 2010. https:\/\/doi.org\/10.1145\/1807167.1807206.","journal-title":"ACM SIGMOD"},{"key":"1025_CR35","doi-asserted-by":"crossref","unstructured":"Kim K, Jung J, Seo I, Han WS, Choi K, Chong J. Learned cardinality estimation: An in-depth study. In Proceedings of the 2022 International Conference on Management of Data. 2022: 1214\u20131227.","DOI":"10.1145\/3514221.3526154"},{"key":"1025_CR36","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1809.00677","author":"A Kipf","year":"2019","unstructured":"Kipf A, Kipf T, Radke B, Leis V, Boncz PA, Kemper A. Learned cardinalities: estimating correlated joins with deep learning. CIDR. 2019. https:\/\/doi.org\/10.48550\/arXiv.1809.00677.","journal-title":"CIDR"},{"key":"1025_CR37","doi-asserted-by":"publisher","DOI":"10.1109\/SSDM.1999.787640","volume-title":"Range selectivity estimation for continuous attributes","author":"F Korn","year":"1999","unstructured":"Korn F, Johnson T, Jagadish HV. Range selectivity estimation for continuous attributes. Piscataway: IEEE; 1999."},{"key":"1025_CR38","unstructured":"Kraska T. et al. SageDB: A Learned Database System. Conference on Innovative Data Systems Research (CIDR). 2019."},{"key":"1025_CR39","first-page":"489","volume":"18","author":"T Kraska","year":"2018","unstructured":"Kraska T, Beutel A, Chi EH, Dean J, Polyzotis N. The case for learned index structures. ACM SIGMOD. 2018;18:489\u2013504.","journal-title":"ACM SIGMOD"},{"key":"1025_CR40","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1808.03196","author":"S Krishnan","year":"2018","unstructured":"Krishnan S, Yang Z, Goldberg K, Hellerstein JM, Stoica I. Learning to optimize join queries with deep reinforcement learning. CoRR. 2018. https:\/\/doi.org\/10.48550\/arXiv.1808.03196.","journal-title":"CoRR"},{"key":"1025_CR41","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389752","author":"A Kristo","year":"2020","unstructured":"Kristo A, Vaidya K, \u00c7etintemel U, Misra S, Kraska T. The case for a learned sorting algorithm. ACM SIGMOD. 2020. https:\/\/doi.org\/10.1145\/3318464.3389752.","journal-title":"ACM SIGMOD"},{"key":"1025_CR42","first-page":"1","volume":"1","author":"M Kurmanji","year":"2023","unstructured":"Kurmanji M, Triantafillou P. Detect, distill and update: learned DB systems facing out of distribution data. ACM. 2023;1:1\u201327.","journal-title":"ACM"},{"key":"1025_CR43","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1007\/s41019-020-00149-7","volume":"6","author":"H Lan","year":"2021","unstructured":"Lan H, Bao Z, Peng Y. A survey on advancing the dbms query optimizer: cardinality estimation, cost model, and plan enumeration. Data Sci Eng. 2021;6:86\u2013101.","journal-title":"Data Sci Eng"},{"key":"1025_CR44","first-page":"1","volume":"1","author":"H Lan","year":"2023","unstructured":"Lan H, Bao Z, Culpepper JS, Borovica-Gajic R. Updatable learned indexes meet disk-resident DBMS: from evaluations to design choices. ACM. 2023;1:1\u201322.","journal-title":"ACM"},{"key":"1025_CR45","doi-asserted-by":"publisher","DOI":"10.14778\/3654621.3654625","author":"C Lehmann","year":"2023","unstructured":"Lehmann C, Sulimov P, Stockinger K. Is your learned query optimizer behaving as you expect? a machine learning perspective. arXiv. 2023. https:\/\/doi.org\/10.14778\/3654621.3654625.","journal-title":"arXiv"},{"key":"1025_CR46","first-page":"204","volume":"9","author":"V Leis","year":"2015","unstructured":"Leis V, Gubichev A, Mirchev A, Boncz P, Kemper A, Neumann T. How good are query optimizers, really? PVLDB. 2015;9:204\u201315.","journal-title":"PVLDB"},{"key":"1025_CR47","unstructured":"Leis V, Radke B, Gubichev A, Kemper A, Neumann T. Cardinality Estimation Done Right: Index-Based Join Sampling. In CIDR. 2017."},{"key":"1025_CR48","doi-asserted-by":"publisher","first-page":"643","DOI":"10.1007\/s00778-017-0480-7","volume":"27","author":"V Leis","year":"2018","unstructured":"Leis V, Radke B, Gubichev A, Mirchev A, Boncz P, Kemper A, Neumann T. Query optimization through the looking glass, and what we found running the join order benchmark. VLDB J. 2018;27:643\u201368.","journal-title":"VLDB J"},{"key":"1025_CR49","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2024.3364183","author":"M Li","year":"2024","unstructured":"Li M, et al. A survey of multi-dimensional indexes: past and future trends. IEEE Trans Knowl Data Eng. 2024. https:\/\/doi.org\/10.1109\/TKDE.2024.3364183.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1025_CR50","first-page":"2118","volume":"12","author":"G Li","year":"2019","unstructured":"Li G, Zhou X, Li S, Gao B. QTune: a query-aware database tuning system with deep reinforcement learning. PVLDB. 2019;12:2118\u201330.","journal-title":"PVLDB"},{"key":"1025_CR51","doi-asserted-by":"crossref","unstructured":"Lipton RJ, Naughton JF, Schneider DA. Practical selectivity estimation through adaptive sampling. In Proceedings of the 1990 ACM SIGMOD international conference on Management of data. 1990: 1\u201311.","DOI":"10.1145\/93597.93611"},{"key":"1025_CR52","unstructured":"Lohman G. Is Query Optimization a Solved Problem?. ACM SIGMOD Blog, ACM Blog. 2014: 14."},{"key":"1025_CR53","unstructured":"Marcus R. More Bao Results: Learned Distributed Query Optimization on Vertica, Redshift, and Azure Synapse. Retrieved May, 2024 from https:\/\/learnedsystems.mit.edu\/bao-distributed\/. 2021."},{"key":"1025_CR54","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3452838","author":"R Marcus","year":"2021","unstructured":"Marcus R, et al. Bao: making learned query optimization practical. ACM SIGMOD. 2021. https:\/\/doi.org\/10.1145\/3448016.3452838.","journal-title":"ACM SIGMOD"},{"key":"1025_CR55","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1809.10212","author":"R Marcus","year":"2018","unstructured":"Marcus R, Papaemmanouil O. Towards a hands-free query optimizer through deep learning. arXiv. 2018. https:\/\/doi.org\/10.48550\/arXiv.1809.10212.","journal-title":"arXiv"},{"key":"1025_CR56","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342646","author":"R Marcus","year":"2019","unstructured":"Marcus R, Papaemmanouil O. Plan-structured deep neural network models for query performance prediction. arXiv. 2019. https:\/\/doi.org\/10.14778\/3342263.3342646.","journal-title":"arXiv"},{"key":"1025_CR57","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342644","author":"R Marcus","year":"2019","unstructured":"Marcus R, Negi P, Mao H, Zhang C, Alizadeh M, Kraska T, Papaemmanouil O. Neo: a learned query optimizer. Proc VLDB Endow. 2019. https:\/\/doi.org\/10.14778\/3342263.3342644.","journal-title":"Proc VLDB Endow"},{"key":"1025_CR58","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3196926","author":"S Mudgal","year":"2018","unstructured":"Mudgal S, et al. Deep learning for entity matching. ACM SIGMOD. 2018. https:\/\/doi.org\/10.1145\/3183713.3196926.","journal-title":"ACM SIGMOD"},{"key":"1025_CR59","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380579","author":"V Nathan","year":"2020","unstructured":"Nathan V, Ding J, Alizadeh M, Kraska T. Learning multi-dimensional indexes. ACM SIGMOD. 2020. https:\/\/doi.org\/10.1145\/3318464.3380579.","journal-title":"ACM SIGMOD"},{"key":"1025_CR60","doi-asserted-by":"publisher","DOI":"10.1145\/3209889.3209890","author":"J Ortiz","year":"2018","unstructured":"Ortiz J, Balazinska M, Gehrke J, Keerthi SS. Learning state representations for query optimization with deep reinforcement learning. arXiv. 2018. https:\/\/doi.org\/10.1145\/3209889.3209890.","journal-title":"arXiv"},{"issue":"2","key":"1025_CR61","doi-asserted-by":"publisher","first-page":"294","DOI":"10.1145\/235968.233342","volume":"25","author":"V Poosala","year":"1996","unstructured":"Poosala V, Haas PJ, Ioannidis YE, Shekita EJ. Improved histograms for selectivity estimation of range predicates. ACM Sigmod Record. 1996;25(2):294\u2013305.","journal-title":"ACM Sigmod Record"},{"key":"1025_CR62","unstructured":"PostgreSQL DBMS. (n.d.), Optimizer Commit History, https:\/\/github.com\/postgres\/postgres\/commits\/master\/src\/backend\/optimizer\/. Retrieved May, 2024"},{"key":"1025_CR63","first-page":"96","volume":"9","author":"S Richter","year":"2015","unstructured":"Richter S, Alvarez V, Dittrich J. A seven-dimensional analysis of hashing methods and its implications on query processing. PVLDB. 2015;9:96\u2013107.","journal-title":"PVLDB"},{"key":"1025_CR64","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3526158","author":"I Sabek","year":"2022","unstructured":"Sabek I, Ukyab TS, Kraska T. LSched: a workload-aware learned query scheduler for analytical database systems. ACM SIGMOD. 2022. https:\/\/doi.org\/10.1145\/3514221.3526158.","journal-title":"ACM SIGMOD"},{"issue":"6","key":"1025_CR65","doi-asserted-by":"publisher","first-page":"420","DOI":"10.1007\/s42979-021-00815-1","volume":"2","author":"IH Sarker","year":"2021","unstructured":"Sarker IH. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput Sci. 2021;2(6):420.","journal-title":"SN Comput Sci"},{"key":"1025_CR66","doi-asserted-by":"publisher","DOI":"10.1145\/3555041.3589677","author":"G Saxena","year":"2023","unstructured":"Saxena G, et al. Auto-WLM: machine learning enhanced workload management in amazon redshift. ACM SIGMOD Companion. 2023. https:\/\/doi.org\/10.1145\/3555041.3589677.","journal-title":"ACM SIGMOD Companion"},{"key":"1025_CR67","doi-asserted-by":"publisher","DOI":"10.1145\/3639299","author":"T Schwabe","year":"2023","unstructured":"Schwabe T, Acosta M. Cardinality estimation over knowledge graphs with embeddings and graph neural networks. arXiv. 2023. https:\/\/doi.org\/10.1145\/3639299.","journal-title":"arXiv"},{"key":"1025_CR68","doi-asserted-by":"publisher","DOI":"10.1145\/582095.582099","author":"PG Selinger","year":"1979","unstructured":"Selinger PG, Astrahan MM, Chamberlin DD, Lorie RA, Price TG. Access path selection in a relational database management system. ACM SIGMOD. 1979. https:\/\/doi.org\/10.1145\/582095.582099.","journal-title":"ACM SIGMOD"},{"key":"1025_CR69","first-page":"1","volume":"1","author":"Y Sheng","year":"2023","unstructured":"Sheng Y, et al. WISK: a workload-aware learned index for spatial keyword queries. ACM. 2023;1:1\u201327.","journal-title":"ACM"},{"issue":"8","key":"1025_CR70","doi-asserted-by":"publisher","first-page":"1992","DOI":"10.14778\/3594512.3594528","volume":"16","author":"Z Sun","year":"2003","unstructured":"Sun Z, Zhou X, Li G. Learned index: a comprehensive experimental evaluation. Proc VLDB Endow. 2003;16(8):1992\u20132004.","journal-title":"Proc VLDB Endow"},{"key":"1025_CR71","unstructured":"Stillger M, Lohman GM, Markl V, Kandil M. LEO-DB2's learning optimizer. In VLDB. 2001. 1: 19\u201328."},{"issue":"1","key":"1025_CR72","doi-asserted-by":"publisher","first-page":"85","DOI":"10.14778\/3485450.3485459","volume":"15","author":"J Sun","year":"2021","unstructured":"Sun J, Zhang J, Sun Z, Li G, Tang N. Learned cardinality estimation: a design space exploration and a comparative evaluation. Proc VLDB Endow. 2021;15(1):85\u201397.","journal-title":"Proc VLDB Endow"},{"key":"1025_CR73","doi-asserted-by":"crossref","unstructured":"Swami A. Optimization of large join queries: Combining heuristics and combinatorial techniques. In Proceedings of the 1989 ACM SIGMOD International Conference on Management of data. 1989: 367\u2013376.","DOI":"10.1145\/67544.66961"},{"key":"1025_CR74","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2017.8202133","author":"J Tobin","year":"2017","unstructured":"Tobin J, Fong R, Ray A, Schneider J, Zaremba W, Abbeel P. Domain randomization for transferring deep neural networks from simulation to the real world. arXiv. 2017. https:\/\/doi.org\/10.1109\/IROS.2017.8202133.","journal-title":"arXiv"},{"key":"1025_CR75","volume-title":"The vertica query optimizer: the case for specialized query optimizers","author":"N Tran","year":"2014","unstructured":"Tran N, Lamb A, Shrinivas L, Bodagala S, Dave J. The vertica query optimizer: the case for specialized query optimizers. Piscataway: IEEE; 2014."},{"key":"1025_CR76","doi-asserted-by":"crossref","unstructured":"Trummer I, Wang J, Maram D, Moseley S, Jo S, Antonakakis J. SkinnerDB. ACM SIGMOD. 2019.","DOI":"10.1145\/3299869.3300088"},{"key":"1025_CR77","first-page":"852","volume":"4","author":"K Tzoumas","year":"2011","unstructured":"Tzoumas K, Deshpande A, Jensen CS. Lightweight graphical models for selectivity estimation without independence assumptions. PVLDB. 2011;4:852\u201363.","journal-title":"PVLDB"},{"key":"1025_CR78","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/s00778-012-0293-7","volume":"22","author":"K Tzoumas","year":"2012","unstructured":"Tzoumas K, Deshpande A, Jensen CS. Efficiently adapting graphical models for selectivity estimation. VLDB J. 2012;22:3\u201327.","journal-title":"VLDB J"},{"key":"1025_CR79","first-page":"1241","volume":"14","author":"D Van Aken","year":"2021","unstructured":"Van Aken D, Yang D, Brillard S, Fiorino A, Zhang B, Billian C, Pavlo A. An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems. PVLDB. 2021;14:1241\u201353.","journal-title":"PVLDB"},{"key":"1025_CR80","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v30i1.10295","author":"H Van Hasselt","year":"2016","unstructured":"Van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double Q-learning. AAAI. 2016. https:\/\/doi.org\/10.1609\/aaai.v30i1.10295.","journal-title":"AAAI"},{"key":"1025_CR81","first-page":"51","volume-title":"Lecture notes in computer science","author":"F Waas","year":"2000","unstructured":"Waas F, Pellenkoft A. Join order selection (good enough is easy). In: Goos G, Hartmanis J, editors. Lecture notes in computer science. Berlin: Springer; 2000. p. 51\u201367."},{"key":"1025_CR82","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1145\/3003665.3003669","volume":"45","author":"W Wang","year":"2016","unstructured":"Wang W, et al. Database meets deep learning: challenges and opportunities. ACM SIGMOD Rec. 2016;45:17\u201322.","journal-title":"ACM SIGMOD Rec"},{"key":"1025_CR83","first-page":"1640","volume":"14","author":"X Wang","year":"2021","unstructured":"Wang X, et al. Are we ready for learned cardinality estimation? PVLDB. 2021;14:1640\u201354.","journal-title":"PVLDB"},{"key":"1025_CR84","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1145\/3003665.3003669","volume":"45","author":"W Wang","year":"2016","unstructured":"Wang W, Zhang M, Chen G, Jagadish HV, Ooi BC, Tan K-L. Database meets deep learning. ACM SIGMOD Rec. 2016;45:17\u201322.","journal-title":"ACM SIGMOD Rec"},{"key":"1025_CR85","first-page":"925","volume":"6","author":"W Wu","year":"2013","unstructured":"Wu W, Chi Y, Hac\u0131g\u00fcm\u00fc\u015f H, Naughton JF. Towards predicting query execution time for concurrent and dynamic database workloads. PVLDB. 2013;6:925\u201336.","journal-title":"PVLDB"},{"key":"1025_CR86","volume-title":"Predicting query execution time: are optimizer cost models really unusable?","author":"W Wu","year":"2013","unstructured":"Wu W, Chi Y, Zhu S, Tatemura J, Hac\u0131g\u00fcm\u00fc\u015f H, Naughton JF. Predicting query execution time: are optimizer cost models really unusable? Piscataway: IEEE; 2013."},{"key":"1025_CR87","first-page":"210","volume":"12","author":"C Wu","year":"2018","unstructured":"Wu C, Jindal A, Amizadeh S, Patel H, Le W, Qiao S, Rao S. Towards a learning optimizer for shared clouds. PVLDB. 2018;12:210\u201322.","journal-title":"PVLDB"},{"key":"1025_CR88","first-page":"1","volume":"1","author":"Z Wu","year":"2023","unstructured":"Wu Z, Negi P, Alizadeh M, Kraska T, Madden S. FactorJoin: a new cardinality estimation framework for join queries. ACM. 2023;1:1\u201327.","journal-title":"ACM"},{"key":"1025_CR89","first-page":"3","volume":"13","author":"Z Yang","year":"2019","unstructured":"Yang Z, et al. Deep unsupervised cardinality estimation. PVLDB. 2019;13:3\u2013279.","journal-title":"PVLDB"},{"key":"1025_CR90","doi-asserted-by":"publisher","first-page":"279","DOI":"10.14778\/3368289.3368294","volume":"13","author":"Z Yang","year":"2019","unstructured":"Yang Z, et al. Deep unsupervised cardinality estimation. Proc VLDB Endow. 2019;13:279\u201392.","journal-title":"Proc VLDB Endow"},{"key":"1025_CR91","doi-asserted-by":"publisher","first-page":"61","DOI":"10.14778\/3421424.3421432","volume":"14","author":"Z Yang","year":"2020","unstructured":"Yang Z, et al. NeuroCard. Proc VLDB Endow. 2020;14:61\u201373.","journal-title":"Proc VLDB Endow"},{"key":"1025_CR92","volume-title":"Machine learning for query optimization","author":"Z Yang","year":"2022","unstructured":"Yang Z. Machine learning for query optimization. Berkeley: EECS Department, University of California; 2022."},{"key":"1025_CR93","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517885","author":"Z Yang","year":"2022","unstructured":"Yang Z, Chiang W-L, Luan S, Mittal G, Luo M, Stoica I. Balsa: learning a query optimizer without expert demonstrations. ACM SIGMOD. 2022. https:\/\/doi.org\/10.1145\/3514221.3517885.","journal-title":"ACM SIGMOD"},{"key":"1025_CR94","first-page":"1297","volume-title":"Reinforcement learning with tree-LSTM for join order selection","author":"X Yu","year":"2020","unstructured":"Yu X, Li G, Chai C, Tang N. Reinforcement learning with tree-LSTM for join order selection. Piscataway: IEEE ICDE; 2020. p. 1297\u2013308."},{"key":"1025_CR95","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3300085","author":"J Zhang","year":"2019","unstructured":"Zhang J, et al. An end-to-end automatic cloud database tuning system using deep reinforcement learning. ACM SIGMOD. 2019. https:\/\/doi.org\/10.1145\/3299869.3300085.","journal-title":"ACM SIGMOD"},{"key":"1025_CR96","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3183739","author":"Z Zhao","year":"2018","unstructured":"Zhao Z, Christensen R, Li F, Hu X, Yi K. Random sampling over joins revisited. ACM SIGMOD. 2018. https:\/\/doi.org\/10.1145\/3183713.3183739.","journal-title":"ACM SIGMOD"},{"issue":"12","key":"1025_CR97","doi-asserted-by":"publisher","first-page":"12470","DOI":"10.1109\/TKDE.2023.3266893","volume":"35","author":"X Zhao","year":"2023","unstructured":"Zhao X, Zhou X, Li G. Automatic database knob tuning: a survey. IEEE Trans Knowl Data Eng. 2023;35(12):12470\u201390.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1025_CR98","doi-asserted-by":"publisher","first-page":"1096","DOI":"10.1109\/TKDE.2020.2994641","volume":"34","author":"X Zhou","year":"2020","unstructured":"Zhou X, Chai C, Li G, Sun J. Database meets artificial intelligence: a survey. IEEE Trans Knowl Data Eng. 2020;34:1096\u2013116.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1025_CR99","first-page":"1","volume":"1","author":"X Zhou","year":"2023","unstructured":"Zhou X, Li G, Feng J, Liu L, Guo W. Grep: a graph learning-based database partitioning system. ACM. 2023;1:1\u201324.","journal-title":"ACM"},{"key":"1025_CR100","doi-asserted-by":"publisher","first-page":"100304","DOI":"10.1016\/j.bdr.2021.100304","volume":"27","author":"B Zou","year":"2022","unstructured":"Zou B, You J, Wang Q, Wen X, Jia L. Survey on learnable databases: a machine learning perspective. Big Data Res. 2022;27:100304.","journal-title":"Big Data Res"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-01025-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-024-01025-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-01025-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,18]],"date-time":"2024-12-18T15:17:19Z","timestamp":1734535039000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-024-01025-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,18]]},"references-count":100,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["1025"],"URL":"https:\/\/doi.org\/10.1186\/s40537-024-01025-1","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,18]]},"assertion":[{"value":"4 May 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 October 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 December 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"The authors provided consent for publication.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"173"}}