{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T22:36:22Z","timestamp":1778279782631,"version":"3.51.4"},"reference-count":57,"publisher":"Association for Computing Machinery (ACM)","issue":"3","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2023,11]]},"abstract":"<jats:p>In recent years, a wide spectrum of database tuning systems have emerged to automatically optimize database performance. However, these systems require a significant number of workload runs to deliver a satisfactory level of database performance, which is time-consuming and resource-intensive. While many attempts have been made to address this issue by using advanced search optimizers, empirical studies have shown that no single optimizer can dominate the rest across tuning tasks with different characteristics. Choosing an inferior optimizer may significantly increase the tuning cost. Unfortunately, current practices typically adopt a single optimizer or follow simple heuristics without considering the task characteristics. Consequently, they fail to choose the most suitable optimizer for a specific task. Furthermore, constructing a compact search space can significantly improve the tuning efficiency. However, current practices neglect the setting of the value range for each knob and rely on a large number of workload runs to select important knobs, resulting in a considerable amount of unnecessary exploration in ineffective regions.<\/jats:p>\n          <jats:p>To pursue efficient database tuning, in this paper, we argue that it is imperative to have an approach that can judiciously determine a precise space and search optimizer for an arbitrary tuning task. To this end, we propose OpAdviser, which exploits the information learned from historical tuning tasks to guide the search space construction and search optimizer selection. Our design can greatly accelerate the tuning process and further reduce the required workload runs. Given a tuning task, OpAdviser learns the geometries of search space, including important knobs and their effective regions, from relevant previous tasks. It then constructs the target search space from the geometries according to the on-the-fly task similarity, which allows for adaptive adjustment of the target space. OpAdviser also employs a pairwise ranking model to capture the relationship from task characteristics to optimizer rankings. This ranking model is invoked during tuning and predicts the best optimizer to be used for the current iteration. We conduct extensive evaluations across a diverse set of workloads, where OpAdviser achieves 9.2% higher throughput and significantly reduces the number of workload runs with an average speedup of ~3.4x compared to state-of-the-art tuning systems.<\/jats:p>","DOI":"10.14778\/3632093.3632114","type":"journal-article","created":{"date-parts":[[2024,1,20]],"date-time":"2024-01-20T11:26:31Z","timestamp":1705749991000},"page":"539-552","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["An Efficient Transfer Learning Based Configuration Adviser for Database Tuning"],"prefix":"10.14778","volume":"17","author":[{"given":"Xinyi","family":"Zhang","sequence":"first","affiliation":[{"name":"Peking University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hong","family":"Wu","sequence":"additional","affiliation":[{"name":"Alibaba Group"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"Li","sequence":"additional","affiliation":[{"name":"Peking University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhengju","family":"Tang","sequence":"additional","affiliation":[{"name":"Peking University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jian","family":"Tan","sequence":"additional","affiliation":[{"name":"Alibaba Group"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Feifei","family":"Li","sequence":"additional","affiliation":[{"name":"Alibaba Group"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bin","family":"Cui","sequence":"additional","affiliation":[{"name":"Peking University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2024,1,20]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"2015. TPC-H benchmark. http:\/\/www.tpc.org\/tpch\/."},{"key":"e_1_2_1_2_1","unstructured":"2022. InnoDB Startup Options and System Variables. https:\/\/dev.mysql.com\/doc\/refman\/5.7\/en\/innodb-parameters.html."},{"key":"e_1_2_1_3_1","unstructured":"2022. Server System Variables. https:\/\/dev.mysql.com\/doc\/refman\/5.7\/en\/server-system-variables.html."},{"key":"e_1_2_1_4_1","volume-title":"Database Tuning Advisor for Microsoft SQL Server","author":"Agrawal Sanjay","year":"2005","unstructured":"Sanjay Agrawal, Surajit Chaudhuri, Lubor Koll\u00e1r, Arunprasad P. Marathe, Vivek R. Narasayya, and Manoj Syamala. 2004. Database Tuning Advisor for Microsoft SQL Server 2005. In VLDB. Morgan Kaufmann, 1110--1121."},{"key":"e_1_2_1_5_1","volume-title":"Automatic Database Management System Tuning Through Large-scale Machine Learning. In SIGMOD Conference. ACM, 1009--1024","author":"Aken Dana Van","year":"2017","unstructured":"Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, and Bohan Zhang. 2017. Automatic Database Management System Tuning Through Large-scale Machine Learning. In SIGMOD Conference. ACM, 1009--1024."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14778\/3450980.3450992"},{"key":"e_1_2_1_7_1","volume-title":"Transfer Learning for Bayesian Optimization: A Survey. CoRR abs\/2302.05927","author":"Bai Tianyi","year":"2023","unstructured":"Tianyi Bai, Yang Li, Yu Shen, Xinyi Zhang, Wentao Zhang, and Bin Cui. 2023. Transfer Learning for Bayesian Optimization: A Survey. CoRR abs\/2302.05927 (2023)."},{"key":"e_1_2_1_8_1","first-page":"23","article-title":"From ranknet to lambdarank to lambdamart: An overview","volume":"11","author":"Burges Christopher JC","year":"2010","unstructured":"Christopher JC Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning 11, 23--581 (2010), 81.","journal-title":"Learning"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3488932.3517401"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/3457390.3457404"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Surajit Chaudhuri and Gerhard Weikum. 2006. Foundations of Automated Database Tuning. In VLDB. ACM 1265.","DOI":"10.1109\/ICDE.2006.72"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1007\/s41019-022-00190-8","article-title":"Representation Learning in Multi-view Clustering","volume":"7","author":"Chen Mansheng","year":"2022","unstructured":"Mansheng Chen, Jia-Qi Lin, Xiang-Long Li, Bao-Yu Liu, Chang-Dong Wang, Dong Huang, and Jian-Huang Lai. 2022. Representation Learning in Multi-view Clustering: A Literature Review. Data Sci. Eng. 7, 3 (2022), 225--241.","journal-title":"A Literature Review. Data Sci. Eng."},{"key":"e_1_2_1_13_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT (1)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT (1). Association for Computational Linguistics, 4171--4186."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732240.2732246"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687767"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-021-1488-4"},{"key":"e_1_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Ayat Fekry Lucian Carata Thomas F. J.-M. Pasquier Andrew Rice and Andy Hopper. 2020. To Tune or Not to Tune?: In Search of Optimal Configurations for Data Analytics. In KDD. ACM 2494--2504.","DOI":"10.1145\/3394486.3403299"},{"key":"e_1_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Ralf Herbrich Thore Graepel and Klaus Obermayer. 1999. Support vector learning for ordinal regression. (1999).","DOI":"10.1049\/cp:19991091"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1007\/s41019-022-00193-5","article-title":"Dimensionality Reduction in Surrogate Modeling: A Review of Combined Methods","volume":"7","author":"Jeffery Hou Chun Kit","year":"2022","unstructured":"Chun Kit Jeffery Hou and Kamran Behdinan. 2022. Dimensionality Reduction in Surrogate Modeling: A Review of Combined Methods. Data Sci. Eng. 7, 4 (2022), 402--427.","journal-title":"Data Sci. Eng."},{"key":"e_1_2_1_20_1","volume-title":"Survey on performance optimization for database systems. Sci. China Inf. Sci. 66, 2","author":"Huang Shiyue","year":"2023","unstructured":"Shiyue Huang, Yanzhao Qin, Xinyi Zhang, Yaofeng Tu, Zhongliang Li, and Bin Cui. 2023. Survey on performance optimization for database systems. Sci. China Inf. Sci. 66, 2 (2023)."},{"key":"e_1_2_1_21_1","first-page":"1","volume-title":"Proc. ACM Manag. Data 1","author":"Huang Shiyue","year":"2023","unstructured":"Shiyue Huang, Ziwei Wang, Xinyi Zhang, Yaofeng Tu, Zhongliang Li, and Bin Cui. 2023. DBPA: A Benchmark for Transactional Database Performance Anomalies. Proc. ACM Manag. Data 1, 1 (2023), 72:1--72:26."},{"key":"e_1_2_1_22_1","volume-title":"LION (Lecture Notes in Computer Science)","author":"Hutter Frank","unstructured":"Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2011. Sequential Model-Based Optimization for General Algorithm Configuration. In LION (Lecture Notes in Computer Science), Vol. 6683. Springer, 507--523."},{"key":"e_1_2_1_23_1","volume-title":"HotStorage","author":"Kanellis Konstantinos","unstructured":"Konstantinos Kanellis, Ramnatthan Alagappan, and Shivaram Venkataraman. 2020. Too Many Knobs to Tune? Towards Faster Database Tuning by Pre-selecting Important Knobs. In HotStorage. USENIX Association."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.14778\/3551793.3551844"},{"key":"e_1_2_1_25_1","unstructured":"Aaron Klein. 2017. RoBO : A Flexible and Robust Bayesian Optimization Framework in Python."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10619-020-07288-w"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3380591"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850583.2850594"},{"key":"e_1_2_1_29_1","unstructured":"Stefan Lessmann Robert Stahlbock and Sven F Crone. 2005. Optimizing hyper-parameters of support vector machines by genetic algorithms.. In IC-AI. 74--82."},{"key":"e_1_2_1_30_1","volume-title":"Lewis and Jason Catlett","author":"David","year":"1994","unstructured":"David D. Lewis and Jason Catlett. 1994. Heterogeneous Uncertainty Sampling for Supervised Learning. In ICML. Morgan Kaufmann, 148--156."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352129"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.14778\/3611540.3611548"},{"key":"e_1_2_1_33_1","volume-title":"ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems. CoRR abs\/2309.12239","author":"Lian Jinqing","year":"2023","unstructured":"Jinqing Lian, Xinyi Zhang, Yingxia Shao, Zenglin Pu, Qingfeng Xiang, Yawen Li, and Bin Cui. 2023. ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems. CoRR abs\/2309.12239 (2023)."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-021-0471-4"},{"key":"e_1_2_1_35_1","unstructured":"Timothy P. Lillicrap Jonathan J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In ICLR (Poster)."},{"key":"e_1_2_1_36_1","volume-title":"Lundberg and Su-In Lee","author":"Scott","year":"2017","unstructured":"Scott M. Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In NIPS. 4765--4774."},{"key":"e_1_2_1_37_1","volume-title":"Query-based Workload Forecasting for Self-Driving Database Management Systems. In SIGMOD Conference. ACM, 631--645","author":"Ma Lin","unstructured":"Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, and Geoffrey J. Gordon. 2018. Query-based Workload Forecasting for Self-Driving Database Management Systems. In SIGMOD Conference. ACM, 631--645."},{"key":"e_1_2_1_38_1","volume-title":"Latin Hypercube Sampling as a Tool in Uncertainty Analysis of Computer Models","author":"McKay Michael D.","unstructured":"Michael D. McKay. 1992. Latin Hypercube Sampling as a Tool in Uncertainty Analysis of Computer Models. In WSC. ACM Press, 557--564."},{"key":"e_1_2_1_39_1","volume-title":"ICML (Proceedings of Machine Learning Research)","volume":"97","author":"Nayebi Amin","year":"2019","unstructured":"Amin Nayebi, Alexander Munteanu, and Matthias Poloczek. 2019. A Framework for Bayesian Optimization in Embedded Subspaces. In ICML (Proceedings of Machine Learning Research), Vol. 97. PMLR, 4752--4761."},{"key":"e_1_2_1_40_1","unstructured":"Valerio Perrone and Huibin Shen. 2019. Learning search spaces for Bayesian optimization: Another view of hyperparameter transfer learning. In NeurIPS. 12751--12761."},{"key":"e_1_2_1_41_1","volume-title":"Ensemble learning: A survey. WIREs Data Mining Knowl. Discov. 8, 4","author":"Sagi Omer","year":"2018","unstructured":"Omer Sagi and Lior Rokach. 2018. Ensemble learning: A survey. WIREs Data Mining Knowl. Discov. 8, 4 (2018)."},{"key":"e_1_2_1_42_1","volume-title":"Shasha and Philippe Bonnet","author":"Dennis","year":"2002","unstructured":"Dennis E. Shasha and Philippe Bonnet. 2002. Database Tuning: Principles, Experiments, and Troubleshooting Techniques. In VLDB. Morgan Kaufmann."},{"key":"e_1_2_1_43_1","volume-title":"Shasha and Steve Rozen","author":"Dennis","year":"1992","unstructured":"Dennis E. Shasha and Steve Rozen. 1992. Database Tuning. In VLDB. Morgan Kaufmann, 313."},{"key":"e_1_2_1_44_1","unstructured":"Adam J. Storm Christian Garcia-Arellano Sam Lightstone Yixin Diao and Maheswaran Surendra. 2006. Adaptive Self-tuning Memory in DB2. In VLDB. ACM 1081--1092."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517843"},{"key":"e_1_2_1_46_1","volume-title":"A self-tuning client-side metadata prefetching scheme for wide area network file systems. Sci. China Inf. Sci. 65, 3","author":"Wei Bing","year":"2022","unstructured":"Bing Wei, Limin Xiao, Yao Song, Guangjun Qin, Jinbin Zhu, Baicheng Yan, Chaobo Wang, and Zhisheng Huo. 2022. A self-tuning client-side metadata prefetching scheme for wide area network file systems. Sci. China Inf. Sci. 65, 3 (2022)."},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Gerhard Weikum Axel Monkeberg Christof Hasse and Peter Zabback. 2002. Self-tuning Database Technology and Information Services: from Wishful Thinking to Viable Engineering. In VLDB. Morgan Kaufmann 20--31.","DOI":"10.1016\/B978-155860869-6\/50011-1"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/4235.585893"},{"key":"e_1_2_1_49_1","volume-title":"Attribute augmented and weighted naive Bayes. Sci. China Inf. Sci. 65, 12","author":"Zhang Huan","year":"2022","unstructured":"Huan Zhang, Liangxiao Jiang, and Chaoqun Li. 2022. Attribute augmented and weighted naive Bayes. Sci. China Inf. Sci. 65, 12 (2022)."},{"key":"e_1_2_1_50_1","volume-title":"An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning. In SIGMOD Conference. ACM, 415--432","author":"Zhang Ji","year":"2019","unstructured":"Ji Zhang, Yu Liu, Ke Zhou, Guoliang Li, Zhili Xiao, Bin Cheng, Jiashu Xing, Yangtao Wang, Tianheng Cheng, Li Liu, Minwei Ran, and Zekang Li. 2019. An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning. In SIGMOD Conference. ACM, 415--432."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.14778\/3538598.3538604"},{"key":"e_1_2_1_52_1","first-page":"2","volume-title":"Proc. ACM Manag. Data 1","author":"Zhang Xinyi","year":"2023","unstructured":"Xinyi Zhang, Zhuo Chang, Hong Wu, Yang Li, Jia Chen, Jian Tan, Feifei Li, and Bin Cui. 2023. A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning. Proc. ACM Manag. Data 1, 2 (2023), 186:1--186:26."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457291"},{"key":"e_1_2_1_54_1","volume-title":"Towards Dynamic and Safe Configuration Tuning for Cloud Databases. CoRR abs\/2203.14473","author":"Zhang Xinyi","year":"2022","unstructured":"Xinyi Zhang, Hong Wu, Yang Li, Jian Tan, Feifei Li, and Bin Cui. 2022. Towards Dynamic and Safe Configuration Tuning for Cloud Databases. CoRR abs\/2203.14473 (2022)."},{"key":"e_1_2_1_55_1","volume-title":"Towards Dynamic and Safe Configuration Tuning for Cloud Databases. In SIGMOD Conference. ACM, 631--645","author":"Zhang Xinyi","year":"2022","unstructured":"Xinyi Zhang, Hong Wu, Yang Li, Jian Tan, Feifei Li, and Bin Cui. 2022. Towards Dynamic and Safe Configuration Tuning for Cloud Databases. In SIGMOD Conference. ACM, 631--645."},{"key":"e_1_2_1_56_1","doi-asserted-by":"crossref","unstructured":"Yuqing Zhu Jianxun Liu Mengying Guo Yungang Bao Wenlong Ma Zhuoyue Liu Kunpeng Song and Yingchun Yang. 2017. BestConfig: tapping the performance potential of systems via automatic configuration tuning. In SoCC. ACM 338--350.","DOI":"10.1145\/3127479.3128605"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2020.3004555"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3632093.3632114","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,20]],"date-time":"2024-01-20T11:27:12Z","timestamp":1705750032000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3632093.3632114"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11]]},"references-count":57,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,11]]}},"alternative-id":["10.14778\/3632093.3632114"],"URL":"https:\/\/doi.org\/10.14778\/3632093.3632114","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2023,11]]},"assertion":[{"value":"2024-01-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}