{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T09:01:17Z","timestamp":1775638877757,"version":"3.50.1"},"reference-count":55,"publisher":"Association for Computing Machinery (ACM)","issue":"4","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:p>With the continued migration of storage to cloud database systems, the impact of slow queries in such systems on services and user experience is increasing. Root-cause diagnosis plays an indispensable role in facilitating slow-query detection and revision. This paper proposes a method capable of both identifying possible root cause types for slow queries and ranking these according to their potential for accelerating slow queries. This enables prioritizing root causes with the highest impact, in turn improving slow-query revision effectiveness. To enable more accurate and detailed diagnoses, we propose the multimodal Ranking for the Root Causes of slow queries (RCRank) framework, which formulates root cause analysis as a multimodal machine learning problem and leverages multimodal information from query statements, execution plans, execution logs, and key performance indicators. To obtain expressive embeddings from its heterogeneous multimodal input, RCRank integrates self-supervised pre-training that enhances cross-modal alignment and task relevance. Next, the framework integrates root-cause-adaptive cross Transformers that enable adaptive fusion of multimodal features with varying characteristics. Finally, the framework offers a unified model that features an impact-aware training objective for identifying and ranking root causes. We report on experiments on real and synthetic datasets, finding that RCRank is capable of consistently outperforming the state-of-the-art methods at root cause identification and ranking according to a range of metrics.<\/jats:p>","DOI":"10.14778\/3717755.3717774","type":"journal-article","created":{"date-parts":[[2025,5,20]],"date-time":"2025-05-20T15:51:49Z","timestamp":1747756309000},"page":"1169-1182","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems"],"prefix":"10.14778","volume":"18","author":[{"given":"Biao","family":"Ouyang","sequence":"first","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yingying","family":"Zhang","sequence":"additional","affiliation":[{"name":"Alibaba Cloud Computing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hanyin","family":"Cheng","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yang","family":"Shu","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chenjuan","family":"Guo","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bin","family":"Yang","sequence":"additional","affiliation":[{"name":"East China Normal University, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qingsong","family":"Wen","sequence":"additional","affiliation":[{"name":"Squirrel Ai Learning, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lunting","family":"Fan","sequence":"additional","affiliation":[{"name":"Alibaba Cloud Computing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christian S.","family":"Jensen","sequence":"additional","affiliation":[{"name":"Aalborg University, Denmark"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,5,20]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Zdonik","author":"Akdere Mert","year":"2012","unstructured":"Mert Akdere, Ugur \u00c7etintemel, Matteo Riondato, Eli Upfal, and Stanley B. Zdonik. 2012. Learning-based Query Performance Modeling and Prediction. In ICDE. 390\u2013401."},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Dana Van Aken Andrew Pavlo Geoffrey J. Gordon and Bohan Zhang. 2017. Automatic Database Management System Tuning Through Large-scale Machine Learning. In SIGMOD. 1009\u20131024.","DOI":"10.1145\/3035918.3064029"},{"key":"e_1_2_1_3_1","first-page":"2","volume-title":"Proc. ACM Manag. Data 1","author":"Campos David","year":"2023","unstructured":"David Campos, Miao Zhang, Bin Yang, Tung Kieu, Chenjuan Guo, and Christian S. Jensen. 2023. LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation. Proc. ACM Manag. Data 1, 2 (2023), 171:1\u2013171:27."},{"key":"e_1_2_1_4_1","volume-title":"Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting. In ICLR.","author":"Chen Peng","year":"2024","unstructured":"Peng Chen, Yingying Zhang, Yunyao Cheng, Yang Shu, Yihang Wang, Qingsong Wen, Bin Yang, and Chenjuan Guo. 2024. Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting. In ICLR."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/3636218.3636231"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Sudipto Das Miroslav Grbic Igor Ilic Isidora Jovandic Andrija Jovanovic Vivek R. Narasayya Miodrag Radulovic Maja Stikic Gaoxiang Xu and Surajit Chaudhuri. 2019. Automatically Indexing Millions of Databases in Microsoft Azure SQL Database. In SIGMOD. 666\u2013679.","DOI":"10.1145\/3299869.3314035"},{"key":"e_1_2_1_7_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171\u20134186.","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171\u20134186."},{"key":"e_1_2_1_8_1","volume-title":"Narasayya","author":"Ding Bailu","year":"2019","unstructured":"Bailu Ding, Sudipto Das, Ryan Marcus, Wentao Wu, Surajit Chaudhuri, and Vivek R. Narasayya. 2019. AI Meets AI: Leveraging Query Executions to Improve Index Recommendations. In SIGMOD. 1241\u20131258."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCC.2020.3007016"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413678"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Vimalkumar Jeyakumar Omid Madani Ali Parandeh Ashutosh Kulshreshtha Weifei Zeng and Navindra Yadav. 2019. ExplainIt! - A Declarative Root-cause Analysis Engine for Time Series Data. In SIGMOD. 333\u2013348.","DOI":"10.1145\/3299869.3314048"},{"key":"e_1_2_1_12_1","volume-title":"Learned Cardinalities: Estimating Correlated Joins with Deep Learning. In CIDR.","author":"Kipf Andreas","year":"2019","unstructured":"Andreas Kipf, Thomas Kipf, Bernhard Radke, Viktor Leis, Peter A. Boncz, and Alfons Kemper. 2019. Learned Cardinalities: Estimating Correlated Joins with Deep Learning. In CIDR."},{"key":"e_1_2_1_13_1","volume-title":"Ani Kristo, Guillaume Leclerc, Samuel Madden, Hongzi Mao, and Vikram Nathan.","author":"Kraska Tim","year":"2019","unstructured":"Tim Kraska, Mohammad Alizadeh, Alex Beutel, Ed H. Chi, Ani Kristo, Guillaume Leclerc, Samuel Madden, Hongzi Mao, and Vikram Nathan. 2019. SageDB: A Learned Database System. In CIDR."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850583.2850594"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476311.3476380"},{"key":"e_1_2_1_16_1","volume-title":"Hoi","author":"Li Junnan","year":"2022","unstructured":"Junnan Li, Dongxu Li, Caiming Xiong, and Steven C. H. Hoi. 2022. BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation. In ICML. 12888\u201312900."},{"key":"e_1_2_1_17_1","unstructured":"Junnan Li Ramprasaath R. Selvaraju Akhilesh Gotmare Shafiq R. Joty Caiming Xiong and Steven Chu-Hong Hoi. 2021. Align before Fuse: Vision and Language Representation Learning with Momentum Distillation. In NeurIPS. 9694\u20139705."},{"key":"e_1_2_1_18_1","unstructured":"Greg Linden. 2006. Akamai online retail performance report: Milliseconds are critical. http:\/\/glinden.blogspot.com\/2006\/11\/marissa-mayer-at-web-20.html"},{"key":"e_1_2_1_19_1","volume-title":"A Systematic Survey of Prompting Methods in Natural Language Processing. ACM Comput. Surv. 55, 9","author":"Liu Pengfei","year":"2023","unstructured":"Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, and Graham Neubig. 2023. Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing. ACM Comput. Surv. 55, 9 (2023), 195:1\u2013195:35."},{"key":"e_1_2_1_20_1","unstructured":"Jiasen Lu Dhruv Batra Devi Parikh and Stefan Lee. 2019. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. In NeurIPS. 13\u201323."},{"key":"e_1_2_1_21_1","doi-asserted-by":"crossref","unstructured":"Pengfei Luo Tong Xu Shiwei Wu Chen Zhu Linli Xu and Enhong Chen. 2023. Multi-Grained Multimodal Interaction Network for Entity Linking. In SIGKDD. 1583\u20131594.","DOI":"10.1145\/3580305.3599439"},{"key":"e_1_2_1_22_1","volume-title":"Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, and Geoffrey J. Gordon.","author":"Ma Lin","year":"2018","unstructured":"Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, and Geoffrey J. Gordon. 2018. Query-based Workload Forecasting for Self-Driving Database Management Systems. In SIGMOD. 631\u2013645."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.14778\/3389133.3389136"},{"key":"e_1_2_1_24_1","volume-title":"Bao: Making Learned Query Optimization Practical. In SIGMOD. 1275\u20131288.","author":"Marcus Ryan","year":"2021","unstructured":"Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, and Tim Kraska. 2021. Bao: Making Learned Query Optimization Practical. In SIGMOD. 1275\u20131288."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342644"},{"key":"e_1_2_1_26_1","first-page":"1","article-title":"Deep Reinforcement Learning for Join Order Enumeration","volume":"3","author":"Marcus Ryan","year":"2018","unstructured":"Ryan Marcus and Olga Papaemmanouil. 2018. Deep Reinforcement Learning for Join Order Enumeration. In SIGMOD. 3:1\u20133:4.","journal-title":"SIGMOD."},{"key":"e_1_2_1_27_1","volume-title":"Jensen","author":"Miao Hao","year":"2024","unstructured":"Hao Miao, Yan Zhao, Chenjuan Guo, Bin Yang, Zheng Kai, Feiteng Huang, Jiandong Xie, and Christian S. Jensen. 2024. A Unified Replay-based Continuous Learning Framework for Spatio-Temporal Prediction on Streaming Data. ICDE (2024)."},{"key":"e_1_2_1_28_1","first-page":"336","article-title":"Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline","volume":"12363","author":"Murahari Vishvak","year":"2020","unstructured":"Vishvak Murahari, Dhruv Batra, Devi Parikh, and Abhishek Das. 2020. Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline. In ECCV, Vol. 12363. 336\u2013352.","journal-title":"ECCV"},{"key":"e_1_2_1_29_1","unstructured":"Long Ouyang Jeffrey Wu Xu Jiang Diogo Almeida Carroll L. Wainwright Pamela Mishkin Chong Zhang Sandhini Agarwal Katarina Slama Alex Ray John Schulman Jacob Hilton Fraser Kelton Luke Miller Maddie Simens Amanda Askell Peter Welinder Paul F. Christiano Jan Leike and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. In NeurIPS."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.14778\/3611540.3611566"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.14778\/3665844.3665863"},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Rainer Schlosser Jan Kossmann and Martin Boissier. 2019. Efficient Scalable Multi-attribute Index Selection Using Recursive Strategies. In ICDE. 1238\u20131249.","DOI":"10.1109\/ICDE.2019.00113"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.14778\/2140436.2140444"},{"key":"e_1_2_1_34_1","volume-title":"Carl Vondrick, Kevin Murphy, and Cordelia Schmid.","author":"Sun Chen","year":"2019","unstructured":"Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, and Cordelia Schmid. 2019. VideoBERT: A Joint Model for Video and Language Representation Learning. In ICCV. 7463\u20137472."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368296"},{"key":"e_1_2_1_36_1","doi-asserted-by":"crossref","unstructured":"Xiu Tang Sai Wu Mingli Song Shanshan Ying Feifei Li and Gang Chen. 2022. PreQR: Pre-training Representation for SQL Understanding. In SIGMOD. 204\u2013216.","DOI":"10.1145\/3514221.3517878"},{"key":"e_1_2_1_37_1","volume-title":"Air quality prediction with physics-guided dual neural odes in open systems. ICLR","author":"Tian Jindong","year":"2025","unstructured":"Jindong Tian, Yuxuan Liang, Ronghui Xu, Peng Chen, Chenjuan Guo, Aoying Zhou, Lujia Pan, Zhongwen Rao, and Bin Yang. 2025. Air quality prediction with physics-guided dual neural odes in open systems. ICLR (2025)."},{"key":"e_1_2_1_38_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention is All you Need. In NeurIPS. 5998\u20136008."},{"key":"e_1_2_1_39_1","volume-title":"Quoc V. Le, and Denny Zhou.","author":"Wei Jason","year":"2022","unstructured":"Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed H. Chi, Quoc V. Le, and Denny Zhou. 2022. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. In NeurIPS."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-024-00872-x"},{"key":"e_1_2_1_41_1","first-page":"1","volume-title":"Proc. ACM Manag. Data 1","author":"Wu Xinle","year":"2023","unstructured":"Xinle Wu, Dalin Zhang, Miao Zhang, Chenjuan Guo, Bin Yang, and Christian S. Jensen. 2023. AutoCTS+: Joint Neural Architecture and Hyperparameter Search for Correlated Time Series Forecasting. Proc. ACM Manag. Data 1, 1 (2023), 97:1\u201397:26."},{"key":"e_1_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Sean Bin Yang Chenjuan Guo Jilin Hu Jian Tang and Bin Yang. 2021. Unsupervised Path Representation Learning with Curriculum Negative Sampling. In IJCAI. 3286\u20133292.","DOI":"10.24963\/ijcai.2021\/452"},{"key":"e_1_2_1_43_1","unstructured":"Shunyu Yao Dian Yu Jeffrey Zhao Izhak Shafran Tom Griffiths Yuan Cao and Karthik Narasimhan. 2023. Tree of Thoughts: Deliberate Problem Solving with Large Language Models. In NeurIPS."},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","unstructured":"Dong Young Yoon Ning Niu and Barzan Mozafari. 2016. DBSherlock: A Performance Diagnostic Tool for Transactional Databases. In SIGMOD. 1599\u20131614.","DOI":"10.1145\/2882903.2915218"},{"key":"e_1_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Wenmeng Yu Hua Xu Ziqi Yuan and Jiele Wu. 2021. Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis. In AAAI. 10790\u201310797.","DOI":"10.1609\/aaai.v35i12.17289"},{"key":"e_1_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Yi Zhang Mingyuan Chen Jundong Shen and Chongjun Wang. 2022. Tailor Versatile Multi-Modal Learning for Multi-Label Emotion Recognition. In AAAI. 9100\u20139108.","DOI":"10.1609\/aaai.v36i8.20895"},{"key":"e_1_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Chenyu Zhao Minghua Ma Zhenyu Zhong Shenglin Zhang Zhiyuan Tan Xiao Xiong LuLu Yu Jiayi Feng Yongqian Sun Yuzhi Zhang Dan Pei Qingwei Lin and Dongmei Zhang. 2023. Robust Multimodal Failure Detection for Microservice Systems. In SIGKDD. 5639\u20135649.","DOI":"10.1145\/3580305.3599902"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.14778\/3636218.3636230"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.14778\/3529337.3529349"},{"key":"e_1_2_1_50_1","first-page":"12736","article-title":"Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation","volume":"139","author":"Zheng Renjie","year":"2021","unstructured":"Renjie Zheng, Junkun Chen, Mingbo Ma, and Liang Huang. 2021. Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation. In ICML, Vol. 139. 12736\u201312746.","journal-title":"ICML"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476311.3476334"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.14778\/3675034.3675043"},{"key":"e_1_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Jun-Peng Zhu Peng Cai Boyan Niu Zheming Ni Kai Xu Jiajun Huang Jianwei Wan Shengbo Ma Bing Wang Donghui Zhang et al. 2024. Chat2Query: A Zero-Shot Automatic Exploratory Data Analysis System with Large Language Models. In ICDE. 5429\u20135432.","DOI":"10.1109\/ICDE60146.2024.00420"},{"key":"e_1_2_1_54_1","volume-title":"AutoTQA: Towards Autonomous Tabular Question Answering through Multi-Agent Large Language Models. PVLDB 17(12)","author":"Zhu Jun-Peng","year":"2024","unstructured":"Jun-Peng Zhu, Peng Cai, Kai Xu, Li Li, Yishen Sun, Shuai Zhou, Haihuang Su, Liu Tang, and Qi Liu. 2024. AutoTQA: Towards Autonomous Tabular Question Answering through Multi-Agent Large Language Models. PVLDB 17(12) (2024), 3920\u20133933."},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.14778\/3583140.3583160"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3717755.3717774","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,20]],"date-time":"2025-05-20T16:17:29Z","timestamp":1747757849000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3717755.3717774"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12]]},"references-count":55,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["10.14778\/3717755.3717774"],"URL":"https:\/\/doi.org\/10.14778\/3717755.3717774","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2024,12]]},"assertion":[{"value":"2025-05-20","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}