{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,20]],"date-time":"2025-06-20T04:08:53Z","timestamp":1750392533386,"version":"3.41.0"},"reference-count":52,"publisher":"Association for Computing Machinery (ACM)","issue":"FSE","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. ACM Softw. Eng."],"published-print":{"date-parts":[[2025,6,19]]},"abstract":"<jats:p>Microservice-based systems (MSS) may fail with various fault types, due to their complex and dynamic nature. While existing AIOps methods excel at detecting abnormal traces and locating the responsible service(s), human efforts from practitioners are still required for further root cause analysis to diagnose specific fault types and analyze failure reasons for detected abnormal traces, particularly when abnormal traces do not stem directly from specific services. In this paper, we propose a novel AIOps framework, TraFaultDia, to automatically classify abnormal traces into fault categories for MSS. We treat the classification process as a series of multi-class classification tasks, where each task represents an attempt to classify abnormal traces into specific fault categories for a MSS. TraFaultDia is trained on several abnormal trace classification tasks with a few labeled instances from a MSS using a meta-learning approach. After training, TraFaultDia can quickly adapt to new, unseen abnormal trace classification tasks with a few labeled instances across MSS. TraFaultDia\u2019s use cases are scalable depending on how fault categories are built from anomalies within MSS. We evaluated TraFaultDia on two representative MSS, TrainTicket and OnlineBoutique, with open datasets. In these datasets, each fault category is tied to the faulty system component(s) (service\/pod) with a root cause. Our TraFaultDia automatically classifies abnormal traces into these fault categories, thus enabling the automatic identification of faulty system components and root causes without manual analysis. Our results show that, within the MSS it is trained on, TraFaultDia achieves an average accuracy of 93.26% and 85.20% across 50 new, unseen abnormal trace classification tasks for TrainTicket and OnlineBoutique respectively, when provided with 10 labeled instances for each fault category per task in each system. In the cross-system context, when TraFaultDia is applied to a MSS different from the one it is trained on, TraFaultDia gets an average accuracy of 92.19% and 84.77% for the same set of 50 new, unseen abnormal trace classification tasks of the respective systems, also with 10 labeled instances provided for each fault category per task in each system.<\/jats:p>","DOI":"10.1145\/3715742","type":"journal-article","created":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T15:16:02Z","timestamp":1750346162000},"page":"576-598","source":"Crossref","is-referenced-by-count":0,"title":["Cross-System Categorization of Abnormal Traces in Microservice-Based Systems via Meta-Learning"],"prefix":"10.1145","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0175-005X","authenticated-orcid":false,"given":"Yuqing","family":"Wang","sequence":"first","affiliation":[{"name":"University of Helsinki, Helsinki, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2841-5879","authenticated-orcid":false,"given":"Mika V.","family":"M\u00e4ntyl\u00e4","sequence":"additional","affiliation":[{"name":"University of Helsinki, Helsinki, Finland"},{"name":"University of Oulu, Oulu, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4463-2945","authenticated-orcid":false,"given":"Serge","family":"Demeyer","sequence":"additional","affiliation":[{"name":"University of Antwerp, Antwerp, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2714-8155","authenticated-orcid":false,"given":"Mutlu","family":"Beyaz\u0131t","sequence":"additional","affiliation":[{"name":"University of Antwerp, Antwerp, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7081-5385","authenticated-orcid":false,"given":"Joanna","family":"Kisaakye","sequence":"additional","affiliation":[{"name":"University of Antwerp, Antwerp, Belgium"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-7276-5696","authenticated-orcid":false,"given":"Jesse","family":"Nyyss\u00f6l\u00e4","sequence":"additional","affiliation":[{"name":"University of Helsinki, Helsinki, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,6,19]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3643991.3644890"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2020.3037496"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/COMPSAC51774.2021.00121"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.comcom.2023.03.028"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-20984-0_18"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISSRE5003.2020.00013"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Jacob Cohen. 2013. Statistical power analysis for the behavioral sciences. routledge.","DOI":"10.4324\/9780203771587"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1967.1053964"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-67425-4_12"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-39821-6_8"},{"key":"e_1_2_1_11_1","volume-title":"International conference on machine learning (ICML\u201917)","author":"Finn Chelsea","year":"2017","unstructured":"Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic meta-learning for fast adaptation of deep networks. In International conference on machine learning (ICML\u201917). JMLR.org, 1126\u20131135. https:\/\/dl.acm.org\/doi\/10.5555\/3305381.3305498"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4612-4380-9_6"},{"key":"e_1_2_1_13_1","unstructured":"FudanSELab. 2024. DeepTraLog. https:\/\/github.com\/FudanSELab\/DeepTraLog"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3297858.3304013"},{"key":"e_1_2_1_15_1","volume-title":"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. https:\/\/github.com\/google-research\/bert Accessed: 2024-03-14","author":"Research Google","year":"2018","unstructured":"Google Research. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. https:\/\/github.com\/google-research\/bert Accessed: 2024-03-14"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3459637.3482209"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10515-024-00428-x"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICWS.2017.13"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9053762"},{"key":"e_1_2_1_20_1","first-page":"31158","article-title":"Root cause analysis of failures in microservices through causal discovery","volume":"35","author":"Ikram Azam","year":"2022","unstructured":"Azam Ikram, Sarthak Chakraborty, Subrata Mitra, Shiv Saini, Saurabh Bagchi, and Murat Kocaoglu. 2022. Root cause analysis of failures in microservices through causal discovery. Advances in Neural Information Processing Systems, 35 (2022), 31158\u201331170. https:\/\/dl.acm.org\/doi\/10.5555\/3600270.3602529","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_21_1","unstructured":"IntelligentDDS. 2024. Nezha. https:\/\/github.com\/IntelligentDDS\/Nezha"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01199"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-26348-4_4"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1186\/s13677-022-00296-4"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3323933.3324085"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2023.10.036"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASE51524.2021.9678773"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSE48619.2023.00150"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-021-10063-9"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-00928-1_62"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/SANER60148.2024.00046"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCGRID.2019.00038"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLOUD.2019.00038"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.3390\/s22239492"},{"key":"e_1_2_1_35_1","unstructured":"OpenTelemetry. 2024. OpenTelemetry. https:\/\/opentelemetry.io\/ Accessed: 2024-03-14"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/CloudNet59005.2023.10490038"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.5555\/3294996.3295163"},{"key":"e_1_2_1_38_1","unstructured":"A Vaswani. 2017. Attention is all you need. Advances in Neural Information Processing Systems https:\/\/dl.acm.org\/doi\/10.5555\/3295222.3295349"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.5555\/3157382.3157504"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-022-10321-2"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-020-09910-y"},{"key":"e_1_2_1_42_1","unstructured":"Yuqing Wang Mika V M\u00e4ntyl\u00e4 Serge Demeyer Mutlu Beyazit Joanna Kisaakye and Jesse Nyyss\u00f6l\u00e4. 2025. TraFaultDia. https:\/\/github.com\/yuqwang\/TraFaultDia Accessed: 2025-02-09"},{"key":"e_1_2_1_43_1","unstructured":"Yuqing Wang Mika V. M\u00e4ntyl\u00e4 Jesse Nyyss\u00f6l\u00e4 Ke Ping and Liqiang Wang. 2024. Cross-System Software Log-based Anomaly Detection Using Meta-Learning. arxiv:2412.15445. arxiv:2412.15445"},{"key":"e_1_2_1_44_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao and Klaus Macherey. 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 arxiv:1609.08144"},{"key":"e_1_2_1_45_1","volume-title":"International Conference on Learning Representations. arxiv:1810","author":"Ye Han-Jia","year":"2021","unstructured":"Han-Jia Ye and Wei-Lun Chao. 2021. How to Train Your MAML to Excel in Few-Shot Classification. In International Conference on Learning Representations. arxiv:1810.09502"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3611643.3616249"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01287"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3597503.3639205"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3510003.3510180"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISSRE55969.2022.00032"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383313.3412268"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2018.2887384"}],"container-title":["Proceedings of the ACM on Software Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3715742","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T15:22:28Z","timestamp":1750346548000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3715742"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,19]]},"references-count":52,"journal-issue":{"issue":"FSE","published-print":{"date-parts":[[2025,6,19]]}},"alternative-id":["10.1145\/3715742"],"URL":"https:\/\/doi.org\/10.1145\/3715742","relation":{},"ISSN":["2994-970X"],"issn-type":[{"value":"2994-970X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,19]]}}}