{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,26]],"date-time":"2025-12-26T07:12:06Z","timestamp":1766733126761,"version":"3.44.0"},"reference-count":88,"publisher":"Association for Computing Machinery (ACM)","issue":"7","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2025,3]]},"abstract":"<jats:p>Motif Discovery involves identifying recurring patterns and locating their occurrences within a time series without prior knowledge about their shape or location. In practice, Motif Discovery faces several data-related challenges, leading to various definitions of the problem and multiple algorithms addressing these challenges to different extents. However, there has been no systematic evaluation and comparison of these diverse approaches. Consequently, this paper presents a comprehensive literature review covering data-related challenges, motif definitions, and algorithms. We also analyze the strengths and limitations of algorithms carefully chosen to represent the literature diversity. The analysis is structured around key research questions identified from our review. Our experimental findings provide practical guidelines for selecting Motif Discovery algorithms suitable for a given task and suggest directions for future research.<\/jats:p>","DOI":"10.14778\/3734839.3734857","type":"journal-article","created":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T16:01:06Z","timestamp":1756483266000},"page":"2226-2239","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Time Series Motif Discovery: A Comprehensive Evaluation"],"prefix":"10.14778","volume":"18","author":[{"given":"Valerio","family":"Guerrmi","sequence":"first","affiliation":[{"name":"Univ. Paris Saclay, Univ. Paris Cit\u00e9, CNRS, SSA, ENS Paris Saclay, INSERM, Centre Borelli"}]},{"given":"Thibaut","family":"Germain","sequence":"additional","affiliation":[{"name":"Univ. Paris Saclay, Univ. Paris Cit\u00e9, CNRS, SSA, ENS Paris Saclay, INSERM, Centre Borelli"}]},{"given":"Charles","family":"Truong","sequence":"additional","affiliation":[{"name":"Univ. Paris Saclay, Univ. Paris Cit\u00e9, CNRS, SSA, ENS Paris Saclay, INSERM, Centre Borelli"}]},{"given":"Laurent","family":"Oudre","sequence":"additional","affiliation":[{"name":"Univ. Paris Saclay, Univ. Paris Cit\u00e9, CNRS, SSA, ENS Paris Saclay, INSERM, Centre Borelli"}]},{"given":"Paul","family":"Boniol","sequence":"additional","affiliation":[{"name":"Inria, ENS, DIENS, CNRS, PSL"}]}],"member":"320","published-online":{"date-parts":[[2025,8,29]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-57301-1_5"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM50108.2020.00099"},{"key":"e_1_2_1_3_1","volume-title":"Finding motif sets in time series. arXiv preprint arXiv:1407.3685","author":"Bagnall Anthony","year":"2014","unstructured":"Anthony Bagnall, Jon Hills, and Jason Lines. 2014. Finding motif sets in time series. arXiv preprint arXiv:1407.3685 (2014)."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-016-0483-9"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-49055-7_38"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1017\/dce.2023.25"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/369133.369172"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972801.73"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/11871637_44"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.14778\/3565838.3565840"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.1999.754915"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/956750.956808"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5201\/ipol.2024.494"},{"key":"e_1_2_1_14_1","volume-title":"Workshop Computational Intelligence VDI\/VDE-Gesellschaft Mess-und Automatisierungstechnik (GMA).","author":"Deppe Sahar","year":"2015","unstructured":"Sahar Deppe and Volker Lohweg. 2015. Shift-Invariant Feature Extraction for Time-Series Motif Discovery. In 25. Workshop Computational Intelligence VDI\/VDE-Gesellschaft Mess-und Automatisierungstechnik (GMA)."},{"key":"e_1_2_1_15_1","volume-title":"Biological sequence analysis: probabilistic models of proteins and nucleic acids","author":"Durbin Richard","unstructured":"Richard Durbin, Sean R Eddy, Anders Krogh, and Graeme Mitchison. 1998. Biological sequence analysis: probabilistic models of proteins and nucleic acids. Cambridge university press."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/11893318_12"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-018-0570-1"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-018-1279-6"},{"key":"e_1_2_1_19_1","unstructured":"Yixin Gao S. Vedula Carol E. Reiley N. Ahmidi B. Varadarajan Henry C. Lin L. Tao L. Zappella B. B\u00e9jar D. Yuh C. C. Chen R. Vidal S. Khudanpur and Gregory Hager. 2014. JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS ): A Surgical Activity Dataset for Human Motion Modeling."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/347090.347109"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDEW61823.2024.00028"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2024.3417303"},{"key":"e_1_2_1_23_1","volume-title":"Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley.","author":"Goldberger Ary L","year":"2000","unstructured":"Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. 2000. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. circulation 101, 23 (2000), e215\u2013e220."},{"key":"e_1_2_1_24_1","volume-title":"Latent time-series motifs. ACM Transactions on Knowledge Discovery from Data (TKDD) 11, 1","author":"Grabocka Josif","year":"2016","unstructured":"Josif Grabocka, Nicolas Schilling, and Lars Schmidt-Thieme. 2016. Latent time-series motifs. ACM Transactions on Knowledge Discovery from Data (TKDD) 11, 1 (2016), 1\u201320."},{"key":"e_1_2_1_25_1","volume-title":"Stormo","author":"Hertz Gerald Z","year":"1999","unstructured":"Gerald Z Hertz and Gary D. Stormo. 1999. Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics (Oxford, England) 15, 7 (1999), 563\u2013577."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bti745"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","unstructured":"Mohammed Kadous. 1995. Australian Sign Language Signs. UCI Machine Learning Repository. 10.24432\/C5XG6C","DOI":"10.24432\/C5XG6C"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/645496.657890"},{"key":"e_1_2_1_29_1","volume-title":"Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems 3","author":"Keogh Eamonn","year":"2001","unstructured":"Eamonn Keogh, Kaushik Chakrabarti, Michael Pazzani, and Sharad Mehrotra. 2001. Dimensionality reduction for fast similarity search in large time series databases. Knowledge and information Systems 3 (2001), 263\u2013286."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/375663.375680"},{"key":"e_1_2_1_31_1","volume-title":"Clustering of time-series subsequences is meaningless: implications for previous and future research. Knowledge and information systems 8","author":"Keogh Eamonn","year":"2005","unstructured":"Eamonn Keogh and Jessica Lin. 2005. Clustering of time-series subsequences is meaningless: implications for previous and future research. Knowledge and information systems 8 (2005), 154\u2013177."},{"key":"e_1_2_1_32_1","volume-title":"The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1\u20132","author":"Kuhn Harold W","year":"1955","unstructured":"Harold W Kuhn. 1955. The Hungarian method for the assignment problem. Naval research logistics quarterly 2, 1\u20132 (1955), 83\u201397."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972818.86"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2015.7113316"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1814245.1814255"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972825.77"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3183713.3183744"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1007\/11430919_41"},{"key":"e_1_2_1_39_1","volume-title":"Proc. of the 2nd Workshop on Temporal Data Mining. 53\u201368","author":"Lonardi JLEKS","year":"2002","unstructured":"JLEKS Lonardi and Pranav Patel. 2002. Finding motifs in time series. In Proc. of the 2nd Workshop on Temporal Data Mining. 53\u201368."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICBK.2019.00031"},{"key":"e_1_2_1_41_1","first-page":"e215","article-title":"Pulse Transit Time PPG Dataset","volume":"10","author":"Mehrgardt Philip","year":"2022","unstructured":"Philip Mehrgardt, Matloob Khushi, Simon Poon, and Anusha Withana. 2022. Pulse Transit Time PPG Dataset. PhysioNet 10 (2022), e215\u2013e220.","journal-title":"PhysioNet"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of the national conference on artificial intelligence","volume":"22","author":"Minnen David","year":"2007","unstructured":"David Minnen, Charles L Isbell, Irfan Essa, and Thad Starner. 2007. Discovering multivariate motifs using subsequence density estimation and greedy mixture learning. In Proceedings of the national conference on artificial intelligence, Vol. 22. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, 615."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISWC.2006.286337"},{"key":"e_1_2_1_44_1","first-page":"2814","article-title":"Improving Activity Discovery with Automatic Neighborhood Estimation","volume":"7","author":"Minnen David","year":"2007","unstructured":"David Minnen, Thad Starner, Irfan A Essa, and Charles Lee Isbell Jr. 2007. Improving Activity Discovery with Automatic Neighborhood Estimation.. In IJCAI, Vol. 7. 2814\u20132819.","journal-title":"IJCAI"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-05458-2_3"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-014-0793-4"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835941"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2009.15"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972795.41"},{"key":"e_1_2_1_50_1","unstructured":"Abdullah Mueen Sheng Zhing Yan Zhu Michael Yeh Kaveh Kamgar Krishnamurthy Viswanathan Chetan Gupta and Eamonn Keogh. 2022. The Fastest Similarity Search Algorithm for Time Series Subsequences under Euclidean Distance. http:\/\/www.cs.unm.edu\/~mueen\/FastestSimilaritySearch.html."},{"key":"e_1_2_1_51_1","volume-title":"An electrical load measurements dataset of United Kingdom households from a two-year longitudinal study. Scientific data 4, 1","author":"Murray David","year":"2017","unstructured":"David Murray, Lina Stankovic, and Vladimir Stankovic. 2017. An electrical load measurements dataset of United Kingdom households from a two-year longitudinal study. Scientific data 4, 1 (2017), 1\u201312."},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.3233\/ICA-210650"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ECTICON.2011.5947877"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ECTICon.2012.6254126"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2002.1183920"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/2814710.2814719"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/2949741.2949758"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.14778\/3529337.3529354"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3575813.3595198"},{"key":"e_1_2_1_60_1","first-page":"269","article-title":"Combinatorial approaches to finding subtle signals in DNA sequences","volume":"8","author":"Pevzner Pavel A","year":"2000","unstructured":"Pavel A Pevzner, Sing-Hoi Sze, et al. 2000. Combinatorial approaches to finding subtle signals in DNA sequences.. In ISMB, Vol. 8. 269\u2013278.","journal-title":"ISMB"},{"key":"e_1_2_1_61_1","volume-title":"MDL-based time series clustering. Knowledge and information systems 33","author":"Rakthanmanon Thanawin","year":"2012","unstructured":"Thanawin Rakthanmanon, Eamonn J Keogh, Stefano Lonardi, and Scott Evans. 2012. MDL-based time series clustering. Knowledge and information systems 33 (2012), 371\u2013399."},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-25957-2_8"},{"key":"e_1_2_1_63_1","volume-title":"Motif Discovery Using Similarity-Constraints Deep Neural Networks. In International Conference on Database Systems for Advanced Applications. Springer, 587\u2013603","author":"Rong Chuitian","year":"2020","unstructured":"Chuitian Rong, Ziliang Chen, Chunbin Lin, and Jianming Wang. 2020. Motif Discovery Using Similarity-Constraints Deep Neural Networks. In International Conference on Database Systems for Advanced Applications. Springer, 587\u2013603."},{"key":"e_1_2_1_64_1","volume-title":"Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series. In 2023 IEEE International Conference on Big Data (BigData). IEEE, 66\u201375","author":"Rotman Stijn J","year":"2023","unstructured":"Stijn J Rotman, Boris Cule, and Len Feremans. 2023. Efficiently Mining Frequent Representative Motifs in Large Collections of Time Series. In 2023 IEEE International Conference on Big Data (BigData). IEEE, 66\u201375."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-022-10656-4"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01107"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.14778\/3574245.3574257"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.14778\/3538598.3538602"},{"key":"e_1_2_1_69_1","volume-title":"Interactive discovery of variable-length time series patterns. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 1","author":"Senin Pavel","year":"2018","unstructured":"Pavel Senin, Jessica Lin, Xing Wang, Tim Oates, Sunil Gandhi, Arnold P Boedihardjo, Crystal Chen, and Susan Frankenstein. 2018. Grammarviz 3.0: Interactive discovery of variable-length time series patterns. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 1 (2018), 1\u201328."},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/5.4.293"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1109\/DEXA.1999.795160"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/1644873.1644875"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-005-5829-2"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45065-3_22"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2008.03.022"},{"key":"e_1_2_1_76_1","volume-title":"Precision and Recall for Time Series. arXiv preprint arXiv:1803.03639","author":"Tatbul N","year":"2018","unstructured":"N Tatbul. 2018. Precision and Recall for Time Series. arXiv preprint arXiv:1803.03639 (2018)."},{"key":"e_1_2_1_77_1","volume-title":"Workshop comPutational intElligEncE. 277","author":"Torkamani Sahar","year":"2014","unstructured":"Sahar Torkamani and Volker Lohweg. 2014. Identification of multi-scale motifs. In ProcEEDings 24. Workshop comPutational intElligEncE. 277."},{"key":"e_1_2_1_78_1","volume-title":"Workshop Computational Intelligence, ser. Schriftenreihe des Instituts f\u00fcr Angewandte Informatik-Automatisierungstechnik am Karlsruher Institut f\u00fcr Technologie","volume":"54","author":"Torkamani Sahar","year":"2015","unstructured":"Sahar Torkamani, Volker Lohweg, F Hoffmann, and E H\u00fcllermeier. 2015. Shift-invariant feature extraction for time-series motif discovery. In 25. Workshop Computational Intelligence, ser. Schriftenreihe des Instituts f\u00fcr Angewandte Informatik-Automatisierungstechnik am Karlsruher Institut f\u00fcr Technologie, Vol. 54. 23\u201345."},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11680-8_13"},{"key":"e_1_2_1_80_1","volume-title":"LoCoMotif: Discovering time-warped motifs in time series. Data Mining and Knowledge Discovery","author":"Wesenbeeck Daan Van","year":"2024","unstructured":"Daan Van Wesenbeeck, Aras Yurtman, Wannes Meert, and Hendrik Blockeel. 2024. LoCoMotif: Discovering time-warped motifs in time series. Data Mining and Knowledge Discovery (2024), 1\u201330."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505620"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281282"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2017.66"},{"key":"e_1_2_1_84_1","volume-title":"2016 IEEE 16th international conference on data mining (ICDM). Ieee, 1317\u20131322","author":"Michael Yeh Chin-Chia","year":"2016","unstructured":"Chin-Chia Michael Yeh, Yan Zhu, Liudmila Ulanova, Nurjahan Begum, Yifei Ding, Hoang Anh Dau, Diego Furtado Silva, Abdullah Mueen, and Eamonn Keogh. 2016. Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In 2016 IEEE 16th international conference on data mining (ICDM). Ieee, 1317\u20131322."},{"key":"e_1_2_1_85_1","volume-title":"Recent Advances in Information and Communication Technology: Proceedings of the 10th International Conference on Computing and Information Technology (IC2IT2014)","author":"Yin Myat Su","year":"2014","unstructured":"Myat Su Yin, Songsri Tangsripairoj, and Benjarath Pupacdi. 2014. Variable length motif-based time series classification. In Recent Advances in Information and Communication Technology: Proceedings of the 10th International Conference on Computing and Information Technology (IC2IT2014). Springer, 73\u201382."},{"key":"e_1_2_1_86_1","volume-title":"2013 IEEE 13th international conference on data mining. IEEE, 1265\u20131270","author":"Yingchareonthawornchai Sorrachai","year":"2013","unstructured":"Sorrachai Yingchareonthawornchai, Haemwaan Sivaraks, Thanawin Rakthanmanon, and Chotirat Ann Ratanamahatana. 2013. Efficient proper length time series motif discovery. In 2013 IEEE 13th international conference on data mining. IEEE, 1265\u20131270."},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2018.00099"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2016.0085"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3734839.3734857","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T16:02:16Z","timestamp":1756483336000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3734839.3734857"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3]]},"references-count":88,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2025,3]]}},"alternative-id":["10.14778\/3734839.3734857"],"URL":"https:\/\/doi.org\/10.14778\/3734839.3734857","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2025,3]]},"assertion":[{"value":"2025-08-29","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}