{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T23:07:14Z","timestamp":1777590434720,"version":"3.51.4"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2020,6,3]],"date-time":"2020-06-03T00:00:00Z","timestamp":1591142400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,6,3]],"date-time":"2020-06-03T00:00:00Z","timestamp":1591142400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Data Min Knowl Disc"],"published-print":{"date-parts":[[2020,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>A new symbolic representation of time series, called ABBA, is introduced. It is based on an adaptive polygonal chain approximation of the time series into a sequence of tuples, followed by a mean-based clustering to obtain the symbolic representation. We show that the reconstruction error of this representation can be modelled as a random walk with pinned start and end points, a so-called Brownian bridge. This insight allows us to make ABBA essentially parameter-free, except for the approximation tolerance which must be chosen. Extensive comparisons with the SAX and 1d-SAX representations are included in the form of performance profiles, showing that ABBA is often able to better preserve the essential shape information of time series compared to other approaches, in particular when time warping measures are used. Advantages and applications of ABBA are discussed, including its in-built differencing property and use for anomaly detection, and Python implementations provided.<\/jats:p>","DOI":"10.1007\/s10618-020-00689-6","type":"journal-article","created":{"date-parts":[[2020,6,3]],"date-time":"2020-06-03T15:37:06Z","timestamp":1591198626000},"page":"1175-1200","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":33,"title":["ABBA: adaptive Brownian bridge-based symbolic aggregation of time series"],"prefix":"10.1007","volume":"34","author":[{"given":"Steven","family":"Elsworth","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1494-4478","authenticated-orcid":false,"given":"Stefan","family":"G\u00fcttel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,6,3]]},"reference":[{"issue":"2","key":"689_CR1","doi-asserted-by":"publisher","first-page":"378","DOI":"10.1007\/s10618-018-0596-4","volume":"33","author":"A Abanda","year":"2019","unstructured":"Abanda A, Mori U, Lozano JA (2019) A review on distance based time series classification. Data Min Knowl Discov 33(2):378\u2013412. https:\/\/doi.org\/10.1007\/s10618-018-0596-4","journal-title":"Data Min Knowl Discov"},{"key":"689_CR2","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1016\/j.is.2015.04.007","volume":"53","author":"S Aghabozorgi","year":"2015","unstructured":"Aghabozorgi S, Shirkhorshidi AS, Wah TY (2015) Time-series clustering\u2014a decade review. Inf Syst 53:16\u201338. https:\/\/doi.org\/10.1016\/j.is.2015.04.007","journal-title":"Inf Syst"},{"key":"689_CR3","doi-asserted-by":"crossref","unstructured":"Arthur D, Vassilvitskii S (2006) How slow is the $$k$$-means method? In: Symposium on computational geometry, ACM, New York, vol 6, pp 1\u201310","DOI":"10.1145\/1137856.1137880"},{"issue":"4","key":"689_CR4","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1145\/3161602","volume":"51","author":"G Atluri","year":"2018","unstructured":"Atluri G, Karpatne A, Kumar V (2018) Spatio-temporal data mining: a survey of problems and methods. ACM Comput Surv (CSUR) 51(4):83","journal-title":"ACM Comput Surv (CSUR)"},{"key":"689_CR5","unstructured":"Barnaghi PM, Bakar AA, Othman ZA (2012) Enhanced symbolic aggregate approximation method for financial time series data representation. In: 6th International conference on new trends in information science, service science and data mining (ISSDM2012), IEEE, pp 790\u2013795"},{"issue":"2","key":"689_CR6","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1007\/s10618-014-0349-y","volume":"29","author":"MG Baydogan","year":"2015","unstructured":"Baydogan MG, Runger G (2015) Learning a symbolic representation for multivariate time series classification. Data Min Knowl Discov 29(2):400\u2013422","journal-title":"Data Min Knowl Discov"},{"issue":"3","key":"689_CR7","first-page":"1","volume":"7","author":"Y Benyahmed","year":"2015","unstructured":"Benyahmed Y, Bakar AA, Hamdan AR, Abdullah SMS (2015) A time-weighted average-based PAA representation for time series symbolization. Int J Adv Soft Comput Appl 7(3):1\u201315","journal-title":"Int J Adv Soft Comput Appl"},{"key":"689_CR8","first-page":"359","volume":"10","author":"DJ Berndt","year":"1994","unstructured":"Berndt DJ, Clifford J (1994) Using dynamic time warping to find patterns in time series. KDD Workshop 10:359\u2013370","journal-title":"KDD Workshop"},{"key":"689_CR9","doi-asserted-by":"publisher","unstructured":"Bettaiah V, Ranganath HS (2014) An analysis of time series representation methods: data mining applications perspective. In: Proceedings of the 2014 ACM Southeast Regional Conference, ACM, pp 16:1\u201316:6. https:\/\/doi.org\/10.1145\/2638404.2638475","DOI":"10.1145\/2638404.2638475"},{"key":"689_CR10","doi-asserted-by":"crossref","unstructured":"Bondu A, Boull\u00e9 M, Cornu\u00e9jols A (2016) Symbolic representation of time series: a hierarchical coclustering formalization. In: International workshop on advanced analysis and learning on temporal data, Springer, pp 3\u201316","DOI":"10.1007\/978-3-319-44412-3_1"},{"issue":"1","key":"689_CR11","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1007\/s10994-006-8364-x","volume":"65","author":"M Boull\u00e9","year":"2006","unstructured":"Boull\u00e9 M (2006) MODL: a Bayes optimal discretization method for continuous attributes. Mach Learn 65(1):131\u2013165. https:\/\/doi.org\/10.1007\/s10994-006-8364-x","journal-title":"Mach Learn"},{"key":"689_CR12","doi-asserted-by":"crossref","unstructured":"Chiu B, Keogh E, Lonardi S (2003) Probabilistic discovery of time series motifs. In: Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 493\u2013498","DOI":"10.1145\/956750.956808"},{"key":"689_CR13","unstructured":"Dau HA, Keogh E, Kamgar K, Yeh CCM, Zhu Y, Gharghabi S, Ratanamahatana CA, Yanping, Hu B, Begum N, Bagnall A, Mueen A, Batista G (2018) The UCR time series classification archive. https:\/\/www.cs.ucr.edu\/~eamonn\/time_series_data_2018\/. Accessed 14 Mar 2019"},{"issue":"2, Ser. A","key":"689_CR14","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1007\/s101070100263","volume":"91","author":"ED Dolan","year":"2002","unstructured":"Dolan ED, Mor\u00e9 JJ (2002) Benchmarking optimization software with performance profiles. Math Program 91(2, Ser. A):201\u2013213. https:\/\/doi.org\/10.1007\/s101070100263","journal-title":"Math Program"},{"key":"689_CR15","unstructured":"Elsworth S, G\u00fcttel S (2020) Time series forecasting using LSTM networks: a symbolic approach. Manchester Institute for Mathematical Sciences, The University of Manchester, UK. arXiv:2003.05672"},{"key":"689_CR16","doi-asserted-by":"crossref","unstructured":"Esmael B, Arnaout A, Fruhwirth RK, Thonhauser G (2012) Multivariate time series classification by combining trend-based and value-based approximations. In: International conference on computational science and its applications, Springer, pp 392\u2013403","DOI":"10.1007\/978-3-642-31128-4_29"},{"issue":"1","key":"689_CR17","doi-asserted-by":"publisher","first-page":"164","DOI":"10.1016\/j.engappai.2010.09.007","volume":"24","author":"T Fu","year":"2011","unstructured":"Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24(1):164\u2013181. https:\/\/doi.org\/10.1016\/j.engappai.2010.09.007","journal-title":"Eng Appl Artif Intell"},{"key":"689_CR18","doi-asserted-by":"publisher","first-page":"3793","DOI":"10.1109\/JSEN.2013.2271562","volume":"13","author":"F Ganz","year":"2013","unstructured":"Ganz F, Barnaghi P, Carrez F (2013) Information abstraction for heterogeneous real world internet data. IEEE Sensors J 13:3793\u20133805. https:\/\/doi.org\/10.1109\/JSEN.2013.2271562","journal-title":"IEEE Sensors J"},{"issue":"6","key":"689_CR19","doi-asserted-by":"publisher","first-page":"1683","DOI":"10.1109\/TKDE.2014.2377746","volume":"27","author":"J Grabocka","year":"2014","unstructured":"Grabocka J, Wistuba M, Schmidt-Thieme L (2014) Scalable classification of repetitive time series through frequencies of local polynomials. IEEE Trans Knowl Data Eng 27(6):1683\u20131695","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"689_CR20","unstructured":"Gr\u00f8nlund A, Larsen KG, Mathiasen A, Nielsen JS (2017) Fast exact k-means, k-medians and Bregman divergence clustering in 1D. arXiv:1701.07204"},{"issue":"9","key":"689_CR21","doi-asserted-by":"publisher","first-page":"2250","DOI":"10.1109\/TKDE.2013.184","volume":"26","author":"M Gupta","year":"2013","unstructured":"Gupta M, Gao J, Aggarwal CC, Han J (2013) Outlier detection for temporal data: a survey. IEEE Trans Knowl Data Eng 26(9):2250\u20132267","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"689_CR22","doi-asserted-by":"crossref","unstructured":"Keogh E, Pazzani MJ (2001) Derivative dynamic time warping. In: Proceedings of the 2001 SIAM international conference on data mining, pp 1\u201311","DOI":"10.1137\/1.9781611972719.1"},{"issue":"4","key":"689_CR23","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1023\/A:1024988512476","volume":"7","author":"E Keogh","year":"2003","unstructured":"Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min Knowl Discov 7(4):349\u2013371. https:\/\/doi.org\/10.1023\/A:1024988512476","journal-title":"Data Min Knowl Discov"},{"issue":"3","key":"689_CR24","doi-asserted-by":"publisher","first-page":"358","DOI":"10.1007\/s10115-004-0154-9","volume":"7","author":"E Keogh","year":"2005","unstructured":"Keogh E, Ratanamahatana CA (2005) Exact indexing of dynamic time warping. Knowl Inf Syst 7(3):358\u2013386. https:\/\/doi.org\/10.1007\/s10115-004-0154-9","journal-title":"Knowl Inf Syst"},{"key":"689_CR25","doi-asserted-by":"publisher","unstructured":"Keogh E, Chu S, Hart D, Pazzani M (2001) An online algorithm for segmenting time series. In: Proceedings of the 2001 IEEE international conference on data mining, pp 289\u2013296. https:\/\/doi.org\/10.1109\/ICDM.2001.989531","DOI":"10.1109\/ICDM.2001.989531"},{"key":"689_CR26","doi-asserted-by":"crossref","unstructured":"Keogh E, Lonardi S, Chiu B (2002) Finding surprising patterns in a time series database in linear time and space. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 550\u2013556","DOI":"10.1145\/775047.775128"},{"issue":"1","key":"689_CR27","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1016\/S0304-4076(99)00031-7","volume":"95","author":"JY Kim","year":"2000","unstructured":"Kim JY (2000) Detection of change in persistence of a linear time series. J Econom 95(1):97\u2013116. https:\/\/doi.org\/10.1016\/S0304-4076(99)00031-7","journal-title":"J Econom"},{"key":"689_CR28","doi-asserted-by":"crossref","unstructured":"Li G, Zhang L, Yang L (2012) TSX: a novel symbolic representation for financial time series. In: PRICAI 2012: trends in artificial intelligence, Springer, pp 262\u2013273","DOI":"10.1007\/978-3-642-32695-0_25"},{"key":"689_CR29","doi-asserted-by":"crossref","unstructured":"Lin J, Keogh E, Lonardi S, Lankford JP, Nystrom DM (2004a) Visually mining and monitoring massive time series. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, ACM, pp 460\u2013469","DOI":"10.1145\/1014052.1014104"},{"key":"689_CR30","doi-asserted-by":"crossref","unstructured":"Lin J, Keogh E, Lonardi S, Lankford JP, Nystrom DM (2004b) Viztree: a tool for visually mining and monitoring massive time series databases. In: Proceedings of the 30th international conference on very large data bases, vol 30 (VLDB \u201904), VLDB Endowment, pp 1269\u20131272","DOI":"10.1016\/B978-012088469-8\/50124-8"},{"issue":"2","key":"689_CR31","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1057\/palgrave.ivs.9500089","volume":"4","author":"J Lin","year":"2005","unstructured":"Lin J, Keogh E, Lonardi S (2005) Visualizing and discovering non-trivial patterns in large time series databases. Inf Vis 4(2):61\u201382","journal-title":"Inf Vis"},{"issue":"2","key":"689_CR32","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1007\/s10618-007-0064-z","volume":"15","author":"J Lin","year":"2007","unstructured":"Lin J, Keogh E, Wei L, Lonardi S (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 15(2):107\u2013144. https:\/\/doi.org\/10.1007\/s10618-007-0064-z","journal-title":"Data Min Knowl Discov"},{"key":"689_CR33","doi-asserted-by":"publisher","unstructured":"Lkhagva B, Suzuki Y, Kawagoe K (2006) New time series data representation ESAX for financial applications. In: 22nd international conference on data engineering workshops (ICDEW\u201906), pp x115\u2013x115. https:\/\/doi.org\/10.1109\/ICDEW.2006.99","DOI":"10.1109\/ICDEW.2006.99"},{"key":"689_CR34","doi-asserted-by":"publisher","unstructured":"Luo G, Yi K, Cheng SW, Li Z, Fan W, He C, Mu Y (2015) Piecewise linear approximation of streaming time series data with max-error guarantees. In: 2015 IEEE 31st international conference on data engineering, pp 173\u2013184. https:\/\/doi.org\/10.1109\/ICDE.2015.7113282","DOI":"10.1109\/ICDE.2015.7113282"},{"key":"689_CR35","doi-asserted-by":"crossref","unstructured":"Malinowski S, Guyet T, Quiniou R, Tavenard R (2013) 1d-SAX: a novel symbolic representation for time series. In: Advances in intelligent data analysis XII, Springer, pp 273\u2013284","DOI":"10.1007\/978-3-642-41398-8_24"},{"issue":"2","key":"689_CR36","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1145\/321941.321946","volume":"23","author":"EM McCreight","year":"1976","unstructured":"McCreight EM (1976) A space-economical suffix tree construction algorithm. J ACM (JACM) 23(2):262\u2013272","journal-title":"J ACM (JACM)"},{"key":"689_CR37","doi-asserted-by":"crossref","unstructured":"M\u00f6rchen F, Ultsch A (2006) Finding persisting states for knowledge discovery in time series. In: From data and information analysis to knowledge engineering, Springer, pp 278\u2013285","DOI":"10.1007\/3-540-31314-1_33"},{"key":"689_CR38","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"issue":"12","key":"689_CR39","doi-asserted-by":"publisher","first-page":"1816","DOI":"10.14778\/2824032.2824078","volume":"8","author":"T Pelkonen","year":"2015","unstructured":"Pelkonen T, Franklin S, Teller J, Cavallaro P, Huang Q, Meza J, Veeraraghavan K (2015) Gorilla: a fast, scalable, in-memory time series database. Proc VLDB Endow 8(12):1816\u20131827","journal-title":"Proc VLDB Endow"},{"key":"689_CR40","doi-asserted-by":"crossref","unstructured":"Pham ND, Le QL, Dang TK (2010) HOT aSAX: a novel adaptive symbolic representation for time series discords discovery. In: Proceedings of the second international conference on intelligent information and database systems: part I, Springer, pp 113\u2013121","DOI":"10.1007\/978-3-642-12145-6_12"},{"key":"689_CR41","doi-asserted-by":"crossref","unstructured":"Shieh J, Keogh E (2008) iSAX: indexing and mining terabyte sized time series. In: Proceedings of the 14th ACM sigkdd international conference on knowledge discovery and data mining, ACM, pp 623\u2013631","DOI":"10.1145\/1401890.1401966"},{"issue":"2","key":"689_CR42","doi-asserted-by":"publisher","first-page":"29","DOI":"10.32614\/RJ-2011-015","volume":"3","author":"H Wang","year":"2011","unstructured":"Wang H, Song M (2011) Ckmeans.1d.dp: optimal $$k$$-means clustering in one dimension by dynamic programming. R J 3(2):29\u201333","journal-title":"R J"},{"key":"689_CR43","doi-asserted-by":"publisher","unstructured":"Zhang K, Li Y, Chai Y, Huang L (2018) Trend-based symbolic aggregate approximation for time series representation. In: 2018 Chinese control and decision conference, IEEE, pp 2234\u20132240. https:\/\/doi.org\/10.1109\/CCDC.2018.8407498","DOI":"10.1109\/CCDC.2018.8407498"}],"container-title":["Data Mining and Knowledge Discovery"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-020-00689-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10618-020-00689-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10618-020-00689-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,3]],"date-time":"2021-06-03T01:30:59Z","timestamp":1622683859000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10618-020-00689-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,3]]},"references-count":43,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,7]]}},"alternative-id":["689"],"URL":"https:\/\/doi.org\/10.1007\/s10618-020-00689-6","relation":{},"ISSN":["1384-5810","1573-756X"],"issn-type":[{"value":"1384-5810","type":"print"},{"value":"1573-756X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,3]]},"assertion":[{"value":"29 May 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 May 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 June 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}