{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T19:43:45Z","timestamp":1740167025485,"version":"3.37.3"},"reference-count":98,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,2,7]],"date-time":"2020-02-07T00:00:00Z","timestamp":1581033600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,2,7]],"date-time":"2020-02-07T00:00:00Z","timestamp":1581033600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["1447634"],"award-info":[{"award-number":["1447634"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"name":"MassMutual Life Insurance"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["EPJ Data Sci."],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We introduce a qualitative, shape-based, timescale-independent time-domain transform used to extract local dynamics from sociotechnical time series\u2014termed the Discrete Shocklet Transform (DST)\u2014and an associated similarity search routine, the Shocklet Transform And Ranking (STAR) algorithm, that indicates time windows during which panels of time series display qualitatively-similar anomalous behavior. After distinguishing our algorithms from other methods used in anomaly detection and time series similarity search, such as the matrix profile, seasonal-hybrid ESD, and discrete wavelet transform-based procedures, we demonstrate the DST\u2019s ability to identify mechanism-driven dynamics at a wide range of timescales and its relative insensitivity to functional parameterization. As an application, we analyze a sociotechnical data source (usage frequencies for a subset of words on Twitter) and highlight our algorithms\u2019 utility by using them to extract both a typology of mechanistic local dynamics and a data-driven narrative of socially-important events as perceived by English-language Twitter.<\/jats:p>","DOI":"10.1140\/epjds\/s13688-020-0220-x","type":"journal-article","created":{"date-parts":[[2020,2,7]],"date-time":"2020-02-07T11:04:06Z","timestamp":1581073446000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["The shocklet transform: a decomposition method for the identification of local, mechanism-driven dynamics in sociotechnical time series"],"prefix":"10.1140","volume":"9","author":[{"given":"David Rushing","family":"Dewhurst","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thayer","family":"Alshaabi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dilan","family":"Kiley","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael V.","family":"Arnold","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Joshua R.","family":"Minot","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christopher M.","family":"Danforth","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter Sheridan","family":"Dodds","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,2,7]]},"reference":[{"issue":"2","key":"220_CR1","doi-asserted-by":"publisher","DOI":"10.1145\/1883612.1883613","volume":"43","author":"P Chaovalit","year":"2011","unstructured":"Chaovalit P, Gangopadhyay A, Karabatis G, Chen Z (2011) Discrete wavelet transform-based time series analysis and mining. ACM Comput Surv (CSUR) 43(2):6","journal-title":"ACM Comput Surv (CSUR)"},{"key":"220_CR2","doi-asserted-by":"publisher","first-page":"565","DOI":"10.1109\/ICDM.2017.66","volume-title":"2017 IEEE international conference on data mining (ICDM)","author":"C-CM Yeh","year":"2017","unstructured":"Yeh C-CM, Kavantzas N, Keogh E (2017) Matrix profile vi: meaningful multidimensional motif discovery. In: 2017 IEEE international conference on data mining (ICDM). IEEE Press, New York, pp\u00a0565\u2013574"},{"key":"220_CR3","unstructured":"Zhu Y, Imamura M, Nikovski D, Keogh E (2018) Introducing time series chains: a new primitive for time series data mining. Knowl Inf Syst: 1\u201327"},{"issue":"3\u20134","key":"220_CR4","doi-asserted-by":"publisher","first-page":"388","DOI":"10.1016\/S0378-4371(02)00552-6","volume":"309","author":"ZR Struzik","year":"2002","unstructured":"Struzik ZR, Siebes AP (2002) Wavelet transform based multifractal formalism in outlier detection and localisation for financial time series. Phys A, Stat Mech Appl 309(3\u20134):388\u2013402","journal-title":"Phys A, Stat Mech Appl"},{"key":"220_CR5","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1109\/ICDE.2002.994711","volume-title":"Proceedings 18th international conference on data engineering","author":"I Popivanov","year":"2002","unstructured":"Popivanov I, Miller RJ (2002) Similarity search over time-series data using wavelets. In: Proceedings 18th international conference on data engineering. IEEE Press, New York, pp\u00a0212\u2013221"},{"issue":"12","key":"220_CR6","doi-asserted-by":"publisher","first-page":"2391","DOI":"10.1175\/1520-0477(1995)076<2391:CSDUWT>2.0.CO;2","volume":"76","author":"K-M Lau","year":"1995","unstructured":"Lau K-M, Weng H (1995) Climate signal detection using wavelet transform: how to make a time series sing. Bull Am Meteorol Soc 76(12):2391\u20132402","journal-title":"Bull Am Meteorol Soc"},{"issue":"5","key":"220_CR7","doi-asserted-by":"publisher","first-page":"12-1-12-16","DOI":"10.1029\/2001WR000509","volume":"38","author":"B. Whitcher","year":"2002","unstructured":"Whitcher B, Byers SD, Guttorp P, Percival DB (2002) Testing for homogeneity of variance in time series: long memory, wavelets, and the Nile river. Water Resour Res 38(5)","journal-title":"Water Resources Research"},{"issue":"3","key":"220_CR8","doi-asserted-by":"publisher","first-page":"634","DOI":"10.1016\/j.camwa.2010.05.010","volume":"60","author":"R Ben\u00edtez","year":"2010","unstructured":"Ben\u00edtez R, Bol\u00f3s V, Ram\u00edrez M (2010) A wavelet-based tool for studying non-periodicity. Comput Math Appl 60(3):634\u2013641","journal-title":"Comput Math Appl"},{"key":"220_CR9","first-page":"205","volume-title":"Vision interface","author":"S Mann","year":"1991","unstructured":"Mann S, Haykin S (1991) The chirplet transform: a generalization of Gabor\u2019s logon transform. In: Vision interface, vol\u00a091, pp\u00a0205\u2013212"},{"key":"220_CR10","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1109\/NRC.2002.999697","volume-title":"Proceedings of the 2002 IEEE radar conference (IEEE cat. no. 02CH37322)","author":"G Wang","year":"2002","unstructured":"Wang G, Xia X-G, Root BT, Chen VC (2002) Moving target detection in over-the-horizon radar using adaptive chirplet transform. In: Proceedings of the 2002 IEEE radar conference (IEEE cat. no. 02CH37322). IEEE Press, New York, pp\u00a077\u201384"},{"issue":"7","key":"220_CR11","doi-asserted-by":"publisher","first-page":"675","DOI":"10.1016\/j.soildyn.2006.11.007","volume":"27","author":"P Spanos","year":"2007","unstructured":"Spanos P, Giaralis A, Politis N (2007) Time\u2013frequency representation of earthquake accelerograms and inelastic structural response records using the adaptive chirplet decomposition and empirical mode decomposition. Soil Dyn Earthq Eng 27(7):675\u2013689","journal-title":"Soil Dyn Earthq Eng"},{"key":"220_CR12","doi-asserted-by":"crossref","unstructured":"Taebi A, Mansy H (2016) Effect of noise on time-frequency analysis of vibrocardiographic signals. J Bioeng & Biomed Sci 6(4)","DOI":"10.4172\/2155-9538.1000202"},{"issue":"3\/4","key":"220_CR13","doi-asserted-by":"publisher","first-page":"523","DOI":"10.2307\/2333401","volume":"42","author":"E Page","year":"1955","unstructured":"Page E (1955) A test for a change in a parameter occurring at an unknown point. Biometrika 42(3\/4):523\u2013527","journal-title":"Biometrika"},{"issue":"2","key":"220_CR14","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1109\/18.119727","volume":"38","author":"S Mallat","year":"1992","unstructured":"Mallat S, Hwang WL (1992) Singularity detection and processing with wavelets. IEEE Trans Inf Theory 38(2):617\u2013643","journal-title":"IEEE Trans Inf Theory"},{"issue":"12","key":"220_CR15","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0026752","volume":"6","author":"PS Dodds","year":"2011","unstructured":"Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM (2011) Temporal patterns of happiness and information in a global social network: hedonometrics and Twitter. PLoS ONE 6(12):26752","journal-title":"PLoS ONE"},{"key":"220_CR16","unstructured":"Li Q, Shah S, Thomas M, Anderson K, Liu X, Nourbakhsh A, Fang R (2017) How much data do you need? Twitter decahose data analysis"},{"issue":"1","key":"220_CR17","doi-asserted-by":"publisher","DOI":"10.1140\/epjds\/s13688-017-0121-9","volume":"6","author":"AJ Reagan","year":"2017","unstructured":"Reagan AJ, Danforth CM, Tivnan B, Williams JR, Dodds PS (2017) Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs. EPJ Data Sci 6(1):28","journal-title":"EPJ Data Sci"},{"issue":"1","key":"220_CR18","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-017-12961-9","volume":"7","author":"AG Reece","year":"2017","unstructured":"Reece AG, Reagan AJ, Lix KL, Dodds PS, Danforth CM, Langer EJ (2017) Forecasting the onset and course of mental illness with Twitter data. Sci Rep 7(1):13006","journal-title":"Sci Rep"},{"key":"220_CR19","doi-asserted-by":"publisher","DOI":"10.1038\/srep02625","volume":"3","author":"MR Frank","year":"2013","unstructured":"Frank MR, Mitchell L, Dodds PS, Danforth CM (2013) Happiness and the patterns of life: a study of geolocated tweets. Sci Rep 3:2625","journal-title":"Sci Rep"},{"issue":"5","key":"220_CR20","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0064417","volume":"8","author":"L Mitchell","year":"2013","unstructured":"Mitchell L, Frank MR, Harris KD, Dodds PS, Danforth CM (2013) The geography of happiness: connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PLoS ONE 8(5):64417","journal-title":"PLoS ONE"},{"key":"220_CR21","doi-asserted-by":"publisher","first-page":"1396","DOI":"10.1109\/ICDMW.2015.39","volume-title":"2015 IEEE international conference on data mining workshop (ICDMW)","author":"R Lemahieu","year":"2015","unstructured":"Lemahieu R, Van Canneyt S, De Boom C, Dhoedt B (2015) Optimizing the popularity of Twitter messages through user categories. In: 2015 IEEE international conference on data mining workshop (ICDMW). IEEE Press, New York, pp\u00a01396\u20131401"},{"issue":"45","key":"220_CR22","doi-asserted-by":"publisher","first-page":"17599","DOI":"10.1073\/pnas.0704916104","volume":"104","author":"F Wu","year":"2007","unstructured":"Wu F, Huberman BA (2007) Novelty and collective attention. Proc Natl Acad Sci 104(45):17599\u201317601","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"220_CR23","doi-asserted-by":"publisher","DOI":"10.1038\/s41562-018-0474-5","volume":"3","author":"C Candia","year":"2019","unstructured":"Candia C, Jara-Figueroa C, Rodriguez-Sickert C, Barab\u00e1si A-L, Hidalgo CA (2019) The universal decay of collective memory and attention. Nat Hum Behav 3(1):82","journal-title":"Nat Hum Behav"},{"issue":"41","key":"220_CR24","doi-asserted-by":"publisher","first-page":"15649","DOI":"10.1073\/pnas.0803685105","volume":"105","author":"R Crane","year":"2008","unstructured":"Crane R, Sornette D (2008) Robust dynamic classes revealed by measuring the response function of a social system. Proc Natl Acad Sci 105(41):15649\u201315653","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"220_CR25","doi-asserted-by":"publisher","DOI":"10.1038\/s41467-019-09311-w","volume":"10","author":"P Lorenz-Spreen","year":"2019","unstructured":"Lorenz-Spreen P, M\u00f8nsted BM, H\u00f6vel P, Lehmann S (2019) Accelerating dynamics of collective attention. Nat Commun 10(1):1759","journal-title":"Nat Commun"},{"key":"220_CR26","unstructured":"De Domenico M, Altmann EG (2019) Unraveling the origin of social bursts in collective attention. arXiv preprint. arXiv:1903.06588"},{"key":"220_CR27","doi-asserted-by":"crossref","unstructured":"Ierley G, Kostinski A (2019) A universal rank-order transform to extract signals from noisy data. arXiv preprint. arXiv:1906.08729","DOI":"10.1103\/PhysRevX.9.031039"},{"key":"220_CR28","unstructured":"Nakamoto S, et al. (2008) Bitcoin: a peer-to-peer electronic cash system"},{"key":"220_CR29","doi-asserted-by":"publisher","first-page":"1443","DOI":"10.1109\/IEEM.2014.7058877","volume-title":"2014 IEEE international conference on industrial engineering and engineering management","author":"A Al Shehhi","year":"2014","unstructured":"Al Shehhi A, Oudah M, Aung Z (2014) Investigating factors behind choosing a cryptocurrency. In: 2014 IEEE international conference on industrial engineering and engineering management. IEEE Press, New York, pp\u00a01443\u20131447"},{"issue":"2","key":"220_CR30","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1007\/s10618-007-0064-z","volume":"15","author":"J Lin","year":"2007","unstructured":"Lin J, Keogh E, Wei L, Lonardi S (2007) Experiencing sax: a novel symbolic representation of time series. Data Min Knowl Discov 15(2):107\u2013144","journal-title":"Data Min Knowl Discov"},{"issue":"1","key":"220_CR31","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1016\/j.ic.2006.08.004","volume":"205","author":"K Yang","year":"2007","unstructured":"Yang K, Shahabi C (2007) An efficient k nearest neighbor search for multivariate time series. Inf Comput 205(1):65\u201398","journal-title":"Inf Comput"},{"key":"220_CR32","doi-asserted-by":"publisher","first-page":"260","DOI":"10.1109\/ICDM.2014.153","volume-title":"2014 IEEE international conference on data mining","author":"DC Kale","year":"2014","unstructured":"Kale DC, Gong D, Che Z, Liu Y, Medioni G, Wetzel R, Ross P (2014) An examination of multivariate time series hashing with applications to health care. In: 2014 IEEE international conference on data mining. IEEE Press, New York, pp\u00a0260\u2013269"},{"key":"220_CR33","unstructured":"Driemel A, Silvestri F (2017) Locality-sensitive hashing of curves. arXiv preprint. arXiv:1703.04040"},{"key":"220_CR34","first-page":"122","volume-title":"Pacific-Asia conference on knowledge discovery and data mining","author":"EJ Keogh","year":"2000","unstructured":"Keogh EJ, Pazzani MJ (2000) A simple dimensionality reduction technique for fast similarity search in large time series databases. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, pp\u00a0122\u2013133"},{"key":"220_CR35","first-page":"488","volume-title":"Proceedings of the ninth international conference on information and knowledge management","author":"Y-L Wu","year":"2000","unstructured":"Wu Y-L, Agrawal D, El Abbadi A (2000) A comparison of dft and dwt based similarity search in time-series databases. In: Proceedings of the ninth international conference on information and knowledge management. ACM, New York, pp\u00a0488\u2013495"},{"issue":"3","key":"220_CR36","doi-asserted-by":"publisher","first-page":"686","DOI":"10.1109\/TKDE.2003.1198399","volume":"15","author":"F-P Chan","year":"2003","unstructured":"Chan F-P, Fu A-C, Yu C (2003) Haar wavelets for efficient similarity search of time-series: with and without time warping. IEEE Trans Knowl Data Eng 15(3):686\u2013705","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"220_CR37","doi-asserted-by":"publisher","first-page":"771","DOI":"10.1007\/11430919_90","volume-title":"Pacific-Asia conference on knowledge discovery and data mining","author":"C Ratanamahatana","year":"2005","unstructured":"Ratanamahatana C, Keogh E, Bagnall AJ, Lonardi S (2005) A novel bit level time series representation with implication of similarity search and clustering. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, Berlin, pp\u00a0771\u2013777"},{"key":"220_CR38","first-page":"8","volume-title":"Fifth IEEE international conference on data mining (ICDM\u201905)","author":"E Keogh","year":"2005","unstructured":"Keogh E, Lin J, Fu A (2005) Hot sax: efficiently finding the most unusual time series subsequence. In: Fifth IEEE international conference on data mining (ICDM\u201905). IEEE Press, New York, p\u00a08"},{"key":"220_CR39","doi-asserted-by":"publisher","first-page":"1317","DOI":"10.1109\/ICDM.2016.0179","volume-title":"2016 IEEE 16th international conference on data mining (ICDM)","author":"C-CM Yeh","year":"2016","unstructured":"Yeh C-CM, Zhu Y, Ulanova L, Begum N, Ding Y, Dau HA, Silva DF, Mueen A, Keogh E (2016) Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: 2016 IEEE 16th international conference on data mining (ICDM). IEEE Press, New York, pp\u00a01317\u20131322"},{"key":"220_CR40","unstructured":"Eastman JR, Fulk M (1993) Long sequence time series evaluation using standardized principal components. Photogramm Eng Remote Sens 59(6)"},{"issue":"4","key":"220_CR41","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1017\/S0266466600005995","volume":"13","author":"D Harris","year":"1997","unstructured":"Harris D (1997) Principal components analysis of cointegrated time series. Econom Theory 13(4):529\u2013557","journal-title":"Econom Theory"},{"issue":"2","key":"220_CR42","doi-asserted-by":"publisher","DOI":"10.1007\/s11222-008-9082-y","volume":"19","author":"JRG Lansangan","year":"2009","unstructured":"Lansangan JRG, Barrios EB (2009) Principal components analysis of nonstationary time series data. Stat Comput 19(2):173","journal-title":"Stat Comput"},{"key":"220_CR43","unstructured":"Mueen A, Viswanathan K, Gupta C, Keogh E (2017) The fastest similarity search algorithm for time series subsequences under Euclidean distance"},{"issue":"1","key":"220_CR44","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1287\/ijoc.2013.0554","volume":"26","author":"O Seref","year":"2013","unstructured":"Seref O, Fan Y-J, Chaovalitwongse WA (2013) Mathematical programming formulations and algorithms for discrete k-median clustering of time-series data. INFORMS J Comput 26(1):160\u2013172","journal-title":"INFORMS J Comput"},{"key":"220_CR45","volume-title":"Proc. Workshop on clustering high dimensionality data and its applications","author":"M Vlachos","year":"2003","unstructured":"Vlachos M, Lin J, Keogh E, Gunopulos D (2003) A wavelet-based anytime algorithm for k-means clustering of time series. In: Proc. Workshop on clustering high dimensionality data and its applications. Citeseer"},{"issue":"3","key":"220_CR46","doi-asserted-by":"publisher","first-page":"298","DOI":"10.1006\/nimg.1998.0391","volume":"9","author":"C Goutte","year":"1999","unstructured":"Goutte C, Toft P, Rostrup E, Nielsen F, Hansen LK (1999) On clustering fmri time series. NeuroImage 9(3):298\u2013310","journal-title":"NeuroImage"},{"key":"220_CR47","series-title":"Proceedings","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1109\/BIBE.2003.1188978","volume-title":"Third IEEE symposium on bioinformatics and bioengineering, 2003","author":"D Jiang","year":"2003","unstructured":"Jiang D, Pei J, Zhang A (2003) Dhc: a density-based hierarchical clustering method for time series gene expression data. In: Third IEEE symposium on bioinformatics and bioengineering, 2003. Proceedings. IEEE Press, New York, pp\u00a0393\u2013400"},{"key":"220_CR48","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1137\/1.9781611972764.48","volume-title":"Proceedings of the 2006 SIAM international conference on data mining","author":"PP Rodrigues","year":"2006","unstructured":"Rodrigues PP, Gama J, Pedroso JP (2006) Odac: hierarchical clustering of time series data streams. In: Proceedings of the 2006 SIAM international conference on data mining. SIAM, Philadelphia, pp\u00a0499\u2013503"},{"issue":"5","key":"220_CR49","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1109\/TKDE.2007.190727","volume":"20","author":"PP Rodrigues","year":"2008","unstructured":"Rodrigues PP, Gama J, Pedroso J (2008) Hierarchical clustering of time-series data streams. IEEE Trans Knowl Data Eng 20(5):615\u2013627","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"220_CR50","first-page":"8","volume-title":"Fifth IEEE international conference on data mining (ICDM\u201905)","author":"A Denton","year":"2005","unstructured":"Denton A (2005) Kernel-density-based clustering of time series subsequences using a continuous random-walk noise model. In: Fifth IEEE international conference on data mining (ICDM\u201905). IEEE Press, New York, p\u00a08"},{"issue":"1","key":"220_CR51","doi-asserted-by":"publisher","first-page":"208","DOI":"10.1016\/j.datak.2006.01.013","volume":"60","author":"D Birant","year":"2007","unstructured":"Birant D, Kut A (2007) St-dbscan: an algorithm for clustering spatial\u2013temporal data. Data Knowl Eng 60(1):208\u2013221","journal-title":"Data Knowl Eng"},{"key":"220_CR52","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1109\/INISTA.2011.5946052","volume-title":"2011 international symposium on innovations in intelligent systems and applications","author":"M \u00c7elik","year":"2011","unstructured":"\u00c7elik M, Dada\u015fer-\u00c7elik F, Dokuz A\u015e (2011) Anomaly detection in temperature data using dbscan algorithm. In: 2011 international symposium on innovations in intelligent systems and applications. IEEE Press, New York, pp\u00a091\u201395"},{"key":"220_CR53","doi-asserted-by":"publisher","first-page":"557","DOI":"10.1145\/775047.775129","volume-title":"Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining","author":"M Kumar","year":"2002","unstructured":"Kumar M, Patel NR, Woo J (2002) Clustering seasonality patterns in the presence of errors. In: Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp\u00a0557\u2013563"},{"key":"220_CR54","first-page":"17","volume-title":"Proceedings of the IJCAI-99 workshop on neural, symbolic and reinforcement learning methods for sequence learning","author":"T Oates","year":"1999","unstructured":"Oates T, Firoiu L, Cohen PR (1999) Clustering time series with hidden Markov models and dynamic time warping. In: Proceedings of the IJCAI-99 workshop on neural, symbolic and reinforcement learning methods for sequence learning, pp\u00a017\u201321. Citeseer"},{"issue":"8","key":"220_CR55","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.79.1475","volume":"79","author":"T Schreiber","year":"1997","unstructured":"Schreiber T, Schmitz A (1997) Classification of time series data with nonlinear similarity measures. Phys Rev Lett 79(8):1475","journal-title":"Phys Rev Lett"},{"key":"220_CR56","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1109\/ICDM.2001.989529","volume-title":"Proceedings 2001 IEEE international conference on data mining","author":"K Kalpakis","year":"2001","unstructured":"Kalpakis K, Gada D, Puttagunta V (2001) Distance measures for effective clustering of arima time-series. In: Proceedings 2001 IEEE international conference on data mining. IEEE Press, New York, pp\u00a0273\u2013280"},{"key":"220_CR57","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1145\/1014052.1014061","volume-title":"Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining","author":"AJ Bagnall","year":"2004","unstructured":"Bagnall AJ, Janacek GJ (2004) Clustering time series from arma models with clipped data. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp\u00a049\u201358"},{"issue":"8","key":"220_CR58","doi-asserted-by":"publisher","first-page":"1675","DOI":"10.1016\/j.patcog.2003.12.018","volume":"37","author":"Y Xiong","year":"2004","unstructured":"Xiong Y, Yeung D-Y (2004) Time series clustering with arma mixtures. Pattern Recognit 37(8):1675\u20131689","journal-title":"Pattern Recognit"},{"issue":"1","key":"220_CR59","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1198\/073500107000000106","volume":"26","author":"S Fr\u00f6hwirth-Schnatter","year":"2008","unstructured":"Fr\u00f6hwirth-Schnatter S, Kaufmann S (2008) Model-based clustering of multiple time series. J Bus Econ Stat 26(1):78\u201389","journal-title":"J Bus Econ Stat"},{"issue":"2","key":"220_CR60","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/s10115-004-0172-7","volume":"8","author":"E Keogh","year":"2005","unstructured":"Keogh E, Lin J (2005) Clustering of time-series subsequences is meaningless: implications for previous and future research. Knowl Inf Syst 8(2):154\u2013177","journal-title":"Knowl Inf Syst"},{"issue":"3","key":"220_CR61","doi-asserted-by":"publisher","first-page":"606","DOI":"10.1007\/s10618-016-0483-9","volume":"31","author":"A Bagnall","year":"2017","unstructured":"Bagnall A, Lines J, Bostrom A, Large J, Keogh E (2017) The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Discov 31(3):606\u2013660","journal-title":"Data Min Knowl Discov"},{"key":"220_CR62","first-page":"79","volume-title":"Vldb","author":"AC Gilbert","year":"2001","unstructured":"Gilbert AC, Kotidis Y, Muthukrishnan S, Strauss M (2001) Surfing wavelets on streams: one-pass summaries for approximate aggregate queries. In: Vldb, vol\u00a01, pp\u00a079\u201388"},{"key":"220_CR63","first-page":"523","volume-title":"International conference on intelligent data engineering and automated learning","author":"S Ahmad","year":"2004","unstructured":"Ahmad S, Taskaya-Temizel T, Ahmad K (2004) Summarizing time series: learning patterns in \u2018volatile\u2019 series. In: International conference on intelligent data engineering and automated learning. Springer, Berlin, pp\u00a0523\u2013532"},{"key":"220_CR64","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1145\/2001858.2001954","volume-title":"Proceedings of the 13th annual conference companion on genetic and evolutionary computation","author":"R Castillo-Ortega","year":"2011","unstructured":"Castillo-Ortega R, Mar\u00edn N, S\u00e1nchez D, Tettamanzi AG (2011) A multi-objective memetic algorithm for the linguistic summarization of time series. In: Proceedings of the 13th annual conference companion on genetic and evolutionary computation. ACM, New York, pp\u00a0171\u2013172"},{"key":"220_CR65","first-page":"416","volume-title":"EUSFLAT","author":"R Castillo Ortega","year":"2011","unstructured":"Castillo Ortega R, Mar\u00edn N, S\u00e1nchez D, Tettamanzi AG (2011) Linguistic summarization of time series data using genetic algorithms. In: EUSFLAT, vol\u00a01. Atlantis Press, pp\u00a0416\u2013423"},{"key":"220_CR66","doi-asserted-by":"publisher","first-page":"230","DOI":"10.1007\/978-3-540-73451-2_25","volume-title":"International conference on rough sets and intelligent systems paradigms","author":"J Kacprzyk","year":"2007","unstructured":"Kacprzyk J, Wilbik A, Zadro\u017cny S (2007) Linguistic summarization of time series under different granulation of describing features. In: International conference on rough sets and intelligent systems paradigms. Springer, Berlin, pp\u00a0230\u2013240"},{"issue":"12","key":"220_CR67","doi-asserted-by":"publisher","first-page":"1485","DOI":"10.1016\/j.fss.2008.01.025","volume":"159","author":"J Kacprzyk","year":"2008","unstructured":"Kacprzyk J, Wilbik A, Zadro\u017cny S (2008) Linguistic summarization of time series using a fuzzy quantifier driven aggregation. Fuzzy Sets Syst 159(12):1485\u20131499","journal-title":"Fuzzy Sets Syst"},{"issue":"5","key":"220_CR68","first-page":"411","volume":"25","author":"J Kacprzyk","year":"2010","unstructured":"Kacprzyk J, Wilbik A, Zadro\u017cny S (2010) An approach to the linguistic summarization of time series using a fuzzy quantifier driven aggregation. Int J Intell Syst 25(5):411\u2013439","journal-title":"Int J Intell Syst"},{"key":"220_CR69","doi-asserted-by":"publisher","first-page":"507","DOI":"10.1145\/1557019.1557078","volume-title":"Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining","author":"L Li","year":"2009","unstructured":"Li L, McCann J, Pollard NS, Faloutsos C (2009) Dynammo: mining and summarization of coevolving sequences with missing values. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp\u00a0507\u2013516"},{"issue":"3","key":"220_CR70","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541882","volume":"41","author":"V Chandola","year":"2009","unstructured":"Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM Comput Surv (CSUR 41(3):15","journal-title":"ACM Comput Surv (CSUR"},{"issue":"1","key":"220_CR71","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.88.013901","volume":"88","author":"G Gbur","year":"2001","unstructured":"Gbur G, Visser T, Wolf E (2001) Anomalous behavior of spectra near phase singularities of focused waves. Phys Rev Lett 88(1):013901","journal-title":"Phys Rev Lett"},{"issue":"3","key":"220_CR72","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.62.R3023","volume":"62","author":"V Plerou","year":"2000","unstructured":"Plerou V, Gopikrishnan P, Amaral LAN, Gabaix X, Stanley HE (2000) Economic fluctuations and anomalous diffusion. Phys Rev E 62(3):3023","journal-title":"Phys Rev E"},{"issue":"4","key":"220_CR73","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.106.048103","volume":"106","author":"J-H Jeon","year":"2011","unstructured":"Jeon J-H, Tejedor V, Burov S, Barkai E, Selhuber-Unkel C, Berg-S\u00f8rensen K, Oddershede L, Metzler R (2011) In vivo anomalous diffusion and weak ergodicity breaking of lipid granules. Phys Rev Lett 106(4):048103","journal-title":"Phys Rev Lett"},{"key":"220_CR74","unstructured":"Palfrey TR, Prisbrey JE (1997) Anomalous behavior in public goods experiments: how much and why?. Am Econ Rev: 829\u2013846"},{"issue":"3","key":"220_CR75","doi-asserted-by":"publisher","first-page":"678","DOI":"10.1257\/aer.89.3.678","volume":"89","author":"CM Capra","year":"1999","unstructured":"Capra CM, Goeree JK, Gomez R, Holt CA (1999) Anomalous behavior in a traveler\u2019s dilemma? Am Econ Rev 89(3):678\u2013690","journal-title":"Am Econ Rev"},{"issue":"2","key":"220_CR76","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1080\/00401706.1983.10487848","volume":"25","author":"B Rosner","year":"1983","unstructured":"Rosner B (1983) Percentage points for a generalized esd many-outlier procedure. Technometrics 25(2):165\u2013172","journal-title":"Technometrics"},{"key":"220_CR77","volume-title":"6th $\\{\\mathrm{USENIX}\\}$ workshop on hot topics in cloud computing (HotCloud 14)","author":"O Vallis","year":"2014","unstructured":"Vallis O, Hochenbaum J, Kejariwal A (2014) A novel technique for long-term anomaly detection in the cloud. In: 6th $\\{\\mathrm{USENIX}\\}$ workshop on hot topics in cloud computing (HotCloud 14)"},{"key":"220_CR78","doi-asserted-by":"publisher","first-page":"1939","DOI":"10.1145\/2783258.2788611","volume-title":"Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining","author":"N Laptev","year":"2015","unstructured":"Laptev N, Amizadeh S, Flint I (2015) Generic and scalable framework for automated time-series anomaly detection. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp\u00a01939\u20131947"},{"key":"220_CR79","first-page":"8","volume-title":"Fifth IEEE international conference on data mining (ICDM\u201905)","author":"PK Chan","year":"2005","unstructured":"Chan PK, Mahoney MV (2005) Modeling multiple time series for anomaly detection. In: Fifth IEEE international conference on data mining (ICDM\u201905). IEEE Press, New York, p\u00a08"},{"key":"220_CR80","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1137\/1.9781611972795.36","volume-title":"Proceedings of the 2009 SIAM international conference on data mining","author":"H Cheng","year":"2009","unstructured":"Cheng H, Tan P-N, Potter C, Klooster S (2009) Detection and characterization of anomalies in multivariate time series. In: Proceedings of the 2009 SIAM international conference on data mining. SIAM, Philadelphia, pp\u00a0413\u2013424"},{"key":"220_CR81","doi-asserted-by":"publisher","first-page":"1074","DOI":"10.1109\/ICDM.2012.73","volume-title":"2012 IEEE 12th international conference on data mining","author":"H Qiu","year":"2012","unstructured":"Qiu H, Liu Y, Subrahmanya NA, Li W (2012) Granger causality for time-series anomaly detection. In: 2012 IEEE 12th international conference on data mining. IEEE Press, New York, pp\u00a01074\u20131079"},{"issue":"3","key":"220_CR82","doi-asserted-by":"publisher","first-page":"948","DOI":"10.1016\/j.ijforecast.2015.06.001","volume":"32","author":"HN Akouemo","year":"2016","unstructured":"Akouemo HN, Povinelli RJ (2016) Probabilistic anomaly detection in natural gas time series data. Int J Forecast 32(3):948\u2013956","journal-title":"Int J Forecast"},{"issue":"4","key":"220_CR83","doi-asserted-by":"publisher","first-page":"969","DOI":"10.2307\/2527348","volume":"39","author":"Marcelle Chauvet","year":"1998","unstructured":"Chauvet M (1998) An econometric characterization of business cycle dynamics with factor structure and regime switching. Int Econ Rev: 969\u2013996","journal-title":"International Economic Review"},{"issue":"1","key":"220_CR84","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1198\/073500104000000613","volume":"23","author":"M Dueker","year":"2005","unstructured":"Dueker M (2005) Dynamic forecasts of qualitative variables: a qual var model of US recessions. J Bus Econ Stat 23(1):96\u2013104","journal-title":"J Bus Econ Stat"},{"issue":"1","key":"220_CR85","doi-asserted-by":"publisher","first-page":"76","DOI":"10.1016\/j.jmacro.2011.10.002","volume":"34","author":"P \u00d6sterholm","year":"2012","unstructured":"\u00d6sterholm P (2012) The limited usefulness of macroeconomic Bayesian vars when estimating the probability of a US recession. J Macroecon 34(1):76\u201386","journal-title":"J Macroecon"},{"issue":"5","key":"220_CR86","doi-asserted-by":"publisher","first-page":"573","DOI":"10.1002\/(SICI)1099-1255(199609)11:5<573::AID-JAE413>3.0.CO;2-T","volume":"11","author":"JD Hamilton","year":"1996","unstructured":"Hamilton JD, Lin G (1996) Stock market volatility and the business cycle. J Appl Econom 11(5):573\u2013593","journal-title":"J Appl Econom"},{"issue":"1","key":"220_CR87","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1162\/003465398557320","volume":"80","author":"A Estrella","year":"1998","unstructured":"Estrella A, Mishkin FS (1998) Predicting US recessions: financial variables as leading indicators. Rev Econ Stat 80(1):45\u201361","journal-title":"Rev Econ Stat"},{"issue":"3","key":"220_CR88","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1016\/S0169-2070(01)00092-9","volume":"17","author":"M Qi","year":"2001","unstructured":"Qi M (2001) Predicting US recessions with leading indicators via neural network models. Int J Forecast 17(3):383\u2013401","journal-title":"Int J Forecast"},{"issue":"6","key":"220_CR89","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1002\/for.2345","volume":"34","author":"TJ Berge","year":"2015","unstructured":"Berge TJ (2015) Predicting recessions with leading indicators: model averaging and selection over the business cycle. J Forecast 34(6):455\u2013471","journal-title":"J Forecast"},{"issue":"1","key":"220_CR90","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1093\/biomet\/63.1.201","volume":"63","author":"A O\u2019hagan","year":"1976","unstructured":"O\u2019hagan A, Leonard T (1976) Bayes estimation subject to uncertainty about parameter constraints. Biometrika 63(1):201\u2013203","journal-title":"Biometrika"},{"issue":"2\u20133","key":"220_CR91","doi-asserted-by":"publisher","first-page":"253","DOI":"10.1016\/0304-4076(87)90027-3","volume":"35","author":"JS Cramer","year":"1987","unstructured":"Cramer JS (1987) Mean and variance of r2 in small and moderate samples. J Econom 35(2\u20133):253\u2013266","journal-title":"J Econom"},{"issue":"4","key":"220_CR92","doi-asserted-by":"publisher","first-page":"375","DOI":"10.1016\/0165-1765(92)90021-P","volume":"38","author":"ML Carrodus","year":"1992","unstructured":"Carrodus ML, Giles DE (1992) The exact distribution of r2 when the regression disturbances are autocorrelated. Econ Lett 38(4):375\u2013380","journal-title":"Econ Lett"},{"issue":"4","key":"220_CR93","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1023\/A:1024940629314","volume":"7","author":"J Kleinberg","year":"2003","unstructured":"Kleinberg J (2003) Bursty and hierarchical structure in streams. Data Min Knowl Discov 7(4):373\u2013397","journal-title":"Data Min Knowl Discov"},{"key":"220_CR94","doi-asserted-by":"publisher","first-page":"497","DOI":"10.1145\/1557019.1557077","volume-title":"Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining","author":"J Leskovec","year":"2009","unstructured":"Leskovec J, Backstrom L, Kleinberg J (2009) Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp\u00a0497\u2013506"},{"key":"220_CR95","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993\u20131022","journal-title":"J Mach Learn Res"},{"key":"220_CR96","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1109\/VAST.2011.6102461","volume-title":"2011 IEEE conference on visual analytics science and technology (VAST)","author":"W Dou","year":"2011","unstructured":"Dou W, Wang X, Chang R, Ribarsky W (2011) Paralleltopics: a probabilistic approach to exploring document collections. In: 2011 IEEE conference on visual analytics science and technology (VAST). IEEE Press, New York, pp\u00a0231\u2013240"},{"issue":"16","key":"220_CR97","doi-asserted-by":"publisher","first-page":"6483","DOI":"10.1073\/pnas.0808904106","volume":"106","author":"M\u00c1 Serrano","year":"2009","unstructured":"Serrano M\u00c1, Bogun\u00e1 M, Vespignani A (2009) Extracting the multiscale backbone of complex weighted networks. Proc Natl Acad Sci 106(16):6483\u20136488","journal-title":"Proc Natl Acad Sci"},{"issue":"6","key":"220_CR98","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.70.066111","volume":"70","author":"A Clauset","year":"2004","unstructured":"Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111","journal-title":"Phys Rev E"}],"container-title":["EPJ Data Science"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-020-0220-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1140\/epjds\/s13688-020-0220-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1140\/epjds\/s13688-020-0220-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,26]],"date-time":"2023-09-26T11:51:12Z","timestamp":1695729072000},"score":1,"resource":{"primary":{"URL":"https:\/\/epjdatascience.springeropen.com\/articles\/10.1140\/epjds\/s13688-020-0220-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,2,7]]},"references-count":98,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["220"],"URL":"https:\/\/doi.org\/10.1140\/epjds\/s13688-020-0220-x","relation":{},"ISSN":["2193-1127"],"issn-type":[{"type":"electronic","value":"2193-1127"}],"subject":[],"published":{"date-parts":[[2020,2,7]]},"assertion":[{"value":"28 June 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 January 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 February 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"3"}}