{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T22:24:34Z","timestamp":1774736674061,"version":"3.50.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2018,6,27]],"date-time":"2018-06-27T00:00:00Z","timestamp":1530057600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001711","name":"SNSF","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001711","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Significant Pattern Mining"},{"name":"SPHN"},{"name":"PHRT"},{"name":"Personalized Swiss Sepsis Study"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Most modern intensive care units record the physiological and vital signs of patients. These data can be used to extract signatures, commonly known as biomarkers, that help physicians understand the biological complexity of many syndromes. However, most biological biomarkers suffer from either poor predictive performance or weak explanatory power. Recent developments in time series classification focus on discovering shapelets, i.e. subsequences that are most predictive in terms of class membership. Shapelets have the advantage of combining a high predictive performance with an interpretable component\u2014their shape. Currently, most shapelet discovery methods do not rely on statistical tests to verify the significance of individual shapelets. Therefore, identifying associations between the shapelets of physiological biomarkers and patients that exhibit certain phenotypes of interest enables the discovery and subsequent ranking of physiological signatures that are interpretable, statistically validated and accurate predictors of clinical endpoints.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present a novel and scalable method for scanning time series and identifying discriminative patterns that are statistically significant. The significance of a shapelet is evaluated while considering the problem of multiple hypothesis testing and mitigating it by efficiently pruning untestable shapelet candidates with Tarone\u2019s method. We demonstrate the utility of our method by discovering patterns in three of a patient\u2019s vital signs: heart rate, respiratory rate and systolic blood pressure that are indicators of the severity of a future sepsis event, i.e. an inflammatory response to an infective agent that can lead to organ failure and death, if not treated in time.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>We make our method and the scripts that are required to reproduce the experiments publicly available at https:\/\/github.com\/BorgwardtLab\/S3M.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty246","type":"journal-article","created":{"date-parts":[[2018,4,13]],"date-time":"2018-04-13T11:09:48Z","timestamp":1523617788000},"page":"i438-i446","source":"Crossref","is-referenced-by-count":18,"title":["Association mapping in biomedical time series via statistically significant shapelet mining"],"prefix":"10.1093","volume":"34","author":[{"given":"Christian","family":"Bock","sequence":"first","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Gumbsch","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael","family":"Moor","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bastian","family":"Rieck","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Damian","family":"Roqueiro","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Karsten","family":"Borgwardt","sequence":"additional","affiliation":[{"name":"Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland"},{"name":"SIB Swiss Institute of Bioinformatics, Switzerland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2018,6,27]]},"reference":[{"key":"2023051604250472200_bty246-B1","doi-asserted-by":"crossref","first-page":"e6642.","DOI":"10.1371\/journal.pone.0006642","article-title":"Continuous multi-parameter heart rate variability analysis heralds onset of sepsis in adults","volume":"4","author":"Ahmad","year":"2009","journal-title":"PLoS One"},{"key":"2023051604250472200_bty246-B2","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1016\/j.ijmedinf.2006.11.006","article-title":"Predictive data mining in clinical medicine: current issues and guidelines","volume":"77","author":"Bellazzi","year":"2008","journal-title":"Int. J. Med. Inform"},{"key":"2023051604250472200_bty246-B3","first-page":"7","article-title":"Biomarkers for sepsis: what is and what might be?","volume":"10","author":"Biron","year":"2015","journal-title":"Biomarker Insights"},{"key":"2023051604250472200_bty246-B4","first-page":"3","article-title":"Teoria statistica delle classi e calcolo delle probabilit\u00e0","volume":"8","author":"Bonferroni","year":"1936","journal-title":"Pubblicazioni Del R. Istituto Superiore Di Scienze Economiche e Commerciali Di Firenze"},{"key":"2023051604250472200_bty246-B5","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1016\/j.compbiomed.2016.05.003","article-title":"A computational approach to early sepsis detection","volume":"74","author":"Calvert","year":"2016","journal-title":"Comp. Biol. Med"},{"key":"2023051604250472200_bty246-B6","doi-asserted-by":"crossref","first-page":"e0180060.","DOI":"10.1371\/journal.pone.0180060","article-title":"Heart rate variability as predictor of mortality in sepsis: a prospective cohort study","volume":"12","author":"de Castilho","year":"2017","journal-title":"PLoS One"},{"key":"2023051604250472200_bty246-B7","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1097\/CCM.0b013e31827e83af","article-title":"Surviving sepsis campaign: international guidelines for management of severe sepsis and septic shock 2012","volume":"41","author":"Dellinger","year":"2013","journal-title":"Crit. Care Med"},{"key":"2023051604250472200_bty246-B8","doi-asserted-by":"crossref","first-page":"e28.","DOI":"10.2196\/medinform.5909","article-title":"Prediction of sepsis in the intensive care unit with minimal electronic health record data: a machine learning approach","volume":"4","author":"Desautels","year":"2016","journal-title":"JMIR Med. Inform"},{"key":"2023051604250472200_bty246-B9","doi-asserted-by":"crossref","first-page":"87","DOI":"10.2307\/2340521","article-title":"On the interpretation of \u03c72 from contingency tables, and the calculation of p","volume":"85","author":"Fisher","year":"1922","journal-title":"J. R. Stat. Soc"},{"key":"2023051604250472200_bty246-B10","doi-asserted-by":"crossref","first-page":"195.","DOI":"10.1186\/1471-2105-13-195","article-title":"Early classification of multivariate temporal observations by extraction of interpretable shapelets","volume":"13","author":"Ghalwash","year":"2012","journal-title":"BMC Bioinform"},{"key":"2023051604250472200_bty246-B11","author":"Ghalwash","year":"2013"},{"key":"2023051604250472200_bty246-B12","first-page":"201","author":"Ghalwash","year":"2013"},{"key":"2023051604250472200_bty246-B13","first-page":"392","author":"Grabocka","year":"2014"},{"key":"2023051604250472200_bty246-B14","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1007\/s10115-015-0905-9","article-title":"Fast classification of univariate and multivariate time series through shapelet discovery","volume":"49","author":"Grabocka","year":"2016","journal-title":"Knowl. Inform. Syst"},{"key":"2023051604250472200_bty246-B15","doi-asserted-by":"crossref","first-page":"299ra122.","DOI":"10.1126\/scitranslmed.aab3719","article-title":"A targeted real-time early warning score (TREWScore) for septic shock","volume":"7","author":"Henry","year":"2015","journal-title":"Sci. Transl. Med"},{"key":"2023051604250472200_bty246-B16","doi-asserted-by":"crossref","first-page":"16045.","DOI":"10.1038\/nrdp.2016.45","article-title":"Sepsis and septic shock","volume":"2","author":"Hotchkiss","year":"2016","journal-title":"Nat. Rev. Dis. Primers"},{"key":"2023051604250472200_bty246-B17","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1093\/jamia\/ocx084","article-title":"The MIMIC Code Repository: enabling reproducibility in critical care research","volume":"25","author":"Johnson","year":"2018","journal-title":"J. Am. Med. Inform. Assoc"},{"key":"2023051604250472200_bty246-B18","doi-asserted-by":"crossref","first-page":"160035.","DOI":"10.1038\/sdata.2016.35","article-title":"MIMIC-III, a freely accessible critical care database","volume":"3","author":"Johnson","year":"2016","journal-title":"Sci. Data"},{"key":"2023051604250472200_bty246-B19","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1007\/s10618-016-0473-y","article-title":"Generalized random shapelet forests","volume":"30","author":"Karlsson","year":"2016","journal-title":"Data Mining Knowl. Discov"},{"key":"2023051604250472200_bty246-B20","doi-asserted-by":"crossref","first-page":"1308","DOI":"10.1001\/jama.2014.2637","article-title":"Mortality related to severe sepsis and septic shock among critically ill patients in Australia and New Zealand, 2000\u20132012","volume":"311","author":"Kaukonen","year":"2014","journal-title":"JAMA"},{"key":"2023051604250472200_bty246-B21","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s10115-004-0172-7","article-title":"Clustering of time-series subsequences is meaningless: implications for previous and future research","volume":"8","author":"Keogh","year":"2005","journal-title":"Knowl. Inform. Syst"},{"key":"2023051604250472200_bty246-B22","volume-title":"Analyzing Network Data in Biology and Medicin: A Textbook for Training Biological, Medical and Computational Inter-Disciplinary Scientists","author":"Llinares-L\u00f3pez","year":"2018"},{"key":"2023051604250472200_bty246-B23","author":"Llinares-L\u00f3pez","year":"2015"},{"key":"2023051604250472200_bty246-B24","first-page":"2290","author":"Marshall","year":"2009"},{"key":"2023051604250472200_bty246-B25","first-page":"1154","author":"Mueen","year":"2011"},{"key":"2023051604250472200_bty246-B26","first-page":"2279","volume-title":"Advances in Neural Information Processing Systems 29 (NIPS","author":"Papaxanthos","year":"2016"},{"key":"2023051604250472200_bty246-B27","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1016\/S1441-2772(23)02010-0","article-title":"The outcome of patients with sepsis and septic shock presenting to emergency departments in Australia and New Zealand","volume":"9","author":"Peake","year":"2007","journal-title":"Crit. Care Resuscit"},{"key":"2023051604250472200_bty246-B28","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1080\/14786440009463897","article-title":"X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling","volume":"50","author":"Pearson","year":"1900","journal-title":"Lond. Edinb. Dubl. Phil. Mag. J. Sci"},{"key":"2023051604250472200_bty246-B29","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1001\/jama.2016.20328","article-title":"Prognostic accuracy of the SOFA score, SIRS criteria, and qSOFA score for in-hospital mortality among adults with suspected infection admitted to the intensive care unit","volume":"317","author":"Raith","year":"2017","journal-title":"JAMA"},{"key":"2023051604250472200_bty246-B30","first-page":"668","volume-title":"Fast-Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets","author":"Rakthanmanon","year":"2013"},{"key":"2023051604250472200_bty246-B31","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1001\/jama.2016.0288","article-title":"Assessment of clinical criteria for sepsis: for the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3)","volume":"315","author":"Seymour","year":"2016","journal-title":"JAMA"},{"key":"2023051604250472200_bty246-B32","first-page":"739","author":"Shashikumar","year":"2017"},{"key":"2023051604250472200_bty246-B33","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1001\/jama.2016.0287","article-title":"The third international consensus definitions for sepsis and septic shock (Sepsis-3)","volume":"315","author":"Singer","year":"2016","journal-title":"JAMA"},{"key":"2023051604250472200_bty246-B34","doi-asserted-by":"crossref","first-page":"515","DOI":"10.2307\/2531456","article-title":"A modified Bonferroni method for discrete data","volume":"46","author":"Tarone","year":"1990","journal-title":"Biometrics"},{"key":"2023051604250472200_bty246-B35","doi-asserted-by":"crossref","first-page":"12996","DOI":"10.1073\/pnas.1302233110","article-title":"Statistical significance of combinatorial regulations","volume":"110","author":"Terada","year":"2013","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051604250472200_bty246-B36","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1007\/BF01709751","article-title":"The SOFA (sepsis-related organ failure assessment) score to describe organ dysfunction\/failure","volume":"22","author":"Vincent","year":"1996","journal-title":"Intensive Care Med"},{"key":"2023051604250472200_bty246-B37","doi-asserted-by":"crossref","first-page":"119","DOI":"10.2481\/dsj.5.119","article-title":"The impact of data mining techniques on medical diagnostics","volume":"5","author":"Wasan","year":"2006","journal-title":"Data Sci. J"},{"key":"2023051604250472200_bty246-B38","author":"Wistuba","year":"2015"},{"key":"2023051604250472200_bty246-B39","first-page":"947","author":"Ye","year":"2009"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i438\/50316254\/bioinformatics_34_13_i438.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/13\/i438\/50316254\/bioinformatics_34_13_i438.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,6]],"date-time":"2024-07-06T00:18:07Z","timestamp":1720225087000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/13\/i438\/5045733"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,27]]},"references-count":39,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2018,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty246","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,7,1]]},"published":{"date-parts":[[2018,6,27]]}}}