{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,9]],"date-time":"2026-02-09T10:15:15Z","timestamp":1770632115153,"version":"3.49.0"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"13","license":[{"start":{"date-parts":[[2021,1,20]],"date-time":"2021-01-20T00:00:00Z","timestamp":1611100800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002341","name":"Academy of Finland","doi-asserted-by":"publisher","award":["292660"],"award-info":[{"award-number":["292660"]}],"id":[{"id":"10.13039\/501100002341","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,7,27]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Longitudinal study designs are indispensable for studying disease progression. Inferring covariate effects from longitudinal data, however, requires interpretable methods that can model complicated covariance structures and detect non-linear effects of both categorical and continuous covariates, as well as their interactions. Detecting disease effects is hindered by the fact that they often occur rapidly near the disease initiation time, and this time point cannot be exactly observed. An additional challenge is that the effect magnitude can be heterogeneous over the subjects.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present lgpr, a widely applicable and interpretable method for non-parametric analysis of longitudinal data using additive Gaussian processes. We demonstrate that it outperforms previous approaches in identifying the relevant categorical and continuous covariates in various settings. Furthermore, it implements important novel features, including the ability to account for the heterogeneity of covariate effects, their temporal uncertainty, and appropriate observation models for different types of biomedical data. The lgpr tool is implemented as a comprehensive and user-friendly R-package.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>lgpr is available at jtimonen.github.io\/lgpr-usage with documentation, tutorials, test data and code for reproducing the experiments of this article.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab021","type":"journal-article","created":{"date-parts":[[2021,1,8]],"date-time":"2021-01-08T23:49:12Z","timestamp":1610149752000},"page":"1860-1867","source":"Crossref","is-referenced-by-count":11,"title":["<i>lgpr:<\/i>an interpretable non-parametric method for inferring covariate effects from longitudinal data"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2341-6765","authenticated-orcid":false,"given":"Juho","family":"Timonen","sequence":"first","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 00076, Finland"}]},{"given":"Henrik","family":"Mannerstr\u00f6m","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 00076, Finland"}]},{"given":"Aki","family":"Vehtari","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 00076, Finland"}]},{"given":"Harri","family":"L\u00e4hdesm\u00e4ki","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 00076, Finland"}]}],"member":"286","published-online":{"date-parts":[[2021,1,21]]},"reference":[{"key":"2023051611455230700_btab021-B1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v067.i01","article-title":"Fitting linear mixed-effects models using lme4","volume":"67","author":"Bates","year":"2015","journal-title":"J. Stat. Softw"},{"key":"2023051611455230700_btab021-B2","article-title":"A conceptual introduction to Hamiltonian Monte Carlo","author":"Betancourt","year":"2017","journal-title":"arXiv:1701.02434"},{"key":"2023051611455230700_btab021-B3","article-title":"Stan: a probabilistic programming language","volume":"76, 1\u201332","author":"Carpenter","year":"2017","journal-title":"J. Stat. Softw"},{"key":"2023051611455230700_btab021-B4","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1038\/s41467-019-09785-8","article-title":"An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data","volume":"10","author":"Cheng","year":"2019","journal-title":"Nat. Commun"},{"key":"2023051611455230700_btab021-B5","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198524847.001.0001","volume-title":"Analysis of Longitudinal Data","author":"Diggle","year":"2002"},{"key":"2023051611455230700_btab021-B6","first-page":"226","article-title":"Additive Gaussian processes","volume":"24","author":"Duvenaud","year":"2011","journal-title":"Adv. Neur. Inf. Proc. Syst"},{"key":"2023051611455230700_btab021-B7","first-page":"189","article-title":"Why we (usually) don\u2019t have to worry about multiple comparisons","volume":"5","author":"Gelman","year":"2012","journal-title":"J. Res. Educ. Eff"},{"key":"2023051611455230700_btab021-B8","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1080\/00031305.2018.1549100","article-title":"R-squared for Bayesian regression models","volume":"73","author":"Gelman","year":"2019","journal-title":"Am. Stat"},{"key":"2023051611455230700_btab021-B9","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1093\/biomet\/85.1.29","article-title":"Model choice in generalised linear models: a Bayesian approach via Kullback-Leibler projections","volume":"85","author":"Goutis","year":"1998","journal-title":"Biometrika"},{"key":"2023051611455230700_btab021-B10","first-page":"297","article-title":"Generalized additive models","volume":"1","author":"Hastie","year":"1986","journal-title":"Stat. Sci"},{"key":"2023051611455230700_btab021-B11","first-page":"1593","article-title":"The No-U-Turn Sampler: adaptively setting path lengths in Hamiltonian Monte Carlo","volume":"15","author":"Hoffman","year":"2014","journal-title":"J. Mach. Learn. Res"},{"key":"2023051611455230700_btab021-B12","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.2337\/db19-0287","article-title":"Early detection of peripheral blood cell signature in children developing \u03b2-cell autoimmunity at a young age","volume":"68","author":"Kallionp\u00e4\u00e4","year":"2019","journal-title":"Diabetes"},{"key":"2023051611455230700_btab021-B13","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1214\/10-BA505","article-title":"Bayesian functional ANOVA modeling using Gaussian process prior distributions","volume":"5","author":"Kaufman","year":"2010","journal-title":"Bayesian Anal"},{"key":"2023051611455230700_btab021-B14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v082.i13","article-title":"lmerTest package: tests in linear mixed effects models","volume":"82","author":"Kuznetsova","year":"2017","journal-title":"J. Stat. Softw"},{"key":"2023051611455230700_btab021-B15","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.jprot.2017.10.004","article-title":"Temporal expression profiling of plasma proteins reveals oxidative stress in early stages of Type 1 Diabetes progression","volume":"172","author":"Liu","year":"2018","journal-title":"J. Proteomics"},{"key":"2023051611455230700_btab021-B16","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1111\/j.2041-210X.2010.00021.x","article-title":"Do not log-transform count data","volume":"1","author":"O\u2019Hara","year":"2010","journal-title":"Methods Ecol. Evol"},{"key":"2023051611455230700_btab021-B17","doi-asserted-by":"crossref","first-page":"1189","DOI":"10.2337\/db06-0880","article-title":"The heterogeneity of diabetes","volume":"56","author":"Pietropaolo","year":"2007","journal-title":"Diabetes"},{"key":"2023051611455230700_btab021-B18","doi-asserted-by":"crossref","first-page":"711","DOI":"10.1007\/s11222-016-9649-y","article-title":"Comparison of Bayesian predictive methods for model selection","volume":"27","author":"Piironen","year":"2017","journal-title":"Stat. Comput"},{"key":"2023051611455230700_btab021-B19","doi-asserted-by":"crossref","first-page":"29","DOI":"10.2333\/bhmk.26.29","article-title":"Accuracy versus interpretability in flexible modeling: implementing a tradeoff using Gaussian process models","volume":"26","author":"Plate","year":"1999","journal-title":"Behaviourmetrika"},{"key":"2023051611455230700_btab021-B20","doi-asserted-by":"crossref","first-page":"1168","DOI":"10.1080\/01621459.2015.1076725","article-title":"Bayesian nonparametric longitudinal data analysis","volume":"111","author":"Quintana","year":"2016","journal-title":"J. Am. Stat. Assoc"},{"key":"2023051611455230700_btab021-B21","volume-title":"R: A Language and Environment for Statistical Computing","year":"2018"},{"key":"2023051611455230700_btab021-B22","volume-title":"Gaussian Processes for Machine Learning","author":"Rasmussen","year":"2006"},{"key":"2023051611455230700_btab021-B23","doi-asserted-by":"crossref","first-page":"20110550","DOI":"10.1098\/rsta.2011.0550","article-title":"Gaussian processes for time-series modelling","volume":"371","author":"Roberts","year":"2013","journal-title":"Phil. Trans. R. Soc. A"},{"key":"2023051611455230700_btab021-B24","doi-asserted-by":"crossref","DOI":"10.1186\/1471-2105-12-77","article-title":"pROC: an open-source package for R and S+ to analyze and compare ROC curves","volume":"12","author":"Robin","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051611455230700_btab021-B25","doi-asserted-by":"crossref","first-page":"4931","DOI":"10.1038\/s41467-018-07261-3","article-title":"Longitudinal single-cell RNA sequencing of patient-derived primary cells reveals drug-induced infidelity in stem cell hierarchy","volume":"9","author":"Sharma","year":"2018","journal-title":"Nat. Commun"},{"key":"2023051611455230700_btab021-B26","first-page":"1674","article-title":"Input warping for Bayesian optimization of non-stationary functions","volume":"31","author":"Snoek","year":"2014","journal-title":"Int. Conf. Mach. Learn"},{"key":"2023051611455230700_btab021-B27","volume-title":"Generalized Linear Mixed Models: Modern Concepts, Methods and Applications. Chapman & Hall\/CRC Texts in Statistical Science","author":"Stroup","year":"2012"},{"key":"2023051611455230700_btab021-B28","first-page":"1175","article-title":"GPstuff: Bayesian modeling with Gaussian processes","volume":"14","author":"Vanhatalo","year":"2013","journal-title":"J. Mach. Learn. Res"},{"key":"2023051611455230700_btab021-B29","doi-asserted-by":"crossref","first-page":"842","DOI":"10.1016\/j.cell.2016.04.007","article-title":"Variation in microbiome LPS immunogenicity contributes to autoimmunity in humans","volume":"165","author":"Vatanen","year":"2016","journal-title":"Cell"},{"key":"2023051611455230700_btab021-B30","volume-title":"Bayesian Anal","author":"Vehtari","year":"2020"},{"key":"2023051611455230700_btab021-B31","doi-asserted-by":"crossref","DOI":"10.1201\/9781420010404","volume-title":"Generalized Additive Models: An Introduction with R. Texts in Statistical Science","author":"Wood","year":"2006"},{"key":"2023051611455230700_btab021-B32","doi-asserted-by":"crossref","first-page":"2473","DOI":"10.1001\/jama.2013.6285","article-title":"Seroconversion to multiple islet autoantibodies and risk of progression to diabetes in children","volume":"309","author":"Ziegler","year":"2013","journal-title":"J. Am. Med. Assoc"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab021\/36158834\/btab021.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/13\/1860\/50340113\/btab021.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/13\/1860\/50340113\/btab021.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,22]],"date-time":"2024-08-22T03:07:25Z","timestamp":1724296045000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/13\/1860\/6104850"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,1,21]]},"references-count":32,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2021,7,27]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab021","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,7,1]]},"published":{"date-parts":[[2021,1,21]]}}}