{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T09:20:07Z","timestamp":1769851207271,"version":"3.49.0"},"reference-count":52,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2022,12,13]],"date-time":"2022-12-13T00:00:00Z","timestamp":1670889600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100000062","name":"National Institute of Diabetes and Digestive and Kidney Diseases","doi-asserted-by":"publisher","award":["U24DK097771"],"award-info":[{"award-number":["U24DK097771"]}],"id":[{"id":"10.13039\/100000062","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,1,19]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Type 1 diabetes (T1D) outcome prediction plays a vital role in identifying novel risk factors, ensuring early patient care and designing cohort studies. TEDDY is a longitudinal cohort study that collects a vast amount of multi-omics and clinical data from its participants to explore the progression and markers of T1D. However, missing data in the omics profiles make the outcome prediction a difficult task. TEDDY collected time series gene expression for less than 6% of enrolled participants. Additionally, for the participants whose gene expressions are collected, 79% time steps are missing. This study introduces an advanced bioinformatics framework for gene expression imputation and islet autoimmunity (IA) prediction. The imputation model generates synthetic data for participants with partially or entirely missing gene expression. The prediction model integrates the synthetic gene expression with other risk factors to achieve better predictive performance. Comprehensive experiments on TEDDY datasets show that: (1) Our pipeline can effectively integrate synthetic gene expression with family history, HLA genotype and SNPs to better predict IA status at 2 years (sensitivity 0.622, AUC 0.715) compared with the individual datasets and state-of-the-art results in the literature (AUC 0.682). (2) The synthetic gene expression contains predictive signals as strong as the true gene expression, reducing reliance on expensive and long-term longitudinal data collection. (3) Time series gene expression is crucial to the proposed improvement and shows significantly better predictive ability than cross-sectional gene expression. (4) Our pipeline is robust to limited data availability. Availability: Code is available at https:\/\/github.com\/compbiolabucf\/TEDDY<\/jats:p>","DOI":"10.1093\/bib\/bbac537","type":"journal-article","created":{"date-parts":[[2022,12,14]],"date-time":"2022-12-14T00:46:37Z","timestamp":1670978797000},"source":"Crossref","is-referenced-by-count":4,"title":["Incomplete time-series gene expression in integrative study for islet autoimmunity prediction"],"prefix":"10.1093","volume":"24","author":[{"given":"Khandakar","family":"Tanvir Ahmed","sequence":"first","affiliation":[{"name":"University of Central Florida Department of Computer Science, , Orlando, FL 32816 , USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sze","family":"Cheng","sequence":"additional","affiliation":[{"name":"University of Minnesota Twin Cities Department of Biochemistry, Molecular Biology and Biophysics, , Minneapolis, MN 55455 , USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qian","family":"Li","sequence":"additional","affiliation":[{"name":"St. Jude Children\u2019s Research Hospital Department of Biostatistics, , Memphis, TN 38105 , USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jeongsik","family":"Yong","sequence":"additional","affiliation":[{"name":"University of Minnesota Twin Cities Department of Biochemistry, Molecular Biology and Biophysics, , Minneapolis, MN 55455 , USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of Central Florida Department of Computer Science, , Orlando, FL 32816 , USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2022,12,13]]},"reference":[{"issue":"3","key":"2023011917140140500_ref1","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/0895-4356(90)90005-A","article-title":"The individual over time: time series applications in health care research","volume":"43","author":"Crabtree","year":"1990","journal-title":"J Clin Epidemiol"},{"issue":"3","key":"2023011917140140500_ref2","doi-asserted-by":"crossref","first-page":"c214","DOI":"10.1159\/000235241","article-title":"Cohort studies: prospective versus retrospective","volume":"113","author":"Euser","year":"2009","journal-title":"Nephron Clin Pract"},{"key":"2023011917140140500_ref3","article-title":"Prospective cohort studies in medical research","author":"Hammoudeh","year":"2018","journal-title":"IntechOpen"},{"key":"2023011917140140500_ref4","first-page":"1651","volume-title":"International conference on artificial intelligence and statistics","author":"Fortuin","year":"2020"},{"key":"2023011917140140500_ref5","doi-asserted-by":"crossref","first-page":"2621","DOI":"10.1109\/SMC42975.2020.9283191","volume-title":"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)","author":"Saad","year":"2020"},{"issue":"1","key":"2023011917140140500_ref6","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1007\/s40484-019-0192-7","article-title":"Imputation of single-cell gene expression with an autoencoder neural network","volume":"8","author":"Badsha","year":"2020","journal-title":"Quantitative Biology"},{"issue":"15","key":"2023011917140140500_ref7","doi-asserted-by":"crossref","first-page":"e85","DOI":"10.1093\/nar\/gkaa506","article-title":"scIGANs: single-cell RNA-seq imputation using generative adversarial networks","volume":"48","author":"Yungang","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2023011917140140500_ref8","doi-asserted-by":"crossref","first-page":"489","DOI":"10.3389\/fgene.2021.624128","article-title":"Deep Learning Enables Fast and Accurate Imputation of Gene Expression","volume":"12","author":"Vi\u00f1as","year":"2021","journal-title":"Front Genet"},{"key":"2023011917140140500_ref9","doi-asserted-by":"crossref","DOI":"10.3389\/fgene.2020.570255","article-title":"A review of integrative imputation for multi-omics datasets","volume":"11","author":"Song","year":"2020","journal-title":"Front Genet"},{"issue":"1","key":"2023011917140140500_ref10","doi-asserted-by":"crossref","first-page":"18","DOI":"10.2174\/1574893608999140109120957","article-title":"A review on missing value imputation algorithms for microarray gene expression data","volume":"9","author":"Moorthy","year":"2014","journal-title":"Current Bioinformatics"},{"issue":"7","key":"2023011917140140500_ref11","doi-asserted-by":"crossref","DOI":"10.1093\/gigascience\/giaa076","article-title":"Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning\u2013based neural network","volume":"9","author":"Zhou","year":"2020","journal-title":"GigaScience"},{"issue":"1","key":"2023011917140140500_ref12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-016-1273-5","article-title":"Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework","volume":"17","author":"Voillet","year":"2016","journal-title":"BMC bioinformatics"},{"issue":"1","key":"2023011917140140500_ref13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-016-1122-6","article-title":"An integrative imputation method based on multi-omics datasets","volume":"17","author":"Lin","year":"2016","journal-title":"BMC bioinformatics"},{"key":"2023011917140140500_ref14","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1007\/978-1-4939-9442-7_12","article-title":"Missing-values imputation algorithms for microarray gene expression data","volume":"1986","author":"Moorthy","year":"2019","journal-title":"Microarray Bioinformatics"},{"issue":"1","key":"2023011917140140500_ref15","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1109\/TITB.2008.2007421","article-title":"Autoregressive-model-based missing value estimation for DNA microarray time series data","volume":"13","author":"Choong","year":"2009","journal-title":"IEEE Trans Inf Technol Biomed"},{"key":"2023011917140140500_ref16","article-title":"Multivariate time series imputation with generative adversarial networks","volume":"31","author":"Luo","year":"2018","journal-title":"Advances in neural information processing systems"},{"issue":"1","key":"2023011917140140500_ref17","doi-asserted-by":"crossref","DOI":"10.1002\/met.1873","article-title":"Missing data imputation of high-resolution temporal climate time series data","volume":"27","author":"Afrifa-Yamoah","year":"2020","journal-title":"Meteorological Applications"},{"key":"2023011917140140500_ref18","article-title":"Brits: Bidirectional recurrent imputation for time series","volume":"31","author":"Cao","year":"2018","journal-title":"Advances in neural information processing systems"},{"issue":"5","key":"2023011917140140500_ref19","doi-asserted-by":"crossref","first-page":"1477","DOI":"10.1109\/TBME.2018.2874712","article-title":"Estimating missing data in temporal data streams using multi-directional recurrent neural networks","volume":"66","author":"Yoon","year":"2018","journal-title":"IEEE Transactions on Biomedical Engineering"},{"key":"2023011917140140500_ref20","volume-title":"The environmental determinants of diabetes in the young (TEDDY) study","author":"Teddy"},{"issue":"4","key":"2023011917140140500_ref21","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1297\/cpe.23.99","article-title":"Type 1 diabetes and autoimmunity","volume":"23","author":"Kawasaki","year":"2014","journal-title":"Clinical pediatric endocrinology"},{"issue":"6","key":"2023011917140140500_ref22","doi-asserted-by":"crossref","first-page":"1051","DOI":"10.2337\/dc18-2282","article-title":"Predicting islet cell autoimmunity and type 1 diabetes: an 8-year TEDDY study progress report","volume":"42","author":"Krischer","year":"2019","journal-title":"Diabetes Care"},{"issue":"2","key":"2023011917140140500_ref23","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1111\/1753-0407.13093","article-title":"Prediction of the development of islet autoantibodies through integration of environmental, genetic, and metabolic markers","volume":"13","author":"Webb-Robertson","year":"2021","journal-title":"J Diabetes"},{"issue":"9","key":"2023011917140140500_ref24","doi-asserted-by":"crossref","first-page":"3268","DOI":"10.2337\/db13-0159","article-title":"Cord serum lipidome in prediction of islet autoimmunity and type 1 diabetes","volume":"62","author":"Ore\u0161i\u010d","year":"2013","journal-title":"Diabetes"},{"issue":"12","key":"2023011917140140500_ref25","doi-asserted-by":"crossref","first-page":"2521","DOI":"10.1007\/s00125-014-3362-1","article-title":"Feature ranking of type 1 diabetes susceptibility genes improves prediction of type 1 diabetes","volume":"57","author":"Winkler","year":"2014","journal-title":"Diabetologia"},{"issue":"3","key":"2023011917140140500_ref26","doi-asserted-by":"crossref","first-page":"337","DOI":"10.2337\/dc15-1111","article-title":"A type 1 diabetes genetic risk score can aid discrimination between type 1 and type 2 diabetes in young adults","volume":"39","author":"Oram","year":"2016","journal-title":"Diabetes Care"},{"issue":"9","key":"2023011917140140500_ref27","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1136\/jmedgenet-2018-105532","article-title":"Progression from islet autoimmunity to clinical type 1 diabetes is influenced by genetic factors: results from the prospective TEDDY study","volume":"56","author":"Beyerlein","year":"2019","journal-title":"J Med Genet"},{"issue":"4","key":"2023011917140140500_ref28","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pmed.1002548","article-title":"Genetic scores to stratify risk of developing multiple islet autoantibodies and type 1 diabetes: a prospective study in children","volume":"15","author":"Bonifacio","year":"2018","journal-title":"PLoS Med"},{"issue":"4","key":"2023011917140140500_ref29","doi-asserted-by":"crossref","first-page":"847","DOI":"10.2337\/db18-0882","article-title":"Genetic contribution to the divergence in type 1 diabetes risk between children from the general population and children from affected families","volume":"68","author":"Hippich","year":"2019","journal-title":"Diabetes"},{"issue":"11","key":"2023011917140140500_ref30","doi-asserted-by":"crossref","first-page":"2188","DOI":"10.2337\/dc08-0935","article-title":"Glucose and C-peptide changes in the perionset period of type 1 diabetes in the Diabetes Prevention Trial\u2013Type 1","volume":"31","author":"Sosenko","year":"2008","journal-title":"Diabetes Care"},{"issue":"9","key":"2023011917140140500_ref31","doi-asserted-by":"crossref","first-page":"1887","DOI":"10.2337\/dc18-0087","article-title":"A type 1 diabetes genetic risk score predicts progression of islet autoimmunity and development of type 1 diabetes in individuals at risk","volume":"41","author":"Redondo","year":"2018","journal-title":"Diabetes Care"},{"issue":"8","key":"2023011917140140500_ref32","doi-asserted-by":"crossref","first-page":"1247","DOI":"10.1038\/s41591-020-0930-4","article-title":"A combined risk score enhances prediction of type 1 diabetes among susceptible children","volume":"26","author":"Ferrat","year":"2020","journal-title":"Nat Med"},{"key":"2023011917140140500_ref33","article-title":"A paradigm for class prediction using gene expression profiles","author":"Radmacher","journal-title":"J Comput Biol"},{"key":"2023011917140140500_ref34","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.ymeth.2019.02.009","article-title":"Deep-Resp-Forest: A deep forest model to predict anti-cancer drug response","volume":"166","author":"Ran","year":"2019","journal-title":"Methods"},{"issue":"1","key":"2023011917140140500_ref35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-018-19635-0","article-title":"Robust phenotype prediction from gene expression data using differential shrinkage of co-regulated genes","volume":"8","author":"Zarringhalam","year":"2018","journal-title":"Sci Rep"},{"issue":"6","key":"2023011917140140500_ref36","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbab264","article-title":"In silico model for miRNA-mediated regulatory network in cancer","volume":"22","author":"Ahmed","year":"2021","journal-title":"Brief Bioinform"},{"issue":"14","key":"2023011917140140500_ref37","doi-asserted-by":"crossref","first-page":"i501","DOI":"10.1093\/bioinformatics\/btz318","article-title":"MOLI: multi-omics late integration with deep neural networks for drug response prediction","volume":"35","author":"Sharifi-Noghabi","year":"2019","journal-title":"Bioinformatics"},{"issue":"14","key":"2023011917140140500_ref38","doi-asserted-by":"crossref","first-page":"2441","DOI":"10.1093\/bioinformatics\/bty148","article-title":"Network-based integration of multi-omics data for prioritizing cancer genes","volume":"34","author":"Dimitrakopoulos","year":"2018","journal-title":"Bioinformatics"},{"issue":"17","key":"2023011917140140500_ref39","doi-asserted-by":"crossref","first-page":"3055","DOI":"10.1093\/bioinformatics\/bty1054","article-title":"DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays","volume":"35","author":"Singh","year":"2019","journal-title":"Bioinformatics"},{"issue":"6","key":"2023011917140140500_ref40","doi-asserted-by":"crossref","first-page":"1248","DOI":"10.1158\/1078-0432.CCR-17-0853","article-title":"Deep learning\u2013based multi-omics integration robustly predicts survival in liver cancer","volume":"24","author":"Chaudhary","year":"2018","journal-title":"Clin Cancer Res"},{"issue":"1","key":"2023011917140140500_ref41","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1093\/bioinformatics\/btab608","article-title":"Multi-omics data integration by generative adversarial network","volume":"38","author":"Ahmed","year":"2021","journal-title":"Bioinformatics"},{"issue":"587","key":"2023011917140140500_ref42","doi-asserted-by":"crossref","DOI":"10.1126\/scitranslmed.abd5666","article-title":"Transcriptional networks in at-risk individuals identify signatures of type 1 diabetes progression","volume":"13","author":"Xhonneux","year":"2021","journal-title":"Sci Transl Med"},{"key":"2023011917140140500_ref43","first-page":"1150","article-title":"The environmental determinants of diabetes in the young (TEDDY) study","volume":"1","author":"TEDDY Study Group","year":"2008","journal-title":"Ann N Y Acad Sci"},{"issue":"3","key":"2023011917140140500_ref44","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1111\/pedi.12812","article-title":"Predicting progression to type 1 diabetes from ages 3 to 6 in islet autoantibody positive TEDDY children","volume":"20","author":"Jacobsen","year":"2019","journal-title":"Pediatr Diabetes"},{"issue":"3","key":"2023011917140140500_ref45","doi-asserted-by":"crossref","first-page":"465","DOI":"10.2337\/db19-0756","article-title":"Longitudinal metabolome-wide signals prior to the appearance of a first islet autoantibody in children participating in the TEDDY study","volume":"69","author":"Li","year":"2020","journal-title":"Diabetes"},{"issue":"5","key":"2023011917140140500_ref46","doi-asserted-by":"crossref","first-page":"808","DOI":"10.2337\/dc14-2426","article-title":"Predictors of progression from the appearance of islet autoantibodies to early childhood diabetes: The Environmental Determinants of Diabetes in the Young (TEDDY)","volume":"38","author":"Steck","year":"2015","journal-title":"Diabetes Care"},{"key":"2023011917140140500_ref47","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.neucom.2017.11.077","article-title":"Feature selection in machine learning: A new perspective","volume":"300","author":"Cai","year":"2018","journal-title":"Neurocomputing"},{"issue":"3","key":"2023011917140140500_ref48","doi-asserted-by":"crossref","first-page":"993","DOI":"10.1016\/j.ejor.2017.08.040","article-title":"High dimensional data classification and feature selection using support vector machines","volume":"265","author":"Ghaddar","year":"2018","journal-title":"European Journal of Operational Research"},{"key":"2023011917140140500_ref49","first-page":"2825","article-title":"Scikit-learn: Machine Learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"2023011917140140500_ref50","first-page":"8026","article-title":"Pytorch: An imperative style, high-performance deep learning library","volume":"32","author":"Paszke","year":"2019","journal-title":"Advances in neural information processing systems"},{"issue":"8","key":"2023011917140140500_ref51","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput"},{"issue":"5","key":"2023011917140140500_ref52","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.2337\/dc17-2335","article-title":"Racial\/ethnic minority youth with recent-onset type 1 diabetes have poor prognostic factors","volume":"41","author":"Redondo","year":"2018","journal-title":"Diabetes Care"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/1\/bbac537\/48782134\/bbac537.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/1\/bbac537\/48782134\/bbac537.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,19]],"date-time":"2023-01-19T17:43:51Z","timestamp":1674150231000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbac537\/6895461"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,13]]},"references-count":52,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,1,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbac537","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,1]]},"published":{"date-parts":[[2022,12,13]]},"article-number":"bbac537"}}