{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,21]],"date-time":"2025-08-21T16:46:37Z","timestamp":1755794797365,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","funder":[{"DOI":"10.13039\/https:\/\/doi.org\/10.13039\/100000050","name":"National Heart, Lung, and Blood Institute","doi-asserted-by":"publisher","award":["R01 HL158626"],"award-info":[{"award-number":["R01 HL158626"]}],"id":[{"id":"10.13039\/https:\/\/doi.org\/10.13039\/100000050","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Graduate Fellowship for STEM Diversity"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,8,3]]},"DOI":"10.1145\/3711896.3736898","type":"proceedings-article","created":{"date-parts":[[2025,8,1]],"date-time":"2025-08-01T13:30:13Z","timestamp":1754055013000},"page":"1219-1228","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Cross-Validation for Longitudinal Datasets with Unstable Correlations"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4151-2294","authenticated-orcid":false,"given":"Meera","family":"Krishnamoorthy","sequence":"first","affiliation":[{"name":"Computer Science and Engineering, University of Michigan - Ann Arbor, Ann Arbor, MI, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0535-9659","authenticated-orcid":false,"given":"Michael","family":"Sjoding","sequence":"additional","affiliation":[{"name":"Internal Medicine, University of Michigan - Ann Arbor, Ann Arbor, MI, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1057-7722","authenticated-orcid":false,"given":"Jenna","family":"Wiens","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering, University of Michigan Ann Arbor, Ann Arbor, MI, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,8,3]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.2023.2197686"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1093\/jamiaopen\/ooaa006"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMc2104626"},{"key":"e_1_3_2_2_4_1","unstructured":"Mehak Gupta Brennan Gallamoza Nicolas Cutrona Pranjal Dhakal Raphael Poulain and Rahmatollah Beheshti. 2022. An Extensive Data Processing Pipeline for MIMIC-IV. arXiv:2204.13841 [cs.LG] https:\/\/arxiv.org\/abs\/2204.13841"},{"volume-title":"The Elements of Statistical Learning: Data Mining, Inference, and Prediction","author":"Hastie Trevor","key":"e_1_3_2_2_5_1","unstructured":"Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer."},{"key":"e_1_3_2_2_6_1","volume-title":"Simple data balancing achieves competitive worst-group-accuracy. CoRR","author":"Idrissi Badr Youbi","year":"2021","unstructured":"Badr Youbi Idrissi, Mart\u00edn Arjovsky, Mohammad Pezeshki, and David Lopez-Paz. 2021. Simple data balancing achieves competitive worst-group-accuracy. CoRR, Vol. abs\/2110.14503 (2021). arXiv:2110.14503 https:\/\/arxiv.org\/abs\/2110.14503"},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41597-022-01899-x"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1136\/bmj-2021-068576"},{"key":"e_1_3_2_2_9_1","first-page":"5338","volume-title":"Proceedings of the 37th International Conference on Machine Learning(Proceedings of Machine Learning Research","author":"Koh Pang Wei","year":"2020","unstructured":"Pang Wei Koh, Thao Nguyen, Yew Siang Tang, Stephen Mussmann, Emma Pierson, Been Kim, and Percy Liang. 2020. Concept Bottleneck Models. In Proceedings of the 37th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 119), , Hal Daum\u00e9 III and Aarti Singh(Eds.). PMLR, 5338-5348. https:\/\/proceedings.mlr.press\/v119\/koh20a.html"},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11606-019-05008-4"},{"key":"e_1_3_2_2_11_1","volume-title":"Proceedings of the 38th International Conference on Machine Learning(Proceedings of Machine Learning Research","volume":"6792","author":"Liu Evan Z","year":"2021","unstructured":"Evan Z Liu, Behzad Haghgoo, Annie S Chen, Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, and Chelsea Finn. 2021. Just Train Twice: Improving Group Robustness without Training Group Information. In Proceedings of the 38th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 139), Marina Meila and Tong Zhang(Eds.). PMLR, 6781-6792. https:\/\/proceedings.mlr.press\/v139\/liu21f.html"},{"key":"e_1_3_2_2_12_1","volume-title":"Nov 2021 Sub (1975-2019) - Linked To County Attributes - Time Dependent (1990-2019) Income\/Rurality","author":"National Cancer Institute. 2022. Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database","year":"1969","unstructured":"National Cancer Institute. 2022. Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence - SEER Research Data, 8 Registries, Nov 2021 Sub (1975-2019) - Linked To County Attributes - Time Dependent (1990-2019) Income\/Rurality, 1969-2020 Counties, National Cancer Institute, DCCPS, Surveillance Research Program. (2022)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2078195"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1002\/bimj.202200302"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1111\/ecog.02881"},{"key":"e_1_3_2_2_16_1","volume-title":"Epic overhauls popular sepsis algorithm criticized for faulty alarms. STAT(October 3","author":"Ross Casey","year":"2022","unstructured":"Casey Ross. 2022. Epic overhauls popular sepsis algorithm criticized for faulty alarms. STAT(October 3 2022)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-019-0048-x"},{"key":"e_1_3_2_2_18_1","volume-title":"Tatsunori B. Hashimoto, and Percy Liang.","author":"Sagawa Shiori","year":"2019","unstructured":"Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, and Percy Liang. 2019. Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization. CoRR, Vol. abs\/1911.08731 (2019). arXiv:1911.08731 http:\/\/arxiv.org\/abs\/1911.08731"},{"key":"e_1_3_2_2_19_1","volume-title":"Selecting a classification method by cross-validation. Machine learning","author":"Schaffer Cullen","year":"1993","unstructured":"Cullen Schaffer. 1993. Selecting a classification method by cross-validation. Machine learning, Vol. 13 (1993), 135-143."},{"key":"e_1_3_2_2_20_1","article-title":"Covariate shift adaptation by importance weighted cross validation","volume":"8","author":"Sugiyama Masashi","year":"2007","unstructured":"Masashi Sugiyama, Matthias Krauledat, and Klaus-Robert M\u00fcller. 2007. Covariate shift adaptation by importance weighted cross validation. Journal of Machine Learning Research, Vol. 8, 5 (2007).","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","unstructured":"Alex Tsun. 2020. Probability & Statistics with Applications to Computing. https:\/\/doi.org\/10.1080\/01621459.2023.2197686","DOI":"10.1080\/01621459.2023.2197686"},{"key":"e_1_3_2_2_22_1","unstructured":"Jing Wang Laurel Hopkins Tyler Hallman W. Douglas Robinson and Rebecca Hutchinson. 2023. Cross-validation for Geospatial Data: Estimating Generalization Performance in Geostatistical Problems. Transactions on Machine Learning Research(2023). https:\/\/openreview.net\/forum?id=VgJhYu7FmQ"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1001\/jamainternmed.2021.2626"},{"key":"e_1_3_2_2_24_1","unstructured":"Yuzhe Yang Haoran Zhang Dina Katabi and Marzyeh Ghassemi. 2023. Change is Hard: A Closer Look at Subpopulation Shift. arXiv:2302.12254 [cs.LG]"},{"key":"e_1_3_2_2_25_1","volume-title":"Lipton","author":"Zhou Helen","year":"2022","unstructured":"Helen Zhou, Yuwen Chen, and Zachary C. Lipton. 2022. Model Evaluation in Medical Datasets Over Time. arXiv:2211.07165 [cs.LG]"}],"event":{"name":"KDD '25: The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"],"location":"Toronto ON Canada","acronym":"KDD '25"},"container-title":["Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3711896.3736898","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,16]],"date-time":"2025-08-16T14:25:52Z","timestamp":1755354352000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3711896.3736898"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,3]]},"references-count":25,"alternative-id":["10.1145\/3711896.3736898","10.1145\/3711896"],"URL":"https:\/\/doi.org\/10.1145\/3711896.3736898","relation":{},"subject":[],"published":{"date-parts":[[2025,8,3]]},"assertion":[{"value":"2025-08-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}