{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T04:57:50Z","timestamp":1781326670064,"version":"3.54.1"},"reference-count":72,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"DOI":"10.13039\/501100004836","name":"Danmarks Frie Forskningsfond","doi-asserted-by":"publisher","award":["0171-00061B & 0171-00062B"],"award-info":[{"award-number":["0171-00061B & 0171-00062B"]}],"id":[{"id":"10.13039\/501100004836","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. ACM Manag. Data"],"published-print":{"date-parts":[[2025,9,22]]},"abstract":"<jats:p>Training deep learning models is a repetitive and resource-intensive process. Data scientists often train several models before landing on a set of parameters (e.g., hyper-parameter tuning) and model architecture (e.g., neural architecture search), among other things that yield the highest accuracy. The computational efficiency of these training tasks depends highly on how well the training data is supplied to the training process. The repetitive nature of these tasks results in the same data processing pipelines running over and over, exacerbating the need for and costs of computational resources.<\/jats:p>\n                  <jats:p>\n                    In this paper, we present T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    to reduce the computational needs of deep learning training by enabling simultaneous training processes to share the same data loader. T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    mitigates CPU-side bottlenecks in cases where the collocated training workloads have high throughput on GPU, but are held back by lower data-loading throughput on CPU. T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    achieves this by reducing redundant computations and data duplication across collocated training processes and leveraging modern GPU-GPU interconnects. While doing so, T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    is able to train and balance differently-sized models and serve multiple batch sizes simultaneously and is hardware- and pipeline-agnostic in nature.\n                  <\/jats:p>\n                  <jats:p>\n                    Our evaluation shows that T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    enables scenarios that are infeasible without data sharing, increases training throughput by up to 100%, and when utilizing cloud instances, achieves cost savings of 50% by reducing the hardware resource needs on the CPU side. Furthermore, T\n                    <jats:sc>ensor<\/jats:sc>\n                    S\n                    <jats:sc>ocket<\/jats:sc>\n                    outperforms the state-of-the-art solutions for shared data loading such as CoorDL and Joader; it is easier to deploy and maintain and either achieves higher or matches their throughput while requiring fewer CPU resources.\n                  <\/jats:p>","DOI":"10.1145\/3749185","type":"journal-article","created":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T17:17:03Z","timestamp":1758647823000},"page":"1-26","source":"Crossref","is-referenced-by-count":4,"title":["TensorSocket: Shared Data Loading for Deep Learning Training"],"prefix":"10.1145","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-3451-5602","authenticated-orcid":false,"given":"Ties","family":"Robroek","sequence":"first","affiliation":[{"name":"IT University of Copenhagen, Copenhagen, Denmark"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-8159-4346","authenticated-orcid":false,"given":"Neil Kim","family":"Nielsen","sequence":"additional","affiliation":[{"name":"IT University of Copenhagen, Copenhagen, Denmark"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6838-4854","authenticated-orcid":false,"given":"P\u0131nar","family":"T\u00f6z\u00fcn","sequence":"additional","affiliation":[{"name":"IT University of Copenhagen, Copenhagen, Denmark"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2025,9,23]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Amazon. 2023a. https:\/\/aws.amazon.com\/ec2\/. Accessed: 2023-11-30."},{"key":"e_1_2_1_2_1","unstructured":"Amazon. 2023b. https:\/\/aws.amazon.com\/ec2\/pricing\/on-demand\/. Accessed: 2024-01-22."},{"key":"e_1_2_1_3_1","volume-title":"Carbontracker: Tracking and Predicting the Carbon Footprint of Training Deep Learning Models. ICML Workshop on Challenges in Deploying and monitoring Machine Learning Systems. arXiv:2007","author":"Wolff Anthony Lasse F.","year":"2020","unstructured":"Lasse F. Wolff Anthony, Benjamin Kanding, and Raghavendra Selvan. 2020. Carbontracker: Tracking and Predicting the Carbon Footprint of Training Deep Learning Models. ICML Workshop on Challenges in Deploying and monitoring Machine Learning Systems. arXiv:2007.03051."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3620678.3624666"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC63097.2024.00013"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3572751.3572765"},{"key":"e_1_2_1_7_1","volume-title":"Training for Speech Recognition on Coprocessors. In International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, ADMS@VLDB 2020","author":"Baunsgaard Sebastian","year":"2020","unstructured":"Sebastian Baunsgaard, Sebastian Benjamin Wrede, and P\u0131nar T\u00f6z\u00fcn. 2020. Training for Speech Recognition on Coprocessors. In International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, ADMS@VLDB 2020, Tokyo, Japan, August 31, 2020, Rajesh Bordawekar and Tirthankar Lahiri (Eds.). Association for Computing Machinery, Japan, 1-10."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE55515.2023.00077"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, USA, 113-123","author":"Cubuk Ekin D.","unstructured":"Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, and Quoc V. Le. 2019. AutoAugment: Learning Augmentation Strategies From Data. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, USA, 113-123."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2541940.2541941"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3531146.3533234"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3642970.3655830"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/2168651.2168654"},{"key":"e_1_2_1_15_1","unstructured":"Google. 2023. https:\/\/cloud.google.com\/compute?hl=en. Accessed: 2023-11-30."},{"key":"e_1_2_1_16_1","first-page":"689","volume-title":"2022 USENIX Annual Technical Conference (USENIX ATC 22)","author":"Graur Dan","year":"2022","unstructured":"Dan Graur, Damien Aymon, Dan Kluser, Tanguy Albrici, Chandramohan A. Thekkath, and Ana Klimovic. 2022. Cachew: Machine Learning Input Data Processing as a Service. In 2022 USENIX Annual Technical Conference (USENIX ATC 22). USENIX Association, Carlsbad, CA, 689-706. https:\/\/www.usenix.org\/conference\/atc22\/presentation\/graur"},{"key":"e_1_2_1_17_1","first-page":"649","volume-title":"Proceedings of the 2024 USENIX Annual Technical Conference, USENIX ATC 2024","author":"Graur Dan","year":"2024","unstructured":"Dan Graur, Oto Mraz, Muyu Li, Mohammad Sepehr Pourghannad, Chandramohan A. Thekkath, and Ana Klimovic. 2024. Pecan: Cost-Efficient ML Data Preprocessing with Automatic Transformation Ordering and Hybrid Placement. In Proceedings of the 2024 USENIX Annual Technical Conference, USENIX ATC 2024, Santa Clara, CA, USA, July 10-12, 2024, Saurabh Bagchi and Yiying Zhang (Eds.). USENIX Association, USA, 649-665. https:\/\/www.usenix.org\/conference\/atc24\/presentation\/graur"},{"key":"e_1_2_1_18_1","first-page":"11","article-title":"StagedDB: Designing Database Servers for Modern Hardware","volume":"28","author":"Harizopoulos Stavros","year":"2005","unstructured":"Stavros Harizopoulos and Anastassia Ailamaki. 2005. StagedDB: Designing Database Servers for Modern Hardware. IEEE Data Eng. Bull., Vol. 28, 2 (2005), 11-16. http:\/\/sites.computer.org\/debull\/A05june\/stavros.ps","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1066157.1066201"},{"key":"e_1_2_1_20_1","unstructured":"iostat. 2024. https:\/\/linux.die.net\/man\/1\/iostat"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3514221.3517848"},{"key":"e_1_2_1_22_1","first-page":"947","volume-title":"Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. In 2019 USENIX Annual Technical Conference (USENIX ATC 19)","author":"Jeon Myeongjae","year":"2019","unstructured":"Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, and Fan Yang. 2019. Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. In 2019 USENIX Annual Technical Conference (USENIX ATC 19). USENIX, USA, 947-960."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.5555\/1325851.1325894"},{"key":"e_1_2_1_24_1","volume-title":"The Case for Unifying Data Loading in Machine Learning Clusters. In 11th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 19)","author":"Kakaraparthy Aarati","year":"2019","unstructured":"Aarati Kakaraparthy, Abhay Venkatesh, Amar Phanishayee, and Shivaram Venkataraman. 2019. The Case for Unifying Data Loading in Machine Learning Clusters. In 11th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 19). USENIX Association, Renton, WA, 12. https:\/\/www.usenix.org\/conference\/hotcloud19\/presentation\/kakaraparthy"},{"key":"e_1_2_1_25_1","unstructured":"kernel.org 2021. top(1) - Linux manual page. kernel.org. https:\/\/man7.org\/linux\/man-pages\/man1\/top.1.html."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.14778\/3636218.3636238"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342276"},{"key":"e_1_2_1_28_1","first-page":"33","article-title":"Plumber: Diagnosing and removing performance bottlenecks in machine learning data pipelines","volume":"4","author":"Kuchnik Michael","year":"2022","unstructured":"Michael Kuchnik, Ana Klimovic, Jiri Simsa, Virginia Smith, and George Amvrosiadis. 2022. Plumber: Diagnosing and removing performance bottlenecks in machine learning data pipelines. Proceedings of Machine Learning and Systems, Vol. 4 (2022), 33-51. https:\/\/proceedings.mlsys.org\/paper_files\/paper\/2022\/hash\/d0e90e9a9310570dfa643aa3b2da6e89-Abstract.html","journal-title":"Proceedings of Machine Learning and Systems"},{"key":"e_1_2_1_29_1","first-page":"230","volume-title":"Proceedings of Machine Learning and Systems, I. Dhillon, D. Papailiopoulos, and V. Sze (Eds.)","volume":"2","author":"Li Liam","year":"2020","unstructured":"Liam Li, Kevin Jamieson, Afshin Rostamizadeh, Ekaterina Gonina, Jonathan Ben-tzur, Moritz Hardt, Benjamin Recht, and Ameet Talwalkar. 2020. A System for Massively Parallel Hyperparameter Tuning. In Proceedings of Machine Learning and Systems, I. Dhillon, D. Papailiopoulos, and V. Sze (Eds.), Vol. 2. MLSys, USA, 230-246."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01246-5_2"},{"key":"e_1_2_1_31_1","unstructured":"LMPerf. 2023. https:\/\/mlcommons.org\/benchmarks\/storage\/."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3592980.3595314"},{"key":"e_1_2_1_33_1","unstructured":"Microsoft Azure. 2023. https:\/\/azure.microsoft.com\/en-us\/products\/virtual-machines. Accessed: 2023-11-30."},{"key":"e_1_2_1_34_1","unstructured":"Microsoft Research. 2024. https:\/\/github.com\/msr-fiddle\/CoorDL\/tree\/master. Accessed: 2024-07-10."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.14778\/3446095.3446100"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476311.3476374"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407816"},{"key":"e_1_2_1_38_1","first-page":"1","article-title":"DICER: Diligent Cache Partitioning for Efficient Workload Consolidation. In ICPP. Association for Computing Machinery","volume":"15","author":"Nikas Konstantinos","year":"2019","unstructured":"Konstantinos Nikas, Nikela Papadopoulou, Dimitra Giantsidi, Vasileios Karakostas, Georgios Goumas, and Nectarios Koziris. 2019. DICER: Diligent Cache Partitioning for Efficient Workload Consolidation. In ICPP. Association for Computing Machinery, Japan, 15:1-15:10.","journal-title":"Japan"},{"key":"e_1_2_1_39_1","unstructured":"NVIDIA. 2012. https:\/\/developer.nvidia.com\/nvidia-system-management-interface"},{"key":"e_1_2_1_40_1","unstructured":"NVIDIA. 2022. Data Center GPU Manager Documentation. Technical Report. NVIDIA. https:\/\/docs.nvidia.com\/datacenter\/dcgm\/latest\/dcgm-user-guide\/."},{"key":"e_1_2_1_41_1","unstructured":"NVIDIA. 2023. https:\/\/github.com\/NVIDIA\/DALI. Accessed: 2023-05-19."},{"key":"e_1_2_1_42_1","unstructured":"NVIDIA. 2023. https:\/\/docs.nvidia.com\/deploy\/mps\/index.html. Accessed: 2023-05-27."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178964"},{"key":"e_1_2_1_44_1","first-page":"8024","article-title":"PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. Curran Associates, Inc","author":"Paszke Adam","year":"2019","unstructured":"Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32. Curran Associates, Inc., Canada, 8024-8035. http:\/\/papers.neurips.cc\/paper\/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf","journal-title":"Canada"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2022.3148714"},{"key":"e_1_2_1_46_1","first-page":"1","article-title":"Tunability: Importance of hyperparameters of machine learning algorithms","volume":"20","author":"Probst Philipp","year":"2019","unstructured":"Philipp Probst, Anne-Laure Boulesteix, and Bernd Bischl. 2019. Tunability: Importance of hyperparameters of machine learning algorithms. Journal of Machine Learning Research, Vol. 20, 1 (jan 2019), 1934-1965.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536360.2536364"},{"key":"e_1_2_1_48_1","unstructured":"Aditya Ramesh Prafulla Dhariwal Alex Nichol Casey Chu and Mark Chen. 2022. arXiv:2204.06125 [cs.CV]"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3595360.3595851"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3642970.3655827"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/3381831"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1238"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","unstructured":"Janne Spijkervet and John Ashley Burgoyne. 2021 673-681 pages. https:\/\/doi.org\/10.5281\/zenodo.5624573","DOI":"10.5281\/zenodo.5624573"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3627703.3629578"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.18653\/V1\/P19-1355"},{"key":"e_1_2_1_56_1","volume-title":"Hashimoto","author":"Taori Rohan","year":"2023","unstructured":"Rohan Taori, Ishaan Gulrajani, Tianyi Zhang, Yann Dubois, Xuechen Li, Carlos Guestrin, Percy Liang, and Tatsunori B. Hashimoto. 2023. https:\/\/github.com\/tatsu-lab\/stanford_alpaca."},{"key":"e_1_2_1_57_1","unstructured":"torchtune maintainers and contributors. 2024. torchtune: PyTorch's finetuning library. PyTorch. https\/\/github.com\/pytorch\/torchtune"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.14778\/2733085.2733095"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.14778\/3579075.3579083"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3715275.3732006"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/3448016.3457286"},{"key":"e_1_2_1_62_1","unstructured":"Phil Wang. 2022. https:\/\/github.com\/lucidrains\/DALLE2-pytorch."},{"key":"e_1_2_1_63_1","first-page":"599","article-title":"Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models","volume":"3","author":"Wang Shang","year":"2021","unstructured":"Shang Wang, Peiming Yang, Yuxuan Zheng, Xin Li, and Gennady Pekhimenko. 2021. Horizontally Fused Training Array: An Effective Hardware Utilization Squeezer for Training Novel Deep Learning Models. Proceedings of Machine Learning and Systems, Vol. 3 (2021), 599-623.","journal-title":"Proceedings of Machine Learning and Systems"},{"key":"e_1_2_1_64_1","first-page":"945","volume-title":"MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22)","author":"Weng Qizhen","year":"2022","unstructured":"Qizhen Weng, Wencong Xiao, Yinghao Yu, Wei Wang, Cheng Wang, Jian He, Yong Li, Liping Zhang, Wei Lin, and Yu Ding. 2022. MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). USENIX Association, USA, 945-960. https:\/\/www.usenix.org\/conference\/nsdi22\/presentation\/weng"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","unstructured":"Ross Wightman. 2019. https:\/\/github.com\/rwightman\/pytorch-image-models. https:\/\/doi.org\/10.5281\/zenodo.4414861","DOI":"10.5281\/zenodo.4414861"},{"key":"e_1_2_1_66_1","first-page":"17146","article-title":"A Deep Learning Dataloader, with Shared Data Preparation","volume":"35","author":"Xu Jingwei","year":"2022","unstructured":"Jingwei Xu, Guochang Wang, Yuan Yao, Zenan Li, Chun Cao, and Hanghang Tong. 2022. A Deep Learning Dataloader, with Shared Data Preparation. Advances in Neural Information Processing Systems, Vol. 35 (2022), 17146-17156.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_67_1","unstructured":"Jingwei Xu Guochang Wang Yuan Yao Zenan Li Chun Cao and Hanghang Tong. 2024. https:\/\/github.com\/XieJiann\/Joader. Accessed: 2024-07-10."},{"key":"e_1_2_1_68_1","unstructured":"Qwen: An Yang Baosong Yang Beichen Zhang Binyuan Hui Bo Zheng Bowen Yu Chengyuan Li Dayiheng Liu Fei Huang Haoran Wei Huan Lin Jian Yang Jianhong Tu Jianwei Zhang Jianxin Yang Jiaxi Yang Jingren Zhou Junyang Lin Kai Dang Keming Lu Keqin Bao Kexin Yang Le Yu Mei Li Mingfeng Xue Pei Zhang Qin Zhu Rui Men Runji Lin Tianhao Li Tianyi Tang Tingyu Xia Xingzhang Ren Xuancheng Ren Yang Fan Yang Su Yichang Zhang Yu Wan Yuqiong Liu Zeyu Cui Zhenru Zhang and Zihan Qiu. 2025. Qwen2.5 Technical Report. arXiv:2412.15115 [cs.CL] https:\/\/arxiv.org\/abs\/2412.15115"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3578356.3592589"},{"key":"e_1_2_1_70_1","unstructured":"ZeroMQ. 2023. https:\/\/zeromq.org\/. Accessed: 2023-12-11."},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","unstructured":"Mark Zhao Emanuel Adamiak and Christos Kozyrakis. 2024. https:\/\/doi.org\/10.48550\/arXiv.2401.08895 arXiv:2401.08895 [cs].","DOI":"10.48550\/arXiv.2401.08895"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/3470496.3533044"}],"container-title":["Proceedings of the ACM on Management of Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3749185","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T04:38:20Z","timestamp":1781325500000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3749185"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,22]]},"references-count":72,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,9,22]]}},"alternative-id":["10.1145\/3749185"],"URL":"https:\/\/doi.org\/10.1145\/3749185","relation":{},"ISSN":["2836-6573"],"issn-type":[{"value":"2836-6573","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,9,22]]}}}