{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T05:56:39Z","timestamp":1758261399640,"version":"3.44.0"},"reference-count":62,"publisher":"Association for Computing Machinery (ACM)","issue":"3","funder":[{"name":"Natural Science Foundation Project of CQ CSTC","award":["cstc2022NSCQ-MSX0789"],"award-info":[{"award-number":["cstc2022NSCQ-MSX0789"]}]},{"name":"Opening Project of State Key Laboratory for Novel Software Technology","award":["KFKT2024B47"],"award-info":[{"award-number":["KFKT2024B47"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Archit. Code Optim."],"published-print":{"date-parts":[[2025,9,30]]},"abstract":"<jats:p>\n            The computational SSD (\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            ), with high computing capabilities, can function not only as a storage device but also as a computing node. The data cache of the\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            device stores both the output data from host-side tasks and the input data for tasks executed on the\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            . However, current cache management strategies are optimized for traditional SSDs and are incompatible with the unique requirements of\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            . To address the issue of cache management for\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            , this article proposes a novel cache division scheme, to dynamically divide the cache into two parts, for separately buffering output data from host-side tasks and input data used by\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            -side tasks. To this end, we construct a mathematical model that periodically estimate an optimal cache division ratio, by considering the factors of the ratios of read\/write data amount, the cache hits, and the overhead of data transfer between the storage device and the host. Besides, we propose a scheme of proactive data flushing to write the output data to the underlying flash arrays, without impacts on I\/O responsiveness. The trace-driven experiments show that our scheme can improve the overall I\/O latency by\n            <jats:monospace>35.4%<\/jats:monospace>\n            on average, in contrast to existing cache management schemes for\n            <jats:italic toggle=\"yes\">CompSSD<\/jats:italic>\n            devices.\n          <\/jats:p>","DOI":"10.1145\/3747845","type":"journal-article","created":{"date-parts":[[2025,7,22]],"date-time":"2025-07-22T11:13:49Z","timestamp":1753182829000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Supports of Data Cache Division for Computational Solid-state Drives"],"prefix":"10.1145","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7242-6287","authenticated-orcid":false,"given":"Zhibing","family":"Sha","sequence":"first","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-3086-8953","authenticated-orcid":false,"given":"Shuaiwen","family":"Yu","sequence":"additional","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0759-4377","authenticated-orcid":false,"given":"Chengyong","family":"Tang","sequence":"additional","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8406-8461","authenticated-orcid":false,"given":"Zhigang","family":"Cai","sequence":"additional","affiliation":[{"name":"Computer Science, Southwest University of China","place":["Beibei, China"]}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-7870-1122","authenticated-orcid":false,"given":"Peng","family":"Tang","sequence":"additional","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8433-618X","authenticated-orcid":false,"given":"Ming","family":"Huang","sequence":"additional","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5235-6496","authenticated-orcid":false,"given":"Jun","family":"Li","sequence":"additional","affiliation":[{"name":"Nanjing University of Posts and Telecommunications","place":["Nanjing, China"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6149-6650","authenticated-orcid":false,"given":"Jianwei","family":"Liao","sequence":"additional","affiliation":[{"name":"Southwest University","place":["Chongqing, China"]}]}],"member":"320","published-online":{"date-parts":[[2025,9,18]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2017.2678018"},{"key":"e_1_3_2_3_2","volume-title":"Proceedings of USENIX Annual Technical Conference.","author":"Jaffer S.","year":"2019","unstructured":"S. Jaffer, S. Maneas, A. Hwang, and B. Schroeder. 2019. Evaluating file system reliability on solid state drives. In Proceedings of USENIX Annual Technical Conference. 2019."},{"key":"e_1_3_2_4_2","doi-asserted-by":"crossref","unstructured":"J. Wang D. Park Y. Papakonstantinou and S. Swanson. 2016. SSD in-storage computing for search engines. IEEE Transactions on Computers (2016) 1\u20131.","DOI":"10.1109\/TC.2016.2608818"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2014.55"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3286588"},{"key":"e_1_3_2_7_2","unstructured":"Cosmos OpenSSD Platform. Retrieved from http:\/\/www.openssd-project.org"},{"volume-title":"Proceedings of the Design, Automation, and Test in Europe.","author":"Xu X.","key":"e_1_3_2_8_2","unstructured":"X. Xu, Z. Cai, J. Liao, and Y. Ishiakwa. 2020. Frequent access pattern-based prefetching inside of solid-state drives. In Proceedings of the Design, Automation, and Test in Europe."},{"volume-title":"Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 1221\u20131230","author":"Do Jaeyoung","key":"e_1_3_2_9_2","unstructured":"Jaeyoung Do, Yang-Suk Kee, Jignesh M. Patel, Chanik Park, Kwanghyun Park, and David J. DeWitt. 2013. Query processing on smart ssds: Opportunities and challenges. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 1221\u20131230."},{"volume-title":"Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 1221\u20131230","author":"Do Jaeyoung","key":"e_1_3_2_10_2","unstructured":"Jaeyoung Do, Yang-Suk Kee, Jignesh M. Patel, Chanik Park, Kwanghyun Park, and David J. DeWitt. 2013. Query processing on smart ssds: Opportunities and challenges. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 1221\u20131230."},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2020.3009347"},{"volume-title":"Proceedings of the 50th Annual IEEE\/ACM International Symposium on Microarchitecture. 219\u2013231","author":"Koo Gunjae","key":"e_1_3_2_12_2","unstructured":"Gunjae Koo, Kiran Kumar Matam, I. Te, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, and Murali Annavaram. 2017. Summarizer: trading communication with computing near storage. In Proceedings of the 50th Annual IEEE\/ACM International Symposium on Microarchitecture. 219\u2013231."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/3632951"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3415580"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2021.102334"},{"volume-title":"Proceedings of the 48th International Conference on Parallel Processing.","author":"Liu J.","key":"e_1_3_2_16_2","unstructured":"J. Liu, F. Wang, and D. Feng. 2019. CostPI: Cost-effective performance isolation for shared NVMe SSDs. In Proceedings of the 48th International Conference on Parallel Processing."},{"volume-title":"Proceedings of the 39th International Conference on Computer-Aided Design. 1\u20139.","author":"Liu Weiguang","key":"e_1_3_2_17_2","unstructured":"Weiguang Liu, Jinhua Cui, Junwei Liu, and Laurence T. Yang. 2020. MLCache: A space-efficient cache scheme based on reuse distance and machine learning for NVMe SSDs. In Proceedings of the 39th International Conference on Computer-Aided Design. 1\u20139."},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3143295"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2024.3492993"},{"volume-title":"Proceedings of the IEEE Conference on Mass Storage Systems and Technologies.","author":"Kang Y.","key":"e_1_3_2_20_2","unstructured":"Y. Kang, Y. S. Kee, E. L. Miller, and C. Park. 2013. Enabling cost-effective data processing with smart SSD. In Proceedings of the IEEE Conference on Mass Storage Systems and Technologies."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/3472769"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2022.3155956"},{"volume-title":"Proceedings of the USENIX Conference on File and Storage Technologies.","author":"Megiddo N.","key":"e_1_3_2_23_2","unstructured":"N. Megiddo and D. Modha. 2003. ARC: A self-tuning, low overhead replacement cache. In Proceedings of the USENIX Conference on File and Storage Technologies."},{"key":"e_1_3_2_24_2","volume-title":"16th USENIX Conference on File and Storage Technologies (FAST 18)","author":"Tavakkol Arash","year":"2018","unstructured":"Arash Tavakkol, Juan G\u00f3mez-Luna, Mohammad Sadrosadati, Saugata Ghose, and Onur Mutlu. 2018. MQSim: A framework for enabling realistic studies of modern Multi-QueueSSD devices. In 16th USENIX Conference on File and Storage Technologies (FAST 18). 49\u201366."},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3221410"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS54959.2023.00053"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/1519065.1519081"},{"key":"e_1_3_2_28_2","unstructured":"2020. Alibaba block traces. Retrieved from https:\/\/github.com\/alibaba\/block-traces"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3078468.3078479"},{"key":"e_1_3_2_30_2","volume-title":"Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1\u20136.","author":"Yu Shuaiwen","year":"2024","unstructured":"Shuaiwen Yu, Zhibing Sha, Chengyong Tang, Zhigang Cai, Peng Tang, Min Huang, Jun Li, and Jianwei Liao. 2024. Adaptive dram cache division for computational solid-state drives. In 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE). IEEE, 1\u20136."},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3205289.3205319"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3131850"},{"key":"e_1_3_2_33_2","volume-title":"In-Situ Data Analytics on Extreme-Scale Machines. In 11th USENIX Conference on File and Storage Technologies (FAST 13)","author":"Tiwari Devesh","year":"2013","unstructured":"Devesh Tiwari, Simona Boboila, Sudharshan Vazhkudai, Youngjae Kim, Xiaosong Ma, Peter Desnoyers, and Yan Solihin. 2013. Active Flash: Towards Energy-Efficient, In-Situ Data Analytics on Extreme-Scale Machines. In 11th USENIX Conference on File and Storage Technologies (FAST 13). 119\u2013132."},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322275"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446763"},{"key":"e_1_3_2_36_2","volume-title":"Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. 635\u2013654","author":"Ghiasi Nika Mansouri","year":"2022","unstructured":"Nika Mansouri Ghiasi, Jisung Park, Harun Mustafa, Jeremie Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, and others. 2022. GenStore: A high-performance in-storage processing system for genome sequence analysis. In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. 635\u2013654."},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1145\/3470496.3527433"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358320"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA57654.2024.00034"},{"volume-title":"Proceedings of the USENIX Annual Technical Conference.","author":"Ruan Z.","key":"e_1_3_2_40_2","unstructured":"Z. Ruan, T. He, and J. Cong. 2019. INSIDER: Designing In-Storage computing system for emerging High-Performance drive. In Proceedings of the USENIX Annual Technical Conference."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387557"},{"key":"e_1_3_2_42_2","doi-asserted-by":"publisher","DOI":"10.1145\/301453.301487"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2021.3052028"},{"key":"e_1_3_2_44_2","volume-title":"Yookun Cho, and Chong Sang Kim.","author":"Lee Donghee","year":"2001","unstructured":"Donghee Lee, Jongmoo Choi, Jong-Hun Kim, Sam H Noh, Sang Lyul Min, Yookun Cho, and Chong Sang Kim. 2001. LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies. IEEE transactions on Computers 50, 12 (2001), 1352\u20131361."},{"key":"e_1_3_2_45_2","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2015.2401029"},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2022.3229293"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2017.2712366"},{"key":"e_1_3_2_48_2","doi-asserted-by":"publisher","DOI":"10.1145\/2925426.2926263"},{"key":"e_1_3_2_49_2","volume-title":"et\u00a0al","author":"Sha Z.","year":"2022","unstructured":"Z. Sha, Z. Cai, and F. Trahay, et\u00a0al. 2022. Unifying temporal and spatial locality for cache management inside SSDs. In Proceedings of the Design, Automation, and Test in Europe."},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1145\/3392717.3392766"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.peva.2025.102479"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2024.3412970"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3143295"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2022.3221410"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2024.3492993"},{"key":"e_1_3_2_56_2","volume-title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","author":"Wu Jiaojiao","year":"2025","unstructured":"Jiaojiao Wu, Li Cai, Zhigang Cai, Fengxiang Zhang, and Jianwei Liao. 2025. Improving I\/O performance and fairness in NVMe SSDs with pooling portions of cache partitions. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2025), 1\u20131."},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/RTSS59052.2023.00028"},{"volume-title":"Proceedings of the Real-Time Systems.","author":"Altmeyer S.","key":"e_1_3_2_58_2","unstructured":"S. Altmeyer, R. I. Davis, and C. Maiza. 2012. Improved cache related pre-emption delay aware response time analysis for fixed priority pre-emptive systems. In Proceedings of the Real-Time Systems."},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/RTCSA.2008.42"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11241-015-9246-8"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11241-015-9246-8"},{"volume-title":"Proceedings of the Euromicro Conference on Real-Time Systems.","author":"Kim H.","key":"e_1_3_2_62_2","unstructured":"H. Kim, A. Kandhalu, and R. Rajkumar. 2013. A coordinated approach for practical OS-level cache management in multi-core real-time systems. In Proceedings of the Euromicro Conference on Real-Time Systems."},{"key":"e_1_3_2_63_2","doi-asserted-by":"crossref","unstructured":"H. Kim and R. Rajkumar. 2016. Real-time cache management for multi-core virtualization In Proceedings of the International Conference on Embedded Software.","DOI":"10.1145\/2968478.2968480"}],"container-title":["ACM Transactions on Architecture and Code Optimization"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3747845","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,18]],"date-time":"2025-09-18T20:45:42Z","timestamp":1758228342000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3747845"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,18]]},"references-count":62,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,9,30]]}},"alternative-id":["10.1145\/3747845"],"URL":"https:\/\/doi.org\/10.1145\/3747845","relation":{},"ISSN":["1544-3566","1544-3973"],"issn-type":[{"type":"print","value":"1544-3566"},{"type":"electronic","value":"1544-3973"}],"subject":[],"published":{"date-parts":[[2025,9,18]]},"assertion":[{"value":"2024-12-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-06-27","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-09-18","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}