{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,2]],"date-time":"2026-01-02T07:32:38Z","timestamp":1767339158623,"version":"build-2065373602"},"reference-count":56,"publisher":"Association for Computing Machinery (ACM)","issue":"4","funder":[{"name":"National Key Research and Development Program of China","award":["2023YFB4502100"],"award-info":[{"award-number":["2023YFB4502100"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62262042 and 62172361"],"award-info":[{"award-number":["62262042 and 62172361"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Major Projects of Zhejiang Province","award":["LD24F020012"],"award-info":[{"award-number":["LD24F020012"]}]},{"name":"Pioneer and Leading Goose R&D Program of Zhejiang Province","award":["2024SSYS0002"],"award-info":[{"award-number":["2024SSYS0002"]}]},{"DOI":"10.13039\/501100004479","name":"Jiangxi Provincial Natural Science Foundation","doi-asserted-by":"crossref","award":["20224BAB202017"],"award-info":[{"award-number":["20224BAB202017"]}],"id":[{"id":"10.13039\/501100004479","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Storage"],"published-print":{"date-parts":[[2025,11,30]]},"abstract":"<jats:p>Delta compression can complement data deduplication by further minimizing redundancy through the compression of non-duplicate data chunks. When adding delta compression to deduplication-based backup systems, however, two primary challenges arise that degrade performance of inline deduplication. First, extra I\/Os are introduced along the critical paths of backup and restoration for retrieving base chunks, slowing the system. Second, rewriting techniques prohibit specific data chunks from serving as base chunks for delta compression to improve restore performance, resulting in a loss of compression efficiency.<\/jats:p>\n                  <jats:p>In this paper, we introduce LoopDelta, a framework that seamlessly integrates delta compression into inline deduplication for backup storage, addressing the aforementioned challenges by using three techniques: (1) dual-locality-based similarity tracking leverages both logical and physical locality to detect most of the similar chunks, which, due to their locality, can be prefetched by piggybacking on routine operations during deduplication, thereby eliminating extra I\/Os during backup; (2) cache-aware filter identifies base chunks requiring extra I\/Os during restore and prevents their referencing, thus eliminating extra restore I\/Os; and (3) inversed delta compression, which reverses the roles of base and target chunks in the traditional delta compression approach, thereby allowing for the delta compression of data chunks that are otherwise prohibited as base chunks due to rewriting techniques. Experiments show that LoopDelta increases the compression ratio by 1.28 to 11.33 times over basic deduplication, without significantly affecting backup throughput, and enhances restore performance by up to 3.57 times.<\/jats:p>","DOI":"10.1145\/3721485","type":"journal-article","created":{"date-parts":[[2025,3,5]],"date-time":"2025-03-05T05:11:38Z","timestamp":1741151498000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["An Efficient Delta Compression Framework Seamlessly Integrated into Inline Deduplication"],"prefix":"10.1145","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7716-1214","authenticated-orcid":false,"given":"Yucheng","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Mathematics and Computer Sciences, Nanchang University","place":["Nanchang, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-8980-6352","authenticated-orcid":false,"given":"Wenbin","family":"Zeng","sequence":"additional","affiliation":[{"name":"School of Mathematics and Computer Sciences, Nanchang University","place":["Nanchang, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1477-9751","authenticated-orcid":false,"given":"Hong","family":"Jiang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of Texas at Arlington","place":["Arlington, United States"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4674-6006","authenticated-orcid":false,"given":"Dan","family":"Feng","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Huazhong University of Science and Technology","place":["Wuhan, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9293-8028","authenticated-orcid":false,"given":"Zichen","family":"Xu","sequence":"additional","affiliation":[{"name":"School of Mathematics and Computer Sciences, Nanchang University","place":["Nanchang, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7075-4153","authenticated-orcid":false,"given":"Shuibing","family":"He","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Zhejiang University","place":["Hangzhou, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6440-7550","authenticated-orcid":false,"given":"Mingzhe","family":"Zhang","sequence":"additional","affiliation":[{"name":"Ant Group","place":["Beijing, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0000-1872-3715","authenticated-orcid":false,"given":"Dan","family":"Wu","sequence":"additional","affiliation":[{"name":"Nanchang University","place":["Nanchang, China"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,11,3]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2017.187"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.5555\/2813767.2813779"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/1534530.1534539"},{"key":"e_1_3_2_5_2","doi-asserted-by":"publisher","DOI":"10.5555\/2591272.2591282"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45123-4_1"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.5555\/3323298.3323311"},{"key":"e_1_3_2_8_2","first-page":"309","volume-title":"The 16th USENIX Conference on File and Storage Technologies (FAST\u201918)","author":"Cao Zhichao","year":"2018","unstructured":"Zhichao Cao, Hao Wen, Fenggang Wu, and David H. C. Du. 2018. ALACC: Accelerating restore performance of data deduplication systems using adaptive look-ahead window assisted chunk caching. In The 16th USENIX Conference on File and Storage Technologies (FAST\u201918). USENIX Association, Oakland, CA, USA, 309\u2013324."},{"key":"e_1_3_2_9_2","first-page":"1","volume-title":"The 2010 Conference on USENIX Annual Technical Conference (ATC\u201910)","author":"Debnath Biplob","year":"2010","unstructured":"Biplob Debnath, Sudipta Sengupta, and Jin Li. 2010. ChunkStash: Speeding up inline storage deduplication using flash memory. In The 2010 Conference on USENIX Annual Technical Conference (ATC\u201910). USENIX Association, Boston, MA, USA, 1\u201316."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.5555\/3129633.3129637"},{"key":"e_1_3_2_11_2","first-page":"113","volume-title":"The 2003 USENIX Conference on USENIX Annual Technical Conference (ATC\u201903)","author":"Douglis Fred","year":"2003","unstructured":"Fred Douglis and Arun Iyengar. 2003. Application-specific delta-encoding via resemblance detection. In The 2003 USENIX Conference on USENIX Annual Technical Conference (ATC\u201903). USENIX Association, San Antonio, TX, USA, 113\u2013126."},{"key":"e_1_3_2_12_2","first-page":"1","volume-title":"The 8th USENIX Conference on File and Storage Technologies (FAST\u201910)","author":"Erik Kruus","year":"2010","unstructured":"Kruus Erik, Ungureanu Cristian, and Dubnicki Cezary. 2010. Bimodal content defined chunking for backup streams. In The 8th USENIX Conference on File and Storage Technologies (FAST\u201910). USENIX Association, San Jose, CA, USA, 1\u201314."},{"key":"e_1_3_2_13_2","unstructured":"Facebook. 2024. Zstandard. https:\/\/github.com\/facebook\/zstd. zstd."},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.5555\/2643634.2643653"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2015.2410781"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.5555\/2750482.2750507"},{"key":"e_1_3_2_17_2","first-page":"1","volume-title":"The 2011 USENIX Conference on USENIX Annual Technical Conference (ATC\u201911)","author":"Guo Fanglu","year":"2011","unstructured":"Fanglu Guo and Petros Efstathopoulos. 2011. Building a high-performance deduplication system. In The 2011 USENIX Conference on USENIX Annual Technical Conference (ATC\u201911). USENIX Association, Portland, OR, USA, 1\u201314."},{"key":"e_1_3_2_18_2","unstructured":"Diwaker Gupta Sangmin Lee Michael Vrable Stefan Savage Alex C. Snoeren George Varghese Geoffrey M. Voelker and Amin Vahdat. 2008. Difference engine: Harnessing memory redundancy in virtual machines. In The 5th Symposium on Operating Systems Design and Implementation (OSDI\u201908) USENIX Association San Diego CA USA 309\u2013322."},{"key":"e_1_3_2_19_2","first-page":"844","volume-title":"The 33rd IEEE International Conference on Computer Communications (INFOCOM\u201914)","author":"Hua Yu","unstructured":"Yu Hua, Xue Liu, and Dan Feng. April 27 - May 02, 2014. Neptune: Efficient remote communication services for cloud backups. In The 33rd IEEE International Conference on Computer Communications (INFOCOM\u201914). IEEE, Toronto, Canada, 844\u2013852."},{"key":"e_1_3_2_20_2","first-page":"281","volume-title":"The 3rd USENIX Conference on File and Storage Technologies (FAST\u201905)","author":"Jain Navendu","year":"2005","unstructured":"Navendu Jain, Michael Dahlin, and Renu Tewari. 2005. TAPER: Tiered approach for eliminating redundancy in replica synchronization. In The 3rd USENIX Conference on File and Storage Technologies (FAST\u201905). USENIX Association, San Francisco, CA, USA, 281\u2013294."},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1145\/2367589.2367600"},{"key":"e_1_3_2_22_2","first-page":"59","volume-title":"The 2004 USENIX Annual Technical Conference (ATC\u201904)","author":"Kulkarni Purushottam","year":"2004","unstructured":"Purushottam Kulkarni, Fred Douglis, Jason D. LaVoie, and John M. Tracey. 2004. Redundancy elimination within large collections of files. In The 2004 USENIX Annual Technical Conference (ATC\u201904). USENIX Association, Boston, MA, USA, 59\u201372."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.5555\/2591272.2591292"},{"key":"e_1_3_2_24_2","first-page":"111","volume-title":"The 7th USENIX Conference on File and Storage Technologies (FAST\u201909)","volume":"9","author":"Lillibridge Mark","year":"2009","unstructured":"Mark Lillibridge, Kave Eshghi, Deepavali Bhagwat, Vinay Deolalikar, Greg Trezise, and Peter Camble. 2009. Sparse indexing: Large scale, inline deduplication using sampling and locality. In The 7th USENIX Conference on File and Storage Technologies (FAST\u201909), Vol. 9. USENIX Association, San Jose, CA, USA, 111\u2013123."},{"key":"e_1_3_2_25_2","unstructured":"Joshua P. MacDonald. 2000. File System Support for Delta Compression. University of California Berkeley Berkeley CA."},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/2485732.2485748"},{"key":"e_1_3_2_27_2","first-page":"229","volume-title":"The 9th USENIX Conference on File and Storage Technologies (FAST\u201911)","author":"Meyer Dutch T.","year":"2011","unstructured":"Dutch T. Meyer and William J. Bolosky. 2011. A study of practical deduplication. In The 9th USENIX Conference on File and Storage Technologies (FAST\u201911). USENIX Association, San Jose, CA, USA, 229\u2013241."},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/HPCC.2011.82"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1145\/3357223.3362731"},{"key":"e_1_3_2_30_2","first-page":"247","volume-title":"The 20th USENIX Conference on File and Storage Technologies (FAST\u201922)","author":"Park Jisung","year":"2022","unstructured":"Jisung Park, Jeonggyun Kim, Yeseong Kim, Sungjin Lee, and Onur Mutlu. 2022. DeepSketch: A new machine learning-based reference search technique for post-deduplication delta compression. In The 20th USENIX Conference on File and Storage Technologies (FAST\u201922). USENIX Association, Santa Clara, CA, USA, 247\u2013264."},{"key":"e_1_3_2_31_2","first-page":"89","volume-title":"The 1st USENIX Conference on File and Storage Technologies (FAST\u201902)","author":"Quinlan Sean","year":"2002","unstructured":"Sean Quinlan and Sean Dorward. 2002. Venti: A new approach to archival storage. In The 1st USENIX Conference on File and Storage Technologies (FAST\u201902). USENIX Association, Monterey, CA, USA, 89\u2013101."},{"key":"e_1_3_2_32_2","volume-title":"Fingerprinting by Random Polynomials","author":"Rabin Michael O.","year":"1981","unstructured":"Michael O. Rabin. 1981. Fingerprinting by Random Polynomials. Center for Research in Computing Techn., Aiken Computation Laboratory, Univ."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/1987816.1987837"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1145\/2385603.2385606"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.5555\/2342806.2342816"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3664817"},{"key":"e_1_3_2_37_2","volume-title":"The 33rd Symposium on Mass Storage Systems and Technologies (MSST\u201917)","author":"Tan Yujuan","year":"2017","unstructured":"Yujuan Tan, Jian Wen, Zhichao Yan, Hong Jiang, Srisa-an Witawas, Baiping Wang, and Hao Luo. 2017. FGDEFRAG: A fine-grained defragmentation approach to improve restore performance. In The 33rd Symposium on Mass Storage Systems and Technologies (MSST\u201917). IEEE Computer Society Press, Santa Clara, CA."},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1145\/1629080.1629084"},{"key":"e_1_3_2_39_2","first-page":"1","volume-title":"The 10th USENIX Conference on File and Storage Technologies (FAST\u201912)","author":"Wallace Grant","year":"2012","unstructured":"Grant Wallace, Fred Douglis, Hangwei Qian, Philip Shilane, Stephen Smaldone, Mark Chamness, and Windsor Hsu. 2012. Characteristics of backup workloads in production systems. In The 10th USENIX Conference on File and Storage Technologies (FAST\u201912). USENIX Association, San Jose, CA, USA, 1\u201314."},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.6558"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2016.2571298"},{"key":"e_1_3_2_42_2","first-page":"285","volume-title":"The 2011 Conference on USENIX Annual Technical Conference (ATC\u201911)","author":"Xia Wen","year":"2011","unstructured":"Wen Xia, Hong Jiang, Dan Feng, and Yu Hua. 2011. SiLo: A similarity-locality based near-exact deduplication scheme with low RAM overhead and high throughput. In The 2011 Conference on USENIX Annual Technical Conference (ATC\u201911). USENIX Association, Portland, OR, 285\u2013298."},{"key":"e_1_3_2_43_2","volume-title":"The 7th USENIX Conference on Hot Topics in Storage and File Systems (HotStorage\u201915)","author":"Xia Wen","year":"2015","unstructured":"Wen Xia, Chunguang Li, Hong Jiang, Dan Feng, Yu Hua, Leihua Qin, and Yucheng Zhang. 2015. Edelta: A word-enlarging based fast delta compression approach. In The 7th USENIX Conference on Hot Topics in Storage and File Systems (HotStorage\u201915). USENIX Association, Santa Clara, CA."},{"key":"e_1_3_2_44_2","first-page":"101","volume-title":"The 2016 Conference on USENIX Annual Technical Conference (ATC\u201916)","author":"Xia Wen","year":"2016","unstructured":"Wen Xia, Yukun Zhou, Hong Jiang, Dan Feng, Yu Hua, Yuchong Hu, Qing Liu, and Yucheng Zhang. 2016. FastCDC: A fast and efficient content-defined chunking approach for data deduplication. In The 2016 Conference on USENIX Annual Technical Conference (ATC\u201916). USENIX Association, Denver, CO, 101\u2013114."},{"key":"e_1_3_2_45_2","first-page":"592","volume-title":"The 21st Annual International Conference on Mobile Computing and Networking (MobiCom\u201915)","author":"Yong Cui","year":"2015","unstructured":"Cui Yong, Lai Zeqi, Wang Xin, Dai Ningwei, and Miao Congcong. 2015. QuickSync: Improving synchronization efficiency for mobile cloud storage services. In The 21st Annual International Conference on Mobile Computing and Networking (MobiCom\u201915). ACM Association, Paris, France, 592\u2013603."},{"key":"e_1_3_2_46_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2005.47"},{"key":"e_1_3_2_47_2","first-page":"133","volume-title":"The 2023 Conference on USENIX Annual Technical Conference (ATC\u201923)","author":"Zhang Yucheng","year":"2023","unstructured":"Yucheng Zhang, Hong Jiang, Dan Feng, Nan Jiang, Taorong Qiu, and Wei Huang. 2023. LoopDelta: Embedding locality-aware opportunistic delta compression in inline deduplication for highly efficient data reduction. In The 2023 Conference on USENIX Annual Technical Conference (ATC\u201923). USENIX Association, Boston, MA, USA, 133\u2013148."},{"key":"e_1_3_2_48_2","first-page":"1337","volume-title":"The 34th IEEE International Conference on Computer Communications (INFOCOM\u201915)","author":"Zhang Yucheng","unstructured":"Yucheng Zhang, Hong Jiang, Dan Feng, Wen Xia, Min Fu, Fangting Huang, and Yukun Zhou. April 26th - May 1st, 2015. AE: An asymmetric extremum content defined chunking algorithm for fast and bandwidth-efficient data deduplication. In The 34th IEEE International Conference on Computer Communications (INFOCOM\u201915). IEEE, Hong Kong, China, 1337\u20131345."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCD53106.2021.00078"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2023.3318404"},{"key":"e_1_3_2_51_2","first-page":"121","volume-title":"The 17th USENIX Conference on File and Storage Technologies (FAST\u201919)","author":"Zhang Yucheng","year":"2019","unstructured":"Yucheng Zhang, Wen Xia, Dan Feng, Hong Jiang, Yu Hua, and Qiang Wang. 2019. Finesse: Fine-grained feature locality based fast resemblance detection for post-deduplication delta compression. In The 17th USENIX Conference on File and Storage Technologies (FAST\u201919). USENIX Association, Boston, MA, USA, 121\u2013128."},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2020.2991030"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2024.03.004"},{"key":"e_1_3_2_54_2","first-page":"269","volume-title":"The 6th USENIX Conference on File and Storage Technologies (FAST\u201908)","author":"Zhu Benjamin","year":"2008","unstructured":"Benjamin Zhu, Kai Li, and Patterson Hugo. 2008. Avoiding the disk bottleneck in the data domain deduplication file system. In The 6th USENIX Conference on File and Storage Technologies (FAST\u201908). USENIX Association, San Jose, CA, USA, 269\u2013282."},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE51399.2021.00048"},{"key":"e_1_3_2_56_2","first-page":"19","volume-title":"The 2022 USENIX Annual Technical Conference (ATC\u201922)","author":"Zou Xiangyu","year":"2022","unstructured":"Xiangyu Zou, Wen Xia, Philip Shilane, Haijun Zhang, and Xuan Wang. 2022. Building a high-performance fine-grained deduplication framework for backup storage with high deduplication ratio. In The 2022 USENIX Annual Technical Conference (ATC\u201922). USENIX Association, Carlsbad, CA, USA, 19\u201336."},{"key":"e_1_3_2_57_2","first-page":"171","volume-title":"The 19th USENIX Conference on File and Storage Technologies (FAST\u201921)","author":"Zou Xiangyu","year":"2021","unstructured":"Xiangyu Zou, Jingsong Yuan, Philip Shilane, Wen Xia, Haijun Zhang, and Xuan Wang. 2021. The dilemma between deduplication and locality: Can both be achieved?. In The 19th USENIX Conference on File and Storage Technologies (FAST\u201921). USENIX Association, 171\u2013185."}],"container-title":["ACM Transactions on Storage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3721485","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T13:32:38Z","timestamp":1762176758000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3721485"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,3]]},"references-count":56,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,11,30]]}},"alternative-id":["10.1145\/3721485"],"URL":"https:\/\/doi.org\/10.1145\/3721485","relation":{},"ISSN":["1553-3077","1553-3093"],"issn-type":[{"type":"print","value":"1553-3077"},{"type":"electronic","value":"1553-3093"}],"subject":[],"published":{"date-parts":[[2025,11,3]]},"assertion":[{"value":"2024-08-30","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-01-24","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2025-11-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}