{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T15:32:02Z","timestamp":1759937522537,"version":"3.41.0"},"reference-count":43,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2018,5,11]],"date-time":"2018-05-11T00:00:00Z","timestamp":1525996800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"China 863","award":["2015AA015305"],"award-info":[{"award-number":["2015AA015305"]}]},{"name":"ONR","award":["12055763"],"award-info":[{"award-number":["12055763"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61433019 and U1435217"],"award-info":[{"award-number":["61433019 and U1435217"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000001","name":"NSF","doi-asserted-by":"publisher","award":["CNS-1251137, CNS- 1302246, CNS-1305360, and CNS-1622832"],"award-info":[{"award-number":["CNS-1251137, CNS- 1302246, CNS-1305360, and CNS-1622832"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100011685","name":"Dell-EMC","doi-asserted-by":"crossref","award":["2016-2017"],"award-info":[{"award-number":["2016-2017"]}],"id":[{"id":"10.13039\/100011685","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Storage"],"published-print":{"date-parts":[[2018,5,31]]},"abstract":"<jats:p>Deduplication has become essential in disk-based backup systems, but there have been few long-term studies of backup workloads. Most past studies either were of a small static snapshot or covered only a short period that was not representative of how a backup system evolves over time. For this article, we first collected 21 months of data from a shared user file system; 33 users and over 4,000 snapshots are covered. We then analyzed the dataset, examining a variety of essential characteristics across two dimensions: single-node deduplication and cluster deduplication. For single-node deduplication analysis, our primary focus was individual-user data. Despite apparently similar roles and behavior among all of our users, we found significant differences in their deduplication ratios. Moreover, the data that some users share with others had a much higher deduplication ratio than average. For cluster deduplication analysis, we implemented seven published data-routing algorithms and created a detailed comparison of their performance with respect to deduplication ratio, load distribution, and communication overhead. We found that per-file routing achieves a higher deduplication ratio than routing by super-chunk (multiple consecutive chunks), but it also leads to high data skew (imbalance of space usage across nodes). We also found that large chunking sizes are better for cluster deduplication, as they significantly reduce data-routing overhead, while their negative impact on deduplication ratios is small and acceptable. We draw interesting conclusions from both single-node and cluster deduplication analysis and make recommendations for future deduplication systems design.<\/jats:p>","DOI":"10.1145\/3183890","type":"journal-article","created":{"date-parts":[[2018,5,11]],"date-time":"2018-05-11T12:15:27Z","timestamp":1526040927000},"page":"1-27","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Cluster and Single-Node Analysis of Long-Term Deduplication Patterns"],"prefix":"10.1145","volume":"14","author":[{"given":"Zhen \u201cJason\u201d","family":"Sun","sequence":"first","affiliation":[{"name":"National University of Defense Technology, Hunan, P.R.China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Geoff","family":"Kuenning","sequence":"additional","affiliation":[{"name":"Harvey Mudd College, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sonam","family":"Mandal","sequence":"additional","affiliation":[{"name":"Stony Brook University, NY, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Philip","family":"Shilane","sequence":"additional","affiliation":[{"name":"Dell EMC, PA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vasily","family":"Tarasov","sequence":"additional","affiliation":[{"name":"IBM Research, California, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nong","family":"Xiao","sequence":"additional","affiliation":[{"name":"National University of Defense Technology, Hunan, P.R.China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erez","family":"Zadok","sequence":"additional","affiliation":[{"name":"Stony Brook University, NY, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2018,5,11]]},"reference":[{"volume-title":"Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MASCOTS\u201909)","author":"Bhagwat D.","key":"e_1_2_1_1_1","unstructured":"D. Bhagwat , K. Eshghi , D. Long , and M. Lillibridge . 2009. Extreme binning: Scalable, parallel deduplication for chunk-based file backup . In Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MASCOTS\u201909) . IEEE Computer Society, 1--9. D. Bhagwat, K. Eshghi, D. Long, and M. Lillibridge. 2009. Extreme binning: Scalable, parallel deduplication for chunk-based file backup. In Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MASCOTS\u201909). IEEE Computer Society, 1--9."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.5555\/3129633.3129663"},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 16","author":"Debnath B.","key":"e_1_2_1_3_1","unstructured":"B. Debnath , S. Sengupta , and J. Li . 2010. ChunkStash: Speeding up inline storage deduplication using flash memory . In Proceedings of the USENIX Annual Technical Conference. USENIX, 16 . B. Debnath, S. Sengupta, and J. Li. 2010. ChunkStash: Speeding up inline storage deduplication using flash memory. In Proceedings of the USENIX Annual Technical Conference. USENIX, 16."},{"volume-title":"Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST\u201911)","author":"Dong W.","key":"e_1_2_1_4_1","unstructured":"W. Dong , F. Douglis , K. Li , H. Patterson , S. Reddy , and P. Shilane . 2011. Tradeoffs in scalable data routing for deduplication clusters . In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST\u201911) . USENIX, 15--29. W. Dong, F. Douglis, K. Li, H. Patterson, S. Reddy, and P. Shilane. 2011. Tradeoffs in scalable data routing for deduplication clusters. In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST\u201911). USENIX, 15--29."},{"volume-title":"Proceedings of the USENIX Large Installation System Administration Conference. USENIX, 13--13","author":"Douglis F.","key":"e_1_2_1_5_1","unstructured":"F. Douglis , D. Bhardwaj , H. Qian , and P. Shilane . 2011. Content-aware load balancing for distributed backup . In Proceedings of the USENIX Large Installation System Administration Conference. USENIX, 13--13 . F. Douglis, D. Bhardwaj, H. Qian, and P. Shilane. 2011. Content-aware load balancing for distributed backup. In Proceedings of the USENIX Large Installation System Administration Conference. USENIX, 13--13."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 285--296","author":"El-Shimi A.","key":"e_1_2_1_6_1","unstructured":"A. El-Shimi , R. Kalach , A. Kumar , A. Oltean , J. Li , and S. Sengupta . 2012. Primary data deduplication\u2014Large scale study and system design . In Proceedings of the USENIX Annual Technical Conference. USENIX, 285--296 . A. El-Shimi, R. Kalach, A. Kumar, A. Oltean, J. Li, and S. Sengupta. 2012. Primary data deduplication\u2014Large scale study and system design. In Proceedings of the USENIX Annual Technical Conference. USENIX, 285--296."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2391229.2391246"},{"key":"e_1_2_1_9_1","unstructured":"FSL-data-set 2016. FSLHomes data set and tools. Retrieved from tracer.filesystems.org.  FSL-data-set 2016. FSLHomes data set and tools. Retrieved from tracer.filesystems.org."},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the Annual Technical Conference. USENIX, 181--192","author":"Fu Min","year":"2014","unstructured":"Min Fu , Dan Feng , Yu Hua , Xubin He , and Zuoning Chen . 2014 . Accelerating restore and garbage collection in deduplication-based backup systems via exploiting history information . In Proceedings of the Annual Technical Conference. USENIX, 181--192 . Min Fu, Dan Feng, Yu Hua, Xubin He, and Zuoning Chen. 2014. Accelerating restore and garbage collection in deduplication-based backup systems via exploiting history information. In Proceedings of the Annual Technical Conference. USENIX, 181--192."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/2442626.2442649"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-013-1394-5"},{"volume-title":"USENIX Annual Technical Conference. USENIX, 151--164","author":"George A.","key":"e_1_2_1_13_1","unstructured":"A. George and B. Medha . 2015. Identifying trends in enterprise data protection systems . In USENIX Annual Technical Conference. USENIX, 151--164 . A. George and B. Medha. 2015. Identifying trends in enterprise data protection systems. In USENIX Annual Technical Conference. USENIX, 151--164."},{"volume-title":"Proceedings of the 30th Symposium on Mass Storage Systems and Technologies (MSST\u201914)","author":"Gharaibeh A.","key":"e_1_2_1_14_1","unstructured":"A. Gharaibeh , C. Constantinescu , M. Lu , A. Sharma , R. Routray , P. Sarkar , D. Pease , and M. Ripeanu . 2014. DedupT: Deduplication for tape systems . In Proceedings of the 30th Symposium on Mass Storage Systems and Technologies (MSST\u201914) . IEEE Computer Society, 1--11. A. Gharaibeh, C. Constantinescu, M. Lu, A. Sharma, R. Routray, P. Sarkar, D. Pease, and M. Ripeanu. 2014. DedupT: Deduplication for tape systems. In Proceedings of the 30th Symposium on Mass Storage Systems and Technologies (MSST\u201914). IEEE Computer Society, 1--11."},{"key":"e_1_2_1_15_1","unstructured":"Jhon Gratz and David Reinsel. 2010. The Digital Universe Decade\u2014Are You Ready? IDC White Paper.  Jhon Gratz and David Reinsel. 2010. The Digital Universe Decade\u2014Are You Ready? IDC White Paper."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 25--25","author":"Guo F.","key":"e_1_2_1_16_1","unstructured":"F. Guo and P. Efstathopoulos . 2011. Building a high-performance deduplication system . In Proceedings of the USENIX Annual Technical Conference. USENIX, 25--25 . F. Guo and P. Efstathopoulos. 2011. Building a high-performance deduplication system. In Proceedings of the USENIX Annual Technical Conference. USENIX, 25--25."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the International Conference on Image, Vision and Computing (ICIVC\u201912)","author":"Jianting M.","year":"2012","unstructured":"M. Jianting . 2012 . A deduplication-based data archiving system . In Proceedings of the International Conference on Image, Vision and Computing (ICIVC\u201912) . ACM, 1--12. M. Jianting. 2012. A deduplication-based data archiving system. In Proceedings of the International Conference on Image, Vision and Computing (ICIVC\u201912). ACM, 1--12."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1534530.1534540"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1837915.1837921"},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 111--124","author":"Li M.","key":"e_1_2_1_20_1","unstructured":"M. Li , C. Qin , and P. Lee . 2015. CDStore: Toward reliable, secure, and cost-efficient cloud storage via convergent dispersal . In Proceedings of the USENIX Annual Technical Conference. USENIX, 111--124 . M. Li, C. Qin, and P. Lee. 2015. CDStore: Toward reliable, secure, and cost-efficient cloud storage via convergent dispersal. In Proceedings of the USENIX Annual Technical Conference. USENIX, 111--124."},{"volume-title":"Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913)","author":"Lillibridge M.","key":"e_1_2_1_21_1","unstructured":"M. Lillibridge and K. Eshghi . 2013. Improving restore speed for backup systems that use inline chunk-based deduplication . In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913) . USENIX, 183--197. M. Lillibridge and K. Eshghi. 2013. Improving restore speed for backup systems that use inline chunk-based deduplication. In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913). USENIX, 183--197."},{"volume-title":"Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST\u201909)","author":"Lillibridge M.","key":"e_1_2_1_22_1","unstructured":"M. Lillibridge , K. Eshghi , D. Bhagwat , V. Deolalikar , G. Trezise , and P. Camble . 2009. Sparse indexing: Large scale, inline deduplication using sampling and locality . In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST\u201909) . USENIX, 111--123. M. Lillibridge, K. Eshghi, D. Bhagwat, V. Deolalikar, G. Trezise, and P. Camble. 2009. Sparse indexing: Large scale, inline deduplication using sampling and locality. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST\u201909). USENIX, 111--123."},{"volume-title":"Proceedings of the 7th USENIX Conference on Hot Topics in Storage and File Systems. USENIX, 11","author":"Lin X.","key":"e_1_2_1_23_1","unstructured":"X. Lin , F. Douglis , J. Li , X. Li , R. Ricci , S. Smaldone , and G. Wallace . 2015. Metadata considered harmful \u2026 to deduplication . In Proceedings of the 7th USENIX Conference on Hot Topics in Storage and File Systems. USENIX, 11 . X. Lin, F. Douglis, J. Li, X. Li, R. Ricci, S. Smaldone, and G. Wallace. 2015. Metadata considered harmful \u2026 to deduplication. In Proceedings of the 7th USENIX Conference on Hot Topics in Storage and File Systems. USENIX, 11."},{"volume-title":"Proceedings of the IEEE International Conference on Software Testing, Verification and Validation. IEEE Computer Society, 1--14","author":"Lin X.","key":"e_1_2_1_24_1","unstructured":"X. Lin , M. Hibler , E. Eide , and R. Ricci . 2015. Using deduplicating storage for efficient disk image deployment . In Proceedings of the IEEE International Conference on Software Testing, Verification and Validation. IEEE Computer Society, 1--14 . X. Lin, M. Hibler, E. Eide, and R. Ricci. 2015. Using deduplicating storage for efficient disk image deployment. In Proceedings of the IEEE International Conference on Software Testing, Verification and Validation. IEEE Computer Society, 1--14."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2367589.2367606"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1534530.1534541"},{"volume-title":"Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MSST\u201910)","author":"Meister D.","key":"e_1_2_1_27_1","unstructured":"D. Meister and A. Brinkmann . 2010. dedupv1: Improving deduplication throughput using solid state drives (SSD) . In Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MSST\u201910) . IEEE Computer Society,1--6. D. Meister and A. Brinkmann. 2010. dedupv1: Improving deduplication throughput using solid state drives (SSD). In Proceedings of the IEEE International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems Conference (MSST\u201910). IEEE Computer Society,1--6."},{"volume-title":"Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913)","author":"Meister D.","key":"e_1_2_1_28_1","unstructured":"D. Meister , A. Brinkmann , and T. Suss . 2013. File recipe compression in data deduplication systems . In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913) . USENIX,175--182. D. Meister, A. Brinkmann, and T. Suss. 2013. File recipe compression in data deduplication systems. In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913). USENIX,175--182."},{"volume-title":"Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC\u201912)","author":"Meister D.","key":"e_1_2_1_29_1","unstructured":"D. Meister , J. Kaiser , A. Brinkmann , T. Cortes , M. Kuhn , and J. Kunkel . 2012. A study on data deduplication in hpc storage systems . In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC\u201912) . IEEE Computer Society, 7. D. Meister, J. Kaiser, A. Brinkmann, T. Cortes, M. Kuhn, and J. Kunkel. 2012. A study on data deduplication in hpc storage systems. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC\u201912). IEEE Computer Society, 7."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2078861.2078864"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2010.5650369"},{"volume-title":"Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912)","author":"Srinivasan K.","key":"e_1_2_1_32_1","unstructured":"K. Srinivasan , T. Bisson , G. Goodson , and K. Voruganti . 2012. iDedup: Latency-aware, inline data deduplication for primary storage . In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912) . K. Srinivasan, T. Bisson, G. Goodson, and K. Voruganti. 2012. iDedup: Latency-aware, inline data deduplication for primary storage. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912)."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2016.7897080"},{"key":"e_1_2_1_34_1","first-page":"270","article-title":"SORT: A similarity-ownership based routing scheme to improve data read performance for deduplication clusters","volume":"3","author":"Tan Yujuan","year":"2011","unstructured":"Yujuan Tan , Dan Feng , Fangting Huang , and Zhichao Yan . 2011 . SORT: A similarity-ownership based routing scheme to improve data read performance for deduplication clusters . Int. J. Adv. Comput. Technol. 3 , 9 (2011), 270 -- 277 . Yujuan Tan, Dan Feng, Fangting Huang, and Zhichao Yan. 2011. SORT: A similarity-ownership based routing scheme to improve data read performance for deduplication clusters. Int. J. Adv. Comput. Technol. 3, 9 (2011), 270--277.","journal-title":"Int. J. Adv. Comput. Technol."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 261--272","author":"Tarasov V.","key":"e_1_2_1_35_1","unstructured":"V. Tarasov , A. Mudrankitony , W. Buik , P. Shilane , G. Kuenning , and E. Zadok . 2012. Generating realistic datasets for deduplication analysis . In Proceedings of the USENIX Annual Technical Conference. USENIX, 261--272 . V. Tarasov, A. Mudrankitony, W. Buik, P. Shilane, G. Kuenning, and E. Zadok. 2012. Generating realistic datasets for deduplication analysis. In Proceedings of the USENIX Annual Technical Conference. USENIX, 261--272."},{"volume-title":"Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST\u201910)","author":"Ungureanu C.","key":"e_1_2_1_36_1","unstructured":"C. Ungureanu , B. Atkin , A. Aranya , S. Gokhale , S. Rago , G. Calkowski , C. Dubnicki , and A. Bohra . 2010. HydraFS: A high-throughput file system for the HYDRAstor content-addressable storage system . In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST\u201910) . USENIX, 225--239. C. Ungureanu, B. Atkin, A. Aranya, S. Gokhale, S. Rago, G. Calkowski, C. Dubnicki, and A. Bohra. 2010. HydraFS: A high-throughput file system for the HYDRAstor content-addressable storage system. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST\u201910). USENIX, 225--239."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASCOTS.2015.40"},{"key":"e_1_2_1_38_1","volume-title":"Big Data: What It Is and Why You Should Care. White Paper.","author":"Villars R.","year":"2011","unstructured":"R. Villars , C. Olofson , and M. Eastwood . 2011 . Big Data: What It Is and Why You Should Care. White Paper. R. Villars, C. Olofson, and M. Eastwood. 2011. Big Data: What It Is and Why You Should Care. White Paper."},{"volume-title":"Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912)","author":"Wallace G.","key":"e_1_2_1_39_1","unstructured":"G. Wallace , F. Douglis , H. Qian , P. Shilane , S. Smaldone , M. Chamness , and W. Hsu . 2012. Characteristics of backup workloads in production systems . In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912) . USENIX, 33--48. G. Wallace, F. Douglis, H. Qian, P. Shilane, S. Smaldone, M. Chamness, and W. Hsu. 2012. Characteristics of backup workloads in production systems. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST\u201912). USENIX, 33--48."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496987"},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX, 26--28","author":"Xia W.","key":"e_1_2_1_41_1","unstructured":"W. Xia , H. Jiang , D. Feng , and Y. Hua . 2011. SiLo: A similarity-locality based near-exact deduplication scheme with low RAM overhead and high throughput . In Proceedings of the USENIX Annual Technical Conference. USENIX, 26--28 . W. Xia, H. Jiang, D. Feng, and Y. Hua. 2011. SiLo: A similarity-locality based near-exact deduplication scheme with low RAM overhead and high throughput. In Proceedings of the USENIX Annual Technical Conference. USENIX, 26--28."},{"volume-title":"Proceedings of the IEEE International Parallel 8 Distributed Processing Symposium (IPDPS\u201910)","author":"Yang T.","key":"e_1_2_1_42_1","unstructured":"T. Yang , H. Jiang , D. Feng , Z. Niu , K. Zhou , and Y. Wan . 2010. DEBAR: A scalable high-performance de-duplication storage system for backup and archiving . In Proceedings of the IEEE International Parallel 8 Distributed Processing Symposium (IPDPS\u201910) . IEEE Computer Society, 1--12. T. Yang, H. Jiang, D. Feng, Z. Niu, K. Zhou, and Y. Wan. 2010. DEBAR: A scalable high-performance de-duplication storage system for backup and archiving. In Proceedings of the IEEE International Parallel 8 Distributed Processing Symposium (IPDPS\u201910). IEEE Computer Society, 1--12."},{"volume-title":"Proceedings of the 31th Symposium on Mass Storage Systems and Technologies (MSST\u201915)","author":"Zhou Y.","key":"e_1_2_1_43_1","unstructured":"Y. Zhou , D. Feng , W. Xia , M. Fu , F. Huang , Y. Zhang , and C. Li . 2015. SecDep: A user-aware efficient fine-grained secure dedupication scheme with multi-level key management . In Proceedings of the 31th Symposium on Mass Storage Systems and Technologies (MSST\u201915) . IEEE Computer Society, 1--14. Y. Zhou, D. Feng, W. Xia, M. Fu, F. Huang, Y. Zhang, and C. Li. 2015. SecDep: A user-aware efficient fine-grained secure dedupication scheme with multi-level key management. In Proceedings of the 31th Symposium on Mass Storage Systems and Technologies (MSST\u201915). IEEE Computer Society, 1--14."},{"volume-title":"Proceedings of the 6th USENIX Conference on File and Storage Technologies. USENIX, 1--14","author":"Zhu B.","key":"e_1_2_1_44_1","unstructured":"B. Zhu , K. Li , and H. Patterson . 2008. Avoiding the disk bottleneck in the data domain deduplication file system . In Proceedings of the 6th USENIX Conference on File and Storage Technologies. USENIX, 1--14 . B. Zhu, K. Li, and H. Patterson. 2008. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of the 6th USENIX Conference on File and Storage Technologies. USENIX, 1--14."}],"container-title":["ACM Transactions on Storage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3183890","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3183890","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3183890","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:08:29Z","timestamp":1750208909000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3183890"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,5,11]]},"references-count":43,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2018,5,31]]}},"alternative-id":["10.1145\/3183890"],"URL":"https:\/\/doi.org\/10.1145\/3183890","relation":{},"ISSN":["1553-3077","1553-3093"],"issn-type":[{"type":"print","value":"1553-3077"},{"type":"electronic","value":"1553-3093"}],"subject":[],"published":{"date-parts":[[2018,5,11]]},"assertion":[{"value":"2017-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-01-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-05-11","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}