{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,15]],"date-time":"2025-12-15T14:10:37Z","timestamp":1765807837893,"version":"3.37.3"},"reference-count":21,"publisher":"Wiley","license":[{"start":{"date-parts":[[2020,10,14]],"date-time":"2020-10-14T00:00:00Z","timestamp":1602633600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"],"award-info":[{"award-number":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Research in Shaanxi","award":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"],"award-info":[{"award-number":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"]}]},{"DOI":"10.13039\/501100009103","name":"Education Department of Shaanxi Province","doi-asserted-by":"publisher","award":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"],"award-info":[{"award-number":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"]}],"id":[{"id":"10.13039\/501100009103","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Thirteenth Five-Year\u201d National Key R&D Program Project","award":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"],"award-info":[{"award-number":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"]}]},{"DOI":"10.13039\/501100005392","name":"Xi'an University of Architecture and Technology","doi-asserted-by":"publisher","award":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"],"award-info":[{"award-number":["61872284","2020GY-012","18JK0466","2019YFD1100901","RC1707","QN1726","ZR18050"]}],"id":[{"id":"10.13039\/501100005392","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Scientific Programming"],"published-print":{"date-parts":[[2020,10,14]]},"abstract":"<jats:p>Deduplication is a popular data reduction technology in storage systems which has significant advantages, such as finding and eliminating duplicate data, reducing data storage capacity required, increasing resource utilization, and saving storage costs. The file features are a key factor that is used to calculate the similarity between files, but the similarity calculated by the single feature has some limitations especially for the similar files. The storage node feature reflects the load condition of the node, which is the key factor to be considered in the data routing. This paper introduces a multifeature data routing strategy (DRMF). The routing strategy is made based on the features of the cluster, including routing communication, file similarity calculation, and the determination of the target node. The mutual information exchange is achieved by routing communication, routing servers, and storage nodes. The storage node calculates the similarity between the files stored, and then the file is routed according to the information provided by the routing server. The routing server determines the target node of the route according to the similar results and the node load features. The system prototype is designed and implemented; also, we develop a system to process the feature of cluster and determine the specific parameters of various features of experiments. In the end, we simulate the multifeature data routing and single-feature data routing, respectively, and compare the deduplication rate and data slope between the two strategies. The experimental results show that the proposed data routing strategy using multiple features can improve the deduplication rate of the cluster and maintain a lower data skew rate compared with the single-feature-based routing strategy MCS; DRMF can improve the deduplication rate of the cluster and maintain a lower data skew rate.<\/jats:p>","DOI":"10.1155\/2020\/8869237","type":"journal-article","created":{"date-parts":[[2020,10,15]],"date-time":"2020-10-15T02:20:36Z","timestamp":1602728436000},"page":"1-11","source":"Crossref","is-referenced-by-count":5,"title":["Research on Multifeature Data Routing Strategy in Deduplication"],"prefix":"10.1155","volume":"2020","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4051-3137","authenticated-orcid":true,"given":"Qinlu","family":"He","sequence":"first","affiliation":[{"name":"School of Information and Control Engineering, Xi\u2019an University of Architecture and Technology, Xi\u2019an 710043, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6058-4832","authenticated-orcid":true,"given":"Genqing","family":"Bian","sequence":"additional","affiliation":[{"name":"School of Information and Control Engineering, Xi\u2019an University of Architecture and Technology, Xi\u2019an 710043, China"}]},{"given":"Bilin","family":"Shao","sequence":"additional","affiliation":[{"name":"School of Management, Xi\u2019an University of Architecture and Technology, Xi\u2019an 710043, China"}]},{"given":"Weiqi","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Control Engineering, Xi\u2019an University of Architecture and Technology, Xi\u2019an 710043, China"}]}],"member":"311","reference":[{"volume-title":"Extracting Value from Chaos, an IDC White Paper Sponsored by EMC","year":"2019","author":"J. Gantz","key":"1"},{"issue":"4","key":"2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3369737","article-title":"Sketching volume capacities in deduplicated storage","volume":"15","author":"D. Harnik","year":"2019","journal-title":"ACM Transactions on Storage (TOS)"},{"key":"3","first-page":"647","article-title":"Data domain cloud tier: backup here, backup there, deduplicated everywhere! 2019","volume":"19","author":"A. Duggal","year":"2019","journal-title":"USENIX Annual Technical Conference (ATC)"},{"first-page":"277","article-title":"Estimating unseen deduplication\u2014from theory to practice","author":"D. Harnik","key":"4"},{"first-page":"65","article-title":"The quick migration of file servers","author":"K. Matsuzawa","key":"5"},{"first-page":"325","article-title":"UKSM: swift memory deduplication via hierarchical and adaptive memory region distilling","author":"N. Xia","key":"6"},{"first-page":"309","article-title":"ALACC: accelerating restore performance of data deduplication systems using adaptive look-ahead window assisted chunk caching","author":"Z. Cao","key":"7"},{"first-page":"705","article-title":"Can\u2019t we all get along? redesigning protection storage for modern workloads","author":"Y. Allu","key":"8"},{"first-page":"77","article-title":"CAFTL: a content-aware flash translation layer enhancing the lifespan of flash memory based solid state drives","author":"F. Chen","key":"9"},{"key":"10","doi-asserted-by":"publisher","DOI":"10.1109\/tpds.2013.167"},{"key":"11","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/TST.2015.7040510","article-title":"I-sieve: an inline high performance deduplication system used in cloud storage","volume":"20","author":"J. Wang","year":"2015","journal-title":"Tsinghua Science and Technology"},{"first-page":"472","article-title":"HPDV: a highly parallel deduplication cluster for virtual machine images","author":"C. Lin","key":"12"},{"key":"13","article-title":"Application-aware big data deduplication in cloud environment","volume":"1","author":"Y. Fu","year":"2017","journal-title":"IEEE Transactions on Cloud Computing"},{"first-page":"29","article-title":"The logic of physical garbage collection in deduplicating storage","author":"F. Douglis","key":"14"},{"first-page":"536","article-title":"RMD: a resemblance and mergence based approach for high performance deduplication","author":"P. Zhang","key":"15"},{"key":"16","doi-asserted-by":"publisher","DOI":"10.1109\/tc.2015.2456015"},{"key":"17","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2017.02.039"},{"key":"18","doi-asserted-by":"publisher","DOI":"10.1016\/j.dam.2015.09.018"},{"first-page":"1","article-title":"Content-aware load balancing for distributed backup","author":"F. Douglis","key":"19"},{"key":"20","doi-asserted-by":"publisher","DOI":"10.3166\/ria.32.s1.25-40"},{"key":"21","doi-asserted-by":"publisher","DOI":"10.3166\/isi.23.5.159-173"}],"container-title":["Scientific Programming"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/8869237.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/8869237.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/sp\/2020\/8869237.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,9]],"date-time":"2023-10-09T20:04:31Z","timestamp":1696881871000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/sp\/2020\/8869237\/"}},"subtitle":[],"editor":[{"given":"Cristian","family":"Mateos","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2020,10,14]]},"references-count":21,"alternative-id":["8869237","8869237"],"URL":"https:\/\/doi.org\/10.1155\/2020\/8869237","relation":{},"ISSN":["1875-919X","1058-9244"],"issn-type":[{"type":"electronic","value":"1875-919X"},{"type":"print","value":"1058-9244"}],"subject":[],"published":{"date-parts":[[2020,10,14]]}}}