{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,24]],"date-time":"2025-08-24T23:06:49Z","timestamp":1756076809281,"version":"3.41.0"},"reference-count":34,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2011,7,1]],"date-time":"2011-07-01T00:00:00Z","timestamp":1309478400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Storage"],"published-print":{"date-parts":[[2011,7]]},"abstract":"<jats:p>Two schemes proposed to cope with unrecoverable or latent media errors and enhance the reliability of RAID systems are examined. The first scheme is the established, widely used, disk scrubbing scheme, which operates by periodically accessing disk drives to detect media-related unrecoverable errors. These errors are subsequently corrected by rebuilding the sectors affected. The second scheme is the recently proposed intradisk redundancy scheme, which uses a further level of redundancy inside each disk, in addition to the RAID redundancy across multiple disks. A new model is developed to evaluate the extent to which disk scrubbing reduces the unrecoverable sector errors. The probability of encountering unrecoverable sector errors is derived analytically under very general conditions regarding the characteristics of the read\/write process of uniformly distributed random workloads and for a broad spectrum of disk scrubbing schemes, which includes the deterministic and random scrubbing schemes. We show that the deterministic scrubbing scheme is the most efficient one. We also derive closed-form expressions for the percentage of unrecoverable sector errors that the scrubbing scheme detects and corrects, the throughput performance, and the minimum scrubbing period achievable under operation with random, uniformly distributed I\/O requests. Our results demonstrate that the reliability improvement due to disk scrubbing depends on the scrubbing frequency and the load of the system, and, for heavy-write workloads, may not reach the reliability level achieved by a simple interleaved parity-check (IPC)-based intradisk redundancy scheme, which is insensitive to the load. In fact, for small unrecoverable sector error probabilities, the IPC-based intradisk redundancy scheme achieves essentially the same reliability as that of a system operating without unrecoverable sector errors. For heavy loads, the reliability achieved by the scrubbing scheme can be orders of magnitude less than that of the intradisk redundancy scheme. Finally, the I\/O and throughput performances are evaluated by means of analysis and event-driven simulation.<\/jats:p>","DOI":"10.1145\/1970348.1970350","type":"journal-article","created":{"date-parts":[[2011,7,21]],"date-time":"2011-07-21T13:27:09Z","timestamp":1311254829000},"page":"1-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":26,"title":["Disk Scrubbing Versus Intradisk Redundancy for RAID Storage Systems"],"prefix":"10.1145","volume":"7","author":[{"given":"Ilias","family":"Iliadis","sequence":"first","affiliation":[{"name":"IBM Zurich Research Laboratory"}]},{"given":"Robert","family":"Haas","sequence":"additional","affiliation":[{"name":"IBM Zurich Research Laboratory"}]},{"given":"Xiao-Yu","family":"Hu","sequence":"additional","affiliation":[{"name":"IBM Zurich Research Laboratory"}]},{"given":"Evangelos","family":"Eleftheriou","sequence":"additional","affiliation":[{"name":"IBM Zurich Research Laboratory"}]}],"member":"320","published-online":{"date-parts":[[2011,7]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1269899.1254917"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1217935.1217957"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/12.364531"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/176979.176981"},{"volume-title":"Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST). USENIX Association","author":"Corbett P.","key":"e_1_2_1_5_1","unstructured":"Corbett , P. , English , R. , Goel , A. , Grcanac , T. , Kleiman , S. , Leong , J. , and Sankar , S . 2004. Row-diagonal parity for double disk failure correction . In Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST). USENIX Association , Berkeley, CA, 1--14. Corbett, P., English, R., Goel, A., Grcanac, T., Kleiman, S., Leong, J., and Sankar, S. 2004. Row-diagonal parity for double disk failure correction. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST). USENIX Association, Berkeley, CA, 1--14."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1140103.1140326"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1353452.1353453"},{"key":"e_1_2_1_8_1","unstructured":"DiskSim. 2007. The DiskSim simulation environment (Ver. 3.0). http:\/\/www.pdl.cmu.edu\/DiskSim\/. DiskSim . 2007. The DiskSim simulation environment (Ver. 3.0). http:\/\/www.pdl.cmu.edu\/DiskSim\/."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/DSN.2007.41"},{"volume-title":"Proceedings of the USENIX Workshop on Hot Topics in Storage and File Systems. USENIX Association","author":"Greenan K. M.","key":"e_1_2_1_10_1","unstructured":"Greenan , K. M. , Plank , J. S. , and Wylie , J. J . 2010. Mean time to meaningless: MTTDL, Markov models, and storage system reliability . In Proceedings of the USENIX Workshop on Hot Topics in Storage and File Systems. USENIX Association , Berkeley, CA, 1--5. Greenan, K. M., Plank, J. S., and Wylie, J. J. 2010. Mean time to meaningless: MTTDL, Markov models, and storage system reliability. In Proceedings of the USENIX Workshop on Hot Topics in Storage and File Systems. USENIX Association, Berkeley, CA, 1--5."},{"key":"e_1_2_1_11_1","unstructured":"Hafner J. L. Deenadhayalan V. Kanungo T. and Rao K. 2004. Performance metrics for erasure codes in storage systems. IBM Res. rep. RJ 10321. Hafner J. L. Deenadhayalan V. Kanungo T. and Rao K. 2004. Performance metrics for erasure codes in storage systems. IBM Res. rep. RJ 10321."},{"key":"e_1_2_1_12_1","unstructured":"Hitachi Global Storage Technologies. 2007. Hitachi disk drive product datasheets. http:\/\/www.hitachigst.com\/. Hitachi Global Storage Technologies . 2007. Hitachi disk drive product datasheets. http:\/\/www.hitachigst.com\/."},{"key":"e_1_2_1_13_1","unstructured":"HP Labs. 2006. Pivate software. http:\/\/tesla.hpl.hp.com\/private_software\/. HP Labs . 2006. Pivate software. http:\/\/tesla.hpl.hp.com\/private_software\/."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/MASCOT.2009.5366195"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1384529.1375485"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/NAS.2008.20"},{"volume-title":"Queueing Systems, Volume 1: Theory","author":"Kleinrock L.","key":"e_1_2_1_17_1","unstructured":"Kleinrock , L. 1975. Queueing Systems, Volume 1: Theory . Wiley , New York . Kleinrock, L. 1975. Queueing Systems, Volume 1: Theory. Wiley, New York."},{"volume-title":"Proceedings of the 38th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE","author":"Mi N.","key":"e_1_2_1_18_1","unstructured":"Mi , N. , Riska , A. , Smirni , E. , and Riedel , E . 2008. Enhancing data availability in disk drives through background activities . In Proceedings of the 38th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE , Los Alamitos, CA, 492--501. Mi, N., Riska, A., Smirni, E., and Riedel, E. 2008. Enhancing data availability in disk drives through background activities. In Proceedings of the 38th Annual IEEE\/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE, Los Alamitos, CA, 492--501."},{"volume-title":"Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST). USENIX Association","author":"Oprea A.","key":"e_1_2_1_19_1","unstructured":"Oprea , A. and Juels , A . 2010. A clean-slate look at disk scrubbing . In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST). USENIX Association , Berkeley, CA, 57--70. Oprea, A. and Juels, A. 2010. A clean-slate look at disk scrubbing. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST). USENIX Association, Berkeley, CA, 57--70."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1179559.1179568"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/50202.50214"},{"volume-title":"Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association","author":"Pinheiro E.","key":"e_1_2_1_22_1","unstructured":"Pinheiro , E. , Weber , W.-D. , and Barroso , L. A . 2007. Failure trends in a large disk drive population . In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association , Berkeley, CA, 17--28. Pinheiro, E., Weber, W.-D., and Barroso, L. A. 2007. Failure trends in a large disk drive population. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association, Berkeley, CA, 17--28."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference, USENIX Association","author":"Riska A.","key":"e_1_2_1_23_1","unstructured":"Riska , A. and Riedel , E . 2006. Disk drive level workload characterization . In Proceedings of the USENIX Annual Technical Conference, USENIX Association , Berkeley, CA, 97--102. Riska, A. and Riedel, E. 2006. Disk drive level workload characterization. In Proceedings of the USENIX Annual Technical Conference, USENIX Association, Berkeley, CA, 97--102."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference. USENIX Association","author":"Riska A.","key":"e_1_2_1_24_1","unstructured":"Riska , A. and Riedel , E . 2008. Idle read after write: IRAW . In Proceedings of the USENIX Annual Technical Conference. USENIX Association , Berkeley, CA, 43--56. Riska, A. and Riedel, E. 2008. Idle read after write: IRAW. In Proceedings of the USENIX Annual Technical Conference. USENIX Association, Berkeley, CA, 43--56."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/2.268881"},{"key":"e_1_2_1_26_1","unstructured":"Sawyer D. C. 1994. Dependability analysis of parallel systems using a simulation-based approach. NASA-CR-195762. Sawyer D. C. 1994. Dependability analysis of parallel systems using a simulation-based approach. NASA-CR-195762."},{"volume-title":"Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association","author":"Schroeder B.","key":"e_1_2_1_27_1","unstructured":"Schroeder , B. and Gibson , G. A . 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association , Berkeley, CA, 1--16. Schroeder, B. and Gibson, G. A. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST). USENIX Association, Berkeley, CA, 1--16."},{"volume-title":"Proceedings of the 8th USENIX Conference on File and Storage Technologies. USENIX","author":"Schroeder B.","key":"e_1_2_1_28_1","unstructured":"Schroeder , B. , Damouras , S. , and Gill , P . 2010. Understanding latent sector errors and how to protect against them . In Proceedings of the 8th USENIX Conference on File and Storage Technologies. USENIX , Berkeley, CA, 71--84. Schroeder, B., Damouras, S., and Gill, P. 2010. Understanding latent sector errors and how to protect against them. In Proceedings of the 8th USENIX Conference on File and Storage Technologies. USENIX, Berkeley, CA, 71--84."},{"volume-title":"Proceedings of the 12th Annual IEEE\/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). IEEE","author":"Schwarz T. J. E.","key":"e_1_2_1_29_1","unstructured":"Schwarz , T. J. E. , Xin , Q. , Miller , E. L. , Long , D. D. E. , Hospodor , A. , and Ng , S . 2004. Disk scrubbing in large archival storage systems . In Proceedings of the 12th Annual IEEE\/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). IEEE , Los Alamitos, CA, 409--418. Schwarz, T. J. E., Xin, Q., Miller, E. L., Long, D. D. E., Hospodor, A., and Ng, S. 2004. Disk scrubbing in large archival storage systems. In Proceedings of the 12th Annual IEEE\/ACM International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS). IEEE, Los Alamitos, CA, 409--418."},{"volume-title":"Proceedings of the 51th IEEE Annual Reliability and Maintainability Symposium (RAMS). IEEE","author":"Shah S.","key":"e_1_2_1_30_1","unstructured":"Shah , S. and Elerath , J. G . 2005. Reliability analysis of disk drive failure mechanisms . In Proceedings of the 51th IEEE Annual Reliability and Maintainability Symposium (RAMS). IEEE , Los Alamitos, CA, 226--231. Shah, S. and Elerath, J. G. 2005. Reliability analysis of disk drive failure mechanisms. In Proceedings of the 51th IEEE Annual Reliability and Maintainability Symposium (RAMS). IEEE, Los Alamitos, CA, 226--231."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1629075.1629076"},{"key":"e_1_2_1_32_1","volume-title":"Probabilistic and Statistics with Reliability, Queueing and Computer Science Applications","author":"Trivedi K. S.","unstructured":"Trivedi , K. S. 2002. Probabilistic and Statistics with Reliability, Queueing and Computer Science Applications 2 nd Ed. Wiley , New York . Trivedi, K. S. 2002. Probabilistic and Statistics with Reliability, Queueing and Computer Science Applications 2nd Ed. Wiley, New York.","edition":"2"},{"volume-title":"Proceedings of the USENIX Workshop on Power Aware Computing and Systems (HotPower). USENIX Association","author":"Wang G.","key":"e_1_2_1_33_1","unstructured":"Wang , G. , Butt , A. R. , and Gniady , C . 2008. On the impact of disk scrubbing on energy savings . In Proceedings of the USENIX Workshop on Power Aware Computing and Systems (HotPower). USENIX Association , Berkeley, CA, 1--5. Wang, G., Butt, A. R., and Gniady, C. 2008. On the impact of disk scrubbing on energy savings. In Proceedings of the USENIX Workshop on Power Aware Computing and Systems (HotPower). USENIX Association, Berkeley, CA, 1--5."},{"volume-title":"Stochastic Modeling and the Theory of Queues","author":"Wolff R. W.","key":"e_1_2_1_34_1","unstructured":"Wolff , R. W. 1989. Stochastic Modeling and the Theory of Queues . Prentice Hall , Englewood Cliffs, NJ . Wolff, R. W. 1989. Stochastic Modeling and the Theory of Queues. Prentice Hall, Englewood Cliffs, NJ."}],"container-title":["ACM Transactions on Storage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1970348.1970350","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1970348.1970350","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T10:52:52Z","timestamp":1750243972000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1970348.1970350"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7]]},"references-count":34,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2011,7]]}},"alternative-id":["10.1145\/1970348.1970350"],"URL":"https:\/\/doi.org\/10.1145\/1970348.1970350","relation":{},"ISSN":["1553-3077","1553-3093"],"issn-type":[{"type":"print","value":"1553-3077"},{"type":"electronic","value":"1553-3093"}],"subject":[],"published":{"date-parts":[[2011,7]]},"assertion":[{"value":"2010-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-07-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}