{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T02:09:21Z","timestamp":1774922961008,"version":"3.50.1"},"reference-count":31,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2007,10,1]],"date-time":"2007-10-01T00:00:00Z","timestamp":1191196800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Storage"],"published-print":{"date-parts":[[2007,10]]},"abstract":"<jats:p>For five years, we collected annual snapshots of file-system metadata from over 60,000 Windows PC file systems in a large corporation. In this article, we use these snapshots to study temporal changes in file size, file age, file-type frequency, directory size, namespace structure, file-system population, storage capacity and consumption, and degree of file modification. We present a generative model that explains the namespace structure and the distribution of directory sizes. We find significant temporal trends relating to the popularity of certain file types, the origin of file content, the way the namespace is used, and the degree of variation among file systems, as well as more pedestrian changes in size and capacities. We give examples of consequent lessons for designers of file systems and related software.<\/jats:p>","DOI":"10.1145\/1288783.1288788","type":"journal-article","created":{"date-parts":[[2007,11,15]],"date-time":"2007-11-15T14:26:02Z","timestamp":1195136762000},"page":"9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":203,"title":["A five-year study of file-system metadata"],"prefix":"10.1145","volume":"3","author":[{"given":"Nitin","family":"Agrawal","sequence":"first","affiliation":[{"name":"University of Wisconsin, Madison, WI"}]},{"given":"William J.","family":"Bolosky","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA"}]},{"given":"John R.","family":"Douceur","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA"}]},{"given":"Jacob R.","family":"Lorch","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA"}]}],"member":"320","published-online":{"date-parts":[[2007,10]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/1060289.1060291"},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST)","author":"Agrawal N.A.","unstructured":"Agrawal , N.A. , Bolosky , W.J. , Douceur , J.R. , and Lorch , J.R . 2007. A five-year study of file system metadata . In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST) , San Jose, CA, 31--45. Agrawal, N.A., Bolosky, W.J., Douceur, J.R., and Lorch, J.R. 2007. A five-year study of file system metadata. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST), San Jose, CA, 31--45."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/502034.502040"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/277851.277897"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/111048.111053"},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the 4th USENIX Windows Systems Symposium","author":"Bolosky W.J.","unstructured":"Bolosky , W.J. , Corbin , S. , Goebel , D. , and Douceur , J.R . 2000. Single instance storage in Windows 2000 . In Proceedings of the 4th USENIX Windows Systems Symposium , Seattle, WA. Bolosky, W.J., Corbin, S., Goebel, D., and Douceur, J.R. 2000. Single instance storage in Windows 2000. In Proceedings of the 4th USENIX Windows Systems Symposium, Seattle, WA."},{"key":"e_1_2_1_7_1","volume-title":"ZFS: The last word in file systems","author":"Bonwick J.","year":"2006","unstructured":"Bonwick , J. 2006 . ZFS: The last word in file systems . http:\/\/www.opensolaris.org\/os\/community\/zfs\/docs\/zfs_last.pdf. Bonwick, J. 2006. ZFS: The last word in file systems. http:\/\/www.opensolaris.org\/os\/community\/zfs\/docs\/zfs_last.pdf."},{"key":"e_1_2_1_8_1","unstructured":"Chapman G. 2002. Why does Explorer think I only want to see my documents&quest; http:\/\/pubs.logicalexpressions.com\/Pub0009\/LPMArticle.asp?ID=189.  Chapman G. 2002. Why does Explorer think I only want to see my documents&quest; http:\/\/pubs.logicalexpressions.com\/Pub0009\/LPMArticle.asp?ID=189."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/1060289.1060316"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/301453.301480"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/378420.378824"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS)","author":"Evans K.M.","unstructured":"Evans , K.M. and Kuenning , G.H . 2002. A study of irregularities in file-size distributions . In Proceedings of the International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS) , San Diego, CA. Evans, K.M. and Kuenning, G.H. 2002. A study of irregularities in file-size distributions. In Proceedings of the International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS), San Diego, CA."},{"key":"e_1_2_1_13_1","volume-title":"Mathematical Statistics","author":"Freund J.E.","unstructured":"Freund , J.E. 1992. Mathematical Statistics , 5 th ed. Prentice Hall . Freund, J.E. 1992. Mathematical Statistics, 5th ed. Prentice Hall.","edition":"5"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/277851.277894"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2005.20"},{"key":"e_1_2_1_16_1","volume-title":"Unix file size survey --","author":"Irlam G.","year":"1993","unstructured":"Irlam , G. 1993. Unix file size survey -- 1993 . http:\/\/www.base.com\/gordoni\/ufs93.html. Irlam, G. 1993. Unix file size survey -- 1993. http:\/\/www.base.com\/gordoni\/ufs93.html."},{"key":"e_1_2_1_17_1","volume-title":"The Art of Computer Programming, Volume 2: Seminumerical Algorithms","author":"Knuth D.E.","unstructured":"Knuth , D.E. 1981. The Art of Computer Programming, Volume 2: Seminumerical Algorithms , 2 nd ed. Addison-Wesley . Knuth, D.E. 1981. The Art of Computer Programming, Volume 2: Seminumerical Algorithms, 2nd ed. Addison-Wesley.","edition":"2"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/0377-0427(92)90252-S"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the 1st International Conference on Autonomic Computing (ICAC)","author":"Mesnier M.","unstructured":"Mesnier , M. , Thereska , E. , Ganger , G.R. , Ellard , D. , and Seltzer , M . 2004. File classification in self-&ast; storage systems . In Proceedings of the 1st International Conference on Autonomic Computing (ICAC) , New York. Mesnier, M., Thereska, E., Ganger, G.R., Ellard, D., and Seltzer, M. 2004. File classification in self-&ast; storage systems. In Proceedings of the 1st International Conference on Autonomic Computing (ICAC), New York."},{"key":"e_1_2_1_20_1","unstructured":"Microsoft. 2006. SetFileTime. http:\/\/msdn.microsoft.com\/library\/default.asp?url=\/library\/en-us\/wcecoreos5\/html\/wce50lrfsetfiletime.asp.  Microsoft. 2006. SetFileTime. http:\/\/msdn.microsoft.com\/library\/default.asp?url=\/library\/en-us\/wcecoreos5\/html\/wce50lrfsetfiletime.asp."},{"key":"e_1_2_1_21_1","volume-title":"Inside the Windows 95 file system. O'Reilly","author":"Mitchell S.","unstructured":"Mitchell , S. 1997. Inside the Windows 95 file system. O'Reilly , Sebastopol, CA . Mitchell, S. 1997. Inside the Windows 95 file system. O'Reilly, Sebastopol, CA."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1080\/15427951.2004.10129092"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1002\/spe.4380140407"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/323647.323631"},{"key":"e_1_2_1_25_1","unstructured":"Reiser H. 2006. Three reasons why ReiserFS is great for you. http:\/\/www.namesys.com\/.  Reiser H. 2006. Three reasons why ReiserFS is great for you. http:\/\/www.namesys.com\/."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the USENIX Annual Technical Conference","author":"Roselli D.","unstructured":"Roselli , D. , Lorch , J.R. , and Anderson , T.E . 2000. A comparison of file system workloads . In Proceedings of the USENIX Annual Technical Conference , San Diego, CA, 41--54. Roselli, D., Lorch, J.R., and Anderson, T.E. 2000. A comparison of file system workloads. In Proceedings of the USENIX Annual Technical Conference, San Diego, CA, 41--54."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/800216.806597"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the 16th IFIP Working Group 7.3 International Symposium on Computer Performance Modeling and Evaluation. 3--25","author":"Sienknecht T.F.","unstructured":"Sienknecht , T.F. , Friedrich , R.J. , Martinka , J.J. , and Friedenbach , P.M . 1994. The implications of distributed data in a commercial environment on the design of hierarchical storage management . In Proceedings of the 16th IFIP Working Group 7.3 International Symposium on Computer Performance Modeling and Evaluation. 3--25 . Sienknecht, T.F., Friedrich, R.J., Martinka, J.J., and Friedenbach, P.M. 1994. The implications of distributed data in a commercial environment on the design of hierarchical storage management. In Proceedings of the 16th IFIP Working Group 7.3 International Symposium on Computer Performance Modeling and Evaluation. 3--25."},{"key":"e_1_2_1_29_1","volume-title":"Tech. Rep. TR-35-94","author":"Smith K.","year":"1994","unstructured":"Smith , K. and Seltzer , M . 1994 . File layout and file system performance. Tech. Rep. TR-35-94 , Harvard University . Smith, K. and Seltzer, M. 1994. File layout and file system performance. Tech. Rep. TR-35-94, Harvard University."},{"key":"e_1_2_1_30_1","volume-title":"Inside Windows NT","author":"Solomon D.A.","unstructured":"Solomon , D.A. 1998. Inside Windows NT , 2 nd ed. Microsoft Press , Redmond, WA . Solomon, D.A. 1998. Inside Windows NT, 2nd ed. Microsoft Press, Redmond, WA.","edition":"2"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/319151.319158"}],"container-title":["ACM Transactions on Storage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1288783.1288788","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1288783.1288788","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T14:58:03Z","timestamp":1750258683000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1288783.1288788"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10]]},"references-count":31,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2007,10]]}},"alternative-id":["10.1145\/1288783.1288788"],"URL":"https:\/\/doi.org\/10.1145\/1288783.1288788","relation":{},"ISSN":["1553-3077","1553-3093"],"issn-type":[{"value":"1553-3077","type":"print"},{"value":"1553-3093","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,10]]},"assertion":[{"value":"2007-10-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}