{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,13]],"date-time":"2025-12-13T06:46:04Z","timestamp":1765608364365,"version":"3.41.0"},"reference-count":39,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2012,11,1]],"date-time":"2012-11-01T00:00:00Z","timestamp":1351728000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100004085","name":"Ministry of Education, Science and Technology","doi-asserted-by":"publisher","award":["2012-0006423"],"award-info":[{"award-number":["2012-0006423"]}],"id":[{"id":"10.13039\/501100004085","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002994","name":"Ministry of Knowledge Economy","doi-asserted-by":"publisher","award":["10041244"],"award-info":[{"award-number":["10041244"]}],"id":[{"id":"10.13039\/501100002994","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["DE-AC05-00OR22725"],"award-info":[{"award-number":["DE-AC05-00OR22725"]}],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003662","name":"Korea Evaluation Institute of Industrial Technology","doi-asserted-by":"publisher","award":["10041244"],"award-info":[{"award-number":["10041244"]}],"id":[{"id":"10.13039\/501100003662","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Web"],"published-print":{"date-parts":[[2012,11]]},"abstract":"<jats:p>With the ever-increasing popularity of Social Network Services (SNSs), an understanding of the characteristics of these services and their effects on the behavior of their host servers is critical. However, there has been a lack of research on the workload characterization of servers running SNS applications such as blog services. To fill this void, we empirically characterized real-world Web server logs collected from one of the largest South Korean blog hosting sites for 12 consecutive days. The logs consist of more than 96 million HTTP requests and 4.7TB of network traffic. Our analysis reveals the following: (i) The transfer size of nonmultimedia files and blog articles can be modeled using a truncated Pareto distribution and a log-normal distribution, respectively; (ii) user access for blog articles does not show temporal locality, but is strongly biased towards those posted with image or audio files. We additionally discuss the potential performance improvement through clustering of small files on a blog page into contiguous disk blocks, which benefits from the observed file access patterns. Trace-driven simulations show that, on average, the suggested approach achieves 60.6% better system throughput and reduces the processing time for file access by 30.8% compared to the best performance of the Ext4 filesystem.<\/jats:p>","DOI":"10.1145\/2382616.2382619","type":"journal-article","created":{"date-parts":[[2012,12,4]],"date-time":"2012-12-04T20:10:57Z","timestamp":1354651857000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["Workload Characterization and Performance Implications of Large-Scale Blog Servers"],"prefix":"10.1145","volume":"6","author":[{"given":"Myeongjae","family":"Jeon","sequence":"first","affiliation":[{"name":"Rice University"}]},{"given":"Youngjae","family":"Kim","sequence":"additional","affiliation":[{"name":"Oak Ridge National Laboratory"}]},{"given":"Jeaho","family":"Hwang","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology"}]},{"given":"Joonwon","family":"Lee","sequence":"additional","affiliation":[{"name":"Sungkyunkwan University"}]},{"given":"Euiseong","family":"Seo","sequence":"additional","affiliation":[{"name":"Sungkyunkwan University"}]}],"member":"320","published-online":{"date-parts":[[2012,11]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1198\/016214505000000411"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/65.844498"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/90.649565"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/301464.301560"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988743"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.peva.2011.07.008"},{"key":"e_1_2_1_7_1","unstructured":"Bucy J. S. Schindler J. Schlosser S. W. and Ganger G. R. 2008. The disksim simulation environment version 4.0 reference manual. Tech. rep. CMU-PDL-08-101 Carnegie Mellon University. Bucy J. S. Schindler J. Schlosser S. W. and Ganger G. R. 2008. The disksim simulation environment version 4.0 reference manual. Tech. rep. CMU-PDL-08-101 Carnegie Mellon University."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1518701.1518847"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526806"},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the 16th International Conference on Distributed Computing Systems.","author":"Challenger J.","year":"1996","unstructured":"Challenger , J. 1996 . A distributed web server and its performance analysis on multiple platforms . In Proceedings of the 16th International Conference on Distributed Computing Systems. Challenger, J. 1996. A distributed web server and its performance analysis on multiple platforms. In Proceedings of the 16th International Conference on Distributed Computing Systems."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/90.650143"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010012224103"},{"volume-title":"Proceedings of the Global Telecommunication Conference.","author":"Dingle A.","key":"e_1_2_1_13_1","unstructured":"Dingle , A. , MacNair , E. , and Nguyen , T . 1999. An analysis of web server performance . In Proceedings of the Global Telecommunication Conference. Dingle, A., MacNair, E., and Nguyen, T. 1999. An analysis of web server performance. In Proceedings of the Global Telecommunication Conference."},{"volume-title":"Proceedings of the International Conference on Weblogs and Social Media.","author":"Duarte F.","key":"e_1_2_1_14_1","unstructured":"Duarte , F. , Mattos , B. , Bestavros , A. , Almeida , V. , and Almeida , J . 2007. Traffic characteristics and communication patterns in blogosphere . In Proceedings of the International Conference on Weblogs and Social Media. Duarte, F., Mattos, B., Bestavros, A., Almeida, V., and Almeida, J. 2007. Traffic characteristics and communication patterns in blogosphere. In Proceedings of the International Conference on Weblogs and Social Media."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188570"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1298306.1298310"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2019643.2019646"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557064"},{"volume-title":"Proceedings of the 7th International Symposium on High Performance Distributed Computing.","author":"Holmedahl V.","key":"e_1_2_1_19_1","unstructured":"Holmedahl , V. , Smith , B. , and Yang , T . 1998. Cooperative caching of dynamic content on a distributed web server . In Proceedings of the 7th International Symposium on High Performance Distributed Computing. Holmedahl, V., Smith, B., and Yang, T. 1998. Cooperative caching of dynamic content on a distributed web server. In Proceedings of the 7th International Symposium on High Performance Distributed Computing."},{"volume-title":"Proceedings of the USENIX Symposium on Internet Technologies and Systems.","author":"Iyengar A.","key":"e_1_2_1_20_1","unstructured":"Iyengar , A. and Challenger , J . 1997. Improving web server performance by caching dynamic data . In Proceedings of the USENIX Symposium on Internet Technologies and Systems. Iyengar, A. and Challenger, J. 1997. Improving web server performance by caching dynamic data. In Proceedings of the USENIX Symposium on Internet Technologies and Systems."},{"volume-title":"Proceedings of the 2nd IEEE Workshop on Workload Characterization.","author":"Kant K.","key":"e_1_2_1_21_1","unstructured":"Kant , K. and Won , Y . 1999. Performance impact of uncached file accesses in specweb99 . In Proceedings of the 2nd IEEE Workshop on Workload Characterization. Kant, K. and Won, Y. 1999. Performance impact of uncached file accesses in specweb99. In Proceedings of the 2nd IEEE Workshop on Workload Characterization."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/1702135.1702163"},{"volume-title":"Proceedings of the 7th SIAM International Conference on Data Mining.","author":"Leskovec J.","key":"e_1_2_1_23_1","unstructured":"Leskovec , J. , Mcglohon , M. , Faloutsos , C. , Glance , N. , and Hurst , M . 2007. Cascading behavior in large blog graphs . In Proceedings of the 7th SIAM International Conference on Data Mining. Leskovec, J., Mcglohon, M., Faloutsos, C., Glance, N., and Hurst, M. 2007. Cascading behavior in large blog graphs. In Proceedings of the 7th SIAM International Conference on Data Mining."},{"volume-title":"Proceedings of the 3rd USENIX Conference on File and Storage Technologies.","author":"Li Z.","key":"e_1_2_1_24_1","unstructured":"Li , Z. , Chen , Z. , Srinivasan , S. M. , and Zhou , Y . 2004. C-miner: Mining block correlations in storage systems . In Proceedings of the 3rd USENIX Conference on File and Storage Technologies. Li, Z., Chen, Z., Srinivasan, S. M., and Zhou, Y. 2004. C-miner: Mining block correlations in storage systems. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies."},{"key":"e_1_2_1_25_1","volume-title":"Sciences: Keys and Clues. BioScience.","author":"Limpert E.","year":"2001","unstructured":"Limpert , E. , Stahel , W. A. , and Abbt , M . 2001 . Log-Normal Distributions across the Sciences: Keys and Clues. BioScience. Limpert, E., Stahel, W. A., and Abbt, M. 2001. Log-Normal Distributions across the Sciences: Keys and Clues. BioScience."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2007.4362168"},{"volume-title":"Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software.","author":"Ohara M.","key":"e_1_2_1_27_1","unstructured":"Ohara , M. , Nagpurkar , P. , Ueda , Y. , and Ishizaki , K . 2009. The data-centricity of web 2.0 workloads and its impact on server performance . In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software. Ohara, M., Nagpurkar, P., Ueda, Y., and Ishizaki, K. 2009. The data-centricity of web 2.0 workloads and its impact on server performance. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software."},{"volume-title":"Proceedings of the 12th International Conference on Modelling Tools and Techniques for Computer and Communication System Performance Evaluation.","author":"Oke A.","key":"e_1_2_1_28_1","unstructured":"Oke , A. and Bunt , R. B . 2002. Hierarchical workload characterization for a busy web server . In Proceedings of the 12th International Conference on Modelling Tools and Techniques for Computer and Communication System Performance Evaluation. Oke, A. and Bunt, R. B. 2002. Hierarchical workload characterization for a busy web server. In Proceedings of the 12th International Conference on Modelling Tools and Techniques for Computer and Communication System Performance Evaluation."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/224056.224064"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/190314.190338"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 18th International World Wide Web Conference.","author":"Rodriguez P.","year":"2009","unstructured":"Rodriguez , P. 2009 . Web infrastructure for the 21st century . In Proceedings of the 18th International World Wide Web Conference. Rodriguez, P. 2009. Web infrastructure for the 21st century. In Proceedings of the 18th International World Wide Web Conference."},{"volume-title":"Proceedings of the USENIX Annual Technical Conference.","author":"Shriver E.","key":"e_1_2_1_32_1","unstructured":"Shriver , E. , Gabber , E. , Huang , L. , and Stein , C. A . 2001 . Proceedings of the USENIX Annual Technical Conference. Shriver, E., Gabber, E., Huang, L., and Stein, C. A. 2001. Proceedings of the USENIX Annual Technical Conference."},{"volume-title":"Proceedings of the IEEE International Symposium on Workload Characterization.","author":"Stewart C.","key":"e_1_2_1_33_1","unstructured":"Stewart , C. , Leventi , M. , and Shen , K . 2008. Empirical examination of a collaborative web application . In Proceedings of the IEEE International Symposium on Workload Characterization. Stewart, C., Leventi, M., and Shen, K. 2008. Empirical examination of a collaborative web application. In Proceedings of the IEEE International Symposium on Workload Characterization."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/258612.258680"},{"volume-title":"Proceedings of the International Instrumentation and Measurement Technology Conference.","author":"Veres S.","key":"e_1_2_1_35_1","unstructured":"Veres , S. and Ionescu , D . 2009. Measurement-Based traffic characterization for web 2.0 applications . In Proceedings of the International Instrumentation and Measurement Technology Conference. Veres, S. and Ionescu, D. 2009. Measurement-Based traffic characterization for web 2.0 applications. In Proceedings of the International Instrumentation and Measurement Technology Conference."},{"volume-title":"Proceedings of the 6th USENIX Conference on File and Storage Technologies.","author":"Wachs M.","key":"e_1_2_1_36_1","unstructured":"Wachs , M. , Abd-El-Malek , M. , Thereska , E. , and Ganger , G. R . 2007. Argon: Performance insulation for shared storage servers . In Proceedings of the 6th USENIX Conference on File and Storage Technologies. Wachs, M., Abd-El-Malek, M., Thereska, E., and Ganger, G. R. 2007. Argon: Performance insulation for shared storage servers. In Proceedings of the 6th USENIX Conference on File and Storage Technologies."},{"volume-title":"Proceedings of the 11th IEEE\/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.","author":"Wang J.","key":"e_1_2_1_37_1","unstructured":"Wang , J. and Li , D . 2003.A light-weight, temporary file system for large-scale web servers . In Proceedings of the 11th IEEE\/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems. Wang, J. and Li, D. 2003.A light-weight, temporary file system for large-scale web servers. In Proceedings of the 11th IEEE\/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems."},{"key":"e_1_2_1_38_1","unstructured":"Williams A. Arlitt M. Williamson C. and Barker K. 2005. Web Workload Characterization: Ten Years Later. Springer. Williams A. Arlitt M. Williamson C. and Barker K. 2005. Web Workload Characterization: Ten Years Later . Springer."},{"volume-title":"Human Behavior and the Principle of Least-Effort","author":"Zipf G. K.","key":"e_1_2_1_39_1","unstructured":"Zipf , G. K. 1949. Human Behavior and the Principle of Least-Effort . Addison-Wesley . Zipf, G. K. 1949. Human Behavior and the Principle of Least-Effort. Addison-Wesley."}],"container-title":["ACM Transactions on the Web"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2382616.2382619","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2382616.2382619","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:34:38Z","timestamp":1750239278000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2382616.2382619"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11]]},"references-count":39,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,11]]}},"alternative-id":["10.1145\/2382616.2382619"],"URL":"https:\/\/doi.org\/10.1145\/2382616.2382619","relation":{},"ISSN":["1559-1131","1559-114X"],"issn-type":[{"type":"print","value":"1559-1131"},{"type":"electronic","value":"1559-114X"}],"subject":[],"published":{"date-parts":[[2012,11]]},"assertion":[{"value":"2011-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-11-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}