{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,14]],"date-time":"2025-10-14T20:08:08Z","timestamp":1760472488393,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":14,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,5,15]],"date-time":"2017-05-15T00:00:00Z","timestamp":1494806400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"BMBF","award":["01IS14013A"],"award-info":[{"award-number":["01IS14013A"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,5,15]]},"DOI":"10.1145\/3075564.3078888","type":"proceedings-article","created":{"date-parts":[[2017,6,7]],"date-time":"2017-06-07T12:47:29Z","timestamp":1496839649000},"page":"367-372","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Addressing Hadoop's Small File Problem With an Appendable Archive File Format"],"prefix":"10.1145","author":[{"given":"Thomas","family":"Renner","sequence":"first","affiliation":[{"name":"Technische Universit\u00e4t Berlin, Germany"}]},{"given":"Johannes","family":"M\u00fcller","sequence":"additional","affiliation":[{"name":"Technische Universit\u00e4t Berlin, Germany"}]},{"given":"Lauritz","family":"Thamsen","sequence":"additional","affiliation":[{"name":"Technische Universit\u00e4t Berlin, Germany"}]},{"given":"Odej","family":"Kao","sequence":"additional","affiliation":[{"name":"Technische Universit\u00e4t Berlin, Germany"}]}],"member":"320","published-online":{"date-parts":[[2017,5,15]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Apache flink: Stream and batch processing in a single engine. Data Engineering","author":"Carbone Paris","year":"2015","unstructured":"Paris Carbone , Stephan Ewen , Seif Haridi , Asterios Katsifodimos , Volker Markl , and Kostas Tzoumas . 2015. Apache flink: Stream and batch processing in a single engine. Data Engineering ( 2015 ), 28. Paris Carbone, Stephan Ewen, Seif Haridi, Asterios Katsifodimos, Volker Markl, and Kostas Tzoumas. 2015. Apache flink: Stream and batch processing in a single engine. Data Engineering (2015), 28."},{"key":"e_1_3_2_1_2_1","volume-title":"Ronald L Rivest, and Clifford Stein.","author":"Cormen Thomas","year":"2001","unstructured":"Thomas H.. Cormen , Charles Eric Leiserson , Ronald L Rivest, and Clifford Stein. 2001 . Introduction to algorithms. Vol. 6 . MIT press Cambridge . Thomas H.. Cormen, Charles Eric Leiserson, Ronald L Rivest, and Clifford Stein. 2001. Introduction to algorithms. Vol. 6. MIT press Cambridge."},{"key":"e_1_3_2_1_3_1","volume-title":"Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04)","author":"Dean Jeffrey","year":"2004","unstructured":"Jeffrey Dean and Sanjay Ghemawat . 2004 . MapReduce: Simplified Data Processing on Large Clusters . In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04) . USENIX Association, 10--10. Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified Data Processing on Large Clusters. In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04). USENIX Association, 10--10."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/SCC.2010.72"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICBNMT.2010.5705223"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCGrid.2014.51"},{"volume-title":"2009 IEEE International Conference on Cluster Computing and Workshops. 1--8.","author":"Liu X.","key":"e_1_3_2_1_7_1","unstructured":"X. Liu , J. Han , Y. Zhong , C. Han , and X. He . 2009. Implementing WebGIS on Hadoop: A case study of improving small file I\/O performance on HDFS . In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--8. X. Liu, J. Han, Y. Zhong, C. Han, and X. He. 2009. Implementing WebGIS on Hadoop: A case study of improving small file I\/O performance on HDFS. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--8."},{"key":"e_1_3_2_1_8_1","volume":"200","author":"Mackey G.","unstructured":"G. Mackey , S. Sehrish , and J. Wang. 200 9. Improving metadata management for small files in HDFS. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--4. G. Mackey, S. Sehrish, and J. Wang. 2009. Improving metadata management for small files in HDFS. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--4.","journal-title":"J. Wang."},{"key":"e_1_3_2_1_9_1","unstructured":"S. Radia and S. Srinivas. 2010. Scaling HDFS Cluster Using Namenode Federation HDFS-1052. (2010).  S. Radia and S. Srinivas. 2010. Scaling HDFS Cluster Using Namenode Federation HDFS-1052. (2010)."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496972"},{"volume-title":"Computer Science and Software Engineering (JCSSE), 2014 11th International Joint Conference on. 200--205","author":"Vorapongkitipun C.","key":"e_1_3_2_1_11_1","unstructured":"C. Vorapongkitipun and N. Nupairoj . 2014. Improving performance of small-file accessing in Hadoop . In Computer Science and Software Engineering (JCSSE), 2014 11th International Joint Conference on. 200--205 . C. Vorapongkitipun and N. Nupairoj. 2014. Improving performance of small-file accessing in Hadoop. In Computer Science and Software Engineering (JCSSE), 2014 11th International Joint Conference on. 200--205."},{"key":"e_1_3_2_1_12_1","volume-title":"Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on. IEEE, 1--9.","author":"Xie Jiong","year":"2010","unstructured":"Jiong Xie , Shu Yin , Xiaojun Ruan , Zhiyang Ding , Yun Tian , James Majors , Adam Manzanares , and Xiao Qin . 2010 . Improving mapreduce performance through data placement in heterogeneous hadoop clusters. In Parallel & Distributed Processing , Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on. IEEE, 1--9. Jiong Xie, Shu Yin, Xiaojun Ruan, Zhiyang Ding, Yun Tian, James Majors, Adam Manzanares, and Xiao Qin. 2010. Improving mapreduce performance through data placement in heterogeneous hadoop clusters. In Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on. IEEE, 1--9."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11194-0_5"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522737"}],"event":{"name":"CF '17: Computing Frontiers Conference","sponsor":["SIGMICRO ACM Special Interest Group on Microarchitectural Research and Processing"],"location":"Siena Italy","acronym":"CF '17"},"container-title":["Proceedings of the Computing Frontiers Conference"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3075564.3078888","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3075564.3078888","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:03:42Z","timestamp":1750215822000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3075564.3078888"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,5,15]]},"references-count":14,"alternative-id":["10.1145\/3075564.3078888","10.1145\/3075564"],"URL":"https:\/\/doi.org\/10.1145\/3075564.3078888","relation":{},"subject":[],"published":{"date-parts":[[2017,5,15]]},"assertion":[{"value":"2017-05-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}