{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T02:38:33Z","timestamp":1774579113922,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":37,"publisher":"ACM","license":[{"start":{"date-parts":[[2014,6,23]],"date-time":"2014-06-23T00:00:00Z","timestamp":1403481600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2014,6,23]]},"DOI":"10.1145\/2600212.2600229","type":"proceedings-article","created":{"date-parts":[[2014,6,20]],"date-time":"2014-06-20T13:06:05Z","timestamp":1403269565000},"page":"165-176","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":95,"title":["MRONLINE"],"prefix":"10.1145","author":[{"given":"Min","family":"Li","sequence":"first","affiliation":[{"name":"Virginia Tech, Blacksburg, VA, USA"}]},{"given":"Liangzhao","family":"Zeng","sequence":"additional","affiliation":[{"name":"IBM TJ Watson Research Center, Yorktown Heights, NY, USA"}]},{"given":"Shicong","family":"Meng","sequence":"additional","affiliation":[{"name":"IBM TJ Watson Research Center, Yorktown Heights, NY, USA"}]},{"given":"Jian","family":"Tan","sequence":"additional","affiliation":[{"name":"IBM TJ Watson Research Center, Yorktown Heights, NY, USA"}]},{"given":"Li","family":"Zhang","sequence":"additional","affiliation":[{"name":"IBM TJ Watson Research Center, Yorktown Heights, NY, USA"}]},{"given":"Ali R.","family":"Butt","sequence":"additional","affiliation":[{"name":"Virginia Tech, Blacksburg, VA, USA"}]},{"given":"Nicholas","family":"Fuller","sequence":"additional","affiliation":[{"name":"IBM TJ Watson Research Center, Yorktown Heights, NY, USA"}]}],"member":"320","published-online":{"date-parts":[[2014,6,23]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"7 tips for improving MapReduce performance","year":"2009","unstructured":"Cloudera. 7 tips for improving MapReduce performance , 2009 . http:\/\/blog.cloudera.com\/blog\/2009\/12\/7-tips-forimproving-mapreduce-performance\/. Cloudera. 7 tips for improving MapReduce performance, 2009. http:\/\/blog.cloudera.com\/blog\/2009\/12\/7-tips-forimproving-mapreduce-performance\/."},{"key":"e_1_3_2_1_2_1","volume-title":"Optimizing MapReduce job performance","year":"2012","unstructured":"Cloudera. Optimizing MapReduce job performance , 2012 . http:\/\/www.slideshare.net\/cloudera\/mr-perf. Cloudera. Optimizing MapReduce job performance, 2012. http:\/\/www.slideshare.net\/cloudera\/mr-perf."},{"key":"e_1_3_2_1_3_1","volume-title":"Proc. USENIX OSDI","author":"Dean J.","year":"2004","unstructured":"J. Dean and S. Ghemawat . MapReduce: Simplified data processing on large clusters . In Proc. USENIX OSDI , 2004 . J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In Proc. USENIX OSDI, 2004."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920908"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687767"},{"key":"e_1_3_2_1_6_1","volume-title":"Mumak: MapReduce simulator","author":"Foundation A. S.","year":"2009","unstructured":"A. S. Foundation . Mumak: MapReduce simulator , 2009 . https:\/\/issues.apache.org\/jira\/browse\/MAPREDUCE-728. A. S. Foundation. Mumak: MapReduce simulator, 2009. https:\/\/issues.apache.org\/jira\/browse\/MAPREDUCE-728."},{"key":"e_1_3_2_1_7_1","unstructured":"A. S. Foundation. Apache Giraph 2013. http:\/\/giraph.apache.org\/.  A. S. Foundation. Apache Giraph 2013. http:\/\/giraph.apache.org\/."},{"key":"e_1_3_2_1_8_1","unstructured":"A. S. Foundation. Hadoop-2.1.0-Beta 2013. http:\/\/www.trieuvan.com\/apache\/hadoop\/common\/hadoop-2.1.0-beta\/.  A. S. Foundation. Hadoop-2.1.0-Beta 2013. http:\/\/www.trieuvan.com\/apache\/hadoop\/common\/hadoop-2.1.0-beta\/."},{"key":"e_1_3_2_1_9_1","volume-title":"Grep example","author":"Foundation A. S.","year":"2014","unstructured":"A. S. Foundation . Grep example , 2014 . http:\/\/wiki.apache.org\/hadoop\/Grep. A. S. Foundation. Grep example, 2014. http:\/\/wiki.apache.org\/hadoop\/Grep."},{"key":"e_1_3_2_1_10_1","volume-title":"Terasort example","author":"Foundation A. S.","year":"2014","unstructured":"A. S. Foundation . Terasort example , 2014 . https:\/\/hadoop.apache.org\/docs\/current\/api\/org\/apache\/hadoop\/examples\/terasort\/package-summary.html. A. S. Foundation. Terasort example, 2014. https:\/\/hadoop.apache.org\/docs\/current\/api\/org\/apache\/hadoop\/examples\/terasort\/package-summary.html."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1165389.945450"},{"key":"e_1_3_2_1_12_1","volume-title":"Proc. USENIX NSDI","author":"Ghodsi A.","year":"2011","unstructured":"A. Ghodsi , M. Zaharia , B. Hindman , A. Konwinski , S. Shenker , and I. Stoica . Dominant resource fairness: fair allocation of multiple resource types . In Proc. USENIX NSDI , 2011 . A. Ghodsi, M. Zaharia, B. Hindman, A. Konwinski, S. Shenker, and I. Stoica. Dominant resource fairness: fair allocation of multiple resource types. In Proc. USENIX NSDI, 2011."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402746"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402755.3402792"},{"key":"e_1_3_2_1_15_1","volume-title":"Proc. Conference on Innovative Data System Research","author":"Herodotou H.","year":"2011","unstructured":"H. Herodotou , H. Lim , G. Luo , N. Borisov , L. Dong , F. B. Cetin , and S. Babu . Starfish: A self-tuning system for big data analytics . In Proc. Conference on Innovative Data System Research , 2011 . H. Herodotou, H. Lim, G. Luo, N. Borisov, L. Dong, F. B. Cetin, and S. Babu. Starfish: A self-tuning system for big data analytics. In Proc. Conference on Innovative Data System Research, 2011."},{"key":"e_1_3_2_1_16_1","volume-title":"Advanced Hadoop tuning and optimizations","year":"2009","unstructured":"Impetus. Advanced Hadoop tuning and optimizations , 2009 . http:\/\/www.slideshare.net\/ImpetusInfo\/ppt-on-advancedhadoop-tuning-n-optimisation. Impetus. Advanced Hadoop tuning and optimizations, 2009. http:\/\/www.slideshare.net\/ImpetusInfo\/ppt-on-advancedhadoop-tuning-n-optimisation."},{"key":"e_1_3_2_1_17_1","volume-title":"Hadoop performance tuning","year":"2012","unstructured":"Impetus. Hadoop performance tuning , 2012 . https:\/\/hadoop-toolkit.googlecode.com\/files\/Whitepaper-HadoopPerformanceTuning.pdf. Impetus. Hadoop performance tuning, 2012. https:\/\/hadoop-toolkit.googlecode.com\/files\/Whitepaper-HadoopPerformanceTuning.pdf."},{"key":"e_1_3_2_1_18_1","volume-title":"Freebase data dumps","author":"G. Inc.","year":"2013","unstructured":"G. Inc. Freebase data dumps , 2013 . https:\/\/developers.google.com\/freebase\/data. G. Inc. Freebase data dumps, 2013. https:\/\/developers.google.com\/freebase\/data."},{"key":"e_1_3_2_1_19_1","volume-title":"Spark: Lightning-fast cluster computing","author":"Incubator A.","year":"2013","unstructured":"A. Incubator . Spark: Lightning-fast cluster computing , 2013 . http:\/\/spark.incubator.apache.org\/. A. Incubator. Spark: Lightning-fast cluster computing, 2013. http:\/\/spark.incubator.apache.org\/."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.14778\/1978665.1978670"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920903"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.14778\/2180912.2180913"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2213836.2213840"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2371536.2371547"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40047-6_42"},{"key":"e_1_3_2_1_26_1","volume-title":"Cloud9: A hadoop toolkit for working with big data","author":"Lin J.","year":"2010","unstructured":"J. Lin and C. Dyer . Cloud9: A hadoop toolkit for working with big data , 2010 . http:\/\/lintool.github.io\/Cloud9\/index.html. J. Lin and C. Dyer. Cloud9: A hadoop toolkit for working with big data, 2010. http:\/\/lintool.github.io\/Cloud9\/index.html."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_3_2_1_28_1","volume-title":"Storm: Distributed and fault-tolerant realtime computation","year":"2013","unstructured":"Twitter. Storm: Distributed and fault-tolerant realtime computation , 2013 . http:\/\/storm-project.net\/. Twitter. Storm: Distributed and fault-tolerant realtime computation, 2013. http:\/\/storm-project.net\/."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2523616.2523633"},{"key":"e_1_3_2_1_30_1","volume-title":"Proc. IEEE MASCOTS","author":"Wang G.","year":"2009","unstructured":"G. Wang , A. R. Butt , P. Pandey , and K. Gupta . A simulation approach to evaluating design decisions in mapreduce setups . In Proc. IEEE MASCOTS , 2009 . G. Wang, A. R. Butt, P. Pandey, and K. Gupta. A simulation approach to evaluating design decisions in mapreduce setups. In Proc. IEEE MASCOTS, 2009."},{"key":"e_1_3_2_1_31_1","volume-title":"The Definitive Guide. O'Reilly","author":"White T.","year":"2012","unstructured":"T. White . Hadoop : The Definitive Guide. O'Reilly , 2012 . T. White. Hadoop: The Definitive Guide. O'Reilly, 2012."},{"key":"e_1_3_2_1_32_1","volume-title":"Wikipedia data dumps","year":"2014","unstructured":"Wikipedia. Wikipedia data dumps , 2014 . http:\/\/dumps.wikimedia.org\/enwiki\/latest\/. Wikipedia. Wikipedia data dumps, 2014. http:\/\/dumps.wikimedia.org\/enwiki\/latest\/."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/988672.988711"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/885651.781052"},{"key":"e_1_3_2_1_35_1","volume-title":"Proc. USENIX Conference on Hot Topics in Cloud Computing","author":"Zaharia M.","year":"2010","unstructured":"M. Zaharia , M. Chowdhury , M. J. Franklin , S. Shenker , and I. Stoica . Spark: Cluster computing with working sets . In Proc. USENIX Conference on Hot Topics in Cloud Computing , 2010 . M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica. Spark: Cluster computing with working sets. In Proc. USENIX Conference on Hot Topics in Cloud Computing, 2010."},{"key":"e_1_3_2_1_36_1","volume-title":"Proc. USENIX NSDI","author":"Zhang J.","year":"2012","unstructured":"J. Zhang , H. Zhou , R. Chen , X. Fan , Z. Guo , H. Lin , J. Y. Li , W. Lin , J. Zhou , and L. Zhou . Optimizing data shuffling in data-parallel computation by understanding user-defined functions . In Proc. USENIX NSDI , 2012 . J. Zhang, H. Zhou, R. Chen, X. Fan, Z. Guo, H. Lin, J. Y. Li, W. Lin, J. Zhou, and L. Zhou. Optimizing data shuffling in data-parallel computation by understanding user-defined functions. In Proc. USENIX NSDI, 2012."},{"key":"e_1_3_2_1_37_1","volume-title":"Proc. USENIX ATC","author":"Zheng W.","year":"2009","unstructured":"W. Zheng , R. Bianchini , G. J. Janakiraman , J. R. Santos , and Y. Turner . JustRunIt: Experiment-based management of virtualized data centers . In Proc. USENIX ATC , 2009 . W. Zheng, R. Bianchini, G. J. Janakiraman, J. R. Santos, and Y. Turner. JustRunIt: Experiment-based management of virtualized data centers. In Proc. USENIX ATC, 2009."}],"event":{"name":"HPDC'14: The 23rd International Symposium on High-Performance Parallel and Distributed Computing","location":"Vancouver BC Canada","acronym":"HPDC'14","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture"]},"container-title":["Proceedings of the 23rd international symposium on High-performance parallel and distributed computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2600212.2600229","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2600212.2600229","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:10:26Z","timestamp":1750234226000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2600212.2600229"}},"subtitle":["MapReduce online performance tuning"],"short-title":[],"issued":{"date-parts":[[2014,6,23]]},"references-count":37,"alternative-id":["10.1145\/2600212.2600229","10.1145\/2600212"],"URL":"https:\/\/doi.org\/10.1145\/2600212.2600229","relation":{},"subject":[],"published":{"date-parts":[[2014,6,23]]},"assertion":[{"value":"2014-06-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}