{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T05:57:11Z","timestamp":1774159031566,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":18,"publisher":"ACM","license":[{"start":{"date-parts":[[2013,6,22]],"date-time":"2013-06-22T00:00:00Z","timestamp":1371859200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2013,6,22]]},"DOI":"10.1145\/2463676.2465338","type":"proceedings-article","created":{"date-parts":[[2013,6,25]],"date-time":"2013-06-25T19:13:21Z","timestamp":1372187601000},"page":"939-942","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["Machine learning for big data"],"prefix":"10.1145","author":[{"given":"Tyson","family":"Condie","sequence":"first","affiliation":[{"name":"Microsoft, Redmond, WA, USA"}]},{"given":"Paul","family":"Mineiro","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, USA"}]},{"given":"Neoklis","family":"Polyzotis","sequence":"additional","affiliation":[{"name":"UC Santa Cruz, Santa Cruz, CA, USA"}]},{"given":"Markus","family":"Weimer","sequence":"additional","affiliation":[{"name":"Microsoft, Redmond, WA, USA"}]}],"member":"320","published-online":{"date-parts":[[2013,6,22]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"A Reliable Effective Terascale Linear Learning System,\" arXiv.org","author":"Agarwal A.","year":"2012","unstructured":"A. Agarwal , O. Chapelle , M. Dudik and J. Langford , \" A Reliable Effective Terascale Linear Learning System,\" arXiv.org , 2012 . A. Agarwal, O. Chapelle, M. Dudik and J. Langford, \"A Reliable Effective Terascale Linear Learning System,\" arXiv.org, 2012."},{"key":"e_1_3_2_1_2_1","volume-title":"Large Scale Distributed Deep Networks,\" in Advances in Neural Information Processing Systems","author":"Dean J.","year":"2013","unstructured":"J. Dean , G. Corrado , R. Monga , K. Chen , M. Devin , Q. Le , M. Mao , A. Senior , P. Tucker , K. Yang and A. Ng , \" Large Scale Distributed Deep Networks,\" in Advances in Neural Information Processing Systems , 2013 . J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, A. Senior, P. Tucker, K. Yang and A. Ng, \"Large Scale Distributed Deep Networks,\" in Advances in Neural Information Processing Systems, 2013."},{"key":"e_1_3_2_1_3_1","unstructured":"The Apache Project \"Apache Hadoop NextGen MapReduce (YARN) \" The Apache Project {Online}. Available: http:\/\/hadoop.apache.org\/docs\/r0.23.0\/hadoop-yarn\/hadoop-yarn-site\/YARN.html.  The Apache Project \"Apache Hadoop NextGen MapReduce (YARN) \" The Apache Project {Online}. Available: http:\/\/hadoop.apache.org\/docs\/r0.23.0\/hadoop-yarn\/hadoop-yarn-site\/YARN.html."},{"key":"e_1_3_2_1_4_1","volume-title":"A Common Substrate for Cluster Computing,\" in HotCloud","author":"Hindman B.","year":"2009","unstructured":"B. Hindman , A. Konwinski , M. Zaharia and I. Stoica , \" A Common Substrate for Cluster Computing,\" in HotCloud , 2009 . B. Hindman, A. Konwinski, M. Zaharia and I. Stoica, \"A Common Substrate for Cluster Computing,\" in HotCloud, 2009."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/293347.293351"},{"key":"e_1_3_2_1_6_1","volume-title":"Map-Reduce for Machine Learning on Multicore,\" in Advances in Neural Information Processing Systems 19","author":"Chu C.-T.","year":"2007","unstructured":"C.-T. Chu , S. K. Kim , Y.-A. Lin , Y. Yu , G. Bradski and A. Y. Ng , \" Map-Reduce for Machine Learning on Multicore,\" in Advances in Neural Information Processing Systems 19 , Cambridge, MA , 2007 . C.-T. Chu, S. K. Kim, Y.-A. Lin, Y. Yu, G. Bradski and A. Y. Ng, \"Map-Reduce for Machine Learning on Multicore,\" in Advances in Neural Information Processing Systems 19, Cambridge, MA, 2007."},{"key":"e_1_3_2_1_7_1","unstructured":"The Apache Foundation \"Apache Pig \" 11 12 2012. {Online}. Available: http:\/\/pig.apache.org\/.  The Apache Foundation \"Apache Pig \" 11 12 2012. {Online}. Available: http:\/\/pig.apache.org\/."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"key":"e_1_3_2_1_9_1","unstructured":"The Apache Mahout Project \"Apache Mahout \" 17 9 2012. {Online}. Available: http:\/\/mahout.apache.org\/. {Accessed 17 9 2012}.  The Apache Mahout Project \"Apache Mahout \" 17 9 2012. {Online}. Available: http:\/\/mahout.apache.org\/. {Accessed 17 9 2012}."},{"key":"e_1_3_2_1_10_1","volume-title":"Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing,\" in USENIX NSDI","author":"Zaharia M.","year":"2012","unstructured":"M. Zaharia , M. Chowdhury , T. Das , A. Dave , J. Ma , M. McCauley , M. J. Franklin , S. Shenker and I. Stoica , \" Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing,\" in USENIX NSDI , San Jose, CA , 2012 . M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker and I. Stoica, \"Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing,\" in USENIX NSDI, San Jose, CA, 2012."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807184"},{"key":"e_1_3_2_1_12_1","unstructured":"The Apache Software Foundation \"Apache Giraph \" {Online}. Available: http:\/\/giraph.apache.org\/.  The Apache Software Foundation \"Apache Giraph \" {Online}. Available: http:\/\/giraph.apache.org\/."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.14778\/2212351.2212354"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.tics.2007.09.004"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"e_1_3_2_1_16_1","volume-title":"Available: http:\/\/www.nytimes.com\/2012\/06\/26\/technology\/in-a-big-network-of-computers-evidence-of-machine-learning.html. {Accessed","author":"Markoff J.","year":"2012","unstructured":"J. Markoff , \"How Many Computers to Identify a Cat? 16,000,\" The New York Times, 25 June 2012. {Online}. Available: http:\/\/www.nytimes.com\/2012\/06\/26\/technology\/in-a-big-network-of-computers-evidence-of-machine-learning.html. {Accessed 11 December 2012 }. J. Markoff, \"How Many Computers to Identify a Cat? 16,000,\" The New York Times, 25 June 2012. {Online}. Available: http:\/\/www.nytimes.com\/2012\/06\/26\/technology\/in-a-big-network-of-computers-evidence-of-machine-learning.html. {Accessed 11 December 2012}."},{"key":"e_1_3_2_1_17_1","volume-title":"WWW 2012 Tutorial: New Templates for Scalable Data Analysis","author":"Smola A.","year":"2012","unstructured":"A. Smola , A. Ahmed and M. Weimer , \" WWW 2012 Tutorial: New Templates for Scalable Data Analysis ,\" June 2012 . {Online}. Available: http:\/\/www2012.wwwconference.org\/program\/tutorials\/ and http:\/\/cs.markusweimer.com\/2012\/04\/06\/www-2012-tutorial-new-templates-for-scalable-data-analysis\/. A. Smola, A. Ahmed and M. Weimer, \"WWW 2012 Tutorial: New Templates for Scalable Data Analysis,\" June 2012. {Online}. Available: http:\/\/www2012.wwwconference.org\/program\/tutorials\/ and http:\/\/cs.markusweimer.com\/2012\/04\/06\/www-2012-tutorial-new-templates-for-scalable-data-analysis\/."},{"key":"e_1_3_2_1_18_1","volume-title":"The Yahoo! Music Dataset and KDD-Cup'11,\" in Proceedings of KDDCup","author":"Dror G.","year":"2011","unstructured":"G. Dror , N. Koenigstein , Y. Koren and M. Weimer , \" The Yahoo! Music Dataset and KDD-Cup'11,\" in Proceedings of KDDCup 2011 , San Diego , CA , 2011. G. Dror, N. Koenigstein, Y. Koren and M. Weimer, \"The Yahoo! Music Dataset and KDD-Cup'11,\" in Proceedings of KDDCup 2011, San Diego, CA, 2011."}],"event":{"name":"SIGMOD\/PODS'13: International Conference on Management of Data","location":"New York New York USA","acronym":"SIGMOD\/PODS'13","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"]},"container-title":["Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2463676.2465338","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2463676.2465338","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T08:39:06Z","timestamp":1750235946000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2463676.2465338"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,6,22]]},"references-count":18,"alternative-id":["10.1145\/2463676.2465338","10.1145\/2463676"],"URL":"https:\/\/doi.org\/10.1145\/2463676.2465338","relation":{},"subject":[],"published":{"date-parts":[[2013,6,22]]},"assertion":[{"value":"2013-06-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}