{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,9]],"date-time":"2024-09-09T11:22:10Z","timestamp":1725880930474},"publisher-location":"Cham","reference-count":22,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783319543338"},{"type":"electronic","value":"9783319543345"}],"license":[{"start":{"date-parts":[[2017,1,1]],"date-time":"2017-01-01T00:00:00Z","timestamp":1483228800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017]]},"DOI":"10.1007\/978-3-319-54334-5_4","type":"book-chapter","created":{"date-parts":[[2017,2,17]],"date-time":"2017-02-17T01:53:38Z","timestamp":1487296418000},"page":"45-60","source":"Crossref","is-referenced-by-count":0,"title":["Benchmarking Spark Machine Learning Using BigBench"],"prefix":"10.1007","author":[{"given":"Sweta","family":"Singh","sequence":"first","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2017,2,18]]},"reference":[{"key":"4_CR1","unstructured":"Apache Spark. http:\/\/spark.apache.org\/"},{"key":"4_CR2","unstructured":"dashDB. http:\/\/www.ibm.com\/analytics\/us\/en\/technology\/cloud-data-services\/dashdb\/"},{"key":"4_CR3","unstructured":"dashDB Local. http:\/\/www.ibm.com\/analytics\/us\/en\/technology\/cloud-data-services\/dashdb-local\/"},{"key":"4_CR4","unstructured":"UCI Machine Learning Repository. http:\/\/archive.ics.uci.edu\/ml\/"},{"key":"4_CR5","unstructured":"IBM SPSS. http:\/\/www.ibm.com\/analytics\/us\/en\/technology\/spss\/spss.html"},{"key":"4_CR6","unstructured":"ftp:\/\/public.dhe.ibm.com\/software\/analytics\/spss\/documentation\/modeler\/16.0\/en\/modeler_applications_guide_book.pdf"},{"key":"4_CR7","doi-asserted-by":"crossref","unstructured":"Ghazal, A., et al.: BigBench: towards an industry standard benchmark for big data analytics. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM (2013)","DOI":"10.1145\/2463676.2463712"},{"key":"4_CR8","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/978-3-319-10596-3_1","volume-title":"Advancing Big Data Benchmarks","author":"B Chowdhury","year":"2014","unstructured":"Chowdhury, B., Rabl, T., Saadatpanah, P., Du, J., Jacobsen, H.-A.: A BigBench implementation in the hadoop ecosystem. In: Rabl, T., Jacobsen, H.-A., Raghunath, N., Poess, M., Bhandarkar, M., Baru, C. (eds.) WBDB 2013. LNCS, vol. 8585, pp. 3\u201318. Springer, Heidelberg (2014). doi: 10.1007\/978-3-319-10596-3_1"},{"key":"4_CR9","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1007\/978-3-319-15350-6_4","volume-title":"TPCTC 2014","author":"C Baru","year":"2015","unstructured":"Baru, C., et al.: Discussion of BigBench: a proposed industry standard performance benchmark for big data. In: Nambiar, R., Poess, M. (eds.) TPCTC 2014. LNCS, vol. 8904, pp. 44\u201363. Springer, Cham (2015). doi: 10.1007\/978-3-319-15350-6_4"},{"key":"4_CR10","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-04936-6","volume-title":"Performance Characterization and Benchmarking","year":"2014","unstructured":"Nambiar, R., Poess, M. (eds.): TPCTC 2013. LNCS, vol. 8391. Springer, Heidelberg (2014). doi: 10.1007\/978-3-319-04936-6"},{"key":"4_CR11","unstructured":"Meng, X., et al.: Mllib: Machine learning in apache spark. JMLR 17(34), 1\u20137 (2016)"},{"key":"4_CR12","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"26","DOI":"10.1007\/978-3-319-31409-9_3","volume-title":"Performance Evaluation and Benchmarking: Traditional to Big Data to Internet of Things","author":"D Agrawal","year":"2016","unstructured":"Agrawal, D., et al.: SparkBench \u2013 a spark performance testing suite. In: Nambiar, R., Poess, M. (eds.) TPCTC 2015. LNCS, vol. 9508, pp. 26\u201344. Springer, Heidelberg (2016). doi: 10.1007\/978-3-319-31409-9_3"},{"key":"4_CR13","doi-asserted-by":"publisher","unstructured":"Su, X., Khoshgoftaar, T.M.: A survey of collaborative filtering techniques. Adv. Artif. Intell. 2009, 19 (2009). Article ID 421425, doi: 10.1155\/2009\/421425","DOI":"10.1155\/2009\/421425"},{"issue":"8","key":"4_CR14","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1109\/MC.2009.263","volume":"42","author":"Y Koren","year":"2009","unstructured":"Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30\u201337 (2009)","journal-title":"Computer"},{"key":"4_CR15","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1007\/978-3-540-68880-8_32","volume-title":"Algorithmic Aspects in Information and Management","author":"Y Zhou","year":"2008","unstructured":"Zhou, Y., Wilkinson, D., Schreiber, R., Pan, R.: Large-scale parallel collaborative filtering for the netflix prize. In: Fleischer, R., Xu, J. (eds.) AAIM 2008. LNCS, vol. 5034, pp. 337\u2013348. Springer, Heidelberg (2008). doi: 10.1007\/978-3-540-68880-8_32"},{"key":"4_CR16","doi-asserted-by":"crossref","unstructured":"Jain, P., Netrapalli, P., Sanghavi, S.: Low-rank matrix completion using alternating minimization. In: Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing. ACM (2013)","DOI":"10.1145\/2488608.2488693"},{"key":"4_CR17","unstructured":"Transaction Processing Performance Council. http:\/\/www.tpc.org"},{"key":"4_CR18","unstructured":"Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma,\u00a0J., McCauley, M., Franklin, M., Shenker, S., Stoica, I.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. USENIX Association, p. 2 (2012)"},{"key":"4_CR19","unstructured":"Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, Boston, 22\u201325 June 2010, p. 10 (2010)"},{"key":"4_CR20","doi-asserted-by":"crossref","unstructured":"Pil\u00e1szy, I., Zibriczky, D., Tikk, D.: Fast als-based matrix factorization for explicit and implicit feedback datasets. In: Proceedings of the Fourth ACM Conference on Recommender Systems. ACM (2010)","DOI":"10.1145\/1864708.1864726"},{"key":"4_CR21","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1214\/11-STS368","volume":"27","author":"A Feuerverger","year":"2012","unstructured":"Feuerverger, A., He, Y., Khatri, S.: Statistical significance of the Netflix challenge. Stat. Sci. 27, 202\u2013231 (2012)","journal-title":"Stat. Sci."},{"key":"4_CR22","first-page":"3367","volume":"16","author":"T Hastie","year":"2015","unstructured":"Hastie, T., et al.: Matrix completion and low-rank SVD via fast alternating least squares. J. Mach. Learn. Res. 16, 3367\u20133402 (2015)","journal-title":"J. Mach. Learn. Res."}],"container-title":["Lecture Notes in Computer Science","Performance Evaluation and Benchmarking. Traditional - Big Data - Interest of Things"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-319-54334-5_4","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2017,6,25]],"date-time":"2017-06-25T06:48:11Z","timestamp":1498373291000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-319-54334-5_4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017]]},"ISBN":["9783319543338","9783319543345"],"references-count":22,"URL":"https:\/\/doi.org\/10.1007\/978-3-319-54334-5_4","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2017]]}}}