{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,1]],"date-time":"2025-06-01T22:29:01Z","timestamp":1748816941056},"publisher-location":"Cham","reference-count":31,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783319596464"},{"type":"electronic","value":"9783319596471"}],"license":[{"start":{"date-parts":[[2017,1,1]],"date-time":"2017-01-01T00:00:00Z","timestamp":1483228800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017]]},"DOI":"10.1007\/978-3-319-59647-1_31","type":"book-chapter","created":{"date-parts":[[2017,5,12]],"date-time":"2017-05-12T22:53:30Z","timestamp":1494629610000},"page":"421-438","source":"Crossref","is-referenced-by-count":10,"title":["An Executable Sequential Specification for Spark Aggregation"],"prefix":"10.1007","author":[{"given":"Yu-Fang","family":"Chen","sequence":"first","affiliation":[]},{"given":"Chih-Duo","family":"Hong","sequence":"additional","affiliation":[]},{"given":"Ond\u0159ej","family":"Leng\u00e1l","sequence":"additional","affiliation":[]},{"given":"Shin-Cheng","family":"Mu","sequence":"additional","affiliation":[]},{"given":"Nishant","family":"Sinha","sequence":"additional","affiliation":[]},{"given":"Bow-Yaw","family":"Wang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2017,5,14]]},"reference":[{"key":"31_CR1","unstructured":"Apache Spark. https:\/\/github.com\/apache\/spark"},{"key":"31_CR2","unstructured":"IBM DB2 Version 9.7. Partitioned Tables. https:\/\/ibm.biz\/BdHyYR"},{"key":"31_CR3","unstructured":"The Scalaz project. https:\/\/github.com\/scalaz"},{"key":"31_CR4","unstructured":"PureSpark. https:\/\/github.com\/guluchen\/purespark"},{"key":"31_CR5","doi-asserted-by":"crossref","unstructured":"Bennett, J., Grout, R., Pebay, P., Roe, D., Thompson, D.: Numerically stable, single-pass, parallel statistics algorithms. In: CLUSTER, pp. 1\u20138 (2009)","DOI":"10.1109\/CLUSTR.2009.5289161"},{"key":"31_CR6","doi-asserted-by":"crossref","unstructured":"Bird, R.S.: An introduction to the theory of lists. In: Broy, M. (eds) Logic of Programming and Calculi of Discrete Design. NATO ASI Series (Series F: Computer and Systems Sciences), vol. 36, pp. 5\u201342. Springer, Heidelberg (1987)","DOI":"10.1007\/978-3-642-87374-4_1"},{"key":"31_CR7","doi-asserted-by":"crossref","unstructured":"Bocchino Jr., R.L., Adve, V.S., Dig, D., Adve, S.V., Heumann, S., Komuravelli, R., Overbey, J., Simmons, P., Sung, H., Vakilian, M.: A type and effect system for deterministic parallel Java. In: OOPSLA, pp. 97\u2013116 (2009)","DOI":"10.1145\/1640089.1640097"},{"key":"31_CR8","doi-asserted-by":"crossref","unstructured":"Bocchino Jr., R.L., Heumann, S., Honarmand, N., Adve, S.V., Adve, V.S., Welc, A., Shpeisman, T.: Safe nondeterminism in a deterministic-by-default parallel language. SIGPLAN Not. 46(1), 535\u2013548 (2011)","DOI":"10.1145\/1925844.1926447"},{"issue":"3\u20134","key":"31_CR9","first-page":"203","volume":"18","author":"Z Budimlic","year":"2010","unstructured":"Budimlic, Z., Burke, M.G., Cav\u00e9, V., Knobe, K., Lowney, G., Newton, R., Palsberg, J., Peixotto, D.M., Sarkar, V., Schlimbach, F., Tasirlar, S.: Concurrent collections. Sci. Program. 18(3\u20134), 203\u2013217 (2010)","journal-title":"Sci. Program."},{"issue":"6","key":"31_CR10","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1145\/1743546.1743572","volume":"53","author":"J Burnim","year":"2010","unstructured":"Burnim, J., Sen, K.: Asserting and checking determinism for multithreaded programs. Commun. ACM 53(6), 97\u2013105 (2010)","journal-title":"Commun. ACM"},{"key":"31_CR11","doi-asserted-by":"crossref","unstructured":"Chaudhuri, S.: An overview of query optimization in relational systems. In: PODS 1998 (1998)","DOI":"10.1145\/275487.275492"},{"key":"31_CR12","doi-asserted-by":"crossref","unstructured":"Chen, Y., Hong, C., Leng\u00e1l, O., Mu, S., Sinha, N., Wang, B.: An executable sequential specification for Spark aggregation arXiv:1702.02439 [cs.DC] (2017)","DOI":"10.1007\/978-3-319-59647-1_31"},{"key":"31_CR13","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1007\/978-3-662-46681-0_9","volume-title":"Tools and Algorithms for the Construction and Analysis of Systems","author":"Y-F Chen","year":"2015","unstructured":"Chen, Y.-F., Hong, C.-D., Sinha, N., Wang, B.-Y.: Commutativity of reducers. In: Baier, C., Tinelli, C. (eds.) TACAS 2015. LNCS, vol. 9035, pp. 131\u2013146. Springer, Heidelberg (2015). doi: 10.1007\/978-3-662-46681-0_9"},{"key":"31_CR14","doi-asserted-by":"crossref","unstructured":"Chu, C., Kim, S.K., Lin, Y., Yu, Y., Bradski, G.R., Ng, A.Y., Olukotun, K.: Map-Reduce for machine learning on multicore. In: NIPS, pp. 281\u2013288 (2006)","DOI":"10.7551\/mitpress\/7503.003.0040"},{"issue":"1","key":"31_CR15","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1145\/1629175.1629198","volume":"53","author":"J Dean","year":"2010","unstructured":"Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool. Commun. ACM 53(1), 72\u201377 (2010)","journal-title":"Commun. ACM"},{"issue":"7","key":"31_CR16","doi-asserted-by":"crossref","first-page":"1734","DOI":"10.1002\/cpe.3333","volume":"27","author":"J D\u00f6rre","year":"2015","unstructured":"D\u00f6rre, J., Apel, S., Lengauer, C.: Modeling and optimizing MapReduce programs. Concurrency Comput. Pract. Experience 27(7), 1734\u20131766 (2015)","journal-title":"Concurrency Comput. Pract. Experience"},{"key":"31_CR17","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1007\/978-3-642-28869-2_13","volume-title":"Programming Languages and Systems","author":"K Emoto","year":"2012","unstructured":"Emoto, K., Fischer, S., Hu, Z.: Generate, test, and aggregate. In: Seidl, H. (ed.) ESOP 2012. LNCS, vol. 7211, pp. 254\u2013273. Springer, Heidelberg (2012). doi: 10.1007\/978-3-642-28869-2_13"},{"issue":"11","key":"31_CR18","doi-asserted-by":"crossref","first-page":"1111","DOI":"10.14778\/3402707.3402746","volume":"4","author":"H Herodotou","year":"2011","unstructured":"Herodotou, H., Babu, S.: Profiling, what-if analysis, and cost-based optimization of MapReduce programs. Proc. VLDB Endowment 4(11), 1111\u20131122 (2011)","journal-title":"Proc. VLDB Endowment"},{"key":"31_CR19","doi-asserted-by":"crossref","unstructured":"Herodotou, H., Borisov, N., Babu, S.: Query optimization techniques for partitioned tables. In: SIGMOD 2011, pp. 49\u201360 (2011)","DOI":"10.1145\/1989323.1989330"},{"issue":"1","key":"31_CR20","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1145\/234313.234367","volume":"28","author":"YE Ioannidis","year":"1996","unstructured":"Ioannidis, Y.E.: Query optimization. ACM Comput. Surv. 28(1), 121\u2013123 (1996)","journal-title":"ACM Comput. Surv."},{"key":"31_CR21","doi-asserted-by":"crossref","unstructured":"Karloff, H., Suri, S., Vassilvitskii, S.: A model of computation for MapReduce. In: SODA, pp. 938\u2013948 (2010)","DOI":"10.1137\/1.9781611973075.76"},{"key":"31_CR22","doi-asserted-by":"crossref","unstructured":"Leijen, D., F\u00e4hndrich, M., Burckhardt, S.: Prettier concurrency: Purely functional concurrent revisions. In: Haskell, pp. 83\u201394 (2011)","DOI":"10.1145\/2034675.2034686"},{"key":"31_CR23","doi-asserted-by":"crossref","unstructured":"Liu, C., Zhang, J., Zhou, H., McDirmid, S., Guo, Z., Moscibroda, T.: Automating distributed partial aggregation. In: SoCC, pp. 1:1\u20131:12 (2014)","DOI":"10.1145\/2670979.2670980"},{"key":"31_CR24","doi-asserted-by":"crossref","unstructured":"Radoi, C., Fink, S.J., Rabbah, R.M., Sridharan, M.: Translating imperative code to MapReduce. In: OOPSLA, pp. 909\u2013927 (2014)","DOI":"10.1145\/2660193.2660228"},{"key":"31_CR25","doi-asserted-by":"crossref","unstructured":"Sakr, S., Liu, A., Fayoumi, A.G.: The family of MapReduce and large-scale data processing systems. ACM Comput. Surv. 46(1), 11:1\u201311:44 (2013)","DOI":"10.1145\/2522968.2522979"},{"key":"31_CR26","doi-asserted-by":"crossref","unstructured":"Tian, Y., Tatikonda, S., Reinwald, B.: Scalable and numerically stable descriptive statistics in SystemML. In: ICDE, pp. 1351\u20131359 (2012)","DOI":"10.1109\/ICDE.2012.12"},{"key":"31_CR27","doi-asserted-by":"crossref","unstructured":"Xiao, T., Zhang, J., Zhou, H., Guo, Z., McDirmid, S., Lin, W., Chen, W., Zhou, L.: Nondeterminism in MapReduce considered harmful? an empirical study on non-commutative aggregators in MapReduce programs. In: Companion Proceedings of ICSE, pp. 44\u201353 (2014)","DOI":"10.1145\/2591062.2591177"},{"key":"31_CR28","doi-asserted-by":"crossref","unstructured":"Xu, Z., Hirzel, M., Rothermel, G.: Semantic characterization of MapReduce workloads. In: IISWC, pp. 87\u201397 (2013)","DOI":"10.1109\/IISWC.2013.6704673"},{"key":"31_CR29","unstructured":"Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauly, M., Franklin, M.J., Shenker, S., Stoica, I.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: NSDI, pp. 15\u201328 (2012)"},{"issue":"11","key":"31_CR30","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1145\/2934664","volume":"59","author":"M Zaharia","year":"2016","unstructured":"Zaharia, M., Xin, R.S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M.J., Ghodsi, A., Gonzalez, J., Shenker, S., Stoica, I.: Apache spark: a unified engine for big data processing. Commun. ACM 59(11), 56\u201365 (2016)","journal-title":"Commun. ACM"},{"key":"31_CR31","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Cherkasova, L., Verma, A., Loo, B.T.: Performance modeling and optimization of deadline-driven Pig programs. ACM Trans. Auton. Adapt. Syst. 8(3), 14:1\u201314:28 (2013)","DOI":"10.1145\/2518017.2518019"}],"container-title":["Lecture Notes in Computer Science","Networked Systems"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-319-59647-1_31","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,23]],"date-time":"2023-08-23T11:04:14Z","timestamp":1692788654000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-319-59647-1_31"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017]]},"ISBN":["9783319596464","9783319596471"],"references-count":31,"URL":"https:\/\/doi.org\/10.1007\/978-3-319-59647-1_31","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2017]]}}}