{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:29:19Z","timestamp":1750220959125,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":48,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,4,4]],"date-time":"2019-04-04T00:00:00Z","timestamp":1554336000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,4,4]]},"DOI":"10.1145\/3297663.3310300","type":"proceedings-article","created":{"date-parts":[[2019,4,5]],"date-time":"2019-04-05T13:27:26Z","timestamp":1554470846000},"page":"91-102","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Analyzing and Optimizing Java Code Generation for Apache Spark Query Plan"],"prefix":"10.1145","author":[{"given":"Kazuaki","family":"Ishizaki","sequence":"first","affiliation":[{"name":"IBM Research, Tokyo, Japan"}]}],"member":"320","published-online":{"date-parts":[[2019,4,4]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/3088525.3088527"},{"key":"e_1_3_2_1_2_1","unstructured":"Sameer Agarwal Davies Liu and Reynold Xin. 2016. Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop. (2016). https:\/\/databricks.com\/blog\/2016\/05\/23\/apache-spark-as-a-compiler-joining-a-billion-rows-per-second-on-a-laptop.html  Sameer Agarwal Davies Liu and Reynold Xin. 2016. Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop. (2016). https:\/\/databricks.com\/blog\/2016\/05\/23\/apache-spark-as-a-compiler-joining-a-billion-rows-per-second-on-a-laptop.html"},{"key":"e_1_3_2_1_3_1","volume-title":"https:\/\/hadoop.apache.org","author":"Hadoop Apache","year":"2007","unstructured":"Apache Hadoop . 2007. ( 2007 ). https:\/\/hadoop.apache.org . Apache Hadoop. 2007. (2007). https:\/\/hadoop.apache.org."},{"key":"e_1_3_2_1_4_1","volume-title":"https:\/\/parquet.apache.org","author":"Parquet Apache","year":"2013","unstructured":"Apache Parquet . 2013. ( 2013 ). https:\/\/parquet.apache.org . Apache Parquet. 2013. (2013). https:\/\/parquet.apache.org."},{"key":"e_1_3_2_1_5_1","unstructured":"Michael Armbrust Wenchen Fan Reynold Xin and Matei Zaharia. 2016. Introducing Apache Spark Datasets. (2016). https:\/\/databricks.com\/blog\/2016\/01\/04\/introducing-apache-spark-datasets.html  Michael Armbrust Wenchen Fan Reynold Xin and Matei Zaharia. 2016. Introducing Apache Spark Datasets. (2016). https:\/\/databricks.com\/blog\/2016\/01\/04\/introducing-apache-spark-datasets.html"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2742797"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/BDCloud.2015.37"},{"key":"e_1_3_2_1_8_1","volume-title":"Conference on Innovative Data Systems Research (CIDR). 225--237","author":"Boncz Peter A.","year":"2005","unstructured":"Peter A. Boncz , Marcin Zukowski , and Niels Nes . 2005 . MonetDB\/X100: Hyper-Pipelining Query Execution . In Conference on Innovative Data Systems Research (CIDR). 225--237 . Peter A. Boncz, Marcin Zukowski, and Niels Nes. 2005. MonetDB\/X100: Hyper-Pipelining Query Execution. In Conference on Innovative Data Systems Research (CIDR). 225--237."},{"key":"e_1_3_2_1_9_1","unstructured":"Luca Canali. 2016. Voice from CERN: Apache Spark 2.0 Performance Improvements Investigated With Flame Graphs. (2016). https:\/\/databricks.com\/blog\/2016\/10\/03\/voice-from-cern-apache-spark-2-0-performance-improvements-investigated-with-flame-graphs.html  Luca Canali. 2016. Voice from CERN: Apache Spark 2.0 Performance Improvements Investigated With Flame Graphs. (2016). https:\/\/databricks.com\/blog\/2016\/10\/03\/voice-from-cern-apache-spark-2-0-performance-improvements-investigated-with-flame-graphs.html"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2907294.2907310"},{"key":"e_1_3_2_1_11_1","volume-title":"TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","author":"Chen Tianqi","year":"2018","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , Carlos Guestrin , and Arvind Krishnamurthy . 2018 . TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . 578--594. Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, and Arvind Krishnamurthy. 2018. TVM: An Automated End-to-End Optimizing Compiler for Deep Learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). 578--594."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2016.7482079"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/318898.318923"},{"key":"e_1_3_2_1_14_1","unstructured":"Databricks. 2014. Spark Performance Tests. (2014). https:\/\/github.com\/databricks\/spark-perf\/  Databricks. 2014. Spark Performance Tests. (2014). https:\/\/github.com\/databricks\/spark-perf\/"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04)","author":"Dean Jeffrey","year":"2004","unstructured":"Jeffrey Dean and Sanjay Ghemawat . 2004 . MapReduce: Simplified Data Processing on Large Clusters . In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04) . 10--10. Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified Data Processing on Large Clusters. In Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation (OSDI'04). 10--10."},{"key":"e_1_3_2_1_16_1","volume-title":"Flare: Optimizing Apache Spark with Native Compilation for Scale-Up Architectures and Medium-Size Data. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","author":"Essertel Gregory","year":"2018","unstructured":"Gregory Essertel , Ruby Tahboub , James Decker , Kevin Brown , Kunle Olukotun , and Tiark Rompf . 2018 . Flare: Optimizing Apache Spark with Native Compilation for Scale-Up Architectures and Medium-Size Data. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . 799--815. Gregory Essertel, Ruby Tahboub, James Decker, Kevin Brown, Kunle Olukotun, and Tiark Rompf. 2018. Flare: Optimizing Apache Spark with Native Compilation for Scale-Up Architectures and Medium-Size Data. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). 799--815."},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI'14)","author":"Gonzalez Joseph E.","year":"2014","unstructured":"Joseph E. Gonzalez , Reynold Xin , Ankur Dave , Daniel Crankshaw , Michael J. Franklin , and Ion Stoica . 2014 . GraphX: Graph Processing in a Distributed Dataflow Framework . In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI'14) . 599--613. Joseph E. Gonzalez, Reynold Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, and Ion Stoica. 2014. GraphX: Graph Processing in a Distributed Dataflow Framework. In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI'14). 599--613."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/69.273032"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2907294.2907307"},{"key":"e_1_3_2_1_20_1","unstructured":"Kazuaki Ishizaki. 2016a. SPARK-15985: Eliminate redundant cast from an array without null or a map without null. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704  Kazuaki Ishizaki. 2016a. SPARK-15985: Eliminate redundant cast from an array without null or a map without null. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704"},{"key":"e_1_3_2_1_21_1","unstructured":"Kazuaki Ishizaki. 2016b. SPARK-16213: Reduce runtime overhead of a program that creates an primitive array in DataFrame. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704  Kazuaki Ishizaki. 2016b. SPARK-16213: Reduce runtime overhead of a program that creates an primitive array in DataFrame. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704"},{"key":"e_1_3_2_1_22_1","unstructured":"Kazuaki Ishizaki. 2016c. SPARK-17490: Optimize SerializeFromObject() for a primitive array. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704  Kazuaki Ishizaki. 2016c. SPARK-17490: Optimize SerializeFromObject() for a primitive array. (2016). https:\/\/github.com\/apache\/spark\/pull\/13704"},{"key":"e_1_3_2_1_23_1","unstructured":"Kazuaki Ishizaki. 2017. SPARK-20822: Generate code to directly get value from ColumnVector for table cache. (2017). https:\/\/github.com\/apache\/spark\/pull\/13704  Kazuaki Ishizaki. 2017. SPARK-20822: Generate code to directly get value from ColumnVector for table cache. (2017). https:\/\/github.com\/apache\/spark\/pull\/13704"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2967938.2967957"},{"key":"e_1_3_2_1_25_1","volume-title":"Open-Source SQL Engine for Hadoop. In Conference on Innovative Data Systems Research (CIDR).","author":"Kornacker Marcel","year":"2015","unstructured":"Marcel Kornacker , Alexander Behm , Victor Bittorf , Taras Bobrovytsky , Casey Ching , Alan Choi , Justin Erickson , Martin Grund , Daniel Hecht , Matthew Jacobs , Ishaan Joshi , Lenni Kuff , Dileep Kumar , Alex Leblang , Nong Li , Ippokratis Pandis , Henry Robinson , David Rorke , Silvius Rus , John Russell , Dimitris Tsirogiannis , Skye Wanderman-Milne , and Michael Yoder . 2015 . Impala: A Modern , Open-Source SQL Engine for Hadoop. In Conference on Innovative Data Systems Research (CIDR). Marcel Kornacker, Alexander Behm, Victor Bittorf, Taras Bobrovytsky, Casey Ching, Alan Choi, Justin Erickson, Martin Grund, Daniel Hecht, Matthew Jacobs, Ishaan Joshi, Lenni Kuff, Dileep Kumar, Alex Leblang, Nong Li, Ippokratis Pandis, Henry Robinson, David Rorke, Silvius Rus, John Russell, Dimitris Tsirogiannis, Skye Wanderman-Milne, and Michael Yoder. 2015. Impala: A Modern, Open-Source SQL Engine for Hadoop. In Conference on Innovative Data Systems Research (CIDR)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/977395.977673"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2742854.2747283"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2016.7840611"},{"key":"e_1_3_2_1_29_1","volume-title":"2007 IEEE 23rd International Conference on Data Engineering","author":"Madden Samuel R.","year":"2007","unstructured":"Samuel R. Madden , Daniel S. Myers , David J. DeWitt , and Daniel J. Abadi . 2007. Materialization Strategies in a Column-Oriented DBMS . 2007 IEEE 23rd International Conference on Data Engineering ( 2007 ), 466--475. Samuel R. Madden, Daniel S. Myers, David J. DeWitt, and Daniel J. Abadi. 2007. Materialization Strategies in a Column-Oriented DBMS. 2007 IEEE 23rd International Conference on Data Engineering (2007), 466--475."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2814270.2814313"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.5555\/2946645.2946679"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.14778\/2002938.2002940"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/2789770.2789791"},{"key":"e_1_3_2_1_34_1","volume-title":"Proceedings of the Conference on Innovative Data Systems Research (CIDR).","author":"Palkar Shoumik","year":"2017","unstructured":"Shoumik Palkar , James J. Thomas , Anil Shanbhag , Deepak Narayanan , Holger Pirk , Malte Schwarzkopf , Saman Amarasinghe , and Matei Zaharia . 2017 . Weld: A Common Runtime for High Performance Data Analytics . In Proceedings of the Conference on Innovative Data Systems Research (CIDR). Shoumik Palkar, James J. Thomas, Anil Shanbhag, Deepak Narayanan, Holger Pirk, Malte Schwarzkopf, Saman Amarasinghe, and Matei Zaharia. 2017. Weld: A Common Runtime for High Performance Data Analytics. In Proceedings of the Conference on Innovative Data Systems Research (CIDR)."},{"key":"e_1_3_2_1_35_1","unstructured":"Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in PyTorch. (2017).  Adam Paszke Sam Gross Soumith Chintala Gregory Chanan Edward Yang Zachary DeVito Zeming Lin Alban Desmaison Luca Antiga and Adam Lerer. 2017. Automatic differentiation in PyTorch. (2017)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536222.2536233"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2006.40"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/2831360.2831365"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367513"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-39038-8_3"},{"key":"e_1_3_2_1_41_1","volume-title":"The 2016 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'16)","author":"Taneja Rohit","year":"2016","unstructured":"Rohit Taneja , Raj Krishnamurhty , and Gang Liu . 2016 . The 2016 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'16) . Rohit Taneja, Raj Krishnamurhty, and Gang Liu. 2016. The 2016 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'16)."},{"key":"e_1_3_2_1_42_1","unstructured":"The Transaction Processing Council. 2017. TPC-H Standard Specification Revision 2.17.3. (2017).  The Transaction Processing Council. 2017. TPC-H Standard Specification Revision 2.17.3. (2017)."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/2882903.2903740"},{"key":"e_1_3_2_1_44_1","first-page":"31","article-title":"Runtime Code Generation in Cloudera Impala","volume":"37","author":"Wanderman-Milne Skye","year":"2014","unstructured":"Skye Wanderman-Milne and Nong Li . 2014 . Runtime Code Generation in Cloudera Impala . IEEE Data Engineering Bulletin , Vol. 37 , 1 (2014), 31 -- 37 . Skye Wanderman-Milne and Nong Li. 2014. Runtime Code Generation in Cloudera Impala. IEEE Data Engineering Bulletin, Vol. 37, 1 (2014), 31--37.","journal-title":"IEEE Data Engineering Bulletin"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2017.111"},{"key":"e_1_3_2_1_46_1","volume-title":"Project Tungsten: Bringing Apache Spark Closer to Bare Metal.","author":"Xin Reynold","year":"2015","unstructured":"Reynold Xin and Josh Rosen . 2015 . Project Tungsten: Bringing Apache Spark Closer to Bare Metal. (2015). https:\/\/databricks.com\/blog\/2015\/04\/28\/project-tungsten-bringing-spark-closer-to-bare-metal.html Reynold Xin and Josh Rosen. 2015. Project Tungsten: Bringing Apache Spark Closer to Bare Metal. (2015). https:\/\/databricks.com\/blog\/2015\/04\/28\/project-tungsten-bringing-spark-closer-to-bare-metal.html"},{"key":"e_1_3_2_1_47_1","volume-title":"Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12)","author":"Zaharia Matei","year":"2012","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Tathagata Das , Ankur Dave , Justin Ma , Murphy McCauley , Michael J. Franklin , Scott Shenker , and Ion Stoica . 2012 . Resilient Distributed Datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing . In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12) . 1. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J. Franklin, Scott Shenker, and Ion Stoica. 2012. Resilient Distributed Datasets: A Fault-tolerant Abstraction for In-memory Cluster Computing. In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12). 1."},{"key":"e_1_3_2_1_48_1","volume-title":"Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (HotCloud'10)","author":"Zaharia Matei","year":"2010","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Michael J. Franklin , Scott Shenker , and Ion Stoica . 2010 . Spark: Cluster Computing with Working Sets . In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (HotCloud'10) . 1. Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion Stoica. 2010. Spark: Cluster Computing with Working Sets. In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (HotCloud'10). 1."}],"event":{"name":"ICPE '19: Tenth ACM\/SPEC International Conference on Performance Engineering","sponsor":["SIGMETRICS ACM Special Interest Group on Measurement and Evaluation","SIGSOFT ACM Special Interest Group on Software Engineering"],"location":"Mumbai India","acronym":"ICPE '19"},"container-title":["Proceedings of the 2019 ACM\/SPEC International Conference on Performance Engineering"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3297663.3310300","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3297663.3310300","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:54:10Z","timestamp":1750204450000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3297663.3310300"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,4]]},"references-count":48,"alternative-id":["10.1145\/3297663.3310300","10.1145\/3297663"],"URL":"https:\/\/doi.org\/10.1145\/3297663.3310300","relation":{},"subject":[],"published":{"date-parts":[[2019,4,4]]},"assertion":[{"value":"2019-04-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}