{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T03:41:57Z","timestamp":1740109317915,"version":"3.37.3"},"reference-count":29,"publisher":"Springer Science and Business Media LLC","license":[{"start":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T00:00:00Z","timestamp":1612483200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2021,2,5]],"date-time":"2021-02-05T00:00:00Z","timestamp":1612483200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"funder":[{"name":"Key Research Program of Zhejiang Province","award":["2021C01109"],"award-info":[{"award-number":["2021C01109"]}]},{"DOI":"10.13039\/501100004731","name":"Zhejiang Provincial Natural Science Foundation","doi-asserted-by":"crossref","award":["LZ21F020007"],"award-info":[{"award-number":["LZ21F020007"]}],"id":[{"id":"10.13039\/501100004731","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Grid State Foundation","award":["5211XT190033"],"award-info":[{"award-number":["5211XT190033"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Knowl Inf Syst"],"DOI":"10.1007\/s10115-020-01542-4","type":"journal-article","created":{"date-parts":[[2021,2,8]],"date-time":"2021-02-08T15:11:31Z","timestamp":1612797091000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["AQUA+: Query Optimization for Hybrid Database-MapReduce System"],"prefix":"10.1007","author":[{"given":"Zhifei","family":"Pang","sequence":"first","affiliation":[]},{"given":"Sai","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Haichao","family":"Huang","sequence":"additional","affiliation":[]},{"given":"Zhouzhenyan","family":"Hong","sequence":"additional","affiliation":[]},{"given":"Yuqing","family":"Xie","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,2,5]]},"reference":[{"key":"1542_CR1","doi-asserted-by":"crossref","unstructured":"Abouzied A, Abadi DJ, Bajda-Pawlikowski K, Silberschatz A (2019) \u2018Integration of large-scale data processing systems and traditional parallel database technology\u2019, Proc. VLDB Endow. f12(12),\u00a02290\u20132299. http:\/\/www.vldb.org\/pvldb\/vol12\/p2290-abouzied.pdf","DOI":"10.14778\/3352063.3352145"},{"key":"1542_CR2","unstructured":"Abouzied A, Bajda-Pawlikowski K, Huang J, Abadi DJ, Silberschatz A (2010) Hadoopdb in action: building real world applications, in \u2018SIGMOD Conference\u2019, pp.\u00a01111\u20131114"},{"key":"1542_CR3","doi-asserted-by":"crossref","unstructured":"Afrati FN, Ullman JD (2010) Optimizing joins in a map-reduce environment, In Proceedings of the 13th International Conference on Extending Database Technology (pp. 99-110)","DOI":"10.1145\/1739041.1739056"},{"issue":"4","key":"1542_CR4","doi-asserted-by":"publisher","first-page":"602","DOI":"10.1145\/319628.319650","volume":"6","author":"PA Bernstein","year":"1981","unstructured":"Bernstein PA, Goodman N, Wong E, Reeve CL, Rothnie JB Jr (1981) Query processing in a system for distributed databases (sdd-1). ACM Trans. Database Syst. 6(4):602\u2013625","journal-title":"ACM Trans. Database Syst."},{"key":"1542_CR5","unstructured":"Bittorf M, Bobrovytsky T, Erickson C C A C J, Hecht M GD, Kuff M J I JL, Leblang D KA, Robinson N L I PH, Rus D RS, Wanderman J R D TS, Yoder MM (2015) Impala: A modern, open-source sql engine for hadoop, in Proceedings of the 7th Biennial Conference on Innovative Data Systems Research"},{"key":"1542_CR6","doi-asserted-by":"publisher","unstructured":"Camacho-Rodr\u00edguez J, Chauhan A, Gates A, Koifman E, O\u2019Malley O, Garg V, Haindrich Z, Shelukhin S, Jayachandran P, Seth S, Jaiswal D, Bouguerra S, Bangarwa N, Hariappan S, Agarwal A, Dere J, Dai D, Nair T, Dembla N, Vijayaraghavan G, Hagleitner G (2019) Apache hive: From mapreduce to enterprise-grade big data warehousing, in P.\u00a0A. Boncz, S.\u00a0Manegold, A.\u00a0Ailamaki, A.\u00a0Deshpande and T.\u00a0Kraska, eds, \u2018Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019\u2019, ACM, pp.\u00a01773\u20131786. https:\/\/doi.org\/10.1145\/3299869.3314045","DOI":"10.1145\/3299869.3314045"},{"key":"1542_CR7","unstructured":"Chaudhuri S (1998) An overview of query optimization in relational systems, in PODS, pp.\u00a034\u201343"},{"issue":"3","key":"1542_CR8","doi-asserted-by":"publisher","first-page":"416","DOI":"10.1109\/69.506709","volume":"8","author":"M-S Chen","year":"1996","unstructured":"Chen M-S, Yu PS, Wu K-L (1996) Optimization of parallel execution for multi-join queries. IEEE Trans on Knowl and Data Eng 8(3):416\u2013428","journal-title":"IEEE Trans on Knowl and Data Eng"},{"issue":"1\u20132","key":"1542_CR9","doi-asserted-by":"publisher","first-page":"1459","DOI":"10.14778\/1920841.1921020","volume":"3","author":"S Chen","year":"2010","unstructured":"Chen S (2010) Cheetah: a high performance, custom data warehouse on top of mapreduce. Proc. VLDB Endow. 3(1\u20132):1459\u20131468","journal-title":"Proc. VLDB Endow."},{"key":"1542_CR10","doi-asserted-by":"publisher","unstructured":"Chen Z, Gehrke J, Korn F (2001) Query optimization in compressed database systems, in Proceedings of the 2001 ACM SIGMOD international conference on Management of data, Santa Barbara, CA, USA, May 21-24, 2001, pp.\u00a0271\u2013282. https:\/\/doi.org\/10.1145\/375663.375692","DOI":"10.1145\/375663.375692"},{"key":"1542_CR11","first-page":"21","volume":"10","author":"T Condie","year":"2010","unstructured":"Condie T, Conway N, Alvaro P, Hellerstein JM, Elmeleegy K, Sears R (2010) Mapreduce online. In NSDI 10:21\u201329","journal-title":"Mapreduce online. In NSDI"},{"key":"1542_CR12","unstructured":"Dean J, Ghemawat S (2004) Mapreduce: simplified data processing on large clusters, in OSDI, 137\u2013150"},{"issue":"2","key":"1542_CR13","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1145\/235968.233328","volume":"25","author":"MJ Franklin","year":"1996","unstructured":"Franklin MJ, J\u00f3nsson BT, Kossmann D (1996) Performance tradeoffs for client-server query processing. SIGMOD Rec. 25(2):149\u2013160","journal-title":"SIGMOD Rec."},{"issue":"2","key":"1542_CR14","first-page":"1402","volume":"2","author":"E Friedman","year":"2009","unstructured":"Friedman E, Pawlowski PM, Cieslewicz J (2009) Sql\/mapreduce: a practical approach to self-describing, polymorphic, and parallelizable user-defined functions. PVLDB 2(2):1402\u20131413","journal-title":"PVLDB"},{"key":"1542_CR15","doi-asserted-by":"crossref","unstructured":"Ganguly S, Hasan W, Krishnamurthy R (1992) Query optimization for parallel execution. SIGMOD Rec. 21(2):","DOI":"10.1145\/141484.130291"},{"issue":"2","key":"1542_CR16","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1089\/big.2013.0011","volume":"1","author":"M Hausenblas","year":"2013","unstructured":"Hausenblas M, Nadeau J (2013) Apache drill: interactive ad-hoc analysis at scale. Big Data 1(2):100\u2013104","journal-title":"Big Data"},{"issue":"8","key":"1542_CR17","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735","journal-title":"Neural Comput"},{"issue":"2","key":"1542_CR18","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1145\/356924.356928","volume":"16","author":"M Jarke","year":"1984","unstructured":"Jarke M, Koch J (1984) Query optimization in database systems. ACM Comput. Surv. 16(2):111\u2013152","journal-title":"ACM Comput. Surv."},{"key":"1542_CR19","unstructured":"Jia Y (2009) Running tpc-h queries on hive, in http:\/\/issues.apache.org\/jira\/browse\/HIVE-600"},{"key":"1542_CR20","doi-asserted-by":"crossref","unstructured":"Lin Y, Agrawal D, Chen C, Ooi BC, Wu S (2011) Llama: leveraging columnar storage for scalable join processing in the mapreduce framework, in SIGMOD Conference, 961\u2013972","DOI":"10.1145\/1989323.1989424"},{"key":"1542_CR21","doi-asserted-by":"crossref","unstructured":"Olston C, Reed B, Srivastava U, Kumar R, Tomkins A (2008) Pig latin: a not-so-foreign language for data processing, in SIGMOD Conference, 1099\u20131110","DOI":"10.1145\/1376616.1376726"},{"key":"1542_CR22","volume-title":"Principles of distributed database systems","author":"MT Ozsu","year":"2007","unstructured":"Ozsu MT (2007) Principles of distributed database systems, 3rd edn. Prentice Hall Press, NJ, USA","edition":"3"},{"key":"1542_CR23","doi-asserted-by":"crossref","unstructured":"Poosala V, Haas PJ, Ioannidis YE, Shekita EJ (1996) Improved histograms for selectivity estimation of range predicates, in SIGMOD Conference, 294\u2013305","DOI":"10.1145\/235968.233342"},{"key":"1542_CR24","doi-asserted-by":"crossref","unstructured":"Stewart RJ, Trinder PW, Loidl H-W (2011) Comparing high level mapreduce query languages, in APPT, 58\u201372","DOI":"10.1007\/978-3-642-24151-2_5"},{"key":"1542_CR25","doi-asserted-by":"crossref","unstructured":"Tai KS, Socher R, Manning CD (2015) Improved semantic representations from tree-structured long short-term memory networks, in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, July 26-31, 2015, Beijing, China, Volume 1: Long Papers, 1556\u20131566. https:\/\/www.aclweb.org\/anthology\/P15-1150\/","DOI":"10.3115\/v1\/P15-1150"},{"issue":"2","key":"1542_CR26","first-page":"1626","volume":"2","author":"A Thusoo","year":"2009","unstructured":"Thusoo A, Sarma JS, Jain N, Shao Z, Chakka P, Anthony S, Liu H, Wyckoff P, Murthy R (2009) Hive - a warehousing solution over a map-reduce framework. PVLDB 2(2):1626\u20131629","journal-title":"PVLDB"},{"key":"1542_CR27","unstructured":"bibitemhiveicde Thusoo A, Sarma JS, Jain N, Shao Z, Chakka P, Zhang N, Anthony S, Liu H, Murthy R (2010) Hive - a petabyte scale data warehouse using hadoop, In ICDE, 996\u20131005"},{"key":"1542_CR28","unstructured":"Traverso M (2013) Presto: Interacting with petabytes of data at facebook. Retrieved February 4, 2014"},{"key":"1542_CR29","doi-asserted-by":"crossref","unstructured":"Wu S, Li F, Mehrotra S, Ooi BC (2011) Query optimization for massively parallel data processing, In SoCC , 12","DOI":"10.1145\/2038916.2038928"}],"container-title":["Knowledge and Information Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-020-01542-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10115-020-01542-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10115-020-01542-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T13:53:47Z","timestamp":1617285227000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10115-020-01542-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,5]]},"references-count":29,"alternative-id":["1542"],"URL":"https:\/\/doi.org\/10.1007\/s10115-020-01542-4","relation":{},"ISSN":["0219-1377","0219-3116"],"issn-type":[{"type":"print","value":"0219-1377"},{"type":"electronic","value":"0219-3116"}],"subject":[],"published":{"date-parts":[[2021,2,5]]},"assertion":[{"value":"25 January 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 December 2020","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 December 2020","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 February 2021","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}