{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,26]],"date-time":"2025-03-26T04:48:15Z","timestamp":1742964495022,"version":"3.40.3"},"publisher-location":"Cham","reference-count":20,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783030902865"},{"type":"electronic","value":"9783030902872"}],"license":[{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"},{"start":{"date-parts":[[2022,1,1]],"date-time":"2022-01-01T00:00:00Z","timestamp":1640995200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022]]},"DOI":"10.1007\/978-3-030-90287-2_6","type":"book-chapter","created":{"date-parts":[[2022,3,14]],"date-time":"2022-03-14T16:05:35Z","timestamp":1647273935000},"page":"107-125","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["A Data Mining Approach to\u00a0Guide the\u00a0Physical Design of\u00a0Distributed Big Data Warehouses"],"prefix":"10.1007","author":[{"given":"Yassine","family":"Ramdane","sequence":"first","affiliation":[]},{"given":"Nadia","family":"Kabachi","sequence":"additional","affiliation":[]},{"given":"Omar","family":"Boussaid","sequence":"additional","affiliation":[]},{"given":"Fadila","family":"Bentayeb","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,15]]},"reference":[{"issue":"1","key":"6_CR1","doi-asserted-by":"publisher","first-page":"922","DOI":"10.14778\/1687627.1687731","volume":"2","author":"A Abouzeid","year":"2009","unstructured":"Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., & Rasin, A. (2009). Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads. Proceedings of the VLDB Endowment,\u00a02(1), 922\u2013933.","journal-title":"Proceedings of the VLDB Endowment"},{"issue":"9","key":"6_CR2","doi-asserted-by":"publisher","first-page":"1282","DOI":"10.1109\/TKDE.2011.47","volume":"23","author":"FN Afrati","year":"2011","unstructured":"Afrati, F. N., & Ullman, J. D. (2011). Optimizing multiway joins in a map-reduce environment. IEEE Transactions on Knowledge and Data Engineering,\u00a023(9), 1282\u20131298.","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"6_CR3","doi-asserted-by":"crossref","unstructured":"Arres, B., Kabachi, N., & Boussaid, O. (2015). Optimizing olap cubes construction by improving data placement on multi-nodes clusters. In 2015 23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (pp. 520\u2013524). IEEE.","DOI":"10.1109\/PDP.2015.45"},{"key":"6_CR4","first-page":"111","volume":"6","author":"H Azez","year":"2015","unstructured":"Azez, H., Khafagy, M. H., & Omara, F. A. (2015). Joum: An indexing methodology for improving join in hive star schema. International Journal of Scientific and Engineering Research,\u00a06, 111\u2013119.","journal-title":"International Journal of Scientific and Engineering Research"},{"key":"6_CR5","doi-asserted-by":"crossref","unstructured":"Blanas, S., Patel, J.\u00a0M., Ercegovac, V., Rao, J., Shekita, E.\u00a0J., & Tian, Y. (2010). A comparison of join algorithms for log processing in mapreduce. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (pp. 975\u2013986). ACM.","DOI":"10.1145\/1807167.1807273"},{"key":"6_CR6","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1016\/j.procs.2016.05.299","volume":"80","author":"JJ Brito","year":"2016","unstructured":"Brito, J. J., Mosqueiro, T., Ciferri, R. R., & de Aguiar Ciferri, C. D. (2016). Faster cloud star joins with reduced disk spill and network communication. Procedia Computer Science,\u00a080, 74\u201385.","journal-title":"Procedia Computer Science"},{"issue":"1\u20132","key":"6_CR7","doi-asserted-by":"publisher","first-page":"515","DOI":"10.14778\/1920841.1920908","volume":"3","author":"J Dittrich","year":"2010","unstructured":"Dittrich, J., Quian\u00e9-Ruiz, J.-A., Jindal, A., Kargin, Y., Setty, V., & Schad, J. (2010). Hadoop++: Making a yellow elephant run like a cheetah. Proceedings of the VLDB Endowment,\u00a03(1\u20132), 515\u2013529.","journal-title":"Proceedings of the VLDB Endowment"},{"issue":"9","key":"6_CR8","doi-asserted-by":"publisher","first-page":"575","DOI":"10.14778\/2002938.2002943","volume":"4","author":"MY Eltabakh","year":"2011","unstructured":"Eltabakh, M. Y., Tian, Y., \u00d6zcan, F., Gemulla, R., Krettek, A., & McPherson, J. (2011). Cohadoop: Flexible data placement and its exploitation in hadoop. Proceedings of the VLDB Endowment,\u00a04(9), 575\u2013585.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"6_CR9","unstructured":"Field, A. (2013). Discovering Statistics Using IBM SPSS Statistics. California: Sage."},{"key":"6_CR10","doi-asserted-by":"crossref","unstructured":"Golfarelli, M., & Baldacci, L. (2018). A cost model for spark sql. IEEE Transactions on Knowledge and Data Engineering.","DOI":"10.1109\/TKDE.2018.2850339"},{"key":"6_CR11","unstructured":"Gravetter, F. J., & Wallnau, L. B. (2016). Statistics for the Behavioral Sciences. Cengage Learning."},{"issue":"5","key":"6_CR12","doi-asserted-by":"publisher","first-page":"589","DOI":"10.14778\/3055540.3055551","volume":"10","author":"Y Lu","year":"2017","unstructured":"Lu, Y., Shanbhag, A., Jindal, A., & Madden, S. (2017). Adaptdb: Adaptive partitioning for distributed joins. Proceedings of the VLDB Endowment,\u00a010(5), 589\u2013600.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"6_CR13","doi-asserted-by":"crossref","unstructured":"Malinen, M.\u00a0I., & Fr\u00e4nti, P. (2014). Balanced k-means for clustering. In Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR) (pp. 32\u201341). Springer.","DOI":"10.1007\/978-3-662-44415-3_4"},{"key":"6_CR14","doi-asserted-by":"crossref","unstructured":"Petridis, P., Gounaris, A., & Torres, J. (2016). Spark parameter tuning via trial-and-error. In INNS Conference on Big Data (pp. 226\u2013237). Springer.","DOI":"10.1007\/978-3-319-47898-2_24"},{"issue":"3","key":"6_CR15","doi-asserted-by":"publisher","first-page":"319","DOI":"10.1002\/spe.2308","volume":"46","author":"V Purdil\u0103","year":"2016","unstructured":"Purdil\u0103, V., & Pentiuc, \u015e-G. (2016). Single-scan: A fast star-join query processing algorithm. Practice and Experience,\u00a046(3), 319\u2013339.","journal-title":"Practice and Experience"},{"key":"6_CR16","doi-asserted-by":"crossref","unstructured":"Ramdane, Y., Boussaid, O., Kabachi, N., & Bentayeb, F. (2018). Partitioning and bucketing techniques to speed up query processing in spark-sql. In 2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS) (pp. 142\u2013151). IEEE.","DOI":"10.1109\/PADSW.2018.8644891"},{"key":"6_CR17","unstructured":"Ramdane, Y., Omar, B., Nadia, K., & Fadila, B. (2019). Conception physique d\u2019un entrep\u00f4t de donn\u00e9es distribu\u00e9es bas\u00e9e sur k-means \u00e9quilibr\u00e9. In EGC (pp. 177\u2013188)."},{"key":"6_CR18","doi-asserted-by":"crossref","unstructured":"Sun, L., Franklin, M.\u00a0J., Krishnan, S., & Xin, R.\u00a0S. (2014). Fine-grained partitioning for aggressive data skipping. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data (pp. 1115\u20131126). ACM.","DOI":"10.1145\/2588555.2610515"},{"key":"6_CR19","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1016\/j.future.2016.06.027","volume":"78","author":"Z Tang","year":"2018","unstructured":"Tang, Z., Zhang, X., Li, K., & Li, K. (2018). An intermediate data placement algorithm for load balancing in spark computing environment. Future Generation Computer Systems,\u00a078, 287\u2013301.","journal-title":"Future Generation Computer Systems"},{"key":"6_CR20","doi-asserted-by":"crossref","unstructured":"Zamanian, E., Binnig, C., & Salama, A. (2015). Locality-aware partitioning in parallel database systems. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data (pp. 17\u201330). ACM.","DOI":"10.1145\/2723372.2723718"}],"container-title":["Studies in Computational Intelligence","Advances in Knowledge Discovery and Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-030-90287-2_6","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,14]],"date-time":"2022-03-14T16:12:06Z","timestamp":1647274326000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-030-90287-2_6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022]]},"ISBN":["9783030902865","9783030902872"],"references-count":20,"URL":"https:\/\/doi.org\/10.1007\/978-3-030-90287-2_6","relation":{},"ISSN":["1860-949X","1860-9503"],"issn-type":[{"type":"print","value":"1860-949X"},{"type":"electronic","value":"1860-9503"}],"subject":[],"published":{"date-parts":[[2022]]},"assertion":[{"value":"15 March 2022","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}}]}}