{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T19:39:27Z","timestamp":1771616367905,"version":"3.50.1"},"reference-count":80,"publisher":"Springer Science and Business Media LLC","issue":"9","license":[{"start":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T00:00:00Z","timestamp":1660262400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T00:00:00Z","timestamp":1660262400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["01 IS 18047"],"award-info":[{"award-number":["01 IS 18047"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015068","name":"Universidade de Santiago de Compostela","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100015068","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2023,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper provides an in-depth survey on the integration of machine learning and array databases. First,machine learning support in modern database management systems is introduced. From straightforward implementations of linear algebra operations in SQL to machine learning capabilities of specialized database managers designed to process specific types of data, a number of different approaches are overviewed. Then, the paper covers the database features already implemented in current machine learning systems. Features such as rewriting, compression, and caching allow users to implement more efficient machine learning applications. The underlying linear algebra computations in some of the most used machine learning algorithms are studied in order to determine which linear algebra operations should be efficiently implemented by array databases. An exhaustive overview of array data and relevant array database managers is also provided. Those database features that have been proven of special importance for efficient execution of machine learning algorithms are analyzed in detail for each relevant array database management system. Finally, current state of array databases capabilities for machine learning implementation is shown through two example implementations in Rasdaman and SciDB.<\/jats:p>","DOI":"10.1007\/s10489-022-03979-2","type":"journal-article","created":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T05:03:09Z","timestamp":1660280589000},"page":"9799-9822","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["A survey on machine learning in array databases"],"prefix":"10.1007","volume":"53","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0555-8735","authenticated-orcid":false,"given":"Sebasti\u00e1n","family":"Villarroya","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Baumann","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,8,12]]},"reference":[{"key":"3979_CR1","doi-asserted-by":"publisher","unstructured":"Kim M, Candan KS (2014) TensorDB: In-database tensor manipulation with tensor-relational query plans. In: Proceedings of the 23rd ACM International conference on conference on information and knowledge management. CIKM \u201914, pp 2039\u20132041. ACM. https:\/\/doi.org\/10.1145\/2661829.2661842","DOI":"10.1145\/2661829.2661842"},{"issue":"2","key":"3979_CR2","doi-asserted-by":"publisher","first-page":"1481","DOI":"10.14778\/1687553.1687576","volume":"2","author":"J Cohen","year":"2009","unstructured":"Cohen J, Dolan B, Dunlap M, Hellerstein JM, Welton C (2009) MAD skills: New analysis practices for big data. Proc VLDB Endow 2(2):1481\u20131492. https:\/\/doi.org\/10.14778\/1687553.1687576","journal-title":"Proc VLDB Endow"},{"key":"3979_CR3","doi-asserted-by":"publisher","unstructured":"Feng X, Kumar A, Recht B, R\u00e9 C (2012) Towards a unified architecture for in-RDBMS analytics. In: Proceedings of the 2012 ACM SIGMOD International conference on management of data. SIGMOD \u201912, pp 325\u2013336. ACM. https:\/\/doi.org\/10.1145\/2213836.2213874https:\/\/doi.org\/10.1145\/2213836.2213874","DOI":"10.1145\/2213836.2213874 10.1145\/2213836.2213874"},{"key":"3979_CR4","doi-asserted-by":"crossref","unstructured":"Zhang Y, Zhang W, Yang J (2010) I\/O-efficient statistical computing with RIOT. 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010), pp 1157\u20131160","DOI":"10.1109\/ICDE.2010.5447819"},{"issue":"1","key":"3979_CR5","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1145\/3277006.3277013","volume":"47","author":"S Luo","year":"2018","unstructured":"Luo S, Gao ZJ, Gubanov M, Perez LL, Jermaine C (2018) Scalable linear algebra on a relational database system. SIGMOD Rec 47(1):24\u201331. https:\/\/doi.org\/10.1145\/3277006.3277013","journal-title":"SIGMOD Rec"},{"issue":"12","key":"3979_CR6","doi-asserted-by":"publisher","first-page":"1700","DOI":"10.14778\/2367502.2367510","volume":"5","author":"JM Hellerstein","year":"2012","unstructured":"Hellerstein JM, R\u00e9 C, Schoppmann F, Wang DZ, Fratkin E, Gorajek A, Ng KS, Welton C, Feng X, Li K, Kumar A (2012) The MADlib analytics library: Or MAD skills, the SQL. Proc VLDB Endow 5(12):1700\u20131711. https:\/\/doi.org\/10.14778\/2367502.2367510https:\/\/doi.org\/10.14778\/2367502.2367510","journal-title":"Proc VLDB Endow"},{"key":"3979_CR7","doi-asserted-by":"publisher","unstructured":"Cheng Y, Qin C, Rusu F (2012) GLADE: Big data analytics made easy. In: Proceedings of the 2012 ACM SIGMOD International conference on management of data. SIGMOD \u201912, pp 697\u2013700. ACM,. https:\/\/doi.org\/10.1145\/2213836.2213936","DOI":"10.1145\/2213836.2213936"},{"issue":"1","key":"3979_CR8","doi-asserted-by":"publisher","first-page":"1400","DOI":"10.14778\/3236187.3236194","volume":"11","author":"JV D\u2019silva","year":"2018","unstructured":"D\u2019silva JV, De Moor F, Kemme B (2018) AIDA: Abstraction for advanced in-database analytics. Proc VLDB Endow 11(1):1400\u20131413. https:\/\/doi.org\/10.14778\/3236187.3236194","journal-title":"Proc VLDB Endow"},{"key":"3979_CR9","doi-asserted-by":"publisher","unstructured":"Deshpande A, Madden S (2006) MauveDB: Supporting model-based user views in database systems. In: Proceedings of the 2006 ACM SIGMOD International conference on management of data. SIGMOD \u201906, pp 73\u201384. ACM. https:\/\/doi.org\/10.1145\/1142473.1142483https:\/\/doi.org\/10.1145\/1142473.1142483","DOI":"10.1145\/1142473.1142483 10.1145\/1142473.1142483"},{"key":"3979_CR10","unstructured":"Schelter S, Palumbo A, Quinn S, Marthi S, Musselman A (2016) Samsara: Declarative machine learning on distributed dataflow systems. In: NIPS MLSYs workshop, pp 1\u20138"},{"key":"3979_CR11","unstructured":"Sujeeth AK, Lee H, Brown KJ, Chafi H, Wu M, Atreya AR, Olukotun K, Rompf T, Odersky M (2011) OptiML: An implicitly parallel domain-specific language for machine learning. In: Proceedings of the 28th International conference on international conference on machine learning. ICML\u201911, pp 609\u2013616. Omnipress. http:\/\/dl.acm.org\/citation.cfm?id=3104482.3104559. Accessed 12 Oct 2019"},{"key":"3979_CR12","unstructured":"Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray DG, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X (2016) Tensorflow: A system for large-scale machine learning. In: Proceedings of the 12th USENIX Conference on operating systems design and implementation. OSDI\u201916, pp 265\u2013283. USENIX Association. http:\/\/dl.acm.org\/citation.cfm?id=3026877.3026899. Accessed 12 Oct 2019"},{"issue":"13","key":"3979_CR13","doi-asserted-by":"publisher","first-page":"1425","DOI":"10.14778\/3007263.3007279","volume":"9","author":"M Boehm","year":"2016","unstructured":"Boehm M, Dusenberry MW, Eriksson D, Evfimievski AV, Manshadi FM, Pansare N, Reinwald B, Reiss FR, Sen P, Surve AC, Tatikonda S (2016) SystemML: Declarative machine learning on Spark. Proc VLDB Endow 9(13):1425\u20131436. https:\/\/doi.org\/10.14778\/3007263.3007279","journal-title":"Proc VLDB Endow"},{"key":"3979_CR14","doi-asserted-by":"publisher","unstructured":"Park Y, Qing J, Shen X, Mozafari B (2019) BlinkML: Efficient maximum likelihood estimation with probabilistic guarantees. In: Proceedings of the 2019 International conference on management of data. SIGMOD \u201919, pp 1135\u20131152. ACM. https:\/\/doi.org\/10.1145\/3299869.3300077","DOI":"10.1145\/3299869.3300077"},{"key":"3979_CR15","doi-asserted-by":"publisher","unstructured":"Yu Y, Tang M, Aref WG, Malluhi QM, Abbas MM, Ouzzani M (2017) In-memory distributed matrix computation processing and optimization. In: 2017 IEEE 33rd International conference on data engineering (ICDE), pp 1047\u20131058. https:\/\/doi.org\/10.1109\/ICDE.2017.150","DOI":"10.1109\/ICDE.2017.150"},{"key":"3979_CR16","doi-asserted-by":"publisher","unstructured":"Bosagh Zadeh R, Meng X, Ulanov A, Yavuz B, Pu L, Venkataraman S, Sparks E, Staple A, Zaharia M (2016) Matrix computations and optimization in Apache Spark. In: Proceedings of the 22Nd ACM SIGKDD International conference on knowledge discovery and data mining. KDD \u201916, pp 31\u201338. ACM. https:\/\/doi.org\/10.1145\/2939672.2939675","DOI":"10.1145\/2939672.2939675"},{"key":"3979_CR17","doi-asserted-by":"publisher","unstructured":"Villarroya S, Baumann P (2020) On the integration of machine learning and array databases. In: 2020 IEEE 36th International conference on data engineering (ICDE), pp 1786\u20131789. IEEE Computer Society. https:\/\/doi.org\/10.1109\/ICDE48307.2020.00170","DOI":"10.1109\/ICDE48307.2020.00170"},{"key":"3979_CR18","doi-asserted-by":"crossref","unstructured":"Rodriges Zalipynis RA (2021) Towards machine learning in distributed array DBMS : Networking considerations. In: Renault, e.\u0301, Boumerdassi, S, M\u00fchlethaler, P. (eds.) Machine Learning for Networking, pp 284\u2013304","DOI":"10.1007\/978-3-030-70866-5_19"},{"key":"3979_CR19","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1007\/s10619-018-7229-1","volume":"37","author":"C Ordo\u00f1ez","year":"2019","unstructured":"Ordo\u00f1ez C, Zhang Y, Johnsson SL (2019) Scalable machine learning computing a data summarization matrix with a parallel array DBMS. Distributed and Parallel Databases 37:329\u2013350. https:\/\/doi.org\/10.1007\/s10619-018-7229-1","journal-title":"Distributed and Parallel Databases"},{"issue":"1","key":"3979_CR20","first-page":"149","volume":"12","author":"J Baxter","year":"2000","unstructured":"Baxter J (2000) A model of inductive bias learning. J Artif Int Res 12(1):149\u2013198","journal-title":"J Artif Int Res"},{"key":"3979_CR21","doi-asserted-by":"crossref","unstructured":"Caruana R (1993) Multitask learning: a knowledge-based source of inductive bias. In: Proceedings of the 10th International conference on international conference on machine learning. ICML\u201993, pp 41\u201348. Morgan Kaufmann Publishers Inc. http:\/\/dl.acm.org\/citation.cfm?id=3091529.3091535. Accessed 12 Oct 2019","DOI":"10.1016\/B978-1-55860-307-3.50012-5"},{"key":"3979_CR22","doi-asserted-by":"publisher","unstructured":"Faghmous JH, Le M, Uluyol M, Kumar V, Chatterjee S (2013) A parameter-free spatio-temporal pattern mining model to catalog global ocean dynamics. In: 2013 IEEE 13th International conference on data mining, pp 151\u2013160. https:\/\/doi.org\/10.1109\/ICDM.2013.162","DOI":"10.1109\/ICDM.2013.162"},{"key":"3979_CR23","unstructured":"Liu Y, Bahadori MT, Li H (2012) Sparse-GEV: Sparse latent space model for multivariate extreme value time series modeling. In: Proceedings of the 29th international coference on international conference on machine learning. ICML\u201912, pp 1195\u20131202. Omnipress. http:\/\/dl.acm.org\/citation.cfm?id=3042573.3042727. Accessed 12 Oct 2019"},{"issue":"7","key":"3979_CR24","doi-asserted-by":"publisher","first-page":"434","DOI":"10.1097\/RLI.0000000000000358","volume":"52","author":"AS Becker","year":"2017","unstructured":"Becker AS, Marcon M, Ghafoor S, Wurnig MC, Frauenfelder T, Boss A (2017) Deep learning in mammography: Diagnostic accuracy of a multipurpose image analysis software in the detection of breast cancer. Invest Radiol 52(7):434\u2013440","journal-title":"Invest Radiol"},{"key":"3979_CR25","doi-asserted-by":"publisher","unstructured":"Liu S, Liu S, Cai W, Pujol S, Kikinis R, Feng D (2014) Early diagnosis of Alzheimer\u2019s disease with deep learning. In: 2014 IEEE 11th International symposium on biomedical imaging (ISBI), pp 1015\u20131018. https:\/\/doi.org\/10.1109\/ISBI.2014.6868045","DOI":"10.1109\/ISBI.2014.6868045"},{"issue":"4","key":"3979_CR26","doi-asserted-by":"publisher","first-page":"427","DOI":"10.1007\/s10278-017-9955-8","volume":"30","author":"H Lee","year":"2017","unstructured":"Lee H, Tajmir S, Lee J, Zissen M, Yeshiwas BA, Alkasab TK, Choy G, Do S (2017) Fully automated deep learning system for bone age assessment. J Digital Imaging 30(4):427\u2013441. https:\/\/doi.org\/10.1007\/s10278-017-9955-8","journal-title":"J Digital Imaging"},{"issue":"5","key":"3979_CR27","doi-asserted-by":"publisher","first-page":"1322","DOI":"10.1109\/TMI.2016.2532122","volume":"35","author":"M Kallenberg","year":"2016","unstructured":"Kallenberg M, Petersen K, Nielsen M, Ng AY, Diao P, Igel C, Vachon CM, Holland K, Winkel RR, Karssemeijer N, Lillholm M (2016) Unsupervised deep learning applied to breast density segmentation and mammographic risk scoring. IEEE Trans Med Imaging 35(5):1322\u20131331. https:\/\/doi.org\/10.1109\/TMI.2016.2532122","journal-title":"IEEE Trans Med Imaging"},{"issue":"1","key":"3979_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/978-3-031-01869-5","volume":"14","author":"M Boehm","year":"2019","unstructured":"Boehm M, Kumar A, Yang J (2019) Data management in machine learning systems. Synthesis Lectures on Data Management 14 (1):1\u2013173. https:\/\/doi.org\/10.2200\/S00895ED1V01Y201901DTM057https:\/\/doi.org\/10.2200\/S00895ED1V01Y201901DTM057","journal-title":"Synthesis Lectures on Data Management"},{"issue":"7","key":"3979_CR29","doi-asserted-by":"publisher","first-page":"822","DOI":"10.14778\/3317315.3317323","volume":"12","author":"D Jankov","year":"2019","unstructured":"Jankov D, Luo S, Yuan B, Cai Z, Zou J, Jermaine C, Gao ZJ (2019) Declarative recursive computation on an RDBMS: Or, why you should use a database for distributed machine learning. Proc VLDB Endow 12(7):822\u2013835. https:\/\/doi.org\/10.14778\/3317315.3317323","journal-title":"Proc VLDB Endow"},{"key":"3979_CR30","doi-asserted-by":"publisher","unstructured":"Kumar A, Naughton J, Patel JM (2015) Learning generalized linear models over normalized data. In: Proceedings of the 2015 ACM SIGMOD International conference on management of data. SIGMOD \u201915, pp 1969\u20131984. ACM. https:\/\/doi.org\/10.1145\/2723372.2723713","DOI":"10.1145\/2723372.2723713"},{"key":"3979_CR31","doi-asserted-by":"publisher","unstructured":"Schleich M, Olteanu D, Ciucanu R (2016) Learning linear regression models over factorized joins. In: Proceedings of the 2016 International conference on management of data. SIGMOD \u201916, pp 3\u201318. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2882903.2882939","DOI":"10.1145\/2882903.2882939"},{"key":"3979_CR32","doi-asserted-by":"publisher","unstructured":"Nikolic M, Olteanu D (2018) Incremental view maintenance with triple lock factorization benefits. In: Proceedings of the 2018 International conference on management of data. SIGMOD \u201918, pp 365\u2013380. ACM. https:\/\/doi.org\/10.1145\/3183713.3183758","DOI":"10.1145\/3183713.3183758"},{"issue":"5","key":"3979_CR33","doi-asserted-by":"publisher","first-page":"337","DOI":"10.14778\/2535573.2488340","volume":"6","author":"S Rendle","year":"2013","unstructured":"Rendle S (2013) Scaling factorization machines to relational data. Proc VLDB Endow 6(5):337\u2013348. https:\/\/doi.org\/10.14778\/2535573.2488340https:\/\/doi.org\/10.14778\/2535573.2488340","journal-title":"Proc VLDB Endow"},{"issue":"12","key":"3979_CR34","doi-asserted-by":"publisher","first-page":"1864","DOI":"10.14778\/2824032.2824087","volume":"8","author":"A Kumar","year":"2015","unstructured":"Kumar A, Jalal M, Yan B, Naughton J, Patel JM (2015) Demonstration of santoku: Optimizing machine learning over normalized data. Proc VLDB Endow 8(12):1864\u20131867. https:\/\/doi.org\/10.14778\/2824032.2824087","journal-title":"Proc VLDB Endow"},{"issue":"11","key":"3979_CR35","doi-asserted-by":"publisher","first-page":"1214","DOI":"10.14778\/3137628.3137633","volume":"10","author":"L Chen","year":"2017","unstructured":"Chen L, Kumar A, Naughton J, Patel JM (2017) Towards linear algebra over normalized data. Proc VLDB Endow 10(11):1214\u20131225. https:\/\/doi.org\/10.14778\/3137628.3137633","journal-title":"Proc VLDB Endow"},{"key":"3979_CR36","doi-asserted-by":"publisher","unstructured":"Ghoting A, Krishnamurthy R, Pednault E, Reinwald B, Sindhwani V, Tatikonda S, Tian Y, Vaithyanathan S (2011) SystemML: Declarative machine learning on MapReduce. In: 2011 IEEE 27th International conference on data engineering, pp 231\u2013242. https:\/\/doi.org\/10.1109\/ICDE.2011.5767930","DOI":"10.1109\/ICDE.2011.5767930"},{"key":"3979_CR37","doi-asserted-by":"publisher","unstructured":"Li S, Chen L, Kumar A (2019) Enabling and optimizing non-linear feature interactions in factorized linear algebra. In: Proceedings of the 2019 International conference on management of data. SIGMOD \u201919, pp 1571\u20131588. ACM. https:\/\/doi.org\/10.1145\/3299869.3319878","DOI":"10.1145\/3299869.3319878"},{"key":"3979_CR38","doi-asserted-by":"publisher","unstructured":"Abo Khamis M, Ngo HQ, Nguyen X, Olteanu D, Schleich M (2018) In-database learning with sparse tensors. In: Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. SIGMOD\/PODS \u201918, pp 325\u2013340. ACM. https:\/\/doi.org\/10.1145\/3196959.3196960","DOI":"10.1145\/3196959.3196960"},{"issue":"1","key":"3979_CR39","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1007\/s10994-006-5833-1","volume":"62","author":"M Richardson","year":"2006","unstructured":"Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1):107\u2013136. https:\/\/doi.org\/10.1007\/s10994-006-5833-1","journal-title":"Mach Learn"},{"key":"3979_CR40","doi-asserted-by":"publisher","unstructured":"Getoor L (2013) Probabilistic soft logic: A scalable approach for markov random fields over continuous-valued variables. In: Proceedings of the 7th International conference on theory, practice, and applications of rules on the Web. RuleML\u201913, pp 1\u20131. Springer, Berlin, Heidelberg. https:\/\/doi.org\/10.1007\/978-3-642-39617-5_1","DOI":"10.1007\/978-3-642-39617-5_1"},{"issue":"6","key":"3979_CR41","doi-asserted-by":"publisher","first-page":"373","DOI":"10.14778\/1978665.1978669","volume":"4","author":"F Niu","year":"2011","unstructured":"Niu F, R\u00e9 C, Doan A, Shavlik J (2011) Tuffy: Scaling up statistical inference in markov logic networks using an RDBMS. Proc VLDB Endow 4(6):373\u2013384. https:\/\/doi.org\/10.14778\/1978665.1978669","journal-title":"Proc VLDB Endow"},{"key":"3979_CR42","doi-asserted-by":"publisher","unstructured":"Niu F, Zhang C, Re C, Shavlik J (2012) Scaling inference for markov logic via dual decomposition. In: Proceedings of the 2012 IEEE 12th International conference on data mining. ICDM \u201912, pp 1032\u20131037. IEEE Computer Society. https:\/\/doi.org\/10.1109\/ICDM.2012.96","DOI":"10.1109\/ICDM.2012.96"},{"key":"3979_CR43","doi-asserted-by":"publisher","unstructured":"Zhang C, R\u00e9 C (2013) Towards high-throughput gibbs sampling at scale: A study across storage managers. In: Proceedings of the 2013 ACM SIGMOD International conference on management of data. SIGMOD \u201913, pp 397\u2013408. ACM, New York. https:\/\/doi.org\/10.1145\/2463676.2463702","DOI":"10.1145\/2463676.2463702"},{"key":"3979_CR44","unstructured":"Zhang C, R\u00e9 C, Sadeghian A, Shan Z, Shin J, Wang F, Wu S (2014) Feature engineering for knowledge base construction. IEEE Data Eng Bull"},{"key":"3979_CR45","doi-asserted-by":"publisher","unstructured":"Lu Y, Chowdhery A, Kandula S (2016) Optasia: A relational platform for efficient large-scale video analytics. In: Proceedings of the Seventh ACM Symposium on Cloud Computing. SoCC \u201916, pp 57\u201370. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2987550.2987564","DOI":"10.1145\/2987550.2987564"},{"key":"3979_CR46","unstructured":"Zhang H, Ananthanarayanan G, Bodik P, Philipose M, Bahl P, Freedman MJ (2017) Live video analytics at scale with approximation and delay-tolerance. In: Proceedings of the 14th USENIX conference on networked systems design and implementation. NSDI\u201917, pp 377\u2013392. USENIX Association,. http:\/\/dl.acm.org\/citation.cfm?id=3154630.3154661. Accessed 13 Oct 2019"},{"key":"3979_CR47","doi-asserted-by":"publisher","unstructured":"Watcharapichat P, Morales VL, Fernandez RC, Pietzuch P (2016) Ako: Decentralised deep learning with partial gradient exchange. In: Proceedings of the Seventh ACM symposium on cloud computing. SoCC \u201916, pp 84\u201397. ACM. https:\/\/doi.org\/10.1145\/2987550.2987586","DOI":"10.1145\/2987550.2987586"},{"key":"3979_CR48","unstructured":"Duan S, Babu S (2007) Processing forecasting queries. In: Proceedings of the 33rd international conference on very large data bases. VLDB \u201907, pp 711\u2013722. VLDB Endowment. http:\/\/dl.acm.org\/citation.cfm?id=1325851.1325933. Accessed 13 Oct 2019"},{"key":"3979_CR49","unstructured":"Fischer U (2015) Forecasting in database systems. In: Seidl, T, Ritter, N, Sch\u00f6ning, H, Sattler, K-U, H\u00e4rder, T, Friedrich, S, Wingerath, W (eds.) Datenbanksysteme F\u00fcr Business, Technologie und Web (BTW 2015), pp 483\u2013492. Gesellschaft f\u00fcr Informatik e.V."},{"key":"3979_CR50","unstructured":"Low Y, Bickson D, Gonzalez J, Guestrin C, Kyrola A, Hellerstein J (2010) Graphlab: A new framework for parallel machine learning. In: UAI"},{"key":"3979_CR51","doi-asserted-by":"publisher","unstructured":"Baumann P, Dehmel A, Furtado P, Ritsch R, Widmann N (1998) The multidimensional database system RasDaMan. In: Proceedings of the 1998 ACM SIGMOD International conference on management of data. SIGMOD \u201998, pp 575\u2013577. ACM. https:\/\/doi.org\/10.1145\/276304.276386","DOI":"10.1145\/276304.276386"},{"key":"3979_CR52","doi-asserted-by":"crossref","unstructured":"Stonebraker M, Brown P, Poliakov A, Raman S (2011) The architecture of sciDB. In: Proceedings of the 23rd international conference on scientific and statistical database management. SSDBM\u201911, pp 1\u201316. Springer. http:\/\/dl.acm.org\/citation.cfm?id=2032397.2032399. Accessed 13 Oct 2019","DOI":"10.1007\/978-3-642-22351-8_1"},{"key":"3979_CR53","doi-asserted-by":"publisher","unstructured":"Huang B, Babu S, Yang J (2013) Cumulon: Optimizing statistical data analysis in the cloud. In: Proceedings of the 2013 ACM SIGMOD International conference on management of data. SIGMOD \u201913, pp 1\u201312. ACM, New York, NY, USA. https:\/\/doi.org\/10.1145\/2463676.2465273","DOI":"10.1145\/2463676.2465273"},{"key":"3979_CR54","doi-asserted-by":"publisher","unstructured":"Sparks ER, Talwalkar A, Haas D, Franklin MJ, Jordan MI, Kraska T (2015) Automating model search for large scale machine learning. In: Proceedings of the Sixth ACM symposium on cloud computing. SoCC \u201915, pp 368\u2013380. ACM. https:\/\/doi.org\/10.1145\/2806777.2806945","DOI":"10.1145\/2806777.2806945"},{"issue":"1","key":"3979_CR55","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1145\/2949741.2949754","volume":"45","author":"A Alexandrov","year":"2016","unstructured":"Alexandrov A, Katsifodimos A, Krastev G, Markl V (2016) Implicit parallelism through deep language embedding. SIGMOD Rec 45(1):51\u201358. https:\/\/doi.org\/10.1145\/2949741.2949754","journal-title":"SIGMOD Rec"},{"key":"3979_CR56","unstructured":"Russ R (2007) NetCDF-4 : Software implementing an enhanced data model for the geosciences"},{"key":"3979_CR57","doi-asserted-by":"publisher","unstructured":"Baumann P (2016) Array Databases. In: Liu L, \u00d6zsu M (eds) Encyclopedia of Database Systems. Springer, New York, NY. https:\/\/doi.org\/10.1007\/978-1-4899-7993-3_2061-2","DOI":"10.1007\/978-1-4899-7993-3_2061-2"},{"key":"3979_CR58","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-020-00399-2","volume":"8","author":"P Baumann","year":"2021","unstructured":"Baumann P, Misev D, Merticariu V, Huu BP (2021) Array databases: concepts, standards, implementations. J Big Data 8:1\u201361. https:\/\/doi.org\/10.1186\/s40537-020-00399-2","journal-title":"J Big Data"},{"issue":"4","key":"3979_CR59","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1007\/BF01231603","volume":"3","author":"P Baumann","year":"1994","unstructured":"Baumann P (1994) Management of multidimensional discrete data. VLDB J 3(4):401\u2013444. https:\/\/doi.org\/10.1007\/BF01231603","journal-title":"VLDB J"},{"key":"3979_CR60","doi-asserted-by":"publisher","unstructured":"Sarawagi S, Stonebraker M (1994) Efficient organization of large multidimensional arrays. In: Proceedings of 1994 IEEE 10th International conference on data engineering, pp 328\u2013336. https:\/\/doi.org\/10.1109\/ICDE.1994.283048","DOI":"10.1109\/ICDE.1994.283048"},{"key":"3979_CR61","doi-asserted-by":"publisher","unstructured":"Liaukevich V, Mi\u0161ev D, Baumann P, Merticariu V (2017) Location and processing aware datacube caching. In: Proceedings of the 29th international conference on scientific and statistical database management. SSDBM \u201917, pp 34\u20131346. ACM. https:\/\/doi.org\/10.1145\/3085504.3085539","DOI":"10.1145\/3085504.3085539"},{"key":"3979_CR62","doi-asserted-by":"publisher","unstructured":"Cheng Y, Rusu F (2013) Astronomical data processing in EXTASCID. In: Proceedings of the 25th international conference on scientific and statistical database management. SSDBM, pp. 47\u20131474. ACM. https:\/\/doi.org\/10.1145\/2484838.2484875","DOI":"10.1145\/2484838.2484875"},{"key":"3979_CR63","doi-asserted-by":"publisher","unstructured":"Zhang Y, Kersten M, Ivanova M, Nes N (2011) SciQL: Bridging the gap between science and relational DBMS. In: Proceedings of the 15th Symposium on International Database Engineering & Applications. IDEAS \u201911, pp 124\u2013133. ACM. https:\/\/doi.org\/10.1145\/2076623.2076639","DOI":"10.1145\/2076623.2076639"},{"key":"3979_CR64","unstructured":"PostGIS (2019 ) Post GIS Raster Manual. http:\/\/postgis.net\/docs\/manual-dev\/using_raster_dataman.html. Accessed 14 Oct 2019"},{"key":"3979_CR65","unstructured":"Teradata (2019) Array Data Type. https:\/\/docs.teradata.com\/r\/Teradata-Database-SQL-Data-Types-and-Literals\/June-2017\/ARRAY\/VARRAY-Data-Type. Accessed 14 Oct 2019"},{"key":"3979_CR66","unstructured":"GeoServer, Oracle Georaster User Manual (2019). https:\/\/docs.geoserver.org\/latest\/en\/user\/data\/raster\/oraclegeoraster.html. Accessed 14 Oct 2019"},{"key":"3979_CR67","unstructured":"Information technology database languages \u2014 SQL \u2014 Part 15: Multi-dimensional arrays (SQL\/MDA) (2019) Standard, International Organization for Standardization"},{"issue":"4","key":"3979_CR68","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1007\/s10707-009-0087-2","volume":"14","author":"P Baumann","year":"2010","unstructured":"Baumann P (2010) The OGC web coverage processing service (WCPS) standard. GeoInformatica 14(4):447\u2013479. https:\/\/doi.org\/10.1007\/s10707-009-0087-2. Accessed 14 Oct 2019","journal-title":"GeoInformatica"},{"key":"3979_CR69","unstructured":"Nexedi (2016) Wendelin.core Tutorial. https:\/\/www.nexedi.com\/wendelin-Core.Tutorial.2016. Accessed 14 Oct 2019"},{"key":"3979_CR70","unstructured":"TensorFlow (2019) An end-to-end open source machine learning platform. https:\/\/www.tensorflow.org\/. Accessed 15 Oct 2019"},{"key":"3979_CR71","unstructured":"Xtensor (2019) Multi-dimensional arrays with broadcasting and lazy computing. https:\/\/xtensor.readthedocs.io\/en\/latest\/. Accessed 15 Oct 2019"},{"key":"3979_CR72","unstructured":"OPeNDAP (2019) Advanced Software for Remote Data Retrieval. https:\/\/www.opendap.org\/. Accessed 15 Oct 2019"},{"key":"3979_CR73","unstructured":"Ophidia (2019) High Performance Data Mining & Analytics for eScience. http:\/\/ophidia.cmcc.it\/. Accessed 15 Oct 2019"},{"key":"3979_CR74","unstructured":"Google Earth Engine (2019) A planetary-scale platform for Earth science data & analysis. https:\/\/earthengine.google.com\/. Accessed 15 Oct 2019"},{"issue":"4","key":"3979_CR75","doi-asserted-by":"publisher","first-page":"349","DOI":"10.14778\/3025111.3025117","volume":"10","author":"S Papadopoulos","year":"2016","unstructured":"Papadopoulos S, Datta K, Madden S, Mattson T (2016) The TileDB array data storage manager. Proc VLDB Endow 10(4):349\u2013360. https:\/\/doi.org\/10.14778\/3025111.3025117","journal-title":"Proc VLDB Endow"},{"key":"3979_CR76","unstructured":"Boost (2019) C++ Libraries. https:\/\/www.boost.org\/doc\/libs\/1_71_0\/libs\/geometry\/doc\/html\/index.html. Accessed 15 Oct 2019"},{"key":"3979_CR77","unstructured":"Open Data Cube (2019) An Open Source Geospatial Data Management & Analysis Platform. https:\/\/www.opendatacube.org\/. Accessed 15 Oct 2019"},{"key":"3979_CR78","unstructured":"xarray (2019) N-D labeled arrays and datasets in Python. http:\/\/xarray.pydata.org\/en\/stable\/. Accessed 15 Oct 2019"},{"key":"3979_CR79","doi-asserted-by":"publisher","unstructured":"McKinney W (2010) Data structures for statistical computing in Python. In: St\u00e9fan van der Walt, Jarrod Millman (eds.) Proceedings of the 9th python in science conference, pp 56\u201361. https:\/\/doi.org\/10.25080\/Majora-92bf1922-00a","DOI":"10.25080\/Majora-92bf1922-00a"},{"issue":"4","key":"3979_CR80","doi-asserted-by":"publisher","first-page":"447","DOI":"10.1007\/s10707-009-0087-2","volume":"14","author":"P Baumann","year":"2010","unstructured":"Baumann P (2010) The OGC web coverage processing service (WCPS) standard. Geoinformatica 14(4):447\u2013479. https:\/\/doi.org\/10.1007\/s10707-009-0087-2","journal-title":"Geoinformatica"}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-022-03979-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-022-03979-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-022-03979-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,19]],"date-time":"2023-05-19T11:32:09Z","timestamp":1684495929000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-022-03979-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,12]]},"references-count":80,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2023,5]]}},"alternative-id":["3979"],"URL":"https:\/\/doi.org\/10.1007\/s10489-022-03979-2","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"value":"0924-669X","type":"print"},{"value":"1573-7497","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,12]]},"assertion":[{"value":"7 July 2022","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 August 2022","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}