{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T14:01:21Z","timestamp":1776780081693,"version":"3.51.2"},"reference-count":133,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2016,8,2]],"date-time":"2016-08-02T00:00:00Z","timestamp":1470096000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"IBM CAS research and NSERC"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2017,6,30]]},"abstract":"<jats:p>With almost everything now online, organizations look at the Big Data collected to gain insights for improving their services. In the analytics process, derivation of such insights requires experimenting-with and integrating different analytics techniques, while handling the Big Data high arrival velocity and large volumes. Existing solutions cover bits-and-pieces of the analytics process, leaving it to organizations to assemble their own ecosystem or buy an off-the-shelf ecosystem that can have unnecessary components to them. We build on this point by dividing the Big Data Analytics problem into six main pillars. We characterize and show examples of solutions designed for each of these pillars. We then integrate these six pillars into a taxonomy to provide an overview of the possible state-of-the-art analytics ecosystems. In the process, we highlight a number of ecosystems to meet organizations different needs. Finally, we identify possible areas of research for building future Big Data Analytics Ecosystems.<\/jats:p>","DOI":"10.1145\/2963143","type":"journal-article","created":{"date-parts":[[2016,8,4]],"date-time":"2016-08-04T13:26:34Z","timestamp":1470317194000},"page":"1-36","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":35,"title":["The Six Pillars for Building Big Data Analytics Ecosystems"],"prefix":"10.1145","volume":"49","author":[{"given":"Shadi","family":"Khalifa","sequence":"first","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Yehia","family":"Elshater","sequence":"additional","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Kiran","family":"Sundaravarathan","sequence":"additional","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Aparna","family":"Bhat","sequence":"additional","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Patrick","family":"Martin","sequence":"additional","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Fahim","family":"Imam","sequence":"additional","affiliation":[{"name":"Queen's University, ON, Canada"}]},{"given":"Dan","family":"Rope","sequence":"additional","affiliation":[{"name":"IBM, Washington D.C., United States"}]},{"given":"Mike","family":"Mcroberts","sequence":"additional","affiliation":[{"name":"IBM, Washington D.C., United States"}]},{"given":"Craig","family":"Statchuk","sequence":"additional","affiliation":[{"name":"IBM, Canada, Ottawa, ON, Canada"}]}],"member":"320","published-online":{"date-parts":[[2016,8,2]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2011.26"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687625"},{"key":"e_1_2_1_3_1","first-page":"33","article-title":"A survey on ontology reasoners and comparison","volume":"57","author":"Abburu Sunitha","year":"2012","journal-title":"Int\u2019l Journal of Computer Applications."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807294"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367533"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536222.2536229"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1966445.1966472"},{"key":"e_1_2_1_8_1","volume-title":"CouchDB: the definitive guide. O\u2019Reilly","author":"Anderson J. Chris"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000036"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2012.136"},{"key":"e_1_2_1_11_1","volume-title":"Proc. of the 7th Int\u2019l Conf. on Parallel Processing and Applied Mathematics (PPAM\u201907)","author":"Barker Adam","year":"2007"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2463683"},{"key":"e_1_2_1_13_1","unstructured":"Sean Bechhofer Frank van Harmelen Jim Hendler Ian Horrocks Deborah L. McGuinness Peter F. Patel-Schneider and Lynn Andrea Stein. 2004. OWL web ontology language reference. W3C Recommendation.  Sean Bechhofer Frank van Harmelen Jim Hendler Ian Horrocks Deborah L. McGuinness Peter F. Patel-Schneider and Lynn Andrea Stein. 2004. OWL web ontology language reference. W3C Recommendation."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-010-0645-5"},{"key":"e_1_2_1_15_1","volume-title":"Proc. of the RapidMiner Community Meeting and Conference.","author":"Bockermann Christian","year":"2012"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5555\/1753235.1753251"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2012.37"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920881"},{"key":"e_1_2_1_19_1","volume-title":"Redis in Action","author":"Carlson Josiah L."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1978915.1978919"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2012.55"},{"key":"e_1_2_1_22_1","unstructured":"Badrish Chandramouli Jonathan Goldstein Mike Barnett Robert DeLine Danyel Fisher John C. Platt James F. Terwilliger and John Wernsing. 2014. The Trill Incremental Analytics Engine. MSR-TR-2014-54.  Badrish Chandramouli Jonathan Goldstein Mike Barnett Robert DeLine Danyel Fisher John C. Platt James F. Terwilliger and John Wernsing. 2014. The Trill Incremental Analytics Engine. MSR-TR-2014-54."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1365815.1365816"},{"key":"e_1_2_1_24_1","volume-title":"Retrieved January 20th","author":"Chapman Pete","year":"2000"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/1780582.1780585"},{"key":"e_1_2_1_27_1","volume-title":"MongoDB: The Definitive Guide. O\u2019Reilly Media","author":"Chodorow Kristina"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/1855711.1855732"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.14778\/1454159.1454167"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/DEST.2007.372004"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1294261.1294281"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2008.06.012"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1155\/2005\/128026"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3233\/SW-2011-0034"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-03915-7_25"},{"key":"e_1_2_1_37_1","volume-title":"Proc. of the ECML-PKDD Workshop on Service-Oriented Knowledge Discovery.13--24","author":"Diamantini Claudia","year":"2009"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2735377"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0319-9"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1851476.1851593"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigDataCongress.2015.33"},{"key":"e_1_2_1_42_1","volume-title":"Retrieved January 20th","author":"EMC.","year":"2013"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.14778\/1988776.1988778"},{"key":"e_1_2_1_44_1","volume-title":"HBase: The Definitive Guide. O\u2019Reilly Media","author":"George Lars"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/945445.945450"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2010.9"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMLA.2005.65"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1186\/2192-113X-2-22"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1006\/knac.1993.1008"},{"key":"e_1_2_1_50_1","volume-title":"White Paper. Atos Scientific Community. Retrieved January 20th","author":"G\u00fcemes Celestino","year":"2013"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2287016.2287022"},{"key":"e_1_2_1_52_1","volume-title":"A developer's guide to Amazon SimpleDB","author":"Habeeb Mocky"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350259"},{"key":"e_1_2_1_54_1","volume-title":"Proc. of the 25th CASCON Conf. 26--34","author":"Hamdaqa Mohammad","year":"2015"},{"key":"e_1_2_1_55_1","volume-title":"Proc. of the 1996 SiGMOD. 27--34","author":"Han Jiawei","year":"1996"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767933"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367510"},{"key":"e_1_2_1_58_1","volume-title":"Proc. of the 8th USENIX Conf. on Networked Systems Design and Implementation (NSDI\u201911)","author":"Hindman Benjamin","year":"2011"},{"key":"e_1_2_1_59_1","volume-title":"Retrieved January 20th","author":"IBM.","year":"2012"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/1272998.1273005"},{"key":"e_1_2_1_61_1","volume-title":"Proc. of the 26th Int\u2019l Conf. on Very Large Data Bases (VLDB\u201900)","author":"Johnson Theodore"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.5555\/1855533.1855555"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/2790798.2790812"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigDataCongress.2016.23"},{"key":"e_1_2_1_65_1","volume-title":"Proc. of the 3rd Planning to Learn Workshop (WS9) At the European Conf. on Artificial Intelligence (ECAI\u201910)","author":"Kietz Jorg-Uwe","year":"2010"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989355"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850578.2850582"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigDataCongress.2015.12"},{"key":"e_1_2_1_69_1","volume-title":"Proc. of the 2nd IEEE Int\u2019l Conf. on Comp. Sci. and Info. Tech. (ICCSIT\u201909)","author":"Kotsiantis S. B."},{"key":"e_1_2_1_70_1","volume-title":"Proc. of the 6th Biennial Conf. on Innovative Data Systems Research (CIDR\u201913)","author":"Kraska Tim","year":"2013"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1145\/1773912.1773922"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367520"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.14778\/2336664.2336675"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.14778\/2809974.2809979"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1145\/2503009"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989424"},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1145\/2628194.2628251"},{"key":"e_1_2_1_78_1","volume-title":"McKinsey Global Institute. Retrieved January 20th","author":"Manyika James","year":"2011"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.61"},{"key":"e_1_2_1_80_1","unstructured":"A. Martin D. Maladhy and V. P. Venkatesan. 2011. A framework for business intelligence application using ontological classification. Int\u2019l Journal of Engineering Science and Technology. 3. 1213--1221.  A. Martin D. Maladhy and V. P. Venkatesan. 2011. A framework for business intelligence application using ontological classification. Int\u2019l Journal of Engineering Science and Technology. 3. 1213--1221."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920886"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.5555\/645922.673500"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2012.01.008"},{"key":"e_1_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1109\/VAST.2012.6400554"},{"key":"e_1_2_1_85_1","volume-title":"Intelligent Technologies for Information Analysis","author":"Morik Katharina"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522738"},{"key":"e_1_2_1_87_1","volume-title":"Proc. of the Fifteenth National\/Tenth Conf. on Artificial Intelligence\/Innovative Applications of Artificial Intelligence (AAAI\u201998\/IAAI\u201998)","author":"Nau Dana S.","year":"1998"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.5555\/975615"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2010.172"},{"key":"e_1_2_1_90_1","volume-title":"Inc. Retrieved January 20th","author":"Inc.","year":"2012"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376726"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989439"},{"key":"e_1_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1145\/1463434.1463497"},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063384.2063462"},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-014-0363-0"},{"key":"e_1_2_1_96_1","volume-title":"Neo4j in Action. O\u2019Reilly Media","author":"Partner Jonas"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/2095536.2095583"},{"key":"e_1_2_1_98_1","volume-title":"Proc. of RCOMM","author":"Prekopcsak Zoltan","year":"2011"},{"key":"e_1_2_1_99_1","volume-title":"Proc. of the 2013 CASCON Conf.. 192--199","author":"Rais-Ghasem Mohsen","year":"2013"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.5555\/822086.823346"},{"key":"e_1_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1007\/11767138_20"},{"key":"e_1_2_1_102_1","volume-title":"Proc. of the 27th VLDB Conf. 653--656","author":"Sadri Reza","year":"2001"},{"key":"e_1_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISDA.2009.145"},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1145\/2480741.2480748"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502082"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367513"},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496972"},{"key":"e_1_2_1_108_1","volume-title":"Peter Baer Galvin, and Greg Gagne","author":"Silberschatz Abraham","year":"2008"},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2005.06.004"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2007.03.004"},{"key":"e_1_2_1_111_1","volume-title":"Case-based reasoning for diagnosis and solution planning","author":"Soltani Sima"},{"key":"e_1_2_1_112_1","volume-title":"Proc. of the 2013 IEEE 13th Int\u2019l Conf. on Data Mining (ICDM). 1187--1192","author":"Sparks Evan"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.5555\/2208461.2208479"},{"key":"e_1_2_1_114_1","volume-title":"Proc. of Big Learning Workshop at NIPS.","author":"Talwalkara Ameet"},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","DOI":"10.1145\/1083784.1083805"},{"key":"e_1_2_1_116_1","volume-title":"Getting Started with OrientDB","author":"Tesoriero Claudio"},{"key":"e_1_2_1_117_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687609"},{"key":"e_1_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1007\/11814771_26"},{"key":"e_1_2_1_119_1","volume-title":"Retrieved January 20th","author":"Turner David","year":"2012"},{"key":"e_1_2_1_120_1","volume-title":"Workflow Management: Models, Methods, and Systems","author":"van der Aalst Wil","year":"2004"},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1145\/2523616.2523633"},{"key":"e_1_2_1_122_1","volume-title":"Proc. of the 2nd IEEE Annual Conf. on Pervasive Computing and Communications Workshops (PERCOMW\u201904)","author":"Wang Xiao Hang","year":"2004"},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2010.24"},{"key":"e_1_2_1_124_1","volume-title":"Hadoop: The Definitive Guide","author":"White Tom","year":"2009","edition":"1"},{"key":"e_1_2_1_125_1","doi-asserted-by":"publisher","DOI":"10.5555\/2228298.2228335"},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2465288"},{"key":"e_1_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.1145\/2737182.2737186"},{"key":"e_1_2_1_128_1","doi-asserted-by":"publisher","DOI":"10.1145\/1084805.1084814"},{"key":"e_1_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.Congress.2013.10"},{"key":"e_1_2_1_130_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755940"},{"key":"e_1_2_1_131_1","volume-title":"Proc. of the 2nd USENIX Conf. on Hot Topics in Cloud Computing (HotCloud\u201910)","author":"Zaharia Matei","year":"2010"},{"key":"e_1_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2011.260"},{"key":"e_1_2_1_133_1","doi-asserted-by":"publisher","DOI":"10.1145\/2038916.2038929"},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-012-0280-z"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2963143","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2963143","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:54:04Z","timestamp":1750222444000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2963143"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,8,2]]},"references-count":133,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,6,30]]}},"alternative-id":["10.1145\/2963143"],"URL":"https:\/\/doi.org\/10.1145\/2963143","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,8,2]]},"assertion":[{"value":"2015-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-05-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-08-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}