{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T06:06:16Z","timestamp":1774332376446,"version":"3.50.1"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2022,2,23]],"date-time":"2022-02-23T00:00:00Z","timestamp":1645574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Commun. ACM"],"published-print":{"date-parts":[[2022,3]]},"abstract":"<jats:p>Given the complexity of data science projects and related demand for human expertise, automation has the potential to transform the data science process.<\/jats:p>","DOI":"10.1145\/3495256","type":"journal-article","created":{"date-parts":[[2022,2,23]],"date-time":"2022-02-23T15:20:04Z","timestamp":1645629604000},"page":"76-87","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":40,"title":["Automating data science"],"prefix":"10.1145","volume":"65","author":[{"given":"Tijl","family":"De Bie","sequence":"first","affiliation":[{"name":"Ghent University, Belgium"}]},{"given":"Luc","family":"De Raedt","sequence":"additional","affiliation":[{"name":"KU Leuven, Belgium and \u00d6rebro University, Sweden"}]},{"given":"Jos\u00e9","family":"Hern\u00e1ndez-Orallo","sequence":"additional","affiliation":[{"name":"Universitat Polit\u00e8cnica de Val\u00e8ncia, Spain"}]},{"given":"Holger H.","family":"Hoos","sequence":"additional","affiliation":[{"name":"Leiden University, The Netherlands and University of British Columbia in Vancouver, Canada"}]},{"given":"Padhraic","family":"Smyth","sequence":"additional","affiliation":[{"name":"University of California, Irvine"}]},{"given":"Christopher K. I.","family":"Williams","sequence":"additional","affiliation":[{"name":"University of Edinburgh, U.K and Alan Turing Institute, London, U.K"}]}],"member":"320","published-online":{"date-parts":[[2022,2,23]]},"reference":[{"key":"e_1_2_1_1_1","first-page":"13","article-title":"Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conf. on Human Factors","volume":"1","author":"Amershi S.","year":"2019","unstructured":"Amershi, S. et al. Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conf. on Human Factors in Computing Systems, 2019, 1--13.","journal-title":"Computing Systems"},{"key":"e_1_2_1_2_1","volume-title":"et al. Plant functional trait change across a warming tundra biome. Nature 562, 7725","author":"Bjorkman A.","year":"2018","unstructured":"Bjorkman, A. et al. Plant functional trait change across a warming tundra biome. Nature 562, 7725 (2018), 57."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-011-0229-7"},{"key":"e_1_2_1_4_1","volume-title":"Metalearning: Applications to Data Mining","author":"Brazdil P.","year":"2008","unstructured":"Brazdil, P., Carrier, C., Soares, C., and Vilalta, R. Metalearning: Applications to Data Mining. Springer Science & Business Media, 2008."},{"key":"e_1_2_1_5_1","volume-title":"et al. CRISP-DM 1.0 Step-by-step data mining guide","author":"Chapman P.","year":"2000","unstructured":"Chapman, P. et al. CRISP-DM 1.0 Step-by-step data mining guide, 2000."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.330129"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/861869"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-41398-8_3"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.2200\/S00692ED1V01Y201601AIM032"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1080\/10618600.2017.1384734"},{"key":"e_1_2_1_11_1","first-page":"324","article-title":"Analytics using machine learning-guided simulations with application to healthcare scenarios","volume":"277","author":"Elbattah M.","year":"2018","unstructured":"Elbattah, M. and Molloy, O. Analytics using machine learning-guided simulations with application to healthcare scenarios. Analytics and Knowledge Mgmt. Auerbach Publications, 2018, 277--324.","journal-title":"Analytics and Knowledge Mgmt. Auerbach Publications"},{"key":"e_1_2_1_12_1","first-page":"2962","article-title":"Efficient and robust automated machine learning","volume":"28","author":"Feurer M.","year":"2015","unstructured":"Feurer, M., Klein, A., Eggensperger, K., Springenberg, J., Blum, M., and Hutter, F. Efficient and robust automated machine learning. Advances in Neural Information Processing Systems 28, 2015, 2962--2970.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0269888900004136"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1132960.1132963"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2578855.2535850"},{"key":"e_1_2_1_16_1","first-page":"5","article-title":"A survey of methods for explaining black box models","volume":"51","author":"Guidotti R.","year":"2018","unstructured":"Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., and Pedreschi, D. A survey of methods for explaining black box models. ACM Computing Surveys 51, 5 (2018). 93.","journal-title":"ACM Computing Surveys"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the Workshop on Automatic Machine Learning 64","author":"Guyon I.","year":"2016","unstructured":"Guyon, I., et al. A brief review of the ChaLearn AutoML Challenge: Any-time any-dataset learning without human intervention. In Proceedings of the Workshop on Automatic Machine Learning 64 (2016), 21--30. F. Hutter, L. Kotthoff, and J. Vanschoren, Eds."},{"key":"e_1_2_1_18_1","volume-title":"Proceedings of Conf. on Innovative Data Systems Research","author":"Heer J.","year":"2015","unstructured":"Heer, J., Hellerstein, J., and Kandel, S. Predictive interaction for data transformation. In Proceedings of Conf. on Innovative Data Systems Research, 2015."},{"key":"e_1_2_1_19_1","volume-title":"Data Wrangling. Encyclopedia of Big Data Technologies","author":"Heer J.","year":"2019","unstructured":"Heer, J., Hellerstein, J., and Kandel, S. Data Wrangling. Encyclopedia of Big Data Technologies. S. Sakr and A. Zomaya, Eds. Springer, 2019."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3233\/AIC-160705"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25566-3_40"},{"key":"e_1_2_1_22_1","volume-title":"Systems, Challenges","author":"Hutter F.","year":"2019","unstructured":"Hutter, F., Kotthoff, L., and Vanschoren, J., Eds. Automated Machine Learning---Methods, Systems, Challenges. Springer, 2019."},{"key":"e_1_2_1_23_1","first-page":"175","article-title":"Visual analytics: Definition, process, and challenges","volume":"154","author":"Keim D.","year":"2008","unstructured":"Keim, D., Andrienko, G., Fekete, J., G\u00f6rg, C., Kohlhammer, J., and Melan\u00e7on, G. Visual analytics: Definition, process, and challenges. Information Visualization. Springer, 2008, 154--175.","journal-title":"Information Visualization. Springer"},{"key":"e_1_2_1_24_1","volume-title":"Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427, 6971","author":"King R.","year":"2004","unstructured":"King, R., et al. Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427, 6971 (2004, 247."},{"key":"e_1_2_1_25_1","volume-title":"Automatic model selection and hyperparameter optimization in WEKA. J. Machine Learning Research 18, 1","author":"Kotthoff L.","year":"2017","unstructured":"Kotthoff, L., Thornton, C., Hoos, H., Hutter, F., and Leyton-Brown, K. Auto-WEKA 2.0: Automatic model selection and hyperparameter optimization in WEKA. J. Machine Learning Research 18, 1 (2017), 826--830."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/29379"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2594291.2594333"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01246-5_2"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v28i1.8904"},{"key":"e_1_2_1_30_1","unstructured":"Mansinghka V. Tibbetts R. Baxter J. Shafto P. and Eaves B. BayesDB: A probabilistic programming system for querying the probable implications of data. 2015; arXiv:1512.05006."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2019.2962680"},{"key":"e_1_2_1_32_1","volume-title":"Data engineering for data analytics: A classification of the issues, and case studies. 2020","author":"Nazabal A.","year":"2004","unstructured":"Nazabal, A., Williams, C., Colavizza, G., Smith, C., and Williams, A. Data engineering for data analytics: A classification of the issues, and case studies. 2020; arXiv:2004.12929."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019"},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 30th Intern. Conf. on Neural Information Processing Systems","author":"Ratner A.","year":"2016","unstructured":"Ratner, A., De Sa, C., Wu, S., Selsam, D., and R\u00e9, C. Data programming: Creating large training sets, quickly. In Proceedings of the 30th Intern. Conf. on Neural Information Processing Systems, 2016, 3574--3582."},{"key":"e_1_2_1_35_1","first-page":"1","article-title":"Interactive intent modeling: Information discovery beyond search","volume":"58","author":"Ruotsalo T.","year":"2014","unstructured":"Ruotsalo, T., Jacucci, G., Myllym\u00e4ki, P., and Kaski, S. Interactive intent modeling: Information discovery beyond search. Commun. ACM 58, 1 (Jan. 2014), 86--92.","journal-title":"Commun. ACM"},{"key":"e_1_2_1_36_1","first-page":"2503","article-title":"Hidden technical debt in machine learning systems","volume":"28","author":"Sculley D.","year":"2015","unstructured":"Sculley, D. et al. Hidden technical debt in machine learning systems. Advances in Neural Info. Processing Systems 28, (2015), 2503--2511.","journal-title":"Advances in Neural Info. Processing Systems"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2480741.2480748"},{"key":"e_1_2_1_38_1","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1080\/10618600.1998.10474794","article-title":"Intelligent support for exploratory data analysis","volume":"7","author":"St. Amant R.","year":"1998","unstructured":"St. Amant, R. and Cohen, P. Intelligent support for exploratory data analysis. J. Computational and Graphical Statistics 7, 4 (1998), 545--558.","journal-title":"J. Computational and Graphical Statistics"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3220057"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1038\/d41586-019-02849-1"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2487629"},{"key":"e_1_2_1_42_1","author":"Tukey","year":"1977","unstructured":"Tukey, J. Exploratory Data Analysis. Pearson, 1977.","journal-title":"J. Exploratory Data Analysis. Pearson"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1464291.1464366"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/2641190.2641198"},{"key":"e_1_2_1_45_1","volume-title":"Proceedings of the Intern. Conf. on Very Large Data Bases 8","author":"Vartak M.","year":"2015","unstructured":"Vartak, M., Rahman, S., Madden, S., Parameswaran, A., and Polyzotis, N. SeeDB: Efficient data-driven visualization recommendations to support visual analytics. In Proceedings of the Intern. Conf. on Very Large Data Bases 8 (2015), 2182."},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939502.2939516"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359313"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of the 2015 IEEE Intern. Congress on Big Data, 716--719","author":"Wasay A.","unstructured":"Wasay, A., Athanassoulis, M., and Idreos, S. Queriosity: Automated data exploration. In Proceedings of the 2015 IEEE Intern. Congress on Big Data, 716--719."},{"key":"e_1_2_1_49_1","first-page":"1","article-title":"Voyager: Exploratory analysis via faceted browsing of visualization recommendations","volume":"22","author":"Wongsuphasawat K.","year":"2015","unstructured":"Wongsuphasawat, K., Moritz, D., Anand, A., Mackinlay, J., Howe, B., and Heer, J. Voyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE Trans. Visualization and Computer Graphics 22, 1 (2015), 649--658.","journal-title":"IEEE Trans. Visualization and Computer Graphics"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1080\/01431161.2018.1452075"},{"key":"e_1_2_1_51_1","volume-title":"Proceedings of the 2018 CHI Conf. on Human Factors in Computing Systems, 1--12","author":"Zgraggen E.","unstructured":"Zgraggen, E., Zhao, Z., Zeleznik, R., and Kraska, T. Investigating the effect of the multiple comparisons problem in visual analysis. In Proceedings of the 2018 CHI Conf. on Human Factors in Computing Systems, 1--12."}],"container-title":["Communications of the ACM"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3495256","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3495256","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3495256","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:23Z","timestamp":1750182563000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3495256"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,23]]},"references-count":51,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,3]]}},"alternative-id":["10.1145\/3495256"],"URL":"https:\/\/doi.org\/10.1145\/3495256","relation":{},"ISSN":["0001-0782","1557-7317"],"issn-type":[{"value":"0001-0782","type":"print"},{"value":"1557-7317","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,23]]},"assertion":[{"value":"2022-02-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}