{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:16:42Z","timestamp":1750220202570,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":24,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,9,21]],"date-time":"2022-09-21T00:00:00Z","timestamp":1663718400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,9,21]]},"DOI":"10.1145\/3537674.3554748","type":"proceedings-article","created":{"date-parts":[[2022,9,7]],"date-time":"2022-09-07T16:08:30Z","timestamp":1662566910000},"page":"142-149","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Pressure Test: Finding Appropriate Data Size for Practice in Data Science Education"],"prefix":"10.1145","author":[{"given":"Yong","family":"Zheng","sequence":"first","affiliation":[{"name":"Illinois Institute of Technology, USA"}]},{"given":"Arnold","family":"Liu","sequence":"additional","affiliation":[{"name":"Illinois Institute of Technology, USA"}]},{"given":"Shuaiqi","family":"Zheng","sequence":"additional","affiliation":[{"name":"Illinois Institute of Technology, USA"}]}],"member":"320","published-online":{"date-parts":[[2022,9,21]]},"reference":[{"volume-title":"Innovations in Electronics and Communication Engineering","author":"Bhathal Gurjit\u00a0Singh","key":"e_1_3_2_1_2_1","unstructured":"Gurjit\u00a0Singh Bhathal and Amardeep Singh . 2019. Big data computing with distributed computing frameworks . In Innovations in Electronics and Communication Engineering . Springer , 467\u2013477. Gurjit\u00a0Singh Bhathal and Amardeep Singh. 2019. Big data computing with distributed computing frameworks. In Innovations in Electronics and Communication Engineering. Springer, 467\u2013477."},{"key":"e_1_3_2_1_3_1","volume-title":"Random forests. Machine learning 45, 1","author":"Breiman Leo","year":"2001","unstructured":"Leo Breiman . 2001. Random forests. Machine learning 45, 1 ( 2001 ), 5\u201332. Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5\u201332."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.05.513"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDSP.2015.7251923"},{"key":"e_1_3_2_1_6_1","volume-title":"Data mining curriculum: A proposal (Version 1.0). Intensive working group of ACM SIGKDD curriculum committee 140","author":"Chakrabarti Soumen","year":"2006","unstructured":"Soumen Chakrabarti , Martin Ester , Usama Fayyad , Johannes Gehrke , Jiawei Han , Shinichi Morishita , Gregory Piatetsky-Shapiro , and Wei Wang . 2006. Data mining curriculum: A proposal (Version 1.0). Intensive working group of ACM SIGKDD curriculum committee 140 ( 2006 ), 1\u201310. Soumen Chakrabarti, Martin Ester, Usama Fayyad, Johannes Gehrke, Jiawei Han, Shinichi Morishita, Gregory Piatetsky-Shapiro, and Wei Wang. 2006. Data mining curriculum: A proposal (Version 1.0). Intensive working group of ACM SIGKDD curriculum committee 140 (2006), 1\u201310."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1146\/annurev-statistics-060116-053930"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2017.122"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.05.517"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.2015.1077729"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.2017.1356747"},{"key":"e_1_3_2_1_12_1","volume-title":"Performance analysis of distributed computing frameworks for big data analytics: hadoop vs spark. Computaci\u00f3n y Sistemas 24, 2","author":"Ketu Shwet","year":"2020","unstructured":"Shwet Ketu , Pramod\u00a0Kumar Mishra , and Sonali Agarwal . 2020. Performance analysis of distributed computing frameworks for big data analytics: hadoop vs spark. Computaci\u00f3n y Sistemas 24, 2 ( 2020 ), 669\u2013686. Shwet Ketu, Pramod\u00a0Kumar Mishra, and Sonali Agarwal. 2020. Performance analysis of distributed computing frameworks for big data analytics: hadoop vs spark. Computaci\u00f3n y Sistemas 24, 2 (2020), 669\u2013686."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2939196"},{"key":"e_1_3_2_1_14_1","volume-title":"Matthew Wiener","author":"Liaw Andy","year":"2002","unstructured":"Andy Liaw , Matthew Wiener , 2002 . Classification and regression by randomForest. R news 2, 3 (2002), 18\u201322. Andy Liaw, Matthew Wiener, 2002. Classification and regression by randomForest. R news 2, 3 (2002), 18\u201322."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/3154557"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Alexander\u00a0J McLeod Michael Bliemel and Nancy Jones. 2017. Examining the adoption of big data and analytics curriculum. Business Process Management Journal(2017).  Alexander\u00a0J McLeod Michael Bliemel and Nancy Jones. 2017. Examining the adoption of big data and analytics curriculum. Business Process Management Journal(2017).","DOI":"10.1108\/BPMJ-12-2015-0174"},{"key":"e_1_3_2_1_17_1","volume-title":"Data science and its relationship to big data and data-driven decision making. Big data 1, 1","author":"Provost Foster","year":"2013","unstructured":"Foster Provost and Tom Fawcett . 2013. Data science and its relationship to big data and data-driven decision making. Big data 1, 1 ( 2013 ), 51\u201359. Foster Provost and Tom Fawcett. 2013. Data science and its relationship to big data and data-driven decision making. Big data 1, 1 (2013), 51\u201359."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jaccedu.2016.12.008"},{"key":"e_1_3_2_1_19_1","volume-title":"Big data and data science: what should we teach?Expert Systems 33, 4","author":"Song Il-Yeol","year":"2016","unstructured":"Il-Yeol Song and Yongjun Zhu . 2016. Big data and data science: what should we teach?Expert Systems 33, 4 ( 2016 ), 364\u2013373. Il-Yeol Song and Yongjun Zhu. 2016. Big data and data science: what should we teach?Expert Systems 33, 4 (2016), 364\u2013373."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3233\/EFI-160977"},{"key":"e_1_3_2_1_21_1","volume-title":"Big data for education: Data mining, data analytics, and web dashboards. Governance studies at Brookings 4, 1","author":"West M","year":"2012","unstructured":"Darrell\u00a0 M West . 2012. Big data for education: Data mining, data analytics, and web dashboards. Governance studies at Brookings 4, 1 ( 2012 ), 1\u201310. Darrell\u00a0M West. 2012. Big data for education: Data mining, data analytics, and web dashboards. Governance studies at Brookings 4, 1 (2012), 1\u201310."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1080\/08993408.2018.1486120"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2015.2388958"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"crossref","unstructured":"Tian Zheng. 2017. Teaching Data Science in a Statistical Curriculum: Can We Teach More by Teaching Less?Journal of Computational and Graphical Statistics 26 4(2017) 772\u2013774.  Tian Zheng. 2017. Teaching Data Science in a Statistical Curriculum: Can We Teach More by Teaching Less?Journal of Computational and Graphical Statistics 26 4(2017) 772\u2013774.","DOI":"10.1080\/10618600.2017.1385473"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3349266.3351380"}],"event":{"name":"SIGITE '22: The 23rd Annual Conference on Information Technology Education","sponsor":["SIGITE ACM Special Interest Group on Information Technology Education"],"location":"Chicago IL USA","acronym":"SIGITE '22"},"container-title":["Proceedings of the 23rd Annual Conference on Information Technology Education"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3537674.3554748","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3537674.3554748","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:50Z","timestamp":1750186970000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3537674.3554748"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,9,21]]},"references-count":24,"alternative-id":["10.1145\/3537674.3554748","10.1145\/3537674"],"URL":"https:\/\/doi.org\/10.1145\/3537674.3554748","relation":{},"subject":[],"published":{"date-parts":[[2022,9,21]]},"assertion":[{"value":"2022-09-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}