{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T05:11:59Z","timestamp":1769749919991,"version":"3.49.0"},"reference-count":91,"publisher":"Association for Computing Machinery (ACM)","issue":"FSE","funder":[{"name":"European Research Council","award":["101093186"],"award-info":[{"award-number":["101093186"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. ACM Softw. Eng."],"published-print":{"date-parts":[[2025,6,19]]},"abstract":"<jats:p>Literate programming environments like Jupyter and R Markdown notebooks, coupled with easy-to-use languages like Python and R, put a plethora of statistical methods right at a data analyst\u2019s fingertips. But are these methods being used correctly? Statistical methods make statistical assumptions about samples being analyzed, and in many cases produce reasonable looking results even if assumptions are not met.<\/jats:p>\n          <jats:p>We propose an approach that allows library developers to annotate functions with statistical assumptions, phrases them as hypotheses about the data, and inserts hypothesis tests investigating the likelihood that the assumption is met; this way, analysts using these functions will have their data checked automatically. We implement this approach in two tools: prob-check-py for Python, and prob-check-r for R, and to evaluate them we identify common hypothesis testing and statistical modeling functions, annotate them with the relevant statistical assumptions, and run 128 Kaggle notebooks that use those methods to identify misuses. Our investigation reveals statistically significant evidence against assumptions in 84.38% of surveyed notebooks, and in 53.36% of calls to annotated functions. In the case of hypothesis tests, had an equivalent test that did not make these assumptions been chosen, a different conclusion would have been drawn in 11.51% of cases.<\/jats:p>","DOI":"10.1145\/3729391","type":"journal-article","created":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T15:16:02Z","timestamp":1750346162000},"page":"2735-2758","source":"Crossref","is-referenced-by-count":1,"title":["Expressing and Checking Statistical Assumptions"],"prefix":"10.1145","volume":"2","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0381-0477","authenticated-orcid":false,"given":"Alexi","family":"Turcotte","sequence":"first","affiliation":[{"name":"CISPA Helmholtz Center for Information Security, Saabr\u00fccken, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-2640-4673","authenticated-orcid":false,"given":"Zheyuan","family":"Wu","sequence":"additional","affiliation":[{"name":"Saarland University, Saarbr\u00fccken, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,6,19]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI","author":"Allamanis Miltiadis","year":"2020","unstructured":"Miltiadis Allamanis, Earl T. Barr, Soline Ducousso, and Zheng Gao. 2020. Typilus: neural type hints. In Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2020). Association for Computing Machinery, New York, NY, USA. 91\u2013105. isbn:9781450376136 https:\/\/doi.org\/10.1145\/3385412.3385997 10.1145\/3385412.3385997"},{"key":"e_1_2_1_2_1","unstructured":"arkaradeniz. 2024. AB_Testing. See URL: https:\/\/www.kaggle.com\/code\/arkaradeniz\/ab-testing"},{"key":"e_1_2_1_3_1","volume-title":"International Conference on Computational Science and Its Applications. 1166\u20131176","author":"Badea Bogdan","year":"2006","unstructured":"Bogdan Badea and Adriana Vlad. 2006. Revealing statistical independence of two experimental data sets: an improvement on Spearman\u2019s algorithm. In International Conference on Computational Science and Its Applications. 1166\u20131176. https:\/\/doi.org\/10.1007\/11751540_127 10.1007\/11751540_127"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 19th international conference on Architectural support for programming languages and operating systems. 51\u201366","author":"Bornholt James","year":"2014","unstructured":"James Bornholt, Todd Mytkowicz, and Kathryn S McKinley. 2014. Uncertain<T>: a first-order type for uncertain data. In Proceedings of the 19th international conference on Architectural support for programming languages and operating systems. 51\u201366. https:\/\/doi.org\/10.1145\/2644865.2541958 10.1145\/2644865.2541958"},{"key":"e_1_2_1_5_1","unstructured":"car Team. 2024. leveneTest. See URL: https:\/\/rdrr.io\/cran\/car\/man\/leveneTest.html"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1080\/00949650902731377"},{"key":"e_1_2_1_7_1","unstructured":"carlolepelaars. 2019. Statistical Tutorial. See URL: https:\/\/www.kaggle.com\/code\/carlolepelaars\/statistics-tutorial"},{"key":"e_1_2_1_8_1","unstructured":"cbhyphen. 2024. Post College Salaries Exploration in R. See URL: https:\/\/www.kaggle.com\/code\/cbhyphen\/post-college-salaries-exploration-in-r"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eja.2022.126666"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/2544173.2509548"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1925844.1926410"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1468-0084.2008.00537.x"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1142\/S0219720009004230"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1214\/09-STS301"},{"key":"e_1_2_1_15_1","unstructured":"gcmadhan. 2024. COVID Analysis Prediction. See URL: https:\/\/www.kaggle.com\/code\/gcmadhan\/covid-analysis-prediction"},{"key":"e_1_2_1_16_1","first-page":"1","article-title":"Probability plotting methods for the analysis of data","volume":"55","author":"Gnanadesikan Ramanathan","year":"1968","unstructured":"Ramanathan Gnanadesikan and Martin B Wilk. 1968. Probability plotting methods for the analysis of data. Biometrika, 55, 1 (1968), 1\u201317.","journal-title":"Biometrika"},{"key":"e_1_2_1_17_1","volume-title":"Computer Aided Verification: 32nd International Conference, CAV 2020, Los Angeles, CA, USA, July 21\u201324, 2020, Proceedings, Part II 32","author":"Gopinathan Kiran","year":"2020","unstructured":"Kiran Gopinathan and Ilya Sergey. 2020. Certifying certainty and uncertainty in approximate membership query structures. In Computer Aided Verification: 32nd International Conference, CAV 2020, Los Angeles, CA, USA, July 21\u201324, 2020, Proceedings, Part II 32. 279\u2013303. https:\/\/doi.org\/10.1007\/978-3-030-53291-8_16 10.1007\/978-3-030-53291-8_16"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","unstructured":"Yimeng Guo Zhifei Chen Lin Chen Wenjie Xu Yanhui Li Yuming Zhou and Baowen Xu. 2024. Generating Python Type Annotations from Type Inference: How Far Are We? ACM Transactions on Software Engineering and Methodology https:\/\/doi.org\/10.1145\/3652153 10.1145\/3652153","DOI":"10.1145\/3652153"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1080\/03610918.2013.844835"},{"key":"e_1_2_1_20_1","volume-title":"Functional and Logic Programming: 8th International Symposium, FLOPS 2006, Fuji-Susono, Japan, April 24-26, 2006. Proceedings 8. 208\u2013225","author":"Hinze Ralf","year":"2006","unstructured":"Ralf Hinze, Johan Jeuring, and Andres L\u00f6h. 2006. Typed contracts for functional programming. In Functional and Logic Programming: 8th International Symposium, FLOPS 2006, Fuji-Susono, Japan, April 24-26, 2006. Proceedings 8. 208\u2013225. https:\/\/doi.org\/10.1007\/11737414_15 10.1007\/11737414_15"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.3389\/fpsyg.2012.00137"},{"key":"e_1_2_1_22_1","volume-title":"2019 IEEE\/ACM 41st International Conference on Software Engineering (ICSE). 608\u2013618","author":"Joshi Keyur","year":"2019","unstructured":"Keyur Joshi, Vimuth Fernando, and Sasa Misailovic. 2019. Statistical algorithmic profiling for randomized approximate programs. In 2019 IEEE\/ACM 41st International Conference on Software Engineering (ICSE). 608\u2013618. https:\/\/doi.org\/10.1109\/ICSE.2019.00071 10.1109\/ICSE.2019.00071"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. 591\u2013603","author":"Jun Eunice","year":"2019","unstructured":"Eunice Jun, Maureen Daum, Jared Roesch, Sarah Chasins, Emery Berger, Rene Just, and Katharina Reinecke. 2019. Tea: A high-level language and runtime system for automating statistical analysis. In Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. 591\u2013603. https:\/\/doi.org\/10.1145\/3332165.3347940 10.1145\/3332165.3347940"},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1\u201316","author":"Jun Eunice","year":"2022","unstructured":"Eunice Jun, Audrey Seo, Jeffrey Heer, and Ren\u00e9 Just. 2022. Tisane: Authoring statistical models via formal reasoning from conceptual and data relationships. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1\u201316. https:\/\/doi.org\/10.1145\/3491102.3501888 10.1145\/3491102.3501888"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.29220\/CSAM.2024.31.1.001"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.22237\/jmasm"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.11138\/cderm"},{"key":"e_1_2_1_28_1","volume-title":"Proceedings of the 23rd International Conference on Evaluation and Assessment in Software Engineering. 134\u2013143","author":"Kitchenham Barbara","year":"2019","unstructured":"Barbara Kitchenham, Lech Madeyski, and Pearl Brereton. 2019. Problems with statistical practice in human-centric software engineering experiments. In Proceedings of the 23rd International Conference on Evaluation and Assessment in Software Engineering. 134\u2013143. https:\/\/doi.org\/10.1145\/3319008.3319009 10.1145\/3319008.3319009"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.22237\/jmasm"},{"key":"e_1_2_1_30_1","volume-title":"Proc. ACM Program. Lang., 4, POPL","author":"Lazarek Lukas","year":"2019","unstructured":"Lukas Lazarek, Alexis King, Samanvitha Sundar, Robert Bruce Findler, and Christos Dimoulas. 2019. Does blame shifting work? Proc. ACM Program. Lang., 4, POPL (2019), Article 65, dec, 29 pages. https:\/\/doi.org\/10.1145\/3371133 10.1145\/3371133"},{"key":"e_1_2_1_31_1","unstructured":"lmtest. 2024. gqtest. See URL: https:\/\/www.rdocumentation.org\/packages\/lmtest\/versions\/0.9-40\/topics\/gqtest"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.11613\/bm.2013.018"},{"key":"e_1_2_1_33_1","volume-title":"Freund\u2019s mathematical statistics. 6","author":"Miller Irwin","unstructured":"Irwin Miller and Marylees Miller. 1999. John E. Freund\u2019s mathematical statistics. 6, Prentice Hall Upper Saddle River, NJ."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.4103\/aca.ACA_248_18"},{"key":"e_1_2_1_35_1","first-page":"01","article-title":"Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests","volume":"2","author":"Razali Nornadiah Mohd","year":"2011","unstructured":"Nornadiah Mohd Razali and Bee Yap. 2011. Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests. J. Stat. Model. Analytics, 2 (2011), 01.","journal-title":"J. Stat. Model. Analytics"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cam.2023.115649"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1002\/sim.8086"},{"key":"e_1_2_1_38_1","volume-title":"Statistical assumptions of substantive analyses across the general linear model: a mini-review. Frontiers in psychology, 3","author":"Nimon Kim F","year":"2012","unstructured":"Kim F Nimon. 2012. Statistical assumptions of substantive analyses across the general linear model: a mini-review. Frontiers in psychology, 3 (2012), 322."},{"key":"e_1_2_1_39_1","unstructured":"nortest. 2025. ad.test. See URL: https:\/\/www.rdocumentation.org\/packages\/nortest\/versions\/1.0-4\/topics\/ad.test"},{"key":"e_1_2_1_40_1","article-title":"Assessing Normality: Applications in Multi-Group Designs","volume":"9","author":"Othman Abdul R","year":"2015","unstructured":"Abdul R Othman, HJ Keselman, and Rand Wilcox. 2015. Assessing Normality: Applications in Multi-Group Designs.. Malaysian Journal of Mathematical Sciences, 9, 1 (2015).","journal-title":"Malaysian Journal of Mathematical Sciences"},{"key":"e_1_2_1_41_1","first-page":"1806","volume-title":"Meeting the assumptions of statistical tests: an important and often forgotten step to reporting valid results. Jornal Brasileiro de Pneumologia, 44","author":"Patino Cecilia Maria","year":"2018","unstructured":"Cecilia Maria Patino and Juliana Carvalho Ferreira. 2018. Meeting the assumptions of statistical tests: an important and often forgotten step to reporting valid results. Jornal Brasileiro de Pneumologia, 44, 05 (2018), 353\u2013353. https:\/\/doi.org\/10.1590\/S1806-37562018000000303 10.1590\/S1806-37562018000000303"},{"key":"e_1_2_1_42_1","first-page":"1","article-title":"Use of statistical techniques in analysis of biological data","volume":"1","author":"Perveen Farzana","year":"2012","unstructured":"Farzana Perveen and Zahid Hussain. 2012. Use of statistical techniques in analysis of biological data. Basic Research Journal of Agricultural Science and Review, 1, 1 (2012), 1\u201310.","journal-title":"Basic Research Journal of Agricultural Science and Review"},{"key":"e_1_2_1_43_1","unstructured":"Python Team. 2020. Type Hints for Python. https:\/\/docs.python.org\/3\/library\/typing.html"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00184-008-0194-3"},{"key":"e_1_2_1_45_1","unstructured":"R Core Team. 2024. aov. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/aov"},{"key":"e_1_2_1_46_1","unstructured":"R Core Team. 2024. bartlett.test. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/bartlett.test"},{"key":"e_1_2_1_47_1","unstructured":"R Core Team. 2024. cor.test. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/cor.test"},{"key":"e_1_2_1_48_1","unstructured":"R Core Team. 2024. ks.test. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/ks.test"},{"key":"e_1_2_1_49_1","unstructured":"R Core Team. 2024. lm. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/lm"},{"key":"e_1_2_1_50_1","unstructured":"R Core Team. 2024. The R Stats Package. See URL: https:\/\/stat.ethz.ch\/R-manual\/R-devel\/library\/stats\/html\/00Index.html"},{"key":"e_1_2_1_51_1","unstructured":"R Core Team. 2024. shapiro.test. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/shapiro.test"},{"key":"e_1_2_1_52_1","unstructured":"R Core Team. 2024. t.test. See URL: https:\/\/www.rdocumentation.org\/packages\/stats\/versions\/3.6.2\/topics\/t.test"},{"key":"e_1_2_1_53_1","volume-title":"Proceedings of the 16th ACM SIGPLAN International Symposium on Dynamic Languages. 57\u201370","author":"Rak-Amnouykit Ingkarat","year":"2020","unstructured":"Ingkarat Rak-Amnouykit, Daniel McCrevan, Ana Milanova, Martin Hirzel, and Julian Dolby. 2020. Python 3 types in the wild: a tale of two type systems. In Proceedings of the 16th ACM SIGPLAN International Symposium on Dynamic Languages. 57\u201370. https:\/\/doi.org\/10.1145\/3426422.3426981 10.1145\/3426422.3426981"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00362-009-0224-x"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-12-81"},{"key":"e_1_2_1_56_1","volume-title":"Testing statistical hypotheses. 3","author":"Romano Joseph P","unstructured":"Joseph P Romano. 1986. Testing statistical hypotheses. 3, Springer."},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation. 112\u2013122","author":"Sampson Adrian","year":"2014","unstructured":"Adrian Sampson, Pavel Panchekha, Todd Mytkowicz, Kathryn S McKinley, Dan Grossman, and Luis Ceze. 2014. Expressing and verifying probabilistic assertions. In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation. 112\u2013122. https:\/\/doi.org\/10.1145\/2594291.2594294 10.1145\/2594291.2594294"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2230.2006.02206.x"},{"key":"e_1_2_1_59_1","unstructured":"scikit. 2024. sklearn.linear_model.LinearRegression. See URL: https:\/\/scikit-learn.org\/stable\/modules\/generated\/sklearn.linear_model.LinearRegression.html"},{"key":"e_1_2_1_60_1","unstructured":"SciPy Community. 2024. scipy.stats.anderson. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.anderson.html"},{"key":"e_1_2_1_61_1","unstructured":"SciPy Community. 2024. scipy.stats.bartlett. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.bartlett.html"},{"key":"e_1_2_1_62_1","unstructured":"SciPy Community. 2024. scipy.stats.f_oneway. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.f_oneway.html"},{"key":"e_1_2_1_63_1","unstructured":"SciPy Community. 2024. scipy.stats.kstest. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.kstest.html"},{"key":"e_1_2_1_64_1","unstructured":"SciPy Community. 2024. scipy.stats.levene. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.levene.html"},{"key":"e_1_2_1_65_1","unstructured":"SciPy Community. 2024. scipy.stats.linregress. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.linregress.html"},{"key":"e_1_2_1_66_1","unstructured":"SciPy Community. 2024. scipy.stats.shapiro. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.shapiro.html"},{"key":"e_1_2_1_67_1","unstructured":"SciPy Community. 2024. scipy.stats.spearmanr. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.spearmanr.html"},{"key":"e_1_2_1_68_1","unstructured":"SciPy Community. 2024. scipy.stats.ttest_1samp. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.ttest_1samp.html"},{"key":"e_1_2_1_69_1","unstructured":"SciPy Community. 2024. scipy.stats.ttest_ind. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.ttest_ind.html"},{"key":"e_1_2_1_70_1","unstructured":"SciPy Community. 2024. scipy.stats.ttest_rel. See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.stats.ttest_rel.html"},{"key":"e_1_2_1_71_1","unstructured":"SciPy Community. 2024. Statistical functions (scipy.stats). See URL: https:\/\/docs.scipy.org\/doc\/scipy\/reference\/stats.html"},{"key":"e_1_2_1_72_1","volume-title":"Computer Aided Verification: 16th International Conference, CAV 2004, Boston, MA, USA, July 13-17, 2004. Proceedings 16","author":"Sen Koushik","year":"2004","unstructured":"Koushik Sen, Mahesh Viswanathan, and Gul Agha. 2004. Statistical model checking of black-box probabilistic systems. In Computer Aided Verification: 16th International Conference, CAV 2004, Boston, MA, USA, July 13-17, 2004. Proceedings 16. 202\u2013215. https:\/\/doi.org\/10.1007\/978-3-540-27813-9_16 10.1007\/978-3-540-27813-9_16"},{"key":"e_1_2_1_73_1","unstructured":"shashwatwork. 2021. Guide to Statistical Hypothesis Testing in Python. See URL: https:\/\/www.kaggle.com\/code\/shashwatwork\/guide-to-statistical-hypothesis-tests-in-python"},{"key":"e_1_2_1_74_1","unstructured":"shilongzhuang. 2023. Statistical Testing Guide for Beginners. See URL: https:\/\/www.kaggle.com\/code\/shilongzhuang\/statistical-testing-guide-for-beginners"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1002\/sim.2175"},{"key":"e_1_2_1_76_1","volume-title":"Practical business statistics","author":"Siegel Andrew F","unstructured":"Andrew F Siegel. 2016. Practical business statistics. Academic Press."},{"key":"e_1_2_1_77_1","unstructured":"slamnz. 2024. IBM Employee Attrition Analysis by Numerics. See URL: https:\/\/www.kaggle.com\/code\/slamnz\/ibm-employee-attrition-analysis-by-numerics"},{"key":"e_1_2_1_78_1","unstructured":"StatLint. 2024. IBM Employee Attrition Analysis by Numerics. See URL: https:\/\/www.kaggle.com\/code\/statlint\/ibm-employee-attrition-analysis-by-numerics"},{"key":"e_1_2_1_79_1","unstructured":"statsmodels. 2024. het_goldfeldquandt. See URL: https:\/\/www.statsmodels.org\/dev\/generated\/statsmodels.stats.diagnostic.het_goldfeldquandt.html"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1080\/03610920600628999"},{"key":"e_1_2_1_81_1","unstructured":"Meltem Sonmez Turan. 2016. IID Testing in SP 800 90B. See URL: https:\/\/csrc.nist.gov\/csrc\/media\/events\/random-bit-generation-workshop-2016\/documents\/presentations\/sessionii-3-meltem-sonmez-turan-presentation.pdf"},{"key":"e_1_2_1_82_1","volume-title":"Proc. ACM Program. Lang., 4, OOPSLA","author":"Turcotte Alexi","year":"2020","unstructured":"Alexi Turcotte, Aviral Goel, Filip K\u0159ikava, and Jan Vitek. 2020. Designing types for R, empirically. Proc. ACM Program. Lang., 4, OOPSLA (2020), Article 181, nov, 25 pages. https:\/\/doi.org\/10.1145\/3428249 10.1145\/3428249"},{"key":"e_1_2_1_83_1","volume-title":"Proceedings of the 14th Workshop on Implementation, Compilation, Optimization of Object-Oriented Languages, Programs and Systems (ICOOOLPS \u201919)","author":"Turcotte Alexi","year":"2019","unstructured":"Alexi Turcotte and Jan Vitek. 2019. Towards a Type System for R. In Proceedings of the 14th Workshop on Implementation, Compilation, Optimization of Object-Oriented Languages, Programs and Systems (ICOOOLPS \u201919). Association for Computing Machinery, New York, NY, USA. Article 4, 5 pages. isbn:9781450368629 https:\/\/doi.org\/10.1145\/3340670.3342426 10.1145\/3340670.3342426"},{"key":"e_1_2_1_84_1","volume-title":"Testing statistical assumptions in research","author":"Verma JP","unstructured":"JP Verma and Abdel-Salam G Abdel-Salam. 2019. Testing statistical assumptions in research. John Wiley & Sons."},{"key":"e_1_2_1_85_1","volume-title":"Proceedings of the 33rd annual acm conference on human factors in computing systems. 2693\u20132702","author":"Wacharamanotham Chat","year":"2015","unstructured":"Chat Wacharamanotham, Krishna Subramanian, Sarah Theres V\u00f6lkel, and Jan Borchers. 2015. Statsplorer: Guiding novices in statistical analysis. In Proceedings of the 33rd annual acm conference on human factors in computing systems. 2693\u20132702. https:\/\/doi.org\/10.1145\/2702123.2702347 10.1145\/2702123.2702347"},{"key":"e_1_2_1_86_1","volume-title":"All of statistics: a concise course in statistical inference","author":"Wasserman Larry","unstructured":"Larry Wasserman. 2004. All of statistics: a concise course in statistical inference. Springer Science & Business Media."},{"key":"e_1_2_1_87_1","volume-title":"Applying contemporary statistical techniques","author":"Wilcox Rand R","unstructured":"Rand R Wilcox. 2003. Applying contemporary statistical techniques. Elsevier."},{"key":"e_1_2_1_88_1","volume-title":"Proceedings of the ACM on Programming Languages, 2, OOPSLA","author":"Williams Jack","year":"2018","unstructured":"Jack Williams, J Garrett Morris, and Philip Wadler. 2018. The root cause of blame: contracts for intersection and union types. Proceedings of the ACM on Programming Languages, 2, OOPSLA (2018), 1\u201329. https:\/\/doi.org\/10.1145\/3276504 10.1145\/3276504"},{"key":"e_1_2_1_89_1","doi-asserted-by":"crossref","first-page":"2141","DOI":"10.1080\/00949655.2010.520163","article-title":"Comparisons of various types of normality tests","volume":"81","author":"Yap Bee Wah","year":"2011","unstructured":"Bee Wah Yap and Chiaw Hock Sim. 2011. Comparisons of various types of normality tests. Journal of Statistical Computation and Simulation, 81, 12 (2011), 2141\u20132155.","journal-title":"Journal of Statistical Computation and Simulation"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1080\/00949655.2010.520163"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1080\/00221309.1996.9921274"}],"container-title":["Proceedings of the ACM on Software Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3729391","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T15:19:06Z","timestamp":1750346346000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3729391"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,19]]},"references-count":91,"journal-issue":{"issue":"FSE","published-print":{"date-parts":[[2025,6,19]]}},"alternative-id":["10.1145\/3729391"],"URL":"https:\/\/doi.org\/10.1145\/3729391","relation":{},"ISSN":["2994-970X"],"issn-type":[{"value":"2994-970X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,19]]}}}