{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T03:15:55Z","timestamp":1771557355106,"version":"3.50.1"},"reference-count":176,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2021,8,6]],"date-time":"2021-08-06T00:00:00Z","timestamp":1628208000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,8,6]],"date-time":"2021-08-06T00:00:00Z","timestamp":1628208000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003385","name":"Georg-August-Universit\u00e4t G\u00f6ttingen","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100003385","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The research on and application of artificial intelligence (AI) has triggered a comprehensive scientific, economic, social and political discussion. Here we argue that statistics, as an interdisciplinary scientific field, plays a substantial role both for the theoretical and practical understanding of AI and for its future development. Statistics might even be considered a core element of AI. With its specialist knowledge of data evaluation, starting with the precise formulation of the research question and passing through a study design stage on to analysis and interpretation of the results, statistics is a natural partner for other disciplines in teaching, research and practice. This paper aims at highlighting the relevance of statistical methodology in the context of AI development. In particular, we discuss contributions of statistics to the field of artificial intelligence concerning methodological development, planning and design of studies, assessment of data quality and data collection, differentiation of causality and associations and assessment of uncertainty in results. Moreover, the paper also discusses the equally necessary and meaningful extensions of curricula in schools and universities to integrate statistical aspects into AI teaching.<\/jats:p>","DOI":"10.1007\/s11634-021-00455-6","type":"journal-article","created":{"date-parts":[[2021,8,6]],"date-time":"2021-08-06T11:03:15Z","timestamp":1628247795000},"page":"823-846","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":60,"title":["Is there a role for statistics in artificial intelligence?"],"prefix":"10.1007","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0291-4378","authenticated-orcid":false,"given":"Sarah","family":"Friedrich","sequence":"first","affiliation":[]},{"given":"Gerd","family":"Antes","sequence":"additional","affiliation":[]},{"given":"Sigrid","family":"Behr","sequence":"additional","affiliation":[]},{"given":"Harald","family":"Binder","sequence":"additional","affiliation":[]},{"given":"Werner","family":"Brannath","sequence":"additional","affiliation":[]},{"given":"Florian","family":"Dumpert","sequence":"additional","affiliation":[]},{"given":"Katja","family":"Ickstadt","sequence":"additional","affiliation":[]},{"given":"Hans A.","family":"Kestler","sequence":"additional","affiliation":[]},{"given":"Johannes","family":"Lederer","sequence":"additional","affiliation":[]},{"given":"Heinz","family":"Leitg\u00f6b","sequence":"additional","affiliation":[]},{"given":"Markus","family":"Pauly","sequence":"additional","affiliation":[]},{"given":"Ansgar","family":"Steland","sequence":"additional","affiliation":[]},{"given":"Adalbert","family":"Wilhelm","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5347-7441","authenticated-orcid":false,"given":"Tim","family":"Friede","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,8,6]]},"reference":[{"issue":"1","key":"455_CR1","doi-asserted-by":"publisher","first-page":"136","DOI":"10.1016\/j.asoc.2005.06.001","volume":"7","author":"L Aburto","year":"2007","unstructured":"Aburto L, Weber R (2007) Improved supply chain management based on hybrid demand forecasts. Appl Soft Comput 7(1):136\u2013144","journal-title":"Appl Soft Comput"},{"key":"455_CR2","unstructured":"AInow (2020) https:\/\/ainowinstitute.org\/, accessed 02.02.2020"},{"key":"455_CR3","unstructured":"Athey S, Imbens GW (2015) Machine learning for estimating heterogeneous causal effects. Stanford University, Graduate School of Business, Tech. rep"},{"key":"455_CR4","doi-asserted-by":"crossref","unstructured":"Athey S, Imbens GW (2017) The econometrics of randomized experiments. Handbook of Economic Field Experiments, vol 1. Elsevier, Amsterdam, pp 73\u2013140","DOI":"10.1016\/bs.hefe.2016.10.003"},{"issue":"2","key":"455_CR5","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.1214\/18-AOS1709","volume":"47","author":"S Athey","year":"2019","unstructured":"Athey S, Tibshirani J, Wager S (2019) Generalized random forests. Ann Stat 47(2):1148\u20131178","journal-title":"Ann Stat"},{"issue":"1","key":"455_CR6","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1162\/coli.2008.07-055-r2-06-29","volume":"35","author":"S Barrachina","year":"2009","unstructured":"Barrachina S, Bender O, Casacuberta F, Civera J, Cubel E, Khadivi S, Lagarda A, Ney H, Tom\u00e1s J, Vidal E, Vilar JM (2009) Statistical approaches to computer-assisted translation. Comput Linguistics 35(1):3\u201328. https:\/\/doi.org\/10.1162\/coli.2008.07-055-r2-06-29","journal-title":"Comput Linguistics"},{"issue":"1","key":"455_CR7","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1016\/j.jarmac.2018.01.001","volume":"7","author":"DM Bartels","year":"2018","unstructured":"Bartels DM, Hastie R, Urminsky O (2018) Connecting laboratory and field research in judgment and decision making: causality and the breadth of external validity. J Appl Res Memory Cogn 7(1):11\u201315. https:\/\/doi.org\/10.1016\/j.jarmac.2018.01.001","journal-title":"J Appl Res Memory Cogn"},{"issue":"1","key":"455_CR8","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1214\/aos\/1105988581","volume":"32","author":"PL Bartlett","year":"2004","unstructured":"Bartlett PL, Bickel PJ, B\u00fchlmann P, Freund Y, Friedman J, Hastie T, Jiang W, Jordan MJ, Koltchinskii V, Lugosi G et al (2004) Discussions of boosting papers, and rejoinders. Ann Stat 32(1):85\u2013134","journal-title":"Ann Stat"},{"key":"455_CR9","unstructured":"Beck M, Dumpert F, Feuerhake J (2018) Machine Learning in Official Statistics. arXiv preprint arXiv:1812.10422"},{"issue":"7391","key":"455_CR10","doi-asserted-by":"publisher","first-page":"531","DOI":"10.1038\/483531a","volume":"483","author":"CG Begley","year":"2012","unstructured":"Begley CG, Ellis LM (2012) Raise standards for preclinical cancer research. Nature 483(7391):531\u2013533","journal-title":"Nature"},{"key":"455_CR11","volume-title":"Dynamic programming","author":"R Bellman","year":"1957","unstructured":"Bellman R (1957) Dynamic programming. Princeton University Press, Princeton, New Jersey"},{"issue":"3","key":"455_CR12","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1177\/009286151004400312","volume":"44","author":"N Benda","year":"2010","unstructured":"Benda N, Branson M, Maurer W, Friede T (2010) Aspects of modernizing drug development using clinical scenario planning and evaluation. Drug Inf J DIJ\/Drug Inf Assoc 44(3):299\u2013315","journal-title":"Drug Inf J DIJ\/Drug Inf Assoc"},{"key":"455_CR13","doi-asserted-by":"publisher","DOI":"10.1002\/9780470090183","volume-title":"Symbolic data analysis: conceptual statistics and data mining","author":"L Billard","year":"2006","unstructured":"Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, Chichester, West Sussex"},{"key":"455_CR14","volume-title":"Pattern recognition and machine learning","author":"CM Bishop","year":"2006","unstructured":"Bishop CM (2006) Pattern recognition and machine learning. Springer, New York"},{"issue":"2","key":"455_CR15","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1177\/2472630319890316","volume":"25","author":"A Blasiak","year":"2020","unstructured":"Blasiak A, Khong J, Kee T (2020) CURATE.AI: optimizing personalized medicine with artificial intelligence. SLAS TECHNOLOGY: Trans Life Sci Innov 25(2):95\u2013105","journal-title":"SLAS TECHNOLOGY: Trans Life Sci Innov"},{"issue":"3","key":"455_CR16","doi-asserted-by":"publisher","first-page":"977","DOI":"10.1111\/biom.12861","volume":"74","author":"T Bluhmki","year":"2018","unstructured":"Bluhmki T, Schmoor C, Dobler D, Pauly M, Finke J, Schumacher M, Beyersmann J (2018) A wild bootstrap approach for the Aalen\u2013Johansen estimator. Biometrics 74(3):977\u2013985","journal-title":"Biometrics"},{"key":"455_CR17","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-57155-8","volume-title":"Analysis of symbolic data","author":"HH Bock","year":"2000","unstructured":"Bock HH, Diday E (2000) Analysis of symbolic data. Springer, Heidelberg"},{"issue":"8","key":"455_CR18","doi-asserted-by":"publisher","first-page":"1183","DOI":"10.1002\/sim.8470","volume":"39","author":"F Bonofiglio","year":"2020","unstructured":"Bonofiglio F, Schumacher M, Binder H (2020) Recovery of original individual person data (ipd) inferences from empirical ipd summaries only: applications to distributed computing under disclosure constraints. Stat Med 39(8):1183\u20131198","journal-title":"Stat Med"},{"issue":"4","key":"455_CR19","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1016\/s0149-7189(96)00029-8","volume":"19","author":"SL Braver","year":"1996","unstructured":"Braver SL, Smith MC (1996) Maximizing both external and internal validity in longitudinal true experiments with voluntary treatments: The \u201ccombined modified\u201d design. Eval Prog Planning 19(4):287\u2013300. https:\/\/doi.org\/10.1016\/s0149-7189(96)00029-8","journal-title":"Eval Prog Planning"},{"issue":"2","key":"455_CR20","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1007\/bf00058655","volume":"24","author":"L Breiman","year":"1996","unstructured":"Breiman L (1996) Bagging predictors. Mach Learn 24(2):123\u2013140. https:\/\/doi.org\/10.1007\/bf00058655","journal-title":"Mach Learn"},{"issue":"1","key":"455_CR21","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L (2001) Random forests. Mach Learn 45(1):5\u201332","journal-title":"Mach Learn"},{"key":"455_CR22","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/j.spl.2018.02.016","volume":"136","author":"P B\u00fchlmann","year":"2018","unstructured":"B\u00fchlmann P, van de Geer S (2018) Statistics for big data: A perspective. Stat Prob Lett 136:37\u201341","journal-title":"Stat Prob Lett"},{"key":"455_CR23","unstructured":"Bundespolizeipr\u00e4sidium Potsdam (2018) Abschlussbericht Teilprojekt 1 \u201cBiometrische Gesichtserkennung\u201d. https:\/\/www.bundespolizei.de\/Web\/DE\/04Aktuelles\/01Meldungen\/2018\/10\/181011_abschlussbericht_gesichtserkennung_down.pdf?__blob=publicationFile=1, accessed 07.05.2020"},{"key":"455_CR24","unstructured":"Bundesregierung (2018) Artificial intelligence strategy. https:\/\/www.ki-strategie-deutschland.de\/home.html?file=files\/downloads\/Nationale_KI-Strategie_engl.pdf, accessed 07.05.2020"},{"issue":"1089","key":"455_CR25","doi-asserted-by":"publisher","first-page":"20170545","DOI":"10.1259\/bjr.20170545","volume":"91","author":"JR Burt","year":"2018","unstructured":"Burt JR, Torosdagli N, Khosravan N, RaviPrakash H, Mortazi A, Tissavirasingham F, Hussein S, Bagci U (2018) Deep learning beyond cats and dogs: recent advances in diagnosing breast cancer with deep neural networks. British J Radiol 91(1089):20170545","journal-title":"British J Radiol"},{"issue":"24","key":"455_CR26","doi-asserted-by":"publisher","first-page":"4279","DOI":"10.1002\/sim.2673","volume":"25","author":"A Burton","year":"2006","unstructured":"Burton A, Altman DG, Royston P, Holder RL (2006) The design of simulation studies in medical statistics. Stat Med 25(24):4279\u20134292","journal-title":"Stat Med"},{"key":"455_CR27","unstructured":"Catalogue of bias collaboration, Lee H, Aronson JK, Nunan D (2019) Catalogue of bias: Collider bias. https:\/\/catalogofbias.org\/biases\/collider-bias, accessed 12.02.2020"},{"issue":"1","key":"455_CR28","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1109\/tnnls.2017.2716952","volume":"29","author":"CLP Chen","year":"2018","unstructured":"Chen CLP, Liu Z (2018) Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans Neural Netw Learn Syst 29(1):10\u201324. https:\/\/doi.org\/10.1109\/tnnls.2017.2716952","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"issue":"6","key":"455_CR29","doi-asserted-by":"publisher","first-page":"1241","DOI":"10.1016\/j.drudis.2018.01.039","volume":"23","author":"H Chen","year":"2018","unstructured":"Chen H, Engkvist O, Wang Y, Olivecrona M, Blaschke T (2018) The rise of deep learning in drug discovery. Drug Discovery Today 23(6):1241\u20131250","journal-title":"Drug Discovery Today"},{"issue":"2","key":"455_CR30","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1109\/72.80341","volume":"2","author":"S Chen","year":"1991","unstructured":"Chen S, Cowan CFN, Grant PM (1991) Orthogonal least squares learning algorithm for radial basis function networks. IEEE Trans Neural Netw 2(2):302\u2013309. https:\/\/doi.org\/10.1109\/72.80341","journal-title":"IEEE Trans Neural Netw"},{"issue":"4","key":"455_CR31","first-page":"417","volume":"35","author":"WG Cochran","year":"1973","unstructured":"Cochran WG, Rubin DB (1973) Controlling bias in observational studies: A review. Sankhy\u0101: The Ind J Stat Ser A 35(4):417\u2013446","journal-title":"Sankhy\u0101: The Ind J Stat Ser A"},{"issue":"10181","key":"455_CR32","doi-asserted-by":"publisher","first-page":"1577","DOI":"10.1016\/S0140-6736(19)30037-6","volume":"393","author":"GS Collins","year":"2019","unstructured":"Collins GS, Moons KG (2019) Reporting of artificial intelligence prediction models. The Lancet 393(10181):1577\u20131579","journal-title":"The Lancet"},{"issue":"2","key":"455_CR33","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1161\/CIRCULATIONAHA.114.014508","volume":"131","author":"GS Collins","year":"2015","unstructured":"Collins GS, Reitsma JB, Altman DG, Moons KG (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) the TRIPOD statement. Circulation 131(2):211\u2013219","journal-title":"Circulation"},{"issue":"3","key":"455_CR34","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1007\/bf00994018","volume":"20","author":"C Cortes","year":"1995","unstructured":"Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273\u2013297. https:\/\/doi.org\/10.1007\/bf00994018","journal-title":"Mach Learn"},{"key":"455_CR35","unstructured":"Dastin J (2018) Amazon scraps secret AI recruiting tool that showed bias against women. Reuters (2018). https:\/\/www.reuters.com\/article\/us-amazon-com-jobs-automation-insight\/amazon-scraps-secret-ai-recruiting-tool-that-showed-bias-against-women-idUSKCN1MK08G, accessed 27.11.2019"},{"key":"455_CR36","unstructured":"Data Ethics Commission of the Federal Government, Federal Ministry of the Interior, Building and Community (2019) Opinion of the data ethics commission. https:\/\/www.bmi.bund.de\/SharedDocs\/downloads\/EN\/themen\/it-digital-policy\/datenethikkommission-abschlussgutachten-lang.pdf?__blob=publicationFile&v=4, accessed 07.05.2020"},{"key":"455_CR37","unstructured":"DataSHIELD (2018) https:\/\/www.datashield.ac.uk"},{"issue":"4","key":"455_CR38","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1145\/3008665.3008674","volume":"2","author":"E Davis","year":"2016","unstructured":"Davis E (2016) AI amusements: the tragic tale of Tay the chatbot. AI Matters 2(4):20\u201324","journal-title":"AI Matters"},{"key":"455_CR39","volume-title":"A probabilistic theory of pattern recognition","author":"L Devroye","year":"2013","unstructured":"Devroye L, Gy\u00f6rfi L, Lugosi G (2013) A probabilistic theory of pattern recognition, vol 31. Springer, New York"},{"issue":"1","key":"455_CR40","doi-asserted-by":"publisher","first-page":"169","DOI":"10.1111\/j.1467-9469.2006.00528.x","volume":"34","author":"V Didelez","year":"2007","unstructured":"Didelez V (2007) Graphical models for composable finite Markov processes. Scand J Stat 34(1):169\u2013185","journal-title":"Scand J Stat"},{"issue":"3","key":"455_CR41","doi-asserted-by":"publisher","first-page":"699","DOI":"10.1093\/biomet\/asx026","volume":"104","author":"D Dobler","year":"2017","unstructured":"Dobler D, Beyersmann J, Pauly M (2017) Non-strange weird resampling for complex survival data. Biometrika 104(3):699\u2013711","journal-title":"Biometrika"},{"key":"455_CR42","doi-asserted-by":"publisher","first-page":"3895","DOI":"10.1016\/S1573-4471(07)04061-2","volume":"4","author":"E Duflo","year":"2007","unstructured":"Duflo E, Glennerster R, Kremer M (2007) Using randomization in development economics research: A toolkit. Handbook of development economics 4:3895\u20133962","journal-title":"Handbook of development economics"},{"key":"455_CR43","unstructured":"Duke-Margolis (2018) https:\/\/healthpolicy.duke.edu\/sites\/default\/files\/2020-03\/characterizing_rwd.pdf. Accessed 13 May 2020"},{"key":"455_CR44","unstructured":"Duke-Margolis (2019) https:\/\/healthpolicy.duke.edu\/sites\/default\/files\/2019-11\/rwd_reliability.pdf. Accessed 13 May 2020"},{"key":"455_CR45","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1016\/j.spl.2018.02.028","volume":"136","author":"DB Dunson","year":"2018","unstructured":"Dunson DB (2018) Statistics in the big data era: Failures of the machine. Stat Prob Lett 136:4\u20139","journal-title":"Stat Prob Lett"},{"key":"455_CR46","unstructured":"European Commission (2020a) https:\/\/ec.europa.eu\/info\/resources-partners\/machine-translation-public-administrations-etranslation_en#translateonline, accessed 13.05.2020"},{"key":"455_CR47","unstructured":"European Commission (2020b) On Artificial Intelligence - A European approach to excellence and trust. https:\/\/ec.europa.eu\/info\/sites\/info\/files\/commission-white-paper-artificial-intelligence-feb2020_en.pdf, accessed 29.07.2020"},{"key":"455_CR48","unstructured":"European Statistical System (2019) Quality assurance framework of the european statistical system. https:\/\/ec.europa.eu\/eurostat\/documents\/64157\/4392716\/ESS-QAF-V1-2final.pdf\/bbf5970c-1adf-46c8-afc3-58ce177a0646, accessed 07.05.2020"},{"issue":"3","key":"455_CR49","first-page":"37","volume":"17","author":"U Fayyad","year":"1996","unstructured":"Fayyad U, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery in databases. AI Magazine 17(3):37\u201337","journal-title":"AI Magazine"},{"key":"455_CR50","unstructured":"FDA (2019) https:\/\/www.fda.gov\/media\/122535\/download, accessed 13.05.2020"},{"issue":"4","key":"455_CR51","doi-asserted-by":"publisher","first-page":"456","DOI":"10.1177\/2515245920952393","volume":"3","author":"JK Flake","year":"2020","unstructured":"Flake JK, Fried EI (2020) Measurement schmeasurement: questionable measurement practices and how to avoid them. Adv Methods Practices Psychol Sci 3(4):456\u2013465. https:\/\/doi.org\/10.1177\/2515245920952393","journal-title":"Adv Methods Practices Psychol Sci"},{"key":"455_CR52","unstructured":"Forbes (2018) https:\/\/www.forbes.com\/sites\/bernardmarr\/2018\/03\/05\/heres-why-data-is-not-the-new-oil\/#45b487143aa9, accessed 27.04.2020"},{"issue":"1","key":"455_CR53","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1006\/jcss.1997.1504","volume":"55","author":"Y Freund","year":"1997","unstructured":"Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119\u2013139. https:\/\/doi.org\/10.1006\/jcss.1997.1504","journal-title":"J Comput Syst Sci"},{"issue":"6","key":"455_CR54","doi-asserted-by":"publisher","first-page":"713","DOI":"10.1177\/009286151004400607","volume":"44","author":"T Friede","year":"2010","unstructured":"Friede T, Nicholas R, Stallard N, Todd S, Parsons N, Vald\u00e9s-M\u00e1rquez E, Chataway J (2010) Refinement of the clinical scenario evaluation framework for assessment of competing development strategies with an application to multiple sclerosis. Drug Inf J: DIJ\/Drug Inf Assoc 44(6):713\u2013718","journal-title":"Drug Inf J: DIJ\/Drug Inf Assoc"},{"key":"455_CR55","doi-asserted-by":"publisher","unstructured":"Friedrich S, Gro\u00df S, K\u00f6nig IR, Engelhardt S, Bahls M, Heinz J, Huber C, Kaderali L, Kelm M, Leha A, R\u00fchl J, Schaller J, Scherer C, Vollmer M, Seidler T, Friede T (2021) Applications of AI\/ML approaches in cardiovascular medicine: A systematic review with recommendations. European Heart Journal - Digital Health. https:\/\/doi.org\/10.1093\/ehjdh\/ztab054","DOI":"10.1093\/ehjdh\/ztab054"},{"key":"455_CR56","doi-asserted-by":"publisher","unstructured":"Gabler S, H\u00e4der S (2018) Repr\u00e4sentativit\u00e4t: Versuch einer Begriffsbestimmung. In: Telefonumfragen in Deutschland, Springer Fachmedien Wiesbaden, pp 81\u2013112, https:\/\/doi.org\/10.1007\/978-3-658-23950-3_5, https:\/\/doi.org\/10.1007%2F978-3-658-23950-3_5","DOI":"10.1007\/978-3-658-23950-3_5"},{"key":"455_CR57","unstructured":"Gal Y, Ghahramani Z (2016) Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. In: Balcan MF, Weinberger KQ (eds) Proceedings of The 33rd international conference on machine learning, PMLR, New York, New York, USA, Proceedings of Machine Learning Research, vol\u00a048, pp 1050\u20131059"},{"key":"455_CR58","unstructured":"Garnelo M, Rosenbaum D, Maddison CJ, Ramalho T, Saxton D, Shanahan M, Teh YW, Rezende DJ, Eslami S (2018) Conditional neural processes. arXiv preprint arXiv:1807.01613"},{"issue":"6","key":"455_CR59","doi-asserted-by":"publisher","first-page":"1929","DOI":"10.1093\/ije\/dyu188","volume":"43","author":"A Gaye","year":"2014","unstructured":"Gaye A, Marcon Y, Isaeva J, LaFlamme P, Turner A, Jones EM, Minion J, Boyd AW, Newby CJ, Nuotio ML (2014) DataSHIELD: taking the analysis to the data, not the data to the analysis. Int J Epidemiol 43(6):1929\u20131944","journal-title":"Int J Epidemiol"},{"issue":"3","key":"455_CR60","doi-asserted-by":"publisher","first-page":"413","DOI":"10.1093\/ije\/15.3.413","volume":"15","author":"S Greenland","year":"1986","unstructured":"Greenland S, Robins JM (1986) Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol 15(3):413\u2013419","journal-title":"Int J Epidemiol"},{"issue":"1","key":"455_CR61","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1214\/ss\/1009211805","volume":"14","author":"S Greenland","year":"1999","unstructured":"Greenland S, Robins JM, Pearl J (1999) Confounding and collapsibility in causal inference. Stat Sci 14(1):29\u201346","journal-title":"Stat Sci"},{"issue":"3","key":"455_CR62","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1109\/MC.2015.62","volume":"48","author":"VN Gudivada","year":"2015","unstructured":"Gudivada VN, Baeza-Yates R, Raghavan VV (2015) Big data: Promises and problems. Computer 48(3):20\u201323. https:\/\/doi.org\/10.1109\/MC.2015.62","journal-title":"Computer"},{"key":"455_CR63","doi-asserted-by":"publisher","DOI":"10.1007\/b97848","volume-title":"A distribution-free theory of nonparametric regression","author":"L Gy\u00f6rfi","year":"2002","unstructured":"Gy\u00f6rfi L, Kohler M, Krzyzak A, Walk H (2002) A distribution-free theory of nonparametric regression. Springer, New York. https:\/\/doi.org\/10.1007\/b97848"},{"issue":"7829","key":"455_CR64","doi-asserted-by":"publisher","first-page":"E14","DOI":"10.1038\/s41586-020-2766-y","volume":"586","author":"B Haibe-Kains","year":"2020","unstructured":"Haibe-Kains B, Adam GA, Hosny A, Khodakarami F, Waldron L, Wang B, McIntosh C, Goldenberg A, Kundaje A, Greene CS et al (2020) Transparency and reproducibility in artificial intelligence. Nature 586(7829):E14\u2013E16","journal-title":"Nature"},{"issue":"4","key":"455_CR65","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1056\/NEJMp1006304","volume":"363","author":"MA Hamburg","year":"2010","unstructured":"Hamburg MA, Collins FS (2010) The path to personalized medicine. N Engl J Med 363(4):301\u2013304","journal-title":"N Engl J Med"},{"issue":"4","key":"455_CR66","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1086\/322086","volume":"109","author":"JJ Heckman","year":"2001","unstructured":"Heckman JJ (2001) Micro data, heterogeneity, and the evaluation of public policy: nobel lecture. J Political Econ 109(4):673\u2013748. https:\/\/doi.org\/10.1086\/322086","journal-title":"J Political Econ"},{"issue":"3","key":"455_CR67","doi-asserted-by":"publisher","first-page":"431","DOI":"10.1002\/bimj.201700067","volume":"60","author":"G Heinze","year":"2018","unstructured":"Heinze G, Wallisch C, Dunkler D (2018) Variable selection-a review and recommendations for the practicing statistician. Biomet J 60(3):431\u2013449","journal-title":"Biomet J"},{"key":"455_CR68","doi-asserted-by":"crossref","unstructured":"Higgins JP, Altman DG, G\u00f8tzsche PC, J\u00fcni P, Moher D, Oxman AD, Savovi\u0107 J, Schulz KF, Weeks L, Sterne JA (2011) The Cochrane Collaboration\u2019s tool for assessing risk of bias in randomised trials. Bmj 343:d5928","DOI":"10.1136\/bmj.d5928"},{"issue":"1\u20132","key":"455_CR69","first-page":"28","volume":"49","author":"W Hilberg","year":"1995","unstructured":"Hilberg W (1995) Karl Steinbuch, ein zu Unrecht vergessener Pionier der k\u00fcnstlichen neuronalen Systeme. Frequenz 49(1\u20132):28\u201336","journal-title":"Frequenz"},{"issue":"5","key":"455_CR70","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1177\/003591576505800503","volume":"58","author":"AB Hill","year":"1965","unstructured":"Hill AB (1965) The environment and disease: association or causation? Proc Royal Soc Med 58(5):295\u2013300","journal-title":"Proc Royal Soc Med"},{"issue":"1\u20133","key":"455_CR71","doi-asserted-by":"publisher","first-page":"489","DOI":"10.1016\/j.neucom.2005.12.126","volume":"70","author":"GB Huang","year":"2006","unstructured":"Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: Theory and applications. Neurocomputing 70(1\u20133):489\u2013501. https:\/\/doi.org\/10.1016\/j.neucom.2005.12.126","journal-title":"Neurocomputing"},{"issue":"3","key":"455_CR72","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1080\/00401706.1991.10484833","volume":"33","author":"BH Juang","year":"1991","unstructured":"Juang BH, Rabiner LR (1991) Hidden markov models for speech recognition. Technometrics 33(3):251\u2013272. https:\/\/doi.org\/10.1080\/00401706.1991.10484833","journal-title":"Technometrics"},{"key":"455_CR73","doi-asserted-by":"publisher","first-page":"70","DOI":"10.1016\/j.compag.2018.02.016","volume":"147","author":"A Kamilaris","year":"2018","unstructured":"Kamilaris A, Prenafeta-Bold\u00fa FX (2018) Deep learning in agriculture: A survey. Comput Electron Agric 147:70\u201390","journal-title":"Comput Electron Agric"},{"issue":"4","key":"455_CR74","doi-asserted-by":"publisher","first-page":"783","DOI":"10.1214\/aoms\/1177699361","volume":"37","author":"S Karlin","year":"1966","unstructured":"Karlin S, Studden WJ (1966) Optimal experimental designs. Ann Math Stat 37(4):783\u2013815","journal-title":"Ann Math Stat"},{"issue":"2","key":"455_CR75","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1016\/j.stamet.2005.08.005","volume":"3","author":"AF Karr","year":"2006","unstructured":"Karr AF, Sanil AP, Banks DL (2006) Data quality: A statistical perspective. Stat Methodol 3(2):137\u2013173","journal-title":"Stat Methodol"},{"key":"455_CR76","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/3897.001.0001","volume-title":"An introduction to computational learning theory","author":"MJ Kearns","year":"1994","unstructured":"Kearns MJ, Vazirani U (1994) An introduction to computational learning theory. The MIT Press, Cambridge, MA. https:\/\/doi.org\/10.7551\/mitpress\/3897.001.0001"},{"issue":"10","key":"455_CR77","doi-asserted-by":"publisher","first-page":"947","DOI":"10.2514\/8.5282","volume":"30","author":"HJ Kelley","year":"1960","unstructured":"Kelley HJ (1960) Gradient theory of optimal flight paths. ARS J 30(10):947\u2013954. https:\/\/doi.org\/10.2514\/8.5282","journal-title":"ARS J"},{"issue":"16","key":"455_CR78","doi-asserted-by":"publisher","first-page":"2197","DOI":"10.1002\/sim.8532","volume":"39","author":"RH Keogh","year":"2020","unstructured":"Keogh RH, Shaw PA, Gustafson P, Carroll RJ, Deffner V, Dodd KW, K\u00fcchenhoff H, Tooze JA, Wallace MP, Kipnis V et al (2020) Stratos guidance document on measurement error and misclassification of variables in observational epidemiology: part 1\u2013basic theory and simple methods of adjustment. Stat Med 39(16):2197\u20132231","journal-title":"Stat Med"},{"issue":"4","key":"455_CR79","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1038\/scientificamericanmind0716-20","volume":"27","author":"C Koch","year":"2016","unstructured":"Koch C (2016) How the computer beat the go player. Sci Am Mind 27(4):20\u201323. https:\/\/doi.org\/10.1038\/scientificamericanmind0716-20","journal-title":"Sci Am Mind"},{"key":"455_CR80","doi-asserted-by":"publisher","unstructured":"Kohavi R, Tang D, Xu Y, Hemkens LG, Ioannidis JPA (2020) Online randomized controlled experiments at scale: lessons and extensions to medicine. Trials 21(1), https:\/\/doi.org\/10.1186\/s13063-020-4084-y, https:\/\/doi.org\/10.1186%2Fs13063-020-4084-y","DOI":"10.1186\/s13063-020-4084-y"},{"key":"455_CR81","doi-asserted-by":"publisher","unstructured":"Kozielski M, Doetsch P, Ney H (2013) Improvements in RWTH\u2019s System for Off-Line Handwriting Recognition. In: 2013 12th international conference on document analysis and recognition, IEEE, https:\/\/doi.org\/10.1109\/icdar.2013.190, https:\/\/doi.org\/10.1109%2Ficdar.2013.190","DOI":"10.1109\/icdar.2013.190"},{"key":"455_CR82","doi-asserted-by":"crossref","unstructured":"Kruskal W, Mosteller F (1979a) Representative sampling, I: non-scientific literature. International Statistical Review\/Revue Internationale de Statistique pp 13\u201324","DOI":"10.2307\/1403202"},{"key":"455_CR83","doi-asserted-by":"crossref","unstructured":"Kruskal W, Mosteller F (1979b) Representative sampling. Scientific literature, excluding statistics. International Statistical Review\/Revue Internationale de Statistique, II, pp 111\u2013127","DOI":"10.2307\/1402564"},{"key":"455_CR84","doi-asserted-by":"crossref","unstructured":"Kruskal W, Mosteller F (1979c) Representative sampling. The current statistical literature. International Statistical Review\/Revue Internationale de Statistique, III, pp 245\u2013265","DOI":"10.2307\/1402647"},{"key":"455_CR85","doi-asserted-by":"crossref","unstructured":"Kruskal W, Mosteller F (1980) Representative sampling, IV: The history of the concept in statistics, 1895-1939. International Statistical Review\/Revue Internationale de Statistique pp 169\u2013195","DOI":"10.2307\/1403151"},{"key":"455_CR86","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4419-9782-1","volume-title":"Targeted learning: causal inference for observational and experimental data","author":"MJ Van der Laan","year":"2011","unstructured":"Van der Laan MJ, Rose S (2011) Targeted learning: causal inference for observational and experimental data. Springer, New York"},{"key":"455_CR87","first-page":"273","volume":"6","author":"J Langford","year":"2005","unstructured":"Langford J (2005) Tutorial on practical prediction theory for classification. J Mach Learn Res 6:273\u2013306","journal-title":"J Mach Learn Res"},{"issue":"6176","key":"455_CR88","doi-asserted-by":"publisher","first-page":"1203","DOI":"10.1126\/science.1248506","volume":"343","author":"D Lazer","year":"2014","unstructured":"Lazer D, Kennedy R, King G, Vespignani A (2014) The parable of Google Flu: traps in big data analysis. Science 343(6176):1203\u20131205","journal-title":"Science"},{"issue":"4\u20135","key":"455_CR89","doi-asserted-by":"publisher","first-page":"421","DOI":"10.1177\/0278364917710318","volume":"37","author":"S Levine","year":"2018","unstructured":"Levine S, Pastor P, Krizhevsky A, Ibarz J, Quillen D (2018) Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. Int J Robot Res 37(4\u20135):421\u2013436","journal-title":"Int J Robot Res"},{"issue":"10","key":"455_CR90","first-page":"e310","volume":"25","author":"EJD Lin","year":"2019","unstructured":"Lin EJD, Hefner JL, Zeng X, Moosavinasab S, Huber T, Klima J, Liu C, Lin SM (2019) A deep learning model for pediatric patient risk stratification. Am J Managed Care 25(10):e310\u2013e315","journal-title":"Am J Managed Care"},{"key":"455_CR91","doi-asserted-by":"publisher","first-page":"m3164","DOI":"10.1136\/bmj.m3164","volume":"370","author":"X Liu","year":"2020","unstructured":"Liu X, Rivera SC, Moher D, Calvert MJ, Denniston AK (2020) Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. British Med J 370:m3164","journal-title":"British Med J"},{"issue":"6","key":"455_CR92","doi-asserted-by":"publisher","first-page":"537","DOI":"10.1177\/1745691612460688","volume":"7","author":"MC Makel","year":"2012","unstructured":"Makel MC, Plucker JA, Hegarty B (2012) Replications in psychology research: How often do they really occur? Perspec Psychol Sci 7(6):537\u2013542","journal-title":"Perspec Psychol Sci"},{"issue":"4","key":"455_CR93","doi-asserted-by":"publisher","first-page":"574","DOI":"10.1080\/07350015.2015.1086655","volume":"34","author":"MW McCracken","year":"2016","unstructured":"McCracken MW, Ng S (2016) FRED-MD: a monthly database for macroeconomic research. J Business Econ Stat 34(4):574\u2013589. https:\/\/doi.org\/10.1080\/07350015.2015.1086655","journal-title":"J Business Econ Stat"},{"key":"455_CR94","unstructured":"MedTechIntelligence (2018) https:\/\/www.medtechintelligence.com\/news_article\/apple-watch-4-gets-fda-clearance\/, accessed 13.05.2020"},{"issue":"4","key":"455_CR95","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","volume":"72","author":"N Meinshausen","year":"2010","unstructured":"Meinshausen N, B\u00fchlmann P (2010) Stability selection. J Royal Stat Soc: Ser B (Statistical Methodology) 72(4):417\u2013473","journal-title":"J Royal Stat Soc: Ser B (Statistical Methodology)"},{"issue":"2","key":"455_CR96","doi-asserted-by":"publisher","first-page":"685","DOI":"10.1214\/18-AOAS1161SF","volume":"12","author":"XL Meng","year":"2018","unstructured":"Meng XL (2018) Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election. Ann Appl Stat 12(2):685\u2013726","journal-title":"Ann Appl Stat"},{"issue":"1\u20134","key":"455_CR97","doi-asserted-by":"publisher","first-page":"218","DOI":"10.1080\/07474938.2013.808567","volume":"33","author":"XL Meng","year":"2014","unstructured":"Meng XL, Xie X (2014) I got more data, my model is more refined, but my estimator is getting worse! Am I just dumb? Econom Rev 33(1\u20134):218\u2013250","journal-title":"Econom Rev"},{"key":"455_CR98","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.artint.2018.07.007","volume":"267","author":"T Miller","year":"2019","unstructured":"Miller T (2019) Explanation in artificial intelligence: Insights from the social sciences. Artif Intell 267:1\u201338","journal-title":"Artif Intell"},{"key":"455_CR99","doi-asserted-by":"publisher","unstructured":"(2014) Handbook of missing data methodology. Chapman and Hall\/CRC, Boca Raton, FL,. https:\/\/doi.org\/10.1201\/b17622","DOI":"10.1201\/b17622"},{"key":"455_CR100","unstructured":"Molnar C (2019) Interpretable machine learning. https:\/\/christophm.github.io\/interpretable-ml-book\/. Accessed 29 July 2020"},{"issue":"4","key":"455_CR101","first-page":"87","volume":"27","author":"J Moor","year":"2006","unstructured":"Moor J (2006) The Dartmouth College artificial intelligence conference: The next fifty years. AI Magazine 27(4):87\u201387","journal-title":"AI Magazine"},{"issue":"11","key":"455_CR102","doi-asserted-by":"publisher","first-page":"2074","DOI":"10.1002\/sim.8086","volume":"38","author":"TP Morris","year":"2019","unstructured":"Morris TP, White IR, Crowther MJ (2019) Using simulation studies to evaluate statistical methods. Stat Med 38(11):2074\u20132102","journal-title":"Stat Med"},{"key":"455_CR103","unstructured":"New York A (2018) https:\/\/www.nytimes.com\/2018\/12\/18\/technology\/facebook-privacy.html, accessed 27.04.2020"},{"key":"455_CR104","doi-asserted-by":"publisher","unstructured":"Ng S (2018) Opportunities and challenges: lessons from analyzing terabytes of scanner data. In: Honore B, Pakes A, Piazzesi M, Samuelson L (eds) Advances in economics and econometrics, Cambridge University Press, pp 1\u201334, https:\/\/doi.org\/10.1017\/9781108227223.001, https:\/\/doi.org\/10.1017%2F9781108227223.001","DOI":"10.1017\/9781108227223.001"},{"issue":"3","key":"455_CR105","first-page":"e1356","volume":"10","author":"E Ntoutsi","year":"2020","unstructured":"Ntoutsi E, Fafalios P, Gadiraju U, Iosifidis V, Nejdl W, Vidal ME, Ruggieri S, Turini F, Papadopoulos S, Krasanakis E et al (2020) Bias in data-driven artificial intelligence systems. An introductory survey. Wiley Interdisciplin Rev: Data Mining Knowl Discovery 10(3):e1356","journal-title":"Wiley Interdisciplin Rev: Data Mining Knowl Discovery"},{"key":"455_CR106","unstructured":"Nuffield Foundation (2019) Ethical and societal implications of algorithms, data, and artificial intelligence: a roadmap for research. https:\/\/www.nuffieldfoundation.org\/sites\/default\/files\/files\/Ethical-and-Societal-Implications-of-Data-and-AI-report-Nuffield-Foundat.pdf, accessed 27.04.2021"},{"key":"455_CR107","unstructured":"Osband I, Blundell C, Pritzel A, Van\u00a0Roy B (2016) Deep exploration via bootstrapped DQN. In: Advances in neural information processing systems, pp 4026\u20134034"},{"issue":"6","key":"455_CR108","doi-asserted-by":"publisher","first-page":"528","DOI":"10.1177\/1745691612465253","volume":"7","author":"H Pashler","year":"2012","unstructured":"Pashler H, Wagenmakers EJ (2012) Editors\u2019 introduction to the special section on replicability in psychological science: A crisis of confidence? Perspect Psychol Sci 7(6):528\u2013530","journal-title":"Perspect Psychol Sci"},{"key":"455_CR109","volume-title":"Probabilistic reasoning in intelligent systems: Networks of plausible inference","author":"J Pearl","year":"1988","unstructured":"Pearl J (1988) Probabilistic reasoning in intelligent systems: Networks of plausible inference. Morgan Kaufmann Publisher Inc, San Francisco, CA"},{"key":"455_CR110","unstructured":"Pearl J (1993) Aspects of graphical models connected with causality. In: Proceedings of the 49th session of the international statistical science institute"},{"key":"455_CR111","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511803161","volume-title":"Causality","author":"J Pearl","year":"2009","unstructured":"Pearl J (2009) Causality. Cambridge University Press, New York"},{"issue":"1","key":"455_CR112","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1111\/j.1467-9531.2010.01228.x","volume":"40","author":"J Pearl","year":"2010","unstructured":"Pearl J (2010) The foundations of causal inference. Sociol Methodol 40(1):75\u2013149","journal-title":"Sociol Methodol"},{"key":"455_CR113","doi-asserted-by":"crossref","unstructured":"Pearl J (2018) Theoretical impediments to machine learning with seven sparks from the causal revolution. arXiv preprint arXiv:18010.4016v1","DOI":"10.1145\/3159652.3176182"},{"key":"455_CR114","unstructured":"Peltola T (2018) Local interpretable model-agnostic explanations of bayesian predictive models via Kullback\u2013Leibler projections. arXiv preprint arXiv:18100.2678v1"},{"key":"455_CR115","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198509844.001.0001","volume-title":"The statistical evaluation of medical tests for classification and prediction","author":"MS Pepe","year":"2003","unstructured":"Pepe MS (2003) The statistical evaluation of medical tests for classification and prediction. Oxford University Press, New York"},{"issue":"20","key":"455_CR116","doi-asserted-by":"publisher","first-page":"1909","DOI":"10.1056\/NEJMoa1901183","volume":"381","author":"MV Perez","year":"2019","unstructured":"Perez MV, Mahaffey KW, Hedlin H, Rumsfeld JS, Garcia A, Ferris T, Balasubramanian V, Russo AM, Rajmane A, Cheung L et al (2019) Large-scale assessment of a smartwatch to identify atrial fibrillation. N Engl J Med 381(20):1909\u20131917","journal-title":"N Engl J Med"},{"key":"455_CR117","unstructured":"Porta M (ed) (2016) A Dictionary of Epidemiology, 6th edn. Oxford University Press, New York"},{"key":"455_CR118","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.spl.2019.03.017","volume":"151","author":"B Ramosaj","year":"2019","unstructured":"Ramosaj B, Pauly M (2019a) Consistent estimation of residual variance with random forest Out-Of-Bag errors. Stat Probab Lett 151:49\u201357","journal-title":"Stat Probab Lett"},{"issue":"4","key":"455_CR119","doi-asserted-by":"publisher","first-page":"1741","DOI":"10.1007\/s00180-019-00900-3","volume":"34","author":"B Ramosaj","year":"2019","unstructured":"Ramosaj B, Pauly M (2019b) Predicting missing values: a comparative study on non-parametric approaches for imputation. Comput Stat 34(4):1741\u20131764","journal-title":"Comput Stat"},{"issue":"10","key":"455_CR120","doi-asserted-by":"publisher","first-page":"3099","DOI":"10.1093\/bioinformatics\/btaa082","volume":"36","author":"B Ramosaj","year":"2020","unstructured":"Ramosaj B, Amro L, Pauly M (2020) A cautionary tale on using imputation methods for inference in matched pairs design. Bioinformatics 36(10):3099\u20133106","journal-title":"Bioinformatics"},{"key":"455_CR121","doi-asserted-by":"publisher","unstructured":"Ribeiro M, Singh S, Guestrin C (2016a) \u201cWhy Should I Trust You?\u201d: Explaining the predictions of any classifier. In: Proceedings of the 2016 conference of the north american chapter of the association for computational linguistics: Demonstrations, Association for Computational Linguistics, https:\/\/doi.org\/10.18653\/v1\/n16-3020, https:\/\/doi.org\/10.18653%2Fv1%2Fn16-3020","DOI":"10.18653\/v1\/n16-3020"},{"key":"455_CR122","unstructured":"Ribeiro MT, Singh S, Guestrin C (2016b) Model-agnostic interpretability of machine learning. arXiv preprint arXiv:16060.5386v1"},{"issue":"14","key":"455_CR123","doi-asserted-by":"publisher","first-page":"484","DOI":"10.1093\/bioinformatics\/btz361","volume":"35","author":"J Richter","year":"2019","unstructured":"Richter J, Madjar K, Rahnenf\u00fchrer J (2019) Model-based optimization of subgroup weights for survival analysis. Bioinformatics 35(14):484\u2013491","journal-title":"Bioinformatics"},{"key":"455_CR124","doi-asserted-by":"publisher","first-page":"m3210","DOI":"10.1136\/bmj.m3210","volume":"370","author":"SC Rivera","year":"2020","unstructured":"Rivera SC, Liu X, Chan AW, Denniston AK, Calvert MJ (2020) Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. British Med J 370:m3210","journal-title":"British Med J"},{"key":"455_CR125","doi-asserted-by":"crossref","unstructured":"Robins JM, Hern\u00e1n M\u00c1, Brumback B (2000) Marginal structural models and causal inference in epidemiology. Epidemiology 11(5):550\u2013560. https:\/\/doi.org\/10.1097\/00001648-200009000-00011","DOI":"10.1097\/00001648-200009000-00011"},{"issue":"5","key":"455_CR126","doi-asserted-by":"publisher","first-page":"1266","DOI":"10.1111\/j.1467-8276.2009.01295.x","volume":"91","author":"BE Roe","year":"2009","unstructured":"Roe BE, Just DR (2009) Internal and external validity in economics research: tradeoffs between experiments, field experiments, natural experiments, and field data. Am J Agricult Econom 91(5):1266\u20131271. https:\/\/doi.org\/10.1111\/j.1467-8276.2009.01295.x","journal-title":"Am J Agricult Econom"},{"key":"455_CR127","doi-asserted-by":"publisher","unstructured":"Rosenbaum P (2002) Observational studies. In: Springer Series in Statistics, Springer New York, pp 1\u201317, https:\/\/doi.org\/10.1007\/978-1-4757-3692-2_1, https:\/\/doi.org\/10.1007%2F978-1-4757-3692-2_1","DOI":"10.1007\/978-1-4757-3692-2_1"},{"key":"455_CR128","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4419-1213-8","volume-title":"Design of observational studies","author":"P Rosenbaum","year":"2010","unstructured":"Rosenbaum P (2010) Design of observational studies. Springer, New York. https:\/\/doi.org\/10.1007\/978-1-4419-1213-8"},{"key":"455_CR129","doi-asserted-by":"publisher","DOI":"10.4159\/9780674982697","volume-title":"Observation and experiment","author":"P Rosenbaum","year":"2017","unstructured":"Rosenbaum P (2017) Observation and experiment. Harvard University Press, Cambridge, MA. https:\/\/doi.org\/10.4159\/9780674982697"},{"issue":"6","key":"455_CR130","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1037\/h0042519","volume":"65","author":"F Rosenblatt","year":"1958","unstructured":"Rosenblatt F (1958) The perceptron: A probabilistic model for information storage and organization in the brain. Psychol Rev 65(6):386\u2013408. https:\/\/doi.org\/10.1037\/h0042519","journal-title":"Psychol Rev"},{"key":"455_CR131","unstructured":"Ross A, Lage I, Doshi-Velez F (2017) The neural lasso: Local linear sparsity for interpretable explanations. In: Workshop on transparent and interpretable machine learning in safety critical environments, 31st conference on neural information processing systems, Long Beach, CA"},{"key":"455_CR132","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1177\/0962280219833079","volume":"29","author":"C R\u00f6ver","year":"2020","unstructured":"R\u00f6ver C, Friede T (2020) Dynamically borrowing strength from another study through shrinkage estimation. Stat Methods Med Res 29:293\u2013308","journal-title":"Stat Methods Med Res"},{"issue":"5","key":"455_CR133","doi-asserted-by":"publisher","first-page":"688","DOI":"10.1037\/h0037350","volume":"66","author":"DB Rubin","year":"1974","unstructured":"Rubin DB (1974) Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol 66(5):688","journal-title":"J Educ Psychol"},{"issue":"3","key":"455_CR134","doi-asserted-by":"publisher","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","volume":"63","author":"DB Rubin","year":"1976","unstructured":"Rubin DB (1976) Inference and missing data. Biometrika 63(3):581\u2013592","journal-title":"Biometrika"},{"key":"455_CR135","doi-asserted-by":"publisher","DOI":"10.1017\/cbo9780511810725","volume-title":"Matched sampling for causal effects","author":"DB Rubin","year":"2006","unstructured":"Rubin DB (2006) Matched sampling for causal effects. Cambridge University Press, Cambridge, MA. https:\/\/doi.org\/10.1017\/cbo9780511810725"},{"issue":"3","key":"455_CR136","doi-asserted-by":"publisher","first-page":"808","DOI":"10.1214\/08-AOAS187","volume":"2","author":"DB Rubin","year":"2008","unstructured":"Rubin DB (2008) For objective causal inference, design trumps analysis. Ann Appl Stat 2(3):808\u2013840","journal-title":"Ann Appl Stat"},{"key":"455_CR137","doi-asserted-by":"crossref","unstructured":"Sauerbrei W, Perperoglou A, Schmid M, Abrahamowicz M, Becher H, Binder H, Dunkler D, Harrell FE, Royston P, Heinze G, others for TG2 of the STRATOS initiative (2020) State of the art in selection of variables and functional forms in multivariable analysis - outstanding issues. Diagnostic Prognostic Res 4:1\u201318","DOI":"10.1186\/s41512-020-00074-3"},{"key":"455_CR138","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","volume":"61","author":"J Schmidhuber","year":"2015","unstructured":"Schmidhuber J (2015) Deep learning in neural networks: An overview. Neural Netw 61:85\u2013117. https:\/\/doi.org\/10.1016\/j.neunet.2014.09.003","journal-title":"Neural Netw"},{"key":"455_CR139","volume-title":"Precision medicine in cancer therapy, cancer treatment and research","author":"NJ Schork","year":"2019","unstructured":"Schork NJ (2019) Artificial intelligence and personalized medicine. In: Von Hoff D, Han H (eds) Precision medicine in cancer therapy, cancer treatment and research. Springer, Cham"},{"issue":"4","key":"455_CR140","doi-asserted-by":"publisher","first-page":"1716","DOI":"10.1214\/15-AOS1321","volume":"43","author":"E Scornet","year":"2015","unstructured":"Scornet E, Biau G, Vert JP (2015) Consistency of random forests. Ann Stat 43(4):1716\u20131741","journal-title":"Ann Stat"},{"issue":"3","key":"455_CR141","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1177\/0962280210395740","volume":"22","author":"SR Seaman","year":"2013","unstructured":"Seaman SR, White IR (2013) Review of inverse probability weighting for dealing with missing data. Stat Methods Med Res 22(3):278\u2013295","journal-title":"Stat Methods Med Res"},{"issue":"3","key":"455_CR142","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1017\/S0140525X00005756","volume":"3","author":"J Searle","year":"1980","unstructured":"Searle J (1980) Minds, Brains and Programs. Behavioral Brain Sci 3(3):417\u2013457","journal-title":"Behavioral Brain Sci"},{"key":"455_CR143","volume-title":"Experimental and quasi-experimental designs for generalized causal inference","author":"WR Shadish","year":"2002","unstructured":"Shadish WR, Cook TD, Campbell DT (2002) Experimental and quasi-experimental designs for generalized causal inference. Houghton Mifflin, Boston"},{"issue":"16","key":"455_CR144","doi-asserted-by":"publisher","first-page":"2232","DOI":"10.1002\/sim.8531","volume":"39","author":"PA Shaw","year":"2020","unstructured":"Shaw PA, Gustafson P, Carroll RJ, Deffner V, Dodd KW, Keogh RH, Kipnis V, Tooze JA, Wallace MP, K\u00fcchenhoff H et al (2020) Stratos guidance document on measurement error and misclassification of variables in observational epidemiology: Part 2\u2013more complex methods of adjustment and advanced topics. Stat Med 39(16):2232\u20132263","journal-title":"Stat Med"},{"issue":"6419","key":"455_CR145","doi-asserted-by":"publisher","first-page":"1140","DOI":"10.1126\/science.aar6404","volume":"362","author":"D Silver","year":"2018","unstructured":"Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D (2018) A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362(6419):1140\u20131144. https:\/\/doi.org\/10.1126\/science.aar6404","journal-title":"Science"},{"key":"455_CR146","first-page":"25","volume-title":"Machine learning","author":"HA Simon","year":"1983","unstructured":"Simon HA (1983) Why should machines learn? In: Michalski RS, Carbonell JG, Mitchell TM (eds) Machine learning. Morgan Kaufmann, San Francisco, CA, pp 25\u201337"},{"issue":"6","key":"455_CR147","doi-asserted-by":"publisher","first-page":"1123","DOI":"10.1177\/1745691617708630","volume":"12","author":"DJ Simons","year":"2017","unstructured":"Simons DJ, Shoda Y, Lindsay DS (2017) Constraints on generality (COG): A proposed addition to all empirical papers. Perspect Psychol Sci 12(6):1123\u20131128","journal-title":"Perspect Psychol Sci"},{"issue":"2","key":"455_CR148","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1111\/j.2517-6161.1951.tb00088.x","volume":"13","author":"EH Simpson","year":"1951","unstructured":"Simpson EH (1951) The interpretation of interaction in contingency tables. J Roy Stat Soc: Ser B (Methodol) 13(2):238\u2013241","journal-title":"J Roy Stat Soc: Ser B (Methodol)"},{"issue":"2","key":"455_CR149","doi-asserted-by":"publisher","first-page":"149","DOI":"10.3233\/HSM-1985-5207","volume":"5","author":"RJ Solomonoff","year":"1985","unstructured":"Solomonoff RJ (1985) The time scale of artificial intelligence: Reflections on social effects. Human Syst Manag 5(2):149\u2013153","journal-title":"Human Syst Manag"},{"issue":"1","key":"455_CR150","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929\u20131958","journal-title":"J Mach Learn Res"},{"key":"455_CR151","volume-title":"Scientific method: how science works, fails to work, and pretends to work","author":"J Staddon","year":"2017","unstructured":"Staddon J (2017) Scientific method: how science works, fails to work, and pretends to work. Taylor & Francis Group, New York"},{"issue":"1","key":"455_CR152","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1007\/BF00293853","volume":"1","author":"K Steinbuch","year":"1961","unstructured":"Steinbuch K (1961) Die Lernmatrix. Kybernetik 1(1):36\u201345","journal-title":"Kybernetik"},{"key":"455_CR153","volume-title":"Reinforcement learning: An introduction","author":"RS Sutton","year":"2018","unstructured":"Sutton RS, Barto AG (2018) Reinforcement learning: An introduction. MIT press, Cambridge, MA"},{"key":"455_CR154","doi-asserted-by":"crossref","unstructured":"Teichmann M, Weber M, Zoellner M, Cipolla R, Urtasun R (2018) Multinet: Real-time joint semantic reasoning for autonomous driving. In: 2018 IEEE intelligent vehicles symposium (IV), IEEE, pp 1013\u20131020","DOI":"10.1109\/IVS.2018.8500504"},{"key":"455_CR155","unstructured":"The Economist (2017) https:\/\/www.economist.com\/leaders\/2017\/05\/06\/the-worlds-most-valuable-resource-is-no-longer-oil-but-data, accessed 27.04.2020"},{"key":"455_CR156","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.datak.2017.08.004","volume":"112","author":"V Theodorou","year":"2017","unstructured":"Theodorou V, Abell\u00f3 A, Thiele M, Lehner W (2017) Frequent patterns in ETL workflows: An empirical approach. Data Knowl Eng 112:1\u201316. https:\/\/doi.org\/10.1016\/j.datak.2017.08.004","journal-title":"Data Knowl Eng"},{"key":"455_CR157","unstructured":"Thurow M, Dumpert F, Ramosaj B, Pauly M (2021) Goodness (of fit) of imputation accuracy: The GoodImpact analysis. arXiv preprint arXiv:2101.07532"},{"issue":"1","key":"455_CR158","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","volume":"58","author":"R Tibshirani","year":"1996","unstructured":"Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Royal Stat Soc Ser B Stat Methodol 58(1):267\u2013288. https:\/\/doi.org\/10.1111\/j.2517-6161.1996.tb02080.x","journal-title":"J Royal Stat Soc Ser B Stat Methodol"},{"issue":"4","key":"455_CR159","doi-asserted-by":"publisher","first-page":"385","DOI":"10.1002\/(sici)1097-0258(19970228)16:4h385::aid-sim380i3.0.co;2-3","volume":"16","author":"R Tibshirani","year":"1997","unstructured":"Tibshirani R (1997) The LASSO method for variable selection in the Cox model. Stat Med 16(4):385\u2013395. https:\/\/doi.org\/10.1002\/(sici)1097-0258(19970228)16:4h385::aid-sim380i3.0.co;2-3","journal-title":"Stat Med"},{"issue":"1","key":"455_CR160","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1214\/aoms\/1177704711","volume":"33","author":"JW Tukey","year":"1962","unstructured":"Tukey JW (1962) The future of data analysis. Ann Math Stat 33(1):1\u201367","journal-title":"Ann Math Stat"},{"key":"455_CR161","unstructured":"UNECE (2020) Machine learning for official statistics \u2013 HLG-MOS machine learning project. https:\/\/statswiki.unece.org\/display\/ML\/HLG-MOS+Machine+Learning+Project"},{"issue":"11","key":"455_CR162","doi-asserted-by":"publisher","first-page":"1134","DOI":"10.1145\/1968.1972","volume":"27","author":"LG Valiant","year":"1984","unstructured":"Valiant LG (1984) A theory of the learnable. Commun ACM 27(11):1134\u20131142. https:\/\/doi.org\/10.1145\/1968.1972","journal-title":"Commun ACM"},{"issue":"05","key":"455_CR163","doi-asserted-by":"publisher","first-page":"51-2716","DOI":"10.5860\/choice.51-2716","volume":"51","author":"LG Valiant","year":"2013","unstructured":"Valiant LG (2013) Probably approximately correct: nature\u2019s algorithms for learning and prospering in a complex world. Choice Rev Online 51(05):51-2716\u201351-2716. https:\/\/doi.org\/10.5860\/choice.51-2716","journal-title":"Choice Rev Online"},{"key":"455_CR164","doi-asserted-by":"publisher","DOI":"10.1201\/9780429492259","volume-title":"Flexible imputation of missing data","author":"S Van Buuren","year":"2018","unstructured":"Van Buuren S (2018) Flexible imputation of missing data. CRC Press, Boca Raton, FL"},{"key":"455_CR165","volume-title":"Statistical learning theory","author":"V Vapnik","year":"1998","unstructured":"Vapnik V (1998) Statistical learning theory. Wiley, New York"},{"issue":"523","key":"455_CR166","doi-asserted-by":"publisher","first-page":"1228","DOI":"10.1080\/01621459.2017.1319839","volume":"113","author":"S Wager","year":"2018","unstructured":"Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J Am Stat Assoc 113(523):1228\u20131242","journal-title":"J Am Stat Assoc"},{"key":"455_CR167","unstructured":"Wager S, Wang S, Liang PS (2013) Dropout training as adaptive regularization. In: Burges CJC, Bottou L, Welling M, Ghahramani Z, Weinberger KQ (eds) Advances in neural information processing systems, vol 26. Curran Associates, Inc. https:\/\/proceedings.neurips.cc\/paper\/2013\/file\/38db3aed920cf82ab059bfccbd02be6a-Paper.pdf"},{"issue":"1","key":"455_CR168","first-page":"1625","volume":"15","author":"S Wager","year":"2014","unstructured":"Wager S, Hastie T, Efron B (2014) Confidence intervals for random forests: The jackknife and the infinitesimal jackknife. J Mach Learn Res 15(1):1625\u20131651","journal-title":"J Mach Learn Res"},{"issue":"4","key":"455_CR169","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1080\/00031305.1996.10473554","volume":"50","author":"B Warner","year":"1996","unstructured":"Warner B, Misra M (1996) Understanding neural networks as statistical tools. Am Stat 50(4):284\u2013293","journal-title":"Am Stat"},{"issue":"3","key":"455_CR170","doi-asserted-by":"publisher","first-page":"189","DOI":"10.1007\/s41060-018-0102-5","volume":"6","author":"C Weihs","year":"2018","unstructured":"Weihs C, Ickstadt K (2018) Data science: the impact of statistics. Int J Data Sci Anal 6(3):189\u2013194","journal-title":"Int J Data Sci Anal"},{"issue":"526","key":"455_CR171","doi-asserted-by":"publisher","first-page":"804","DOI":"10.1080\/01621459.2018.1448825","volume":"114","author":"SL Wickramasuriya","year":"2019","unstructured":"Wickramasuriya SL, Athanasopoulos G, Hyndman RJ (2019) Optimal forecast reconciliation for hierarchical and grouped time series through trace minimization. J Am Stat Assoc 114(526):804\u2013819","journal-title":"J Am Stat Assoc"},{"key":"455_CR172","unstructured":"Wikipedia (2020) https:\/\/en.wikipedia.org\/wiki\/Simpson%27s_paradox#\/media\/File:Simpson\u2019s_paradox_continuous.svg. Accessed 28 July 2020"},{"key":"455_CR173","unstructured":"Wiredcom (2019) https:\/\/www.wired.com\/story\/ubers-self-driving-car-didnt-know-pedestrians-could-jaywalk\/. 13 May 2020"},{"issue":"3","key":"455_CR174","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1145\/3144592.3144598","volume":"47","author":"MJ Wolf","year":"2017","unstructured":"Wolf MJ, Miller K, Grodzinsky FS (2017) Why we should have seen that coming: comments on Microsoft\u2019s tay experiment, and wider implications. ACM SIGCAS Comput Soc 47(3):54\u201364","journal-title":"ACM SIGCAS Comput Soc"},{"key":"455_CR175","unstructured":"Zaremba W, Sutskever I, Vinyals O (2014) Recurrent Neural Network Regularization. arXiv preprint arXiv:1409.2329v5"},{"issue":"4","key":"455_CR176","doi-asserted-by":"publisher","first-page":"627","DOI":"10.1093\/nsr\/nwx044","volume":"4","author":"J Zhu","year":"2017","unstructured":"Zhu J, Chen J, Hu W, Zhang B (2017) Big Learning with Bayesian methods. National Sci Rev 4(4):627\u2013651. https:\/\/doi.org\/10.1093\/nsr\/nwx044","journal-title":"National Sci Rev"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-021-00455-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-021-00455-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-021-00455-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,5]],"date-time":"2024-09-05T23:39:55Z","timestamp":1725579595000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-021-00455-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,6]]},"references-count":176,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["455"],"URL":"https:\/\/doi.org\/10.1007\/s11634-021-00455-6","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"value":"1862-5347","type":"print"},{"value":"1862-5355","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,6]]},"assertion":[{"value":"13 September 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 July 2021","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 July 2021","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 August 2021","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}