{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T00:19:01Z","timestamp":1775002741536,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":69,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,7,26]],"date-time":"2022-07-26T00:00:00Z","timestamp":1658793600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-2008461, IIS-2040989, 1922658"],"award-info":[{"award-number":["IIS-2008461, IIS-2040989, 1922658"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100006785","name":"Google","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006785","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Center for Research on Computation and Society, Harvard SEAS"},{"name":"Amazon"},{"name":"Harvard Data Science Institute"},{"DOI":"10.13039\/100004326","name":"Bayer","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100004326","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,7,26]]},"DOI":"10.1145\/3514094.3534198","type":"proceedings-article","created":{"date-parts":[[2022,7,27]],"date-time":"2022-07-27T22:25:13Z","timestamp":1658960713000},"page":"686-699","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Towards Robust Off-Policy Evaluation via Human Inputs"],"prefix":"10.1145","author":[{"given":"Harvineet","family":"Singh","sequence":"first","affiliation":[{"name":"New York University, New York City, NY, USA"}]},{"given":"Shalmali","family":"Joshi","sequence":"additional","affiliation":[{"name":"Harvard University, Cambridge, MA, USA"}]},{"given":"Finale","family":"Doshi-Velez","sequence":"additional","affiliation":[{"name":"Harvard University, Cambridge, MA, USA"}]},{"given":"Himabindu","family":"Lakkaraju","sequence":"additional","affiliation":[{"name":"Harvard University, Cambridge, MA, USA"}]}],"member":"320","published-online":{"date-parts":[[2022,7,27]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Online decision-making with high-dimensional covariates. Forthcoming in Operations Research","author":"Bastani Hamsa","year":"2015","unstructured":"Hamsa Bastani and Mohsen Bayati . 2015. Online decision-making with high-dimensional covariates. Forthcoming in Operations Research ( 2015 ). Hamsa Bastani and Mohsen Bayati. 2015. Online decision-making with high-dimensional covariates. Forthcoming in Operations Research (2015)."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.1120.1641"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10107-017-1125-8"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1287\/moor.2018.0936"},{"key":"#cr-split#-e_1_3_2_1_5_1.1","unstructured":"Cl\u00e9ment L. Canonne. 2022. A short note on an inequality between KL and TV. https:\/\/doi.org\/10.48550\/ARXIV.2202.07198 10.48550\/ARXIV.2202.07198"},{"key":"#cr-split#-e_1_3_2_1_5_1.2","unstructured":"Cl\u00e9ment L. Canonne. 2022. A short note on an inequality between KL and TV. https:\/\/doi.org\/10.48550\/ARXIV.2202.07198"},{"key":"e_1_3_2_1_6_1","volume-title":"Nicola Gnecco, and Jonas Peters.","author":"Christiansen Rune","year":"2020","unstructured":"Rune Christiansen , Niklas Pfister , Martin Emil Jakobsen , Nicola Gnecco, and Jonas Peters. 2020 . A causal framework for distribution generalization. arXiv e-prints (2020), arXiv--2006. Rune Christiansen, Niklas Pfister, Martin Emil Jakobsen, Nicola Gnecco, and Jonas Peters. 2020. A causal framework for distribution generalization. arXiv e-prints (2020), arXiv--2006."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1056\/NEJMoa0809329"},{"key":"e_1_3_2_1_8_1","volume-title":"Retiring Adult: New Datasets for Fair Machine Learning. In Advances in Neural Information Processing Systems","author":"Ding Frances","year":"2021","unstructured":"Frances Ding , Moritz Hardt , John Miller , and Ludwig Schmidt . 2021 . Retiring Adult: New Datasets for Fair Machine Learning. In Advances in Neural Information Processing Systems , A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan (Eds.). https:\/\/openreview.net\/forum?id=bYi_2708mKK Frances Ding, Moritz Hardt, John Miller, and Ludwig Schmidt. 2021. Retiring Adult: New Datasets for Fair Machine Learning. In Advances in Neural Information Processing Systems, A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan (Eds.). https:\/\/openreview.net\/forum?id=bYi_2708mKK"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.5555\/1953048.2021068"},{"key":"e_1_3_2_1_10_1","volume-title":"Learning models with uniform performance via distributionally robust optimization. arXiv preprint arXiv:1810.08750","author":"Duchi John","year":"2018","unstructured":"John Duchi and Hongseok Namkoong . 2018. Learning models with uniform performance via distributionally robust optimization. arXiv preprint arXiv:1810.08750 ( 2018 ). John Duchi and Hongseok Namkoong. 2018. Learning models with uniform performance via distributionally robust optimization. arXiv preprint arXiv:1810.08750 (2018)."},{"key":"e_1_3_2_1_11_1","volume-title":"Distributionally robust losses against mixture covariate shifts. Under review","author":"Duchi John C","year":"2019","unstructured":"John C Duchi , Tatsunori Hashimoto , and Hongseok Namkoong . 2019. Distributionally robust losses against mixture covariate shifts. Under review ( 2019 ). John C Duchi, Tatsunori Hashimoto, and Hongseok Namkoong. 2019. Distributionally robust losses against mixture covariate shifts. Under review (2019)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1214\/14-STS500"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5797"},{"key":"e_1_3_2_1_14_1","volume-title":"Popcorn: Partially observed prediction constrained reinforcement learning. arXiv preprint arXiv:2001.04032","author":"Futoma Joseph","year":"2020","unstructured":"Joseph Futoma , Michael C Hughes , and Finale Doshi-Velez . 2020 . Popcorn: Partially observed prediction constrained reinforcement learning. arXiv preprint arXiv:2001.04032 (2020). Joseph Futoma, Michael C Hughes, and Finale Doshi-Velez. 2020. Popcorn: Partially observed prediction constrained reinforcement learning. arXiv preprint arXiv:2001.04032 (2020)."},{"key":"e_1_3_2_1_15_1","volume-title":"Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572","author":"Goodfellow Ian J","year":"2014","unstructured":"Ian J Goodfellow , Jonathon Shlens , and Christian Szegedy . 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 ( 2014 ). Ian J Goodfellow, Jonathon Shlens, and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014)."},{"key":"e_1_3_2_1_16_1","volume-title":"David Sontag, and Finale Doshi-Velez.","author":"Gottesman Omer","year":"2018","unstructured":"Omer Gottesman , Fredrik Johansson , Joshua Meier , Jack Dent , Donghun Lee , Srivatsan Srinivasan , Linying Zhang , Yi Ding , David Wihl , Xuefeng Peng , Jiayu Yao , Isaac Lage , Christopher Mosch , Li-wei H. Lehman , Matthieu Komorowski , Matthieu Komorowski , Aldo Faisal , Leo Anthony Celi , David Sontag, and Finale Doshi-Velez. 2018 . Evaluating Reinforcement Learning Algorithms in Observational Health Settings . https:\/\/doi.org\/10.48550\/ARXIV.1805.12298 10.48550\/ARXIV.1805.12298 Omer Gottesman, Fredrik Johansson, Joshua Meier, Jack Dent, Donghun Lee, Srivatsan Srinivasan, Linying Zhang, Yi Ding, David Wihl, Xuefeng Peng, Jiayu Yao, Isaac Lage, Christopher Mosch, Li-wei H. Lehman, Matthieu Komorowski, Matthieu Komorowski, Aldo Faisal, Leo Anthony Celi, David Sontag, and Finale Doshi-Velez. 2018. Evaluating Reinforcement Learning Algorithms in Observational Health Settings. https:\/\/doi.org\/10.48550\/ARXIV.1805.12298"},{"key":"e_1_3_2_1_17_1","unstructured":"Tobias Hatt Daniel Tschernutter and Stefan Feuerriegel. 2021. Generalizing Off-Policy Learning under Sample Selection Bias. arXiv:2112.01387 [stat.ML]  Tobias Hatt Daniel Tschernutter and Stefan Feuerriegel. 2021. Generalizing Off-Policy Learning under Sample Selection Bias. arXiv:2112.01387 [stat.ML]"},{"key":"e_1_3_2_1_18_1","volume-title":"Optimal loading dose for the initiation of warfarin: a systematic review. BMC cardiovascular disorders 10, 1","author":"Heneghan Carl","year":"2010","unstructured":"Carl Heneghan , Sally Tyndel , Clare Bankhead , Yi Wan , David Keeling , Rafael Perera , and Alison Ward . 2010. Optimal loading dose for the initiation of warfarin: a systematic review. BMC cardiovascular disorders 10, 1 ( 2010 ), 1--12. Carl Heneghan, Sally Tyndel, Clare Bankhead, Yi Wan, David Keeling, Rafael Perera, and Alison Ward. 2010. Optimal loading dose for the initiation of warfarin: a systematic review. BMC cardiovascular disorders 10, 1 (2010), 1--12."},{"key":"e_1_3_2_1_19_1","volume-title":"The Collected Works of Wassily Hoeffding","author":"Hoeffding Wassily","unstructured":"Wassily Hoeffding . 1994. Probability inequalities for sums of bounded random variables . In The Collected Works of Wassily Hoeffding . Springer , 409--426. Wassily Hoeffding. 1994. Probability inequalities for sums of bounded random variables. In The Collected Works of Wassily Hoeffding. Springer, 409--426."},{"key":"e_1_3_2_1_20_1","volume-title":"International Conference on Machine Learning. 2029--2037","author":"Hu Weihua","year":"2018","unstructured":"Weihua Hu , Gang Niu , Issei Sato , and Masashi Sugiyama . 2018 . Does distributionally robust supervised learning give robust classifiers? . In International Conference on Machine Learning. 2029--2037 . Weihua Hu, Gang Niu, Issei Sato, and Masashi Sugiyama. 2018. Does distributionally robust supervised learning give robust classifiers?. In International Conference on Machine Learning. 2029--2037."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1287\/moor.1040.0129"},{"key":"e_1_3_2_1_22_1","volume-title":"Conference on Learning Theory. PMLR","author":"Jeong Sookyo","year":"2020","unstructured":"Sookyo Jeong and Hongseok Namkoong . 2020 . Robust causal inference under covariate shift via worst-case subpopulation treatment effects . In Conference on Learning Theory. PMLR , 2079--2084. Sookyo Jeong and Hongseok Namkoong. 2020. Robust causal inference under covariate shift via worst-case subpopulation treatment effects. In Conference on Learning Theory. PMLR, 2079--2084."},{"key":"e_1_3_2_1_23_1","unstructured":"Nan Jiang. 2020. Notes on Tabular Methods. https:\/\/nanjiang.cs.illinois.edu\/files\/cs598\/note3.pdf. Accessed: 2021--06-04.  Nan Jiang. 2020. Notes on Tabular Methods. https:\/\/nanjiang.cs.illinois.edu\/files\/cs598\/note3.pdf. Accessed: 2021--06-04."},{"key":"e_1_3_2_1_24_1","unstructured":"Alistair E. W. Johnson Tom J. Pollard and Tristan Naumann. 2018. Generalizability of predictive models for intensive care unit patients. arXiv:1812.02275 [cs.LG]  Alistair E. W. Johnson Tom J. Pollard and Tristan Naumann. 2018. Generalizability of predictive models for intensive care unit patients. arXiv:1812.02275 [cs.LG]"},{"key":"e_1_3_2_1_25_1","volume-title":"Near-optimal reinforcement learning in polynomial time. Machine learning 49, 2","author":"Kearns Michael","year":"2002","unstructured":"Michael Kearns and Satinder Singh . 2002. Near-optimal reinforcement learning in polynomial time. Machine learning 49, 2 ( 2002 ), 209--232. Michael Kearns and Satinder Singh. 2002. Near-optimal reinforcement learning in polynomial time. Machine learning 49, 2 (2002), 209--232."},{"key":"e_1_3_2_1_26_1","volume-title":"Counterfactually Guided Policy Transfer in Clinical Settings. arXiv preprint arXiv:2006.11654","author":"Killian Taylor W","year":"2020","unstructured":"Taylor W Killian , Marzyeh Ghassemi , and Shalmali Joshi . 2020. Counterfactually Guided Policy Transfer in Clinical Settings. arXiv preprint arXiv:2006.11654 ( 2020 ). Taylor W Killian, Marzyeh Ghassemi, and Shalmali Joshi. 2020. Counterfactually Guided Policy Transfer in Clinical Settings. arXiv preprint arXiv:2006.11654 (2020)."},{"key":"e_1_3_2_1_27_1","volume-title":"Inouye","author":"Kulinski Sean","year":"2020","unstructured":"Sean Kulinski , Saurabh Bagchi , and David I . Inouye . 2020 . Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests. In Neural Information Processing Systems (NeurIPS) . Sean Kulinski, Saurabh Bagchi, and David I. Inouye. 2020. Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests. In Neural Information Processing Systems (NeurIPS)."},{"key":"e_1_3_2_1_28_1","unstructured":"Cassidy Laidlaw Sahil Singla and Soheil Feizi. 2021. Perceptual Adversarial Robustness: Defense Against Unseen Threat Models. In ICLR.  Cassidy Laidlaw Sahil Singla and Soheil Feizi. 2021. Perceptual Adversarial Robustness: Defense Against Unseen Threat Models. In ICLR."},{"key":"e_1_3_2_1_29_1","volume-title":"Markov chains and mixing times","author":"Levin David A","unstructured":"David A Levin and Yuval Peres . 2017. Markov chains and mixing times . Vol. 107 . American Mathematical Soc . David A Levin and Yuval Peres. 2017. Markov chains and mixing times. Vol. 107. American Mathematical Soc."},{"key":"e_1_3_2_1_30_1","unstructured":"Sergey Levine Aviral Kumar George Tucker and Justin Fu. 2020. Offline Reinforcement Learning: Tutorial Review and Perspectives on Open Problems. arXiv:2005.01643 [cs.LG]  Sergey Levine Aviral Kumar George Tucker and Justin Fu. 2020. Offline Reinforcement Learning: Tutorial Review and Perspectives on Open Problems. arXiv:2005.01643 [cs.LG]"},{"key":"e_1_3_2_1_31_1","volume-title":"Advances in Neural Information Processing Systems","author":"Li Gen","year":"2020","unstructured":"Gen Li , Yuting Wei , Yuejie Chi , Yuantao Gu , and Yuxin Chen . 2020. Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model . In Advances in Neural Information Processing Systems , H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33 . Curran Associates, Inc. , 12861--12872. https:\/\/proceedings.neurips.cc\/paper\/ 2020 \/file\/96ea64f3a1aa2fd00c72faacf0cb8ac9-Paper.pdf Gen Li, Yuting Wei, Yuejie Chi, Yuantao Gu, and Yuxin Chen. 2020. Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 12861--12872. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/96ea64f3a1aa2fd00c72faacf0cb8ac9-Paper.pdf"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1080\/10691898.2018.1434342"},{"key":"e_1_3_2_1_33_1","unstructured":"Mike Li Hongseok Namkoong and Shangzhou Xia. 2021. Evaluating model performance under worst-case subpopulations. In Advances in Neural Information Processing Systems A. Beygelzimer Y. Dauphin P. Liang and J. Wortman Vaughan (Eds.). https:\/\/openreview.net\/forum?id=nehzxAdyJxF  Mike Li Hongseok Namkoong and Shangzhou Xia. 2021. Evaluating model performance under worst-case subpopulations. In Advances in Neural Information Processing Systems A. Beygelzimer Y. Dauphin P. Liang and J. Wortman Vaughan (Eds.). https:\/\/openreview.net\/forum?id=nehzxAdyJxF"},{"key":"e_1_3_2_1_34_1","volume-title":"International Conference on Learning Representations.","author":"Madry Aleksander","year":"2018","unstructured":"Aleksander Madry , Aleksandar Makelov , Ludwig Schmidt , Dimitris Tsipras , and Adrian Vladu . 2018 . Towards Deep Learning Models Resistant to Adversarial Attacks . In International Conference on Learning Representations. Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2018. Towards Deep Learning Models Resistant to Adversarial Attacks. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_35_1","unstructured":"Sara Magliacane Thijs van Ommen Tom Claassen Stephan Bongers Philip Versteeg and Joris M Mooij. 2018. Domain adaptation by using causal inference to predict invariant conditional distributions. In Advances in Neural Information Processing Systems. 10846--10856.  Sara Magliacane Thijs van Ommen Tom Claassen Stephan Bongers Philip Versteeg and Joris M Mooij. 2018. Domain adaptation by using causal inference to predict invariant conditional distributions. In Advances in Neural Information Processing Systems. 10846--10856."},{"key":"e_1_3_2_1_36_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Maini Pratyush","year":"2020","unstructured":"Pratyush Maini , Eric Wong , and Zico Kolter . 2020 . Adversarial Robustness Against the Union of Multiple Perturbation Models . In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 6640--6650. http:\/\/proceedings.mlr.press\/v119\/maini20a.html Pratyush Maini, Eric Wong, and Zico Kolter. 2020. Adversarial Robustness Against the Union of Multiple Perturbation Models. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 6640--6650. http:\/\/proceedings.mlr.press\/v119\/maini20a.html"},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Miller John","year":"2020","unstructured":"John Miller , Karl Krauth , Benjamin Recht , and Ludwig Schmidt . 2020 . The Effect of Natural Distribution Shift on Question Answering Models . In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 6905--6916. http:\/\/proceedings.mlr.press\/v119\/miller20a.html John Miller, Karl Krauth, Benjamin Recht, and Ludwig Schmidt. 2020. The Effect of Natural Distribution Shift on Question Answering Models. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 6905--6916. http:\/\/proceedings.mlr.press\/v119\/miller20a.html"},{"key":"e_1_3_2_1_38_1","unstructured":"Weibin Mo Zhengling Qi and Yufeng Liu. 2020. Learning optimal distributionally robust individualized treatment rules. J. Amer. Statist. Assoc. (2020) 1--16.  Weibin Mo Zhengling Qi and Yufeng Liu. 2020. Learning optimal distributionally robust individualized treatment rules. J. Amer. Statist. Assoc. (2020) 1--16."},{"key":"e_1_3_2_1_39_1","unstructured":"Hongseok Namkoong Ramtin Keramati Steve Yadlowsky and Emma Brunskill. 2020. Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding. In Advances in Neural Information Processing Systems.  Hongseok Namkoong Ramtin Keramati Steve Yadlowsky and Emma Brunskill. 2020. Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1287\/opre.1050.0216"},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research","volume":"4890","author":"Oberst Michael","year":"2019","unstructured":"Michael Oberst and David Sontag . 2019 . Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models . In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 4881-- 4890 . http:\/\/proceedings.mlr.press\/v97\/oberst19a.html Michael Oberst and David Sontag. 2019. Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 4881--4890. http:\/\/proceedings.mlr.press\/v97\/oberst19a.html"},{"key":"e_1_3_2_1_42_1","volume-title":"Regularizing towards Causal Invariance: Linear Models with Proxies. arXiv preprint arXiv:2103.02477","author":"Oberst Michael","year":"2021","unstructured":"Michael Oberst , Nikolaj Thams , Jonas Peters , and David Sontag . 2021. Regularizing towards Causal Invariance: Linear Models with Proxies. arXiv preprint arXiv:2103.02477 ( 2021 ). Michael Oberst, Nikolaj Thams, Jonas Peters, and David Sontag. 2021. Regularizing towards Causal Invariance: Linear Models with Proxies. arXiv preprint arXiv:2103.02477 (2021)."},{"key":"e_1_3_2_1_43_1","volume-title":"PyTorch: An Imperative Style","author":"Paszke Adam","unstructured":"Adam Paszke , Sam Gross , Francisco Massa , Adam Lerer , James Bradbury , Gregory Chanan , Trevor Killeen , Zeming Lin , Natalia Gimelshein , Luca Antiga , Alban Desmaison , Andreas Kopf , Edward Yang , Zachary DeVito , Martin Raison , Alykhan Tejani , Sasank Chilamkurthy , Benoit Steiner , Lu Fang , Junjie Bai , and Soumith Chintala . 2019. PyTorch: An Imperative Style , High-Performance Deep Learning Library . In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch\u00e9-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024--8035. https:\/\/proceedings.neurips.cc\/paper\/2019\/file\/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alch\u00e9-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024--8035. https:\/\/proceedings.neurips.cc\/paper\/2019\/file\/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf"},{"key":"e_1_3_2_1_44_1","volume-title":"Anke Schmeink, Gerd Ascheid, Christoph Thiemermann, Andreas Schuppert, Ryan Kindle, et al.","author":"Peine Arne","year":"2021","unstructured":"Arne Peine , Ahmed Hallawa , Johannes Bickenbach , Guido Dartmann , Lejla Begic Fazlic , Anke Schmeink, Gerd Ascheid, Christoph Thiemermann, Andreas Schuppert, Ryan Kindle, et al. 2021 . Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care. NPJ digital medicine 4, 1 (2021), 1--12. Arne Peine, Ahmed Hallawa, Johannes Bickenbach, Guido Dartmann, Lejla Begic Fazlic, Anke Schmeink, Gerd Ascheid, Christoph Thiemermann, Andreas Schuppert, Ryan Kindle, et al. 2021. Development and validation of a reinforcement learning algorithm to dynamically optimize mechanical ventilation in critical care. NPJ digital medicine 4, 1 (2021), 1--12."},{"key":"e_1_3_2_1_45_1","volume-title":"August 1--3","author":"Peters Jonas","year":"2022","unstructured":"Jonas Peters , Peter B\u00fchlmann , and Nicolai Meinshausen . 2016. Causal inference by using invariant prediction: identification and confidence intervals. Journal AIES'22 , August 1--3 , 2022 , Oxford, United Kingdom Singh et al. of the Royal Statistical Society : Series B (Statistical Methodology) 78, 5 (2016), 947--1012. Jonas Peters, Peter B\u00fchlmann, and Nicolai Meinshausen. 2016. Causal inference by using invariant prediction: identification and confidence intervals. Journal AIES'22, August 1--3, 2022, Oxford, United Kingdom Singh et al. of the Royal Statistical Society: Series B (Statistical Methodology) 78, 5 (2016), 947--1012."},{"key":"e_1_3_2_1_46_1","unstructured":"Marek Petrik and Reazul Hasan Russel. 2019. Beyond confidence regions: Tight bayesian ambiguity sets for robust mdps. In Advances in Neural Information Processing Systems. 7049--7058.  Marek Petrik and Reazul Hasan Russel. 2019. Beyond confidence regions: Tight bayesian ambiguity sets for robust mdps. In Advances in Neural Information Processing Systems. 7049--7058."},{"key":"e_1_3_2_1_47_1","volume-title":"Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research","volume":"2826","author":"Pinto Lerrel","year":"2017","unstructured":"Lerrel Pinto , James Davidson , Rahul Sukthankar , and Abhinav Gupta . 2017 . Robust Adversarial Reinforcement Learning . In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 70), Doina Precup and Yee Whye Teh (Eds.). PMLR, 2817-- 2826 . http:\/\/proceedings.mlr.press\/v70\/pinto17a.html Lerrel Pinto, James Davidson, Rahul Sukthankar, and Abhinav Gupta. 2017. Robust Adversarial Reinforcement Learning. In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 70), Doina Precup and Yee Whye Teh (Eds.). PMLR, 2817--2826. http:\/\/proceedings.mlr.press\/v70\/pinto17a.html"},{"key":"e_1_3_2_1_48_1","volume-title":"Robust Batch Policy Learning in Markov Decision Processes. arXiv preprint arXiv:2011.04185","author":"Qi Zhengling","year":"2020","unstructured":"Zhengling Qi and Peng Liao . 2020. Robust Batch Policy Learning in Markov Decision Processes. arXiv preprint arXiv:2011.04185 ( 2020 ). Zhengling Qi and Peng Liao. 2020. Robust Batch Policy Learning in Markov Decision Processes. arXiv preprint arXiv:2011.04185 (2020)."},{"key":"e_1_3_2_1_49_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Raghunathan Aditi","year":"2020","unstructured":"Aditi Raghunathan , Sang Michael Xie , Fanny Yang , John Duchi , and Percy Liang . 2020 . Understanding and Mitigating the Tradeoff between Robustness and Accuracy . In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 7909--7919. https:\/\/proceedings.mlr.press\/v119\/raghunathan20a.html Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, and Percy Liang. 2020. Understanding and Mitigating the Tradeoff between Robustness and Accuracy. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 7909--7919. https:\/\/proceedings.mlr.press\/v119\/raghunathan20a.html"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.21314\/JOR.2000.038"},{"key":"e_1_3_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.5555\/3291125.3291161"},{"key":"e_1_3_2_1_52_1","volume-title":"Anchor regression: heterogeneous data meets causality. arXiv preprint arXiv:1801.06229","author":"Rothenh\u00e4usler Dominik","year":"2018","unstructured":"Dominik Rothenh\u00e4usler , Nicolai Meinshausen , Peter B\u00fchlmann , and Jonas Peters . 2018. Anchor regression: heterogeneous data meets causality. arXiv preprint arXiv:1801.06229 ( 2018 ). Dominik Rothenh\u00e4usler, Nicolai Meinshausen, Peter B\u00fchlmann, and Jonas Peters. 2018. Anchor regression: heterogeneous data meets causality. arXiv preprint arXiv:1801.06229 (2018)."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"crossref","unstructured":"Alexander Shapiro Darinka Dentcheva and Andrzej Ruszczy'ski. 2014. Lectures on stochastic programming: modeling and theory. SIAM.  Alexander Shapiro Darinka Dentcheva and Andrzej Ruszczy'ski. 2014. Lectures on stochastic programming: modeling and theory. SIAM.","DOI":"10.1137\/1.9781611973433"},{"key":"e_1_3_2_1_54_1","volume-title":"Distributional Robust Batch Contextual Bandits. arXiv preprint arXiv:2006.05630","author":"Si Nian","year":"2020","unstructured":"Nian Si , Fan Zhang , Zhengyuan Zhou , and Jose Blanchet . 2020. Distributional Robust Batch Contextual Bandits. arXiv preprint arXiv:2006.05630 ( 2020 ). Nian Si, Fan Zhang, Zhengyuan Zhou, and Jose Blanchet. 2020. Distributional Robust Batch Contextual Bandits. arXiv preprint arXiv:2006.05630 (2020)."},{"key":"e_1_3_2_1_55_1","volume-title":"Certifying Some Distributional Robustness with Principled Adversarial Training. In International Conference on Learning Representations.","author":"Sinha Aman","year":"2018","unstructured":"Aman Sinha , Hongseok Namkoong , and John Duchi . 2018 . Certifying Some Distributional Robustness with Principled Adversarial Training. In International Conference on Learning Representations. Aman Sinha, Hongseok Namkoong, and John Duchi. 2018. Certifying Some Distributional Robustness with Principled Adversarial Training. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_56_1","volume-title":"NIPS workshop on Machine Learning and Computer Security.","author":"Staib Matthew","year":"2017","unstructured":"Matthew Staib and Stefanie Jegelka . 2017 . Distributionally robust deep learning as a generalization of adversarial training . In NIPS workshop on Machine Learning and Computer Security. Matthew Staib and Stefanie Jegelka. 2017. Distributionally robust deep learning as a generalization of adversarial training. In NIPS workshop on Machine Learning and Computer Security."},{"key":"e_1_3_2_1_57_1","volume-title":"Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research","volume":"2619","author":"Subbaswamy Adarsh","year":"2021","unstructured":"Adarsh Subbaswamy , Roy Adams , and Suchi Saria . 2021 . Evaluating Model Robustness and Stability to Dataset Shift . In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research , Vol. 130), Arindam Banerjee and Kenji Fukumizu (Eds.). PMLR, 2611-- 2619 . http:\/\/proceedings.mlr.press\/v130\/subbaswamy21a.html Adarsh Subbaswamy, Roy Adams, and Suchi Saria. 2021. Evaluating Model Robustness and Stability to Dataset Shift. In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 130), Arindam Banerjee and Kenji Fukumizu (Eds.). PMLR, 2611--2619. http:\/\/proceedings.mlr.press\/v130\/subbaswamy21a.html"},{"key":"e_1_3_2_1_58_1","volume-title":"The 22nd International Conference on Artificial Intelligence and Statistics. 3118--3127","author":"Subbaswamy Adarsh","year":"2019","unstructured":"Adarsh Subbaswamy , Peter Schulam , and Suchi Saria . 2019 . Preventing failures due to dataset shift: Learning predictive models that transport . In The 22nd International Conference on Artificial Intelligence and Statistics. 3118--3127 . Adarsh Subbaswamy, Peter Schulam, and Suchi Saria. 2019. Preventing failures due to dataset shift: Learning predictive models that transport. In The 22nd International Conference on Artificial Intelligence and Statistics. 3118--3127."},{"key":"e_1_3_2_1_59_1","volume-title":"Reinforcement learning: An introduction","author":"Sutton Richard S","unstructured":"Richard S Sutton and Andrew G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard S Sutton and Andrew G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_1_60_1","volume-title":"International Conference on Machine Learning. 181--189","author":"Tamar Aviv","year":"2014","unstructured":"Aviv Tamar , Shie Mannor , and Huan Xu . 2014 . Scaling up robust MDPs using function approximation . In International Conference on Machine Learning. 181--189 . Aviv Tamar, Shie Mannor, and Huan Xu. 2014. Scaling up robust MDPs using function approximation. In International Conference on Machine Learning. 181--189."},{"key":"e_1_3_2_1_61_1","volume-title":"Lin (Eds.)","volume":"33","author":"Taori Rohan","year":"2020","unstructured":"Rohan Taori , Achal Dave , Vaishaal Shankar , Nicholas Carlini , Benjamin Recht , and Ludwig Schmidt . 2020 . Measuring Robustness to Natural Distribution Shifts in Image Classification. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H . Lin (Eds.) , Vol. 33 . Curran Associates, Inc. , 18583--18599. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/d8330f857a17c53d217014ee776bfd50-Paper.pdf Rohan Taori, Achal Dave, Vaishaal Shankar, Nicholas Carlini, Benjamin Recht, and Ludwig Schmidt. 2020. Measuring Robustness to Natural Distribution Shifts in Image Classification. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 18583--18599. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/d8330f857a17c53d217014ee776bfd50-Paper.pdf"},{"key":"e_1_3_2_1_62_1","volume-title":"International Conference on Machine Learning. 2139--2148","author":"Thomas Philip","year":"2016","unstructured":"Philip Thomas and Emma Brunskill . 2016 . Data-efficient off-policy policy evaluation for reinforcement learning . In International Conference on Machine Learning. 2139--2148 . Philip Thomas and Emma Brunskill. 2016. Data-efficient off-policy policy evaluation for reinforcement learning. In International Conference on Machine Learning. 2139--2148."},{"key":"e_1_3_2_1_63_1","volume-title":"Lin (Eds.)","volume":"33","author":"Uehara Masatoshi","year":"2020","unstructured":"Masatoshi Uehara , Masahiro Kato , and Shota Yasui . 2020 . Off-Policy Evaluation and Learning for External Validity under a Covariate Shift. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H . Lin (Eds.) , Vol. 33 . Curran Associates, Inc., 49--61. https:\/\/proceedings.neurips.cc\/paper\/ 2020\/file\/0084ae4bc24c0795d1e6a4f58444d39b-Paper.pdf Masatoshi Uehara, Masahiro Kato, and Shota Yasui. 2020. Off-Policy Evaluation and Learning for External Validity under a Covariate Shift. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 49--61. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/0084ae4bc24c0795d1e6a4f58444d39b-Paper.pdf"},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2014.2320500"},{"key":"e_1_3_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41592-019-0686-2"},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1287\/moor.1120.0566"},{"key":"e_1_3_2_1_67_1","volume-title":"Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research","author":"Zhang Amy","year":"2020","unstructured":"Amy Zhang , Clare Lyle , Shagun Sodhani , Angelos Filos , Marta Kwiatkowska , Joelle Pineau , Yarin Gal , and Doina Precup . 2020 . Invariant Causal Prediction for Block MDPs . In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research , Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 11214--11224. http:\/\/proceedings.mlr.press\/v119\/zhang20t.html Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal, and Doina Precup. 2020. Invariant Causal Prediction for Block MDPs. In Proceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119), Hal Daum\u00e9 III and Aarti Singh (Eds.). PMLR, 11214--11224. http:\/\/proceedings.mlr.press\/v119\/zhang20t.html"},{"key":"e_1_3_2_1_68_1","volume-title":"Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research","volume":"3339","author":"Zhou Zhengqing","year":"2021","unstructured":"Zhengqing Zhou , Zhengyuan Zhou , Qinxun Bai , Linhai Qiu , Jose Blanchet , and Peter Glynn . 2021 . Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning . In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research , Vol. 130), Arindam Banerjee and Kenji Fukumizu (Eds.). PMLR, 3331-- 3339 . https:\/\/proceedings.mlr.press\/v130\/zhou21d.html Zhengqing Zhou, Zhengyuan Zhou, Qinxun Bai, Linhai Qiu, Jose Blanchet, and Peter Glynn. 2021. Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning. In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 130), Arindam Banerjee and Kenji Fukumizu (Eds.). PMLR, 3331--3339. https:\/\/proceedings.mlr.press\/v130\/zhou21d.html"}],"event":{"name":"AIES '22: AAAI\/ACM Conference on AI, Ethics, and Society","location":"Oxford United Kingdom","acronym":"AIES '22","sponsor":["SIGAI ACM Special Interest Group on Artificial Intelligence","AAAI"]},"container-title":["Proceedings of the 2022 AAAI\/ACM Conference on AI, Ethics, and Society"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3514094.3534198","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3514094.3534198","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3514094.3534198","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:02:37Z","timestamp":1750186957000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3514094.3534198"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,26]]},"references-count":69,"alternative-id":["10.1145\/3514094.3534198","10.1145\/3514094"],"URL":"https:\/\/doi.org\/10.1145\/3514094.3534198","relation":{},"subject":[],"published":{"date-parts":[[2022,7,26]]},"assertion":[{"value":"2022-07-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}