{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,23]],"date-time":"2026-07-23T16:04:57Z","timestamp":1784822697242,"version":"3.55.0"},"reference-count":402,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2022,11,9]],"date-time":"2022-11-09T00:00:00Z","timestamp":1667952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Science Foundation","award":["IIS1714741, CNS1815636, IIS1845081, IIS1907704, DRL2025244, IIS1928278, IIS1955285, IOS2107215, IOS2035472"],"award-info":[{"award-number":["IIS1714741, CNS1815636, IIS1845081, IIS1907704, DRL2025244, IIS1928278, IIS1955285, IOS2107215, IOS2035472"]}]},{"DOI":"10.13039\/100000183","name":"Army Research Office","doi-asserted-by":"crossref","award":["W911NF-21-1-0198"],"award-info":[{"award-number":["W911NF-21-1-0198"]}],"id":[{"id":"10.13039\/100000183","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2023,2,28]]},"abstract":"<jats:p>In the past few decades,<jats:bold>artificial intelligence (AI)<\/jats:bold>technology has experienced swift developments, changing everyone\u2019s daily life and profoundly altering the course of human society. The intention behind developing AI was and is to benefit humans by reducing labor, increasing everyday conveniences, and promoting social good. However, recent research and AI applications indicate that AI can cause unintentional harm to humans by, for example, making unreliable decisions in safety-critical scenarios or undermining fairness by inadvertently discriminating against a group or groups. Consequently, trustworthy AI has recently garnered increased attention regarding the need to avoid the adverse effects that AI could bring to people, so people can fully trust and live in harmony with AI technologies.<\/jats:p><jats:p>A tremendous amount of research on trustworthy AI has been conducted and witnessed in recent years. In this survey, we present a comprehensive appraisal of trustworthy AI from a computational perspective to help readers understand the latest technologies for achieving trustworthy AI. Trustworthy AI is a large and complex subject, involving various dimensions. In this work, we focus on six of the most crucial dimensions in achieving trustworthy AI: (i) Safety &amp; Robustness, (ii) Nondiscrimination &amp; Fairness, (iii) Explainability, (iv) Privacy, (v) Accountability &amp; Auditability, and (vi) Environmental Well-being. For each dimension, we review the recent related technologies according to a taxonomy and summarize their applications in real-world systems. We also discuss the accordant and conflicting interactions among different dimensions and discuss potential aspects for trustworthy AI to investigate in the future.<\/jats:p>","DOI":"10.1145\/3546872","type":"journal-article","created":{"date-parts":[[2022,7,12]],"date-time":"2022-07-12T11:19:37Z","timestamp":1657624777000},"page":"1-59","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":171,"title":["Trustworthy AI: A Computational Perspective"],"prefix":"10.1145","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5290-7163","authenticated-orcid":false,"given":"Haochen","family":"Liu","sequence":"first","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9594-1919","authenticated-orcid":false,"given":"Yiqi","family":"Wang","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4049-1233","authenticated-orcid":false,"given":"Wenqi","family":"Fan","sequence":"additional","affiliation":[{"name":"The Hong Kong Polytechnic University, Hong Kong"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8217-5688","authenticated-orcid":false,"given":"Xiaorui","family":"Liu","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6227-7844","authenticated-orcid":false,"given":"Yaxin","family":"Li","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9007-6169","authenticated-orcid":false,"given":"Shaili","family":"Jain","sequence":"additional","affiliation":[{"name":"Twitter, San Francisco, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8052-9200","authenticated-orcid":false,"given":"Yunhao","family":"Liu","sequence":"additional","affiliation":[{"name":"Tsinghua University, Beijing, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6369-6995","authenticated-orcid":false,"given":"Anil","family":"Jain","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7125-3898","authenticated-orcid":false,"given":"Jiliang","family":"Tang","sequence":"additional","affiliation":[{"name":"Michigan State University, East Lansing, MI, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,11,9]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/IEEESTD.2008.4601584"},{"key":"e_1_3_2_3_2","unstructured":"2017. The Montreal Declaration of Responsible AI. https:\/\/www.montrealdeclaration-responsibleai.com\/the-declaration. Accessed March 18 2021."},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijmedinf.2018.01.007"},{"key":"e_1_3_2_5_2","unstructured":"2019. Governance Principles for the New Generation Artificial Intelligence\u2013Developing Responsible Artificial Intelligence. https:\/\/www.chinadaily.com.cn\/a\/201906\/17\/WS5d07486ba3103dbf14328ab7.html. Accessed March 18 2021."},{"key":"e_1_3_2_6_2","unstructured":"2021. Federated AI Technology Enabler. https:\/\/fate.fedai.org\/."},{"key":"e_1_3_2_7_2","unstructured":"2021. LEAF: A Benchmark for Federated Settings. https:\/\/leaf.cmu.edu\/."},{"key":"e_1_3_2_8_2","unstructured":"2021. A List of Homomorphic Encryption Libraries Software or Resources. https:\/\/github.com\/jonaschn\/awesome-he."},{"key":"e_1_3_2_9_2","unstructured":"2021. A List of MPC Software or Resources. https:\/\/github.com\/rdragos\/awesome-mpc."},{"key":"e_1_3_2_10_2","unstructured":"2021. OenDP: Open Source Tools for Differential Privacy. https:\/\/opendp.org\/."},{"key":"e_1_3_2_11_2","unstructured":"2021. Opacus: Train PyTorch Models with Differential Privacy. https:\/\/opacus.ai\/."},{"key":"e_1_3_2_12_2","unstructured":"2021. Paddle Federated Learning. https:\/\/github.com\/PaddlePaddle\/PaddleFL."},{"key":"e_1_3_2_13_2","unstructured":"2021. A Technical Analysis of Confidential Computing. https:\/\/confidentialcomputing.io\/wp-content\/uploads\/sites\/85\/2021\/03\/CCC-Tech-Analysis-Confidential-Computing-V1.pdf. Accessed Jan 2021."},{"key":"e_1_3_2_14_2","unstructured":"2021. TensorFlow Federated. https:\/\/github.com\/tensorflow\/federated."},{"key":"e_1_3_2_15_2","unstructured":"2021. TensorFlow Privacy. https:\/\/github.com\/tensorflow\/privacy."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/2976749.2978318"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.3390\/sym13122439"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3214303"},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2870052"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33012412"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-017-1116-3"},{"key":"e_1_3_2_22_2","first-page":"120","volume-title":"International Conference on Machine Learning","author":"Agarwal Alekh","year":"2019","unstructured":"Alekh Agarwal, Miroslav Dudik, and Zhiwei Steven Wu. 2019. Fair regression: Quantitative definitions and reduction-based algorithms. In International Conference on Machine Learning. PMLR, 120\u2013129."},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33011418"},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1145\/3319535.3339819"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10207-007-0049-3"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2018.2807385"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSEC.2018.2888775"},{"key":"e_1_3_2_28_2","volume-title":"International Conference on Learning Representations","author":"Ancona Marco","year":"2018","unstructured":"Marco Ancona, Enea Ceolini, Cengiz \u00d6ztireli, and Markus Gross. 2018. Towards better understanding of gradient-based attribution methods for deep neural networks. In International Conference on Learning Representations."},{"key":"e_1_3_2_29_2","doi-asserted-by":"crossref","unstructured":"Rohan Anil Badih Ghazi Vineet Gupta Ravi Kumar and Pasin Manurangsi. 2021. Large-Scale Differentially Private BERT. arxiv:2108.01624 [cs.LG]","DOI":"10.18653\/v1\/2022.findings-emnlp.484"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2019.12.012"},{"issue":"130","key":"e_1_3_2_31_2","first-page":"1","article-title":"AI explainability 360: An extensible toolkit for understanding data and machine learning models","volume":"21","author":"Arya Vijay","year":"2020","unstructured":"Vijay Arya, Rachel K. E. Bellamy, Pin-Yu Chen, Amit Dhurandhar, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Q. Vera Liao, Ronny Luss, Aleksandra Mojsilovic, et\u00a0al. 2020. AI explainability 360: An extensible toolkit for understanding data and machine learning models. Journal of Machine Learning Research 21, 130 (2020), 1\u20136.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_32_2","first-page":"405","volume-title":"International Conference on Machine Learning","author":"Backurs Arturs","year":"2019","unstructured":"Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali Vakilian, and Tal Wagner. 2019. Scalable fair clustering. In International Conference on Machine Learning. PMLR, 405\u2013413."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-010-5188-5"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1109\/FOCS.2014.56"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.winlp-1.25"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1145\/3360544"},{"key":"e_1_3_2_37_2","article-title":"AI fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias","author":"Bellamy Rachel K. E.","year":"2018","unstructured":"Rachel K. E. Bellamy, Kuntal Dey, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Kalapriya Kannan, Pranay Lohia, Jacquelyn Martino, Sameep Mehta, Aleksandra Mojsilovic, et\u00a0al. 2018. AI fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias. arXiv preprint arXiv:1810.01943 (2018).","journal-title":"arXiv preprint arXiv:1810.01943"},{"key":"e_1_3_2_38_2","article-title":"Principles and practice of explainable machine learning","author":"Belle Vaishak","year":"2020","unstructured":"Vaishak Belle and Ioannis Papantonis. 2020. Principles and practice of explainable machine learning. arXiv preprint arXiv:2009.11698 (2020).","journal-title":"arXiv preprint arXiv:2009.11698"},{"key":"e_1_3_2_39_2","article-title":"A convex framework for fair regression","author":"Berk Richard","year":"2017","unstructured":"Richard Berk, Hoda Heidari, Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Seth Neel, and Aaron Roth. 2017. A convex framework for fair regression. arXiv preprint arXiv:1706.02409 (2017).","journal-title":"arXiv preprint arXiv:1706.02409"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1177\/0049124118782533"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40994-3_25"},{"key":"e_1_3_2_42_2","article-title":"Poisoning attacks against support vector machines","author":"Biggio Battista","year":"2012","unstructured":"Battista Biggio, Blaine Nelson, and Pavel Laskov. 2012. Poisoning attacks against support vector machines. arXiv preprint arXiv:1206.6389 (2012).","journal-title":"arXiv preprint arXiv:1206.6389"},{"key":"e_1_3_2_43_2","volume-title":"Pattern Recognition and Machine Learning","author":"Bishop Christopher M.","year":"2006","unstructured":"Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Springer."},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.485"},{"key":"e_1_3_2_45_2","article-title":"Adversarial attacks on node embeddings","author":"Bojcheski Aleksandar","year":"2018","unstructured":"Aleksandar Bojcheski and Stephan G\u00fcnnemann. 2018. Adversarial attacks on node embeddings. arXiv preprint arXiv:1809.01093 (2018).","journal-title":"arXiv preprint arXiv:1809.01093"},{"key":"e_1_3_2_46_2","unstructured":"Aleksandar Bojchevski and Stephan G\u00fcnnemann. 2019. Adversarial attacks on node embeddings via graph poisoning. arxiv:1809.01093 [cs.LG]"},{"key":"e_1_3_2_47_2","first-page":"4349","volume-title":"Advances in Neural Information Processing Systems","author":"Bolukbasi Tolga","year":"2016","unstructured":"Tolga Bolukbasi, Kai-Wei Chang, James Y. Zou, Venkatesh Saligrama, and Adam T. Kalai. 2016. Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In Advances in Neural Information Processing Systems. 4349\u20134357."},{"key":"e_1_3_2_48_2","article-title":"Identifying and reducing gender bias in word-level language models","author":"Bordia Shikha","year":"2019","unstructured":"Shikha Bordia and Samuel R. Bowman. 2019. Identifying and reducing gender bias in word-level language models. arXiv preprint arXiv:1904.03035 (2019).","journal-title":"arXiv preprint arXiv:1904.03035"},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308560.3317593"},{"key":"e_1_3_2_50_2","article-title":"Compositional fairness constraints for graph embeddings","author":"Bose Avishek Joey","year":"2019","unstructured":"Avishek Joey Bose and William L. Hamilton. 2019. Compositional fairness constraints for graph embeddings. arXiv preprint arXiv:1905.10674 (2019).","journal-title":"arXiv preprint arXiv:1905.10674"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1111\/j.1468-0386.2007.00378.x"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2230218"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3406325.3451131"},{"key":"e_1_3_2_54_2","article-title":"Toward trustworthy AI development: Mechanisms for supporting verifiable claims","author":"Brundage Miles","year":"2020","unstructured":"Miles Brundage, Shahar Avin, Jasmine Wang, Haydn Belfield, Gretchen Krueger, Gillian Hadfield, Heidy Khlaaf, Jingying Yang, Helen Toner, Ruth Fong, et\u00a0al. 2020. Toward trustworthy AI development: Mechanisms for supporting verifiable claims. arXiv preprint arXiv:2004.07213 (2020).","journal-title":"arXiv preprint arXiv:2004.07213"},{"key":"e_1_3_2_55_2","first-page":"803","volume-title":"International Conference on Machine Learning","author":"Brunet Marc-Etienne","year":"2019","unstructured":"Marc-Etienne Brunet, Colleen Alkalay-Houlihan, Ashton Anderson, and Richard Zemel. 2019. Understanding the origins of bias in word embeddings. In International Conference on Machine Learning. PMLR, 803\u2013811."},{"issue":"4","key":"e_1_3_2_56_2","first-page":"53","article-title":"A (very) brief history of artificial intelligence","volume":"26","author":"Buchanan Bruce G.","year":"2005","unstructured":"Bruce G. Buchanan. 2005. A (very) brief history of artificial intelligence. AI Magazine 26, 4 (2005), 53\u201353.","journal-title":"AI Magazine"},{"key":"e_1_3_2_57_2","article-title":"Notes from the AI frontier: Modeling the impact of AI on the world economy","author":"Bughin Jacques","year":"2018","unstructured":"Jacques Bughin, Jeongmin Seong, James Manyika, Michael Chui, and Raoul Joshi. 2018. Notes from the AI frontier: Modeling the impact of AI on the world economy. McKinsey Global Institute (2018).","journal-title":"McKinsey Global Institute"},{"key":"e_1_3_2_58_2","first-page":"77","volume-title":"Conference on Fairness, Accountability and Transparency","author":"Buolamwini Joy","year":"2018","unstructured":"Joy Buolamwini and Timnit Gebru. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. In Conference on Fairness, Accountability and Transparency. 77\u201391."},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1177\/2053951715622512"},{"key":"e_1_3_2_60_2","first-page":"622","volume-title":"Asian Conference on Machine Learning","author":"Cai Ermao","year":"2017","unstructured":"Ermao Cai, Da-Cheng Juan, Dimitrios Stamoulis, and Diana Marculescu. 2017. Neuralpower: Predict and deploy energy-efficient convolutional neural networks. In Asian Conference on Machine Learning. PMLR, 622\u2013637."},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2009.83"},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-010-0190-x"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-30487-3_3"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/3289602.3293898"},{"key":"e_1_3_2_65_2","first-page":"267","volume-title":"28th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 19)","author":"Carlini Nicholas","year":"2019","unstructured":"Nicholas Carlini, Chang Liu, \u00dalfar Erlingsson, Jernej Kos, and Dawn Song. 2019. The secret sharer: Evaluating and testing unintended memorization in neural networks. In 28th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 19). 267\u2013284."},{"key":"e_1_3_2_66_2","first-page":"2633","volume-title":"30th USENIX Security Symposium (USENIX Security 21)","author":"Carlini Nicholas","year":"2021","unstructured":"Nicholas Carlini, Florian Tramer, Eric Wallace, Matthew Jagielski, Ariel Herbert-Voss, Katherine Lee, Adam Roberts, Tom Brown, Dawn Song, Ulfar Erlingsson, et\u00a0al. 2021. Extracting training data from large language models. In 30th USENIX Security Symposium (USENIX Security 21). 2633\u20132650."},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1145\/3128572.3140444"},{"key":"e_1_3_2_68_2","doi-asserted-by":"crossref","unstructured":"Nicholas Carlini and David Wagner. 2017. Towards Evaluating the Robustness of Neural Networks. arxiv:1608.04644 [cs.CR]","DOI":"10.1109\/SP.2017.49"},{"key":"e_1_3_2_69_2","first-page":"1","volume-title":"2018 IEEE Security and Privacy Workshops (SPW)","author":"Carlini Nicholas","year":"2018","unstructured":"Nicholas Carlini and David Wagner. 2018. Audio adversarial examples: Targeted attacks on speech-to-text. In 2018 IEEE Security and Privacy Workshops (SPW). IEEE, 1\u20137."},{"key":"e_1_3_2_70_2","unstructured":"Yair Carmon Aditi Raghunathan Ludwig Schmidt Percy Liang and John C. Duchi. 2019. Unlabeled Data Improves Adversarial Robustness. arxiv:1905.13736 [stat.ML]"},{"key":"e_1_3_2_71_2","article-title":"Many cars tone deaf to women\u2019s voices","author":"Carty S.","year":"2011","unstructured":"S. Carty. 2011. Many cars tone deaf to women\u2019s voices. AOL Autos (2011).","journal-title":"AOL Autos"},{"key":"e_1_3_2_72_2","article-title":"Fairness in machine learning: A survey","author":"Caton Simon","year":"2020","unstructured":"Simon Caton and Christian Haas. 2020. Fairness in machine learning: A survey. arXiv preprint arXiv:2010.04053 (2020).","journal-title":"arXiv preprint arXiv:2010.04053"},{"key":"e_1_3_2_73_2","article-title":"How to be fair and diverse?","volume":"1610","author":"Celis L.","year":"2016","unstructured":"L. Celis, Amit Deshpande, Tarun Kathuria, and N. Vishnoi. 2016. How to be fair and diverse?ArXiv abs\/1610.07183 (2016).","journal-title":"ArXiv"},{"key":"e_1_3_2_74_2","article-title":"Improved adversarial learning for fair classification","author":"Celis L. Elisa","year":"2019","unstructured":"L. Elisa Celis and Vijay Keswani. 2019. Improved adversarial learning for fair classification. arXiv preprint arXiv:1901.10443 (2019).","journal-title":"arXiv preprint arXiv:1901.10443"},{"key":"e_1_3_2_75_2","article-title":"Adversarial attacks and defences: A survey","author":"Chakraborty Anirban","year":"2018","unstructured":"Anirban Chakraborty, Manaar Alam, Vishal Dey, Anupam Chattopadhyay, and Debdeep Mukhopadhyay. 2018. Adversarial attacks and defences: A survey. arXiv preprint arXiv:1810.00069 (2018).","journal-title":"arXiv preprint arXiv:1810.00069"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASPDAC.2018.8297302"},{"key":"e_1_3_2_77_2","first-page":"1122","volume-title":"International Conference on Machine Learning","author":"Chen Hongge","year":"2019","unstructured":"Hongge Chen, Huan Zhang, Duane Boning, and Cho-Jui Hsieh. 2019. Robust decision trees against adversarial examples. In International Conference on Machine Learning. PMLR, 1122\u20131131."},{"key":"e_1_3_2_78_2","article-title":"Why is my classifier discriminatory?","author":"Chen Irene","year":"2018","unstructured":"Irene Chen, Fredrik D. Johansson, and David Sontag. 2018. Why is my classifier discriminatory?arXiv preprint arXiv:1805.12002 (2018).","journal-title":"arXiv preprint arXiv:1805.12002"},{"key":"e_1_3_2_79_2","article-title":"Bias and debias in recommender system: A survey and future directions","author":"Chen Jiawei","year":"2020","unstructured":"Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. arXiv preprint arXiv:2010.03240 (2020).","journal-title":"arXiv preprint arXiv:2010.03240"},{"key":"e_1_3_2_80_2","unstructured":"Jinyin Chen Yangyang Wu Xuanheng Xu Yixian Chen Haibin Zheng and Qi Xuan. 2018. Fast Gradient Attack on Network Embedding. arxiv:1809.02797 [physics.soc-ph]"},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1145\/3385003.3410925"},{"key":"e_1_3_2_82_2","first-page":"1032","volume-title":"International Conference on Machine Learning","author":"Chen Xingyu","year":"2019","unstructured":"Xingyu Chen, Brandon Fain, Liang Lyu, and Kamesh Munagala. 2019. Proportionally fair clustering. In International Conference on Machine Learning. PMLR, 1032\u20131041."},{"key":"e_1_3_2_83_2","unstructured":"Xinyun Chen Chang Liu Bo Li Kimberly Lu and Dawn Song. 2017. Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning. arxiv:1712.05526 [cs.CR]"},{"key":"e_1_3_2_84_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eng.2020.01.007"},{"key":"e_1_3_2_85_2","article-title":"Seq2sick: Evaluating the robustness of sequence-to-sequence models with adversarial examples","author":"Cheng Minhao","year":"2018","unstructured":"Minhao Cheng, Jinfeng Yi, Huan Zhang, Pin-Yu Chen, and Cho-Jui Hsieh. 2018. Seq2sick: Evaluating the robustness of sequence-to-sequence models with adversarial examples. arXiv preprint arXiv:1803.01128 (2018).","journal-title":"arXiv preprint arXiv:1803.01128"},{"key":"e_1_3_2_86_2","article-title":"A survey of model compression and acceleration for deep neural networks","author":"Cheng Yu","year":"2017","unstructured":"Yu Cheng, Duo Wang, Pan Zhou, and Tao Zhang. 2017. A survey of model compression and acceleration for deep neural networks. arXiv preprint arXiv:1710.09282 (2017).","journal-title":"arXiv preprint arXiv:1710.09282"},{"key":"e_1_3_2_87_2","volume-title":"Transformers. Zip: Compressing Transformers with Pruning and Quantization","author":"Cheong Robin","year":"2019","unstructured":"Robin Cheong and Robel Daniel. 2019. Transformers. Zip: Compressing Transformers with Pruning and Quantization. Technical Report. Technical report, Stanford University, Stanford, California."},{"key":"e_1_3_2_88_2","volume-title":"AAAI","author":"Chiappa S.","year":"2019","unstructured":"S. Chiappa. 2019. Path-specific counterfactual fairness. In AAAI."},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-53887-6_1"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W19-3824"},{"key":"e_1_3_2_91_2","article-title":"Towards the limit of network quantization","author":"Choi Yoojin","year":"2016","unstructured":"Yoojin Choi, Mostafa El-Khamy, and Jungwon Lee. 2016. Towards the limit of network quantization. arXiv preprint arXiv:1612.01543 (2016).","journal-title":"arXiv preprint arXiv:1612.01543"},{"key":"e_1_3_2_92_2","doi-asserted-by":"publisher","DOI":"10.1089\/big.2016.0047"},{"key":"e_1_3_2_93_2","article-title":"Fairer and more accurate, but for whom?","author":"Chouldechova Alexandra","year":"2017","unstructured":"Alexandra Chouldechova and Max G\u2019Sell. 2017. Fairer and more accurate, but for whom?arXiv preprint arXiv:1707.00046 (2017).","journal-title":"arXiv preprint arXiv:1707.00046"},{"key":"e_1_3_2_94_2","first-page":"1310","volume-title":"International Conference on Machine Learning","author":"Cohen Jeremy","year":"2019","unstructured":"Jeremy Cohen, Elan Rosenfeld, and Zico Kolter. 2019. Certified adversarial robustness via randomized smoothing. In International Conference on Machine Learning. PMLR, 1310\u20131320."},{"key":"e_1_3_2_95_2","volume-title":"Seventh International AAAI Conference on Weblogs and Social Media","author":"Cohen Raviv","year":"2013","unstructured":"Raviv Cohen and Derek Ruths. 2013. Classifying political orientation on Twitter: It\u2019s not easy!. In Seventh International AAAI Conference on Weblogs and Social Media."},{"key":"e_1_3_2_96_2","first-page":"2990","volume-title":"International Conference on Machine Learning","author":"Cohen Taco","year":"2016","unstructured":"Taco Cohen and Max Welling. 2016. Group equivariant convolutional networks. In International Conference on Machine Learning. PMLR, 2990\u20132999."},{"key":"e_1_3_2_97_2","article-title":"Independent high-level expert group on artificial intelligence (2019)","author":"Commission EC HLEG AI-European","year":"2019","unstructured":"EC HLEG AI-European Commission et\u00a0al. 2019. Independent high-level expert group on artificial intelligence (2019). Ethics Guidelines for Trustworthy AI (2019).","journal-title":"Ethics Guidelines for Trustworthy AI"},{"key":"e_1_3_2_98_2","article-title":"The measure and mismeasure of fairness: A critical review of fair machine learning","author":"Corbett-Davies Sam","year":"2018","unstructured":"Sam Corbett-Davies and Sharad Goel. 2018. The measure and mismeasure of fairness: A critical review of fair machine learning. arXiv preprint arXiv:1808.00023 (2018).","journal-title":"arXiv preprint arXiv:1808.00023"},{"key":"e_1_3_2_99_2","doi-asserted-by":"publisher","DOI":"10.1145\/3097983.3098095"},{"key":"e_1_3_2_100_2","article-title":"Algorithmic bias: A counterfactual perspective","author":"Cowgill Bo","year":"2017","unstructured":"Bo Cowgill and Catherine Tucker. 2017. Algorithmic bias: A counterfactual perspective. NSF Trustworthy Algorithms (2017).","journal-title":"NSF Trustworthy Algorithms"},{"key":"e_1_3_2_101_2","unstructured":"Francesco Croce Maksym Andriushchenko Vikash Sehwag Edoardo Debenedetti Nicolas Flammarion Mung Chiang Prateek Mittal and Matthias Hein. 2021. RobustBench: A Standardized Adversarial Robustness Benchmark. arxiv:2010.09670 [cs.LG]"},{"key":"e_1_3_2_102_2","unstructured":"Francesco Croce and Matthias Hein. 2020. Minimally Distorted Adversarial Examples with a Fast Adaptive Boundary Attack. arxiv:1907.02044 [cs.LG]"},{"key":"e_1_3_2_103_2","unstructured":"Francesco Croce and Matthias Hein. 2020. Reliable Evaluation of Adversarial Robustness with an Ensemble of Diverse Parameter-free Attacks. arxiv:2003.01690 [cs.LG]"},{"key":"e_1_3_2_104_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314183.3323847"},{"key":"e_1_3_2_105_2","first-page":"72","volume-title":"Proceedings of the Second Workshop on Gender Bias in Natural Language Processing","author":"Curry Amanda Cercas","year":"2020","unstructured":"Amanda Cercas Curry, Judy Robertson, and Verena Rieser. 2020. Conversational assistants and gender stereotypes: Public perceptions and desiderata for voice personas. In Proceedings of the Second Workshop on Gender Bias in Natural Language Processing. 72\u201378."},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","DOI":"10.1145\/3437963.3441752"},{"key":"e_1_3_2_107_2","unstructured":"Hanjun Dai Hui Li Tian Tian Xin Huang Lin Wang Jun Zhu and Le Song. 2018. Adversarial Attack on Graph Structured Data. arxiv:1806.02371 [cs.LG]"},{"key":"e_1_3_2_108_2","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014066"},{"key":"e_1_3_2_109_2","article-title":"A survey of the state of explainable AI for natural language processing","author":"Danilevsky Marina","year":"2020","unstructured":"Marina Danilevsky, Kun Qian, Ranit Aharonov, Yannis Katsis, Ban Kawas, and Prithviraj Sen. 2020. A survey of the state of explainable AI for natural language processing. arXiv preprint arXiv:2010.00711 (2020).","journal-title":"arXiv preprint arXiv:2010.00711"},{"key":"e_1_3_2_110_2","doi-asserted-by":"publisher","DOI":"10.5555\/2612156.2612159"},{"key":"e_1_3_2_111_2","article-title":"An overview of privacy in machine learning","author":"Cristofaro Emiliano De","year":"2020","unstructured":"Emiliano De Cristofaro. 2020. An overview of privacy in machine learning. arXiv preprint arXiv:2005.08679 (2020).","journal-title":"arXiv preprint arXiv:2005.08679"},{"key":"e_1_3_2_112_2","article-title":"Universal transformers","author":"Dehghani Mostafa","year":"2018","unstructured":"Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, and \u0141ukasz Kaiser. 2018. Universal transformers. arXiv preprint arXiv:1807.03819 (2018).","journal-title":"arXiv preprint arXiv:1807.03819"},{"key":"e_1_3_2_113_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-5209-5"},{"key":"e_1_3_2_114_2","article-title":"Assessing the consequences of text preprocessing decisions","author":"Denny Matthew James","year":"2016","unstructured":"Matthew James Denny and Arthur Spirling. 2016. Assessing the consequences of text preprocessing decisions. Available at SSRN (2016).","journal-title":"Available at SSRN"},{"key":"e_1_3_2_115_2","first-page":"4171","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Devlin Jacob","year":"2019","unstructured":"Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171\u20134186."},{"key":"e_1_3_2_116_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.656"},{"key":"e_1_3_2_117_2","article-title":"AdverTorch v0.1: An adversarial robustness toolbox based on Pytorch","author":"Ding Gavin Weiguang","year":"2019","unstructured":"Gavin Weiguang Ding, Luyu Wang, and Xiaomeng Jin. 2019. AdverTorch v0.1: An adversarial robustness toolbox based on Pytorch. arXiv preprint arXiv:1902.07623 (2019).","journal-title":"arXiv preprint arXiv:1902.07623"},{"key":"e_1_3_2_118_2","doi-asserted-by":"publisher","DOI":"10.1145\/3278721.3278729"},{"key":"e_1_3_2_119_2","article-title":"Towards a rigorous science of interpretable machine learning","author":"Doshi-Velez Finale","year":"2017","unstructured":"Finale Doshi-Velez and Been Kim. 2017. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 (2017).","journal-title":"arXiv preprint arXiv:1702.08608"},{"key":"e_1_3_2_120_2","doi-asserted-by":"publisher","DOI":"10.1145\/3359786"},{"key":"e_1_3_2_121_2","doi-asserted-by":"publisher","DOI":"10.5555\/1791834.1791836"},{"key":"e_1_3_2_122_2","doi-asserted-by":"publisher","DOI":"10.1145\/2090236.2090255"},{"key":"e_1_3_2_123_2","doi-asserted-by":"publisher","DOI":"10.1007\/11681878_14"},{"key":"e_1_3_2_124_2","doi-asserted-by":"publisher","DOI":"10.1561\/0400000042"},{"key":"e_1_3_2_125_2","volume-title":"Proceedings of Algorithmic Learning Theory,","volume":"83","author":"Ensign Danielle","year":"2018","unstructured":"Danielle Ensign, Sorelle A. Friedler, Scott Neville, Carlos Scheidegger, and Suresh Venkatasubramanian. 2018. Decision making with limited feedback: Error bounds for predictive policing and recidivism prediction. In Proceedings of Algorithmic Learning Theory, Vol. 83."},{"key":"e_1_3_2_126_2","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2012.48"},{"key":"e_1_3_2_127_2","article-title":"On the connection between adversarial robustness and saliency map interpretability","author":"Etmann Christian","year":"2019","unstructured":"Christian Etmann, Sebastian Lunz, Peter Maass, and Carola-Bibiane Sch\u00f6nlieb. 2019. On the connection between adversarial robustness and saliency map interpretability. arXiv preprint arXiv:1905.04172 (2019).","journal-title":"arXiv preprint arXiv:1905.04172"},{"key":"e_1_3_2_128_2","doi-asserted-by":"publisher","DOI":"10.1561\/3300000019"},{"key":"e_1_3_2_129_2","article-title":"Robust physical-world attacks on deep learning models","author":"Eykholt Kevin","year":"2017","unstructured":"Kevin Eykholt, Ivan Evtimov, Earlence Fernandes, Bo Li, Amir Rahmati, Chaowei Xiao, Atul Prakash, Tadayoshi Kohno, and Dawn Song. 2017. Robust physical-world attacks on deep learning models. arXiv preprint arXiv:1707.08945 (2017).","journal-title":"arXiv preprint arXiv:1707.08945"},{"key":"e_1_3_2_130_2","doi-asserted-by":"publisher","DOI":"10.5555\/3367032.3367224"},{"key":"e_1_3_2_131_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE51399.2021.00140"},{"key":"e_1_3_2_132_2","article-title":"Jointly attacking graph neural network and its explanations","author":"Fan Wenqi","year":"2021","unstructured":"Wenqi Fan, Wei Jin, Xiaorui Liu, Han Xu, Xianfeng Tang, Suhang Wang, Qing Li, Jiliang Tang, Jianping Wang, and Charu Aggarwal. 2021. Jointly attacking graph neural network and its explanations. arXiv preprint arXiv:2108.03388 (2021).","journal-title":"arXiv preprint arXiv:2108.03388"},{"key":"e_1_3_2_133_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313488"},{"key":"e_1_3_2_134_2","article-title":"A graph neural network framework for social recommendations","author":"Fan Wenqi","year":"2020","unstructured":"Wenqi Fan, Yao Ma, Qing Li, Jianping Wang, Guoyong Cai, Jiliang Tang, and Dawei Yin. 2020. A graph neural network framework for social recommendations. IEEE Transactions on Knowledge and Data Engineering (2020).","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"e_1_3_2_135_2","doi-asserted-by":"publisher","DOI":"10.1145\/3298689.3347011"},{"key":"e_1_3_2_136_2","doi-asserted-by":"publisher","DOI":"10.1145\/3274694.3274706"},{"key":"e_1_3_2_137_2","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2783311"},{"key":"e_1_3_2_138_2","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2783311"},{"key":"e_1_3_2_139_2","doi-asserted-by":"publisher","DOI":"10.1145\/3357713.3384290"},{"key":"e_1_3_2_140_2","article-title":"Learning fair representations via an adversarial framework","author":"Feng Rui","year":"2019","unstructured":"Rui Feng, Yang Yang, Yuehan Lyu, Chenhao Tan, Yizhou Sun, and Chunping Wang. 2019. Learning fair representations via an adversarial framework. arXiv preprint arXiv:1904.13341 (2019).","journal-title":"arXiv preprint arXiv:1904.13341"},{"key":"e_1_3_2_141_2","doi-asserted-by":"publisher","DOI":"10.1126\/science.aaw4399"},{"key":"e_1_3_2_142_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11023-018-9482-5"},{"key":"e_1_3_2_143_2","unstructured":"World Economic Forum. 2020. The Future of Jobs Report 2020 . World Economic Forum Geneva Switzerland."},{"key":"e_1_3_2_144_2","doi-asserted-by":"publisher","DOI":"10.1145\/2810103.2813677"},{"key":"e_1_3_2_145_2","first-page":"17","volume-title":"23rd \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 14)","author":"Fredrikson Matthew","year":"2014","unstructured":"Matthew Fredrikson, Eric Lantz, Somesh Jha, Simon Lin, David Page, and Thomas Ristenpart. 2014. Privacy in pharmacogenetics: An end-to-end case study of personalized warfarin dosing. In 23rd \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 14). 17\u201332."},{"key":"e_1_3_2_146_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jpdc.2019.07.007"},{"key":"e_1_3_2_147_2","doi-asserted-by":"crossref","first-page":"3356","DOI":"10.18653\/v1\/2020.findings-emnlp.301","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2020","author":"Gehman Samuel","year":"2020","unstructured":"Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, and Noah A. Smith. 2020. RealToxicityPrompts: Evaluating neural toxic degeneration in language models. In Findings of the Association for Computational Linguistics: EMNLP 2020. 3356\u20133369."},{"key":"e_1_3_2_148_2","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-020-00257-z"},{"key":"e_1_3_2_149_2","doi-asserted-by":"publisher","DOI":"10.1145\/1536414.1536440"},{"key":"e_1_3_2_150_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-20465-4_9"},{"key":"e_1_3_2_151_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33013681"},{"key":"e_1_3_2_152_2","doi-asserted-by":"publisher","DOI":"10.1109\/DSAA.2018.00018"},{"key":"e_1_3_2_153_2","doi-asserted-by":"publisher","DOI":"10.1145\/3278721.3278722"},{"key":"e_1_3_2_154_2","doi-asserted-by":"publisher","DOI":"10.1145\/3278721.3278722"},{"key":"e_1_3_2_155_2","article-title":"Lipstick on a pig: Debiasing methods cover up systematic gender biases in word embeddings but do not remove them","author":"Gonen Hila","year":"2019","unstructured":"Hila Gonen and Yoav Goldberg. 2019. Lipstick on a pig: Debiasing methods cover up systematic gender biases in word embeddings but do not remove them. arXiv preprint arXiv:1903.03862 (2019).","journal-title":"arXiv preprint arXiv:1903.03862"},{"key":"e_1_3_2_156_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.findings-emnlp.180"},{"key":"e_1_3_2_157_2","article-title":"Adversarial and clean data are not twins","author":"Gong Zhitao","year":"2017","unstructured":"Zhitao Gong, Wenlu Wang, and Wei-Shinn Ku. 2017. Adversarial and clean data are not twins. arXiv preprint arXiv:1704.04960 (2017).","journal-title":"arXiv preprint arXiv:1704.04960"},{"key":"e_1_3_2_158_2","doi-asserted-by":"publisher","DOI":"10.4324\/9781315701301"},{"key":"e_1_3_2_159_2","article-title":"Explaining and harnessing adversarial examples","author":"Goodfellow Ian J.","year":"2014","unstructured":"Ian J. Goodfellow, Jonathon Shlens, and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014).","journal-title":"arXiv preprint arXiv:1412.6572"},{"key":"e_1_3_2_160_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287563"},{"key":"e_1_3_2_161_2","article-title":"On the (statistical) detection of adversarial examples","author":"Grosse Kathrin","year":"2017","unstructured":"Kathrin Grosse, Praveen Manoharan, Nicolas Papernot, Michael Backes, and Patrick McDaniel. 2017. On the (statistical) detection of adversarial examples. arXiv preprint arXiv:1702.06280 (2017).","journal-title":"arXiv preprint arXiv:1702.06280"},{"key":"e_1_3_2_162_2","doi-asserted-by":"publisher","DOI":"10.1145\/3236009"},{"key":"e_1_3_2_163_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2012.72"},{"key":"e_1_3_2_164_2","doi-asserted-by":"publisher","DOI":"10.1145\/3020078.3021745"},{"key":"e_1_3_2_165_2","doi-asserted-by":"publisher","DOI":"10.1145\/3007787.3001163"},{"key":"e_1_3_2_166_2","article-title":"Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding","author":"Han Song","year":"2015","unstructured":"Song Han, Huizi Mao, and William J. Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015).","journal-title":"arXiv preprint arXiv:1510.00149"},{"key":"e_1_3_2_167_2","article-title":"Deep speech: Scaling up end-to-end speech recognition","author":"Hannun Awni","year":"2014","unstructured":"Awni Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, et\u00a0al. 2014. Deep speech: Scaling up end-to-end speech recognition. arXiv preprint arXiv:1412.5567 (2014).","journal-title":"arXiv preprint arXiv:1412.5567"},{"key":"e_1_3_2_168_2","volume-title":"NIPS","author":"Hardt Moritz","year":"2016","unstructured":"Moritz Hardt, E. Price, and Nathan Srebro. 2016. Equality of opportunity in supervised learning. In NIPS."},{"key":"e_1_3_2_169_2","article-title":"FedML: A research library and benchmark for federated machine learning","author":"He Chaoyang","year":"2020","unstructured":"Chaoyang He, Songze Li, Jinhyun So, Mi Zhang, Hongyi Wang, Xiaoyang Wang, Praneeth Vepakomma, Abhishek Singh, Hang Qiu, Li Shen, Peilin Zhao, Yan Kang, Yang Liu, Ramesh Raskar, Qiang Yang, Murali Annavaram, and Salman Avestimehr. 2020. FedML: A research library and benchmark for federated machine learning. arXiv preprint arXiv:2007.13518 (2020).","journal-title":"arXiv preprint arXiv:2007.13518"},{"key":"e_1_3_2_170_2","article-title":"Calibration for the (computationally-identifiable) masses","volume":"1711","author":"H\u00e9bert-Johnson \u00darsula","year":"2017","unstructured":"\u00darsula H\u00e9bert-Johnson, M. P. Kim, O. Reingold, and G. N. Rothblum. 2017. Calibration for the (computationally-identifiable) masses. ArXiv abs\/1711.08513 (2017).","journal-title":"ArXiv"},{"key":"e_1_3_2_171_2","doi-asserted-by":"publisher","DOI":"10.1145\/3278721.3278777"},{"key":"e_1_3_2_172_2","article-title":"What shapes feature representations? Exploring datasets, architectures, and training","author":"Hermann Katherine L.","year":"2020","unstructured":"Katherine L. Hermann and Andrew K. Lampinen. 2020. What shapes feature representations? Exploring datasets, architectures, and training. arXiv preprint arXiv:2006.12433 (2020).","journal-title":"arXiv preprint arXiv:2006.12433"},{"key":"e_1_3_2_173_2","doi-asserted-by":"publisher","DOI":"10.1515\/til-2019-0004"},{"key":"e_1_3_2_174_2","article-title":"Distilling the knowledge in a neural network","author":"Hinton Geoffrey","year":"2015","unstructured":"Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).","journal-title":"arXiv preprint arXiv:1503.02531"},{"key":"e_1_3_2_175_2","article-title":"Loss-aware weight quantization of deep networks","author":"Hou Lu","year":"2018","unstructured":"Lu Hou and James T. Kwok. 2018. Loss-aware weight quantization of deep networks. arXiv preprint arXiv:1802.08635 (2018).","journal-title":"arXiv preprint arXiv:1802.08635"},{"key":"e_1_3_2_176_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11948-017-9975-2"},{"key":"e_1_3_2_177_2","unstructured":"Hongsheng Hu Zoran Salcic Gillian Dobbie and Xuyun Zhang. 2021. Membership Inference Attacks on Machine Learning: A Survey. arxiv:2103.07853 [cs.LG]"},{"key":"e_1_3_2_178_2","volume-title":"International Conference on Learning Representations","author":"Huang Hanxun","year":"2020","unstructured":"Hanxun Huang, Xingjun Ma, Sarah Monazam Erfani, James Bailey, and Yisen Wang. 2020. Unlearnable examples: Making personal data unexploitable. In International Conference on Learning Representations."},{"key":"e_1_3_2_179_2","article-title":"Unlearnable examples: Making personal data unexploitable","author":"Huang Hanxun","year":"2021","unstructured":"Hanxun Huang, Xingjun Ma, Sarah Monazam Erfani, James Bailey, and Yisen Wang. 2021. Unlearnable examples: Making personal data unexploitable. arXiv preprint arXiv:2101.04898 (2021).","journal-title":"arXiv preprint arXiv:2101.04898"},{"key":"e_1_3_2_180_2","article-title":"Multilingual Twitter corpus and baselines for evaluating demographic bias in hate speech recognition","author":"Huang Xiaolei","year":"2020","unstructured":"Xiaolei Huang, Linzi Xing, Franck Dernoncourt, and Michael J. Paul. 2020. Multilingual Twitter corpus and baselines for evaluating demographic bias in hate speech recognition. arXiv preprint arXiv:2002.10361 (2020).","journal-title":"arXiv preprint arXiv:2002.10361"},{"key":"e_1_3_2_181_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287600"},{"key":"e_1_3_2_182_2","doi-asserted-by":"publisher","DOI":"10.1109\/BigData47090.2019.9006487"},{"key":"e_1_3_2_183_2","article-title":"Speeding up convolutional neural networks with low rank expansions","author":"Jaderberg Max","year":"2014","unstructured":"Max Jaderberg, Andrea Vedaldi, and Andrew Zisserman. 2014. Speeding up convolutional neural networks with low rank expansions. arXiv preprint arXiv:1405.3866 (2014).","journal-title":"arXiv preprint arXiv:1405.3866"},{"key":"e_1_3_2_184_2","first-page":"22205","volume-title":"Advances in Neural Information Processing Systems","author":"Jagielski Matthew","year":"2020","unstructured":"Matthew Jagielski, Jonathan Ullman, and Alina Oprea. 2020. Auditing differentially private machine learning: How private is private SGD? In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 22205\u201322216. https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/fc4ddc15f9f4b4b06ef7844d6bb53abf-Paper.pdf."},{"key":"e_1_3_2_185_2","article-title":"Differential privacy and machine learning: A survey and review","author":"Ji Zhanglong","year":"2014","unstructured":"Zhanglong Ji, Zachary C. Lipton, and Charles Elkan. 2014. Differential privacy and machine learning: A survey and review. arXiv preprint arXiv:1412.7584 (2014).","journal-title":"arXiv preprint arXiv:1412.7584"},{"key":"e_1_3_2_186_2","first-page":"702","volume-title":"International Conference on Artificial Intelligence and Statistics","author":"Jiang Heinrich","year":"2020","unstructured":"Heinrich Jiang and Ofir Nachum. 2020. Identifying and correcting label bias in machine learning. In International Conference on Artificial Intelligence and Statistics. PMLR, 702\u2013712."},{"key":"e_1_3_2_187_2","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-020-00236-4"},{"key":"e_1_3_2_188_2","article-title":"Adversarial attacks and defenses on graphs: A review and empirical study","author":"Jin Wei","year":"2020","unstructured":"Wei Jin, Yaxin Li, Han Xu, Yiqi Wang, and Jiliang Tang. 2020. Adversarial attacks and defenses on graphs: A review and empirical study. arXiv preprint arXiv:2003.00653 (2020).","journal-title":"arXiv preprint arXiv:2003.00653"},{"key":"e_1_3_2_189_2","doi-asserted-by":"crossref","unstructured":"Wei Jin Yao Ma Xiaorui Liu Xianfeng Tang Suhang Wang and Jiliang Tang. 2020. Graph Structure Learning for Robust Graph Neural Networks. arxiv:2005.10203 [cs.LG]","DOI":"10.1145\/3394486.3403049"},{"key":"e_1_3_2_190_2","article-title":"Constance: Modeling annotation contexts to improve stance classification","author":"Joseph Kenneth","year":"2017","unstructured":"Kenneth Joseph, Lisa Friedland, William Hobbs, Oren Tsur, and David Lazer. 2017. Constance: Modeling annotation contexts to improve stance classification. arXiv preprint arXiv:1708.06309 (2017).","journal-title":"arXiv preprint arXiv:1708.06309"},{"key":"e_1_3_2_191_2","article-title":"Fair algorithms for infinite and contextual bandits","author":"Joseph Matthew","year":"2016","unstructured":"Matthew Joseph, M. Kearns, Jamie H. Morgenstern, Seth Neel, and A. Roth. 2016. Fair algorithms for infinite and contextual bandits. arXiv: Learning (2016).","journal-title":"arXiv: Learning"},{"key":"e_1_3_2_192_2","series-title":"Proceedings of Thirty Fourth Conference on Learning Theory","first-page":"2717","volume":"134","author":"Kairouz Peter","year":"2021","unstructured":"Peter Kairouz, Monica Ribero Diaz, Keith Rush, and Abhradeep Thakurta. 2021. (Nearly) dimension independent private ERM with AdaGrad rates via publicly estimated subspaces. In Proceedings of Thirty Fourth Conference on Learning Theory(Proceedings of Machine Learning Research, Vol. 134), Mikhail Belkin and Samory Kpotufe (Eds.). PMLR, 2717\u20132746. https:\/\/proceedings.mlr.press\/v134\/kairouz21a.html."},{"key":"e_1_3_2_193_2","article-title":"Censored and fair universal representations using generative adversarial models","author":"Kairouz Peter","year":"2019","unstructured":"Peter Kairouz, Jiachun Liao, Chong Huang, and Lalitha Sankar. 2019. Censored and fair universal representations using generative adversarial models. arXiv preprint arXiv:1910.00411 (2019).","journal-title":"arXiv preprint arXiv:1910.00411"},{"key":"e_1_3_2_194_2","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-020-0186-1"},{"key":"e_1_3_2_195_2","doi-asserted-by":"publisher","DOI":"10.1109\/IC4.2009.4909197"},{"key":"e_1_3_2_196_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-011-0463-8"},{"key":"e_1_3_2_197_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-011-0463-8"},{"key":"e_1_3_2_198_2","volume-title":"Discrimination and Privacy in the Information Society","author":"Kamiran F.","year":"2013","unstructured":"F. Kamiran and I. \u017dliobait\u0117. 2013. Explainable and non-explainable discrimination in classification. In Discrimination and Privacy in the Information Society."},{"key":"e_1_3_2_199_2","volume-title":"ECML\/PKDD","author":"Kamishima Toshihiro","year":"2012","unstructured":"Toshihiro Kamishima, S. Akaho, Hideki Asoh, and J. Sakuma. 2012. Fairness-aware classifier with prejudice remover regularizer. In ECML\/PKDD."},{"key":"e_1_3_2_200_2","doi-asserted-by":"publisher","DOI":"10.5555\/2627817.2627918"},{"key":"e_1_3_2_201_2","first-page":"5132","volume-title":"International Conference on Machine Learning","author":"Karimireddy Sai Praneeth","year":"2020","unstructured":"Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank Reddi, Sebastian Stich, and Ananda Theertha Suresh. 2020. SCAFFOLD: Stochastic controlled averaging for federated learning. In International Conference on Machine Learning. PMLR, 5132\u20135143."},{"key":"e_1_3_2_202_2","first-page":"4519","volume-title":"International Conference on Artificial Intelligence and Statistics","author":"Khaled Ahmed","year":"2020","unstructured":"Ahmed Khaled, Konstantin Mishchenko, and Peter Richt\u00e1rik. 2020. Tighter theory for local SGD on identical and heterogeneous data. In International Conference on Artificial Intelligence and Statistics. PMLR, 4519\u20134529."},{"key":"e_1_3_2_203_2","volume-title":"NIPS","author":"Kilbertus Niki","year":"2017","unstructured":"Niki Kilbertus, Mateo Rojas-Carulla, Giambattista Parascandolo, Moritz Hardt, D. Janzing, and B. Sch\u00f6lkopf. 2017. Avoiding discrimination through causal reasoning. In NIPS."},{"key":"e_1_3_2_204_2","volume-title":"NeurIPS","author":"Kim M. P.","year":"2018","unstructured":"M. P. Kim, O. Reingold, and G. N. Rothblum. 2018. Fairness through computationally-bounded awareness. In NeurIPS."},{"key":"e_1_3_2_205_2","article-title":"Sequence-level knowledge distillation","author":"Kim Yoon","year":"2016","unstructured":"Yoon Kim and Alexander M. Rush. 2016. Sequence-level knowledge distillation. arXiv preprint arXiv:1606.07947 (2016).","journal-title":"arXiv preprint arXiv:1606.07947"},{"key":"e_1_3_2_206_2","article-title":"Semi-supervised classification with graph convolutional networks","author":"Kipf Thomas N.","year":"2016","unstructured":"Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).","journal-title":"arXiv preprint arXiv:1609.02907"},{"key":"e_1_3_2_207_2","article-title":"Examining gender and race bias in two hundred sentiment analysis systems","author":"Kiritchenko Svetlana","year":"2018","unstructured":"Svetlana Kiritchenko and Saif M. Mohammad. 2018. Examining gender and race bias in two hundred sentiment analysis systems. arXiv preprint arXiv:1805.04508 (2018).","journal-title":"arXiv preprint arXiv:1805.04508"},{"key":"e_1_3_2_208_2","first-page":"270","article-title":"Artificial intelligence: Definition, trends, techniques, and cases","volume":"1","author":"Kok Joost N.","year":"2009","unstructured":"Joost N. Kok, Egbert J. Boers, Walter A. Kosters, Peter van der Putten, and Mannes Poel. 2009. Artificial intelligence: Definition, trends, techniques, and cases. Artificial Intelligence 1 (2009), 270\u2013299.","journal-title":"Artificial Intelligence"},{"key":"e_1_3_2_209_2","doi-asserted-by":"publisher","DOI":"10.1145\/3178876.3186133"},{"key":"e_1_3_2_210_2","first-page":"1097","volume-title":"Advances in Neural Information Processing Systems","author":"Krizhevsky Alex","year":"2012","unstructured":"Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems. 1097\u20131105."},{"key":"e_1_3_2_211_2","volume-title":"NIPS","author":"Kusner Matt J.","year":"2017","unstructured":"Matt J. Kusner, Joshua R. Loftus, Chris Russell, and Ricardo Silva. 2017. Counterfactual fairness. In NIPS."},{"key":"e_1_3_2_212_2","article-title":"Quantifying the carbon emissions of machine learning","author":"Lacoste Alexandre","year":"2019","unstructured":"Alexandre Lacoste, Alexandra Luccioni, Victor Schmidt, and Thomas Dandres. 2019. Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700 (2019).","journal-title":"arXiv preprint arXiv:1910.09700"},{"key":"e_1_3_2_213_2","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.2018.3093"},{"key":"e_1_3_2_214_2","article-title":"ALBERT: A lite BERT for self-supervised learning of language representations","author":"Lan Zhenzhong","year":"2019","unstructured":"Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. 2019. ALBERT: A lite BERT for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019).","journal-title":"arXiv preprint arXiv:1909.11942"},{"issue":"10","key":"e_1_3_2_215_2","first-page":"1995","article-title":"Convolutional networks for images, speech, and time series","volume":"3361","author":"LeCun Yann","year":"1995","unstructured":"Yann LeCun, Yoshua Bengio, et\u00a0al. 1995. Convolutional networks for images, speech, and time series. The Handbook of Brain Theory and Neural Networks 3361, 10 (1995), 1995.","journal-title":"The Handbook of Brain Theory and Neural Networks"},{"key":"e_1_3_2_216_2","doi-asserted-by":"publisher","DOI":"10.1145\/3342195.3387532"},{"key":"e_1_3_2_217_2","doi-asserted-by":"publisher","DOI":"10.1518\/hfes.46.1.50.30392"},{"key":"e_1_3_2_218_2","article-title":"Discrete attacks and submodular optimization with applications to text classification","volume":"1812","author":"Lei Qi","year":"2018","unstructured":"Qi Lei, Lingfei Wu, Pin-Yu Chen, Alexandros G. Dimakis, Inderjit S. Dhillon, and Michael Witbrock. 2018. Discrete attacks and submodular optimization with applications to text classification. CoRR abs\/1812.00151 (2018). arxiv:1812.00151http:\/\/arxiv.org\/abs\/1812.00151.","journal-title":"CoRR"},{"key":"e_1_3_2_219_2","doi-asserted-by":"publisher","DOI":"10.1109\/IVS.2011.5940562"},{"key":"e_1_3_2_220_2","doi-asserted-by":"publisher","DOI":"10.1109\/BDCloud-SocialCom-SustainCom.2016.76"},{"key":"e_1_3_2_221_2","volume-title":"Proceedings of Machine Learning and Systems 2020, MLSys 2020, Austin, TX, USA, March 2-4, 2020","author":"Li Tian","year":"2020","unstructured":"Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, and Virginia Smith. 2020. Federated optimization in heterogeneous networks. In Proceedings of Machine Learning and Systems 2020, MLSys 2020, Austin, TX, USA, March 2-4, 2020, Inderjit S. Dhillon, Dimitris S. Papailiopoulos, and Vivienne Sze (Eds.). mlsys.org. https:\/\/proceedings.mlsys.org\/book\/316.pdf."},{"key":"e_1_3_2_222_2","volume-title":"International Conference on Learning Representations","author":"Li Xiang","year":"2020","unstructured":"Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, and Zhihua Zhang. 2020. On the convergence of FedAvg on non-IID data. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=HJxNAnVtDS."},{"key":"e_1_3_2_223_2","unstructured":"Xuechen Li Florian Tram\u00e8r Percy Liang and Tatsunori Hashimoto. 2021. Large Language Models Can Be Strong Differentially Private Learners. arxiv:2110.05679 [cs.LG]"},{"key":"e_1_3_2_224_2","unstructured":"Yaxin Li Wei Jin Han Xu and Jiliang Tang. 2020. DeepRobust: A PyTorch Library for Adversarial Attacks and Defenses. arxiv:2005.06149 [cs.LG]"},{"key":"e_1_3_2_225_2","doi-asserted-by":"publisher","DOI":"10.3390\/e23010018"},{"issue":"1","key":"e_1_3_2_226_2","first-page":"25","article-title":"Designing monitoring systems for continuous certification of cloud services: Deriving meta-requirements and design guidelines","volume":"44","author":"Lins Sebastian","year":"2019","unstructured":"Sebastian Lins, Stephan Schneider, Jakub Szefer, Shafeeq Ibraheem, and Ali Sunyaev. 2019. Designing monitoring systems for continuous certification of cloud services: Deriving meta-requirements and design guidelines. Communications of the Association for Information Systems 44, 1 (2019), 25.","journal-title":"Communications of the Association for Information Systems"},{"key":"e_1_3_2_227_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.coling-main.390"},{"key":"e_1_3_2_228_2","article-title":"Say what I want: Towards the dark side of neural dialogue models","author":"Liu Haochen","year":"2019","unstructured":"Haochen Liu, Tyler Derr, Zitao Liu, and Jiliang Tang. 2019. Say what I want: Towards the dark side of neural dialogue models. arXiv preprint arXiv:1909.06044 (2019).","journal-title":"arXiv preprint arXiv:1909.06044"},{"key":"e_1_3_2_229_2","article-title":"The authors matter: Understanding and mitigating implicit bias in deep text classification","author":"Liu Haochen","year":"2021","unstructured":"Haochen Liu, Wei Jin, Hamid Karimi, Zitao Liu, and Jiliang Tang. 2021. The authors matter: Understanding and mitigating implicit bias in deep text classification. arXiv preprint arXiv:2105.02778 (2021).","journal-title":"arXiv preprint arXiv:2105.02778"},{"key":"e_1_3_2_230_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.emnlp-main.64"},{"key":"e_1_3_2_231_2","article-title":"DIG: A turnkey library for diving into graph deep learning research","author":"Liu Meng","year":"2021","unstructured":"Meng Liu, Youzhi Luo, Limei Wang, Yaochen Xie, Hao Yuan, Shurui Gui, Haiyang Yu, Zhao Xu, Jingtun Zhang, Yi Liu, et\u00a0al. 2021. DIG: A turnkey library for diving into graph deep learning research. arXiv preprint arXiv:2103.12608 (2021).","journal-title":"arXiv preprint arXiv:2103.12608"},{"key":"e_1_3_2_232_2","volume-title":"International Conference on Learning Representations","author":"Liu Xiaorui","year":"2021","unstructured":"Xiaorui Liu, Yao Li, Rongrong Wang, Jiliang Tang, and Ming Yan. 2021. Linear convergent decentralized optimization with compression. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=84gjULz1t5."},{"key":"e_1_3_2_233_2","doi-asserted-by":"publisher","DOI":"10.1145\/2744769.2744900"},{"key":"e_1_3_2_234_2","article-title":"Calibrated fairness in bandits","author":"Liu Yang","year":"2017","unstructured":"Yang Liu, Goran Radanovic, Christos Dimitrakakis, Debmalya Mandal, and David C. Parkes. 2017. Calibrated fairness in bandits. arXiv preprint arXiv:1707.01875 (2017).","journal-title":"arXiv preprint arXiv:1707.01875"},{"key":"e_1_3_2_235_2","article-title":"Learning to pivot with adversarial networks","author":"Louppe Gilles","year":"2016","unstructured":"Gilles Louppe, Michael Kagan, and Kyle Cranmer. 2016. Learning to pivot with adversarial networks. arXiv preprint arXiv:1611.01046 (2016).","journal-title":"arXiv preprint arXiv:1611.01046"},{"key":"e_1_3_2_236_2","first-page":"189","volume-title":"Logic, Language, and Security","author":"Lu Kaiji","year":"2020","unstructured":"Kaiji Lu, Piotr Mardziel, Fangjing Wu, Preetam Amancharla, and Anupam Datta. 2020. Gender bias in neural natural language processing. In Logic, Language, and Security. Springer, 189\u2013202."},{"key":"e_1_3_2_237_2","first-page":"4765","article-title":"A unified approach to interpreting model predictions","volume":"30","author":"Lundberg Scott M.","year":"2017","unstructured":"Scott M. Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30 (2017), 4765\u20134774.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_238_2","article-title":"Parameterized explainer for graph neural network","author":"Luo Dongsheng","year":"2020","unstructured":"Dongsheng Luo, Wei Cheng, Dongkuan Xu, Wenchao Yu, Bo Zong, Haifeng Chen, and Xiang Zhang. 2020. Parameterized explainer for graph neural network. arXiv preprint arXiv:2011.04573 (2020).","journal-title":"arXiv preprint arXiv:2011.04573"},{"key":"e_1_3_2_239_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313607"},{"key":"e_1_3_2_240_2","unstructured":"Yao Ma Suhang Wang Tyler Derr Lingfei Wu and Jiliang Tang. 2019. Attacking Graph Convolutional Networks via Rewiring. arxiv:1906.03750 [cs.LG]"},{"key":"e_1_3_2_241_2","article-title":"Towards deep learning models resistant to adversarial attacks","author":"Madry Aleksander","year":"2017","unstructured":"Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2017. Towards deep learning models resistant to adversarial attacks. arXiv preprint arXiv:1706.06083 (2017).","journal-title":"arXiv preprint arXiv:1706.06083"},{"key":"e_1_3_2_242_2","article-title":"Collaborative filtering and the missing at random assumption","author":"Marlin Benjamin","year":"2012","unstructured":"Benjamin Marlin, Richard S. Zemel, Sam Roweis, and Malcolm Slaney. 2012. Collaborative filtering and the missing at random assumption. arXiv preprint arXiv:1206.5267 (2012).","journal-title":"arXiv preprint arXiv:1206.5267"},{"key":"e_1_3_2_243_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10551-018-3921-3"},{"key":"e_1_3_2_244_2","article-title":"On measuring social biases in sentence encoders","author":"May Chandler","year":"2019","unstructured":"Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, and Rachel Rudinger. 2019. On measuring social biases in sentence encoders. arXiv preprint arXiv:1903.10561 (2019).","journal-title":"arXiv preprint arXiv:1903.10561"},{"key":"e_1_3_2_245_2","doi-asserted-by":"publisher","DOI":"10.5465\/amr.1995.9508080335"},{"issue":"4","key":"e_1_3_2_246_2","first-page":"12","article-title":"A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955","volume":"27","author":"McCarthy John","year":"2006","unstructured":"John McCarthy, Marvin L. Minsky, Nathaniel Rochester, and Claude E. Shannon. 2006. A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955. AI Magazine 27, 4 (2006), 12\u201312.","journal-title":"AI Magazine"},{"issue":"1","key":"e_1_3_2_247_2","article-title":"Advances and open problems in federated learning","volume":"14","author":"McMahan H. Brendan","year":"2021","unstructured":"H. Brendan McMahan et\u00a0al. 2021. Advances and open problems in federated learning. Foundations and Trends in Machine Learning 14, 1 (2021).","journal-title":"Foundations and Trends in Machine Learning"},{"key":"e_1_3_2_248_2","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557090"},{"key":"e_1_3_2_249_2","doi-asserted-by":"publisher","DOI":"10.1109\/FOCS.2007.66"},{"key":"e_1_3_2_250_2","article-title":"A survey on bias and fairness in machine learning","author":"Mehrabi Ninareh","year":"2019","unstructured":"Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, and Aram Galstyan. 2019. A survey on bias and fairness in machine learning. arXiv preprint arXiv:1908.09635 (2019).","journal-title":"arXiv preprint arXiv:1908.09635"},{"key":"e_1_3_2_251_2","article-title":"The cost of fairness in classification","volume":"1705","author":"Menon A.","year":"2017","unstructured":"A. Menon and R. Williamson. 2017. The cost of fairness in classification. ArXiv abs\/1705.09055 (2017).","journal-title":"ArXiv"},{"key":"e_1_3_2_252_2","first-page":"107","volume-title":"Conference on Fairness, Accountability and Transparency","author":"Menon Aditya Krishna","year":"2018","unstructured":"Aditya Krishna Menon and Robert C. Williamson. 2018. The cost of fairness in binary classification. In Conference on Fairness, Accountability and Transparency. PMLR, 107\u2013118."},{"key":"e_1_3_2_253_2","article-title":"Are sixteen heads really better than one?","author":"Michel Paul","year":"2019","unstructured":"Paul Michel, Omer Levy, and Graham Neubig. 2019. Are sixteen heads really better than one?arXiv preprint arXiv:1905.10650 (2019).","journal-title":"arXiv preprint arXiv:1905.10650"},{"key":"e_1_3_2_254_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2018.07.007"},{"key":"e_1_3_2_255_2","doi-asserted-by":"publisher","DOI":"10.1093\/bib\/bbx044"},{"key":"e_1_3_2_256_2","doi-asserted-by":"publisher","DOI":"10.1145\/2636342"},{"key":"e_1_3_2_257_2","doi-asserted-by":"publisher","DOI":"10.1145\/3287560.3287574"},{"key":"e_1_3_2_258_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2017.12"},{"key":"e_1_3_2_259_2","volume-title":"Interpretable Machine Learning","author":"Molnar Christoph","year":"2020","unstructured":"Christoph Molnar. 2020. Interpretable Machine Learning. Lulu.com."},{"key":"e_1_3_2_260_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.17"},{"key":"e_1_3_2_261_2","article-title":"How police technology aggravates racial inequity: A taxonomy of problems and a path forward","volume":"3340898","author":"Moy Laura","year":"2019","unstructured":"Laura Moy. 2019. How police technology aggravates racial inequity: A taxonomy of problems and a path forward. Available at SSRN 3340898 (2019).","journal-title":"Available at SSRN"},{"key":"e_1_3_2_262_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-1100"},{"key":"e_1_3_2_263_2","first-page":"1931","article-title":"Fair inference on outcomes","author":"Nabi Razieh","year":"2018","unstructured":"Razieh Nabi and I. Shpitser. 2018. Fair inference on outcomes. Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence(2018), 1931\u20131940.","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence"},{"key":"e_1_3_2_264_2","doi-asserted-by":"publisher","DOI":"10.1145\/2508859.2516751"},{"key":"e_1_3_2_265_2","article-title":"Adversarial over-sensitivity and over-stability strategies for dialogue models","author":"Niu Tong","year":"2018","unstructured":"Tong Niu and Mohit Bansal. 2018. Adversarial over-sensitivity and over-stability strategies for dialogue models. arXiv preprint arXiv:1809.02079 (2018).","journal-title":"arXiv preprint arXiv:1809.02079"},{"key":"e_1_3_2_266_2","doi-asserted-by":"publisher","DOI":"10.1007\/s42979-020-00390-x"},{"key":"e_1_3_2_267_2","article-title":"InterpretML: A unified framework for machine learning interpretability","author":"Nori Harsha","year":"2019","unstructured":"Harsha Nori, Samuel Jenkins, Paul Koch, and Rich Caruana. 2019. InterpretML: A unified framework for machine learning interpretability. arXiv preprint arXiv:1909.09223 (2019).","journal-title":"arXiv preprint arXiv:1909.09223"},{"key":"e_1_3_2_268_2","unstructured":"Future of Life Institute. 2017. Asilomar AI Principles. https:\/\/futureoflife.org\/ai-principles\/ Accessed March 18 2021."},{"key":"e_1_3_2_269_2","doi-asserted-by":"publisher","DOI":"10.3389\/fdata.2019.00013"},{"key":"e_1_3_2_270_2","article-title":"Google photos identified two black people as\u2019 Gorillas\u2019","volume":"1","author":"Pachal Pete","year":"2015","unstructured":"Pete Pachal. 2015. Google photos identified two black people as\u2019 Gorillas\u2019. Mashable, July 1 (2015).","journal-title":"Mashable, July"},{"key":"e_1_3_2_271_2","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372843"},{"key":"e_1_3_2_272_2","article-title":"cleverhans v1.0.0: An adversarial machine learning library","author":"Papernot Nicolas","year":"2016","unstructured":"Nicolas Papernot, Ian Goodfellow, Ryan Sheatsley, Reuben Feinman, and Patrick McDaniel. 2016. cleverhans v1.0.0: An adversarial machine learning library. arXiv preprint arXiv:1610.00768 (2016).","journal-title":"arXiv preprint arXiv:1610.00768"},{"key":"e_1_3_2_273_2","doi-asserted-by":"publisher","DOI":"10.1109\/ISPASS.2019.00042"},{"key":"e_1_3_2_274_2","article-title":"Reducing gender bias in abusive language detection","author":"Park Ji Ho","year":"2018","unstructured":"Ji Ho Park, Jamin Shin, and Pascale Fung. 2018. Reducing gender bias in abusive language detection. arXiv preprint arXiv:1808.07231 (2018).","journal-title":"arXiv preprint arXiv:1808.07231"},{"key":"e_1_3_2_275_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-1078"},{"key":"e_1_3_2_276_2","volume-title":"NIPS","author":"Pleiss Geoff","year":"2017","unstructured":"Geoff Pleiss, M. Raghavan, Felix Wu, J. Kleinberg, and Kilian Q. Weinberger. 2017. On Fairness and Calibration. In NIPS."},{"key":"e_1_3_2_277_2","article-title":"Large image datasets: A pyrrhic win for computer vision?","author":"Prabhu Vinay Uday","year":"2020","unstructured":"Vinay Uday Prabhu and Abeba Birhane. 2020. Large image datasets: A pyrrhic win for computer vision?arXiv preprint arXiv:2006.16923 (2020).","journal-title":"arXiv preprint arXiv:2006.16923"},{"key":"e_1_3_2_278_2","first-page":"1","article-title":"Assessing gender bias in machine translation: A case study with Google translate","author":"Prates Marcelo O. R.","year":"2019","unstructured":"Marcelo O. R. Prates, Pedro H. Avelar, and Lu\u00eds C. Lamb. 2019. Assessing gender bias in machine translation: A case study with Google translate. Neural Computing and Applications (2019), 1\u201319.","journal-title":"Neural Computing and Applications"},{"key":"e_1_3_2_279_2","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1007\/978-3-030-28954-6_18","volume-title":"Explainable AI: Interpreting, Explaining and Visualizing Deep Learning","author":"Preuer Kristina","year":"2019","unstructured":"Kristina Preuer, G\u00fcnter Klambauer, Friedrich Rippmann, Sepp Hochreiter, and Thomas Unterthiner. 2019. Interpretable deep learning in drug discovery. In Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer, 331\u2013345."},{"key":"e_1_3_2_280_2","article-title":"Toward a better trade-off between performance and fairness with kernel-based distribution matching","author":"Prost Flavien","year":"2019","unstructured":"Flavien Prost, Hai Qian, Qiuwen Chen, Ed H. Chi, Jilin Chen, and Alex Beutel. 2019. Toward a better trade-off between performance and fairness with kernel-based distribution matching. arXiv preprint arXiv:1910.11779 (2019).","journal-title":"arXiv preprint arXiv:1910.11779"},{"key":"e_1_3_2_281_2","doi-asserted-by":"publisher","DOI":"10.1007\/BF00116251"},{"key":"e_1_3_2_282_2","volume-title":"ICML 2021 Workshop on Adversarial Machine Learning","author":"Radiya-Dixit Evani","year":"2021","unstructured":"Evani Radiya-Dixit and Florian Tramer. 2021. Data poisoning won\u2019t save you from facial recognition. In ICML 2021 Workshop on Adversarial Machine Learning."},{"key":"e_1_3_2_283_2","article-title":"Certified defenses against adversarial examples","author":"Raghunathan Aditi","year":"2018","unstructured":"Aditi Raghunathan, Jacob Steinhardt, and Percy Liang. 2018. Certified defenses against adversarial examples. arXiv preprint arXiv:1801.09344 (2018).","journal-title":"arXiv preprint arXiv:1801.09344"},{"key":"e_1_3_2_284_2","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372873"},{"key":"e_1_3_2_285_2","doi-asserted-by":"publisher","DOI":"10.1162\/neco_a_00990"},{"key":"e_1_3_2_286_2","doi-asserted-by":"publisher","DOI":"10.1145\/2939672.2939778"},{"key":"e_1_3_2_287_2","first-page":"8093","volume-title":"International Conference on Machine Learning","author":"Rice Leslie","year":"2020","unstructured":"Leslie Rice, Eric Wong, and Zico Kolter. 2020. Overfitting in adversarially robust deep learning. In International Conference on Machine Learning. PMLR, 8093\u20138104."},{"key":"e_1_3_2_288_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41746-020-00323-1"},{"key":"e_1_3_2_289_2","article-title":"A survey of privacy attacks in machine learning","author":"Rigaki Maria","year":"2020","unstructured":"Maria Rigaki and Sebastian Garcia. 2020. A survey of privacy attacks in machine learning. arXiv preprint arXiv:2007.07646 (2020).","journal-title":"arXiv preprint arXiv:2007.07646"},{"key":"e_1_3_2_290_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.355"},{"key":"e_1_3_2_291_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2003.09.005"},{"key":"e_1_3_2_292_2","first-page":"375","volume-title":"Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA)","author":"Rodrigues Crefeda Faviola","year":"2018","unstructured":"Crefeda Faviola Rodrigues, Graham Riley, and Mikel Luj\u00e1n. 2018. SyNERGY: An energy measurement and prediction framework for convolutional neural networks on Jetson TX1. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA). The Steering Committee of The World Congress in Computer Science, Computer, 375\u2013382."},{"key":"e_1_3_2_293_2","article-title":"Fitnets: Hints for thin deep nets","author":"Romero Adriana","year":"2014","unstructured":"Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, and Yoshua Bengio. 2014. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550 (2014).","journal-title":"arXiv preprint arXiv:1412.6550"},{"key":"e_1_3_2_294_2","article-title":"Are face-detection cameras racist","volume":"1","author":"Rose Adam","year":"2010","unstructured":"Adam Rose. 2010. Are face-detection cameras racist. Time Business 1 (2010).","journal-title":"Time Business"},{"key":"e_1_3_2_295_2","doi-asserted-by":"publisher","DOI":"10.1145\/3195970.3196023"},{"key":"e_1_3_2_296_2","unstructured":"Benjamin I. P. Rubinstein and Francesco Alda. 2017. diffpriv: An R package for easy differential privacy. (2017). https:\/\/github.com\/brubinstein\/diffpriv."},{"key":"e_1_3_2_297_2","doi-asserted-by":"publisher","DOI":"10.1038\/s42256-019-0048-x"},{"key":"e_1_3_2_298_2","unstructured":"Stuart Russell and Peter Norvig. 2002. Artificial intelligence: A modern approach. (2002)."},{"issue":"2","key":"e_1_3_2_299_2","first-page":"7","article-title":"Improving smiling detection with race and gender diversity","volume":"1","author":"Ryu Hee Jung","year":"2017","unstructured":"Hee Jung Ryu, Margaret Mitchell, and Hartwig Adam. 2017. Improving smiling detection with race and gender diversity. 1, 2 (2017), 7. arXiv preprint arXiv:1712.00193.","journal-title":"arXiv preprint arXiv:1712.00193"},{"key":"e_1_3_2_300_2","first-page":"8307","volume-title":"International Conference on Machine Learning","author":"Saadatpanah Parsa","year":"2020","unstructured":"Parsa Saadatpanah, Ali Shafahi, and Tom Goldstein. 2020. Adversarial attacks on copyright detection systems. In International Conference on Machine Learning. PMLR, 8307\u20138315."},{"key":"e_1_3_2_301_2","doi-asserted-by":"publisher","DOI":"10.1109\/Trustcom.2015.357"},{"key":"e_1_3_2_302_2","first-page":"229","volume-title":"International Conference on Information Security and Cryptology","author":"Sadeghi Ahmad-Reza","year":"2009","unstructured":"Ahmad-Reza Sadeghi, Thomas Schneider, and Immo Wehrenberg. 2009. Efficient privacy-preserving face recognition. In International Conference on Information Security and Cryptology. Springer, 229\u2013244."},{"key":"e_1_3_2_303_2","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.bioeng.8.061505.095802"},{"key":"e_1_3_2_304_2","article-title":"Aequitas: A bias and fairness audit toolkit","author":"Saleiro Pedro","year":"2018","unstructured":"Pedro Saleiro, Benedict Kuester, Loren Hinkson, Jesse London, Abby Stevens, Ari Anisfeld, Kit T. Rodolfa, and Rayid Ghani. 2018. Aequitas: A bias and fairness audit toolkit. arXiv preprint arXiv:1811.05577 (2018).","journal-title":"arXiv preprint arXiv:1811.05577"},{"key":"e_1_3_2_305_2","first-page":"4349","article-title":"Auditing algorithms: Research methods for detecting discrimination on internet platforms","volume":"22","author":"Sandvig Christian","year":"2014","unstructured":"Christian Sandvig, Kevin Hamilton, Karrie Karahalios, and Cedric Langbort. 2014. Auditing algorithms: Research methods for detecting discrimination on internet platforms. Data and Discrimination: Converting Critical Concerns into Productive Inquiry 22 (2014), 4349\u20134357.","journal-title":"Data and Discrimination: Converting Critical Concerns into Productive Inquiry"},{"key":"e_1_3_2_306_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1163"},{"key":"e_1_3_2_307_2","doi-asserted-by":"publisher","DOI":"10.1145\/3306618.3314248"},{"key":"e_1_3_2_308_2","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2008.2005605"},{"key":"e_1_3_2_309_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.74"},{"key":"e_1_3_2_310_2","first-page":"6103","volume-title":"Advances in Neural Information Processing Systems","author":"Shafahi Ali","year":"2018","unstructured":"Ali Shafahi, W. Ronny Huang, Mahyar Najibi, Octavian Suciu, Christoph Studer, Tudor Dumitras, and Tom Goldstein. 2018. Poison frogs! Targeted clean-label poisoning attacks on neural networks. In Advances in Neural Information Processing Systems. 6103\u20136113."},{"key":"e_1_3_2_311_2","article-title":"Adversarial training for free!","author":"Shafahi Ali","year":"2019","unstructured":"Ali Shafahi, Mahyar Najibi, Amin Ghiasi, Zheng Xu, John Dickerson, Christoph Studer, Larry S. Davis, Gavin Taylor, and Tom Goldstein. 2019. Adversarial training for free!arXiv preprint arXiv:1904.12843 (2019).","journal-title":"arXiv preprint arXiv:1904.12843"},{"key":"e_1_3_2_312_2","article-title":"Predictive biases in natural language processing models: A conceptual framework and overview","author":"Shah Deven","year":"2019","unstructured":"Deven Shah, H. Andrew Schwartz, and Dirk Hovy. 2019. Predictive biases in natural language processing models: A conceptual framework and overview. arXiv preprint arXiv:1912.11078 (2019).","journal-title":"arXiv preprint arXiv:1912.11078"},{"key":"e_1_3_2_313_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.468"},{"key":"e_1_3_2_314_2","first-page":"1589","volume-title":"29th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 20)","author":"Shan Shawn","year":"2020","unstructured":"Shawn Shan, Emily Wenger, Jiayun Zhang, Huiying Li, Haitao Zheng, and Ben Y. Zhao. 2020. Fawkes: Protecting privacy against unauthorized deep learning models. In 29th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 20). 1589\u20131604."},{"key":"e_1_3_2_315_2","first-page":"2217","volume-title":"International Conference on Machine Learning","author":"Shang Wenling","year":"2016","unstructured":"Wenling Shang, Kihyuk Sohn, Diogo Almeida, and Honglak Lee. 2016. Understanding and improving convolutional neural networks via concatenated rectified linear units. In International Conference on Machine Learning. PMLR, 2217\u20132225."},{"key":"e_1_3_2_316_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41598-020-69250-1"},{"key":"e_1_3_2_317_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1339"},{"key":"e_1_3_2_318_2","doi-asserted-by":"publisher","DOI":"10.1109\/SP.2017.41"},{"key":"e_1_3_2_319_2","first-page":"14","volume-title":"International Conference on Emerging Trends in Information and Communication Security","author":"Shyong K.","year":"2006","unstructured":"K. Shyong, Dan Frankowski, John Riedl, et\u00a0al. 2006. Do you trust your recommendations? An exploration of security and privacy issues in recommender systems. In International Conference on Emerging Trends in Information and Communication Security. Springer, 14\u201329."},{"key":"e_1_3_2_320_2","article-title":"Deep inside convolutional networks: Visualising image classification models and saliency maps","author":"Simonyan Karen","year":"2013","unstructured":"Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2013. Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034 (2013).","journal-title":"arXiv preprint arXiv:1312.6034"},{"key":"e_1_3_2_321_2","article-title":"Darts: Deceiving autonomous cars with toxic signs","author":"Sitawarin Chawin","year":"2018","unstructured":"Chawin Sitawarin, Arjun Nitin Bhagoji, Arsalan Mosenia, Mung Chiang, and Prateek Mittal. 2018. Darts: Deceiving autonomous cars with toxic signs. arXiv preprint arXiv:1802.06430 (2018).","journal-title":"arXiv preprint arXiv:1802.06430"},{"key":"e_1_3_2_322_2","doi-asserted-by":"publisher","DOI":"10.9785\/cri-2019-200402"},{"key":"e_1_3_2_323_2","doi-asserted-by":"publisher","DOI":"10.1145\/3319535.3354211"},{"key":"e_1_3_2_324_2","doi-asserted-by":"publisher","DOI":"10.1109\/GlobalSIP.2013.6736861"},{"key":"e_1_3_2_325_2","doi-asserted-by":"publisher","DOI":"10.1145\/3240765.3240796"},{"key":"e_1_3_2_326_2","article-title":"Evaluating gender bias in machine translation","author":"Stanovsky Gabriel","year":"2019","unstructured":"Gabriel Stanovsky, Noah A. Smith, and Luke Zettlemoyer. 2019. Evaluating gender bias in machine translation. arXiv preprint arXiv:1906.00591 (2019).","journal-title":"arXiv preprint arXiv:1906.00591"},{"key":"e_1_3_2_327_2","article-title":"Energy and policy considerations for deep learning in NLP","author":"Strubell Emma","year":"2019","unstructured":"Emma Strubell, Ananya Ganesh, and Andrew McCallum. 2019. Energy and policy considerations for deep learning in NLP. arXiv preprint arXiv:1906.02243 (2019).","journal-title":"arXiv preprint arXiv:1906.02243"},{"key":"e_1_3_2_328_2","article-title":"Patient knowledge distillation for BERT model compression","author":"Sun Siqi","year":"2019","unstructured":"Siqi Sun, Yu Cheng, Zhe Gan, and Jingjing Liu. 2019. Patient knowledge distillation for BERT model compression. arXiv preprint arXiv:1908.09355 (2019).","journal-title":"arXiv preprint arXiv:1908.09355"},{"key":"e_1_3_2_329_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2017.2761740"},{"key":"e_1_3_2_330_2","article-title":"Intriguing properties of neural networks","author":"Szegedy Christian","year":"2013","unstructured":"Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2013. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199 (2013).","journal-title":"arXiv preprint arXiv:1312.6199"},{"key":"e_1_3_2_331_2","article-title":"Distilling task-specific knowledge from BERT into simple neural networks","author":"Tang Raphael","year":"2019","unstructured":"Raphael Tang, Yao Lu, Linqing Liu, Lili Mou, Olga Vechtomova, and Jimmy Lin. 2019. Distilling task-specific knowledge from BERT into simple neural networks. arXiv preprint arXiv:1903.12136 (2019).","journal-title":"arXiv preprint arXiv:1903.12136"},{"key":"e_1_3_2_332_2","article-title":"Better safe than sorry: Preventing delusive adversaries with adversarial training","volume":"34","author":"Tao Lue","year":"2021","unstructured":"Lue Tao, Lei Feng, Jinfeng Yi, Sheng-Jun Huang, and Songcan Chen. 2021. Better safe than sorry: Preventing delusive adversaries with adversarial training. Advances in Neural Information Processing Systems 34 (2021).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_333_2","article-title":"Google\u2019s speech recognition has a gender bias","volume":"12","author":"Tatman R.","year":"2016","unstructured":"R. Tatman. 2016. Google\u2019s speech recognition has a gender bias. Making Noise and Hearing Things 12 (2016).","journal-title":"Making Noise and Hearing Things"},{"key":"e_1_3_2_334_2","first-page":"1","article-title":"Trustworthy artificial intelligence","author":"Thiebes Scott","year":"2020","unstructured":"Scott Thiebes, Sebastian Lins, and Ali Sunyaev. 2020. Trustworthy artificial intelligence. Electronic Markets (2020), 1\u201318.","journal-title":"Electronic Markets"},{"key":"e_1_3_2_335_2","article-title":"A survey on explainable artificial intelligence (XAI): Toward medical XAI","author":"Tjoa Erico","year":"2020","unstructured":"Erico Tjoa and Cuntai Guan. 2020. A survey on explainable artificial intelligence (XAI): Toward medical XAI. IEEE Transactions on Neural Networks and Learning Systems (2020).","journal-title":"IEEE Transactions on Neural Networks and Learning Systems"},{"key":"e_1_3_2_336_2","doi-asserted-by":"publisher","DOI":"10.1109\/EuroSP.2017.29"},{"key":"e_1_3_2_337_2","volume-title":"International Conference on Learning Representations","author":"Tramer Florian","year":"2021","unstructured":"Florian Tramer and Dan Boneh. 2021. Differentially private learning needs better features (or much more data). In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=YTWGvpFOQD-."},{"key":"e_1_3_2_338_2","first-page":"601","volume-title":"25th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 16)","author":"Tram\u00e8r Florian","year":"2016","unstructured":"Florian Tram\u00e8r, Fan Zhang, Ari Juels, Michael K. Reiter, and Thomas Ristenpart. 2016. Stealing machine learning models via prediction apis. In 25th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 16). 601\u2013618."},{"key":"e_1_3_2_339_2","article-title":"Robustness may be at odds with accuracy","author":"Tsipras Dimitris","year":"2018","unstructured":"Dimitris Tsipras, Shibani Santurkar, Logan Engstrom, Alexander Turner, and Aleksander Madry. 2018. Robustness may be at odds with accuracy. arXiv preprint arXiv:1805.12152 (2018).","journal-title":"arXiv preprint arXiv:1805.12152"},{"key":"e_1_3_2_340_2","article-title":"Big questions for social media big data: Representativeness, validity and other methodological pitfalls","author":"Tufekci Zeynep","year":"2014","unstructured":"Zeynep Tufekci. 2014. Big questions for social media big data: Representativeness, validity and other methodological pitfalls. arXiv preprint arXiv:1403.7400 (2014).","journal-title":"arXiv preprint arXiv:1403.7400"},{"key":"e_1_3_2_341_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41573-019-0024-5"},{"key":"e_1_3_2_342_2","article-title":"Getting gender right in neural machine translation","author":"Vanmassenhove Eva","year":"2019","unstructured":"Eva Vanmassenhove, Christian Hardmeier, and Andy Way. 2019. Getting gender right in neural machine translation. arXiv preprint arXiv:1909.05088 (2019).","journal-title":"arXiv preprint arXiv:1909.05088"},{"key":"e_1_3_2_343_2","first-page":"841","article-title":"Counterfactual explanations without opening the black box: Automated decisions and the GDPR","volume":"31","author":"Wachter Sandra","year":"2017","unstructured":"Sandra Wachter, Brent Mittelstadt, and Chris Russell. 2017. Counterfactual explanations without opening the black box: Automated decisions and the GDPR. Harv. JL & Tech. 31 (2017), 841.","journal-title":"Harv. JL & Tech."},{"key":"e_1_3_2_344_2","article-title":"FinPrivacy: A privacy-preserving mechanism for fingerprint identification","author":"Wang Tao","year":"2020","unstructured":"Tao Wang, Zhigao Zheng, A. Bashir, Alireza Jolfaei, and Yanyan Xu. 2020. FinPrivacy: A privacy-preserving mechanism for fingerprint identification. ACM Transactions on Internet Technology (2020).","journal-title":"ACM Transactions on Internet Technology"},{"key":"e_1_3_2_345_2","doi-asserted-by":"publisher","DOI":"10.1109\/CCGrid49817.2020.00-15"},{"key":"e_1_3_2_346_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00894"},{"key":"e_1_3_2_347_2","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1965.10480775"},{"key":"e_1_3_2_348_2","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2020.2988575"},{"key":"e_1_3_2_349_2","doi-asserted-by":"publisher","DOI":"10.1145\/3306618.3314289"},{"key":"e_1_3_2_350_2","doi-asserted-by":"publisher","DOI":"10.1145\/3351095.3372833"},{"key":"e_1_3_2_351_2","doi-asserted-by":"publisher","DOI":"10.29297\/orbit.v1i2.49"},{"key":"e_1_3_2_352_2","doi-asserted-by":"publisher","DOI":"10.1109\/4235.585893"},{"key":"e_1_3_2_353_2","first-page":"5286","volume-title":"International Conference on Machine Learning","author":"Wong Eric","year":"2018","unstructured":"Eric Wong and Zico Kolter. 2018. Provable defenses against adversarial examples via the convex outer adversarial polytope. In International Conference on Machine Learning. PMLR, 5286\u20135295."},{"key":"e_1_3_2_354_2","unstructured":"Eric Wong Leslie Rice and J. Zico Kolter. 2020. Fast is Better Than Free: Revisiting Adversarial Training. arxiv:2001.03994 [cs.LG]"},{"key":"e_1_3_2_355_2","first-page":"6808","volume-title":"International Conference on Machine Learning","author":"Wong Eric","year":"2019","unstructured":"Eric Wong, Frank Schmidt, and Zico Kolter. 2019. Wasserstein adversarial examples via projected sinkhorn iterations. In International Conference on Machine Learning. PMLR, 6808\u20136817."},{"key":"e_1_3_2_356_2","article-title":"Adversarial weight perturbation helps robust generalization","volume":"33","author":"Wu Dongxian","year":"2020","unstructured":"Dongxian Wu, Shu-Tao Xia, and Yisen Wang. 2020. Adversarial weight perturbation helps robust generalization. Advances in Neural Information Processing Systems 33 (2020).","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_357_2","first-page":"10377","volume-title":"International Conference on Machine Learning","author":"Wu Kaiwen","year":"2020","unstructured":"Kaiwen Wu, Allen Wang, and Yaoliang Yu. 2020. Stronger and faster Wasserstein adversarial attacks. In International Conference on Machine Learning. PMLR, 10377\u201310387."},{"key":"e_1_3_2_358_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICCAD45719.2019.8942149"},{"key":"e_1_3_2_359_2","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2018.8622525"},{"key":"e_1_3_2_360_2","article-title":"To be robust or to be fair: Towards fairness in adversarial training","author":"Xu Han","year":"2020","unstructured":"Han Xu, Xiaorui Liu, Yaxin Li, and Jiliang Tang. 2020. To be robust or to be fair: Towards fairness in adversarial training. arXiv preprint arXiv:2010.06121 (2020).","journal-title":"arXiv preprint arXiv:2010.06121"},{"key":"e_1_3_2_361_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11633-019-1211-x"},{"key":"e_1_3_2_362_2","doi-asserted-by":"publisher","DOI":"10.1007\/s41666-020-00082-4"},{"key":"e_1_3_2_363_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.235"},{"key":"e_1_3_2_364_2","doi-asserted-by":"publisher","DOI":"10.1145\/3453688.3461752"},{"key":"e_1_3_2_365_2","doi-asserted-by":"publisher","DOI":"10.1145\/3298981"},{"key":"e_1_3_2_366_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.643"},{"key":"e_1_3_2_367_2","doi-asserted-by":"publisher","DOI":"10.1109\/SFCS.1982.38"},{"key":"e_1_3_2_368_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.winlp-1.27"},{"key":"e_1_3_2_369_2","first-page":"9240","article-title":"GNNExplainer: Generating explanations for graph neural networks","volume":"32","author":"Ying Rex","year":"2019","unstructured":"Rex Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, and Jure Leskovec. 2019. GNNExplainer: Generating explanations for graph neural networks. Advances in Neural Information Processing Systems 32 (2019), 9240.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_370_2","unstructured":"Da Yu Saurabh Naik Arturs Backurs Sivakanth Gopi Huseyin A. Inan Gautam Kamath Janardhan Kulkarni Yin Tat Lee Andre Manoel Lukas Wutschitz Sergey Yekhanin and Huishuai Zhang. 2021. Differentially Private Fine-tuning of Language Models. arxiv:2110.06500 [cs.LG]"},{"key":"e_1_3_2_371_2","volume-title":"International Conference on Learning Representations","author":"Yu Da","year":"2021","unstructured":"Da Yu, Huishuai Zhang, Wei Chen, and Tie-Yan Liu. 2021. Do not let privacy overbill utility: Gradient embedding perturbation for private learning. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=7aogOj_VYO0."},{"key":"e_1_3_2_372_2","series-title":"Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event","first-page":"12208","volume":"139","author":"Yu Da","year":"2021","unstructured":"Da Yu, Huishuai Zhang, Wei Chen, Jian Yin, and Tie-Yan Liu. 2021. Large scale private learning via low-rank reparametrization. In Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event(Proceedings of Machine Learning Research, Vol. 139), Marina Meila and Tong Zhang (Eds.). PMLR, 12208\u201312218. http:\/\/proceedings.mlr.press\/v139\/yu21f.html."},{"key":"e_1_3_2_373_2","unstructured":"Da Yu Huishuai Zhang Wei Chen Jian Yin and Tie-Yan Liu. 2021. Indiscriminate Poisoning Attacks Are Shortcuts. arxiv:2111.00898 [cs.LG]"},{"key":"e_1_3_2_374_2","article-title":"Building ethics into artificial intelligence","author":"Yu Han","year":"2018","unstructured":"Han Yu, Zhiqi Shen, Chunyan Miao, Cyril Leung, Victor R. Lesser, and Qiang Yang. 2018. Building ethics into artificial intelligence. arXiv preprint arXiv:1812.02953 (2018).","journal-title":"arXiv preprint arXiv:1812.02953"},{"key":"e_1_3_2_375_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403085"},{"key":"e_1_3_2_376_2","article-title":"Explainability in graph neural networks: A taxonomic survey","author":"Yuan Hao","year":"2020","unstructured":"Hao Yuan, Haiyang Yu, Shurui Gui, and Shuiwang Ji. 2020. Explainability in graph neural networks: A taxonomic survey. arXiv preprint arXiv:2012.15445 (2020).","journal-title":"arXiv preprint arXiv:2012.15445"},{"key":"e_1_3_2_377_2","doi-asserted-by":"publisher","DOI":"10.1145\/3038912.3052660"},{"key":"e_1_3_2_378_2","article-title":"Recurrent neural network regularization","author":"Zaremba Wojciech","year":"2014","unstructured":"Wojciech Zaremba, Ilya Sutskever, and Oriol Vinyals. 2014. Recurrent neural network regularization. arXiv preprint arXiv:1409.2329 (2014).","journal-title":"arXiv preprint arXiv:1409.2329"},{"key":"e_1_3_2_379_2","volume-title":"ICML","author":"Zemel R.","year":"2013","unstructured":"R. Zemel, Ledell Yu Wu, Kevin Swersky, T. Pitassi, and C. Dwork. 2013. Learning fair representations. In ICML."},{"key":"e_1_3_2_380_2","doi-asserted-by":"publisher","DOI":"10.1200\/CCI.19.00047"},{"key":"e_1_3_2_381_2","doi-asserted-by":"publisher","DOI":"10.1145\/3278721.3278779"},{"key":"e_1_3_2_382_2","article-title":"Demographics should not be the reason of toxicity: Mitigating discrimination in text classifications with instance weighting","author":"Zhang Guanhua","year":"2020","unstructured":"Guanhua Zhang, Bing Bai, Junqi Zhang, Kun Bai, Conghui Zhu, and Tiejun Zhao. 2020. Demographics should not be the reason of toxicity: Mitigating discrimination in text classifications with instance weighting. arXiv preprint arXiv:2004.14088 (2020).","journal-title":"arXiv preprint arXiv:2004.14088"},{"key":"e_1_3_2_383_2","first-page":"7472","volume-title":"International Conference on Machine Learning","author":"Zhang Hongyang","year":"2019","unstructured":"Hongyang Zhang, Yaodong Yu, Jiantao Jiao, Eric Xing, Laurent El Ghaoui, and Michael Jordan. 2019. Theoretically principled trade-off between robustness and accuracy. In International Conference on Machine Learning. PMLR, 7472\u20137482."},{"key":"e_1_3_2_384_2","volume-title":"IJCAI","author":"Zhang L.","year":"2017","unstructured":"L. Zhang, Yongkai Wu, and Xintao Wu. 2017. A causal framework for discovering and removing direct and indirect discrimination. In IJCAI."},{"key":"e_1_3_2_385_2","article-title":"Graph embedding for recommendation against attribute inference attacks","author":"Zhang Shijie","year":"2021","unstructured":"Shijie Zhang, Hongzhi Yin, Tong Chen, Zi Huang, Lizhen Cui, and Xiangliang Zhang. 2021. Graph embedding for recommendation against attribute inference attacks. arXiv preprint arXiv:2101.12549 (2021).","journal-title":"arXiv preprint arXiv:2101.12549"},{"issue":"3","key":"e_1_3_2_386_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3374217","article-title":"Adversarial attacks on deep-learning models in natural language processing: A survey","volume":"11","author":"Zhang Wei Emma","year":"2020","unstructured":"Wei Emma Zhang, Quan Z. Sheng, Ahoud Alhazmi, and Chenliang Li. 2020. Adversarial attacks on deep-learning models in natural language processing: A survey. ACM Transactions on Intelligent Systems and Technology (TIST) 11, 3 (2020), 1\u201341.","journal-title":"ACM Transactions on Intelligent Systems and Technology (TIST)"},{"key":"e_1_3_2_387_2","volume-title":"29th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 20)","author":"Zhang Xinyang","year":"2020","unstructured":"Xinyang Zhang, Ningfei Wang, Hua Shen, Shouling Ji, Xiapu Luo, and Ting Wang. 2020. Interpretable deep learning under fire. In 29th \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security Symposium ( \\( \\lbrace \\) USENIX \\( \\rbrace \\) Security 20)."},{"key":"e_1_3_2_388_2","article-title":"Explainable recommendation: A survey and new perspectives","author":"Zhang Yongfeng","year":"2018","unstructured":"Yongfeng Zhang and Xu Chen. 2018. Explainable recommendation: A survey and new perspectives. arXiv preprint arXiv:1804.11192 (2018).","journal-title":"arXiv preprint arXiv:1804.11192"},{"key":"e_1_3_2_389_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00033"},{"key":"e_1_3_2_390_2","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609579"},{"key":"e_1_3_2_391_2","article-title":"Identifying significant predictive bias in classifiers","author":"Zhang Zhe","year":"2016","unstructured":"Zhe Zhang and Daniel B. Neill. 2016. Identifying significant predictive bias in classifiers. arXiv preprint arXiv:1611.08292 (2016).","journal-title":"arXiv preprint arXiv:1611.08292"},{"key":"e_1_3_2_392_2","article-title":"IDLG: Improved deep leakage from gradients","author":"Zhao Bo","year":"2020","unstructured":"Bo Zhao, Konda Reddy Mopuri, and Hakan Bilen. 2020. IDLG: Improved deep leakage from gradients. arXiv preprint arXiv:2001.02610 (2020).","journal-title":"arXiv preprint arXiv:2001.02610"},{"key":"e_1_3_2_393_2","article-title":"Gender bias in contextualized word embeddings","author":"Zhao Jieyu","year":"2019","unstructured":"Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordonez, and Kai-Wei Chang. 2019. Gender bias in contextualized word embeddings. arXiv preprint arXiv:1904.03310 (2019).","journal-title":"arXiv preprint arXiv:1904.03310"},{"key":"e_1_3_2_394_2","article-title":"Men also like shopping: Reducing gender bias amplification using corpus-level constraints","author":"Zhao Jieyu","year":"2017","unstructured":"Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2017. Men also like shopping: Reducing gender bias amplification using corpus-level constraints. arXiv preprint arXiv:1707.09457 (2017).","journal-title":"arXiv preprint arXiv:1707.09457"},{"key":"e_1_3_2_395_2","article-title":"Gender bias in coreference resolution: Evaluation and debiasing methods","author":"Zhao Jieyu","year":"2018","unstructured":"Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2018. Gender bias in coreference resolution: Evaluation and debiasing methods. arXiv preprint arXiv:1804.06876 (2018).","journal-title":"arXiv preprint arXiv:1804.06876"},{"key":"e_1_3_2_396_2","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.319"},{"key":"e_1_3_2_397_2","volume-title":"International Conference on Learning Representations","author":"Zhou Yingxue","year":"2021","unstructured":"Yingxue Zhou, Steven Wu, and Arindam Banerjee. 2021. Bypassing the ambient dimension: Private {SGD} with gradient subspace identification. In International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=7dpmlkBuJFC."},{"key":"e_1_3_2_398_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-63076-8_2"},{"key":"e_1_3_2_399_2","article-title":"A survey on measuring indirect discrimination in machine learning","author":"Zliobaite Indre","year":"2015","unstructured":"Indre Zliobaite. 2015. A survey on measuring indirect discrimination in machine learning. arXiv preprint arXiv:1511.00148 (2015).","journal-title":"arXiv preprint arXiv:1511.00148"},{"key":"e_1_3_2_400_2","doi-asserted-by":"crossref","unstructured":"James Zou and Londa Schiebinger. 2018. AI Can Be Sexist and Racist\u2013it\u2019s Time to Make it Fair.","DOI":"10.1038\/d41586-018-05707-8"},{"key":"e_1_3_2_401_2","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3220078"},{"key":"e_1_3_2_402_2","article-title":"Adversarial attacks on graph neural networks via meta learning","author":"Z\u00fcgner Daniel","year":"2019","unstructured":"Daniel Z\u00fcgner and Stephan G\u00fcnnemann. 2019. Adversarial attacks on graph neural networks via meta learning. arXiv preprint arXiv:1902.08412 (2019).","journal-title":"arXiv preprint arXiv:1902.08412"},{"key":"e_1_3_2_403_2","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3220078"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546872","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3546872","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:41Z","timestamp":1750186841000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3546872"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,9]]},"references-count":402,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2023,2,28]]}},"alternative-id":["10.1145\/3546872"],"URL":"https:\/\/doi.org\/10.1145\/3546872","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"value":"2157-6904","type":"print"},{"value":"2157-6912","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,11,9]]},"assertion":[{"value":"2021-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-06-07","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-11-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}