{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,21]],"date-time":"2026-02-21T09:13:11Z","timestamp":1771665191614,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":5,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,8,14]],"date-time":"2021-08-14T00:00:00Z","timestamp":1628899200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,8,14]]},"DOI":"10.1145\/3447548.3469468","type":"proceedings-article","created":{"date-parts":[[2021,8,12]],"date-time":"2021-08-12T06:13:10Z","timestamp":1628748790000},"page":"4147-4148","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["2nd International Workshop on Data Quality Assessment for Machine Learning"],"prefix":"10.1145","author":[{"given":"Hima","family":"Patel","sequence":"first","affiliation":[{"name":"IBM Research India, Bengaluru, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fuyuki","family":"Ishikawa","sequence":"additional","affiliation":[{"name":"National Institute of Informatics, Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Laure","family":"Berti-Equille","sequence":"additional","affiliation":[{"name":"IRD, Montpellier, France"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nitin","family":"Gupta","sequence":"additional","affiliation":[{"name":"IBM Research India, Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sameep","family":"Mehta","sequence":"additional","affiliation":[{"name":"IBM Research India, Bengaluru, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Satoshi","family":"Masuda","sequence":"additional","affiliation":[{"name":"IBM Research Japan, Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shashank","family":"Mujumdar","sequence":"additional","affiliation":[{"name":"IBM Research India, Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shazia","family":"Afzal","sequence":"additional","affiliation":[{"name":"IBM Research India, Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Srikanta","family":"Bedathur","sequence":"additional","affiliation":[{"name":"Indian Institute of Technology, Delhi, Delhi, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yasuharu","family":"Nishi","sequence":"additional","affiliation":[{"name":"University of Electro-Communications, Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,8,14]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"1st International Workshop on Data Assessment and Readiness for AI. In PAKDD (Workshops).","author":"Bandyopadhyay Bortik","year":"2021","unstructured":"Bortik Bandyopadhyay , Sambaran Bandyopadhyay , Srikanta Bedathur , Nitin Gupta , Sameep Mehta , Shashank Mujumdar , Srinivasan Parthasarathy , and Hima Patel . 2021 . 1st International Workshop on Data Assessment and Readiness for AI. In PAKDD (Workshops). Bortik Bandyopadhyay, Sambaran Bandyopadhyay, Srikanta Bedathur, Nitin Gupta, Sameep Mehta, Shashank Mujumdar, Srinivasan Parthasarathy, and Hima Patel. 2021. 1st International Workshop on Data Assessment and Readiness for AI. In PAKDD (Workshops)."},{"key":"e_1_3_2_1_2_1","volume-title":"Ruhi Sharma Mittal, and Vitobha Munigala","author":"Jain Abhinav","year":"2020","unstructured":"Abhinav Jain , Hima Patel , Lokesh Nagalapatti , Nitin Gupta , Sameep Mehta , Shanmukha Guttula , Shashank Mujumdar , Shazia Afzal , Ruhi Sharma Mittal, and Vitobha Munigala . 2020 . Overview and Importance of Data Quality for Machine Learning Tasks. In KDD. Abhinav Jain, Hima Patel, Lokesh Nagalapatti, Nitin Gupta, Sameep Mehta, Shanmukha Guttula, Shashank Mujumdar, Shazia Afzal, Ruhi Sharma Mittal, and Vitobha Munigala. 2020. Overview and Importance of Data Quality for Machine Learning Tasks. In KDD."},{"key":"e_1_3_2_1_3_1","volume-title":"Vitobha Munigala, Naveen Panwar, Sambaran Bandyopadhyay, and Satoshi Musda.","author":"Patel Hima","year":"2021","unstructured":"Hima Patel , Nitin Gupta , Sameep Mehta , Shanmukha Guttula , Shashank Mujumdar , Shazia Afzal , Ruhi Sharma Mittal , Vitobha Munigala, Naveen Panwar, Sambaran Bandyopadhyay, and Satoshi Musda. 2021 . Data Quality for Machine Learning Tasks. In KDD. Hima Patel, Nitin Gupta, Sameep Mehta, Shanmukha Guttula, Shashank Mujumdar, Shazia Afzal, Ruhi Sharma Mittal, Vitobha Munigala, Naveen Panwar, Sambaran Bandyopadhyay, and Satoshi Musda. 2021. Data Quality for Machine Learning Tasks. In KDD."},{"key":"e_1_3_2_1_4_1","volume-title":"Dataset cartography: Mapping and diagnosing datasets with training dynamics. arXiv","author":"Swayamdipta Swabha","year":"2020","unstructured":"Swabha Swayamdipta , Roy Schwartz , Nicholas Lourie , Yizhong Wang , Hannaneh Hajishirzi , Noah A Smith , and Yejin Choi . 2020. Dataset cartography: Mapping and diagnosing datasets with training dynamics. arXiv ( 2020 ). Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A Smith, and Yejin Choi. 2020. Dataset cartography: Mapping and diagnosing datasets with training dynamics. arXiv (2020)."},{"key":"e_1_3_2_1_5_1","unstructured":"Jinsung Yoon Sercan Arik and Tomas Pfister. 2020. Data valuation using reinforcement learning. In ICML.  Jinsung Yoon Sercan Arik and Tomas Pfister. 2020. Data valuation using reinforcement learning. In ICML."}],"event":{"name":"KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","location":"Virtual Event Singapore","acronym":"KDD '21","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"]},"container-title":["Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &amp; Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447548.3469468","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447548.3469468","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:37Z","timestamp":1750191517000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447548.3469468"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,14]]},"references-count":5,"alternative-id":["10.1145\/3447548.3469468","10.1145\/3447548"],"URL":"https:\/\/doi.org\/10.1145\/3447548.3469468","relation":{},"subject":[],"published":{"date-parts":[[2021,8,14]]},"assertion":[{"value":"2021-08-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}