{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T19:40:55Z","timestamp":1770752455603,"version":"3.50.0"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,10]]},"DOI":"10.1145\/3503161.3548340","type":"proceedings-article","created":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T15:43:12Z","timestamp":1665416592000},"page":"4175-4184","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":9,"title":["A Baseline for Detecting Out-of-Distribution Examples in Image Captioning"],"prefix":"10.1145","author":[{"given":"Gal","family":"Shalev","sequence":"first","affiliation":[{"name":"Bar-Ilan University, Ramat Gan, Israel"}]},{"given":"Gabi","family":"Shalev","sequence":"additional","affiliation":[{"name":"Bar-Ilan University, Ramat Gan, Israel"}]},{"given":"Joseph","family":"Keshet","sequence":"additional","affiliation":[{"name":"Technion, Haifa, Israel"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00904"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00636"},{"key":"e_1_3_2_2_3_1","first-page":"99","article-title":"On a measure of divergence between two statistical populations defined by their probability distributions","volume":"35","author":"Bhattacharyya Anil","year":"1943","unstructured":"Anil Bhattacharyya . 1943 . On a measure of divergence between two statistical populations defined by their probability distributions . Bull. Calcutta Math. Soc. 35 (1943), 99 -- 109 . Anil Bhattacharyya. 1943. On a measure of divergence between two statistical populations defined by their probability distributions. Bull. Calcutta Math. Soc. 35 (1943), 99--109.","journal-title":"Bull. Calcutta Math. Soc."},{"key":"e_1_3_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01059"},{"key":"e_1_3_2_2_5_1","volume-title":"Real-Time Self-Driving Car Navigation Using Deep Neural Network. In 2018 4th International Conference on Green Technology and Sustainable Development (GTSD). IEEE, 7--12","author":"Do Truong-Dong","year":"2018","unstructured":"Truong-Dong Do , Minh-Thien Duong , Quoc-Vu Dang , and My-Ha Le . 2018 . Real-Time Self-Driving Car Navigation Using Deep Neural Network. In 2018 4th International Conference on Green Technology and Sustainable Development (GTSD). IEEE, 7--12 . Truong-Dong Do, Minh-Thien Duong, Quoc-Vu Dang, and My-Ha Le. 2018. Real-Time Self-Driving Car Navigation Using Deep Neural Network. In 2018 4th International Conference on Green Technology and Sustainable Development (GTSD). IEEE, 7--12."},{"key":"e_1_3_2_2_6_1","volume-title":"Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge. Journal of Artificial Intelligence Research","author":"Dognin Pierre","year":"2022","unstructured":"Pierre Dognin , Igor Melnyk , Youssef Mroueh , Inkit Padhi , Mattia Rigotti , Jarret Ross , Yair Schiff , Richard A Young , and Brian Belgodere . 2022. Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge. Journal of Artificial Intelligence Research ( 2022 ). Pierre Dognin, Igor Melnyk, Youssef Mroueh, Inkit Padhi, Mattia Rigotti, Jarret Ross, Yair Schiff, Richard A Young, and Brian Belgodere. 2022. Image captioning as an assistive technology: Lessons learned from vizwiz 2020 challenge. Journal of Artificial Intelligence Research (2022)."},{"key":"e_1_3_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P18-1082"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888092"},{"key":"e_1_3_2_2_9_1","volume-title":"Actigraphy-based sleep\/wake pattern detection using convolutional neural networks. arXiv preprint arXiv:1802.07945","author":"Granovsky Lena","year":"2018","unstructured":"Lena Granovsky , Gabi Shalev , Nancy Yacovzada , Yotam Frank , and Shai Fine . 2018. Actigraphy-based sleep\/wake pattern detection using convolutional neural networks. arXiv preprint arXiv:1802.07945 ( 2018 ). Lena Granovsky, Gabi Shalev, Nancy Yacovzada, Yotam Frank, and Shai Fine. 2018. Actigraphy-based sleep\/wake pattern detection using convolutional neural networks. arXiv preprint arXiv:1802.07945 (2018)."},{"key":"e_1_3_2_2_10_1","volume-title":"International Conference on Machine Learning. PMLR, 1321--1330","author":"Guo Chuan","year":"2017","unstructured":"Chuan Guo , Geoff Pleiss , Yu Sun , and Kilian Q Weinberger . 2017 . On calibration of modern neural networks . In International Conference on Machine Learning. PMLR, 1321--1330 . Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q Weinberger. 2017. On calibration of modern neural networks. In International Conference on Machine Learning. PMLR, 1321--1330."},{"key":"e_1_3_2_2_11_1","volume-title":"Proceedings of the International Conference on Learning Representations","author":"Hendrycks Dan","year":"2019","unstructured":"Dan Hendrycks and Thomas Dietterich . 2019 . Benchmarking Neural Network Robustness to Common Corruptions and Perturbations . Proceedings of the International Conference on Learning Representations (2019). Dan Hendrycks and Thomas Dietterich. 2019. Benchmarking Neural Network Robustness to Common Corruptions and Perturbations. Proceedings of the International Conference on Learning Representations (2019)."},{"key":"e_1_3_2_2_12_1","volume-title":"Proceedings of International Conference on Learning Representations","author":"Hendrycks Dan","year":"2017","unstructured":"Dan Hendrycks and Kevin Gimpel . 2017 . A Baseline for Detecting Misclas- sified and Out-of-Distribution Examples in Neural Networks . Proceedings of International Conference on Learning Representations (2017). Dan Hendrycks and Kevin Gimpel. 2017. A Baseline for Detecting Misclas- sified and Out-of-Distribution Examples in Neural Networks. Proceedings of International Conference on Learning Representations (2017)."},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2020.acl-main.244"},{"key":"e_1_3_2_2_14_1","volume-title":"Image captioning: Transforming objects into words. arXiv preprint arXiv:1906.05963","author":"Herdade Simao","year":"2019","unstructured":"Simao Herdade , Armin Kappeler , Kofi Boakye , and Joao Soares . 2019. Image captioning: Transforming objects into words. arXiv preprint arXiv:1906.05963 ( 2019 ). Simao Herdade, Armin Kappeler, Kofi Boakye, and Joao Soares. 2019. Image captioning: Transforming objects into words. arXiv preprint arXiv:1906.05963 (2019)."},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/2566972.2566993"},{"key":"e_1_3_2_2_16_1","volume-title":"Proceedings of International Conference on Learning Representations","author":"Holtzman Ari","year":"2020","unstructured":"Ari Holtzman , Jan Buys , Maxwell Forbes , and Yejin Choi . 2020 . The curious case of neural text degeneration . Proceedings of International Conference on Learning Representations (2020). Ari Holtzman, Jan Buys, Maxwell Forbes, and Yejin Choi. 2020. The curious case of neural text degeneration. Proceedings of International Conference on Learning Representations (2020)."},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01096"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i2.16258"},{"key":"e_1_3_2_2_19_1","unstructured":"Alexander B. Jung Kentaro Wada Jon Crall Satoshi Tanaka Jake Graving etal 2019. imgaug. https:\/\/github.com\/aleju\/imgaug. Online; accessed 25-Sept-2019.  Alexander B. Jung Kentaro Wada Jon Crall Satoshi Tanaka Jake Graving et al. 2019. imgaug. https:\/\/github.com\/aleju\/imgaug. Online; accessed 25-Sept-2019."},{"key":"e_1_3_2_2_20_1","volume-title":"Oodformer: Out-of-distribution detection transformer. BMVC","author":"Koner Rajat","year":"2021","unstructured":"Rajat Koner , Poulami Sinhamahapatra , Karsten Roscher , Stephan G\u00fcnnemann , and Volker Tresp . 2021 . Oodformer: Out-of-distribution detection transformer. BMVC (2021). Rajat Koner, Poulami Sinhamahapatra, Karsten Roscher, Stephan G\u00fcnnemann, and Volker Tresp. 2021. Oodformer: Out-of-distribution detection transformer. BMVC (2021)."},{"key":"e_1_3_2_2_21_1","volume-title":"Openimages: A public dataset for large-scale multi-label and multi-class image classification. Dataset available from https:\/\/github. com\/openimages 2, 3","author":"Krasin Ivan","year":"2017","unstructured":"Ivan Krasin , Tom Duerig , Neil Alldrin , Vittorio Ferrari , Sami Abu-El-Haija , Alina Kuznetsova , Hassan Rom , Jasper Uijlings , Stefan Popov , Andreas Veit , 2017 . Openimages: A public dataset for large-scale multi-label and multi-class image classification. Dataset available from https:\/\/github. com\/openimages 2, 3 (2017), 2--3. Ivan Krasin, Tom Duerig, Neil Alldrin, Vittorio Ferrari, Sami Abu-El-Haija, Alina Kuznetsova, Hassan Rom, Jasper Uijlings, Stefan Popov, Andreas Veit, et al. 2017. Openimages: A public dataset for large-scale multi-label and multi-class image classification. Dataset available from https:\/\/github. com\/openimages 2, 3 (2017), 2--3."},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.5555\/2390524.2390575"},{"key":"e_1_3_2_2_23_1","volume-title":"Proceedings of International Conference on Learning Representations","author":"Lee Kimin","year":"2018","unstructured":"Kimin Lee , Honglak Lee , Kibok Lee , and Jinwoo Shin . 2018 . Training confidence- calibrated classifiers for detecting out-of-distribution samples . Proceedings of International Conference on Learning Representations (2018). Kimin Lee, Honglak Lee, Kibok Lee, and Jinwoo Shin. 2018. Training confidence- calibrated classifiers for detecting out-of-distribution samples. Proceedings of International Conference on Learning Representations (2018)."},{"key":"e_1_3_2_2_24_1","volume-title":"A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems 31","author":"Lee Kimin","year":"2018","unstructured":"Kimin Lee , Kibok Lee , Honglak Lee , and Jinwoo Shin . 2018. A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems 31 ( 2018 ). Kimin Lee, Kibok Lee, Honglak Lee, and Jinwoo Shin. 2018. A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems 31 (2018)."},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00902"},{"key":"e_1_3_2_2_26_1","volume-title":"k Folden: k-Fold Ensemble for Out-Of-Distribution Detection. EMNLP","author":"Li Xiaoya","year":"2021","unstructured":"Xiaoya Li , Jiwei Li , Xiaofei Sun , Chun Fan , Tianwei Zhang , Fei Wu , Yuxian Meng , and Jun Zhang . 2021. k Folden: k-Fold Ensemble for Out-Of-Distribution Detection. EMNLP ( 2021 ). Xiaoya Li, Jiwei Li, Xiaofei Sun, Chun Fan, Tianwei Zhang, Fei Wu, Yuxian Meng, and Jun Zhang. 2021. k Folden: k-Fold Ensemble for Out-Of-Distribution Detection. EMNLP (2021)."},{"key":"e_1_3_2_2_27_1","volume-title":"Proceedings of International Conference on Learning Representations","author":"Liang Shiyu","year":"2018","unstructured":"Shiyu Liang , Yixuan Li , and Rayadurgam Srikant . 2018 . Enhancing the reliabil- ity of out-of-distribution image detection in neural networks . Proceedings of International Conference on Learning Representations (2018). Shiyu Liang, Yixuan Li, and Rayadurgam Srikant. 2018. Enhancing the reliabil- ity of out-of-distribution image detection in neural networks. Proceedings of International Conference on Learning Representations (2018)."},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00754"},{"key":"e_1_3_2_2_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3025453.3025814"},{"key":"e_1_3_2_2_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298640"},{"key":"e_1_3_2_2_32_1","volume-title":"Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . BLEU: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318."},{"key":"e_1_3_2_2_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2005.12.003"},{"key":"e_1_3_2_2_34_1","volume-title":"Generating Diverse and Informative Natural Language Fashion Feedback. CVPR Workshop on Language and Vision","author":"Sadeh Gil","year":"2019","unstructured":"Gil Sadeh , Lior Fritz , Gabi Shalev , and Eduard Oks . 2019 . Generating Diverse and Informative Natural Language Fashion Feedback. CVPR Workshop on Language and Vision (2019). Gil Sadeh, Lior Fritz, Gabi Shalev, and Eduard Oks. 2019. Generating Diverse and Informative Natural Language Fashion Feedback. CVPR Workshop on Language and Vision (2019)."},{"key":"e_1_3_2_2_35_1","volume-title":"CVPR Workshop on Language and Vision","author":"Sadeh Gil","year":"2019","unstructured":"Gil Sadeh , Lior Fritz , Gabi Shalev , and Eduard Oks . 2019 . Joint visual-textual embedding for multimodal style search . CVPR Workshop on Language and Vision (2019). Gil Sadeh, Lior Fritz, Gabi Shalev, and Eduard Oks. 2019. Joint visual-textual embedding for multimodal style search. CVPR Workshop on Language and Vision (2019)."},{"key":"e_1_3_2_2_36_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0212356"},{"key":"e_1_3_2_2_37_1","unstructured":"Gabi Shalev Yossi Adi and Joseph Keshet. 2018. Out-of-distribution detection using multiple semantic label representations. In Advances in Neural Information Processing Systems. 7375--7385.  Gabi Shalev Yossi Adi and Joseph Keshet. 2018. Out-of-distribution detection using multiple semantic label representations. In Advances in Neural Information Processing Systems. 7375--7385."},{"key":"e_1_3_2_2_38_1","volume-title":"Redesigning the classi- fication layer by randomizing the class representation vectors. arXiv preprint arXiv:2011.08704","author":"Shalev Gabi","year":"2020","unstructured":"Gabi Shalev , Gal-Lev Shalev , and Joseph Keshet . 2020. Redesigning the classi- fication layer by randomizing the class representation vectors. arXiv preprint arXiv:2011.08704 ( 2020 ). Gabi Shalev, Gal-Lev Shalev, and Joseph Keshet. 2020. Redesigning the classi- fication layer by randomizing the class representation vectors. arXiv preprint arXiv:2011.08704 (2020)."},{"key":"e_1_3_2_2_39_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.maiworkshop-1.2"},{"key":"e_1_3_2_2_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2016.61"},{"key":"e_1_3_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299087"},{"key":"e_1_3_2_2_42_1","volume-title":"Rami Eisawy, Franz Pfister, and Nassir Navab.","author":"Venkatakrishnan Abinav Ravi","year":"2020","unstructured":"Abinav Ravi Venkatakrishnan , Seong Tae Kim , Rami Eisawy, Franz Pfister, and Nassir Navab. 2020 . Self-supervised out-of-distribution detection in brain CT scans. arXiv preprint arXiv:2011.05428 (2020). Abinav Ravi Venkatakrishnan, Seong Tae Kim, Rami Eisawy, Franz Pfister, and Nassir Navab. 2020. Self-supervised out-of-distribution detection in brain CT scans. arXiv preprint arXiv:2011.05428 (2020)."},{"key":"e_1_3_2_2_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 30th ACM International Conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548340","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3503161.3548340","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:00:43Z","timestamp":1750186843000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3503161.3548340"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":43,"alternative-id":["10.1145\/3503161.3548340","10.1145\/3503161"],"URL":"https:\/\/doi.org\/10.1145\/3503161.3548340","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}