{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:23:28Z","timestamp":1750220608392,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,8]],"date-time":"2020-06-08T00:00:00Z","timestamp":1591574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,8]]},"DOI":"10.1145\/3372278.3390679","type":"proceedings-article","created":{"date-parts":[[2020,6,2]],"date-time":"2020-06-02T04:35:27Z","timestamp":1591072527000},"page":"100-107","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["Actor-Critic Sequence Generation for Relative Difference Captioning"],"prefix":"10.1145","author":[{"given":"Zhengcong","family":"Fei","sequence":"first","affiliation":[{"name":"Institute of Computing Technology, CAS &amp; University of Chinese Academy of Science, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2020,6,8]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proc. ICML. 1206--1214","author":"Ammar Haitham Bou","year":"2014","unstructured":"Haitham Bou Ammar , Eric Eaton , Paul Ruvolo , and Matthew Taylor . 2014 . Online multi-task learning for policy gradient methods . In Proc. ICML. 1206--1214 . Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, and Matthew Taylor. 2014. Online multi-task learning for policy gradient methods. In Proc. ICML. 1206--1214."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00636"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.321"},{"key":"e_1_3_2_1_4_1","volume-title":"Proc. NIPS. 2121--2129","author":"Frome Andrea","year":"2013","unstructured":"Andrea Frome , Greg S Corrado , Jon Shlens , Samy Bengio , Jeff Dean , Marctextquotesingle Aurelio Ranzato , and Tomas Mikolov . 2013 . DeViSE: A Deep Visual-Semantic Embedding Model . In Proc. NIPS. 2121--2129 . Andrea Frome, Greg S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Marctextquotesingle Aurelio Ranzato, and Tomas Mikolov. 2013. DeViSE: A Deep Visual-Semantic Embedding Model. In Proc. NIPS. 2121--2129."},{"key":"e_1_3_2_1_5_1","volume-title":"Proc. VLDB. 518--529","author":"Gionis Aristides","year":"1999","unstructured":"Aristides Gionis , Piotr Indyk , Rajeev Motwani , 1999 . Similarity search in high dimensions via hashing . In Proc. VLDB. 518--529 . Aristides Gionis, Piotr Indyk, Rajeev Motwani, et al. 1999. Similarity search in high dimensions via hashing. In Proc. VLDB. 518--529."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01249-6_47"},{"key":"e_1_3_2_1_7_1","volume-title":"Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay. arXiv preprint arXiv:1607.05077","author":"Hosu Ionel Alexandru","year":"2016","unstructured":"Ionel Alexandru Hosu and Traian Rebedea . 2016. Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay. arXiv preprint arXiv:1607.05077 ( 2016 ). Ionel Alexandru Hosu and Traian Rebedea. 2016. Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay. arXiv preprint arXiv:1607.05077 (2016)."},{"key":"e_1_3_2_1_8_1","volume-title":"Proc. NIPS. 1008--1014","author":"Konda Vijay R","year":"2000","unstructured":"Vijay R Konda and John N Tsitsiklis . 2000 . Actor-critic algorithms . In Proc. NIPS. 1008--1014 . Vijay R Konda and John N Tsitsiklis. 2000. Actor-critic algorithms. In Proc. NIPS. 1008--1014."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"G Kulkarni V Premraj V Ordonez S Dhar S. Li Y Choi A. C. Berg and T. L. Berg. 2013. Babytalk: understanding and generating simple image descriptions. IEEE Trans. Pattern Anal. Mach. Intell. (2013) 2891--2903.  G Kulkarni V Premraj V Ordonez S Dhar S. Li Y Choi A. C. Berg and T. L. Berg. 2013. Babytalk: understanding and generating simple image descriptions. IEEE Trans. Pattern Anal. Mach. Intell. (2013) 2891--2903.","DOI":"10.1109\/TPAMI.2012.162"},{"key":"e_1_3_2_1_10_1","volume-title":"Scene Graph Generation from Objects, Phrases and Caption Regions. arXiv preprint arXiv:1707.09700","author":"Li Yikang","year":"2017","unstructured":"Yikang Li , Wanli Ouyang , Bolei Zhou , Kun Wang , and Xiaogang Wang . 2017. Scene Graph Generation from Objects, Phrases and Caption Regions. arXiv preprint arXiv:1707.09700 ( 2017 ). Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, and Xiaogang Wang. 2017. Scene Graph Generation from Objects, Phrases and Caption Regions. arXiv preprint arXiv:1707.09700 (2017)."},{"key":"e_1_3_2_1_11_1","volume-title":"Proc. ACL Workshops, 74--81","author":"Lin Chin-Yew","year":"2004","unstructured":"Chin-Yew Lin . 2004 . ROUGE: A Package for Automatic Evaluation of summaries . Proc. ACL Workshops, 74--81 . Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of summaries. Proc. ACL Workshops, 74--81."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00728"},{"key":"e_1_3_2_1_13_1","volume-title":"Proc. ACL. 747--756","author":"Mitchell Margaret","year":"2012","unstructured":"Margaret Mitchell , Xufeng Han , Jesse Dodge , Alyssa Mensch , and Iii Hal Daum\u00e9 . 2012 . Midge: generating image descriptions from computer vision detections . In Proc. ACL. 747--756 . Margaret Mitchell, Xufeng Han, Jesse Dodge, Alyssa Mensch, and Iii Hal Daum\u00e9. 2012. Midge: generating image descriptions from computer vision detections. In Proc. ACL. 747--756."},{"key":"e_1_3_2_1_14_1","volume-title":"Nature","volume":"518","author":"Mnih Volodymyr","year":"2015","unstructured":"Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Andrei A Rusu , Joel Veness , Marc G Bellemare , Alex Graves , Martin Riedmiller , Andreas K Fidjeland , Georg Ostrovski , 2015 . Human-level control through deep reinforcement learning . Nature , Vol. 518 , 7540 (2015), 529--533. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature, Vol. 518, 7540 (2015), 529--533."},{"key":"e_1_3_2_1_15_1","volume-title":"Proc. ACL. 311--318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei Jing Zhu . 2002 . BLEU: a Method for Automatic Evaluation of Machine Translation . In Proc. ACL. 311--318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei Jing Zhu. 2002. BLEU: a Method for Automatic Evaluation of Machine Translation. In Proc. ACL. 311--318."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.128"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.131"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_3_2_1_19_1","volume-title":"Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al.","author":"Silver David","year":"2016","unstructured":"David Silver , Aja Huang , Chris J Maddison , Arthur Guez , Laurent Sifre , George Van Den Driessche , Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016 . Mastering the game of Go with deep neural networks and tree search. nature, Vol. 529 , 7587 (2016), 484. David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature, Vol. 529, 7587 (2016), 484."},{"key":"e_1_3_2_1_20_1","unstructured":"Richard S Sutton Andrew G Barto etal 1998. Introduction to reinforcement learning. Vol. 135. MIT press Cambridge.  Richard S Sutton Andrew G Barto et al. 1998. Introduction to reinforcement learning. Vol. 135. MIT press Cambridge."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.463"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.120"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00992696"},{"key":"e_1_3_2_1_25_1","volume-title":"A Study of Reinforcement Learning for Neural Machine Translation. arXiv preprint arXiv:1808.08866","author":"Wu Lijun","year":"2018","unstructured":"Lijun Wu , Fei Tian , Tao Qin , Jianhuang Lai , and Tie-Yan Liu . 2018. A Study of Reinforcement Learning for Neural Machine Translation. arXiv preprint arXiv:1808.08866 ( 2018 ). Lijun Wu, Fei Tian, Tao Qin, Jianhuang Lai, and Tie-Yan Liu. 2018. A Study of Reinforcement Learning for Neural Machine Translation. arXiv preprint arXiv:1808.08866 (2018)."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.29"},{"key":"e_1_3_2_1_27_1","volume-title":"Proc. ICML. 2048--2057","author":"Xu Kelvin","year":"2015","unstructured":"Kelvin Xu , Jimmy Ba , Ryan Kiros , Kyunghyun Cho , Aaron Courville , Ruslan Salakhutdinov , Richard Zemel , and Yoshua Bengio . 2015 . Show, Attend and Tell: Neural Image Caption Generation with Visual Attention . In Proc. ICML. 2048--2057 . Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, and Yoshua Bengio. 2015. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. In Proc. ICML. 2048--2057."},{"key":"e_1_3_2_1_28_1","volume-title":"Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee. arXiv preprint arXiv:1910.06426","author":"Xu Shuangjie","year":"2019","unstructured":"Shuangjie Xu , Feng Xu , Yu Cheng , and Pan Zhou . 2019 . Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee. arXiv preprint arXiv:1910.06426 (2019). Shuangjie Xu, Feng Xu, Yu Cheng, and Pan Zhou. 2019. Tell-the-difference: Fine-grained Visual Descriptor via a Discriminating Referee. arXiv preprint arXiv:1910.06426 (2019)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_42"},{"key":"e_1_3_2_1_30_1","unstructured":"Ting Yao Yingwei Pan Yehao Li Zhaofan Qiu and Tao Mei. 2016. Boosting Image Captioning with Attributes. (2016) 4904--4912.  Ting Yao Yingwei Pan Yehao Li Zhaofan Qiu and Tao Mei. 2016. Boosting Image Captioning with Attributes. (2016) 4904--4912."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.594"},{"key":"e_1_3_2_1_32_1","volume-title":"Hospedales","author":"Zhang Li","year":"2017","unstructured":"Li Zhang , Flood Sung , Feng Liu , Tao Xiang , Shaogang Gong , Yongxin Yang , and Timothy M . Hospedales . 2017 . Actor-Critic Sequence Training for Image Captioning . arXiv preprint arXiv:1706.09601 (2017). Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, and Timothy M. Hospedales. 2017. Actor-Critic Sequence Training for Image Captioning. arXiv preprint arXiv:1706.09601 (2017)."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989381"}],"event":{"name":"ICMR '20: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Dublin Ireland","acronym":"ICMR '20"},"container-title":["Proceedings of the 2020 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390679","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390679","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:10Z","timestamp":1750195930000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390679"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,8]]},"references-count":33,"alternative-id":["10.1145\/3372278.3390679","10.1145\/3372278"],"URL":"https:\/\/doi.org\/10.1145\/3372278.3390679","relation":{},"subject":[],"published":{"date-parts":[[2020,6,8]]},"assertion":[{"value":"2020-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}