{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T06:59:37Z","timestamp":1760597977616,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":46,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,8]],"date-time":"2020-06-08T00:00:00Z","timestamp":1591574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100006502","name":"Defense Sciences Office, DARPA","doi-asserted-by":"publisher","award":["FA87501820018"],"award-info":[{"award-number":["FA87501820018"]}],"id":[{"id":"10.13039\/100006502","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["HR00111990063"],"award-info":[{"award-number":["HR00111990063"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,8]]},"DOI":"10.1145\/3372278.3390674","type":"proceedings-article","created":{"date-parts":[[2020,6,2]],"date-time":"2020-06-02T04:35:27Z","timestamp":1591072527000},"page":"53-62","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval"],"prefix":"10.1145","author":[{"given":"Po-Yao","family":"Huang","sequence":"first","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA, USA"}]},{"given":"Xiaojun","family":"Chang","sequence":"additional","affiliation":[{"name":"Monash University, Melbourne, Australia"}]},{"given":"Alexander","family":"Hauptmann","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, PITTSBURGH, PA, USA"}]},{"given":"Eduard","family":"Hovy","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, PITTSBURGH, PA, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,6,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In CVPR.  Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In CVPR.","DOI":"10.1109\/CVPR.2018.00636"},{"key":"e_1_3_2_1_2_1","volume-title":"International Conference on Machine Learning. 1247--1255","author":"Andrew Galen","year":"2013","unstructured":"Galen Andrew , Raman Arora , Jeff Bilmes , and Karen Livescu . 2013 . Deep canonical correlation analysis . In International Conference on Machine Learning. 1247--1255 . Galen Andrew, Raman Arora, Jeff Bilmes, and Karen Livescu. 2013. Deep canonical correlation analysis. In International Conference on Machine Learning. 1247--1255."},{"key":"e_1_3_2_1_3_1","volume-title":"Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2014a. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 ( 2014 ). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014a. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)."},{"key":"e_1_3_2_1_4_1","volume-title":"Neural Machine Translation by Jointly Learning to Align and Translate. CoRR","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2014b. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR , Vol. abs\/ 1409 .0473 ( 2014 ). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014b. Neural Machine Translation by Jointly Learning to Align and Translate. CoRR, Vol. abs\/1409.0473 (2014)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W17-4746"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1105"},{"key":"e_1_3_2_1_7_1","volume-title":"On the Properties of Neural Machine Translation: Encoder--Decoder Approaches. Syntax, Semantics and Structure in Statistical Translation","author":"Cho Kyunghyun","year":"2014","unstructured":"Kyunghyun Cho , Bart van Merri\u00ebnboer , Dzmitry Bahdanau , and Yoshua Bengio . 2014. On the Properties of Neural Machine Translation: Encoder--Decoder Approaches. Syntax, Semantics and Structure in Statistical Translation ( 2014 ), 103. Kyunghyun Cho, Bart van Merri\u00ebnboer, Dzmitry Bahdanau, and Yoshua Bengio. 2014. On the Properties of Neural Machine Translation: Encoder--Decoder Approaches. Syntax, Semantics and Structure in Statistical Translation (2014), 103."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.323"},{"key":"e_1_3_2_1_9_1","unstructured":"Bo Dai and Dahua Lin. 2017. Contrastive learning for image captioning. In Advances in Neural Information Processing Systems. 898--907.  Bo Dai and Dahua Lin. 2017. Contrastive learning for image captioning. In Advances in Neural Information Processing Systems. 898--907."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.201"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-3210"},{"key":"e_1_3_2_1_12_1","volume-title":"Jamie Ryan Kiros, and Sanja Fidler","author":"Faghri Fartash","year":"2018","unstructured":"Fartash Faghri , David J Fleet , Jamie Ryan Kiros, and Sanja Fidler . 2018 . VSE+: Improving Visual-Semantic Embeddings with Hard Negatives . (2018). https:\/\/github.com\/fartashf\/vsepp Fartash Faghri, David J Fleet, Jamie Ryan Kiros, and Sanja Fidler. 2018. VSE+: Improving Visual-Semantic Embeddings with Hard Negatives. (2018). https:\/\/github.com\/fartashf\/vsepp"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D17-1303"},{"key":"e_1_3_2_1_14_1","unstructured":"Di He Yingce Xia Tao Qin Liwei Wang Nenghai Yu Tieyan Liu and Wei-Ying Ma. 2016a. Dual learning for machine translation. In Advances in Neural Information Processing Systems. 820--828.  Di He Yingce Xia Tao Qin Liwei Wang Nenghai Yu Tieyan Liu and Wei-Ying Ma. 2016a. Dual learning for machine translation. In Advances in Neural Information Processing Systems. 820--828."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1154"},{"volume-title":"Proceedings of the 27th ACM International Conference on Multimedia (MM '19)","author":"Huang Po-Yao","key":"e_1_3_2_1_17_1","unstructured":"Po-Yao Huang , Guoliang Kang , Wenhe Liu , Xiaojun Chang , and Alexander G. Hauptmann . 2019 b. Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment . In Proceedings of the 27th ACM International Conference on Multimedia (MM '19) . Association for Computing Machinery, New York, NY, USA, 1758--1767. https:\/\/doi.org\/10.1145\/3343031.3350894 10.1145\/3343031.3350894 Po-Yao Huang, Guoliang Kang, Wenhe Liu, Xiaojun Chang, and Alexander G. Hauptmann. 2019 b. Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment. In Proceedings of the 27th ACM International Conference on Multimedia (MM '19). Association for Computing Machinery, New York, NY, USA, 1758--1767. https:\/\/doi.org\/10.1145\/3343031.3350894"},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR '18)","author":"Huang Po-Yao","year":"2060","unstructured":"Po-Yao Huang , Junwei Liang , Jean-Baptiste Lamare , and Alexander G. Hauptmann . 2018. Multimodal Filtering of Social Media for Temporal Monitoring and Event Analysis . In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR '18) . Association for Computing Machinery, New York, NY, USA, 450--457. https:\/\/doi.org\/10.1145\/3 2060 25.3206079 10.1145\/3206025.3206079 Po-Yao Huang, Junwei Liang, Jean-Baptiste Lamare, and Alexander G. Hauptmann. 2018. Multimodal Filtering of Social Media for Temporal Monitoring and Event Analysis. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR '18). Association for Computing Machinery, New York, NY, USA, 450--457. https:\/\/doi.org\/10.1145\/3206025.3206079"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-2360"},{"volume-title":"Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19)","author":"Huang Po-Yao","key":"e_1_3_2_1_20_1","unstructured":"Po-Yao Huang , Vaibhav, Xiaojun Chang , and Alexander G. Hauptmann . 2019 c. Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks . In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19) . Association for Computing Machinery, New York, NY, USA, 244--252. https:\/\/doi.org\/10.1145\/3323873.3325043 10.1145\/3323873.3325043 Po-Yao Huang, Vaibhav, Xiaojun Chang, and Alexander G. Hauptmann. 2019 c. Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). Association for Computing Machinery, New York, NY, USA, 244--252. https:\/\/doi.org\/10.1145\/3323873.3325043"},{"key":"e_1_3_2_1_21_1","volume-title":"Learning semantic concepts and order for image and sentence matching. arXiv preprint arXiv:1712.02036","author":"Huang Yan","year":"2017","unstructured":"Yan Huang , Qi Wu , and Liang Wang . 2017. Learning semantic concepts and order for image and sentence matching. arXiv preprint arXiv:1712.02036 ( 2017 ). Yan Huang, Qi Wu, and Liang Wang. 2017. Learning semantic concepts and order for image and sentence matching. arXiv preprint arXiv:1712.02036 (2017)."},{"key":"e_1_3_2_1_22_1","volume-title":"Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016","author":"Kaiser Lukasz","year":"2016","unstructured":"Lukasz Kaiser and Samy Bengio . 2016 . Can Active Memory Replace Attention? . In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016 , December 5 --10 , 2016, Barcelona, Spain. 3774--3782. Lukasz Kaiser and Samy Bengio. 2016. Can Active Memory Replace Attention?. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5--10, 2016, Barcelona, Spain. 3774--3782."},{"key":"e_1_3_2_1_23_1","volume-title":"Neural machine translation in linear time. arXiv preprint arXiv:1610.10099","author":"Kalchbrenner Nal","year":"2016","unstructured":"Nal Kalchbrenner , Lasse Espeholt , Karen Simonyan , Aaron van den Oord , Alex Graves , and Koray Kavukcuoglu . 2016. Neural machine translation in linear time. arXiv preprint arXiv:1610.10099 ( 2016 ). Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, and Koray Kavukcuoglu. 2016. Neural machine translation in linear time. arXiv preprint arXiv:1610.10099 (2016)."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"e_1_3_2_1_25_1","volume-title":"Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980","author":"Kingma Diederik P","year":"2014","unstructured":"Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)."},{"key":"e_1_3_2_1_26_1","volume-title":"Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. NIPS Workshop","author":"Kiros Ryan","year":"2014","unstructured":"Ryan Kiros , Ruslan Salakhutdinov , and Richard S. Zemel . 2014 . Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. NIPS Workshop ( 2014 ). Ryan Kiros, Ruslan Salakhutdinov, and Richard S. Zemel. 2014. Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. NIPS Workshop (2014)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/1557769.1557821"},{"key":"e_1_3_2_1_28_1","volume-title":"Stacked Cross Attention for Image-Text Matching. arXiv preprint arXiv:1803.08024","author":"Lee Kuang-Huei","year":"2018","unstructured":"Kuang-Huei Lee , Xi Chen , Gang Hua , Houdong Hu , and Xiaodong He. 2018. Stacked Cross Attention for Image-Text Matching. arXiv preprint arXiv:1803.08024 ( 2018 ). Kuang-Huei Lee, Xi Chen, Gang Hua, Houdong Hu, and Xiaodong He. 2018. Stacked Cross Attention for Image-Text Matching. arXiv preprint arXiv:1803.08024 (2018)."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"e_1_3_2_1_30_1","volume-title":"Manning","author":"Luong Minh-Thang","year":"2015","unstructured":"Minh-Thang Luong , Hieu Pham , and Christopher D . Manning . 2015 . Effective Approaches to Attention-based Neural Machine Translation. In Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics , Lisbon, Portugal, 1412--1421. http:\/\/aclweb.org\/anthology\/D15--1166 Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective Approaches to Attention-based Neural Machine Translation. In Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Lisbon, Portugal, 1412--1421. http:\/\/aclweb.org\/anthology\/D15--1166"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.232"},{"key":"e_1_3_2_1_32_1","volume-title":"Gideon Maillette de Buy Wenniger, and Peyman Passban","author":"Poncelas Alberto","year":"2018","unstructured":"Alberto Poncelas , Dimitar Shterionov , Andy Way , Gideon Maillette de Buy Wenniger, and Peyman Passban . 2018 . Investigating Backtranslation in Neural Machine Translation . arXiv preprint arXiv:1804.06189 (2018). Alberto Poncelas, Dimitar Shterionov, Andy Way, Gideon Maillette de Buy Wenniger, and Peyman Passban. 2018. Investigating Backtranslation in Neural Machine Translation. arXiv preprint arXiv:1804.06189 (2018)."},{"key":"e_1_3_2_1_33_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99."},{"key":"e_1_3_2_1_34_1","volume-title":"Edinburgh neural machine translation systems for wmt 16. arXiv preprint arXiv:1606.02891","author":"Sennrich Rico","year":"2016","unstructured":"Rico Sennrich , Barry Haddow , and Alexandra Birch . 2016a. Edinburgh neural machine translation systems for wmt 16. arXiv preprint arXiv:1606.02891 ( 2016 ). Rico Sennrich, Barry Haddow, and Alexandra Birch. 2016a. Edinburgh neural machine translation systems for wmt 16. arXiv preprint arXiv:1606.02891 (2016)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1009"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1162"},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018","author":"Shi Haoyue","year":"2018","unstructured":"Haoyue Shi , Jiayuan Mao , Tete Xiao , Yuning Jiang , and Jian Sun . 2018 . Learning Visually-Grounded Semantics from Contrastive Adversarial Samples . In Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018 , Santa Fe, New Mexico, USA, August 20--26 , 2018. 3715--3727. https:\/\/aclanthology.info\/papers\/C18--1315\/c18--1315 Haoyue Shi, Jiayuan Mao, Tete Xiao, Yuning Jiang, and Jian Sun. 2018. Learning Visually-Grounded Semantics from Contrastive Adversarial Samples. In Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20--26, 2018. 3715--3727. https:\/\/aclanthology.info\/papers\/C18--1315\/c18--1315"},{"key":"e_1_3_2_1_38_1","volume-title":"Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019","author":"Song Yale","year":"2019","unstructured":"Yale Song and Mohammad Soleymani . 2019 . Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 , Long Beach, CA, USA, June 16--20 , 2019. Computer Vision Foundation \/ IEEE, 1979--1988. https:\/\/doi.org\/10.1109\/CVPR.2019.00208 10.1109\/CVPR.2019.00208 Yale Song and Mohammad Soleymani. 2019. Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16--20, 2019. Computer Vision Foundation \/ IEEE, 1979--1988. https:\/\/doi.org\/10.1109\/CVPR.2019.00208"},{"key":"e_1_3_2_1_39_1","volume-title":"Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014","author":"Sutskever Ilya","year":"2014","unstructured":"Ilya Sutskever , Oriol Vinyals , and Quoc V. Le . 2014. Sequence to Sequence Learning with Neural Networks . In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014 , December 8 --13 2014 , Montreal, Quebec, Canada. 3104--3112. Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8--13 2014, Montreal, Quebec, Canada. 3104--3112."},{"key":"e_1_3_2_1_40_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998--6008.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998--6008."},{"key":"e_1_3_2_1_41_1","volume-title":"Order-Embeddings of Images and Language. CoRR","author":"Vendrov Ivan","year":"2015","unstructured":"Ivan Vendrov , Ryan Kiros , Sanja Fidler , and Raquel Urtasun . 2015a. Order-Embeddings of Images and Language. CoRR , Vol. abs\/ 1511 .06361 ( 2015 ). Ivan Vendrov, Ryan Kiros, Sanja Fidler, and Raquel Urtasun. 2015a. Order-Embeddings of Images and Language. CoRR, Vol. abs\/1511.06361 (2015)."},{"key":"e_1_3_2_1_42_1","volume-title":"Order-embeddings of images and language. arXiv preprint arXiv:1511.06361","author":"Vendrov Ivan","year":"2015","unstructured":"Ivan Vendrov , Ryan Kiros , Sanja Fidler , and Raquel Urtasun . 2015b. Order-embeddings of images and language. arXiv preprint arXiv:1511.06361 ( 2015 ). Ivan Vendrov, Ryan Kiros, Sanja Fidler, and Raquel Urtasun. 2015b. Order-embeddings of images and language. arXiv preprint arXiv:1511.06361 (2015)."},{"key":"e_1_3_2_1_43_1","volume-title":"Learning two-branch neural networks for image-text matching tasks","author":"Wang Liwei","year":"2018","unstructured":"Liwei Wang , Yin Li , Jing Huang , and Svetlana Lazebnik . 2018. Learning two-branch neural networks for image-text matching tasks . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2018 ). Liwei Wang, Yin Li, Jing Huang, and Svetlana Lazebnik. 2018. Learning two-branch neural networks for image-text matching tasks. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.541"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00166"},{"key":"e_1_3_2_1_46_1","volume-title":"Dual-Path Convolutional Image-Text Embedding. CoRR","author":"Zheng Zhedong","year":"2017","unstructured":"Zhedong Zheng , Liang Zheng , Michael Garrett , Yi Yang , and Yi-Dong Shen . 2017. Dual-Path Convolutional Image-Text Embedding. CoRR , Vol. abs\/ 1711 .05535 ( 2017 ). arxiv: 1711.05535 http:\/\/arxiv.org\/abs\/1711.05535 Zhedong Zheng, Liang Zheng, Michael Garrett, Yi Yang, and Yi-Dong Shen. 2017. Dual-Path Convolutional Image-Text Embedding. CoRR, Vol. abs\/1711.05535 (2017). arxiv: 1711.05535 http:\/\/arxiv.org\/abs\/1711.05535"}],"event":{"name":"ICMR '20: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Dublin Ireland","acronym":"ICMR '20"},"container-title":["Proceedings of the 2020 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390674","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390674","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390674","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:10Z","timestamp":1750195930000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390674"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,8]]},"references-count":46,"alternative-id":["10.1145\/3372278.3390674","10.1145\/3372278"],"URL":"https:\/\/doi.org\/10.1145\/3372278.3390674","relation":{},"subject":[],"published":{"date-parts":[[2020,6,8]]},"assertion":[{"value":"2020-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}