{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T16:11:52Z","timestamp":1769875912168,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":16,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,6,6]],"date-time":"2017-06-06T00:00:00Z","timestamp":1496707200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,6,6]]},"DOI":"10.1145\/3078971.3079038","type":"proceedings-article","created":{"date-parts":[[2017,5,25]],"date-time":"2017-05-25T16:27:32Z","timestamp":1495729652000},"page":"416-419","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":7,"title":["Generative Adversarial Networks for Multimodal Representation Learning in Video Hyperlinking"],"prefix":"10.1145","author":[{"given":"Vedran","family":"Vukoti\u0107","sequence":"first","affiliation":[{"name":"INRIA \/ IRISA &amp; INSA Rennes, Rennes, France"}]},{"given":"Christian","family":"Raymond","sequence":"additional","affiliation":[{"name":"INRIA \/ IRISA &amp; INSA Rennes, Rennes, France"}]},{"given":"Guillaume","family":"Gravier","sequence":"additional","affiliation":[{"name":"CNRS &amp; INRIA \/ IRISA, Rennes, France"}]}],"member":"320","published-online":{"date-parts":[[2017,6,6]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proceedings of TRECVID","volume":"2016","author":"Awad George","year":"2016","unstructured":"George Awad , Jonathan Fiscus , Martial Michel , David Joy , Wessel Kraaij , Alan F Smeaton , Georges Qu\u00e9not , Maria Eskevich , Robin Aly , and Roeland Ordelman . 2016 . Trecvid 2016: Evaluating video search, video event detection, localization, and hyperlinking . In Proceedings of TRECVID , Vol. 2016 . George Awad, Jonathan Fiscus, Martial Michel, David Joy, Wessel Kraaij, Alan F Smeaton, Georges Qu\u00e9not, Maria Eskevich, Robin Aly, and Roeland Ordelman. 2016. Trecvid 2016: Evaluating video search, video event detection, localization, and hyperlinking. In Proceedings of TRECVID, Vol. 2016."},{"key":"e_1_3_2_1_2_1","volume-title":"Multimodality and Monomodality for Video Hyperlinking. In Working Notes of the TRECVid 2016 Workshop.","author":"Bois R\u00e9mi","year":"2016","unstructured":"R\u00e9mi Bois , Vedran Vukoti\u0107 , Ronan Sicre , Christian Raymond , Guillaume Gravier , and Pascale S\u00e9billot . 2016 . IRISA at TRECVid2016: Crossmodality , Multimodality and Monomodality for Video Hyperlinking. In Working Notes of the TRECVid 2016 Workshop. R\u00e9mi Bois, Vedran Vukoti\u0107, Ronan Sicre, Christian Raymond, Guillaume Gravier, and Pascale S\u00e9billot. 2016. IRISA at TRECVid2016: Crossmodality, Multimodality and Monomodality for Video Hyperlinking. In Working Notes of the TRECVid 2016 Workshop."},{"key":"e_1_3_2_1_3_1","unstructured":"Miriam Cha Youngjune Gwon and H. T. Kung. 2015. Multimodal sparse representation learning and applications. CoRR abs\/1511.06238 (2015).  Miriam Cha Youngjune Gwon and H. T. Kung. 2015. Multimodal sparse representation learning and applications. CoRR abs\/1511.06238 (2015)."},{"key":"e_1_3_2_1_4_1","volume-title":"Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in Neural Information Processing Systems. 2172--2180.","author":"Chen Xi","year":"2016","unstructured":"Xi Chen , Yan Duan , Rein Houthooft , John Schulman , Ilya Sutskever , and Pieter Abbeel . 2016 . Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in Neural Information Processing Systems. 2172--2180. Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in Neural Information Processing Systems. 2172--2180."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654902"},{"key":"e_1_3_2_1_6_1","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680.   Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2015-43"},{"key":"e_1_3_2_1_8_1","volume-title":"Adversarial autoencoders. arXiv preprint arXiv:1511.05644","author":"Makhzani Alireza","year":"2015","unstructured":"Alireza Makhzani , Jonathon Shlens , Navdeep Jaitly , Ian Goodfellow , and Brendan Frey . 2015. Adversarial autoencoders. arXiv preprint arXiv:1511.05644 ( 2015 ). Alireza Makhzani, Jonathon Shlens, Navdeep Jaitly, Ian Goodfellow, and Brendan Frey. 2015. Adversarial autoencoders. arXiv preprint arXiv:1511.05644 (2015)."},{"key":"e_1_3_2_1_9_1","volume-title":"Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784","author":"Mirza Mehdi","year":"2014","unstructured":"Mehdi Mirza and Simon Osindero . 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 ( 2014 ). Mehdi Mirza and Simon Osindero. 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)."},{"key":"e_1_3_2_1_10_1","volume-title":"Intl. Conf. on Machine Learning.","author":"Ngiam Jiquan","year":"2011","unstructured":"Jiquan Ngiam , Aditya Khosla , Mingyu Kim , Juhan Nam , Honglak Lee , and Andrew Y Ng . 2011 . Multimodal deep learning . In Intl. Conf. on Machine Learning. Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. 2011. Multimodal deep learning. In Intl. Conf. on Machine Learning."},{"key":"e_1_3_2_1_11_1","unstructured":"Guim Perarnau Joost van de Weijer Bogdan Raducanu and Jose M\u00c1lvarez. 2016. Invertible Conditional GANs for image editing. (2016).  Guim Perarnau Joost van de Weijer Bogdan Raducanu and Jose M\u00c1lvarez. 2016. Invertible Conditional GANs for image editing. (2016)."},{"key":"e_1_3_2_1_12_1","volume-title":"ICLR","author":"Radford Alec","year":"2015","unstructured":"Alec Radford , Luke Metz , and Soumith Chintala . 2015 . Unsupervised representation learning with deep convolutional generative adversarial networks . In ICLR 2016. Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR 2016."},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of The 33rd International Conference on Machine Learning","volume":"3","author":"Reed Scott","year":"2016","unstructured":"Scott Reed , Zeynep Akata , Xinchen Yan , Lajanugen Logeswaran , Bernt Schiele , and Honglak Lee . 2016 . Generative adversarial text to image synthesis . In Proceedings of The 33rd International Conference on Machine Learning , Vol. 3 . Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. 2016. Generative adversarial text to image synthesis. In Proceedings of The 33rd International Conference on Machine Learning, Vol. 3."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911996.2912064"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2983563.2983567"},{"key":"e_1_3_2_1_16_1","volume-title":"StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. arXiv preprint arXiv:1612.03242","author":"Zhang Han","year":"2016","unstructured":"Han Zhang , Tao Xu , Hongsheng Li , Shaoting Zhang , Xiaolei Huang , Xiao-gang Wang, and Dimitris Metaxas . 2016. StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. arXiv preprint arXiv:1612.03242 ( 2016 ). Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaolei Huang, Xiao-gang Wang, and Dimitris Metaxas. 2016. StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. arXiv preprint arXiv:1612.03242 (2016)."}],"event":{"name":"ICMR '17: International Conference on Multimedia Retrieval","location":"Bucharest Romania","acronym":"ICMR '17","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3078971.3079038","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3078971.3079038","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:03:24Z","timestamp":1750215804000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3078971.3079038"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,6,6]]},"references-count":16,"alternative-id":["10.1145\/3078971.3079038","10.1145\/3078971"],"URL":"https:\/\/doi.org\/10.1145\/3078971.3079038","relation":{},"subject":[],"published":{"date-parts":[[2017,6,6]]},"assertion":[{"value":"2017-06-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}