{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T19:22:28Z","timestamp":1768072948014,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,4,20]],"date-time":"2020-04-20T00:00:00Z","timestamp":1587340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,4,20]]},"DOI":"10.1145\/3366423.3380001","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T08:11:44Z","timestamp":1588579904000},"page":"2521-2527","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":15,"title":["Recommending Themes for Ad Creative Design via Visual-Linguistic Representations"],"prefix":"10.1145","author":[{"given":"Yichao","family":"Zhou","sequence":"first","affiliation":[{"name":"University of California Los Angeles, USA"}]},{"given":"Shaunak","family":"Mishra","sequence":"additional","affiliation":[{"name":"Yahoo Research, USA"}]},{"given":"Manisha","family":"Verma","sequence":"additional","affiliation":[{"name":"Yahoo Research, USA"}]},{"given":"Narayan","family":"Bhamidipati","sequence":"additional","affiliation":[{"name":"Yahoo Research, USA"}]},{"given":"Wei","family":"Wang","sequence":"additional","affiliation":[{"name":"University of California Los Angeles, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,4,20]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2019. Automatic Understanding of Image and Video Advertisements. http:\/\/people.cs.pitt.edu\/~kovashka\/ads.  2019. Automatic Understanding of Image and Video Advertisements. http:\/\/people.cs.pitt.edu\/~kovashka\/ads."},{"key":"e_1_3_2_1_2_1","unstructured":"2019. Banner blindness. https:\/\/en.wikipedia.org\/wiki\/Banner_blindness.  2019. Banner blindness. https:\/\/en.wikipedia.org\/wiki\/Banner_blindness."},{"key":"e_1_3_2_1_3_1","unstructured":"2019. Facebook business: Optimize your ad results by refreshing your creative. https:\/\/www.facebook.com\/business\/m\/test-ads-on-facebook.  2019. Facebook business: Optimize your ad results by refreshing your creative. https:\/\/www.facebook.com\/business\/m\/test-ads-on-facebook."},{"key":"e_1_3_2_1_4_1","unstructured":"2019. Marketing Land: Social media ad fatigue. https:\/\/marketingland.com\/ad-fatigue-social-media-combat-224234.  2019. Marketing Land: Social media ad fatigue. https:\/\/marketingland.com\/ad-fatigue-social-media-combat-224234."},{"key":"e_1_3_2_1_5_1","unstructured":"2019. Match Zoo. https:\/\/github.com\/NTMC-Community\/MatchZoo.  2019. Match Zoo. https:\/\/github.com\/NTMC-Community\/MatchZoo."},{"key":"e_1_3_2_1_6_1","unstructured":"2019. Shutterstock: Search millions of royalty free stock images photos videos and music.https:\/\/www.shutterstock.com\/.  2019. Shutterstock: Search millions of royalty free stock images photos videos and music.https:\/\/www.shutterstock.com\/."},{"key":"e_1_3_2_1_7_1","unstructured":"2019. Taboola-trends. https:\/\/trends.taboola.com\/.  2019. Taboola-trends. https:\/\/trends.taboola.com\/."},{"key":"e_1_3_2_1_8_1","volume-title":"VQA: Visual Question Answering. In The IEEE International Conference on Computer Vision (ICCV).","author":"Antol Stanislaw","year":"2015"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3132868"},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations.","author":"Boudin Florian","year":"2016"},{"key":"e_1_3_2_1_11_1","volume-title":"Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).","author":"Devlin Jacob","year":"2018"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1102"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2983323.2983769"},{"key":"e_1_3_2_1_14_1","volume-title":"Data mining: concepts and techniques","author":"Han Jiawei"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"Zaeem Hussain Mingda Zhang Xiaozhong Zhang Keren Ye Christopher Thomas Zuha Agha Nathan Ong and Adriana Kovashka. 2017. Automatic Understanding of Image and Video Advertisements. In CVPR.  Zaeem Hussain Mingda Zhang Xiaozhong Zhang Keren Ye Christopher Thomas Zuha Agha Nathan Ong and Adriana Kovashka. 2017. Automatic Understanding of Image and Video Advertisements. In CVPR.","DOI":"10.1109\/CVPR.2017.123"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/582415.582418"},{"key":"e_1_3_2_1_17_1","volume-title":"Unicoder-vl: A universal encoder for vision and language by cross-modal pre-training. arXiv preprint arXiv:1908.06066(2019).","author":"Li Gen","year":"2019"},{"key":"e_1_3_2_1_18_1","volume-title":"Visualbert: A simple and performant baseline for vision and language. arXiv preprint arXiv:1908.03557(2019).","author":"Li Liunian\u00a0Harold","year":"2019"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835811"},{"key":"e_1_3_2_1_20_1","unstructured":"Jiasen Lu Dhruv Batra Devi Parikh and Stefan Lee. 2019. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. In NeurIPS.  Jiasen Lu Dhruv Batra Devi Parikh and Stefan Lee. 2019. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. In NeurIPS."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2488200"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Shaunak Mishra Manisha Verma and Jelena Gligorijevic. 2019. Guiding Creative Design in Online Advertising(RecSys).  Shaunak Mishra Manisha Verma and Jelena Gligorijevic. 2019. Guiding Creative Design in Online Advertising(RecSys).","DOI":"10.1145\/3298689.3347022"},{"key":"e_1_3_2_1_23_1","volume-title":"Glove: Global vectors for word representation. In In EMNLP.","author":"Pennington Jeffrey","year":"2014"},{"key":"e_1_3_2_1_24_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91\u201399.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91\u201399."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1080\/00913367.2015.1018460"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K18-3013"},{"key":"e_1_3_2_1_27_1","volume-title":"Vl-bert: Pre-training of generic visual-linguistic representations. arXiv preprint arXiv:1908.08530(2019).","author":"Su Weijie","year":"2019"},{"key":"e_1_3_2_1_28_1","volume-title":"LXMERT: Learning Cross-Modality Encoder Representations from Transformers. In EMNLP-IJCNLP.","author":"Tan Hao","year":"2019"},{"key":"e_1_3_2_1_29_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc\u00a0V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144(2016).  Yonghui Wu Mike Schuster Zhifeng Chen Quoc\u00a0V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144(2016)."},{"key":"e_1_3_2_1_30_1","volume-title":"Munich","author":"Ye Keren","year":"2018"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1080\/02650487.2019.1575109"},{"key":"e_1_3_2_1_32_1","volume-title":"Understanding Consumer Journey using Attention based Recurrent Neural Networks. KDD","author":"Zhou Yichao","year":"2019"}],"event":{"name":"WWW '20: The Web Conference 2020","location":"Taipei Taiwan","acronym":"WWW '20","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Proceedings of The Web Conference 2020"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380001","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366423.3380001","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:13:42Z","timestamp":1750202022000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366423.3380001"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,20]]},"references-count":32,"alternative-id":["10.1145\/3366423.3380001","10.1145\/3366423"],"URL":"https:\/\/doi.org\/10.1145\/3366423.3380001","relation":{},"subject":[],"published":{"date-parts":[[2020,4,20]]},"assertion":[{"value":"2020-04-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}