{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:34:19Z","timestamp":1750221259641,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":11,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,9,10]],"date-time":"2018-09-10T00:00:00Z","timestamp":1536537600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,9,10]]},"DOI":"10.1145\/3234944.3234948","type":"proceedings-article","created":{"date-parts":[[2018,9,13]],"date-time":"2018-09-13T12:54:52Z","timestamp":1536843292000},"page":"219-222","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Classifying Community QA Questions That Contain an Image"],"prefix":"10.1145","author":[{"given":"Kenta","family":"Tamaki","sequence":"first","affiliation":[{"name":"Waseda University, Tokyo, Japan"}]},{"given":"Riku","family":"Togashi","sequence":"additional","affiliation":[{"name":"Yahoo Japan Corporation, Tokyo, Japan"}]},{"given":"Sosuke","family":"Kato","sequence":"additional","affiliation":[{"name":"Waseda University, Tokyo, Japan"}]},{"given":"Sumio","family":"Fujita","sequence":"additional","affiliation":[{"name":"Yahoo Japan Corporation, Tokyo, Japan"}]},{"given":"Hideyuki","family":"Maeda","sequence":"additional","affiliation":[{"name":"CyberAgent, Inc., Tokyo, Japan"}]},{"given":"Tetsuya","family":"Sakai","sequence":"additional","affiliation":[{"name":"Waseda University, Tokyo, Japan"}]}],"member":"320","published-online":{"date-parts":[[2018,9,10]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.279"},{"key":"e_1_3_2_1_2_1","volume-title":"Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Conference on Empirical Methods in Natural Language Processing. ACL, 457--468","author":"Fukui Akira","year":"2016","unstructured":"Akira Fukui , Dong Huk Park , Daylen Yang , Anna Rohrbach , Trevor Darrell , and Marcus Rohrbach . 2016 . Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Conference on Empirical Methods in Natural Language Processing. ACL, 457--468 . Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, and Marcus Rohrbach . 2016. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Conference on Empirical Methods in Natural Language Processing. ACL, 457--468."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.41"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_5_1","unstructured":"Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton . 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105.   Alex Krizhevsky Ilya Sutskever and Geoffrey E Hinton . 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105."},{"key":"e_1_3_2_1_6_1","volume-title":"Midge: Generating image descriptions from computer vision detections Proceedings of the 13th Conference of the European","author":"Mitchell Margaret","year":"2012","unstructured":"Margaret Mitchell , Xufeng Han , Jesse Dodge , Alyssa Mensch , Amit Goyal , Alex Berg , Kota Yamaguchi , Tamara Berg , Karl Stratos , and Hal Daum\u00e9 III . 2012 . Midge: Generating image descriptions from computer vision detections Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics , 747--756. Margaret Mitchell, Xufeng Han, Jesse Dodge, Alyssa Mensch, Amit Goyal, Alex Berg, Kota Yamaguchi, Tamara Berg, Karl Stratos, and Hal Daum\u00e9 III . 2012. Midge: Generating image descriptions from computer vision detections Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 747--756."},{"key":"e_1_3_2_1_7_1","volume-title":"Grounding of Textual Phrases in Images by Reconstruction Computer Vision -- ECCV","author":"Rohrbach Anna","year":"2016","unstructured":"Anna Rohrbach , Marcus Rohrbach , Ronghang Hu , Trevor Darrell , and Bernt Schiele . 2016. Grounding of Textual Phrases in Images by Reconstruction Computer Vision -- ECCV 2016 , bibfieldeditorBastian Leibe, Jiri Matas , Nicu Sebe, and Max Welling (Eds.). Springer International Publishing , Cham, 817--834. Anna Rohrbach, Marcus Rohrbach, Ronghang Hu, Trevor Darrell, and Bernt Schiele . 2016. Grounding of Textual Phrases in Images by Reconstruction Computer Vision -- ECCV 2016, bibfieldeditorBastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 817--834."},{"key":"e_1_3_2_1_8_1","volume-title":"2017 IEEE International Conference on. IEEE, 829--834","author":"Saito Kuniaki","year":"2017","unstructured":"Kuniaki Saito , Andrew Shin , Yoshitaka Ushiku , and Tatsuya Harada . 2017 . Dualnet: Domain-invariant network for visual question answering Multimedia and Expo (ICME) , 2017 IEEE International Conference on. IEEE, 829--834 . Kuniaki Saito, Andrew Shin, Yoshitaka Ushiku, and Tatsuya Harada . 2017. Dualnet: Domain-invariant network for visual question answering Multimedia and Expo (ICME), 2017 IEEE International Conference on. IEEE, 829--834."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2641383.2641385"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976600300015349"},{"key":"e_1_3_2_1_11_1","unstructured":"Xiang Zhang Junbo Zhao and Yann LeCun . 2015. Character-level convolutional networks for text classification Advances in neural information processing systems. 649--657.   Xiang Zhang Junbo Zhao and Yann LeCun . 2015. Character-level convolutional networks for text classification Advances in neural information processing systems. 649--657."}],"event":{"name":"ICTIR '18: The 2018 ACM SIGIR International Conference on the Theory of Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Tianjin China","acronym":"ICTIR '18"},"container-title":["Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3234944.3234948","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3234944.3234948","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:08:15Z","timestamp":1750212495000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3234944.3234948"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,9,10]]},"references-count":11,"alternative-id":["10.1145\/3234944.3234948","10.1145\/3234944"],"URL":"https:\/\/doi.org\/10.1145\/3234944.3234948","relation":{},"subject":[],"published":{"date-parts":[[2018,9,10]]},"assertion":[{"value":"2018-09-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}