{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,17]],"date-time":"2025-10-17T13:41:23Z","timestamp":1760708483976,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2012,10,29]],"date-time":"2012-10-29T00:00:00Z","timestamp":1351468800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2012,10,29]]},"DOI":"10.1145\/2393347.2393424","type":"proceedings-article","created":{"date-parts":[[2012,11,14]],"date-time":"2012-11-14T20:36:17Z","timestamp":1352925377000},"page":"549-558","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":22,"title":["Efficient image annotation for automatic sentence generation"],"prefix":"10.1145","author":[{"given":"Yoshitaka","family":"Ushiku","sequence":"first","affiliation":[{"name":"The University of Tokyo, Tokyo, Japan"}]},{"given":"Tatsuya","family":"Harada","sequence":"additional","affiliation":[{"name":"The University of Tokyo &amp; JST PRESTO, Tokyo, Japan"}]},{"given":"Yasuo","family":"Kuniyoshi","sequence":"additional","affiliation":[{"name":"The University of Tokyo, Tokyo, Japan"}]}],"member":"320","published-online":{"date-parts":[[2012,10,29]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"ACL","author":"Aker A.","year":"2010","unstructured":"A. Aker and R. Gaizauskas . Generating image descriptions using dependency relational patterns . In ACL , 2010 . A. Aker and R. Gaizauskas. Generating image descriptions using dependency relational patterns. In ACL, 2010."},{"key":"e_1_3_2_1_2_1","volume-title":"COMPSTAT","author":"Bottou L.","year":"2010","unstructured":"L. Bottou . Large-Scale Machine Learning with Stochastic Gradient Descent . In COMPSTAT , 2010 . L. Bottou. Large-Scale Machine Learning with Stochastic Gradient Descent. In COMPSTAT, 2010."},{"key":"e_1_3_2_1_3_1","first-page":"551","volume":"7","author":"Crammer K.","year":"2006","unstructured":"K. Crammer , O. Dekel , J. Keshet , S. Shalev-Shwartz , and Y. Singer . Online Passive-Aggressive Algorithms. JMLR , 7 : 551 -- 585 , 2006 . K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, and Y. Singer. Online Passive-Aggressive Algorithms. JMLR, 7:551--585, 2006.","journal-title":"Online Passive-Aggressive Algorithms. JMLR"},{"key":"e_1_3_2_1_4_1","volume-title":"NIPS","author":"Crammer K.","year":"2010","unstructured":"K. Crammer and D. D. Lee . Learning via Gaussian Herding . In NIPS , 2010 . K. Crammer and D. D. Lee. Learning via Gaussian Herding. In NIPS, 2010."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/1289189.1289273"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888092"},{"key":"e_1_3_2_1_8_1","article-title":"How Many Words is a Picture Worth?","author":"Feng Y.","year":"2010","unstructured":"Y. Feng and M. Lapata . How Many Words is a Picture Worth? Automatic Caption Generation for News Images. In ACL , 2010 . Y. Feng and M. Lapata. How Many Words is a Picture Worth? Automatic Caption Generation for News Images. In ACL, 2010.","journal-title":"Automatic Caption Generation for News Images. In ACL"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/11871842_19"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459266"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860459"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995466"},{"key":"e_1_3_2_1_13_1","volume-title":"CoNLL","author":"Li S.","year":"2011","unstructured":"S. Li , G. Kulkarni , T. L. Berg , A. C. Berg , and Y. Choi . Composing Simple Image Descriptions using Web-scale N-grams . In CoNLL , 2011 . S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing Simple Image Descriptions using Web-scale N-grams. In CoNLL, 2011."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995477"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88693-8_33"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88690-7_24"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.24.94"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(95)00067-4"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1011139631724"},{"key":"e_1_3_2_1_22_1","volume-title":"NIPS","author":"Ordonez V.","year":"2011","unstructured":"V. Ordonez , G. Kulkarni , and T. L. Berg . Im2Text: Describing Images Using 1 Million Captioned Photographs . In NIPS , 2011 . V. Ordonez, G. Kulkarni, and T. L. Berg. Im2Text: Describing Images Using 1 Million Captioned Photographs. In NIPS, 2011."},{"key":"e_1_3_2_1_23_1","volume-title":"IAPR Workshop on Computer Vision","author":"Otsu N.","year":"1988","unstructured":"N. Otsu and T. Kurita . A new scheme for practical flexible and intelligent vision systems . In IAPR Workshop on Computer Vision , 1988 . N. Otsu and T. Kurita. A new scheme for practical flexible and intelligent vision systems. In IAPR Workshop on Computer Vision, 1988."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073135"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888101"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995711"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb026562"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995504"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/MUE.2011.22"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.128"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2072298.2072058"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5540018"},{"key":"e_1_3_2_1_33_1","volume-title":"CVPR","author":"Wang X.-J.","year":"2010","unstructured":"X.-J. Wang , L. Zhang , M. Liu , Y. Li , and W.-Y. Ma . ARISTA - Image Search to Annotation on Billions of Web Photos . In CVPR , 2010 . X.-J. Wang, L. Zhang, M. Liu, Y. Li, and W.-Y. Ma. ARISTA - Image Search to Annotation on Billions of Web Photos. In CVPR, 2010."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/2283696.2283856"},{"key":"e_1_3_2_1_35_1","volume-title":"EMNLP","author":"Yang Y.","year":"2011","unstructured":"Y. Yang , C. L. Teo , H. Daum\u00e9 III, and Y. Aloimonos . Corpus-Guided Sentence Generation of Natural Images . In EMNLP , 2011 . Y. Yang, C. L. Teo, H. Daum\u00e9 III, and Y. Aloimonos. Corpus-Guided Sentence Generation of Natural Images. In EMNLP, 2011."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2010.2050411"}],"event":{"name":"MM '12: ACM Multimedia Conference","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Nara Japan","acronym":"MM '12"},"container-title":["Proceedings of the 20th ACM international conference on Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2393347.2393424","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2393347.2393424","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:34:46Z","timestamp":1750239286000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2393347.2393424"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,29]]},"references-count":35,"alternative-id":["10.1145\/2393347.2393424","10.1145\/2393347"],"URL":"https:\/\/doi.org\/10.1145\/2393347.2393424","relation":{},"subject":[],"published":{"date-parts":[[2012,10,29]]},"assertion":[{"value":"2012-10-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}