{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:08:09Z","timestamp":1750306089215,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":29,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,8,20]],"date-time":"2017-08-20T00:00:00Z","timestamp":1503187200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001691","name":"Japan Society for the Promotion of Science","doi-asserted-by":"publisher","award":["16K16099"],"award-info":[{"award-number":["16K16099"]}],"id":[{"id":"10.13039\/501100001691","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,8,20]]},"DOI":"10.1145\/3106668.3106675","type":"proceedings-article","created":{"date-parts":[[2017,7,31]],"date-time":"2017-07-31T12:10:27Z","timestamp":1501503027000},"page":"39-44","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Learning Food Appearance by a Supervision with Recipe Text"],"prefix":"10.1145","author":[{"given":"Atsushi","family":"Hashimoto","sequence":"first","affiliation":[{"name":"Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Takumi","family":"Fujino","sequence":"additional","affiliation":[{"name":"Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jun","family":"Harashima","sequence":"additional","affiliation":[{"name":"Cookpad Inc., Shibuya-ku, Tokyo, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Masaaki","family":"Iiyama","sequence":"additional","affiliation":[{"name":"Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michihiko","family":"Minoh","sequence":"additional","affiliation":[{"name":"Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,8,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.61"},{"key":"e_1_3_2_1_2_1","volume-title":"Proc. of IJCAI.","author":"Song Young Chol","year":"2016","unstructured":"Young Chol Song , Iftekhar Naim , Abdullah Al Mamun , Kaustubh Kulkarni , Parag Singla , Jiebo Luo , Daniel Gildea , and Henry Kautz . 2016 . Unsupervised Alignment of Actions in Video with Text Descriptions . In Proc. of IJCAI. Young Chol Song, Iftekhar Naim, Abdullah Al Mamun, Kaustubh Kulkarni, Parag Singla, Jiebo Luo, Daniel Gildea, and Henry Kautz. 2016. Unsupervised Alignment of Actions in Video with Text Descriptions. In Proc. of IJCAI."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.309"},{"key":"e_1_3_2_1_4_1","volume-title":"Optimal packing and covering in the plane are NP-complete. Information processing letters 12, 3","author":"Fowler Robert J","year":"1981","unstructured":"Robert J Fowler , Michael S Paterson , and Steven L Tanimoto . 1981. Optimal packing and covering in the plane are NP-complete. Information processing letters 12, 3 ( 1981 ), 133--137. Robert J Fowler, Michael S Paterson, and Steven L Tanimoto. 1981. Optimal packing and covering in the plane are NP-complete. Information processing letters 12, 3 (1981), 133--137."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995448"},{"key":"e_1_3_2_1_6_1","volume-title":"Proc. of the 10th International Conference on Language Resources and Evaluation.","author":"Harashima Jun","year":"2016","unstructured":"Jun Harashima , Michiaki Ariga , Kenta Murata , and Masayuki Ioki . 2016 . A large-scale recipe and meal data collection as infrastructure for food research . In Proc. of the 10th International Conference on Language Resources and Evaluation. Jun Harashima, Michiaki Ariga, Kenta Murata, and Masayuki Ioki. 2016. A large-scale recipe and meal data collection as infrastructure for food research. In Proc. of the 10th International Conference on Language Resources and Evaluation."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1080\/10447318.2016.1191744"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.494"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"e_1_3_2_1_10_1","unstructured":"Andrej Karpathy Armand Joulin and Fei Fei F Li. 2014. Deep fragment embeddings for bidirectional image sentence mapping. In Advances in neural information processing systems. 1889--1897.  Andrej Karpathy Armand Joulin and Fei Fei F Li. 2014. Deep fragment embeddings for bidirectional image sentence mapping. In Advances in neural information processing systems. 1889--1897."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.162"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2370216.2370248"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W15-2206"},{"key":"e_1_3_2_1_15_1","first-page":"37","article-title":"Mapping Video Segments to a Work Flow based on Path Search","volume":"115","author":"Matsumura Yuki","year":"2016","unstructured":"Yuki Matsumura , Atsushi Hashimoto , Shinsuke Mori , Takuya Funatomi , Masaaki Iiyama , and Michihiko Minoh . 2016 . Mapping Video Segments to a Work Flow based on Path Search . SIGMVE Tech. Report 115 , 495 (2016), 37 -- 42 . Yuki Matsumura, Atsushi Hashimoto, Shinsuke Mori, Takuya Funatomi, Masaaki Iiyama, and Michihiko Minoh. 2016. Mapping Video Segments to a Work Flow based on Path Search. SIGMVE Tech. Report 115, 495 (2016), 37--42.","journal-title":"SIGMVE Tech. Report"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.320"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.1999.791201"},{"key":"e_1_3_2_1_18_1","volume-title":"Qiguang Liu, Liang Huang, Henry Kautz, Jiebo Luo, and Daniel Gildea.","author":"Naim Iftekhar","year":"2015","unstructured":"Iftekhar Naim , Young Chol Song , Qiguang Liu, Liang Huang, Henry Kautz, Jiebo Luo, and Daniel Gildea. 2015 . Discriminative unsupervised alignment of natural language instructions with corresponding video segments. North American Chapter of the Association for Computational Linguistics Human Language Technologies ( 2015). Iftekhar Naim, Young Chol Song, Qiguang Liu, Liang Huang, Henry Kautz, Jiebo Luo, and Daniel Gildea. 2015. Discriminative unsupervised alignment of natural language instructions with corresponding video segments. North American Chapter of the Association for Computational Linguistics Human Language Technologies (2015)."},{"key":"e_1_3_2_1_19_1","volume-title":"Proc. of AAAI. 1558--1564","author":"Naim Iftekhar","year":"2014","unstructured":"Iftekhar Naim , Young Chol Song , Qiguang Liu , Henry A Kautz , Jiebo Luo , and Daniel Gildea . 2014 . Unsupervised Alignment of Natural Language Instructions with Video Segments .. In Proc. of AAAI. 1558--1564 . Iftekhar Naim, Young Chol Song, Qiguang Liu, Henry A Kautz, Jiebo Luo, and Daniel Gildea. 2014. Unsupervised Alignment of Natural Language Instructions with Video Segments.. In Proc. of AAAI. 1558--1564."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298668"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126383"},{"key":"e_1_3_2_1_22_1","unstructured":"Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99.  Shaoqing Ren Kaiming He Ross Girshick and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.61"},{"key":"e_1_3_2_1_24_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"volume-title":"Proc","author":"Tsung-Wei Ke","key":"e_1_3_2_1_25_1","unstructured":"Ke Tsung-Wei , Lin Che-Wei , Liu Tyng-Luh , and Geiger Davi . 2016. Variational Convolutional Networks for Human-Centric Annotations . In Proc . of ACCV. Springer . Ke Tsung-Wei, Lin Che-Wei, Liu Tyng-Luh, and Geiger Davi. 2016. Variational Convolutional Networks for Human-Centric Annotations. In Proc. of ACCV. Springer."},{"key":"e_1_3_2_1_26_1","volume-title":"Theo Gevers, and Arnold WM Smeulders.","author":"Uijlings Jasper RR","year":"2013","unstructured":"Jasper RR Uijlings , Koen EA van de Sande , Theo Gevers, and Arnold WM Smeulders. 2013 . Selective search for object recognition. International journal of computer vision 104, 2 (2013), 154--171. Jasper RR Uijlings, Koen EA van de Sande, Theo Gevers, and Arnold WM Smeulders. 2013. Selective search for object recognition. International journal of computer vision 104, 2 (2013), 154--171."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISM.2010.48"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/2145432.2145484"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2003.1238361"}],"event":{"name":"CEA2017: 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence","sponsor":["The International Joint Conferences on Artificial Intelligence, Inc. (IJCAI)"],"location":"Melbourne Australia","acronym":"CEA2017"},"container-title":["Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3106668.3106675","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3106668.3106675","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:20Z","timestamp":1750217420000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3106668.3106675"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,8,20]]},"references-count":29,"alternative-id":["10.1145\/3106668.3106675","10.1145\/3106668"],"URL":"https:\/\/doi.org\/10.1145\/3106668.3106675","relation":{},"subject":[],"published":{"date-parts":[[2017,8,20]]},"assertion":[{"value":"2017-08-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}