{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,27]],"date-time":"2025-10-27T16:20:52Z","timestamp":1761582052724,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":15,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,12,18]],"date-time":"2020-12-18T00:00:00Z","timestamp":1608249600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,12,18]]},"DOI":"10.1145\/3443279.3443292","type":"proceedings-article","created":{"date-parts":[[2021,2,1]],"date-time":"2021-02-01T22:50:44Z","timestamp":1612219844000},"page":"125-130","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Natural Language Model for Image Caption"],"prefix":"10.1145","author":[{"given":"Chunjie","family":"Guo","sequence":"first","affiliation":[{"name":"Nanjing University of Aeronautics and Astronautics, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weiwei","family":"Yang","sequence":"additional","affiliation":[{"name":"Nanjing University of Aeronautics and Nanjing Astronautics, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Huan","family":"Peng","sequence":"additional","affiliation":[{"name":"University of Aeronautics and Astronautics, Nanjing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,2]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"2048","volume-title":"Ryan et al., \"Show, attend and tell: neural image caption generation with visual attention,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Xu K.","year":"2015","unstructured":"K. Xu , J. Ba , K. Ryan et al., \"Show, attend and tell: neural image caption generation with visual attention,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 2048 -- 2057 , Boston, MA , USA , June 2015 . K. Xu, J. Ba, K. Ryan et al., \"Show, attend and tell: neural image caption generation with visual attention,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2048--2057, Boston, MA, USA, June 2015."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_3_1","volume-title":"USA","author":"Anderson P.","year":"2018","unstructured":"P. Anderson , X. He , C. Buehler et al., \"Bottom-up and top-down attention for image captioning,\" in Proceedings of the IEEEConferenceonComputerVisionandPatternRecognition, S alt Lake City, UT , USA , June 2018 . P. Anderson, X. He, C. Buehler et al., \"Bottom-up and top-down attention for image captioning,\" in Proceedings of the IEEEConferenceonComputerVisionandPatternRecognition, S alt Lake City, UT, USA, June 2018."},{"key":"e_1_3_2_1_4_1","first-page":"1","volume":"99","author":"Park C. C.","year":"2018","unstructured":"C. C. Park , B. Kim , and G. Kim , \"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 99 , p. 1 , 2018 . C. C. Park, B. Kim, and G. Kim, \"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 99, p. 1, 2018.","journal-title":"\"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_1_5_1","volume-title":"USA","author":"Anderson P.","year":"2018","unstructured":"P. Anderson , X. He , C. Buehler et al., \"Bottom-up and top-down attention for image captioning,\" in Proceedings of the IEEEConferenceonComputerVisionandPatternRecognition, S alt Lake City, UT , USA , June 2018 . P. Anderson, X. He, C. Buehler et al., \"Bottom-up and top-down attention for image captioning,\" in Proceedings of the IEEEConferenceonComputerVisionandPatternRecognition, S alt Lake City, UT, USA, June 2018."},{"key":"e_1_3_2_1_6_1","volume-title":"USA","author":"Aneja J.","year":"2018","unstructured":"J. Aneja , A. Deshpande , and S. Alexander , \" Convolutional image captioning,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT , USA , June 2018 . J. Aneja, A. Deshpande, and S. Alexander, \"Convolutional image captioning,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, June 2018."},{"key":"e_1_3_2_1_7_1","volume-title":"USA","author":"Yao T.","year":"2016","unstructured":"T. Yao , Y. Pan , Y. Li , Z. Qiu , and T. Mei , \" Boosting image captioning with attributes,\" in Proceedings of the IEEE Conference on International Conferenceon Computer Vision, pp. 4904--4912, Las Vegas, NV , USA , June 2016 . T. Yao, Y. Pan, Y. Li, Z. Qiu, and T. Mei, \"Boosting image captioning with attributes,\" in Proceedings of the IEEE Conference on International Conferenceon Computer Vision, pp. 4904--4912, Las Vegas, NV, USA, June 2016."},{"key":"e_1_3_2_1_8_1","volume-title":"Venice","author":"Pedersoli M.","year":"2017","unstructured":"M. Pedersoli , T. Lucas , C. Schmid , and J. Verbeek , \" Areas of attention for image captioning,\" in Proceedings of the IEEEConference on International Conference on Computer Vision, pp. 1251--1259 , Venice , Italy , October 2017 . M. Pedersoli, T. Lucas, C. Schmid, and J. Verbeek, \"Areas of attention for image captioning,\" in Proceedings of the IEEEConference on International Conference on Computer Vision, pp. 1251--1259, Venice, Italy, October 2017."},{"key":"e_1_3_2_1_9_1","first-page":"2506","author":"Tavakoli H. R.","year":"2017","unstructured":"H. R. Tavakoli , R. Shetty , B. Ali , and J. Laaksonen , \"Paying attention to descriptions generated by image captioning models,\" in Proceedings of the IEEE Conference on InternationalConferenceonComputerVision ,pp. 2506 - 2515 ,Venice, Italy, October 2017 . H. R. Tavakoli, R. Shetty, B. Ali, and J. Laaksonen, \"Paying attention to descriptions generated by image captioning models,\" in Proceedings of the IEEE Conference on InternationalConferenceonComputerVision,pp.2506-2515,Venice, Italy, October 2017.","journal-title":"\"Paying attention to descriptions generated by image captioning models,\" in Proceedings of the IEEE Conference on InternationalConferenceonComputerVision"},{"key":"e_1_3_2_1_10_1","first-page":"521","volume-title":"Show, adapt and tell: adversarial training of cross-domain image captioner,\" in Proceedings of the IEEE Conference on International Conference on Computer Vision and Pattern Recognition","author":"Chen T.-H.","year":"2017","unstructured":"T.-H. Chen , Y.-H. Liao , C.-Y. Chuang , W.-T. Hsu , J. Fu , and M. Sun , \" Show, adapt and tell: adversarial training of cross-domain image captioner,\" in Proceedings of the IEEE Conference on International Conference on Computer Vision and Pattern Recognition , pp. 521 -- 530 , Honolulu, HI , USA , July 2017 . T.-H. Chen, Y.-H. Liao, C.-Y. Chuang, W.-T. Hsu, J. Fu, and M. Sun, \"Show, adapt and tell: adversarial training of cross-domain image captioner,\" in Proceedings of the IEEE Conference on International Conference on Computer Vision and Pattern Recognition, pp. 521--530, Honolulu, HI, USA, July 2017."},{"key":"e_1_3_2_1_11_1","first-page":"1","volume":"99","author":"Park C. C.","year":"2018","unstructured":"C. C. Park , B. Kim , and G. Kim , \"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 99 , p. 1 , 2018 . C. C. Park, B. Kim, and G. Kim, \"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 99, p. 1, 2018.","journal-title":"\"Towards personalized image captioning via multimodal memory networks,\" IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"e_1_3_2_1_12_1","volume-title":"USA","author":"Chen X.","year":"2018","unstructured":"X. Chen , Ma Lin , W. Jiang , J. Yao , and W. Liu , \" Regularizing RNNs for caption generation by reconstructing the past with the present,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT , USA , June 2018 . X. Chen, Ma Lin, W. Jiang, J. Yao, and W. Liu, \"Regularizing RNNs for caption generation by reconstructing the past with the present,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, June 2018."},{"key":"e_1_3_2_1_13_1","volume-title":"USA","author":"Zhou R.","year":"2017","unstructured":"R. Zhou , X. Wang , N. Zhang , X. Lv , and L.-J. Li , \"Deep reinforcement learning-based image captioning with embedding reward,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1151--1159, Honolulu, HI , USA , July 2017 . R. Zhou, X. Wang, N. Zhang, X. Lv, and L.-J. Li, \"Deep reinforcement learning-based image captioning with embedding reward,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1151--1159, Honolulu, HI, USA, July 2017."},{"key":"e_1_3_2_1_14_1","volume-title":"USA","author":"You Q.","year":"2018","unstructured":"Q. You , Z. Zhang , and J. Luo , \" End-to-end convolutional semantic embeddings,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5735--5744, Salt Lake City, UT , USA , June 2018 . Q. You, Z. Zhang, and J. Luo, \"End-to-end convolutional semantic embeddings,\" in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5735--5744, Salt Lake City, UT, USA, June 2018."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"crossref","unstructured":"S. Yagcioglu E. Erdem A. Erdem and R. Cak\u0131c\u0131 \"A distributed representation based query expansion approach for image captioning \" in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing vol. 10 no. 3115 Beijing China July 2015.  S. Yagcioglu E. Erdem A. Erdem and R. Cak\u0131c\u0131 \"A distributed representation based query expansion approach for image captioning \" in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing vol. 10 no. 3115 Beijing China July 2015.","DOI":"10.3115\/v1\/P15-2018"}],"event":{"name":"NLPIR 2020: 4th International Conference on Natural Language Processing and Information Retrieval","sponsor":["FernUniversit\u00e4t in Hagen"],"location":"Seoul Republic of Korea","acronym":"NLPIR 2020"},"container-title":["Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3443279.3443292","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3443279.3443292","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:03:03Z","timestamp":1750197783000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3443279.3443292"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,18]]},"references-count":15,"alternative-id":["10.1145\/3443279.3443292","10.1145\/3443279"],"URL":"https:\/\/doi.org\/10.1145\/3443279.3443292","relation":{},"subject":[],"published":{"date-parts":[[2020,12,18]]},"assertion":[{"value":"2021-02-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}