{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T16:58:37Z","timestamp":1758819517355,"version":"3.41.2"},"reference-count":31,"publisher":"World Scientific Pub Co Pte Ltd","issue":"02","funder":[{"DOI":"10.13039\/501100001809","name":"Natural Science Foundation of China","doi-asserted-by":"crossref","award":["51777122"],"award-info":[{"award-number":["51777122"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Comp. Intel. Appl."],"published-print":{"date-parts":[[2020,6]]},"abstract":"<jats:p> Attention-based encoder\u2013decoder framework has greatly improved image caption generation tasks. The attention mechanism plays a transitional role by transforming static image features into sequential captions. To generate reasonable captions, it is of great significance to detect spatial characteristics of images. In this paper, we propose a spatial relational attention approach to consider spatial positions and attributes. Image features are firstly weighted by the attention mechanism. Then they are concatenated with contextual features to form a spatial\u2013visual tensor. The tensor is feature extracted by a fully convolutional network to produce visual concepts for the decoder network. The fully convolutional layers maintain spatial topology of images. Experiments conducted on the three benchmark datasets, namely Flickr8k, Flickr30k and MSCOCO, demonstrate the effectiveness of our proposed approach. Captions generated by the spatial relational attention method precisely capture spatial relations of objects. <\/jats:p>","DOI":"10.1142\/s146902682050011x","type":"journal-article","created":{"date-parts":[[2020,7,17]],"date-time":"2020-07-17T09:02:04Z","timestamp":1594976524000},"source":"Crossref","is-referenced-by-count":3,"title":["Spatial Relational Attention Using Fully Convolutional Networks for Image Caption Generation"],"prefix":"10.1142","volume":"19","author":[{"given":"Teng","family":"Jiang","sequence":"first","affiliation":[{"name":"Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, P. R. China"}]},{"given":"Liang","family":"Gong","sequence":"additional","affiliation":[{"name":"Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, P. R. China"}]},{"given":"Yupu","family":"Yang","sequence":"additional","affiliation":[{"name":"Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2020,6,26]]},"reference":[{"key":"S146902682050011XBIB001","first-page":"2048","volume-title":"Proc. 32th Int. Conf. Machine Learning (JMLR, 2015)","author":"Xu K.","year":"2015"},{"key":"S146902682050011XBIB002","doi-asserted-by":"publisher","DOI":"10.1145\/2964284.2964299"},{"key":"S146902682050011XBIB003","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2642953"},{"key":"S146902682050011XBIB004","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2599174"},{"key":"S146902682050011XBIB005","first-page":"2553","volume-title":"Proc. Adv. Neural Inf. Process. Syst.","author":"Krizhevsky A.","year":"2012"},{"volume-title":"Proc. Int. Conf. Learn. Represent","year":"2015","author":"Simonyan K.","key":"S146902682050011XBIB006"},{"key":"S146902682050011XBIB007","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2016.2582924"},{"volume-title":"Proc. Int. Conf. Learn. Represent.","year":"2015","author":"Bahdanau D.","key":"S146902682050011XBIB008"},{"key":"S146902682050011XBIB009","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472621"},{"key":"S146902682050011XBIB010","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15561-1_2"},{"key":"S146902682050011XBIB011","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.162"},{"key":"S146902682050011XBIB012","first-page":"359","volume-title":"Proc. 50th Annual Meeting Association for Computational Linguistics (ACL, 2012)","author":"Kuznetsova P.","year":"2012"},{"key":"S146902682050011XBIB013","first-page":"790","volume-title":"Proc. 51st Annual Meeting Association for Computational Linguistics (ACL, 2013)","author":"Kuznetsova P.","year":"2013"},{"key":"S146902682050011XBIB014","first-page":"595","volume-title":"Proc. 31th Int. Conf. Machine Learning (JMLR, 2014)","author":"Kiros R.","year":"2014"},{"volume-title":"Proc. Int. Conf. Learn. Represent.","year":"2015","author":"Mao J.","key":"S146902682050011XBIB015"},{"key":"S146902682050011XBIB016","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298878"},{"key":"S146902682050011XBIB017","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"S146902682050011XBIB018","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"S146902682050011XBIB019","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.503"},{"key":"S146902682050011XBIB020","doi-asserted-by":"publisher","DOI":"10.1145\/3126686.3126717"},{"key":"S146902682050011XBIB021","first-page":"2361","volume-title":"Proc. Adv. Neural Inf. Process. Syst.","author":"Yang Z.","year":"2016"},{"key":"S146902682050011XBIB022","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.345"},{"key":"S146902682050011XBIB023","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00834"},{"key":"S146902682050011XBIB024","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"S146902682050011XBIB025","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"S146902682050011XBIB026","doi-asserted-by":"publisher","DOI":"10.1613\/jair.3994"},{"key":"S146902682050011XBIB027","first-page":"311","volume-title":"Proc. 40th Annu. Meeting Association for Computational Linguistics (ACL, 2002)","author":"Papineni K.","year":"2002"},{"key":"S146902682050011XBIB028","first-page":"65","volume-title":"Proc. ACL Workshop (ACL, 2005)","author":"Banerjee S.","year":"2005"},{"key":"S146902682050011XBIB029","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299087"},{"volume-title":"Proc. Int. Conf. Learn. Represent.","year":"2015","author":"Kingma D. P.","key":"S146902682050011XBIB030"},{"issue":"1","key":"S146902682050011XBIB031","first-page":"1929","volume":"15","author":"Srivastava N.","year":"2014","journal-title":"J. Mach. Learn. Res."}],"container-title":["International Journal of Computational Intelligence and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S146902682050011X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T03:17:20Z","timestamp":1665371840000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S146902682050011X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6]]},"references-count":31,"journal-issue":{"issue":"02","published-print":{"date-parts":[[2020,6]]}},"alternative-id":["10.1142\/S146902682050011X"],"URL":"https:\/\/doi.org\/10.1142\/s146902682050011x","relation":{},"ISSN":["1469-0268","1757-5885"],"issn-type":[{"type":"print","value":"1469-0268"},{"type":"electronic","value":"1757-5885"}],"subject":[],"published":{"date-parts":[[2020,6]]},"article-number":"2050011"}}