{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:06:27Z","timestamp":1753880787936,"version":"3.41.2"},"reference-count":42,"publisher":"World Scientific Pub Co Pte Ltd","issue":"01","funder":[{"DOI":"10.13039\/501100001809","name":"NSFC","doi-asserted-by":"crossref","award":["62176027"],"award-info":[{"award-number":["62176027"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004608","name":"Natural Science Foundation of Jiangsu Province","doi-asserted-by":"crossref","award":["BK20200462"],"award-info":[{"award-number":["BK20200462"]}],"id":[{"id":"10.13039\/501100004608","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Natural Science Foundation of Chongqing, China","award":["cstc2020jcyj-zdxmX0014"],"award-info":[{"award-number":["cstc2020jcyj-zdxmX0014"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62201434"],"award-info":[{"award-number":["62201434"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Central Universities","award":["XJS222211"],"award-info":[{"award-number":["XJS222211"]}]},{"name":"Key Fund Project of the Ministry of Education","award":["8091B012207"],"award-info":[{"award-number":["8091B012207"]}]},{"name":"Human Resources and Social Security Bureau","award":["cx2020073"],"award-info":[{"award-number":["cx2020073"]}]},{"name":"Guangdong Oppo Mobile Telecommunications Corporation Ltd.,","award":["H20221694"],"award-info":[{"award-number":["H20221694"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,1]]},"abstract":"<jats:p> In this paper, a parallel decoder and a word group prediction module are proposed to speed up decoding and improve the effect of captions. The features of the image extracted by the encoder are linearly projected to different word groups, and then a unique relaxed mask matrix is designed to improve the decoding speed and the caption effect. First, since image captioning is composed of many words, sentences can also be broken down into word groups or words according to their syntactic structure, and we achieve this function through constituency parsing. Second, we make full use of the extracted features to predict the size of word groups. Then, a new embedding representing the information of the word is proposed based on word embedding. Finally, with the help of word groups, we design a mask matrix to modify the decoding process so that each step of the model can produce one or more words in parallel. Experiments on public datasets demonstrate that our method can reduce the time complexity while maintaining competitive performance. <\/jats:p>","DOI":"10.1142\/s0218001423540290","type":"journal-article","created":{"date-parts":[[2023,12,17]],"date-time":"2023-12-17T13:30:27Z","timestamp":1702819827000},"source":"Crossref","is-referenced-by-count":0,"title":["Transformer with a Parallel Decoder for Image Captioning"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-2603-2145","authenticated-orcid":false,"given":"Peilang","family":"Wei","sequence":"first","affiliation":[{"name":"College of Computer Science, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8780-5455","authenticated-orcid":false,"given":"Xu","family":"Liu","sequence":"additional","affiliation":[{"name":"Academy of Advanced Interdisciplinary Research, Xidian University, Xi\u2019an 710071, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1314-5631","authenticated-orcid":false,"given":"Jun","family":"Luo","sequence":"additional","affiliation":[{"name":"School of Mechanical and Vehicle Engineering, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9830-3955","authenticated-orcid":false,"given":"Huayan","family":"Pu","sequence":"additional","affiliation":[{"name":"School of Mechanical and Vehicle Engineering, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4635-6112","authenticated-orcid":false,"given":"Xiaoxu","family":"Huang","sequence":"additional","affiliation":[{"name":"College of Materials Science and Engineering, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3321-027X","authenticated-orcid":false,"given":"Shilong","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Mechanical and Vehicle Engineering, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6617-0473","authenticated-orcid":false,"given":"Huajun","family":"Cao","sequence":"additional","affiliation":[{"name":"School of Mechanical and Vehicle Engineering, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-4491-3701","authenticated-orcid":false,"given":"Shouhong","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Marxism Studies, Chongqing University, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5261-8580","authenticated-orcid":false,"given":"Xu","family":"Zhuang","sequence":"additional","affiliation":[{"name":"OPPO Inc., Chengdu 610000, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-1875-7693","authenticated-orcid":false,"given":"Jason","family":"Wang","sequence":"additional","affiliation":[{"name":"OPPO Inc., Chengdu 610000, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-9371-7700","authenticated-orcid":false,"given":"Hong","family":"Yue","sequence":"additional","affiliation":[{"name":"CICT Connected and Intelligent Technologies Co., Ltd, Chongqing 400044, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2525-8070","authenticated-orcid":false,"given":"Cheng","family":"Ji","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Nanjing University of Science and Technology, Jiangsu 210094, P. R. China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1874-3641","authenticated-orcid":false,"given":"Mingliang","family":"Zhou","sequence":"additional","affiliation":[{"name":"College of Computer Science, Chongqing University, Chongqing 400044, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2024,1,29]]},"reference":[{"key":"S0218001423540290BIB001","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46454-1_24"},{"key":"S0218001423540290BIB002","first-page":"65","volume-title":"Proc. ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and\/or Summarization","author":"Banerjee S.","year":"2005"},{"key":"S0218001423540290BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01059"},{"key":"S0218001423540290BIB005","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-58601-0_42"},{"key":"S0218001423540290BIB008","doi-asserted-by":"publisher","DOI":"10.1145\/3394171.3413901"},{"key":"S0218001423540290BIB009","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i2.16219"},{"key":"S0218001423540290BIB012","doi-asserted-by":"publisher","DOI":"10.1002\/int.22955"},{"key":"S0218001423540290BIB015","first-page":"606","volume-title":"Proc. AAAI Conf. Artificial Intelligence","author":"Gupta A.","year":"2012"},{"key":"S0218001423540290BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S0218001423540290BIB017","doi-asserted-by":"publisher","DOI":"10.1002\/int.22819"},{"key":"S0218001423540290BIB018","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00473"},{"key":"S0218001423540290BIB019","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i2.16258"},{"key":"S0218001423540290BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298932"},{"key":"S0218001423540290BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.162"},{"key":"S0218001423540290BIB024","first-page":"74","volume-title":"Text Summarization Branches Out","author":"Lin C.-Y.","year":"2004"},{"key":"S0218001423540290BIB025","first-page":"6850","volume-title":"Advances in Neural Information Processing Systems","author":"Liu F.","year":"2019"},{"key":"S0218001423540290BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.100"},{"key":"S0218001423540290BIB028","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.345"},{"key":"S0218001423540290BIB029","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i3.16328"},{"key":"S0218001423540290BIB031","first-page":"747","volume-title":"Proc. 13th Conf. European Chapter of the Association for Computational Linguistics","author":"Mitchell M.","year":"2012"},{"key":"S0218001423540290BIB032","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001421600090"},{"key":"S0218001423540290BIB033","doi-asserted-by":"publisher","DOI":"10.1142\/S021800142255014X"},{"key":"S0218001423540290BIB034","first-page":"1143","volume-title":"Advances in Neural Information Processing Systems","author":"Ordonez V.","year":"2011"},{"key":"S0218001423540290BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01098"},{"key":"S0218001423540290BIB036","first-page":"311","volume-title":"Proc. 40th Annual Meeting of the Association for Computational Linguistics","author":"Papineni K.","year":"2002"},{"key":"S0218001423540290BIB037","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001421520029"},{"key":"S0218001423540290BIB038","first-page":"91","volume-title":"Advances in Neural Information Processing Systems","author":"Ren S.","year":"2015"},{"key":"S0218001423540290BIB039","doi-asserted-by":"publisher","DOI":"10.1002\/int.23082"},{"key":"S0218001423540290BIB040","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.131"},{"key":"S0218001423540290BIB041","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.306"},{"key":"S0218001423540290BIB042","first-page":"5998","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani A.","year":"2017"},{"key":"S0218001423540290BIB043","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299087"},{"key":"S0218001423540290BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"S0218001423540290BIB045","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2587640"},{"key":"S0218001423540290BIB046","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33015377"},{"key":"S0218001423540290BIB047","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/3062706"},{"key":"S0218001423540290BIB048","first-page":"2048","volume-title":"Int. Conf. Machine Learning","author":"Xu K.","year":"2015"},{"key":"S0218001423540290BIB049","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i4.16421"},{"key":"S0218001423540290BIB050","doi-asserted-by":"publisher","DOI":"10.1007\/s00500-021-06622-3"},{"key":"S0218001423540290BIB051","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.503"},{"key":"S0218001423540290BIB052","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01521"},{"key":"S0218001423540290BIB053","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW54120.2021.00350"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001423540290","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,29]],"date-time":"2024-02-29T07:53:33Z","timestamp":1709193213000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001423540290"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1]]},"references-count":42,"journal-issue":{"issue":"01","published-print":{"date-parts":[[2024,1]]}},"alternative-id":["10.1142\/S0218001423540290"],"URL":"https:\/\/doi.org\/10.1142\/s0218001423540290","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2024,1]]},"article-number":"2354029"}}