{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T13:38:54Z","timestamp":1753882734560,"version":"3.41.2"},"reference-count":42,"publisher":"World Scientific Pub Co Pte Ltd","issue":"05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2025,4]]},"abstract":"<jats:p> Generating natural language descriptions of an image, namely image captioning, has received much attention in computer vision and natural language processing. Recent image captioning models are mainly based on the encoder-decoder framework in which visual information is extracted by an encoder, e.g. using convolutional neural network (CNN), and captions are generated by a decoder, e.g. using recurrent neural network (RNN). Although this framework is promising for image captioning, there are still issues in the RNN decoder for exploiting the visual information to generate grammatically and semantically correct captions. More specifically, the RNN decoder has limited ability in dealing with long-term complex dependencies, leading to ineffective use of contextual information from the encoded data. To address this issue, in this paper, we introduce a multi-layer gated recurrent unit (ML-GRU) within the conventional RNN decoder, which enables the modulation of the relevant information flow inside the unit, and thus leads to the generation of semantically coherent captions. The proposed ML-GRU-based RNN decoder has been extensively evaluated on the MSCOCO dataset, and experimental results demonstrate the advantage of our proposed approach over the state-of-the-art approaches across multiple performance metrics. <\/jats:p>","DOI":"10.1142\/s0218001424540181","type":"journal-article","created":{"date-parts":[[2024,11,15]],"date-time":"2024-11-15T13:02:24Z","timestamp":1731675744000},"source":"Crossref","is-referenced-by-count":1,"title":["Multi-Layer Gated Recurrent Unit-Based Recurrent Neural Network for Image Captioning"],"prefix":"10.1142","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3389-3867","authenticated-orcid":false,"given":"\u00d6zkan","family":"\u00c7ayl\u0131","sequence":"first","affiliation":[{"name":"Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, GU2 7XH, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3164-1981","authenticated-orcid":false,"given":"Volkan","family":"K\u0131l\u0131\u00e7","sequence":"additional","affiliation":[{"name":"Department of Electrical and Electronics Engineering, \u0130zmir Katip \u00c7elebi University, \u00c7i\u011fli 35620, T\u00fcrkiye"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9434-5880","authenticated-orcid":false,"given":"Aytu\u011f","family":"Onan","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, \u0130zmir Katip \u00c7elebi University, \u00c7i\u011fli 35620, T\u00fcrkiye"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8393-5703","authenticated-orcid":false,"given":"Wenwu","family":"Wang","sequence":"additional","affiliation":[{"name":"Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, GU2 7XH, United Kingdom"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2025,4,28]]},"reference":[{"doi-asserted-by":"publisher","key":"S0218001424540181BIB001","DOI":"10.1007\/978-3-319-46454-1_24"},{"issue":"35","key":"S0218001424540181BIB002","first-page":"380","author":"Ayd\u0131n S.","year":"2022","journal-title":"Euro. J. Sci. Technol."},{"issue":"26","key":"S0218001424540181BIB003","first-page":"191","author":"Baran M.","year":"2021","journal-title":"Euro. J. Sci. Technol."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB004","DOI":"10.1109\/CVPR.2017.195"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB007","DOI":"10.1109\/CVPR.2015.7298878"},{"issue":"32","key":"S0218001424540181BIB008","first-page":"221","author":"Fetiler B.","year":"2021","journal-title":"Euro. J. Sci. Technol."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB009","DOI":"10.1109\/CVPR.2017.108"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB010","DOI":"10.1007\/978-3-030-58520-4_25"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB011","DOI":"10.1109\/CVPR.2016.90"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB012","DOI":"10.1002\/cpe.6866"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB013","DOI":"10.1162\/neco.1997.9.8.1735"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB014","DOI":"10.1145\/3295748"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB015","DOI":"10.1109\/ICCV.2015.277"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB016","DOI":"10.1007\/978-3-030-58621-8_37"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB017","DOI":"10.1109\/CVPR.2015.7298932"},{"issue":"31","key":"S0218001424540181BIB018","first-page":"461","author":"Keskin R.","year":"2021","journal-title":"Euro. J. Sci. Technol."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB019","DOI":"10.1109\/SIU53274.2021.9477901"},{"issue":"2","key":"S0218001424540181BIB020","first-page":"181","volume":"4","author":"K\u0131l\u0131\u00e7 V.","year":"2021","journal-title":"Sakarya Univ. J. Comput. Inform. Sci."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB022","DOI":"10.3115\/1626355.1626389"},{"key":"S0218001424540181BIB024","first-page":"1","volume-title":"Proc. ACL-04 Workshop","volume":"8","author":"Lin C.-Y.","year":"2004"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB025","DOI":"10.1007\/978-3-319-10602-1_48"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB027","DOI":"10.23919\/ELECO47770.2019.8990630"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB028","DOI":"10.23919\/ELECO47770.2019.8990395"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB030","DOI":"10.1109\/CVPR.2018.00896"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB031","DOI":"10.1109\/TNNLS.2020.2979670"},{"key":"S0218001424540181BIB032","first-page":"311","volume-title":"Proc. 40th Annual Meeting on Association for Computational Linguistics","author":"Papineni K.","year":"2002"},{"key":"S0218001424540181BIB033","first-page":"1310","volume-title":"Int. Conf. Machine Learning","author":"Pascanu R.","year":"2013"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB034","DOI":"10.1109\/ICCV.2015.303"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB035","DOI":"10.3390\/app9102024"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB036","DOI":"10.1109\/CVPR.2016.308"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB037","DOI":"10.1016\/j.neucom.2018.12.026"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB038","DOI":"10.1017\/S1351324918000098"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB040","DOI":"10.1109\/CVPR52688.2022.01739"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB041","DOI":"10.1007\/978-3-030-67835-7_6"},{"issue":"35","key":"S0218001424540181BIB042","first-page":"610","author":"Uslu B.","year":"2022","journal-title":"Euro. J. Sci. Technol."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB043","DOI":"10.1109\/CVPR.2015.7299087"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB044","DOI":"10.1109\/CVPR.2015.7298935"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB046","DOI":"10.1016\/j.neucom.2020.03.087"},{"key":"S0218001424540181BIB047","first-page":"2048","volume-title":"Int. Conf. Machine Learning","author":"Xu K.","year":"2015"},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB050","DOI":"10.1109\/CVPR52729.2023.02247"},{"issue":"4","key":"S0218001424540181BIB051","first-page":"1","volume":"8","author":"Zhang L.","year":"2018","journal-title":"Wiley Interdiscipl. Rev.: Data Min. Knowl. Discov."},{"doi-asserted-by":"publisher","key":"S0218001424540181BIB052","DOI":"10.1109\/CVPR.2018.00907"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001424540181","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,28]],"date-time":"2025-04-28T01:16:23Z","timestamp":1745802983000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001424540181"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4]]},"references-count":42,"journal-issue":{"issue":"05","published-print":{"date-parts":[[2025,4]]}},"alternative-id":["10.1142\/S0218001424540181"],"URL":"https:\/\/doi.org\/10.1142\/s0218001424540181","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"type":"print","value":"0218-0014"},{"type":"electronic","value":"1793-6381"}],"subject":[],"published":{"date-parts":[[2025,4]]},"article-number":"2454018"}}