{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T21:43:07Z","timestamp":1781732587565,"version":"3.54.5"},"publisher-location":"New York, NY, USA","reference-count":50,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,10,10]],"date-time":"2018-10-10T00:00:00Z","timestamp":1539129600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,10,10]]},"DOI":"10.1145\/3286606.3286863","type":"proceedings-article","created":{"date-parts":[[2018,12,14]],"date-time":"2018-12-14T14:12:50Z","timestamp":1544796770000},"page":"1-6","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":23,"title":["Automatic Caption Generation for Medical Images"],"prefix":"10.1145","author":[{"given":"Imane","family":"Allaouzi","sequence":"first","affiliation":[{"name":"LIST\/FSTT, Abdelmalek Essaadi University Tangier, Morocco"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"M.","family":"Ben Ahmed","sequence":"additional","affiliation":[{"name":"LIST\/FSTT, Abdelmalek Essaadi University Tangier, Morocco"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"B.","family":"Benamrou","sequence":"additional","affiliation":[{"name":"MMC\/FSTT, Abdelmalek Essaadi University Tangier, Morocco"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"M.","family":"Ouardouz","sequence":"additional","affiliation":[{"name":"MMC\/FSTT, Abdelmalek Essaadi University Tangier, Morocco"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,10,10]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2012.162"},{"key":"e_1_3_2_1_2_1","unstructured":"Siming Li. Kulkarni Girish Berg Tamara L. Berg Alexander C. and Choi Yejin. 2011. Composing simple image descriptions using web-scale n-grams. In Computational Natural Language Learning. ACL. Siming Li. Kulkarni Girish Berg Tamara L. Berg Alexander C. and Choi Yejin. 2011. Composing simple image descriptions using web-scale n-grams. In Computational Natural Language Learning. ACL."},{"key":"e_1_3_2_1_3_1","unstructured":"Yang Yezhou Teo Ching Lik Daume III Hal and Aloimonos Yiannis. 2011. Corpus-guided sentence generation of natural images. In EMNLP ACL 444--454. Yang Yezhou Teo Ching Lik Daume III Hal and Aloimonos Yiannis. 2011. Corpus-guided sentence generation of natural images. In EMNLP ACL 444--454."},{"key":"e_1_3_2_1_4_1","volume-title":"Midge: Generating image descriptions from computer vision detections. In European","author":"Mitchell","year":"2012","unstructured":"Mitchell , Margaret, Han , Xufeng, Dodge , Jesse, Mensch , Alyssa, Goyal , Amit , Berg, Alex, Yamaguchi , Kota, Berg , Tamara, Stratos, Karl, and Daume III, Hal. 2012 . Midge: Generating image descriptions from computer vision detections. In European Chapter of the Association for Computational Linguistics. ACL, 747--756. Mitchell, Margaret, Han, Xufeng, Dodge, Jesse, Mensch, Alyssa, Goyal, Amit, Berg, Alex, Yamaguchi, Kota, Berg, Tamara, Stratos, Karl, and Daume III, Hal. 2012. Midge: Generating image descriptions from computer vision detections. In European Chapter of the Association for Computational Linguistics. ACL, 747--756."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"crossref","unstructured":"Elliott Desmond and Keller Frank. 2013. Image description using visual dependency representations. In EMNLP. Elliott Desmond and Keller Frank. 2013. Image description using visual dependency representations. In EMNLP.","DOI":"10.18653\/v1\/D13-1128"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/2566972.2566993"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00177"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1888089.1888092"},{"key":"e_1_3_2_1_9_1","volume-title":"TREETALK: Composition and Compression of Trees for Image Descriptions, TACL","author":"Kuznetsova P.","year":"2014","unstructured":"Kuznetsova , P. , Ordonez , V. , Berg , T. L. , and Choi, Y. 2014 . TREETALK: Composition and Compression of Trees for Image Descriptions, TACL , vol. 2 , no. 10, 351--362. Kuznetsova, P., Ordonez, V., Berg, T. L., and Choi, Y. 2014. TREETALK: Composition and Compression of Trees for Image Descriptions, TACL, vol. 2, no. 10, 351--362."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Cho K. Merrienboer B. Gulcehre C. Bougares F. Schwenk H. and Bengio Y. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. Cho K. Merrienboer B. Gulcehre C. Bougares F. Schwenk H. and Bengio Y. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP.","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_3_2_1_11_1","volume-title":"NIPS'14 Proceedings of the 27th International Conference on Neural Information Processing Systems","volume":"2","author":"Sutskever I.","unstructured":"Sutskever , I. , Vinyals , O. , and Le , Q. V . 2014. Sequence to sequence learning with neural networks , in NIPS'14 Proceedings of the 27th International Conference on Neural Information Processing Systems , vol. 2 , 3104--3112. Sutskever, I., Vinyals, O., and Le, Q. V. 2014. Sequence to sequence learning with neural networks, in NIPS'14 Proceedings of the 27th International Conference on Neural Information Processing Systems, vol. 2, 3104--3112."},{"key":"e_1_3_2_1_12_1","volume-title":"NIPS'12 Proceedings of the 25th International Conference on Neural Information Processing Systems","volume":"1","author":"Krizhevsky A.","unstructured":"Krizhevsky , A. , Sutskever , I. , and Hinton , G. E . 2012. Imagenet classification with deep convolutional neural networks , in NIPS'12 Proceedings of the 25th International Conference on Neural Information Processing Systems , vol. 1 , 1097--1105. Krizhevsky, A., Sutskever, I., and Hinton, G. E. 2012. Imagenet classification with deep convolutional neural networks, in NIPS'12 Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 1, 1097--1105."},{"key":"e_1_3_2_1_13_1","unstructured":"Simonyan K. and Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition arXiv preprint arXiv:1409.1556 Simonyan K. and Zisserman A. 2014. Very deep convolutional networks for large-scale image recognition arXiv preprint arXiv:1409.1556"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2671188.2749391"},{"key":"e_1_3_2_1_15_1","unstructured":"Gong Y. Jia Y. Leung T. Toshev A. Ioffe S. 2013. Deep convolutional ranking for multilabel image annotation arXiv:1312.4894. Gong Y. Jia Y. Leung T. Toshev A. Ioffe S. 2013. Deep convolutional ranking for multilabel image annotation arXiv:1312.4894."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.251"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298878"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/2566972.2566993"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00166"},{"key":"e_1_3_2_1_21_1","volume-title":"Computer Vision-ECCV 2014","author":"Lin TY.","unstructured":"Lin , TY. , Maire , M. , Belongie , S , Hays , J. , Perona , P. , Ramanan , D. , Microsoft COCO : common objects in context . In: Fleet D, Pajdla T, Schiele B, Tuytelaars T, editors. Computer Vision-ECCV 2014 ; New York. Springer , 740--755. Lin, TY., Maire, M., Belongie, S, Hays, J., Perona, P., Ramanan, D., et al. 2014.Microsoft COCO: common objects in context. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T, editors. Computer Vision-ECCV 2014; New York. Springer, 740--755."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCT.2014.74"},{"key":"e_1_3_2_1_23_1","volume-title":"Y.","author":"Lecun Y.","year":"1998","unstructured":"Lecun , Y. , and Bottou , L. , and Bengio , Y. , and Haffner, P. 1998 . Gradient-based learning applied to document recognition, Proceedings of the IEEE , 2278--2324. Lecun, Y., and Bottou, L., and Bengio, Y., and Haffner, P. 1998. Gradient-based learning applied to document recognition, Proceedings of the IEEE, 2278--2324."},{"key":"e_1_3_2_1_24_1","unstructured":"Simonyan K. and Zisserman A. 2014. 'Very Deep Convolutional Networks for Large-Scale Image Recognition'. arXiv preprint arXiv:1409.1556. Simonyan K. and Zisserman A. 2014. 'Very Deep Convolutional Networks for Large-Scale Image Recognition'. arXiv preprint arXiv:1409.1556."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Szegedy C. etal 2014. Going Deeper with Convolutions arXiv preprint arXiv:1409.4886. Szegedy C. et al. 2014. Going Deeper with Convolutions arXiv preprint arXiv:1409.4886.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_1_26_1","unstructured":"Kaiming He. Zhang X. Ren S. and Sun J. 2015. Deep Residual Learning for Image Recognition. arXiv preprint arXiv:1512.03385. Kaiming He. Zhang X. Ren S. and Sun J. 2015. Deep Residual Learning for Image Recognition. arXiv preprint arXiv:1512.03385."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_28_1","volume-title":"Proceedings of the 32nd International Conference on Machine Learning (ICML-15)","author":"Jozefowicz R.","unstructured":"Jozefowicz , R. , Zaremba , W. , and Sutskever , I . 2015. An empirical exploration of recurrent network architectures , in Proceedings of the 32nd International Conference on Machine Learning (ICML-15) , 2342--2350. Jozefowicz, R., Zaremba, W., and Sutskever, I. 2015. An empirical exploration of recurrent network architectures, in Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2342--2350."},{"key":"e_1_3_2_1_29_1","volume-title":"International Conference on Security, Pattern Analysis, and Cybernetics (SPAC), Shenzhen, 515--519","author":"Wu L.","unstructured":"Wu , L. , Wan , C. , Wu , Y. , and Liu , J . 2017. Generative caption for diabetic retinopathy images , International Conference on Security, Pattern Analysis, and Cybernetics (SPAC), Shenzhen, 515--519 . Wu, L., Wan, C., Wu, Y., and Liu, J. 2017. Generative caption for diabetic retinopathy images, International Conference on Security, Pattern Analysis, and Cybernetics (SPAC), Shenzhen, 515--519."},{"key":"e_1_3_2_1_30_1","volume-title":"CLEF2018 Working Notes. CEUR Workshop Proceedings","author":"Rahman M.M.","year":"2018","unstructured":"Rahman , M.M. 2018 . A cross modal deep learning based approach for caption prediction and concept detection by cs morgan state . In: CLEF2018 Working Notes. CEUR Workshop Proceedings , Avignon, France, CEUR-WS.org Rahman, M.M. 2018. A cross modal deep learning based approach for caption prediction and concept detection by cs morgan state. In: CLEF2018 Working Notes. CEUR Workshop Proceedings, Avignon, France, CEUR-WS.org"},{"key":"e_1_3_2_1_31_1","unstructured":"Lyndon D. Kumar A. Kim J. 2017. Neural captioning for the ImageCLEF 2017 medical image challenges. Lyndon D. Kumar A. Kim J. 2017. Neural captioning for the ImageCLEF 2017 medical image challenges."},{"key":"e_1_3_2_1_32_1","unstructured":"Hasan S.A. Ling Y. Liu J. Sreenivasan R. Anand S. Arora T. Datla V.V. Lee K. Qadir A. Swisher C. Farri O. 2017. PRNA at ImageCLEF 2017 caption prediction and concept detection tasks. Hasan S.A. Ling Y. Liu J. Sreenivasan R. Anand S. Arora T. Datla V.V. Lee K. Qadir A. Swisher C. Farri O. 2017. PRNA at ImageCLEF 2017 caption prediction and concept detection tasks."},{"key":"e_1_3_2_1_33_1","volume-title":"CLEF2018 Working Notes. CEUR Workshop Proceedings","author":"Su Y.","year":"2018","unstructured":"Su , Y. , Liu , F. 2018 . UMass at ImageCLEF caption prediction 2018 task . In: CLEF2018 Working Notes. CEUR Workshop Proceedings , Avignon, France, CEUR-WS.org Su, Y., Liu, F. 2018. UMass at ImageCLEF caption prediction 2018 task. In: CLEF2018 Working Notes. CEUR Workshop Proceedings, Avignon, France, CEUR-WS.org"},{"key":"e_1_3_2_1_34_1","unstructured":"Jing B. Xie P. Xing E. 2017. On the automatic generation of medical imaging reports. arXiv preprint arXiv:1711.08195 Jing B. Xie P. Xing E. 2017. On the automatic generation of medical imaging reports. arXiv preprint arXiv:1711.08195"},{"key":"e_1_3_2_1_35_1","volume-title":"Tienet: Text-image embedding network for common thorax disease classification and reporting in chest x-rays. arXiv preprint arXiv:1801.04334, CVPR.","author":"Wang X.","year":"2018","unstructured":"Wang , X. , Peng , Y. , Lu , L. , Lu , Z. , and Summers , R. M . 2018 . Tienet: Text-image embedding network for common thorax disease classification and reporting in chest x-rays. arXiv preprint arXiv:1801.04334, CVPR. Wang, X., Peng, Y., Lu, L., Lu, Z., and Summers, R. M. 2018. Tienet: Text-image embedding network for common thorax disease classification and reporting in chest x-rays. arXiv preprint arXiv:1801.04334, CVPR."},{"key":"e_1_3_2_1_36_1","volume-title":"CLEF2017 working notes, CEUR.","author":"Ben Abacha A.","year":"2017","unstructured":"Ben Abacha , A. , Garc\u00eda Seco de Herrera, A., Gayen , S. , Demner-Fushman , D. , Antani , S. 2017 . NLM at ImageCLEF 2017 caption task . CLEF2017 working notes, CEUR. Ben Abacha, A., Garc\u00eda Seco de Herrera, A., Gayen, S., Demner-Fushman, D., Antani, S. 2017. NLM at ImageCLEF 2017 caption task. CLEF2017 working notes, CEUR."},{"key":"e_1_3_2_1_37_1","unstructured":"PubMed Homepage https:\/\/www.ncbi.nlm.nih.gov\/pmc\/ last accessed 2018\/5\/30. PubMed Homepage https:\/\/www.ncbi.nlm.nih.gov\/pmc\/ last accessed 2018\/5\/30."},{"key":"e_1_3_2_1_38_1","volume-title":"CLEF2018 Working Notes. CEUR Workshop Proceedings","author":"Wang X.","year":"2018","unstructured":"Wang , X. , Zhang , Y. , Guo , Z. , Li , J. 2018 . ImageSem at ImageCLEF 2018 caption task: Image retrieval and transfer learning . In: CLEF2018 Working Notes. CEUR Workshop Proceedings , Avignon, France, CEUR-WS.org. Wang, X., Zhang, Y., Guo, Z., Li, J. 2018. ImageSem at ImageCLEF 2018 caption task: Image retrieval and transfer learning. In: CLEF2018 Working Notes. CEUR Workshop Proceedings, Avignon, France, CEUR-WS.org."},{"key":"e_1_3_2_1_39_1","volume-title":"A cross-modal concept detection and caption prediction approach in ImageCLEFcaption track of ImageCLEF","author":"Rahman M.","year":"2017","unstructured":"Rahman , M. , Lagree , T. , Taylor , M. 2017. A cross-modal concept detection and caption prediction approach in ImageCLEFcaption track of ImageCLEF 2017 . Rahman, M., Lagree, T., Taylor, M. 2017. A cross-modal concept detection and caption prediction approach in ImageCLEFcaption track of ImageCLEF 2017."},{"key":"e_1_3_2_1_40_1","unstructured":"Liang S. Li X. Zhu Y. Li X. Jiang S. 2017. ISIA at ImageCLEF 2017 image caption task. Liang S. Li X. Zhu Y. Li X. Jiang S. 2017. ISIA at ImageCLEF 2017 image caption task."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocv080"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-66179-7_37"},{"key":"e_1_3_2_1_43_1","volume-title":"A.","author":"Eickhoff C.","year":"2017","unstructured":"Eickhoff , C. , Schwall , I. , Garc\u00eda Seco de Herrera , A. , and M\u00fcller, H. 2017 . Overview of ImageCLEFcaption 2017 - the Image Caption Prediction and Concept Extraction Tasks to Understand Biomedical Images, CLEF working notes, CEUR. Eickhoff, C., Schwall, I., Garc\u00eda Seco de Herrera, A., and M\u00fcller, H. 2017. Overview of ImageCLEFcaption 2017 - the Image Caption Prediction and Concept Extraction Tasks to Understand Biomedical Images, CLEF working notes, CEUR."},{"key":"e_1_3_2_1_44_1","volume-title":"CLEF2018 Working Notes. CEUR-WS.org","author":"Garcia Seco","year":"2018","unstructured":"Garcia Seco de Herrera, A., Eickhoff , C. , Andrearczyk , V. , M\u00fcller , H. 2018 . Overview of the ImageCLEF 2018 Caption Prediction tasks . In: CLEF2018 Working Notes. CEUR-WS.org , Avignon, France. Garcia Seco de Herrera, A., Eickhoff, C., Andrearczyk, V., M\u00fcller, H. 2018. Overview of the ImageCLEF 2018 Caption Prediction tasks. In: CLEF2018 Working Notes. CEUR-WS.org, Avignon, France."},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073135"},{"key":"e_1_3_2_1_46_1","volume-title":"Rouge: A package for automatic evaluation of summaries, in Text summarization branches out: Proceedings of the ACL-04 workshop","author":"Lin C. Y.","year":"2004","unstructured":"Lin , C. Y. 2004 . Rouge: A package for automatic evaluation of summaries, in Text summarization branches out: Proceedings of the ACL-04 workshop , vol. 8 . Lin, C. Y. 2004. Rouge: A package for automatic evaluation of summaries, in Text summarization branches out: Proceedings of the ACL-04 workshop, vol. 8."},{"key":"e_1_3_2_1_47_1","volume-title":"Proceedings of the ninth Workshop on Statistical Machine Translation, 376--380","author":"Denkowski M.","unstructured":"Denkowski , M. , and Lavie , A . 2014. Meteor universal: Language specific translation evaluation for any target language , in Proceedings of the ninth Workshop on Statistical Machine Translation, 376--380 . Denkowski, M., and Lavie, A. 2014. Meteor universal: Language specific translation evaluation for any target language, in Proceedings of the ninth Workshop on Statistical Machine Translation, 376--380."},{"key":"e_1_3_2_1_48_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4566--4575","author":"Vedantam R.","unstructured":"Vedantam , R. , Zitnick , C. L. , and Parikh , D . 2015. Cider: Consensus-based image description evaluation , in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4566--4575 . Vedantam, R., Zitnick, C. L., and Parikh, D. 2015. Cider: Consensus-based image description evaluation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4566--4575."},{"key":"e_1_3_2_1_49_1","first-page":"382","article-title":"Spice: Semantic propositional image caption evaluation","volume":"2016","author":"Anderson P.","year":"2016","unstructured":"Anderson , P. , Fernando , B. , Johnson , M. , and Gould , S. 2016 . Spice: Semantic propositional image caption evaluation , in Computer Vision - ECCV 2016 , 382 -- 398 . Anderson, P., Fernando, B., Johnson, M., and Gould, S. 2016. Spice: Semantic propositional image caption evaluation, in Computer Vision - ECCV 2016, 382--398.","journal-title":"Computer Vision - ECCV"},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"crossref","unstructured":"Kilickaya M. Erdem A. Ikizler-Cinbis N. and Erdem E. 2016. Re-evaluating automatic metrics for image captioning arXiv preprint arXiv:1612.07600. Kilickaya M. Erdem A. Ikizler-Cinbis N. and Erdem E. 2016. Re-evaluating automatic metrics for image captioning arXiv preprint arXiv:1612.07600.","DOI":"10.18653\/v1\/E17-1019"}],"event":{"name":"SCA '18: 3rd International Conference on Smart City Applications","location":"Tetouan Morocco","acronym":"SCA '18"},"container-title":["Proceedings of the 3rd International Conference on Smart City Applications"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3286606.3286863","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3286606.3286863","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T14:08:14Z","timestamp":1775311694000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3286606.3286863"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,10]]},"references-count":50,"alternative-id":["10.1145\/3286606.3286863","10.1145\/3286606"],"URL":"https:\/\/doi.org\/10.1145\/3286606.3286863","relation":{},"subject":[],"published":{"date-parts":[[2018,10,10]]},"assertion":[{"value":"2018-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}