{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,15]],"date-time":"2026-05-15T06:46:15Z","timestamp":1778827575615,"version":"3.51.4"},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,2,23]],"date-time":"2024-02-23T00:00:00Z","timestamp":1708646400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,2,23]],"date-time":"2024-02-23T00:00:00Z","timestamp":1708646400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100019581","name":"Munzur University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100019581","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Multimed Info Retr"],"published-print":{"date-parts":[[2024,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Semantic segmentation is the most important stage of making sense of the visual traffic scene for autonomous driving. In recent years, convolutional neural networks (CNN)-based methods for semantic segmentation of urban traffic scenes are among the trending studies. However, the methods developed in the studies carried out so far are insufficient in terms of accuracy performance criteria. In this study, a new CNN-based semantic segmentation method with higher accuracy performance is proposed. A new module, the Attentional Atrous Feature Pooling (AAFP) Module, has been developed for the proposed method. This module is located between the encoder and decoder in the general network structure and aims to obtain multi-scale information and add attentional features to large and small objects. As a result of experimental tests with the CamVid data set, an accuracy value of approximately 2% higher was achieved with a mIoU value of 70.59% compared to other state-of-art methods. Therefore, the proposed method can semantically segment objects in the urban traffic scene better than other methods.<\/jats:p>","DOI":"10.1007\/s13735-023-00313-5","type":"journal-article","created":{"date-parts":[[2024,2,23]],"date-time":"2024-02-23T13:02:29Z","timestamp":1708693349000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["A new CNN-based semantic object segmentation for autonomous vehicles in urban traffic scenes"],"prefix":"10.1007","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2497-8348","authenticated-orcid":false,"given":"G\u00fcrkan","family":"Do\u011fan","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3244-2615","authenticated-orcid":false,"given":"Burhan","family":"Ergen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,2,23]]},"reference":[{"issue":"8","key":"313_CR1","doi-asserted-by":"publisher","first-page":"5119","DOI":"10.1109\/TIE.2015.2410258","volume":"62","author":"K Jo","year":"2015","unstructured":"Jo K, Kim J, Kim D, Jang C, Sunwoo M (2015) Development of autonomous car - Part II: a case study on the implementation of an autonomous driving system based on distributed architecture. IEEE Trans Industr Electron 62(8):5119\u20135132. https:\/\/doi.org\/10.1109\/TIE.2015.2410258","journal-title":"IEEE Trans Industr Electron"},{"issue":"2","key":"313_CR2","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1007\/s13735-021-00209-2","volume":"10","author":"AM Hafiz","year":"2021","unstructured":"Hafiz AM, Parah SA, Bhat RA (2021) Reinforcement learning applied to machine vision: state of the art. Int J Multimed Inform Retriev 10(2):71\u201382. https:\/\/doi.org\/10.1007\/s13735-021-00209-2","journal-title":"Int J Multimed Inform Retriev"},{"issue":"6","key":"313_CR3","doi-asserted-by":"publisher","first-page":"3258","DOI":"10.1109\/TITS.2020.2980426","volume":"22","author":"G Dong","year":"2021","unstructured":"Dong G, Yan Y, Shen C, Wang H (2021) Real-time high-performance semantic image segmentation of urban street scenes. IEEE Trans Intell Transp Syst 22(6):3258\u20133274. https:\/\/doi.org\/10.1109\/TITS.2020.2980426","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"313_CR4","doi-asserted-by":"publisher","first-page":"50333","DOI":"10.1109\/ACCESS.2018.2868801","volume":"6","author":"L Fan","year":"2018","unstructured":"Fan L, Kong H, Wang WC, Yan J (2018) Semantic segmentation with global encoding and dilated decoder in street scenes. IEEE Access 6:50333\u201350343. https:\/\/doi.org\/10.1109\/ACCESS.2018.2868801","journal-title":"IEEE Access"},{"issue":"1","key":"313_CR5","doi-asserted-by":"publisher","first-page":"580","DOI":"10.1007\/s10489-021-02446-8","volume":"52","author":"X Hu","year":"2022","unstructured":"Hu X, Jing L, Sehar U (2022) Joint pyramid attention network for real-time semantic segmentation of urban scenes. Appl Intell 52(1):580\u2013594. https:\/\/doi.org\/10.1007\/s10489-021-02446-8","journal-title":"Appl Intell"},{"issue":"4","key":"313_CR6","doi-asserted-by":"publisher","first-page":"653","DOI":"10.1007\/s13735-022-00251-8","volume":"11","author":"A Benoughidene","year":"2022","unstructured":"Benoughidene A, Titouna F (2022) A novel method for video shot boundary detection using CNN-LSTM approach. Int J Multimed Inf Retr 11(4):653\u2013667. https:\/\/doi.org\/10.1007\/s13735-022-00251-8","journal-title":"Int J Multimed Inf Retr"},{"issue":"4","key":"313_CR7","doi-asserted-by":"publisher","first-page":"599","DOI":"10.1007\/s13735-022-00248-3","volume":"11","author":"S Cai","year":"2022","unstructured":"Cai S, Wang C, Ding J, Yu J, Fan J (2022) FDAM: full-dimension attention module for deep convolutional neural networks. Int J Multimed Inf Retr 11(4):599\u2013610. https:\/\/doi.org\/10.1007\/s13735-022-00248-3","journal-title":"Int J Multimed Inf Retr"},{"issue":"4","key":"313_CR8","doi-asserted-by":"publisher","first-page":"619","DOI":"10.1007\/s13735-022-00246-5","volume":"11","author":"MJ Parseh","year":"2022","unstructured":"Parseh MJ, Rahmanimanesh M, Keshavarzi P, Azimifar Z (2022) Semantic-aware visual scene representation. Int J Multimed Inf Retr 11(4):619\u2013638. https:\/\/doi.org\/10.1007\/s13735-022-00246-5","journal-title":"Int J Multimed Inf Retr"},{"issue":"3","key":"313_CR9","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1007\/s13735-022-00242-9","volume":"11","author":"AE Ilesanmi","year":"2022","unstructured":"Ilesanmi AE, Ilesanmi T, Idowu OP, Torigian DA, Udupa JK (2022) Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review. Int J Multimed Inf Retr 11(3):315\u2013331. https:\/\/doi.org\/10.1007\/s13735-022-00242-9","journal-title":"Int J Multimed Inf Retr"},{"key":"313_CR10","doi-asserted-by":"publisher","unstructured":"Shotton J, Johnson M, and Cipolla R (2008) Semantic texton forests for image categorization and segmentation. In: 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR. https:\/\/doi.org\/10.1109\/CVPR.2008.4587503","DOI":"10.1109\/CVPR.2008.4587503"},{"issue":"1","key":"313_CR11","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1023\/A:1011174803800","volume":"43","author":"J Malik","year":"2001","unstructured":"Malik J, Belongie S, Leung T, Shi J (2001) Contour and texture analysis for image segmentation. Int J Comput Vis 43(1):7\u201327. https:\/\/doi.org\/10.1023\/A:1011174803800","journal-title":"Int J Comput Vis"},{"issue":"1","key":"313_CR12","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1007\/s11263-007-0109-1","volume":"81","author":"J Shotton","year":"2009","unstructured":"Shotton J, Winn J, Rother C, Criminisi A (2009) TextonBoost for image understanding: multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int J Comput Vis 81(1):2\u201323. https:\/\/doi.org\/10.1007\/s11263-007-0109-1","journal-title":"Int J Comput Vis"},{"issue":"4","key":"313_CR13","doi-asserted-by":"publisher","first-page":"640","DOI":"10.1109\/TPAMI.2016.2572683","volume":"39","author":"E Shelhamer","year":"2017","unstructured":"Shelhamer E, Long J, Darrell T (2017) Fully convolutional networks for semantic segmentation. IEEE Trans Pattern Anal Mach Intell 39(4):640\u2013651. https:\/\/doi.org\/10.1109\/TPAMI.2016.2572683","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"313_CR14","doi-asserted-by":"publisher","DOI":"10.1049\/cit2.12118","author":"Z Guangzhe","year":"2022","unstructured":"Guangzhe Z, Yimeng Z, Ge M, Min Y (2022) Bilateral U-Net semantic segmentation with spatial attention mechanism. CAAI Trans Intell Technol. https:\/\/doi.org\/10.1049\/cit2.12118","journal-title":"CAAI Trans Intell Technol"},{"issue":"12","key":"313_CR15","doi-asserted-by":"publisher","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","volume":"39","author":"V Badrinarayanan","year":"2017","unstructured":"Badrinarayanan V, Kendall A, Cipolla R (2017) SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481\u20132495. https:\/\/doi.org\/10.1109\/TPAMI.2016.2644615","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"313_CR16","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28","author":"O Ronneberger","year":"2015","unstructured":"Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. IEEE Access. https:\/\/doi.org\/10.1007\/978-3-319-24574-4_28","journal-title":"IEEE Access"},{"key":"313_CR17","doi-asserted-by":"publisher","unstructured":"Chen LC, Zhu Y, Papandreou G, Schroff F, and Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol 11211 LNCS, pp 833\u2013851, https:\/\/doi.org\/10.1007\/978-3-030-01234-2_49","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"313_CR18","doi-asserted-by":"publisher","unstructured":"Peng C, Zhang X, Yu G, Luo G, and Sun J (2017) Large kernel matters - Improve semantic segmentation by global convolutional network. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, vol 2017-Janua, pp 1743\u20131751. https:\/\/doi.org\/10.1109\/CVPR.2017.189.","DOI":"10.1109\/CVPR.2017.189"},{"key":"313_CR19","doi-asserted-by":"publisher","unstructured":"Wang P et al (2018) Understanding Convolution for Semantic Segmentation. In: Proceedings - 2018 IEEE Winter Conference on Applications of Computer Vision, WACV 2018, vol 2018-Janua, pp 1451\u20131460. https:\/\/doi.org\/10.1109\/WACV.2018.00163","DOI":"10.1109\/WACV.2018.00163"},{"key":"313_CR20","doi-asserted-by":"publisher","unstructured":"Zhao H, Shi J, Qi X, Wang X, and Jia J (2017) Pyramid scene parsing network. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, vol 2017-Janua, pp 6230\u20136239. https:\/\/doi.org\/10.1109\/CVPR.2017.660","DOI":"10.1109\/CVPR.2017.660"},{"key":"313_CR21","doi-asserted-by":"publisher","unstructured":"Lin G, Milan A, Shen C, and Reid I (2017) RefineNet: Multi-path refinement networks for high-resolution semantic segmentation. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, vol. 2017-Janua, pp. 5168\u20135177. https:\/\/doi.org\/10.1109\/CVPR.2017.549","DOI":"10.1109\/CVPR.2017.549"},{"issue":"2","key":"313_CR22","doi-asserted-by":"publisher","first-page":"88","DOI":"10.1016\/j.patrec.2008.04.005","volume":"30","author":"GJ Brostow","year":"2009","unstructured":"Brostow GJ, Fauqueur J, Cipolla R (2009) Semantic object classes in video: a high-definition ground truth database. Pattern Recognit Lett 30(2):88\u201397. https:\/\/doi.org\/10.1016\/j.patrec.2008.04.005","journal-title":"Pattern Recognit Lett"},{"key":"313_CR23","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, and Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2016-Decem, pp 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"313_CR24","unstructured":"Kingma DP and Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, pp 1\u201315"},{"key":"313_CR25","unstructured":"Keras, \u201cReduceLROnPlateau. (2022)"},{"issue":"3","key":"313_CR26","doi-asserted-by":"publisher","first-page":"297","DOI":"10.2307\/1932409","volume":"26","author":"LR Dice","year":"1945","unstructured":"Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26(3):297\u2013302. https:\/\/doi.org\/10.2307\/1932409","journal-title":"Ecology"},{"key":"313_CR27","doi-asserted-by":"publisher","unstructured":"Milletari F, Navab N, and Ahmadi SA (2016) V-Net: Fully convolutional neural networks for volumetric medical image segmentation. In: Proceedings - 2016 4th International Conference on 3D Vision, 3DV 2016, pp 565\u2013571. https:\/\/doi.org\/10.1109\/3DV.2016.79","DOI":"10.1109\/3DV.2016.79"},{"key":"313_CR28","doi-asserted-by":"publisher","first-page":"111119","DOI":"10.1016\/j.measurement.2022.111119","volume":"195","author":"G Do\u011fan","year":"2022","unstructured":"Do\u011fan G, Ergen B (2022) A new mobile convolutional neural network-based approach for pixel-wise road surface crack detection. Measurement 195:111119. https:\/\/doi.org\/10.1016\/j.measurement.2022.111119","journal-title":"Measurement"},{"key":"313_CR29","unstructured":"Paszke A, Chaurasia A, Kim S, Culurciello E (2016) ENet: a deep neural network architecture for real-time semantic segmentation. Comput Vis Pattern Recog. arXiv:1606.02147"},{"issue":"2","key":"313_CR30","doi-asserted-by":"publisher","first-page":"1183","DOI":"10.1109\/TII.2018.2849348","volume":"15","author":"X Zhang","year":"2019","unstructured":"Zhang X, Chen Z, Jonathan Wu QM, Cai L, Lu D, Li X (2019) Fast semantic segmentation for scene perception. IEEE Trans Ind Inform 15(2):1183\u20131192. https:\/\/doi.org\/10.1109\/TII.2018.2849348","journal-title":"IEEE Trans Ind Inform"},{"issue":"1","key":"313_CR31","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1109\/TITS.2017.2750080","volume":"19","author":"E Romera","year":"2018","unstructured":"Romera E, Alvarez JM, Bergasa LM, Arroyo R (2018) ERFNet: efficient residual factorized ConvNet for real-time semantic segmentation. IEEE Trans Intell Transp Syst 19(1):263\u2013272. https:\/\/doi.org\/10.1109\/TITS.2017.2750080","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"313_CR32","doi-asserted-by":"publisher","unstructured":"Li H, Xiong P, Fan H, and Sun J (2019) DFANet: Deep feature aggregation for real-time semantic segmentation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2019-Jun, pp 9514\u20139523. https:\/\/doi.org\/10.1109\/CVPR.2019.00975","DOI":"10.1109\/CVPR.2019.00975"},{"key":"313_CR33","doi-asserted-by":"publisher","unstructured":"Orsic M, Kreso I, Bevandic P, and Segvic S (2019) In defense of pre-trained imagenet architectures for real-time semantic segmentation of road-driving images. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2019-Jun, pp 12599\u201312608. https:\/\/doi.org\/10.1109\/CVPR.2019.01289","DOI":"10.1109\/CVPR.2019.01289"},{"key":"313_CR34","doi-asserted-by":"publisher","unstructured":"Zhao H, Qi X, Shen X, Shi J, and Jia J (2018) ICNet for Real-time semantic segmentation on high-resolution images. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol 11207 LNCS, pp 418\u2013434. https:\/\/doi.org\/10.1007\/978-3-030-01219-9_25","DOI":"10.1007\/978-3-030-01219-9_25"}],"container-title":["International Journal of Multimedia Information Retrieval"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13735-023-00313-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s13735-023-00313-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s13735-023-00313-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,28]],"date-time":"2024-03-28T11:16:35Z","timestamp":1711624595000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s13735-023-00313-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,23]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2024,3]]}},"alternative-id":["313"],"URL":"https:\/\/doi.org\/10.1007\/s13735-023-00313-5","relation":{},"ISSN":["2192-6611","2192-662X"],"issn-type":[{"value":"2192-6611","type":"print"},{"value":"2192-662X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,23]]},"assertion":[{"value":"1 January 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 September 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 November 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 February 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"This article does not contain any data, or other information from studies or experimentation, with the involvement of human or animal subjects.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval"}}],"article-number":"11"}}