{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T18:03:41Z","timestamp":1774029821148,"version":"3.50.1"},"reference-count":51,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T00:00:00Z","timestamp":1680134400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T00:00:00Z","timestamp":1680134400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"National Innovation and Development Project of Industrial Internet","award":["TC190H3WR"],"award-info":[{"award-number":["TC190H3WR"]}]},{"name":"2021 Open Project of Jiangsu Provincial Key Laboratory of Advanced Robotics","award":["KJS2160"],"award-info":[{"award-number":["KJS2160"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Most existing visual simultaneous localization and mapping (SLAM) algorithms rely heavily on the static world assumption. Combined with deep learning, semantic SLAM has become a popular solution for dynamic scenes. However, most semantic SLAM methods show poor real-time performance when dealing with dynamic scenes. To handle this problem, a real-time semantic SLAM method is proposed in this paper, combining knowledge distillation and dynamic probability propagation strategy. First, to improve the execution speed, a multi-level knowledge distillation method is adopted to obtain a lightweight segmentation model, which is more suitable for continuous frames to create an independent semantic segmentation thread. This segmentation thread only accepts keyframes as input so that the system can avoid time delay caused by processing each frame. Second, a static semantic keyframe selection strategy is proposed based on the segmentation results. In this way, those keyframes containing more static information will be selected to reduce the participation of dynamic objects. By combining segmentation results and data matching algorithm, our system can realize the update and propagation of dynamic probability, reducing the influence of dynamic points in the pose optimization process. Validation results based on the KITTI and TUM datasets show that our method can effectively deal with dynamic feature points and improve running speed simultaneously.<\/jats:p>","DOI":"10.1007\/s40747-023-01031-5","type":"journal-article","created":{"date-parts":[[2023,3,30]],"date-time":"2023-03-30T09:02:58Z","timestamp":1680166978000},"page":"5653-5677","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["A real-time semantic visual SLAM for dynamic environment based on deep learning and dynamic probabilistic propagation"],"prefix":"10.1007","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6598-1036","authenticated-orcid":false,"given":"Liang","family":"Chen","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9620-0186","authenticated-orcid":false,"given":"Zhi","family":"Ling","sequence":"additional","affiliation":[]},{"given":"Yu","family":"Gao","sequence":"additional","affiliation":[]},{"given":"Rongchuan","family":"Sun","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9532-4272","authenticated-orcid":false,"given":"Sheng","family":"Jin","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,3,30]]},"reference":[{"key":"1031_CR1","doi-asserted-by":"publisher","DOI":"10.1186\/s41074-017-0027-2","author":"T Taketomi","year":"2017","unstructured":"Taketomi T, Uchiyama H, Ikeda S (2017) Visual SLAM algorithms: a survey from 2010 to 2016. IPSJ Trans Comput Vis Appl. https:\/\/doi.org\/10.1186\/s41074-017-0027-2","journal-title":"IPSJ Trans Comput Vis Appl"},{"key":"1031_CR2","doi-asserted-by":"publisher","DOI":"10.1007\/s40747-021-00323-y","author":"X Dong","year":"2021","unstructured":"Dong X, Cheng L, Peng H, Li T (2021) FSD-SLAM: a fast semi-direct SLAM algorithm. Complex Intell Syst. https:\/\/doi.org\/10.1007\/s40747-021-00323-y","journal-title":"Complex Intell Syst"},{"key":"1031_CR3","doi-asserted-by":"publisher","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","volume":"32","author":"C Cadena","year":"2016","unstructured":"Cadena C, Carlone L, Carrillo H et al (2016) Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE T Robot 32:1309\u20131332. https:\/\/doi.org\/10.1109\/TRO.2016.2624754","journal-title":"IEEE T Robot"},{"key":"1031_CR4","doi-asserted-by":"publisher","first-page":"4076","DOI":"10.1109\/LRA.2018.2860039","volume":"3","author":"B Bescos","year":"2018","unstructured":"Bescos B, F\u00e1cil JM, Civera J et al (2018) DynaSLAM: tracking, mapping, and inpainting in dynamic scenes. IEEE Robot Autom Lett 3:4076\u20134083. https:\/\/doi.org\/10.1109\/LRA.2018.2860039","journal-title":"IEEE Robot Autom Lett"},{"key":"1031_CR5","doi-asserted-by":"publisher","DOI":"10.1155\/2021\/5538840","author":"X Zhao","year":"2021","unstructured":"Zhao X, Zuo T, Hu X (2021) OFM-SLAM: a visual semantic SLAM for dynamic indoor environments. Math Porbl Eng. https:\/\/doi.org\/10.1155\/2021\/5538840","journal-title":"Math Porbl Eng"},{"key":"1031_CR6","doi-asserted-by":"publisher","first-page":"106153","DOI":"10.1016\/j.asoc.2020.106153","volume":"90","author":"S Jin","year":"2020","unstructured":"Jin S, Chen L, Sun R et al (2020) A novel vSLAM framework with unsupervised semantic segmentation based on adversarial transfer learning. Appl Soft Comput 90:106153. https:\/\/doi.org\/10.1016\/j.asoc.2020.106153","journal-title":"Appl Soft Comput"},{"key":"1031_CR7","doi-asserted-by":"publisher","first-page":"214685","DOI":"10.1109\/ACCESS.2020.3041038","volume":"8","author":"X Long","year":"2020","unstructured":"Long X, Zhang W, Zhao B (2020) PSPNet-SLAM: a semantic SLAM detect dynamic object by pyramid scene parsing network. IEEE Access 8:214685\u2013214695. https:\/\/doi.org\/10.1109\/ACCESS.2020.3041038","journal-title":"IEEE Access"},{"key":"1031_CR8","doi-asserted-by":"publisher","unstructured":"He K, Gkioxari G, Doll\u00e1r P et al (2017) Mask r-cnn. In: IEEE international conference on computer vision (ICCV). IEEE, pp 2961\u20132969. https:\/\/doi.org\/10.1109\/ICCV.2017.322","DOI":"10.1109\/ICCV.2017.322"},{"key":"1031_CR9","doi-asserted-by":"publisher","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","volume":"40","author":"LC Chen","year":"2018","unstructured":"Chen LC, Papandreou G, Kokkinos I et al (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40:834\u2013848. https:\/\/doi.org\/10.1109\/TPAMI.2017.2699184","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1031_CR10","doi-asserted-by":"publisher","unstructured":"Zhao H, Shi J, Qi X et al (2017) Pyramid scene parsing network. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 2881\u20132890. https:\/\/doi.org\/10.1109\/CVPR.2017.660","DOI":"10.1109\/CVPR.2017.660"},{"key":"1031_CR11","doi-asserted-by":"publisher","unstructured":"Yu C, Liu Z, Liu X J et al (2018) DS-SLAM: A semantic visual SLAM towards dynamic environments. In: IEEE\/RSJ International conference on intelligent robots and systems (IROS). IEEE, pp 1168\u20131174. https:\/\/doi.org\/10.1109\/IROS.2018.8593691","DOI":"10.1109\/IROS.2018.8593691"},{"key":"1031_CR12","doi-asserted-by":"publisher","first-page":"166528","DOI":"10.1109\/ACCESS.2019.2952161","volume":"7","author":"L Cui","year":"2019","unstructured":"Cui L, Ma C (2019) SOF-SLAM: a semantic visual SLAM for dynamic environments. IEEE Access 7:166528\u2013166539. https:\/\/doi.org\/10.1109\/ACCESS.2019.2952161","journal-title":"IEEE Access"},{"key":"1031_CR13","doi-asserted-by":"publisher","unstructured":"Xu H, Yang C, Feng Y (2019) FD-SLAM: real-time tracking and mapping in dynamic environments. In: IEEE international conference on unmanned systems and artificial intelligence (ICUSAI). IEEE, pp 166\u2013171. https:\/\/doi.org\/10.1109\/ICUSAI47366.2019.9124850","DOI":"10.1109\/ICUSAI47366.2019.9124850"},{"key":"1031_CR14","doi-asserted-by":"publisher","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","volume":"39","author":"V Badrinarayanan","year":"2017","unstructured":"Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481\u20132495. https:\/\/doi.org\/10.1109\/TPAMI.2016.2644615","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1031_CR15","doi-asserted-by":"publisher","first-page":"202","DOI":"10.3390\/ijgi9040202","volume":"9","author":"J Cheng","year":"2020","unstructured":"Cheng J, Wang Z, Zhou H et al (2020) DM-SLAM: a feature-based SLAM system for rigid dynamic scenes. ISPRS Int J Geo Inf 9:202. https:\/\/doi.org\/10.3390\/ijgi9040202","journal-title":"ISPRS Int J Geo Inf"},{"key":"1031_CR16","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511811685","volume-title":"Multiple view geometry in computer vision","author":"R Hartley","year":"2004","unstructured":"Hartley R, Zisserman A (2004) Multiple view geometry in computer vision. Cambridge, New York"},{"key":"1031_CR17","doi-asserted-by":"publisher","unstructured":"Liu PR, Meng MQH, Liu PX et al (2006) Optical flow and active contour for moving object segmentation and detection in monocular robot. In: IEEE International conference on robotics and automation (ICRA). IEEE, pp 4075\u20134080. https:\/\/doi.org\/10.1109\/ROBOT.2006.1642328","DOI":"10.1109\/ROBOT.2006.1642328"},{"key":"1031_CR18","doi-asserted-by":"publisher","unstructured":"Zinbi Y, Chahir Y, Elmoataz A (2008) Moving object segmentation using optical flow with active contour model. In: International conference on information and communication technologies: from theory to applications (ICTTA). IEEE, pp 1\u20135. https:\/\/doi.org\/10.1109\/ICTTA.2008.4530112","DOI":"10.1109\/ICTTA.2008.4530112"},{"key":"1031_CR19","doi-asserted-by":"publisher","first-page":"1874","DOI":"10.1109\/TRO.2021.3075644","volume":"37","author":"C Campos","year":"2021","unstructured":"Campos C, Elvira R, Rodr\u00edguez JJG et al (2021) ORB-SLAM3: an accurate open-source library for visual, visual\u2013inertial, and multimap slam. IEEE Trans Robot 37:1874\u20131890. https:\/\/doi.org\/10.1109\/TRO.2021.3075644","journal-title":"IEEE Trans Robot"},{"key":"1031_CR20","unstructured":"Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. Preprint at arXiv:1503.02531"},{"key":"1031_CR21","unstructured":"Xie J, Shuai B, Hu JF et al (2018) Improving fast segmentation with teacher-student learning. Preprint at arXiv:1810.08476."},{"key":"1031_CR22","doi-asserted-by":"publisher","unstructured":"Hou Y, Ma Z, Liu C et al (2020) Inter-region affinity distillation for road marking segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 12486\u201312495. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01250","DOI":"10.1109\/CVPR42600.2020.01250"},{"key":"1031_CR23","doi-asserted-by":"publisher","unstructured":"He T, Shen C, Tian Z et al (2019) Knowledge adaptation for efficient semantic segmentation. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 578\u2013587. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01250","DOI":"10.1109\/CVPR42600.2020.01250"},{"key":"1031_CR24","doi-asserted-by":"publisher","first-page":"4616","DOI":"10.3390\/s20164616","volume":"20","author":"S Park","year":"2020","unstructured":"Park S, Heo YS (2020) Knowledge distillation for semantic segmentation using channel and spatial correlations and adaptive cross entropy. Sensors 20:4616. https:\/\/doi.org\/10.3390\/s20164616","journal-title":"Sensors"},{"key":"1031_CR25","doi-asserted-by":"publisher","unstructured":"Gadde R, Jampani V, Gehler PV (2017) Semantic video cnns through representation warping. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 4453\u20134462. https:\/\/doi.org\/10.1109\/ICCV.2017.477","DOI":"10.1109\/ICCV.2017.477"},{"key":"1031_CR26","doi-asserted-by":"publisher","unstructured":"Zhu Yi, Sapra K, Reda FA et al (2019) Improving semantic segmentation via video propagation and label relaxation. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 8856\u20138865. https:\/\/doi.org\/10.1109\/CVPR.2019.00906","DOI":"10.1109\/CVPR.2019.00906"},{"key":"1031_CR27","doi-asserted-by":"publisher","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","volume":"33","author":"R Mur-Artal","year":"2017","unstructured":"Mur-Artal R, Tard\u00f3s JD (2017) ORB-SLAM2: an open-source slam system for monocular, stereo, and rgb-d cameras. IEEE Trans Robot 33:1255\u20131262. https:\/\/doi.org\/10.1109\/TRO.2017.2705103","journal-title":"IEEE Trans Robot"},{"key":"1031_CR28","doi-asserted-by":"publisher","first-page":"162335","DOI":"10.1109\/ACCESS.2020.2991441","volume":"8","author":"Y Ai","year":"2020","unstructured":"Ai Y, Rui T, Lu M et al (2020) DDL-SLAM: a robust RGB-D SLAM in dynamic environments combined with deep learning. IEEE Access 8:162335\u2013162342. https:\/\/doi.org\/10.1109\/ACCESS.2020.2991441","journal-title":"IEEE Access"},{"key":"1031_CR29","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1016\/j.knosys.2019.04.025","volume":"178","author":"Q Jin","year":"2019","unstructured":"Jin Q, Meng Z, Pham TD et al (2019) DUNet: a deformable network for retinal vessel segmentation. Knowl Based Syst 178:149\u2013162. https:\/\/doi.org\/10.1016\/j.knosys.2019.04.025","journal-title":"Knowl Based Syst"},{"key":"1031_CR30","doi-asserted-by":"publisher","unstructured":"Chen X, Xue J, Fang J et al (2020) Using detection, tracking and prediction in visual SLAM to achieve real-time semantic mapping of dynamic scenarios. In: IEEE intelligent vehicles symposium. IEEE, pp 666\u2013671. https:\/\/doi.org\/10.1109\/IV47402.2020.9304693","DOI":"10.1109\/IV47402.2020.9304693"},{"key":"1031_CR31","doi-asserted-by":"crossref","unstructured":"Liu W, Anguelov D, Erhan D et al (2016) SSD: single shot multibox detector. In: European conference on computer vision. Springer, pp 21\u201337","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"1031_CR32","doi-asserted-by":"publisher","unstructured":"Zhong F, Wang S, Zhang Z et al (2018) Detect-SLAM: making object detection and SLAM mutually beneficial. In: IEEE winter conference on applications of computer vision (WACV). IEEE, pp 1001\u20131010. https:\/\/doi.org\/10.1109\/WACV.2018.00115","DOI":"10.1109\/WACV.2018.00115"},{"key":"1031_CR33","doi-asserted-by":"publisher","unstructured":"Elvira R, Tardos JD, Montiel JMM (2019) ORBSLAM-Atlas: A robust and accurate multi-map system. In: IEEE\/RSJ international conference on intelligent robots and systems (IROS). IEEE, pp 6253\u20136259. https:\/\/doi.org\/10.1109\/IROS40897.2019.8967572","DOI":"10.1109\/IROS40897.2019.8967572"},{"key":"1031_CR34","doi-asserted-by":"crossref","unstructured":"Liu Y, Shen C, Yu C et al (2020) Efficient semantic video segmentation with per-frame inference. In: European conference on computer vision. Springer, pp 352\u2013368","DOI":"10.1007\/978-3-030-58607-2_21"},{"key":"1031_CR35","unstructured":"Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: International conference on international conference on machine learning (ICML). ACM, pp 807\u2013814"},{"key":"1031_CR36","doi-asserted-by":"publisher","unstructured":"Sturm J, Engelhard N, Endres F et al (2012) A benchmark for the evaluation of RGB-D SLAM systems. In: IEEE\/RSJ international conference on intelligent robots and systems (IROS). IEEE, pp 573\u2013580. https:\/\/doi.org\/10.1109\/IROS.2012.6385773","DOI":"10.1109\/IROS.2012.6385773"},{"key":"1031_CR37","doi-asserted-by":"publisher","unstructured":"Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 3354\u20133361. https:\/\/doi.org\/10.1109\/CVPR.2012.6248074","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"1031_CR38","doi-asserted-by":"publisher","unstructured":"Wang L, Li D, Zhu Y et al (2020) Dual super-resolution learning for semantic segmentation. In: IEEE\/CVF conference on computer vision and pattern recognition (CVPR). IEEE, pp 3773\u20133782. https:\/\/doi.org\/10.1109\/CVPR42600.2020.00383","DOI":"10.1109\/CVPR42600.2020.00383"},{"key":"1031_CR39","doi-asserted-by":"publisher","unstructured":"T\u00fcrkmen S, Heikkil\u00e4 J (2019) An efficient solution for semantic segmentation: Shufflenet v2 with atrous separable convolutions. In: Scandinavian conference on image analysis. Springer, pp 41\u201353. https:\/\/doi.org\/10.1007\/978-3-030-20205-7_4","DOI":"10.1007\/978-3-030-20205-7_4"},{"key":"1031_CR40","doi-asserted-by":"publisher","first-page":"1169","DOI":"10.1109\/TIP.2020.3042065","volume":"30","author":"T Wu","year":"2020","unstructured":"Wu T, Tang S, Zhang R et al (2020) Cgnet: a light-weight context guided network for semantic segmentation. IEEE Trans Image Process 30:1169\u20131179. https:\/\/doi.org\/10.1109\/TIP.2020.3042065","journal-title":"IEEE Trans Image Process"},{"key":"1031_CR41","doi-asserted-by":"publisher","unstructured":"Zhao H, Qi X, Shen X et al (2018) Icnet for real-time semantic segmentation on high-resolution images. In: European conference on computer vision (ECCV). Springer pp, 418\u2013434. https:\/\/doi.org\/10.1007\/978-3-030-01219-9_25","DOI":"10.1007\/978-3-030-01219-9_25"},{"key":"1031_CR42","doi-asserted-by":"publisher","unstructured":"Mehta S, Rastegari M, Caspi A et al (2018) Espnet: efficient spatial pyramid of dilated convolutions for semantic segmentation. In: European conference on computer vision (ECCV). Springer, pp 561\u2013580. https:\/\/doi.org\/10.1007\/978-3-030-01249-6_34","DOI":"10.1007\/978-3-030-01249-6_34"},{"key":"1031_CR43","doi-asserted-by":"publisher","unstructured":"Cordts M, Omran M, Ramos S et al (2016) The cityscapes dataset for semantic urban scene understanding. In: IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 3213\u20133223. https:\/\/doi.org\/10.1109\/CVPR.2016.350","DOI":"10.1109\/CVPR.2016.350"},{"key":"1031_CR44","doi-asserted-by":"publisher","unstructured":"Wang Y, Zhou W, Jiang T et al (2020) Intra-class feature variation distillation for semantic segmentation. In: European conference on computer vision (ECCV). Springer, pp 346\u2013362. https:\/\/doi.org\/10.1007\/978-3-030-58571-6_21","DOI":"10.1007\/978-3-030-58571-6_21"},{"key":"1031_CR45","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2020.3001940","author":"Y Liu","year":"2020","unstructured":"Liu Y, Shu C, Wang J et al (2020) Structured knowledge distillation for dense prediction. IEEE Trans Pattern Anal. https:\/\/doi.org\/10.1109\/TPAMI.2020.3001940","journal-title":"IEEE Trans Pattern Anal"},{"key":"1031_CR46","doi-asserted-by":"publisher","unstructured":"Shu C, Liu Y, Gao J et al. Channel-wise distillation for semantic segmentation. In: IEEE\/CVF international conference on computer vision (ICCV). IEEE, pp 5291\u20135300. https:\/\/doi.org\/10.1109\/ICCV48922.2021.00526","DOI":"10.1109\/ICCV48922.2021.00526"},{"key":"1031_CR47","doi-asserted-by":"publisher","DOI":"10.1109\/JSEN.2021.3051691","author":"J Jiao","year":"2021","unstructured":"Jiao J, Wang C, Li N et al (2021) An adaptive visual dynamic-SLAM method based on fusing the semantic information. IEEE Sens J. https:\/\/doi.org\/10.1109\/JSEN.2021.3051691","journal-title":"IEEE Sens J"},{"key":"1031_CR48","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.robot.2019.03.012","volume":"117","author":"L Xiao","year":"2019","unstructured":"Xiao L, Wang J, Qiu X et al (2019) Dynamic-SLAM: semantic monocular visual localization and mapping based on deep learning in dynamic environment. Robot Auton Syst 117:1\u201316. https:\/\/doi.org\/10.1016\/j.robot.2019.03.012","journal-title":"Robot Auton Syst"},{"key":"1031_CR49","doi-asserted-by":"publisher","unstructured":"Zhang Z, Zhang J, Tang Q (2019) Mask R-CNN based semantic RGB-D SLAM for dynamic scenes. In: IEEE\/ASME international conference on advanced intelligent mechatronics (AIM). IEEE, pp 1151\u20131156. https:\/\/doi.org\/10.1109\/AIM.2019.8868400","DOI":"10.1109\/AIM.2019.8868400"},{"key":"1031_CR50","doi-asserted-by":"publisher","unstructured":"Xu H, Zhang S, Liu P (2019) DOS-SLAM: A real-time dynamic object segmentation visual SLAM system. In: The 2019 2nd international conference on algorithms, computing and artificial intelligence. ACM, pp 85\u201390. https:\/\/doi.org\/10.1145\/3377713.3377731","DOI":"10.1145\/3377713.3377731"},{"key":"1031_CR51","doi-asserted-by":"publisher","unstructured":"He K, Gkioxari G, Dollar P, Girshick R (2017) Mask R-CNN. In: IEEE international conference on computer vision (ICCV). IEEE, pp 2980\u20132988. https:\/\/doi.org\/10.1109\/ICCV.2017.322","DOI":"10.1109\/ICCV.2017.322"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01031-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-023-01031-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-023-01031-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,22]],"date-time":"2023-09-22T17:24:21Z","timestamp":1695403461000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-023-01031-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,30]]},"references-count":51,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,10]]}},"alternative-id":["1031"],"URL":"https:\/\/doi.org\/10.1007\/s40747-023-01031-5","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,30]]},"assertion":[{"value":"29 September 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 March 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 March 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"This article does not contain any studies with human participants or animals performed by any of the authors.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval"}},{"value":"Not applicable.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent to participate"}},{"value":"Not applicable.","order":5,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}}]}}