{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,8]],"date-time":"2025-09-08T05:49:43Z","timestamp":1757310583295,"version":"3.37.3"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2022,4,22]],"date-time":"2022-04-22T00:00:00Z","timestamp":1650585600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,4,22]],"date-time":"2022-04-22T00:00:00Z","timestamp":1650585600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61988101"],"award-info":[{"award-number":["61988101"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Natural Science Fund for Distinguished Young Scholars","award":["61725301","61925305"],"award-info":[{"award-number":["61725301","61925305"]}]},{"name":"Programme of Introducing Talents of Discipline to Universities","award":["B17017"],"award-info":[{"award-number":["B17017"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Pedestrian detection has been researched for decades. Recently, an anchor-free method CSP is proposed to generate the pedestrian bounding box directly. When the predicted center deviates from the ground truth in the testing phase, the CSP model generates deviated pedestrian bounding box, which leads to false detection in occlusion situations. To handle this problem, we refine the scale regression branch of the CSP model to generate a more accurate prediction. The new scale regression branch outputs the distances between the center and the four edges of the pedestrian bounding box. Even if the predicted center deviates from the ground truth, an accurate bounding box can still be obtained. Moreover, we integrate a self-attention module into our model to take full advantage of the features in different depth layers. Our proposed model achieves better performance than the state-of-the-art detectors in comparison experiments on the two datasets, i.e., Citypersons and Caltech.<\/jats:p>","DOI":"10.1007\/s40747-022-00728-3","type":"journal-article","created":{"date-parts":[[2022,4,22]],"date-time":"2022-04-22T08:03:01Z","timestamp":1650614581000},"page":"4797-4809","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Self-attention-guided scale-refined detector for pedestrian detection"],"prefix":"10.1007","volume":"8","author":[{"given":"Xinchen","family":"Lin","sequence":"first","affiliation":[]},{"given":"Chaoqiang","family":"Zhao","sequence":"additional","affiliation":[]},{"given":"Chen","family":"Zhang","sequence":"additional","affiliation":[]},{"given":"Feng","family":"Qian","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,4,22]]},"reference":[{"key":"728_CR1","unstructured":"Nam W, Doll\u00e1r P, Han JH (2014) Local decorrelation for improved pedestrian detection. In: Proceedings of Advances in neural information processing systems, pp 424\u2013432"},{"issue":"8","key":"728_CR2","doi-asserted-by":"publisher","first-page":"6469","DOI":"10.1109\/JIOT.2020.3043716","volume":"8","author":"L Liu","year":"2020","unstructured":"Liu L, Lu S, Zhong R, Wu B, Yao Y, Zhang Q, Shi W (2020) Computing systems for autonomous driving: state of the art and challenges. IEEE Internet Things J 8(8):6469\u20136486","journal-title":"IEEE Internet Things J"},{"key":"728_CR3","doi-asserted-by":"crossref","unstructured":"Ghanem S, Kanungo P, Panda G, Satapathy SC, Sharma R (2021) Lane detection under artificial colored light in tunnels and on highways: an iot-based framework for smart city infrastructure. Complex Intell Syst","DOI":"10.1007\/s40747-021-00381-2"},{"issue":"1","key":"728_CR4","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1007\/s12652-016-0369-0","volume":"8","author":"D Varga","year":"2017","unstructured":"Varga D, Szir\u00e1nyi T (2017) Robust real-time pedestrian detection in surveillance videos. J Ambient Intell Humaniz Comput 8(1):79\u201385","journal-title":"J Ambient Intell Humaniz Comput"},{"key":"728_CR5","doi-asserted-by":"crossref","unstructured":"Han Q, Yin Q, Zheng X, Chen Z (2021) Remote sensing image building detection method based on mask r-cnn. Complex & Intelligent Systems","DOI":"10.1007\/s40747-021-00322-z"},{"key":"728_CR6","doi-asserted-by":"crossref","unstructured":"Khan MA, Kadry S, Parwekar P, Damasevicius R, Naqvi SR (2021) Human gait analysis for osteoarthritis prediction: a framework of deep learning and kernel extreme learning machine. Complex Intell Syst","DOI":"10.1007\/s40747-020-00244-2"},{"key":"728_CR7","doi-asserted-by":"crossref","unstructured":"Kareem Z, Zaidan A, Ahmed M, Zaidan B, Albahri O, Alamoodi A, Malik R, Albahri A, Ameen H, Garfan S et al (2021) An approach to pedestrian walking behaviour classification in wireless communication and network failure contexts. Complex Intell Syst","DOI":"10.1007\/s40747-021-00542-3"},{"issue":"4","key":"728_CR8","doi-asserted-by":"publisher","first-page":"522","DOI":"10.1109\/TSMC.2016.2606400","volume":"48","author":"W Fang","year":"2018","unstructured":"Fang W, Yao X, Zhao X, Yin J, Xiong N (2018) A stochastic control approach to maximize profit on service provisioning for mobile cloudlet platforms. IEEE Trans Syst Man Cybern Syst 48(4):522\u2013534","journal-title":"IEEE Trans Syst Man Cybern Syst"},{"issue":"7","key":"728_CR9","doi-asserted-by":"publisher","first-page":"4254","DOI":"10.1109\/TII.2019.2905659","volume":"15","author":"B Lin","year":"2019","unstructured":"Lin B, Zhu F, Zhang J, Chen J, Chen X, Xiong NN, Lloret Mauri J (2019) A time-driven data placement strategy for a scientific workflow combining edge computing and cloud computing. IEEE Trans Ind Inform 15(7):4254\u20134265","journal-title":"IEEE Trans Ind Inform"},{"key":"728_CR10","doi-asserted-by":"crossref","unstructured":"Hosang J, Omran M, Benenson R, Schiele B (2015) Taking a deeper look at pedestrians. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4073\u20134082","DOI":"10.1109\/CVPR.2015.7299034"},{"key":"728_CR11","doi-asserted-by":"crossref","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 580\u2013587","DOI":"10.1109\/CVPR.2014.81"},{"issue":"6","key":"728_CR12","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2016","unstructured":"Ren S, He K, Girshick R, Sun J (2016) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137\u20131149","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"728_CR13","doi-asserted-by":"crossref","unstructured":"Liu W, Liao S, Ren W, Hu W, Yu Y (2019) High-level semantic feature detection: A new perspective for pedestrian detection. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 5182\u20135191","DOI":"10.1109\/CVPR.2019.00533"},{"key":"728_CR14","doi-asserted-by":"publisher","first-page":"3085","DOI":"10.1109\/TMM.2020.3020691","volume":"23","author":"J Zhang","year":"2020","unstructured":"Zhang J, Lin L, Zhu J, Li Y, Chen Y-c, Hu Y, Hoi CS (2020) Attribute-aware pedestrian detection in a crowd. IEEE Trans Multimed 23:3085\u20133097","journal-title":"IEEE Trans Multimed"},{"key":"728_CR15","doi-asserted-by":"crossref","unstructured":"Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV). pp 3\u201319","DOI":"10.1007\/978-3-030-01234-2_1"},{"issue":"4","key":"728_CR16","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1109\/TPAMI.2011.155","volume":"34","author":"P Dollar","year":"2012","unstructured":"Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743\u2013761","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"728_CR17","doi-asserted-by":"crossref","unstructured":"Zhang S, Benenson R, Schiele B (2017) Citypersons: A diverse dataset for pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4457\u20134465","DOI":"10.1109\/CVPR.2017.474"},{"key":"728_CR18","doi-asserted-by":"crossref","unstructured":"Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, vol 1, pp 886\u2013893","DOI":"10.1109\/CVPR.2005.177"},{"key":"728_CR19","doi-asserted-by":"crossref","unstructured":"Doll\u00e1r P, Tu Z, Perona P, Belongie S (2009) Integral channel features. In: Proceedings of the British machine cision conference, pp 91.1\u201391.11","DOI":"10.5244\/C.23.91"},{"key":"728_CR20","doi-asserted-by":"crossref","unstructured":"Cao J, Pang Y, Li X (2016) Pedestrian detection inspired by appearance constancy and shape symmetry. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1316\u20131324","DOI":"10.1109\/CVPR.2016.147"},{"key":"728_CR21","doi-asserted-by":"crossref","unstructured":"Zhang L, Lin L, Liang X, He K (2016) Is faster r-cnn doing well for pedestrian detection? In: Proceedings of the European conference on computer vision. Springer, pp 443\u2013457","DOI":"10.1007\/978-3-319-46475-6_28"},{"key":"728_CR22","doi-asserted-by":"crossref","unstructured":"Mao J, Xiao T, Jiang Y, Cao Z (2017) What can help pedestrian detection? In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 3127\u20133136","DOI":"10.1109\/CVPR.2017.639"},{"key":"728_CR23","doi-asserted-by":"crossref","unstructured":"Brazil G, Yin X, Liu X (2017) Illuminating pedestrians via simultaneous detection and segmentation. In: Proceedings of the IEEE international conference on computer vision. pp 4950\u20134959","DOI":"10.1109\/ICCV.2017.530"},{"issue":"4","key":"728_CR24","first-page":"985","volume":"20","author":"J Li","year":"2018","unstructured":"Li J, Liang X, Shen S, Xu T, Feng J, Yan S (2018) Scale-aware fast r-cnn for pedestrian detection. IEEE Trans Multimed 20(4):985\u2013996","journal-title":"IEEE Trans Multimed"},{"key":"728_CR25","doi-asserted-by":"crossref","unstructured":"Cai Z, Fan Q, Feris RS, Vasconcelos N (2016) A unified multi-scale deep convolutional neural network for fast object detection. In: Proceedings of the European conference on computer vision. Springer, pp 354\u2013370","DOI":"10.1007\/978-3-319-46493-0_22"},{"key":"728_CR26","doi-asserted-by":"publisher","unstructured":"Yang P, Zhang G, Wang L, Xu L, Deng Q, Yang M-H (2021) A part-aware multi-scale fully convolutional network for pedestrian detection. IEEE Trans Intell Transport Syst 22(2):1125\u20131137. https:\/\/doi.org\/10.1109\/TITS.2019.2963700","DOI":"10.1109\/TITS.2019.2963700"},{"key":"728_CR27","doi-asserted-by":"crossref","unstructured":"Lin C, Lu J, Wang G, Zhou J (2018) Graininess-aware deep feature learning for pedestrian detection. In: Proceedings of the European conference on computer vision","DOI":"10.1007\/978-3-030-01240-3_45"},{"issue":"1","key":"728_CR28","doi-asserted-by":"publisher","first-page":"577","DOI":"10.1007\/s40747-020-00206-8","volume":"7","author":"C Ning","year":"2021","unstructured":"Ning C, Menglu L, Hao Y, Xueping S, Yunhong L (2021) Survey of pedestrian detection with occlusion. Complex Intell Syst 7(1):577\u2013587","journal-title":"Complex Intell Syst"},{"key":"728_CR29","doi-asserted-by":"crossref","unstructured":"Noh J, Lee S, Kim B, Kim G (2018) Improving occlusion and hard negative handling for single-stage pedestrian detectors. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 966\u2013974","DOI":"10.1109\/CVPR.2018.00107"},{"key":"728_CR30","unstructured":"Fu C-Y, Liu W, Ranga A, Tyagi A, Berg AC (2017) Dssd: deconvolutional single shot detector. arXiv preprint arXiv:1701.06659"},{"key":"728_CR31","unstructured":"Zhang T, Ye Q, Zhang B, Liu J, Zhang X, Tian Q (2020) Feature calibration network for occluded pedestrian detection. IEEE Trans Intell Transport Syst"},{"key":"728_CR32","doi-asserted-by":"crossref","unstructured":"Zhou C, Yuan J (2018) Bi-box regression for pedestrian detection and occlusion estimation. In: Proceedings of the European conference on computer vision. pp 135\u2013151","DOI":"10.1007\/978-3-030-01246-5_9"},{"key":"728_CR33","doi-asserted-by":"crossref","unstructured":"Huang X, Ge Z, Jie Z, Yoshie O (2020) Nms by representative region: Towards crowded pedestrian detection by proposal pairing. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. pp 10747\u201310756","DOI":"10.1109\/CVPR42600.2020.01076"},{"key":"728_CR34","doi-asserted-by":"crossref","unstructured":"Pang Y, Xie J, Khan MH, Anwer RM, Khan FS, Shao L (2019) Mask-guided attention network for occluded pedestrian detection. In: Proceedings of the IEEE\/CVF international conference on computer vision. pp 4967\u20134975","DOI":"10.1109\/ICCV.2019.00507"},{"key":"728_CR35","doi-asserted-by":"crossref","unstructured":"Law H, Deng J (2018) Cornernet: Detecting objects as paired keypoints. In: Proceedings of the European conference on computer vision. pp 734\u2013750","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"728_CR36","doi-asserted-by":"crossref","unstructured":"Song T, Sun L, Xie D, Sun H, Pu S (2018) Small-scale pedestrian detection based on topological line localization and temporal feature aggregation. In: Proceedings of the European conference on computer vision. pp 536\u2013551","DOI":"10.1007\/978-3-030-01234-2_33"},{"key":"728_CR37","doi-asserted-by":"crossref","unstructured":"Tian Z, Shen C, Chen H, He T (2019) Fcos: Fully convolutional one-stage object detection. In: Proceedings of the IEEE\/CVF international conference on computer vision. pp 9627\u20139636","DOI":"10.1109\/ICCV.2019.00972"},{"key":"728_CR38","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 770\u2013778","DOI":"10.1109\/CVPR.2016.90"},{"key":"728_CR39","doi-asserted-by":"crossref","unstructured":"Rezatofighi H, Tsoi N, Gwak J, Sadeghian A, Reid I, Savarese S (2019) Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. pp 658\u2013666","DOI":"10.1109\/CVPR.2019.00075"},{"key":"728_CR40","doi-asserted-by":"crossref","unstructured":"Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2016) How far are we from solving pedestrian detection? In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1259\u20131267","DOI":"10.1109\/CVPR.2016.141"},{"key":"728_CR41","first-page":"8026","volume":"32","author":"A Paszke","year":"2019","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 32:8026\u20138037","journal-title":"Adv Neural Inf Process Syst"},{"key":"728_CR42","unstructured":"Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980"},{"key":"728_CR43","unstructured":"Tarvainen A, Valpola H (2017) Mean teachers are better role models: weight-averaged consistency targets improve semi-supervised deep learning results. arXiv preprint arXiv:1703.01780"},{"key":"728_CR44","doi-asserted-by":"publisher","unstructured":"Liu W, Liao S, Hu W (2020) Efficient single-stage pedestrian detector by asymptotic localization fitting and multi-scale context encoding. IEEE Trans Image Process 29:1413\u20131425. https:\/\/doi.org\/10.1109\/TIP.2019.2938877","DOI":"10.1109\/TIP.2019.2938877"},{"issue":"2","key":"728_CR45","doi-asserted-by":"publisher","first-page":"982","DOI":"10.1109\/TITS.2020.3019390","volume":"23","author":"C Zhou","year":"2022","unstructured":"Zhou C, Wu M, Lam S-K (2022) A unified multi-task learning architecture for fast and accurate pedestrian detection. IEEE Trans Intell Transport Syst 23(2):982\u2013996","journal-title":"IEEE Trans Intell Transport Syst"},{"key":"728_CR46","doi-asserted-by":"publisher","unstructured":"Jiao Y, Yao H, Xu C (2021) Pen: pose-embedding network for pedestrian detection. IEEE Trans Circuits Syst Video Technol 31(3):1150\u20131162. https:\/\/doi.org\/10.1109\/TCSVT.2020.3000223","DOI":"10.1109\/TCSVT.2020.3000223"},{"key":"728_CR47","doi-asserted-by":"crossref","unstructured":"Zhang S, Yang J, Schiele B (2018) Occluded pedestrian detection through guided attention in cnns. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 6995\u20137003","DOI":"10.1109\/CVPR.2018.00731"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00728-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-022-00728-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00728-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,10,27]],"date-time":"2022-10-27T13:20:15Z","timestamp":1666876815000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-022-00728-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,22]]},"references-count":47,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["728"],"URL":"https:\/\/doi.org\/10.1007\/s40747-022-00728-3","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"type":"print","value":"2199-4536"},{"type":"electronic","value":"2198-6053"}],"subject":[],"published":{"date-parts":[[2022,4,22]]},"assertion":[{"value":"15 November 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 April 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}