{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T18:26:36Z","timestamp":1781029596340,"version":"3.54.1"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T00:00:00Z","timestamp":1765324800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T00:00:00Z","timestamp":1765324800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012165","name":"Key Technologies Research and Development Program","doi-asserted-by":"publisher","award":["NO.2022YFB3305700"],"award-info":[{"award-number":["NO.2022YFB3305700"]}],"id":[{"id":"10.13039\/501100012165","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton. Intell. Syst."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Object detection serves as a challenging yet crucial task in computer vision. Despite significant advancements, modern detectors remain struggling with task alignment between localization and classification. In this paper, Global Collaborative Learning (GCL) is introduced to address these challenges from often-overlooked perspectives. First, the essence of GCL is reflected in the label assignment of the detector. Adjusting the loss function to transform samples with strong localization yet weak classification into high-quality samples in both tasks, provides more effective training signals, enabling the model to capture key consistent features. Second, the spirit of GCL is embodied in the head design. By enabling global feature interaction within the decoupled head, the approach ensures that final predictions are made more comprehensively and robustly, thereby preventing the two independent branches from converging into suboptimal solutions for their respective tasks. Extensive experiments on the challenging MS COCO and CrowdHuman datasets demonstrate that the proposed GCL method substantially enhances performance and generalization capabilities.<\/jats:p>","DOI":"10.1007\/s43684-025-00114-z","type":"journal-article","created":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T10:22:32Z","timestamp":1765362152000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Enhancing object detection through global collaborative learning"],"prefix":"10.1007","volume":"5","author":[{"given":"Weidong","family":"Zhao","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-7458-0254","authenticated-orcid":false,"given":"Jian","family":"Chen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2097-4561","authenticated-orcid":false,"given":"Xianhui","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiahuan","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,12,10]]},"reference":[{"key":"114_CR1","first-page":"770","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"K. He","year":"2016","unstructured":"K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 770\u2013778"},{"key":"114_CR2","first-page":"2117","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"T.-Y. Lin","year":"2017","unstructured":"T.-Y. Lin, P. Doll\u2019ar, R. Girshick, K. He, B. Hariharan, S. Belongie, Feature pyramid networks for object detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 2117\u20132125"},{"key":"114_CR3","volume-title":"Advances in Neural Information Processing Systems","author":"A. Vaswani","year":"2017","unstructured":"A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, \u0141. Kaiser, I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems, vol.\u00a030 (2017)"},{"issue":"6","key":"114_CR4","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S. Ren","year":"2016","unstructured":"S. Ren, K. He, R. Girshick, J. Sun, Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137\u20131149 (2016)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"114_CR5","doi-asserted-by":"publisher","first-page":"9626","DOI":"10.1109\/ICCV.2019.00972","volume-title":"2019 IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"Z. Tian","year":"2019","unstructured":"Z. Tian, C. Shen, H. Chen, T. He, Fcos: fully convolutional one-stage object detection, in 2019 IEEE\/CVF International Conference on Computer Vision (ICCV) (2019), pp. 9626\u20139635"},{"key":"114_CR6","first-page":"9759","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"S. Zhang","year":"2020","unstructured":"S. Zhang, C. Chi, Y. Yao, Z. Lei, S.Z. Li, Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 9759\u20139768"},{"key":"114_CR7","first-page":"21002","volume":"33","author":"X. Li","year":"2020","unstructured":"X. Li, W. Wang, L. Wu, S. Chen, X. Hu, J. Li, J. Tang, J. Yang, Generalized focal loss: learning qualified and distributed bounding boxes for dense object detection. Adv. Neural Inf. Process. Syst. 33, 21002\u201321012 (2020)","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"114_CR8","first-page":"213","volume-title":"European Conference on Computer Vision","author":"N. Carion","year":"2020","unstructured":"N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, S. Zagoruyko, End-to-end object detection with transformers, in European Conference on Computer Vision (Springer, Berlin, 2020), pp. 213\u2013229"},{"key":"114_CR9","first-page":"9387","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"S. Li","year":"2022","unstructured":"S. Li, C. He, R. Li, L. Zhang, A dual weighting label assignment scheme for object detection, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2022), pp. 9387\u20139396"},{"key":"114_CR10","doi-asserted-by":"publisher","first-page":"3490","DOI":"10.1109\/ICCV48922.2021.00349","volume-title":"2021 IEEE\/CVF International Conference on Computer Vision (ICCV)","author":"C. Feng","year":"2021","unstructured":"C. Feng, Y. Zhong, Y. Gao, M.R. Scott, W. Huang, Tood: task-aligned one-stage object detection, in 2021 IEEE\/CVF International Conference on Computer Vision (ICCV) (IEEE Computer Society, 2021), pp. 3490\u20133499"},{"key":"114_CR11","unstructured":"J.-W. Ma, M. Liang, L. Chen, S. Tian, S.-L. Chen, J. Qin, X.-C. Yin, Sample weighting with hierarchical equalization loss for dense object detection. IEEE Trans. Multimed. (2023)"},{"key":"114_CR12","doi-asserted-by":"crossref","unstructured":"X. Tang, Q. Yang, X. Zhang, W. Deng, H. Wang, X. Gao, A refinement method for single-stage object detection based on progressive decoupled task alignment. IEEE Trans. Circuits Syst. Video Technol. (2023)","DOI":"10.1109\/TCSVT.2023.3323879"},{"key":"114_CR13","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2023.109878","volume":"145","author":"W. Lin","year":"2024","unstructured":"W. Lin, J. Chu, L. Leng, J. Miao, L. Wang, Feature disentanglement in one-stage object detection. Pattern Recognit. 145, 109878 (2024)","journal-title":"Pattern Recognit."},{"key":"114_CR14","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/978-3-319-46448-0_2","volume-title":"Computer Vision-ECCV 2016: 14th European Conference","author":"W. Liu","year":"2016","unstructured":"W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, A.C. Berg, Ssd: single shot multibox detector, in Computer Vision-ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11\u201314, 2016, Proceedings, Part I, vol.\u00a014 (Springer, Berlin, 2016), pp. 21\u201337"},{"key":"114_CR15","doi-asserted-by":"publisher","first-page":"7389","DOI":"10.1109\/TIP.2020.3002345","volume":"29","author":"T. Kong","year":"2020","unstructured":"T. Kong, F. Sun, H. Liu, Y. Jiang, L. Li, J. Shi, Foveabox: beyond anchor-based object detection. IEEE Trans. Image Process. 29, 7389\u20137398 (2020)","journal-title":"IEEE Trans. Image Process."},{"key":"114_CR16","volume-title":"Advances in Neural Information Processing Systems","author":"X. Zhang","year":"2019","unstructured":"X. Zhang, F. Wan, C. Liu, R. Ji, Q. Ye, Freeanchor: learning to match anchors for visual object detection, in Advances in Neural Information Processing Systems, vol.\u00a032 (2019)"},{"key":"114_CR17","first-page":"10588","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"H. Li","year":"2020","unstructured":"H. Li, Z. Wu, C. Zhu, C. Xiong, R. Socher, L.S. Davis, Learning from noisy anchors for one-stage object detection, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 10588\u201310597"},{"key":"114_CR18","first-page":"3641","volume-title":"Proceedings of the IEEE\/CVF International Conference on Computer Vision","author":"Z. Gao","year":"2021","unstructured":"Z. Gao, L. Wang, G. Wu, Mutual supervision for dense object detection, in Proceedings of the IEEE\/CVF International Conference on Computer Vision (2021), pp. 3641\u20133650"},{"key":"114_CR19","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2024.127383","volume":"577","author":"Y. Zhang","year":"2024","unstructured":"Y. Zhang, C. Luo, A dynamic label assignment strategy for one-stage detectors. Neurocomputing 577, 127383 (2024)","journal-title":"Neurocomputing"},{"key":"114_CR20","first-page":"2980","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"T.-Y. Lin","year":"2017","unstructured":"T.-Y. Lin, P. Goyal, R. Girshick, K. He, P. Doll\u2019ar, Focal loss for dense object detection, in Proceedings of the IEEE International Conference on Computer Vision (2017), pp. 2980\u20132988"},{"key":"114_CR21","first-page":"8514","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"H. Zhang","year":"2021","unstructured":"H. Zhang, Y. Wang, F. Dayoub, N. Sunderhauf, Var-ifocalnet: an iou-aware dense object detector, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2021), pp. 8514\u20138523"},{"key":"114_CR22","first-page":"11583","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Y. Cao","year":"2020","unstructured":"Y. Cao, K. Chen, C.C. Loy, D. Lin, Prime sample attention in object detection, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 11583\u201311591"},{"key":"114_CR23","first-page":"1440","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"R. Girshick","year":"2015","unstructured":"R. Girshick, Fast r-cnn, in Proceedings of the IEEE International Conference on Computer Vision (2015), pp. 1440\u20131448"},{"key":"114_CR24","first-page":"10186","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Y. Wu","year":"2020","unstructured":"Y. Wu, Y. Chen, L. Yuan, Z. Liu, L. Wang, H. Li, Y. Fu, Rethinking classification and localization for object detection, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 10186\u201310195"},{"key":"114_CR25","first-page":"11563","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"G. Song","year":"2020","unstructured":"G. Song, Y. Liu, X. Wang, Revisiting the sibling head in object detector, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2020), pp. 11563\u201311572"},{"key":"114_CR26","first-page":"764","volume-title":"Proceedings of the IEEE International Conference on Computer Vision","author":"J. Dai","year":"2017","unstructured":"J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu, Y. Wei, Deformable convolutional networks, in Proceedings of the IEEE International Conference on Computer Vision (2017), pp. 764\u2013773"},{"key":"114_CR27","doi-asserted-by":"publisher","first-page":"740","DOI":"10.1007\/978-3-319-10602-1_48","volume-title":"Computer Vision-ECCV 2014: 13th European Conference","author":"T.-Y. Lin","year":"2014","unstructured":"T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll\u2019ar, C.L. Zitnick, Microsoft coco: common objects in context, in Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6\u201312, 2014, Proceedings, Part V, vol.\u00a013 (Springer, Berlin, 2014), pp. 740\u2013755"},{"key":"114_CR28","unstructured":"S. Shao, Z. Zhao, B. Li, T. Xiao, G. Yu, X. Zhang, J. Sun, Crowdhuman: a benchmark for detecting human in a crowd. arXiv preprint (2018). arXiv:1805.00123"},{"key":"114_CR29","unstructured":"K. Chen, J. Wang, J. Pang, Y. Cao, Y. Xiong, X. Li, S. Sun, W. Feng, Z. Liu, J. Xu, Z. Zhang, D. Cheng, C. Zhu, T. Cheng, Q. Zhao, B. Li, X. Lu, R. Zhu, Y. Wu, J. Dai, J. Wang, J. Shi, W. Ouyang, C.C. Loy, D. Lin, MMDetection: open mmlab detection toolbox and benchmark. arXiv preprint (2019). arXiv:1906.07155"},{"key":"114_CR30","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1109\/CVPR.2009.5206848","volume-title":"2009 IEEE Conference on Computer Vision and Pattern Recognition","author":"J. Deng","year":"2009","unstructured":"J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, L. Fei-Fei, Imagenet: a large-scale hierarchical image database, in 2009 IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2009), pp. 248\u2013255"},{"key":"114_CR31","unstructured":"P. Goyal, P. Doll\u2019ar, R. Girshick, P. Noordhuis, L. Wesolowski, A. Kyrola, A. Tulloch, Y. Jia, K. He, Accurate, large minibatch sgd: training imagenet in 1 hour. arXiv preprint (2017). arXiv:1706.02677"},{"key":"114_CR32","doi-asserted-by":"publisher","first-page":"14449","DOI":"10.1109\/CVPR46437.2021.01422","volume-title":"2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"P. Sun","year":"2021","unstructured":"P. Sun, R. Zhang, Y. Jiang, T. Kong, C. Xu, W. Zhan, M. Tomizuka, L. Li, Z. Yuan, C. Wang, P. Luo, Sparse r-cnn: end-to-end object detection with learnable proposals, in 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021), pp. 14449\u201314458"},{"key":"114_CR33","first-page":"784","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV)","author":"B. Jiang","year":"2018","unstructured":"B. Jiang, R. Luo, J. Mao, T. Xiao, Y. Jiang, Acquisition of localization confidence for accurate object detection, in Proceedings of the European Conference on Computer Vision (ECCV) (2018), pp. 784\u2013799"},{"key":"114_CR34","first-page":"355","volume-title":"Computer Vision-ECCV 2020: 16th European Conference","author":"K. Kang","year":"2020","unstructured":"K. Kang, H.S. Lee, Probabilistic anchor assignment with iou prediction for object detection, in Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23\u201328, 2020, Proceedings, Part XXV, vol.\u00a016 (Springer, Berlin, 2020), pp. 355\u2013371"},{"key":"114_CR35","first-page":"7373","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"X. Dai","year":"2021","unstructured":"X. Dai, Y. Chen, B. Xiao, D. Chen, M. Liu, L. Yuan, L. Zhang, Dynamic head: unifying object detection heads with attentions, in Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (2021), pp. 7373\u20137382"}],"container-title":["Autonomous Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s43684-025-00114-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s43684-025-00114-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s43684-025-00114-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T10:22:37Z","timestamp":1765362157000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s43684-025-00114-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,12,10]]},"references-count":35,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["114"],"URL":"https:\/\/doi.org\/10.1007\/s43684-025-00114-z","relation":{},"ISSN":["2730-616X"],"issn-type":[{"value":"2730-616X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,10]]},"assertion":[{"value":"23 December 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 May 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 October 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 December 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We declare that we have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"29"}}