{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T17:19:09Z","timestamp":1777655949436,"version":"3.51.4"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T00:00:00Z","timestamp":1763683200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T00:00:00Z","timestamp":1763683200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002835","name":"Chalmers University of Technology","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100002835","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Machine Vision and Applications"],"published-print":{"date-parts":[[2026,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    We address multi-view pedestrian detection in a setting where labeled data is collected using a multi-camera setup different from the one used for testing. While recent multi-view pedestrian detectors perform well on the camera rig used for training, their performance declines when applied to a different setup. To facilitate seamless deployment across varied camera rigs, we propose an unsupervised domain adaptation (UDA) method that adapts the model to new rigs without requiring additional labeled data. Specifically, we leverage the mean teacher self-training framework with a novel pseudo-labeling technique tailored to multi-view pedestrian detection. This method achieves state-of-the-art performance on multiple benchmarks, including MultiviewX\n                    <jats:inline-formula>\n                      <jats:tex-math>$$\\rightarrow $$<\/jats:tex-math>\n                    <\/jats:inline-formula>\n                    Wildtrack. Unlike previous methods, our approach eliminates the need for external labeled monocular datasets, thereby reducing reliance on labeled data. Extensive evaluations demonstrate the effectiveness of our method and validate key design choices. By enabling robust adaptation across camera setups, our work enhances the practicality of multi-view pedestrian detectors and establishes a strong UDA baseline for future research.\n                  <\/jats:p>","DOI":"10.1007\/s00138-025-01764-y","type":"journal-article","created":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T13:12:59Z","timestamp":1763730779000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["MVUDA: Unsupervised Domain Adaptation for Multi-view Pedestrian Detection"],"prefix":"10.1007","volume":"37","author":[{"given":"Erik","family":"Brorsson","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lennart","family":"Svensson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kristofer","family":"Bengtsson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Knut","family":"\u00c5kesson","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,11,21]]},"reference":[{"key":"1764_CR1","doi-asserted-by":"crossref","unstructured":"Ferryman, J., Shahrokni, A.: Pets2009: Dataset and challenge. In: 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, pp. 1\u20136 (2009). IEEE","DOI":"10.1109\/PETS-WINTER.2009.5399556"},{"key":"1764_CR2","doi-asserted-by":"crossref","unstructured":"Coates, A., Ng, A.Y.: Multi-camera object detection for robotics. In: 2010 IEEE International Conference on Robotics and Automation, pp. 412\u2013419 (2010). IEEE","DOI":"10.1109\/ROBOT.2010.5509644"},{"key":"1764_CR3","doi-asserted-by":"publisher","first-page":"855","DOI":"10.1007\/s00138-009-0212-0","volume":"21","author":"J Ren","year":"2010","unstructured":"Ren, J., Xu, M., Orwell, J., Jones, G.A.: Multi-camera video surveillance for real-time analysis and reconstruction of soccer games. Mach. Vis. Appl. 21, 855\u2013863 (2010)","journal-title":"Mach. Vis. Appl."},{"issue":"9","key":"1764_CR4","doi-asserted-by":"publisher","first-page":"5488","DOI":"10.1109\/LRA.2023.3296333","volume":"8","author":"Z Zhang","year":"2023","unstructured":"Zhang, Z., Hajieghrary, H., Dean, E., \u00c5kesson, K.: Prescient collision-free navigation of mobile robots with iterative multimodal motion prediction of dynamic obstacles. IEEE Robotics Autom. Lett. 8(9), 5488\u20135495 (2023). https:\/\/doi.org\/10.1109\/LRA.2023.3296333","journal-title":"IEEE Robotics Autom. Lett."},{"key":"1764_CR5","first-page":"1","volume-title":"Computer Vision - ECCV 2020","author":"Y Hou","year":"2020","unstructured":"Hou, Y., Zheng, L., Gould, S.: Multiview detection with feature perspective transformation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision - ECCV 2020, pp. 1\u201318. Springer, Cham (2020)"},{"key":"1764_CR6","doi-asserted-by":"crossref","unstructured":"Vora, J., Dutta, S., Jain, K., Karthik, S., Gandhi, V.: Bringing generalization to deep multi-view pedestrian detection. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 110\u2013119 (2023)","DOI":"10.1109\/WACVW58289.2023.00016"},{"key":"1764_CR7","doi-asserted-by":"crossref","unstructured":"Song, L., Wu, J., Yang, M., Zhang, Q., Li, Y., Yuan, J.: Stacked homography transformations for multi-view pedestrian detection. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision, pp. 6049\u20136057 (2021)","DOI":"10.1109\/ICCV48922.2021.00599"},{"key":"1764_CR8","doi-asserted-by":"crossref","unstructured":"Qiu, R., Xu, M., Yan, Y., Smith, J.S., Yang, X.: 3D random occlusion and multi-layer projection for deep multi-camera pedestrian localization. In: European Conference on Computer Vision, pp. 695\u2013710 (2022). Springer","DOI":"10.1007\/978-3-031-20080-9_40"},{"key":"1764_CR9","doi-asserted-by":"crossref","unstructured":"Engilberge, M., Shi, H., Wang, Z., Fua, P.: Two-level data augmentation for calibrated multi-view detection. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 128\u2013136 (2023)","DOI":"10.1109\/WACV56688.2023.00021"},{"key":"1764_CR10","doi-asserted-by":"crossref","unstructured":"Aung, S., Park, H., Jung, H., Cho, J.: Enhancing multi-view pedestrian detection through generalized 3D feature pulling. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 1196\u20131205 (2024)","DOI":"10.1109\/WACV57701.2024.00123"},{"key":"1764_CR11","doi-asserted-by":"crossref","unstructured":"Deng, J., Li, W., Chen, Y., Duan, L.: Unbiased mean teacher for cross-domain object detection. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 4091\u20134101 (2021)","DOI":"10.1109\/CVPR46437.2021.00408"},{"key":"1764_CR12","doi-asserted-by":"crossref","unstructured":"Hoyer, L., Dai, D., Van\u00a0Gool, L.: Daformer: Improving network architectures and training strategies for domain-adaptive semantic segmentation. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 9924\u20139935 (2022)","DOI":"10.1109\/CVPR52688.2022.00969"},{"key":"1764_CR13","doi-asserted-by":"crossref","unstructured":"Li, Y.-J., Dai, X., Ma, C.-Y., Liu, Y.-C., Chen, K., Wu, B., He, Z., Kitani, K., Vajda, P.: Cross-domain adaptive teacher for object detection. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7581\u20137590 (2022)","DOI":"10.1109\/CVPR52688.2022.00743"},{"key":"1764_CR14","unstructured":"Tarvainen, A., Valpola, H.: Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. Advances in neural information processing systems 30 (2017)"},{"key":"1764_CR15","doi-asserted-by":"crossref","unstructured":"Lima, J.P., Thomas, D., Uchiyama, H., Teichrieb, V.: Toward unlabeled multi-view 3D pedestrian detection by generalizable AI: techniques and performance analysis. In: 2023 36th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pp. 1\u20136 (2023). IEEE","DOI":"10.1109\/SIBGRAPI59091.2023.10347151"},{"key":"1764_CR16","doi-asserted-by":"crossref","unstructured":"Lima, J.P., Thomas, D., Uchiyama, H., Teichrieb, V.: Mean teacher for unsupervised domain adaptation in multi-view 3D pedestrian detection. In: 2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), pp. 1\u20136 (2024). IEEE","DOI":"10.1109\/SIBGRAPI62404.2024.10716327"},{"issue":"2","key":"1764_CR17","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1109\/TPAMI.2007.1174","volume":"30","author":"F Fleuret","year":"2007","unstructured":"Fleuret, F., Berclaz, J., Lengagne, R., Fua, P.: Multicamera people tracking with a probabilistic occupancy map. IEEE Trans. Pattern Anal. Mach. Intell. 30(2), 267\u2013282 (2007)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"1764_CR18","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1007\/s10851-010-0258-7","volume":"41","author":"A Alahi","year":"2011","unstructured":"Alahi, A., Jacques, L., Boursier, Y., Vandergheynst, P.: Sparsity driven people localization with a heterogeneous network of cameras. J. Mathem. Imaging Vision 41, 39\u201358 (2011)","journal-title":"J. Mathem. Imaging Vision"},{"issue":"5","key":"1764_CR19","doi-asserted-by":"publisher","first-page":"1760","DOI":"10.1016\/j.patcog.2014.12.004","volume":"48","author":"P Peng","year":"2015","unstructured":"Peng, P., Tian, Y., Wang, Y., Li, J., Huang, T.: Robust multiple cameras pedestrian detection with multi-view bayesian network. Pattern Recogn. 48(5), 1760\u20131772 (2015)","journal-title":"Pattern Recogn."},{"issue":"4","key":"1764_CR20","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1007\/s00138-022-01323-9","volume":"33","author":"JP Lima","year":"2022","unstructured":"Lima, J.P., Roberto, R., Figueiredo, L., Sim\u00f5es, F., Thomas, D., Uchiyama, H., Teichrieb, V.: 3D pedestrian localization using multiple cameras: A generalizable approach. Mach. Vis. Appl. 33(4), 61 (2022)","journal-title":"Mach. Vis. Appl."},{"issue":"5","key":"1764_CR21","doi-asserted-by":"publisher","first-page":"1211","DOI":"10.1007\/s10115-022-01673-w","volume":"64","author":"A L\u00f3pez-Cifuentes","year":"2022","unstructured":"L\u00f3pez-Cifuentes, A., Escudero-Vi\u00f1olo, M., Besc\u00f3s, J., Carballeira, P.: Semantic-driven multi-camera pedestrian detection. Knowl. Inf. Syst. 64(5), 1211\u20131237 (2022)","journal-title":"Knowl. Inf. Syst."},{"key":"1764_CR22","doi-asserted-by":"crossref","unstructured":"Roig, G., Boix, X., Shitrit, H.B., Fua, P.: Conditional random fields for multi-camera object detection. In: 2011 International Conference on Computer Vision, pp. 563\u2013570 (2011). IEEE","DOI":"10.1109\/ICCV.2011.6126289"},{"key":"1764_CR23","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2024.110807","volume":"156","author":"R Qiu","year":"2024","unstructured":"Qiu, R., Xu, M., Yan, Y., Smith, J.S., Ling, Y.: PPM: A boolean optimizer for data association in multi-view pedestrian detection. Pattern Recogn. 156, 110807 (2024)","journal-title":"Pattern Recogn."},{"key":"1764_CR24","doi-asserted-by":"crossref","unstructured":"Chavdarova, T., Fleuret, F.: Deep multi-camera people detection. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), pp. 848\u2013853 (2017). IEEE","DOI":"10.1109\/ICMLA.2017.00-50"},{"key":"1764_CR25","doi-asserted-by":"crossref","unstructured":"Baqu\u00e9, P., Fleuret, F., Fua, P.: Deep occlusion reasoning for multi-camera multi-target detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 271\u2013279 (2017)","DOI":"10.1109\/ICCV.2017.38"},{"key":"1764_CR26","doi-asserted-by":"crossref","unstructured":"Lee, W.-Y., Jovanov, L., Philips, W.: Multi-view target transformation for pedestrian detection. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 90\u201399 (2023)","DOI":"10.1109\/WACVW58289.2023.00014"},{"key":"1764_CR27","doi-asserted-by":"crossref","unstructured":"Teepe, T., Wolters, P., Gilg, J., Herzog, F., Rigoll, G.: EarlyBird: Early-fusion for multi-view tracking in the bird\u2019s eye view. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, pp. 102\u2013111 (2024)","DOI":"10.1109\/WACVW60836.2024.00018"},{"key":"1764_CR28","doi-asserted-by":"crossref","unstructured":"Hou, Y., Zheng, L.: Multiview detection with shadow transformer (and view-coherent data augmentation). In: Proceedings of the 29th ACM International Conference on Multimedia, pp. 1673\u20131682 (2021)","DOI":"10.1145\/3474085.3475310"},{"issue":"59","key":"1764_CR29","first-page":"1","volume":"17","author":"Y Ganin","year":"2016","unstructured":"Ganin, Y., Ustinova, E., Ajakan, H., Germain, P., Larochelle, H., Laviolette, F., March, M., Lempitsky, V.: Domain-adversarial training of neural networks. J. Mach. Learn. Res. 17(59), 1\u201335 (2016)","journal-title":"J. Mach. Learn. Res."},{"key":"1764_CR30","unstructured":"Long, M., Cao, Y., Wang, J., Jordan, M.: Learning transferable features with deep adaptation networks. In: International Conference on Machine Learning, pp. 97\u2013105 (2015). PMLR"},{"key":"1764_CR31","doi-asserted-by":"crossref","unstructured":"Saito, K., Watanabe, K., Ushiku, Y., Harada, T.: Maximum classifier discrepancy for unsupervised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3723\u20133732 (2018)","DOI":"10.1109\/CVPR.2018.00392"},{"key":"1764_CR32","unstructured":"Hoffman, J., Wang, D., Yu, F., Darrell, T.: Fcns in the wild: Pixel-level adversarial and constraint-based adaptation. arXiv preprint arXiv:1612.02649 (2016)"},{"key":"1764_CR33","unstructured":"Hoffman, J., Tzeng, E., Park, T., Zhu, J.-Y., Isola, P., Saenko, K., Efros, A., Darrell, T.: Cycada: Cycle-consistent adversarial domain adaptation. In: International Conference on Machine Learning, pp. 1989\u20131998 (2018). Pmlr"},{"key":"1764_CR34","doi-asserted-by":"crossref","unstructured":"Gong, R., Li, W., Chen, Y., Gool, L.V.: Dlow: Domain flow for adaptation and generalization. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2477\u20132486 (2019)","DOI":"10.1109\/CVPR.2019.00258"},{"key":"1764_CR35","doi-asserted-by":"crossref","unstructured":"Tsai, Y.-H., Hung, W.-C., Schulter, S., Sohn, K., Yang, M.-H., Chandraker, M.: Learning to adapt structured output space for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7472\u20137481 (2018)","DOI":"10.1109\/CVPR.2018.00780"},{"key":"1764_CR36","doi-asserted-by":"crossref","unstructured":"Cai, Q., Pan, Y., Ngo, C.-W., Tian, X., Duan, L., Yao, T.: Exploring object relation in mean teacher for cross-domain detection. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 11457\u201311466 (2019)","DOI":"10.1109\/CVPR.2019.01172"},{"key":"1764_CR37","doi-asserted-by":"crossref","unstructured":"Cao, S., Joshi, D., Gui, L.-Y., Wang, Y.-X.: Contrastive mean teacher for domain adaptive object detectors. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 23839\u201323848 (2023)","DOI":"10.1109\/CVPR52729.2023.02283"},{"key":"1764_CR38","unstructured":"Lee, D.-H., : Pseudo-Label: The simple and efficient semi-supervised learning method for deep neural networks. In: Workshop on Challenges in Representation Learning, ICML, vol. 3, p. 896 (2013). Atlanta"},{"issue":"4","key":"1764_CR39","doi-asserted-by":"publisher","first-page":"1106","DOI":"10.1007\/s11263-020-01395-y","volume":"129","author":"Z Zheng","year":"2021","unstructured":"Zheng, Z., Yang, Y.: Rectifying pseudo label learning via uncertainty estimation for domain adaptive semantic segmentation. Int. J. Comput. Vision 129(4), 1106\u20131120 (2021)","journal-title":"Int. J. Comput. Vision"},{"key":"1764_CR40","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778 (2016)","DOI":"10.1109\/CVPR.2016.90"},{"key":"1764_CR41","doi-asserted-by":"crossref","unstructured":"Chavdarova, T., Baqu\u00e9, P., Bouquet, S., Maksai, A., Jose, C., Bagautdinov, T., Lettry, L., Fua, P., Van\u00a0Gool, L., Fleuret, F.: Wildtrack: A multi-camera HD dataset for dense unscripted pedestrian detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5030\u20135039 (2018)","DOI":"10.1109\/CVPR.2018.00528"},{"issue":"2","key":"1764_CR42","doi-asserted-by":"publisher","first-page":"319","DOI":"10.1109\/TPAMI.2008.57","volume":"31","author":"R Kasturi","year":"2008","unstructured":"Kasturi, R., Goldgof, D., Soundararajan, P., Manohar, V., Garofolo, J., Bowers, R., Boonstra, M., Korzhova, V., Zhang, J.: Framework for performance evaluation of face, text, and vehicle detection and tracking in video: Data, metrics, and protocol. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 319\u2013336 (2008)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"1764_CR43","doi-asserted-by":"crossref","unstructured":"Smith, L.N., Topin, N.: Super-convergence: Very fast training of neural networks using large learning rates. In: Artificial Intelligence and Machine Learning for Multi-domain Operations Applications, vol. 11006, pp. 369\u2013386 (2019). SPIE","DOI":"10.1117\/12.2520589"},{"key":"1764_CR44","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248\u2013255 (2009). IEEE","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"1764_CR45","unstructured":"Leal-Taix\u00e9, L., Milan, A., Reid, I., Roth, S., Schindler, K.: MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking (2015). https:\/\/arxiv.org\/abs\/1504.01942"},{"key":"1764_CR46","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2024.128458","volume":"607","author":"R Qiu","year":"2024","unstructured":"Qiu, R., Xu, M., Ling, Y., Smith, J.S., Yan, Y., Wang, X.: A deep top-down framework towards generalisable multi-view pedestrian detection. Neurocomputing 607, 128458 (2024)","journal-title":"Neurocomputing"}],"container-title":["Machine Vision and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-025-01764-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00138-025-01764-y","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00138-025-01764-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T15:07:51Z","timestamp":1769440071000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00138-025-01764-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,21]]},"references-count":46,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,1]]}},"alternative-id":["1764"],"URL":"https:\/\/doi.org\/10.1007\/s00138-025-01764-y","relation":{},"ISSN":["0932-8092","1432-1769"],"issn-type":[{"value":"0932-8092","type":"print"},{"value":"1432-1769","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,21]]},"assertion":[{"value":"3 June 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 September 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 October 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 November 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no Conflict of interest to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"The authors declare no Conflict of interest.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"6"}}