{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T12:29:01Z","timestamp":1781612941140,"version":"3.54.5"},"reference-count":53,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,3,22]],"date-time":"2024-03-22T00:00:00Z","timestamp":1711065600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,3,22]],"date-time":"2024-03-22T00:00:00Z","timestamp":1711065600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100016378","name":"Technische Universit\u00e4t Dortmund","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100016378","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Image Video Proc."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>This contribution presents the TOMIE framework (Tracking Of Multiple Industrial Entities), a framework for the continuous tracking of industrial entities (e.g., pallets, crates, barrels) over a network of, in this example, six RGB cameras. This framework makes use of multiple sensors, data pipelines, and data annotation procedures, and is described in detail in this contribution. With the vision of a fully automated tracking system for industrial entities in mind, it enables researchers to efficiently capture high-quality data in an industrial setting. Using this framework, an image dataset, the TOMIE dataset, is created, which at the same time is used to gauge the framework\u2019s validity. This dataset contains annotation files for 112,860 frames and 640,936 entity instances that are captured from a set of six cameras that perceive a large indoor space. This dataset out-scales comparable datasets by a factor of four and is made up of scenarios, drawn from industrial applications from the sector of warehousing. Three tracking algorithms, namely ByteTrack, Bot-Sort, and SiamMOT, are applied to this dataset, serving as a proof-of-concept and providing tracking results that are comparable to the state of the art.<\/jats:p>","DOI":"10.1186\/s13640-024-00623-6","type":"journal-article","created":{"date-parts":[[2024,3,22]],"date-time":"2024-03-22T14:02:25Z","timestamp":1711116145000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Semi-automated computer vision-based tracking of multiple industrial entities: a framework and dataset creation approach"],"prefix":"10.1186","volume":"2024","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6907-9296","authenticated-orcid":false,"given":"J\u00e9r\u00f4me","family":"Rutinowski","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hazem","family":"Youssef","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sven","family":"Franke","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Irfan Fachrudin","family":"Priyanta","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Frederik","family":"Polachowski","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Moritz","family":"Roidl","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Christopher","family":"Reining","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,3,22]]},"reference":[{"key":"623_CR1","doi-asserted-by":"publisher","first-page":"3709","DOI":"10.3390\/s20133709","volume":"20","author":"A Frank\u00f3","year":"2020","unstructured":"A. Frank\u00f3, G. Vida, P. Varga, Reliable identification schemes for asset and production tracking in industry 4.0. Sensors 20, 3709 (2020). https:\/\/doi.org\/10.3390\/s20133709","journal-title":"Sensors"},{"key":"623_CR2","doi-asserted-by":"publisher","unstructured":"L. Anuj, M. G. Krishna, Multiple camera based multiple object tracking under occlusion: a survey, in International Conference on Innovative Mechanisms for Industry Applications (ICIMIA), pp. 432\u2013437 (2017). https:\/\/doi.org\/10.1109\/ICIMIA.2017.7975652","DOI":"10.1109\/ICIMIA.2017.7975652"},{"key":"623_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3524497","volume":"55","author":"W Liu","year":"2023","unstructured":"W. Liu, Q. Bao, Y. Sun, T. Mei, Recent advances of monocular 2D and 3D human pose estimation: a deep learning perspective. ACM Comput. Surv. 55, 1\u201341 (2023). https:\/\/doi.org\/10.1145\/3524497","journal-title":"ACM Comput. Surv."},{"key":"623_CR4","doi-asserted-by":"publisher","unstructured":"Y. Zhan, F. Li, R. Weng, W. Choi, Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization, in Computer Vision and Pattern Recognition (CVPR), pp. 13106\u201313115 (2022). https:\/\/doi.org\/10.1109\/CVPR52688.2022.01277","DOI":"10.1109\/CVPR52688.2022.01277"},{"key":"623_CR5","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2021.103225","author":"J Wang","year":"2021","unstructured":"J. Wang, S. Tan, X. Zhen, S. Xu, F. Zheng, Z. He, L. Shao, Deep 3D human pose estimation: a review. Comput. Vis. Image Underst. (2021). https:\/\/doi.org\/10.1016\/j.cviu.2021.103225","journal-title":"Comput. Vis. Image Underst."},{"key":"623_CR6","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1016\/j.neucom.2019.11.023","volume":"381","author":"G Ciaparrone","year":"2020","unstructured":"G. Ciaparrone, F. Luque S\u00e1nchez, S. Tabik, L. Troiano, R. Tagliaferri, F. Herrera, Deep learning in video multi-object tracking: a survey. Neurocomputing 381, 61\u201388 (2020). https:\/\/doi.org\/10.1016\/j.neucom.2019.11.023","journal-title":"Neurocomputing"},{"key":"623_CR7","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1007\/s11263-020-01393-0","volume":"129","author":"P Dendorfer","year":"2021","unstructured":"P. Dendorfer, A. Osep, A. Milan, K. Schindler, D. Cremers, I. Reid, S. Roth, L. Leal-Taix\u00e9, Motchalllenge: a benchmark for single-camera multiple target tracking. Int. J. Comput. Vis. 129, 845\u2013881 (2021). 381, 61\u201388 (2020). https:\/\/doi.org\/10.1007\/s11263-020-01393-0","journal-title":"Int. J. Comput. Vis."},{"key":"623_CR8","doi-asserted-by":"publisher","unstructured":"F. Yu, W. Li, Q. Li, Y. Liu, X. Shi, J. Yan, POI: Multiple object tracking with high performance detection and appearance feature, in European Conference on Computer Vision (ECCV) Workshops, pp. 36\u201342 (2016). https:\/\/doi.org\/10.1007\/978-3-319-48881-3_3","DOI":"10.1007\/978-3-319-48881-3_3"},{"key":"623_CR9","doi-asserted-by":"publisher","unstructured":"H. Kieritz, W. H\u00fcbner, M. Arens, Joint detection and online multi-object tracking, in: Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1540\u201315408 (2018). https:\/\/doi.org\/10.1109\/CVPRW.2018.00195","DOI":"10.1109\/CVPRW.2018.00195"},{"key":"623_CR10","doi-asserted-by":"publisher","DOI":"10.3390\/s18072004","author":"D Zhao","year":"2018","unstructured":"D. Zhao, H. Fu, L. Xiao, T. Wu, B. Dai, Multi-object tracking with correlation filter for autonomous vehicle. Sensors (2018). https:\/\/doi.org\/10.3390\/s18072004","journal-title":"Sensors"},{"key":"623_CR11","doi-asserted-by":"publisher","unstructured":"C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going deeper with convolutions, in Computer Vision and Pattern Recognition (CVPR), pp. 1\u20139 (2015). https:\/\/doi.org\/10.1109\/CVPR.2015.7298594","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"623_CR12","doi-asserted-by":"publisher","unstructured":"L. Leal-Taix\u00e9, A. Milan, I. Reid, S. Roth, Motchallenge 2015: Towards a benchmark for multi-target tracking (2015). https:\/\/doi.org\/10.48550\/arXiv.1504.01942","DOI":"10.48550\/arXiv.1504.01942"},{"key":"623_CR13","doi-asserted-by":"publisher","unstructured":"A. Milan, L. Leal-Taixe, I. Reid, S. Roth, K. Schindler, MOT16: a benchmark for multi-object tracking. arXiv (2016). https:\/\/doi.org\/10.48550\/arXiv.1603.00831","DOI":"10.48550\/arXiv.1603.00831"},{"key":"623_CR14","doi-asserted-by":"publisher","unstructured":"S. Tang, M. Andriluka, B. Andres, B. Schiele, Multiple people tracking by lifted multicut and person re-identification. Conference on Computer Vision and Pattern Recognition (CVPR), 3701\u20133710 (2017). https:\/\/doi.org\/10.1109\/CVPR.2017.394","DOI":"10.1109\/CVPR.2017.394"},{"key":"623_CR15","doi-asserted-by":"publisher","unstructured":"L. Chen, H. Ai, C. Shang, Z. Zhuang, B. Bai, Online multi-object tracking with convolutional neural networks. International Conference on Image Processing (ICIP), 645\u2013649 (2017). https:\/\/doi.org\/10.1109\/ICIP.2017.8296360","DOI":"10.1109\/ICIP.2017.8296360"},{"key":"623_CR16","doi-asserted-by":"crossref","unstructured":"L. Ma, S. Tang, M.J. Black, L.V. Gool, Customized multi-person tracker, in Asian Conference on Computer Vision (ACCV) (2018)","DOI":"10.1007\/978-3-030-20890-5_39"},{"key":"623_CR17","doi-asserted-by":"publisher","unstructured":"C. Ma, C. Yang, F. Yang, Y. Zhuang, Z. Zhang, H. Jia, X. Xie, Trajectory factory: Tracklet cleaving and re-connection by deep siamese bi-gru for multiple object tracking. in International Conference on Multimedia and Expo (ICME), pp. 1\u20136 (2018). https:\/\/doi.org\/10.1109\/ICME.2018.8486454","DOI":"10.1109\/ICME.2018.8486454"},{"key":"623_CR18","doi-asserted-by":"publisher","unstructured":"L. Ren, J. Lu, Z. Wang, Q. Tian, J. Zhou, Collaborative deep reinforcement learning for multi-object tracking, in European Conference on Computer Vision (ECCV), pp. 605\u2013621 (2018). https:\/\/doi.org\/10.1007\/978-3-030-01219-9_36","DOI":"10.1007\/978-3-030-01219-9_36"},{"key":"623_CR19","doi-asserted-by":"publisher","unstructured":"A. Bewley, Z. Ge, L. Ott, F. Ramos, B. Upcroft, Simple online and realtime tracking, in International Conference on Image Processing (ICIP), pp. 3464\u20133468 (2016). https:\/\/doi.org\/10.1109\/ICIP.2016.7533003","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"623_CR20","doi-asserted-by":"publisher","unstructured":"N. Aharon, R. Orfaig, B.-Z. Bobrovsky, BoT-SORT: robust associations multi-pedestrian tracking. arXiv (2022). https:\/\/doi.org\/10.48550\/ARXIV.2206.14651","DOI":"10.48550\/ARXIV.2206.14651"},{"key":"623_CR21","doi-asserted-by":"publisher","unstructured":"Y. Zhang, P. Sun, Y. Jiang, D. Yu, F. Weng, Z. Yuan, P. Luo, W. Liu, X. Wang, Bytetrack: multi-object tracking by associating every detection box (2022). https:\/\/doi.org\/10.1007\/978-3-031-20047-2_1","DOI":"10.1007\/978-3-031-20047-2_1"},{"key":"623_CR22","doi-asserted-by":"publisher","unstructured":"J.H. Yoon, M.-H. Yang, J. Lim, K.-j. Yoon, Bayesian multi-object tracking using motion context from multiple objects. Winter Conference on Applications of Computer Vision (WACV), 33\u201340 (2015). https:\/\/doi.org\/10.1109\/WACV.2015.12","DOI":"10.1109\/WACV.2015.12"},{"key":"623_CR23","first-page":"745","volume":"13","author":"M Tiwari","year":"2017","unstructured":"M. Tiwari, R. Singhai, A review of detection and tracking of object from image and video sequences. Int. J. Comput. Intell. Res. 13, 745\u2013765 (2017)","journal-title":"Int. J. Comput. Intell. Res."},{"key":"623_CR24","doi-asserted-by":"publisher","unstructured":"L. Kalake, W. Wan, L. Hou, Analysis based on recent deep learning approaches applied in real-time multi-object tracking: a review 9, 32650\u201332671 (2021). https:\/\/doi.org\/10.1109\/ACCESS.2021.3060821","DOI":"10.1109\/ACCESS.2021.3060821"},{"key":"623_CR25","unstructured":"M. Bredereck, X. Jiang, M. K\u00f6rner, J. Denzler, Data association for multi-object tracking-by-detection in multi-camera networks. International Conference on Distributed Smart Cameras (ICDSC), 1\u20136 (2012)"},{"key":"623_CR26","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.patrec.2012.07.005","volume":"34","author":"X Wang","year":"2013","unstructured":"X. Wang, Intelligent multi-camera video surveillance: a review. Pattern Recogn. Lett. 34, 3\u201319 (2013). https:\/\/doi.org\/10.1016\/j.patrec.2012.07.005","journal-title":"Pattern Recogn. Lett."},{"key":"623_CR27","doi-asserted-by":"publisher","unstructured":"S. Zhang, E. Staudt, T. Faltemier, A.K. Roy-Chowdhury, A camera network tracking (CamNeT) dataset and performance baseline, in Winter Conference on Applications of Computer Vision, pp. 365\u2013372 (2015). https:\/\/doi.org\/10.1109\/WACV.2015.55","DOI":"10.1109\/WACV.2015.55"},{"key":"623_CR28","doi-asserted-by":"publisher","unstructured":"A. Specker, D. Stadler, L. Florin, J. Beyerer, An occlusion-aware multi-target multi-camera tracking system, in Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 4168\u20134177 (2021). https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00471","DOI":"10.1109\/CVPRW53098.2021.00471"},{"key":"623_CR29","doi-asserted-by":"publisher","unstructured":"A. Specker, L. Florin, M. Cormier, J. Beyerer, Improving multi-target multi-camera tracking by track refinement and completion, in Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 3198\u20133208 (2022). https:\/\/doi.org\/10.1109\/CVPRW56347.2022.00361","DOI":"10.1109\/CVPRW56347.2022.00361"},{"key":"623_CR30","doi-asserted-by":"publisher","unstructured":"C. Liu, Y. Zhang, H. Luo, J. Tang, W. Chen, X. Xu, F. Wang, H. Li, Y.-D. Shen, City-Scale multi-camera vehicle tracking guided by crossroad zones, in Computer Vision and Pattern Recognition Workshops (CVPRW), vol. 3, pp. 4124\u20134132 (2021). https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00466","DOI":"10.1109\/CVPRW53098.2021.00466"},{"key":"623_CR31","doi-asserted-by":"publisher","unstructured":"S. He, H. Luo, W. Chen, M. Zhang, Y. Zhang, F. Wang, H. Li, W. Jiang, Multi-domain learning and identity mining for vehicle re-identification, in Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 582\u2013583 (2020). https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00299","DOI":"10.1109\/CVPRW50498.2020.00299"},{"key":"623_CR32","unstructured":"H.-M. Hsu, T.-W. Huang, G. Wang, J. Cai, Z. Lei, J.-N. Hwang, Multi-camera tracking of vehicles based on deep features Re-ID and trajectory-based camera link models, in Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 416\u2013424 (2019)"},{"key":"623_CR33","doi-asserted-by":"publisher","unstructured":"P. Kohl, A. Specker, A. Schumann, J. Beyerer, The MTA dataset for multi-target multi-camera pedestrian tracking by weighted distance aggregation, in Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1042\u20131043 (2020). https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00529","DOI":"10.1109\/CVPRW50498.2020.00529"},{"key":"623_CR34","doi-asserted-by":"publisher","unstructured":"C. Mayershofer, D.-M. Holm, B. Molter, J. Fottner, Loco: Logistics objects in context, in International Conference on Machine Learning and Applications (ICMLA), pp. 612\u2013617 (2020). https:\/\/doi.org\/10.1109\/ICMLA51294.2020.00102","DOI":"10.1109\/ICMLA51294.2020.00102"},{"key":"623_CR35","doi-asserted-by":"publisher","unstructured":"P. Dendorfer, H. Rezatofighi, A. Milan, J.Q. Shi, D. Cremers, I.D. Reid, S. Roth, K. Schindler, L. Leal-Taix\u2019e, Mot20: a benchmark for multi object tracking in crowded scenes (2020). https:\/\/doi.org\/10.48550\/ARXIV.2003.09003","DOI":"10.48550\/ARXIV.2003.09003"},{"key":"623_CR36","doi-asserted-by":"publisher","first-page":"1231","DOI":"10.1177\/0278364913491297","volume":"32","author":"A Geiger","year":"2013","unstructured":"A. Geiger, P. Lenz, C. Stiller, R. Urtasun, Vision meets robotics: the kitti dataset. Int. J. Robot. Res. 32, 1231\u20131237 (2013). https:\/\/doi.org\/10.1177\/0278364913491297","journal-title":"Int. J. Robot. Res."},{"key":"623_CR37","doi-asserted-by":"crossref","unstructured":"T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Doll\u00e1r, C.L. Zitnick, Microsoft COCO: common objects in context, in European Conference on Computer Vision (ECCV), pp. 740\u2013755 (2014)","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"623_CR38","doi-asserted-by":"publisher","DOI":"10.1155\/2008\/246309","author":"K Bernardin","year":"2008","unstructured":"K. Bernardin, R. Stiefelhagen, Evaluating multiple object tracking performance: the CLEAR MOT metrics. EURASIP J. Image Video Process. (2008). https:\/\/doi.org\/10.1155\/2008\/246309","journal-title":"EURASIP J. Image Video Process."},{"key":"623_CR39","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-020-01375-2","author":"J Luiten","year":"2020","unstructured":"J. Luiten, A. Osep, P. Dendorfer, P.H.S. Torr, A. Geiger, L. Leal-Taix\u00e9, B. Leibe, HOTA: A higher order metric for evaluating multi-object tracking. Int. J. Comput. Vis. (2020). https:\/\/doi.org\/10.1007\/s11263-020-01375-2","journal-title":"Int. J. Comput. Vis."},{"key":"623_CR40","doi-asserted-by":"publisher","unstructured":"E. Ristani, F. Solera, R. Zou, R. Cucchiara, C. Tomasi, Performance measures and a data set for multi-target, multi-camera tracking, in European Conference on Computer Vision (ECCV) Workshops, pp. 17\u201335 (2016). https:\/\/doi.org\/10.1007\/978-3-319-48881-3_2","DOI":"10.1007\/978-3-319-48881-3_2"},{"key":"623_CR41","doi-asserted-by":"publisher","unstructured":"B. Drost, M. Ulrich, P. Bergmann, P. Hartinger, C. Steger, Introducing MVTec ITODD-a dataset for 3D object recognition in industry, in Iinternational Conference on Computer Vision Workshops (ICCVW), pp. 2200\u20132208 (2017). https:\/\/doi.org\/10.1109\/ICCVW.2017.257","DOI":"10.1109\/ICCVW.2017.257"},{"key":"623_CR42","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1016\/j.patrec.2019.05.011","volume":"125","author":"C Luo","year":"2019","unstructured":"C. Luo, L. Yu, E. Yang, H. Zhou, P. Ren, A benchmark image dataset for industrial tools. Pattern Recogn. Lett. 125, 341\u2013348 (2019). https:\/\/doi.org\/10.1016\/j.patrec.2019.05.011","journal-title":"Pattern Recogn. Lett."},{"key":"623_CR43","doi-asserted-by":"publisher","unstructured":"P. De Roovere, S. Moonen, N. Michiels et al., Dataset of industrial metal objects (2022). https:\/\/doi.org\/10.48550\/ARXIV.2208.04052","DOI":"10.48550\/ARXIV.2208.04052"},{"key":"623_CR44","doi-asserted-by":"publisher","unstructured":"C. AbouAkar, J. Tekli, D. Jess, M. Khoury, M. Kamradt, M. Guthe, Synthetic object recognition dataset for industries, in SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), vol. 1, pp. 150\u2013155 (2022). https:\/\/doi.org\/10.1109\/SIBGRAPI55357.2022.9991784","DOI":"10.1109\/SIBGRAPI55357.2022.9991784"},{"key":"623_CR45","doi-asserted-by":"publisher","unstructured":"F. Niemann, C. Reining, F. MoyaRueda, N.R. Nair, J.A. Steffens, G.A. Fink, M. ten Hompel, LARa: creating a dataset for human activity recognition in logistics using semantic attributes. Sensors 20(15) (2020). https:\/\/doi.org\/10.3390\/s20154083","DOI":"10.3390\/s20154083"},{"key":"623_CR46","doi-asserted-by":"publisher","unstructured":"J. Rutinowski, T. Chilla, C. Pionzewski, C. Reining, M. ten Hompel, Towards re-identification for warehousing entities\u2014a work-in-progress study, in Emerging Technologies in Factory Automation (ETFA), pp. 501\u2013504 (2021). https:\/\/doi.org\/10.1109\/ETFA45728.2021.9613250","DOI":"10.1109\/ETFA45728.2021.9613250"},{"key":"623_CR47","doi-asserted-by":"crossref","unstructured":"J. Rutinowski, C. Pionzewski, T. Chilla, C. Reining, M. ten Hompel, Deep Learning Based Re-identification of Wooden Euro-pallets, in International Conference on Machine Learning and Applications (ICMLA) (2022)","DOI":"10.1109\/ICMLA55696.2022.00023"},{"key":"623_CR48","unstructured":"DIN: DIN 55405:2014-12, Packaging\u2014Terminology\u2014Terms and definitions (2014)"},{"key":"623_CR49","unstructured":"DIN: DIN EN 13698-1:2004-01, Pallet production specification\u2014Part 1: construction specification for 800 mm $$\\times$$ 1200 mm flat wooden pallets (2004)"},{"key":"623_CR50","doi-asserted-by":"publisher","unstructured":"L. Campagnola, E. Larson, A. Klein, D. Hoese, Siddharth, C. Rossant, A. Griffiths, N.P. Rougier, L. van Dijk, K. M\u00fchlbauer, et al., vispy\/vispy: Version 0.9.5. Zenodo (2022). https:\/\/doi.org\/10.5281\/zenodo.5974509","DOI":"10.5281\/zenodo.5974509"},{"key":"623_CR51","doi-asserted-by":"publisher","unstructured":"B. Adhikari, J. Peltom\u00e4ki, J. Puura, H. Huttunen, Faster bounding box annotation for object detection in indoor scenes. European Workshop on Visual Information Processing (EUVIP) (2018). https:\/\/doi.org\/10.1109\/EUVIP.2018.8611732","DOI":"10.1109\/EUVIP.2018.8611732"},{"key":"623_CR52","doi-asserted-by":"publisher","unstructured":"B. Shuai, A.G. Berneshawi, D. Modolo, J. Tighe, Multi-object tracking with siamese Track-RCNN (2020). https:\/\/doi.org\/10.48550\/ARXIV.2004.07786","DOI":"10.48550\/ARXIV.2004.07786"},{"key":"623_CR53","doi-asserted-by":"publisher","unstructured":"S. Zhang, F. Wang, L. Songtao, G. Zheng, YOLOX: Exceeding yolo series in 2021 (2021). https:\/\/doi.org\/10.48550\/arXiv.2107.08430","DOI":"10.48550\/arXiv.2107.08430"}],"container-title":["EURASIP Journal on Image and Video Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13640-024-00623-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13640-024-00623-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13640-024-00623-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T17:21:49Z","timestamp":1731604909000},"score":1,"resource":{"primary":{"URL":"https:\/\/jivp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13640-024-00623-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,22]]},"references-count":53,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["623"],"URL":"https:\/\/doi.org\/10.1186\/s13640-024-00623-6","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-2837685\/v1","asserted-by":"object"}]},"ISSN":["1687-5281"],"issn-type":[{"value":"1687-5281","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,22]]},"assertion":[{"value":"25 April 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 March 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 March 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All authors confirm that there are no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"8"}}