{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T09:12:05Z","timestamp":1774602725207,"version":"3.50.1"},"reference-count":99,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2026,2,6]],"date-time":"2026-02-06T00:00:00Z","timestamp":1770336000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,2,6]],"date-time":"2026-02-06T00:00:00Z","timestamp":1770336000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"The New Zealand Endeavour Smart Ideas Grant","award":["PROP-88546-ENDSI-UOA"],"award-info":[{"award-number":["PROP-88546-ENDSI-UOA"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2026,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Recent advances in deep learning have greatly enhanced the accuracy and scalability of animal re-identification by automating the extraction of subtle distinguishing features from images and videos. This enables large-scale, non-invasive monitoring of animal populations. This article proposes a segmentation pipeline and a re-identification model to identify animals without ground-truth IDs. The segmentation pipeline isolates animals from the background using bounding boxes and leverages the DINOv2 and Segment Anything Model 2 (SAM2) foundation models. For re-identification, Recurrence over Video Frames (RoVF) is introduced, a novel approach that employs a recurrent component based on the Perceiver transformer atop a DINOv2 image model, iteratively refining embeddings from video frames. The proposed methods are evaluated on video datasets of meerkats and polar bears (PolarBearVidID). The proposed segmentation model achieved high accuracy (94.36% and 97.26%) and IoU (73.14% and 92.77%) for meerkats and polar bears, respectively. RoVF outperformed frame- and video-based re-identification baselines, achieving a top-1 accuracy of 46.5% and 55% on masked test sets for meerkats and polar bears, respectively, as well as higher top-3 accuracy. These results highlight the potential of the proposed approach to reduce annotation burdens in future individual-based ecological studies. The code is available at\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/github.com\/Strong-AI-Lab\/RoVF-Meerkat-Reidentification\" ext-link-type=\"uri\">https:\/\/github.com\/Strong-AI-Lab\/RoVF-Meerkat-Reidentification<\/jats:ext-link>\n                    .\n                  <\/jats:p>","DOI":"10.1007\/s11263-025-02709-8","type":"journal-article","created":{"date-parts":[[2026,2,6]],"date-time":"2026-02-06T04:10:28Z","timestamp":1770351028000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Recurrence over Video Frames (RoVF) for Animal Re-identification"],"prefix":"10.1007","volume":"134","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1713-7743","authenticated-orcid":false,"given":"Mitchell","family":"Rogers","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8652-1285","authenticated-orcid":false,"given":"Kobe","family":"Knowles","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2457-934X","authenticated-orcid":false,"given":"Ga\u00ebl","family":"Gendron","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1727-939X","authenticated-orcid":false,"given":"Shahrokh","family":"Heidari","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8463-2459","authenticated-orcid":false,"given":"Isla","family":"Duporge","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2353-1018","authenticated-orcid":false,"given":"David Arturo Soriano","family":"Valdez","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3001-6340","authenticated-orcid":false,"given":"Mihailo","family":"Azhar","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Padriac","family":"O\u2019Leary","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Simon","family":"Eyre","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7554-0971","authenticated-orcid":false,"given":"Michael","family":"Witbrock","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0235-4596","authenticated-orcid":false,"given":"Patrice","family":"Delmas","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,2,6]]},"reference":[{"key":"2709_CR1","doi-asserted-by":"publisher","first-page":"7131","DOI":"10.1109\/WACV57701.2024.00699","volume-title":"SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification","author":"L Adam","year":"2024","unstructured":"Adam, L., Cermak, V., Papafitsoros, K., & Picek, L. (2024). SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification (pp. 7131\u20137141). Los Alamitos, CA, USA: IEEE Computer Society. https:\/\/doi.org\/10.1109\/WACV57701.2024.00699"},{"key":"2709_CR2","doi-asserted-by":"publisher","unstructured":"Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lu\u010di\u0107, M., & Schmid, C. (2021). ViViT. A Video Vision Transformer., arXiv. https:\/\/doi.org\/10.48550\/arXiv.2103.15691.","DOI":"10.48550\/arXiv.2103.15691"},{"issue":"6","key":"2709_CR3","doi-asserted-by":"publisher","first-page":"1350","DOI":"10.1644\/09-MAMM-A-425.1","volume":"91","author":"CJR Anderson","year":"2010","unstructured":"Anderson, C. J. R., Da Vitoria Lobo, N., Roth, J. D., & Waterman, J. M. (2010). Computer-aided photo-identification system with an application to polar bears based on whisker spot patterns. Journal of Mammalogy,91(6), 1350\u20131359. https:\/\/doi.org\/10.1644\/09-MAMM-A-425.1","journal-title":"Journal of Mammalogy"},{"key":"2709_CR4","doi-asserted-by":"publisher","unstructured":"Andrew, W., Greatwood, C., & Burghardt, T. (2020). Fusing animal biometrics with autonomous robotics: Drone-based search and individual id of friesian cattle (extended abstract). (pp. 38\u201343). https:\/\/doi.org\/10.1109\/WACVW50321.2020.9096949","DOI":"10.1109\/WACVW50321.2020.9096949"},{"key":"2709_CR5","doi-asserted-by":"publisher","DOI":"10.1016\/j.ecoinf.2023.102388","volume":"79","author":"VM Ara\u00fajo","year":"2024","unstructured":"Ara\u00fajo, V. M., Gambs, S., Michaud, R., Lautraite, H., Schneider, L., & Chion, C. (2024). Membership inference attack for beluga whales discrimination. Ecological Informatics,79, Article 102388. https:\/\/doi.org\/10.1016\/j.ecoinf.2023.102388","journal-title":"Ecological Informatics"},{"issue":"4","key":"2709_CR6","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1111\/j.1469-7998.2007.00340.x","volume":"273","author":"CJR Anderson","year":"2007","unstructured":"Anderson, C. J. R., Roth, J. D., & Waterman, J. M. (2007). Can whisker spot patterns be used to identify individual polar bears? Journal of Zoology,273(4), 333\u2013339. https:\/\/doi.org\/10.1111\/j.1469-7998.2007.00340.x","journal-title":"Journal of Zoology"},{"key":"2709_CR7","doi-asserted-by":"publisher","unstructured":"Bolme, D. S., Beveridge, J. R., Draper, B. A., & Lui, Y. M. (2010). Visual object tracking using adaptive correlation filters. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (pp. 2544\u20132550). https:\/\/doi.org\/10.1109\/CVPR.2010.5539960","DOI":"10.1109\/CVPR.2010.5539960"},{"key":"2709_CR8","doi-asserted-by":"publisher","unstructured":"Brookes, O., Gray, S., Bennett, P., Burgess, K. V., Clark, F. E., Roberts, E., & Burghardt, T. (2022). Evaluating cognitive enrichment for zoo-housed gorillas using facial recognition. Frontiers in Veterinary Science,9,. https:\/\/doi.org\/10.3389\/fvets.2022.886720","DOI":"10.3389\/fvets.2022.886720"},{"key":"2709_CR9","doi-asserted-by":"publisher","unstructured":"Ba, J. L., Kiros, J. R., & Hinton, G. E. (2016). Layer Normalization., https:\/\/doi.org\/10.48550\/arXiv.1607.06450.","DOI":"10.48550\/arXiv.1607.06450"},{"key":"2709_CR10","doi-asserted-by":"publisher","unstructured":"Bouma, S., Pawley, M. D. M., Hupman, K., & Gilman, A. (2018). Individual common dolphin identification via metric embedding learning. In: 2018 International Conference on Image and Vision Computing New Zealand (IVCNZ), (pp. 1\u20136). https:\/\/doi.org\/10.1109\/IVCNZ.2018.8634778","DOI":"10.1109\/IVCNZ.2018.8634778"},{"key":"2709_CR11","doi-asserted-by":"publisher","DOI":"10.1016\/j.atech.2022.100135","volume":"3","author":"P Borlinghaus","year":"2023","unstructured":"Borlinghaus, P., Tausch, F., & Rettenberger, L. (2023). A purely visual Re-ID approach for bumblebees (Bombus terrestris). Smart Agricultural Technology,3, Article 100135. https:\/\/doi.org\/10.1016\/j.atech.2022.100135","journal-title":"Smart Agricultural Technology"},{"key":"2709_CR12","unstructured":"Bertasius, G., Wang, H., Torresani, L. (2021) Is space-time attention all you need for video understanding? In: ICML, vol. 2, p. 4"},{"issue":"10","key":"2709_CR13","doi-asserted-by":"publisher","first-page":"562","DOI":"10.1016\/j.tree.2010.08.002","volume":"25","author":"T Clutton-Brock","year":"2010","unstructured":"Clutton-Brock, T., & Sheldon, B. C. (2010). Individuals and populations: the role of long-term, individual-based studies of animals in ecology and evolutionary biology. Trends in Ecology & Evolution,25(10), 562\u2013573. https:\/\/doi.org\/10.1016\/j.tree.2010.08.002","journal-title":"Trends in Ecology & Evolution"},{"issue":"6","key":"2709_CR14","doi-asserted-by":"publisher","first-page":"802","DOI":"10.1111\/2041-210X.13172","volume":"10","author":"P Chakravarty","year":"2019","unstructured":"Chakravarty, P., Cozzi, G., Ozgul, A., & Aminian, K. (2019). A novel biomechanical approach for animal behaviour recognition using accelerometers. Methods in Ecology and Evolution,10(6), 802\u2013814. https:\/\/doi.org\/10.1111\/2041-210X.13172","journal-title":"Methods in Ecology and Evolution"},{"key":"2709_CR15","first-page":"386","volume-title":"Wild Mammals in Captivity: Principles and Techniques","author":"C Crockett","year":"2010","unstructured":"Crockett, C., & Ha, R. (2010). Data collection in the zoo setting, emphasizing behavior. In D. G. Kleiman, K. V. Thompson, & C. K. Baer (Eds.), Wild Mammals in Captivity: Principles and Techniques (2nd ed., pp. 386\u2013406). Chicago: University of Chicago Press.","edition":"2"},{"issue":"10","key":"2709_CR16","doi-asserted-by":"publisher","first-page":"1632","DOI":"10.1111\/2041-210X.13256","volume":"10","author":"S Christin","year":"2019","unstructured":"Christin, S., Hervet, E., & Lecomte, N. (2019). Applications for deep learning in ecology. Methods in Ecology and Evolution,10(10), 1632\u20131644. https:\/\/doi.org\/10.1111\/2041-210X.13256","journal-title":"Methods in Ecology and Evolution"},{"key":"2709_CR17","doi-asserted-by":"publisher","unstructured":"Chen, T., Kornblith, S., Norouzi, M., Hinton, G. (2020) A simple framework for contrastive learning of visual representations. In: Proceedings of the 37th International Conference on Machine Learning. ICML\u201920. https:\/\/doi.org\/10.5555\/3524938.3525087","DOI":"10.5555\/3524938.3525087"},{"key":"2709_CR18","doi-asserted-by":"publisher","unstructured":"Crall, J. P., Stewart, C. V., Berger-Wolf, T. Y., Rubenstein, D. I., & Sundaresan, S. R. (2013). HotSpotter \u2013 patterned species instance recognition. In: 2013 IEEE Workshop on Applications of Computer Vision (WACV) ,(pp. 230\u2013237). https:\/\/doi.org\/10.1109\/WACV.2013.6475023","DOI":"10.1109\/WACV.2013.6475023"},{"key":"2709_CR19","doi-asserted-by":"publisher","unstructured":"Caron, M., Touvron, H., Misra, I., J\u00e9gou, H., Mairal, J., Bojanowski, P., & Joulin, A. (2021). Emerging Properties in Self-Supervised Vision Transformers., https:\/\/doi.org\/10.48550\/arXiv.2104.14294.","DOI":"10.48550\/arXiv.2104.14294"},{"key":"2709_CR20","doi-asserted-by":"publisher","unstructured":"Cho, K., Van Merri\u00ebnboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation., https:\/\/doi.org\/10.48550\/arXiv.1406.1078","DOI":"10.48550\/arXiv.1406.1078"},{"key":"2709_CR21","doi-asserted-by":"publisher","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale., arXiv. https:\/\/doi.org\/10.48550\/arXiv.2010.11929","DOI":"10.48550\/arXiv.2010.11929"},{"issue":"3","key":"2709_CR22","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1002\/rse2.195","volume":"7","author":"I Duporge","year":"2021","unstructured":"Duporge, I., Isupova, O., Reece, S., Macdonald, D. W., & Wang, T. (2021). Using very-high-resolution satellite imagery and deep learning to detect and count african elephants in heterogeneous landscapes. Remote Sensing in Ecology and Conservation,7(3), 369\u2013381. https:\/\/doi.org\/10.1002\/rse2.195","journal-title":"Remote Sensing in Ecology and Conservation"},{"key":"2709_CR23","doi-asserted-by":"publisher","unstructured":"Duporge, I., Kholiavchenko, M., Harel, R., Wolf, S., Rubenstein, D. I., Crofoot, M. C., Berger-Wolf, T., Lee, S. J., Barreau, J., Kline, J., Ramirez, M., & Stewart, C. V. (2025). Baboonland dataset: Tracking primates in the wild and automating behaviour recognition from drone videos. International Journal of Computer Vision. https:\/\/doi.org\/10.1007\/s11263-025-02493-5","DOI":"10.1007\/s11263-025-02493-5"},{"key":"2709_CR24","doi-asserted-by":"publisher","unstructured":"Fu, S., Hamilton, M., Brandt, L., Feldman, A., Zhang, Z., & Freeman, W. T. (2024). FeatUp: A Model-Agnostic Framework for Features at Any Resolution.,arXiv. https:\/\/doi.org\/10.48550\/arXiv.2403.10516.","DOI":"10.48550\/arXiv.2403.10516"},{"key":"2709_CR25","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1007\/978-3-319-45886-1_5","volume-title":"Pattern Recognition","author":"A Freytag","year":"2016","unstructured":"Freytag, A., Rodner, E., Simon, M., Loos, A., K\u00fchl, H. S., & Denzler, J. (2016). Chimpanzee faces in the wild: Log-Euclidean CNNs for predicting identities and attributes of primates. In B. Rosenhahn & B. Andres (Eds.), Pattern Recognition (pp. 51\u201363). Cham: Springer. https:\/\/doi.org\/10.1007\/978-3-319-45886-1_5"},{"key":"2709_CR26","doi-asserted-by":"publisher","unstructured":"Gendron, G., Chen, Y., Rogers, M., Liu, Y., Azhar, M., Heidari, S., Valdez, D.A.S., Knowles, K., O\u2019Leary, P., Eyre, S., Witbrock, M., Dobbie, G., Liu, J., Delmas, P. (2024) Behaviour modelling of social animals via causal structure discovery and graph neural networks. In: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems. AAMAS \u201924, pp. 2276\u20132278. International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC.https:\/\/doi.org\/10.5555\/3635637.3663132","DOI":"10.5555\/3635637.3663132"},{"key":"2709_CR27","doi-asserted-by":"publisher","first-page":"580","DOI":"10.1109\/CVPR.2014.81","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"R Girshick","year":"2014","unstructured":"Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 580\u2013587). https:\/\/doi.org\/10.1109\/CVPR.2014.81"},{"key":"2709_CR28","doi-asserted-by":"publisher","unstructured":"Guo, S., Xu, P., Miao, Q., Shao, G., Chapman, C.A., Chen, X., He, G., Fang, D., Zhang, H., Sun, Y., Shi, Z., & Li, B. (2020) Automatic identification of individual primates with deep learning techniques. iScience 23(8), https:\/\/doi.org\/10.1016\/j.isci.2020.101412","DOI":"10.1016\/j.isci.2020.101412"},{"key":"2709_CR29","doi-asserted-by":"publisher","unstructured":"Hoffer, E., & Ailon, N. (2015). Deep metric learning using triplet network. Similarity-Based Pattern Recognition (pp. 84\u201392). Cham: Springer. https:\/\/doi.org\/10.1007\/978-3-319-24261-3_7","DOI":"10.1007\/978-3-319-24261-3_7"},{"key":"2709_CR30","doi-asserted-by":"publisher","unstructured":"Hermans, A., Beyer, L., & Leibe, B. (2017). In Defense of the Triplet Loss for Person Re-Identification., arXiv. https:\/\/doi.org\/10.48550\/arXiv.1703.07737","DOI":"10.48550\/arXiv.1703.07737"},{"key":"2709_CR31","doi-asserted-by":"publisher","unstructured":"Hoffmann, J., Borgeaud, S., Mensch, A., Buchatskaya, E., Cai, T., Rutherford, E., Las\u00a0Casas, D., Hendricks, L.A., Welbl, J., Clark, A., Hennigan, T., Noland, E., Millican, K., Driessche, G., Damoc, B., Guy, A., Osindero, S., Simonyan, K., Elsen, E., Vinyals, O., Rae, J.W., Sifre, L. (2024) Training compute-optimal large language models. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. NIPS \u201922. Curran Associates Inc., Red Hook, NY, USA. https:\/\/doi.org\/10.5555\/3600270.3602446","DOI":"10.5555\/3600270.3602446"},{"key":"2709_CR32","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/ICASSP49357.2023.10094783","volume-title":"ICASSP 2023\u20132023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Z He","year":"2023","unstructured":"He, Z., Qian, J., Yan, D., Wang, C., & Xin, Y. (2023). Animal re-identification algorithm for posture diversity. ICASSP 2023\u20132023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1\u20135). https:\/\/doi.org\/10.1109\/ICASSP49357.2023.10094783"},{"issue":"8","key":"2709_CR33","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation,9(8), 1735\u20131780.","journal-title":"Neural computation"},{"issue":"1","key":"2709_CR34","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1002\/zoo.21644","volume":"41","author":"FD Huels","year":"2022","unstructured":"Huels, F. D., & Stoeger, A. S. (2022). Sentinel behavior in captive meerkats (suricata suricatta). Zoo Biology,41(1), 10\u201319. https:\/\/doi.org\/10.1002\/zoo.21644","journal-title":"Zoo Biology"},{"key":"2709_CR35","doi-asserted-by":"publisher","first-page":"770","DOI":"10.1109\/CVPR.2016.90","volume-title":"2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"K He","year":"2016","unstructured":"He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 770\u2013778). https:\/\/doi.org\/10.1109\/CVPR.2016.90"},{"key":"2709_CR36","unstructured":"Jaegle, A., Gimeno, F., Brock, A., Vinyals, O., Zisserman, A., & Carreira, J. (2021). Perceiver: General perception with iterative attention, In: International Conference on Machine Learning (pp. 4651\u20134664). PMLR"},{"key":"2709_CR37","doi-asserted-by":"publisher","unstructured":"Jiao, B., Liu, L., Gao, L., Wu, R., Lin, G., Wang, P., Zhang, Y. (2024) Toward re-identifying any animal. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. NIPS \u201923. Curran Associates Inc., Red Hook, NY, USA. https:\/\/doi.org\/10.5555\/3666122.3667864","DOI":"10.5555\/3666122.3667864"},{"key":"2709_CR38","doi-asserted-by":"publisher","unstructured":"Kay, W., Carreira, J., Simonyan, K., Zhang, B., Hillier, C., Vijayanarasimhan, S., Viola, F., Green, T., Back, T., Natsev, P., et al. (2017). The kinetics human action video dataset., arXiv. https:\/\/doi.org\/10.48550\/arXiv.1705.06950","DOI":"10.48550\/arXiv.1705.06950"},{"key":"2709_CR39","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1109\/ICCVW.2019.00035","volume-title":"2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW)","author":"M K\u00f6rschens","year":"2019","unstructured":"K\u00f6rschens, M., & Denzler, J. (2019). ELPephants: A fine-grained dataset for elephant re-identification. 2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW) (pp. 263\u2013270). https:\/\/doi.org\/10.1109\/ICCVW.2019.00035"},{"key":"2709_CR40","doi-asserted-by":"publisher","unstructured":"Kingma, D. P. (2014). Adam: A method for stochastic optimization., https:\/\/doi.org\/10.48550\/arXiv.1412.6980","DOI":"10.48550\/arXiv.1412.6980"},{"key":"2709_CR41","unstructured":"Krizhevsky, A., Sutskever, I., Hinton, G.E. (2012) ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, vol. 25"},{"key":"2709_CR42","doi-asserted-by":"publisher","unstructured":"Kulits, P., Wall, J., Bedetti, A., Henley, M., & Beery, S. (2021). ElephantBook: A semi-automated human-in-the-loop system for elephant re-identification (Vol. \u201921, pp. 88\u201398). New York, NY, USA: Association for Computing Machinery. https:\/\/doi.org\/10.1145\/3460112.3471947","DOI":"10.1145\/3460112.3471947"},{"issue":"1","key":"2709_CR43","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s40317-017-0139-6","volume":"5","author":"TK Lameris","year":"2017","unstructured":"Lameris, T. K., & Kleyheeg, E. (2017). Reduction in adverse effects of tracking devices on waterfowl requires better measuring and reporting. Animal Biotelemetry,5(1), 24. https:\/\/doi.org\/10.1186\/s40317-017-0139-6","journal-title":"Animal Biotelemetry"},{"key":"2709_CR44","doi-asserted-by":"publisher","unstructured":"Li, S., Li, J., Tang, H., Qian, R., & Lin, W. (2020). ATRW: A benchmark for Amur tiger re-identification in the wild (Vol. MM \u201920, pp. 2590\u20132598). New York, NY, USA: Association for Computing Machinery. https:\/\/doi.org\/10.1145\/3394171.3413569","DOI":"10.1145\/3394171.3413569"},{"key":"2709_CR45","unstructured":"Loshchilov, I.(2017) Decoupled weight decay regularization. arXiv. arXiv:1711.05101"},{"key":"2709_CR46","doi-asserted-by":"publisher","unstructured":"Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3431\u20133440). https:\/\/doi.org\/10.1109\/CVPR.2015.7298965","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"2709_CR47","doi-asserted-by":"publisher","unstructured":"Lei, J., Wang, L., Shen, Y., Yu, D., Berg, T., & Bansal, M. (2020) MART: Memory-augmented recurrent transformer for coherent video paragraph captioning. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 2603\u20132614. Association for Computational Linguistics, Online. https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.233","DOI":"10.18653\/v1\/2020.acl-main.233"},{"issue":"1","key":"2709_CR48","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1007\/s00265-009-0820-8","volume":"64","author":"JR Madden","year":"2009","unstructured":"Madden, J. R., Drewe, J. A., Pearce, G. P., & Clutton-Brock, T. H. (2009). The social network structure of a wild meerkat population: 2. intragroup interactions. Behavioral Ecology and Sociobiology,64(1), 81\u201395. https:\/\/doi.org\/10.1007\/s00265-009-0820-8","journal-title":"Behavioral Ecology and Sociobiology"},{"issue":"5","key":"2709_CR49","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1111\/2041-210X.13577","volume":"12","author":"V Miele","year":"2021","unstructured":"Miele, V., Dussert, G., Spataro, B., Chamaill\u00e9-Jammes, S., Allain\u00e9, D., & Bonenfant, C. (2021). Revisiting animal photo-identification using deep metric learning and network analysis. Methods in Ecology and Evolution,12(5), 863\u2013873. https:\/\/doi.org\/10.1111\/2041-210X.13577","journal-title":"Methods in Ecology and Evolution"},{"key":"2709_CR50","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/IVCNZ64857.2024.10794473","volume-title":"2024 39th International Conference on Image and Vision Computing New Zealand (IVCNZ)","author":"P Maddigan","year":"2024","unstructured":"Maddigan, P., Ehrhardt, O., Lensen, A., & Shaw, R. C. (2024). Re-identification of individual k\u0101k\u0101: An explainable dino-based model. 2024 39th International Conference on Image and Vision Computing New Zealand (IVCNZ) (pp. 1\u20136). https:\/\/doi.org\/10.1109\/IVCNZ64857.2024.10794473"},{"key":"2709_CR51","doi-asserted-by":"publisher","unstructured":"Minasandra, P., Grout, E.M., Brock, K., Crofoot, M.C., Demartsev, V., Gersick, A.S., Hirsch, B.T., Holekamp, K.E., Johnson-Ulrich, L., Nayak, A., Ortega, J., Roch, M.A., Strauss, E.D., & Strandburg-Peshkin, A. (2024) Behavioral sequences across multiple animal species in the wild share common structural features. bioRxiv https:\/\/doi.org\/10.1101\/2024.01.20.576411","DOI":"10.1101\/2024.01.20.576411"},{"key":"2709_CR52","doi-asserted-by":"publisher","unstructured":"Meinhardt, T., Kirillov, A., Leal-Taixe, L., & Feichtenhofer, C. (2022). TrackFormer: Multi-object tracking with transformers. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (pp. 8844\u20138854). https:\/\/doi.org\/10.1109\/CVPR52688.2022.00864","DOI":"10.1109\/CVPR52688.2022.00864"},{"key":"2709_CR53","doi-asserted-by":"publisher","unstructured":"Moskvyak, O., Maire, F., Dayoub, F., Armstrong, A. O., & Baktashmotlagh, M. (2021). Robust re-identification of manta rays from natural markings by learning pose invariant embeddings. In: 2021 Digital Image Computing: Techniques and Applications (DICTA) (pp. 1\u20138). https:\/\/doi.org\/10.1109\/DICTA52665.2021.9647359","DOI":"10.1109\/DICTA52665.2021.9647359"},{"key":"2709_CR54","doi-asserted-by":"publisher","unstructured":"Moskvyak, O., Maire, F., Dayoub, F., & Baktashmotlagh, M. (2020). Learning landmark guided embeddings for animal re-identification. 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW) (pp. 12\u201319). https:\/\/doi.org\/10.1109\/WACVW50321.2020.9096932","DOI":"10.1109\/WACVW50321.2020.9096932"},{"key":"2709_CR55","doi-asserted-by":"publisher","unstructured":"Nepovinnykh, E., Eerola, T., Biard, V., Mutka, P., Niemi, M., Kunnasranta, M., & K\u00e4lvi\u00e4inen, H. (2022). SealID: Saimaa ringed seal re-identification dataset. Sensors,22(19). https:\/\/doi.org\/10.3390\/s22197602","DOI":"10.3390\/s22197602"},{"key":"2709_CR56","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1109\/WACVW50321.2020.9096935","volume-title":"2020 IEEE Winter Applications of Computer Vision Workshops (WACVW)","author":"E Nepovinnykh","year":"2020","unstructured":"Nepovinnykh, E., Eerola, T., & K\u00e4lvi\u00e4inen, H. (2020). Siamese network based pelage pattern matching for ringed seal re-identification. 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW) (pp. 25\u201334). https:\/\/doi.org\/10.1109\/WACVW50321.2020.9096935"},{"key":"2709_CR57","doi-asserted-by":"crossref","unstructured":"Nepovinnykh, E., Vilkman, A., Eerola, T., & K\u00e4lvi\u00e4inen, H. (2023). Re-identification of Saimaa ringed seals from image sequences. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)","DOI":"10.1007\/978-3-031-31435-3_8"},{"key":"2709_CR58","doi-asserted-by":"publisher","unstructured":"Oquab, M., Darcet, T., Moutakanni, T., Vo, H., Szafraniec, M., Khalidov, V., Fernandez, P., Haziza, D., Massa, F., El-Nouby, A., Assran, M., Ballas, N., Galuba, W., Howes, R., Huang, P.-Y., Li, S.-W., Misra, I., Rabbat, M., Sharma, V., \u2026 & Bojanowski, P. (2024). DINOv2: Learning Robust Visual Features without. Supervision., https:\/\/doi.org\/10.48550\/arXiv.2304.07193.","DOI":"10.48550\/arXiv.2304.07193"},{"issue":"5","key":"2709_CR59","doi-asserted-by":"publisher","first-page":"663","DOI":"10.1139\/z2012-033","volume":"90","author":"AM Pagano","year":"2012","unstructured":"Pagano, A. M., Durner, G. M., Amstrup, S. C., Simac, K. S., & York, G. S. (2012). Long-distance swimming by polar bears (Ursus maritimus) of the southern Beaufort Sea during years of extensive open water. Canadian Journal of Zoology,90(5), 663\u2013676. https:\/\/doi.org\/10.1139\/z2012-033","journal-title":"Canadian Journal of Zoology"},{"key":"2709_CR60","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., Lerer, A. (2017) Automatic differentiation in PyTorch. In: NIPS-W"},{"key":"2709_CR61","doi-asserted-by":"publisher","DOI":"10.1016\/j.beproc.2021.104446","volume":"190","author":"XP Pacheco","year":"2021","unstructured":"Pacheco, X. P., & Madden, J. R. (2021). Does the social network structure of wild animal populations differ from that of animals in captivity? Behavioural Processes,190, Article 104446. https:\/\/doi.org\/10.1016\/j.beproc.2021.104446","journal-title":"Behavioural Processes"},{"key":"2709_CR62","doi-asserted-by":"publisher","first-page":"121434","DOI":"10.1109\/ACCESS.2023.3328341","volume":"11","author":"S-D Roh","year":"2023","unstructured":"Roh, S.-D., & Chung, K.-S. (2023). DiffusionVID: Denoising object boxes with spatio-temporal conditioning for video object detection. IEEE Access,11, 121434\u2013121444. https:\/\/doi.org\/10.1109\/ACCESS.2023.3328341","journal-title":"IEEE Access"},{"issue":"2","key":"2709_CR63","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1038\/s41592-018-0295-5","volume":"16","author":"F Romero-Ferrero","year":"2019","unstructured":"Romero-Ferrero, F., Bergomi, M. G., Hinz, R. C., Heras, F. J., & Polavieja, G. G. (2019). idtracker.ai: tracking all individuals in small or large collectives of unmarked animals. Nature methods,16(2), 179\u2013182. https:\/\/doi.org\/10.1038\/s41592-018-0295-5","journal-title":"Nature methods"},{"key":"2709_CR64","doi-asserted-by":"publisher","unstructured":"Ravi, N., Gabeur, V., Hu, Y.-T., Hu, R., Ryali, C., Ma, T., Khedr, H., R\u00e4dle, R., Rolland, C., Gustafson, L., Mintun, E., Pan, J., Alwala, K. V., Carion, N., Wu, C.-Y., Girshick, R., Doll\u00e1r, P., & Feichtenhofer, C. (2024). SAM 2: Segment Anything in Images and Videos., https:\/\/doi.org\/10.48550\/arXiv.2408.00714","DOI":"10.48550\/arXiv.2408.00714"},{"key":"2709_CR65","doi-asserted-by":"publisher","unstructured":"Rogers, M., Gendron, G., Valdez, D. A. S., Azhar, M., Chen, Y., Heidari, S., Perelini, C., O\u2019Leary, P., Knowles, K., Tait, I., Eyre, S., Witbrock, M., & Delmas, P. (2023). Meerkat Behaviour Recognition Dataset., https:\/\/doi.org\/10.48550\/arXiv.2306.11326","DOI":"10.48550\/arXiv.2306.11326"},{"issue":"3","key":"2709_CR66","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1207\/s15327604jaws0903_5","volume":"9","author":"MJ Renner","year":"2006","unstructured":"Renner, M. J., & Kelly, A. L. (2006). Behavioral decisions for managing social distance and aggression in captive polar bears (Ursus maritimus). Journal of Applied Animal Welfare Science,9(3), 233\u2013239. https:\/\/doi.org\/10.1207\/s15327604jaws0903_5","journal-title":"Journal of Applied Animal Welfare Science"},{"key":"2709_CR67","unstructured":"Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., et al. (2021). Learning transferable visual models from natural language supervision, In: International Conference on Machine Learning (pp. 8748\u20138763). PMLR."},{"issue":"3","key":"2709_CR68","doi-asserted-by":"publisher","first-page":"421","DOI":"10.3390\/jzbg2030031","volume":"2","author":"PE Rose","year":"2021","unstructured":"Rose, P. E., & Riley, L. M. (2021). Conducting behavioural research in the zoo: A guide to ten important methods, concepts and theories. Journal of Zoological and Botanical Gardens,2(3), 421\u2013444. https:\/\/doi.org\/10.3390\/jzbg2030031","journal-title":"Journal of Zoological and Botanical Gardens"},{"key":"2709_CR69","doi-asserted-by":"publisher","unstructured":"Ravoor, P. C., & T.S.B., S. (2020). Deep learning methods for multi-species animal re-identification and tracking \u2013 a survey. Computer Science Review,38, Article 100289. https:\/\/doi.org\/10.1016\/j.cosrev.2020.100289","DOI":"10.1016\/j.cosrev.2020.100289"},{"key":"2709_CR70","unstructured":"Scott, K. (2014) Behaviour and endocrinology of meerkats in zoos. PhD thesis, University of Exeter. http:\/\/hdl.handle.net\/10871\/16393"},{"issue":"11","key":"2709_CR71","doi-asserted-by":"publisher","first-page":"12922","DOI":"10.1109\/TPAMI.2023.3243465","volume":"45","author":"J Selva","year":"2023","unstructured":"Selva, J., Johansen, A. S., Escalera, S., Nasrollahi, K., Moeslund, T. B., & Clap\u00e9s, A. (2023). Video transformers: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence,45(11), 12922\u201312943. https:\/\/doi.org\/10.1109\/TPAMI.2023.3243465","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2709_CR72","doi-asserted-by":"publisher","unstructured":"Schroff, F., Kalenichenko, D., & Philbin, J. (2015). FaceNet: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR.2015.7298682","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"2709_CR73","doi-asserted-by":"publisher","unstructured":"Srivastava, S., & Sharma, G. (2024). OmniVec: Learning robust representations with cross modal sharing. In: Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision. (pp. 1236\u20131248). https:\/\/doi.org\/10.1109\/WACV57701.2024.00127","DOI":"10.1109\/WACV57701.2024.00127"},{"key":"2709_CR74","doi-asserted-by":"publisher","unstructured":"Schneider, S., Taylor, G. W., & Kremer, S. C. (2020). Similarity learning networks for animal individual re-identification - beyond the capabilities of a human observer. In: 2020 IEEE Winter Applications of Computer Vision Workshops (WACVW). (pp. 44\u201352). https:\/\/doi.org\/10.1109\/WACVW50321.2020.9096925","DOI":"10.1109\/WACVW50321.2020.9096925"},{"issue":"4","key":"2709_CR75","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1111\/2041-210X.13133","volume":"10","author":"S Schneider","year":"2019","unstructured":"Schneider, S., Taylor, G. W., Linquist, S., & Kremer, S. C. (2019). Past, present and future approaches using computer vision for animal re-identification from camera trap data. Methods in Ecology and Evolution,10(4), 461\u2013470. https:\/\/doi.org\/10.1111\/2041-210X.13133","journal-title":"Methods in Ecology and Evolution"},{"key":"2709_CR76","doi-asserted-by":"publisher","unstructured":"Stevens, S., Wu, J., Thompson, M. J., Campolongo, E. G., Song, C. H., Carlyn, D. E., Dong, L., Dahdul, W. M., Stewart, C., Berger-Wolf, T., et al. (2024). BioCLIP: A vision foundation model for the tree of life. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, (pp. 19412\u201319424). https:\/\/doi.org\/10.1109\/CVPR52733.2024.01836","DOI":"10.1109\/CVPR52733.2024.01836"},{"key":"2709_CR77","doi-asserted-by":"publisher","unstructured":"Simonyan, K., Zisserman, A. (2014) Two-stream convolutional networks for action recognition in videos. In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 1. NIPS\u201914, vol. 27, pp. 568\u2013576. https:\/\/doi.org\/10.5555\/2968826.2968890","DOI":"10.5555\/2968826.2968890"},{"key":"2709_CR78","doi-asserted-by":"publisher","unstructured":"Simonyan, K., & Zisserman, A. (2015). Very Deep Convolutional Networks for Large-Scale Image Recognition., https:\/\/doi.org\/10.48550\/arXiv.1409.1556.","DOI":"10.48550\/arXiv.1409.1556."},{"issue":"1","key":"2709_CR79","doi-asserted-by":"publisher","first-page":"792","DOI":"10.1038\/s41467-022-27980-y","volume":"13","author":"D Tuia","year":"2022","unstructured":"Tuia, D., Kellenberger, B., Beery, S., Costelloe, B. R., Zuffi, S., Risse, B., Mathis, A., Mathis, M. W., Langevelde, F., Burghardt, T., Kays, R., Klinck, H., Wikelski, M., Couzin, I. D., Horn, G., Crofoot, M. C., Stewart, C. V., & Berger-Wolf, T. (2022). Perspectives in machine learning for wildlife conservation. Nature Communications,13(1), 792. https:\/\/doi.org\/10.1038\/s41467-022-27980-y","journal-title":"Nature Communications"},{"key":"2709_CR80","doi-asserted-by":"publisher","unstructured":"Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., et al. (2023). Llama 2: Open foundation and fine-tuned chat models., https:\/\/doi.org\/10.48550\/arXiv.2307.09288","DOI":"10.48550\/arXiv.2307.09288"},{"issue":"6","key":"2709_CR81","doi-asserted-by":"publisher","first-page":"1459","DOI":"10.1016\/j.anbehav.2012.03.018","volume":"83","author":"A Thornton","year":"2012","unstructured":"Thornton, A., & Samson, J. (2012). Innovative problem solving in wild meerkats. Animal Behaviour,83(6), 1459\u20131468. https:\/\/doi.org\/10.1016\/j.anbehav.2012.03.018","journal-title":"Animal Behaviour"},{"key":"2709_CR82","unstructured":"Tian, Y., Ye, Q., Doermann, D. (2025) YOLOv12: Attention-Centric Real-Time Object Detectors. https:\/\/arxiv.org\/abs\/2502.12524"},{"issue":"6","key":"2709_CR83","doi-asserted-by":"publisher","first-page":"0252456","DOI":"10.1371\/journal.pone.0252456","volume":"16","author":"W van de Bunte","year":"2021","unstructured":"van de Bunte, W., Koene, P., Nijkamp, H., & Sterck, E. H. M. (2021). Potential effects of GPS collars on the behaviour of two red pandas (Ailurus fulgens) in Rotterdam Zoo. PLOS ONE,16(6), 0252456. https:\/\/doi.org\/10.1371\/journal.pone.0252456","journal-title":"PLOS ONE"},{"key":"2709_CR84","doi-asserted-by":"publisher","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L.u., Polosukhin, I. (2017) Attention is all you need. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 30, pp. 6000\u20136010. Curran Associates, Inc., Red Hook, NY, USA. https:\/\/doi.org\/10.5555\/3295222.3295349","DOI":"10.5555\/3295222.3295349"},{"issue":"3","key":"2709_CR85","doi-asserted-by":"publisher","first-page":"900","DOI":"10.1093\/icb\/icab107","volume":"61","author":"M Vidal","year":"2021","unstructured":"Vidal, M., Wolf, N., Rosenberg, B., Harris, B. P., & Mathis, A. (2021). Perspectives on Individual Animal Identification from Biology and Computer Vision. Integrative and Comparative Biology,61(3), 900\u2013916. https:\/\/doi.org\/10.1093\/icb\/icab107","journal-title":"Integrative and Comparative Biology"},{"key":"2709_CR86","doi-asserted-by":"publisher","first-page":"5740","DOI":"10.1609\/aaai.v38i6.28386","volume":"38","author":"Y-H Wang","year":"2024","unstructured":"Wang, Y.-H., Hsieh, J.-W., Chen, P.-Y., Chang, M.-C., So, H.-H., & Li, X. (2024). SMILEtrack: Similarity learning for occlusion-aware multiple object tracking. Proceedings of the AAAI Conference on Artificial Intelligence,38, 5740\u20135748. https:\/\/doi.org\/10.1609\/aaai.v38i6.28386","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2709_CR87","doi-asserted-by":"publisher","first-page":"2847","DOI":"10.1609\/aaai.v37i3.25386","volume":"37","author":"W Wu","year":"2023","unstructured":"Wu, W., Sun, Z., & Ouyang, W. (2023). Revisiting classifier: Transferring vision-language models for video recognition. Proceedings of the AAAI Conference on Artificial Intelligence,37, 2847\u20132855. https:\/\/doi.org\/10.1609\/aaai.v37i3.25386","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"issue":"2","key":"2709_CR88","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1111\/2041-210X.14278","volume":"15","author":"O Wahltinez","year":"2024","unstructured":"Wahltinez, O., & Wahltinez, S. J. (2024). An open-source general purpose machine learning framework for individual animal re-identification using few-shot learning. Methods in Ecology and Evolution,15(2), 373\u2013387. https:\/\/doi.org\/10.1111\/2041-210X.14278","journal-title":"Methods in Ecology and Evolution"},{"key":"2709_CR89","doi-asserted-by":"publisher","unstructured":"Wang, P., Wang, S., Lin, J., Bai, S., Zhou, X., Zhou, J., Wang, X., & Zhou, C. (2023). ONE-PEACE: Exploring one general representation model toward unlimited modalities., https:\/\/doi.org\/10.48550\/arXiv.2305.11172.","DOI":"10.48550\/arXiv.2305.11172"},{"issue":"1","key":"2709_CR90","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-023-38901-y","volume":"14","author":"Z Wu","year":"2023","unstructured":"Wu, Z., Zhang, C., Gu, X., Duporge, I., Hughey, L. F., Stabach, J. A., Skidmore, A. K., Hopcraft, J. G. C., Lee, S. J., Atkinson, P. M., et al. (2023). Deep learning enables satellite-based monitoring of large populations of terrestrial mammals across heterogeneous landscapes. Nature Communications,14(1), 1\u201313. https:\/\/doi.org\/10.1038\/s41467-023-38901-y","journal-title":"Nature Communications"},{"key":"2709_CR91","doi-asserted-by":"publisher","unstructured":"Xuan, H., Stylianou, A., Liu, X., Pless, R. (2020) Hard negative examples are hard, but useful. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision \u2013 ECCV 2020, pp. 126\u2013142. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-58568-6_8","DOI":"10.1007\/978-3-030-58568-6_8"},{"key":"2709_CR92","unstructured":"Xiong, R., Yang, Y., He, D., Zheng, K., Zheng, S., Xing, C., Zhang, H., Lan, Y., Wang, L., & Liu, T. (2020). On layer normalization in the transformer architecture, In: International Conference on Machine Learning (pp. 10524\u201310533). PMLR"},{"key":"2709_CR93","doi-asserted-by":"publisher","unstructured":"Yang, J., Dong, X., Liu, L., Zhang, C., Shen, J., & Yu, D. (2022). Recurring the transformer for video action recognition. In: 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 14043\u201314053). https:\/\/doi.org\/10.1109\/CVPR52688.2022.01367","DOI":"10.1109\/CVPR52688.2022.01367"},{"key":"2709_CR94","doi-asserted-by":"publisher","unstructured":"Zheng, S., Chen, S., & Jin, Q. (2022). VRDFormer: End-to-end video visual relation detection with transformers. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (pp. 18836\u201318846). https:\/\/doi.org\/10.1109\/CVPR52688.2022.01827","DOI":"10.1109\/CVPR52688.2022.01827"},{"key":"2709_CR95","doi-asserted-by":"publisher","unstructured":"Zuerl, M., Dirauf, R., Koeferl, F., Steinlein, N., Sueskind, J., Zanca, D., Brehm, I., & Fersen, L.v., Eskofier, B. (2023). Polarbearvidid: A video-based re-identification benchmark dataset for polar bears. Animals,13(5). https:\/\/doi.org\/10.3390\/ani13050801","DOI":"10.3390\/ani13050801"},{"key":"2709_CR96","doi-asserted-by":"publisher","unstructured":"Zong, Z., Song, G., & Liu, Y. (2023). DETRs with collaborative hybrid assignments training. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision (pp. 6748\u20136758). https:\/\/doi.org\/10.1109\/ICCV51070.2023.00621","DOI":"10.1109\/ICCV51070.2023.00621"},{"key":"2709_CR97","doi-asserted-by":"publisher","unstructured":"Zhu, X., Xiong, Y., Dai, J., Yuan, L., & Wei, Y. (2017). Deep feature flow for video recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2349\u20132358). https:\/\/doi.org\/10.1109\/CVPR.2017.441","DOI":"10.1109\/CVPR.2017.441"},{"key":"2709_CR98","doi-asserted-by":"publisher","unstructured":"Zheng, Z., Zhao, Y., Li, A., & Yu, Q. (2022). Wild terrestrial animal re-identification based on an improved locally aware transformer with a cross-attention mechanism. Animals,12(24). https:\/\/doi.org\/10.3390\/ani12243503","DOI":"10.3390\/ani12243503"},{"key":"2709_CR99","doi-asserted-by":"publisher","unstructured":"\u010cerm\u00e1k, V., Picek, L., Adam, L., & Papafitsoros, K. (2023). WildlifeDatasets: An open-source toolkit for animal re-identification., https:\/\/doi.org\/10.48550\/arXiv.2311.09118.","DOI":"10.48550\/arXiv.2311.09118"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-025-02709-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-025-02709-8","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-025-02709-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T08:36:43Z","timestamp":1774600603000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-025-02709-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,6]]},"references-count":99,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,3]]}},"alternative-id":["2709"],"URL":"https:\/\/doi.org\/10.1007\/s11263-025-02709-8","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,6]]},"assertion":[{"value":"21 September 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 September 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 February 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing Interests"}},{"value":"Ethical approval for the meerkats study was waived due to the camera-based and non-invasive approach to data collection and re-identification. Meerkats were not manipulated, collared, or marked for the purpose of re-identification. Data acquisition was exclusively carried out at Wellington Zoo during standard opening times, without any changes to the management procedures of the animals.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical Approval"}}],"article-number":"106"}}