{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T15:31:25Z","timestamp":1778081485559,"version":"3.51.4"},"reference-count":63,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2024,3,4]],"date-time":"2024-03-04T00:00:00Z","timestamp":1709510400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,3,4]],"date-time":"2024-03-04T00:00:00Z","timestamp":1709510400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["EP\/S022937\/1"],"award-info":[{"award-number":["EP\/S022937\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across <jats:inline-formula><jats:alternatives><jats:tex-math>$$\\sim $$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:mo>\u223c<\/mml:mo>\n                <\/mml:math><\/jats:alternatives><\/jats:inline-formula>\u00a020,000 camera trap videos of chimpanzees and gorillas collected at 18 field sites in tropical Africa as part of the Pan African Programme: The Cultured Chimpanzee. The footage is accompanied by a rich set of annotations and benchmarks making it suitable for training and testing a variety of challenging and ecologically important computer vision tasks including ape detection and behaviour recognition. Furthering AI analysis of camera trap information is critical given the International Union for Conservation of Nature now lists all species in the great ape family as either Endangered or Critically Endangered. We hope the dataset can form a solid basis for engagement of the AI community to improve performance, efficiency, and result interpretation in order to support assessments of great ape presence, abundance, distribution, and behaviour and thereby aid conservation efforts. The dataset and code are available from the project website: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/obrookes.github.io\/panaf.github.io\/\">PanAf20K<\/jats:ext-link><\/jats:p>","DOI":"10.1007\/s11263-024-02003-z","type":"journal-article","created":{"date-parts":[[2024,3,4]],"date-time":"2024-03-04T12:01:57Z","timestamp":1709553717000},"page":"3086-3102","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition"],"prefix":"10.1007","volume":"132","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6865-1844","authenticated-orcid":false,"given":"Otto","family":"Brookes","sequence":"first","affiliation":[]},{"given":"Majid","family":"Mirmehdi","sequence":"additional","affiliation":[]},{"given":"Colleen","family":"Stephens","sequence":"additional","affiliation":[]},{"given":"Samuel","family":"Angedakin","sequence":"additional","affiliation":[]},{"given":"Katherine","family":"Corogenes","sequence":"additional","affiliation":[]},{"given":"Dervla","family":"Dowd","sequence":"additional","affiliation":[]},{"given":"Paula","family":"Dieguez","sequence":"additional","affiliation":[]},{"given":"Thurston C.","family":"Hicks","sequence":"additional","affiliation":[]},{"given":"Sorrel","family":"Jones","sequence":"additional","affiliation":[]},{"given":"Kevin","family":"Lee","sequence":"additional","affiliation":[]},{"given":"Vera","family":"Leinert","sequence":"additional","affiliation":[]},{"given":"Juan","family":"Lapuente","sequence":"additional","affiliation":[]},{"given":"Maureen S.","family":"McCarthy","sequence":"additional","affiliation":[]},{"given":"Amelia","family":"Meier","sequence":"additional","affiliation":[]},{"given":"Mizuki","family":"Murai","sequence":"additional","affiliation":[]},{"given":"Emmanuelle","family":"Normand","sequence":"additional","affiliation":[]},{"given":"Virginie","family":"Vergnes","sequence":"additional","affiliation":[]},{"given":"Erin G.","family":"Wessling","sequence":"additional","affiliation":[]},{"given":"Roman M.","family":"Wittig","sequence":"additional","affiliation":[]},{"given":"Kevin","family":"Langergraber","sequence":"additional","affiliation":[]},{"given":"Nuria","family":"Maldonado","sequence":"additional","affiliation":[]},{"given":"Xinyu","family":"Yang","sequence":"additional","affiliation":[]},{"given":"Klaus","family":"Zuberb\u00fchler","sequence":"additional","affiliation":[]},{"given":"Christophe","family":"Boesch","sequence":"additional","affiliation":[]},{"given":"Mimi","family":"Arandjelovic","sequence":"additional","affiliation":[]},{"given":"Hjalmar","family":"K\u00fchl","sequence":"additional","affiliation":[]},{"given":"Tilo","family":"Burghardt","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,3,4]]},"reference":[{"key":"2003_CR1","doi-asserted-by":"crossref","unstructured":"Alshammari, S., Wang, Y. X., Ramanan, D., & Kong, S. (2022). Long-tailed recognition via weight balancing. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 6897\u20136907).","DOI":"10.1109\/CVPR52688.2022.00677"},{"key":"2003_CR2","doi-asserted-by":"crossref","unstructured":"Arandjelovic, M., Stephens, C. R., McCarthy, M. S., Dieguez, P., Kalan, A. K., Maldonado, N., Boesch, C., & Kuehl, H. S. (2016). Chimp &See: An online citizen science platform for large-scale, remote video camera trap annotation of chimpanzee behaviour, demography and individual identification. PeerJ Preprints.","DOI":"10.7287\/peerj.preprints.1792v1"},{"key":"2003_CR3","doi-asserted-by":"crossref","unstructured":"Bain, M., Nagrani, A., Schofield, D., Berdugo, S., Bessa, J., Owen, J., Hockings, K. J., Matsuzawa, T., Hayashi, M., Biro, D., & Carvalho, S. (2021). Automated audiovisual behavior recognition in wild primates. Science Advances,7(46), eabi4883","DOI":"10.1126\/sciadv.abi4883"},{"key":"2003_CR4","unstructured":"Beery, S., Agarwal, A., Cole, E., & Birodkar, V. (2021). The iwildcam 2021 competition dataset. arXiv preprint arXiv:2105.03494"},{"key":"2003_CR5","unstructured":"Beery, S., Morris, D., & Yang, S. (2019). Efficient pipeline for camera trap image review. arXiv preprint arXiv:1907.06772"},{"key":"2003_CR6","unstructured":"Bertasius, G., Wang, H., & Torresani, L. (2021). Is space-time attention all you need for video understanding? In Proceedings of the international conference on machine learning (ICML)."},{"key":"2003_CR7","unstructured":"Brookes, O., & Burghardt, T. (2020). A dataset and application for facial recognition of individual gorillas in zoo environments. In Workshop on the visual observation and analysis of vertebrate and insect behaviour. 2012.04689"},{"key":"2003_CR8","doi-asserted-by":"crossref","unstructured":"Brookes, O., Mirmehdi, M., K\u00fchl, H., & Burghardt, T. (2023). Triple-stream deep metric learning of great ape behavioural actions. In Proceedings of the 18th international joint conference on computer vision, imaging and computer graphics theory and applications (pp. 294\u2013302).","DOI":"10.5220\/0011798400003417"},{"key":"2003_CR9","doi-asserted-by":"crossref","unstructured":"Brust, C. A., Burghardt, T., Groenenberg, M., Kading, C., Kuhl, H. S., Manguette, M. L., & Denzler, J. (2017). Towards automated visual monitoring of individual gorillas in the wild. In Proceedings of the IEEE international conference on computer vision workshops (pp. 2820\u20132830).","DOI":"10.1109\/ICCVW.2017.333"},{"key":"2003_CR10","doi-asserted-by":"crossref","unstructured":"Cao, J., Pang, J., Weng, X., Khirodkar, R., & Kitani, K. (2023) Observation-centric sort: Rethinking sort for robust multi-object tracking. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 9686\u20139696).","DOI":"10.1109\/CVPR52729.2023.00934"},{"key":"2003_CR11","doi-asserted-by":"crossref","unstructured":"Carreira, J., & Zisserman, A. (2017). Quo vadis, action recognition? A new model and the kinetics dataset. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. pp 6299\u20136308).","DOI":"10.1109\/CVPR.2017.502"},{"issue":"9","key":"2003_CR12","doi-asserted-by":"publisher","first-page":"1663","DOI":"10.1111\/ddi.13358","volume":"27","author":"JS Carvalho","year":"2021","unstructured":"Carvalho, J. S., Graham, B., Bocksberger, G., Maisels, F., Williamson, E. A., Wich, S., Sop, T., Amarasekaran, B., Barca, B., Barrie, A., & Bergl, R. A. (2021). Predicting range shifts of African apes under global change scenarios. Diversity and Distributions, 27(9), 1663\u20131679.","journal-title":"Diversity and Distributions"},{"issue":"24","key":"2003_CR13","doi-asserted-by":"publisher","first-page":"13596","DOI":"10.1073\/pnas.1922686117","volume":"117","author":"G Ceballos","year":"2020","unstructured":"Ceballos, G., Ehrlich, P. R., & Raven, P. H. (2020). Vertebrates on the brink as indicators of biological annihilation and the sixth mass extinction. Proceedings of the National Academy of Sciences, 117(24), 13596\u201313602.","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"4\u20135","key":"2003_CR14","doi-asserted-by":"publisher","DOI":"10.1002\/ajp.23328","volume":"84","author":"J Chappell","year":"2022","unstructured":"Chappell, J., & Thorpe, S. K. (2022). The role of great ape behavioral ecology in one health: Implications for captive welfare and re-habilitation success. American Journal of Primatology, 84(4\u20135), e23328.","journal-title":"American Journal of Primatology"},{"key":"2003_CR15","doi-asserted-by":"crossref","unstructured":"Chen, J., Hu, M., Coker, D. J., Berumen, M. L., Costelloe, B., Beery, S., Rohrbach, A., & Elhoseiny, M. (2023). Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 13052\u201313061).","DOI":"10.1109\/CVPR52729.2023.01254"},{"issue":"1\u20132","key":"2003_CR16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.applanim.2011.10.010","volume":"135","author":"FE Clark","year":"2011","unstructured":"Clark, F. E. (2011). Great ape cognition and captive care: Can cognitive challenges enhance well-being? Applied Animal Behaviour Science, 135(1\u20132), 1\u201312.","journal-title":"Applied Animal Behaviour Science"},{"issue":"4","key":"2003_CR17","doi-asserted-by":"publisher","first-page":"1201","DOI":"10.1890\/11-1614.1","volume":"22","author":"TE Cox","year":"2012","unstructured":"Cox, T. E., Philippoff, J., Baumgartner, E., & Smith, C. M. (2012). Expert variability provides perspective on the strengths and weaknesses of citizen-driven intertidal monitoring program. Ecological Applications, 22(4), 1201\u20131212.","journal-title":"Ecological Applications"},{"key":"2003_CR18","doi-asserted-by":"crossref","unstructured":"Cui, Y., Jia, M., Lin, T. Y., Song, Y., & Belongie, S. (2019). Class-balanced loss based on effective number of samples. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 9268\u20139277).","DOI":"10.1109\/CVPR.2019.00949"},{"key":"2003_CR19","doi-asserted-by":"crossref","unstructured":"Cui, Y., Song, Y., Sun, C., Howard, A., & Belongie, S. (2018). Large scale fine-grained categorization and domain-specific transfer learning. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. pp 4109\u20134118).","DOI":"10.1109\/CVPR.2018.00432"},{"issue":"3","key":"2003_CR20","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1093\/biosci\/biu001","volume":"64","author":"F Danielsen","year":"2014","unstructured":"Danielsen, F., Jensen, P. M., Burgess, N. D., Altamirano, R., Alviola, P. A., Andrianandrasana, H., Brashares, J. S., Burton, A. C., Coronado, I., Corpuz, N., & Enghoff, M. (2014). A multicountry assessment of tropical resource monitoring by local communities. BioScience, 64(3), 236\u2013251.","journal-title":"BioScience"},{"issue":"6","key":"2003_CR21","doi-asserted-by":"publisher","first-page":"345","DOI":"10.1016\/j.ecoinf.2011.06.003","volume":"6","author":"EH Fegraus","year":"2011","unstructured":"Fegraus, E. H., Lin, K., Ahumada, J. A., Baru, C., Chandra, S., & Youn, C. (2011). Data acquisition and management software for camera trap data: A case study from the team network. Ecological Informatics, 6(6), 345\u2013353.","journal-title":"Ecological Informatics"},{"key":"2003_CR22","doi-asserted-by":"crossref","unstructured":"Feichtenhofer, C. (2020). X3d: Expanding architectures for efficient video recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 203\u2013213).","DOI":"10.1109\/CVPR42600.2020.00028"},{"key":"2003_CR23","doi-asserted-by":"crossref","unstructured":"Freytag, A., Rodner, E., Simon, M., Loos, A., K\u00fchl, H. S., & Denzler, J. (2016). Chimpanzee faces in the wild: Log-Euclidean CNNs for predicting identities and attributes of primates. In German conference on pattern recognition (pp. 51\u201363). Springer.","DOI":"10.1007\/978-3-319-45886-1_5"},{"key":"2003_CR24","doi-asserted-by":"crossref","unstructured":"Hara, K., Kataoka, H., Satoh, & Y. (2017). Learning spatio-temporal features with 3d residual networks for action recognition. In Proceedings of the IEEE international conference on computer vision workshops (pp. 3154\u20133160).","DOI":"10.1109\/ICCVW.2017.373"},{"issue":"5","key":"2003_CR25","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1017\/S0266467415000322","volume":"31","author":"B Haurez","year":"2015","unstructured":"Haurez, B., Da\u00efnou, K., Tagg, N., Petre, C. A., & Doucet, J. L. (2015). The role of great apes in seed dispersal of the tropical forest tree species Dacryodes normandii (Burseraceae) in Gabon. Journal of Tropical Ecology, 31(5), 395\u2013402.","journal-title":"Journal of Tropical Ecology"},{"key":"2003_CR26","doi-asserted-by":"publisher","DOI":"10.7717\/peerj.13510","volume":"10","author":"NA Houa","year":"2022","unstructured":"Houa, N. A., Cappelle, N., Bitty, E. A., Normand, E., Kablan, Y. A., & Boesch, C. (2022). Animal reactivity to camera traps and its effects on abundance estimate using distance sampling in the Ta\u00ef National Park, C\u00f4te d\u2019ivoire. PeerJ, 10, e13510.","journal-title":"PeerJ"},{"key":"2003_CR27","unstructured":"Kay, W., Carreira, J., Simonyan, K., Zhang, B., Hillier, C., Vijayanarasimhan, S., Viola, F., Green, T., Back, T., Natsev, P., & Suleyman, M. (2017). The kinetics human action video dataset. arXiv preprint arXiv:1705.06950"},{"key":"2003_CR28","unstructured":"Krasin, I., Duerig, T., Alldrin, N., Ferrari, V., Abu-El-Haija, S., Kuznetsova, A., Rom, H., Uijlings, J., Popov, S., Veit, A., & Belongie, S. (2017). Openimages: A public dataset for large-scale multi-label and multi-class image classification. Dataset available from https:\/\/storage.googleapis.com\/openimages\/web\/index.html"},{"key":"2003_CR29","unstructured":"Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097\u20131105)."},{"key":"2003_CR30","doi-asserted-by":"crossref","unstructured":"Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T., (2011). HMDB: A large video database for human motion recognition. In 2011 International conference on computer vision (pp. 2556\u20132563). IEEE.","DOI":"10.1109\/ICCV.2011.6126543"},{"issue":"7","key":"2003_CR31","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1016\/j.tree.2013.02.013","volume":"28","author":"HS K\u00fchl","year":"2013","unstructured":"K\u00fchl, H. S., & Burghardt, T. (2013). Animal biometrics: Quantifying and detecting phenotypic appearance. Trends in Ecology & Evolution, 28(7), 432\u2013441.","journal-title":"Trends in Ecology & Evolution"},{"key":"2003_CR32","doi-asserted-by":"crossref","unstructured":"Li, Y., Wu, C.Y., Fan, H., Mangalam, K., Xiong, B., Malik, J., & Feichtenhofer, C. (2022). Mvitv2: Improved multiscale vision transformers for classification and detection. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 4804\u20134814).","DOI":"10.1109\/CVPR52688.2022.00476"},{"key":"2003_CR33","doi-asserted-by":"crossref","unstructured":"Lin, T. Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., & Zitnick, C. L. (2014). Microsoft coco: Common objects in context. In Proceedings of the European conference on computer vision (pp. 740\u2013755). Springer.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"2003_CR34","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 10012\u201310022).","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"2003_CR35","doi-asserted-by":"crossref","unstructured":"Liu, Z., Mao, H., Wu, C. Y., Feichtenhofer, C., Darrell, T., & Xie, S. (2022). A convnet for the 2020s. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 11976\u201311986).","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"2003_CR36","doi-asserted-by":"crossref","unstructured":"Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., & Yu, S. X. (2019). Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2537\u20132546).","DOI":"10.1109\/CVPR.2019.00264"},{"issue":"4","key":"2003_CR37","doi-asserted-by":"publisher","first-page":"1598","DOI":"10.1002\/ece3.7128","volume":"11","author":"MS McCarthy","year":"2021","unstructured":"McCarthy, M. S., Stephens, C., Dieguez, P., et al. (2021). Chimpanzee identification and social network construction through an online citizen science platform. Ecology and Evolution, 11(4), 1598\u20131608.","journal-title":"Ecology and Evolution"},{"key":"2003_CR38","unstructured":"Menon, A. K., Jayasumana, S., Rawat, A. S., Jain, H., Veit, A., & Kumar, S. (2020). Long-tail learning via logit adjustment. In Proceedings of the international conference on learning representations."},{"key":"2003_CR39","doi-asserted-by":"crossref","unstructured":"Ng, X. L., Ong, K. E., Zheng, Q., Ni, Y., Yeo, S. Y., & Liu, J. (2022). Animal kingdom: A large and diverse dataset for animal behavior understanding. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 19023\u201319034).","DOI":"10.1109\/CVPR52688.2022.01844"},{"issue":"2","key":"2003_CR40","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1537\/ase.107.141","volume":"107","author":"T Nishida","year":"1999","unstructured":"Nishida, T., Kano, T., Goodall, J., McGrew, W. C., & Nakamura, M. (1999). Ethogram and ethnography of Mahale chimpanzees. Anthropological Science, 107(2), 141\u2013188.","journal-title":"Anthropological Science"},{"key":"2003_CR41","doi-asserted-by":"crossref","unstructured":"Parkhi, O., Vedaldi, A., & Zisserman, A. (2015). Deep face recognition. In Proceedings of the British machine vision conference. British Machine Vision Association.","DOI":"10.5244\/C.29.41"},{"key":"2003_CR42","doi-asserted-by":"crossref","unstructured":"Perrett, T., Sinha, S., Burghardt, T., Mirmehdi, M., & Damen, D. (2023). Use your head: Improving long-tail video recognition. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2415\u20132425).","DOI":"10.1109\/CVPR52729.2023.00239"},{"key":"2003_CR43","doi-asserted-by":"crossref","unstructured":"Pollen, A. A., Kilik, U., Lowe, C. B., & Camp, J. G. (2023). Human-specific genetics: New tools to explore the molecular and cellular basis of human evolution. Nature Reviews Genetics, 1\u201325","DOI":"10.1038\/s41576-022-00568-4"},{"key":"2003_CR44","doi-asserted-by":"crossref","unstructured":"Redmon, J., & Farhadi, A. (2017). Yolo9000: Better, faster, stronger. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 7263\u20137271).","DOI":"10.1109\/CVPR.2017.690"},{"key":"2003_CR45","unstructured":"Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767"},{"key":"2003_CR46","unstructured":"Sakib, F., & Burghardt, T. (2020). Visual recognition of great ape behaviours in the wild. In Workshop on the visual observation and analysis of vertebrate and insect behaviour. 2011.10759."},{"issue":"1","key":"2003_CR47","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1038\/s41467-020-20709-9","volume":"12","author":"L Samuni","year":"2021","unstructured":"Samuni, L., Crockford, C., & Wittig, R. M. (2021). Group-level cooperation in chimpanzees is shaped by strong social ties. Nature Communications, 12(1), 539.","journal-title":"Nature Communications"},{"key":"2003_CR48","doi-asserted-by":"crossref","unstructured":"Schofield, D., Nagrani, A., Zisserman, A., Hayashi, M., Matsuzawa, T., Biro, D., & Carvalho, S. (2019). Chimpanzee face recognition from videos in the wild using deep learning. Science Advances,5(9), eaaw0736.","DOI":"10.1126\/sciadv.aaw0736"},{"key":"2003_CR49","doi-asserted-by":"crossref","unstructured":"Sechidis, K., Tsoumakas, G., & Vlahavas, I. (2011). On the stratification of multi-label data. In Machine learning and knowledge discovery in databases (pp. 145\u2013158). Springer.","DOI":"10.1007\/978-3-642-23808-6_10"},{"key":"2003_CR50","unstructured":"Simonyan, K., & Zisserman, A. (2014). Two-stream convolutional networks for action recognition in videos. In Advances in neural information processing systems."},{"key":"2003_CR51","unstructured":"Soomro, K., Zamir, A. R., & Shah, M. (2012). Ucf101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402"},{"issue":"1","key":"2003_CR52","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/sdata.2015.26","volume":"2","author":"A Swanson","year":"2015","unstructured":"Swanson, A., Kosmala, M., Lintott, C., Simpson, R., Smith, A., & Packer, C. (2015). Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data, 2(1), 1\u201314.","journal-title":"Scientific Data"},{"issue":"1","key":"2003_CR53","first-page":"209","volume":"20","author":"P Szymanski","year":"2019","unstructured":"Szymanski, P., & Kajdanowicz, T. (2019). Scikit-multilearn: A scikit-based python environment for performing multi-label classification. The Journal of Machine Learning Research, 20(1), 209\u2013230.","journal-title":"The Journal of Machine Learning Research"},{"issue":"1","key":"2003_CR54","doi-asserted-by":"publisher","first-page":"coy013","DOI":"10.1093\/conphys\/coy013","volume":"6","author":"E Tarszisz","year":"2018","unstructured":"Tarszisz, E., Tomlinson, S., Harrison, M. E., Morrogh-Bernard, H. C., & Munn, A. J. (2018). An ecophysiologically informed model of seed dispersal by orangutans: Linking animal movement with gut passage across time and space. Conservation Physiology, 6(1), coy013.","journal-title":"Conservation Physiology"},{"issue":"1","key":"2003_CR55","doi-asserted-by":"publisher","first-page":"13915","DOI":"10.1038\/ncomms13915","volume":"7","author":"C Tennie","year":"2016","unstructured":"Tennie, C., Jensen, K., & Call, J. (2016). The nature of prosociality in chimpanzees. Nature Communications, 7(1), 13915.","journal-title":"Nature Communications"},{"key":"2003_CR56","doi-asserted-by":"crossref","unstructured":"Tran, D., Wang, H., Torresani, L., Ray, J., LeCun, Y., & Paluri, M. (2018). A closer look at spatiotemporal convolutions for action recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6450\u20136459).","DOI":"10.1109\/CVPR.2018.00675"},{"issue":"1","key":"2003_CR57","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-022-27980-y","volume":"13","author":"D Tuia","year":"2022","unstructured":"Tuia, D., Kellenberger, B., Beery, S., Costelloe, B. R., Zuffi, S., Risse, B., Mathis, A., Mathis, M. W., van Langevelde, F., Burghardt, T., & Kays, R. (2022). Perspectives in machine learning for wildlife conservation. Nature Communications, 13(1), 1\u201315.","journal-title":"Nature Communications"},{"key":"2003_CR58","doi-asserted-by":"crossref","unstructured":"Van Horn, G., Mac Aodha, O., Song, Y., Cui, Y., Sun, C., Shepard, A., Adam, H., Perona, P., & Belongie, S. (2018). The inaturalist species classification and detection dataset. In Proceedings of the IEEE international conference on computer vision (pp. 8769\u20138778).","DOI":"10.1109\/CVPR.2018.00914"},{"key":"2003_CR59","unstructured":"Vi\u00e9, J. C., Hilton-Taylor, C., & Stuart, S. N. (2009). Wildlife in a changing world: An analysis of the 2008 IUCN Red List of threatened species. IUCN"},{"key":"2003_CR60","doi-asserted-by":"crossref","unstructured":"Yang, X., Burghardt, T., & Mirmehdi, M. (2023). Dynamic curriculum learning for great ape detection in the wild. International Journal of Computer Vision, 1\u201319","DOI":"10.1007\/s11263-023-01748-3"},{"key":"2003_CR61","doi-asserted-by":"crossref","unstructured":"Yang, X., Mirmehdi, M., & Burghardt, T. (2019). Great ape detection in challenging jungle camera trap footage via attention-based spatial and temporal feature blending. In Proceedings of the IEEE\/CVF international conference on computer vision workshops.","DOI":"10.1109\/ICCVW.2019.00034"},{"key":"2003_CR62","first-page":"510","volume-title":"Ethograms and the diversity of behaviors","author":"K Zamma","year":"2015","unstructured":"Zamma, K., & Matsusaka, T. (2015). Ethograms and the diversity of behaviors (pp. 510\u2013518). Cambridge University Press."},{"key":"2003_CR63","doi-asserted-by":"crossref","unstructured":"Zhang, H., Wang, Y., Dayoub, F., & Sunderhauf, N. (2021). Varifocalnet: An iou-aware dense object detector. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 8514\u20138523).","DOI":"10.1109\/CVPR46437.2021.00841"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02003-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-024-02003-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-024-02003-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,11]],"date-time":"2024-07-11T14:21:52Z","timestamp":1720707712000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-024-02003-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,4]]},"references-count":63,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["2003"],"URL":"https:\/\/doi.org\/10.1007\/s11263-024-02003-z","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,4]]},"assertion":[{"value":"30 August 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 January 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 March 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}