{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:10:33Z","timestamp":1750219833791,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,25]],"date-time":"2023-08-25T00:00:00Z","timestamp":1692921600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Th\u00fcringer Ministerium f\u00fcr Wirtschaft, Wissenschaft und Digitale Gesellschaft"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,25]]},"DOI":"10.1145\/3604951.3605524","type":"proceedings-article","created":{"date-parts":[[2023,8,1]],"date-time":"2023-08-01T17:20:33Z","timestamp":1690910433000},"page":"13-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Drawing the Line: A Dual Evaluation Approach for Shaping Ground Truth in Image Retrieval Using Rich Visual Embeddings of Historical Images"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5344-4172","authenticated-orcid":false,"given":"David","family":"Tschirschwitz","sequence":"first","affiliation":[{"name":"Bauhaus-Universit\u00e4t Weimar, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3137-6732","authenticated-orcid":false,"given":"Franziska","family":"Klemstein","sequence":"additional","affiliation":[{"name":"Bauhaus-Universit\u00e4t Weimar, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9326-6902","authenticated-orcid":false,"given":"Henning","family":"Schmidgen","sequence":"additional","affiliation":[{"name":"Bauhaus-Universit\u00e4t Weimar, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4815-0118","authenticated-orcid":false,"given":"Volker","family":"Rodehorst","sequence":"additional","affiliation":[{"name":"Bauhaus-Universit\u00e4t Weimar, Germany"}]}],"member":"320","published-online":{"date-parts":[[2023,8,25]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2022. Google Universal Image Embedding. https:\/\/kaggle.com\/competitions\/google-universal-image-embedding  2022. Google Universal Image Embedding. https:\/\/kaggle.com\/competitions\/google-universal-image-embedding"},{"key":"e_1_3_2_1_2_1","unstructured":"2023. The Virtual Laboratory. https:\/\/vlp-new.ur.de\/search?type=images  2023. The Virtual Laboratory. https:\/\/vlp-new.ur.de\/search?type=images"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10590-1_38"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1007\/11744023_32"},{"key":"e_1_3_2_1_5_1","unstructured":"Jane Bromley Isabelle Guyon Yann LeCun Eduard S\u00e4ckinger and Roopak Shah. 1993. Signature Verification using a \"Siamese\" Time Delay Neural Network. In Advances in Neural Information Processing Systems Vol.\u00a06. Morgan-Kaufmann. https:\/\/proceedings.neurips.cc\/paper\/1993\/hash\/288cc0ff022877bd3df94bc9360b9c5d-Abstract.html  Jane Bromley Isabelle Guyon Yann LeCun Eduard S\u00e4ckinger and Roopak Shah. 1993. Signature Verification using a \"Siamese\" Time Delay Neural Network. In Advances in Neural Information Processing Systems Vol.\u00a06. Morgan-Kaufmann. https:\/\/proceedings.neurips.cc\/paper\/1993\/hash\/288cc0ff022877bd3df94bc9360b9c5d-Abstract.html"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01264-9_9"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00951"},{"key":"e_1_3_2_1_8_1","first-page":"2640","volume-title":"Proceedings of the 37th International Conference on Machine Learning. PMLR, 1597\u20131607","author":"Chen Ting","year":"2020","unstructured":"Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey Hinton . 2020 . A Simple Framework for Contrastive Learning of Visual Representations . In Proceedings of the 37th International Conference on Machine Learning. PMLR, 1597\u20131607 . https:\/\/proceedings.mlr.press\/v119\/chen20j.html ISSN: 2640 - 3498 . Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A Simple Framework for Contrastive Learning of Visual Representations. In Proceedings of the 37th International Conference on Machine Learning. PMLR, 1597\u20131607. https:\/\/proceedings.mlr.press\/v119\/chen20j.html ISSN: 2640-3498."},{"key":"e_1_3_2_1_9_1","volume-title":"Deep learning for instance retrieval: A survey","author":"Chen Wei","year":"2022","unstructured":"Wei Chen , Yu Liu , Weiping Wang , Erwin\u00a0 M Bakker , Theodoros Georgiou , Paul Fieguth , Li Liu , and Michael\u00a0 S Lew . 2022. Deep learning for instance retrieval: A survey . IEEE Transactions on Pattern Analysis and Machine Intelligence ( 2022 ). Publisher : IEEE. Wei Chen, Yu Liu, Weiping Wang, Erwin\u00a0M Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, and Michael\u00a0S Lew. 2022. Deep learning for instance retrieval: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022). Publisher: IEEE."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1"},{"key":"e_1_3_2_1_11_1","unstructured":"Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly Jakob Uszkoreit and Neil Houlsby. 2021. AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE. (2021).  Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly Jakob Uszkoreit and Neil Houlsby. 2021. AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE. (2021)."},{"key":"e_1_3_2_1_12_1","first-page":"2640","volume-title":"Proceedings of the 34th International Conference on Machine Learning. PMLR, 1126\u20131135","author":"Finn Chelsea","year":"2017","unstructured":"Chelsea Finn , Pieter Abbeel , and Sergey Levine . 2017 . Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks . In Proceedings of the 34th International Conference on Machine Learning. PMLR, 1126\u20131135 . https:\/\/proceedings.mlr.press\/v70\/finn17a.html ISSN: 2640 - 3498 . Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In Proceedings of the 34th International Conference on Machine Learning. PMLR, 1126\u20131135. https:\/\/proceedings.mlr.press\/v70\/finn17a.html ISSN: 2640-3498."},{"key":"e_1_3_2_1_13_1","volume-title":"Advances in Neural Information Processing Systems, Vol.\u00a026. Curran Associates","author":"Frome Andrea","year":"2013","unstructured":"Andrea Frome , Greg\u00a0 S Corrado , Jon Shlens , Samy Bengio , Jeff Dean , Marc\u2019\u00a0Aurelio Ranzato , and Tomas Mikolov . 2013. DeViSE: A Deep Visual-Semantic Embedding Model . In Advances in Neural Information Processing Systems, Vol.\u00a026. Curran Associates , Inc .https:\/\/papers.nips.cc\/paper_files\/paper\/ 2013 \/hash\/7cce53cf90577442771720a370c3c723-Abstract.html Andrea Frome, Greg\u00a0S Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Marc\u2019\u00a0Aurelio Ranzato, and Tomas Mikolov. 2013. DeViSE: A Deep Visual-Semantic Embedding Model. In Advances in Neural Information Processing Systems, Vol.\u00a026. Curran Associates, Inc.https:\/\/papers.nips.cc\/paper_files\/paper\/2013\/hash\/7cce53cf90577442771720a370c3c723-Abstract.html"},{"key":"e_1_3_2_1_14_1","volume-title":"Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems 33","author":"Grill Jean-Bastien","year":"2020","unstructured":"Jean-Bastien Grill , Florian Strub , Florent Altch\u2019e , Corentin Tallec , Pierre Richemond , Elena Buchatskaya , Carl Doersch , Bernardo Avila\u00a0Pires , Zhaohan Guo , Mohammad Gheshlaghi\u00a0Azar , 2020. Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems 33 ( 2020 ), 21271\u201321284. Jean-Bastien Grill, Florian Strub, Florent Altch\u2019e, Corentin Tallec, Pierre Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila\u00a0Pires, Zhaohan Guo, Mohammad Gheshlaghi\u00a0Azar, 2020. Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems 33 (2020), 21271\u201321284."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.5143773"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5540039"},{"key":"e_1_3_2_1_18_1","volume-title":"Segment anything. arXiv preprint arXiv:2304.02643","author":"Kirillov Alexander","year":"2023","unstructured":"Alexander Kirillov , Eric Mintun , Nikhila Ravi , Hanzi Mao , Chloe Rolland , Laura Gustafson , Tete Xiao , Spencer Whitehead , Alexander\u00a0 C Berg , Wan-Yen Lo , 2023. Segment anything. arXiv preprint arXiv:2304.02643 ( 2023 ). Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander\u00a0C Berg, Wan-Yen Lo, 2023. Segment anything. arXiv preprint arXiv:2304.02643 (2023)."},{"key":"e_1_3_2_1_19_1","unstructured":"Ryan Kiros Ruslan Salakhutdinov and Richard\u00a0S. Zemel. 2014. Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. http:\/\/arxiv.org\/abs\/1411.2539 arXiv:1411.2539 [cs].  Ryan Kiros Ruslan Salakhutdinov and Richard\u00a0S. Zemel. 2014. Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models. http:\/\/arxiv.org\/abs\/1411.2539 arXiv:1411.2539 [cs]."},{"volume-title":"Advances in Neural Information Processing Systems 25, F.\u00a0Pereira, C.\u00a0J.\u00a0C. Burges, L.\u00a0Bottou, and K.\u00a0Q","author":"Krizhevsky Alex","key":"e_1_3_2_1_20_1","unstructured":"Alex Krizhevsky , Ilya Sutskever , and Geoffrey\u00a0 E Hinton . 2012. ImageNet Classification with Deep Convolutional Neural Networks . In Advances in Neural Information Processing Systems 25, F.\u00a0Pereira, C.\u00a0J.\u00a0C. Burges, L.\u00a0Bottou, and K.\u00a0Q . Weinberger (Eds.). Curran Associates, Inc. , 1097\u20131105. http:\/\/papers.nips.cc\/paper\/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf Alex Krizhevsky, Ilya Sutskever, and Geoffrey\u00a0E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25, F.\u00a0Pereira, C.\u00a0J.\u00a0C. Burges, L.\u00a0Bottou, and K.\u00a0Q. Weinberger (Eds.). Curran Associates, Inc., 1097\u20131105. http:\/\/papers.nips.cc\/paper\/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2865674"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Carlton\u00a0Wayne Niblack Ron Barber Will Equitz Myron\u00a0D Flickner Eduardo\u00a0H Glasman Dragutin Petkovic Peter Yanker Christos Faloutsos and Gabriel Taubin. 1993. QBIC project: querying images by content using color texture and shape. In Storage and retrieval for image and video databases Vol.\u00a01908. 173\u2013187. tex.organization: Spie.  Carlton\u00a0Wayne Niblack Ron Barber Will Equitz Myron\u00a0D Flickner Eduardo\u00a0H Glasman Dragutin Petkovic Peter Yanker Christos Faloutsos and Gabriel Taubin. 1993. QBIC project: querying images by content using color texture and shape. In Storage and retrieval for image and video databases Vol.\u00a01908. 173\u2013187. tex.organization: Spie.","DOI":"10.1117\/12.143648"},{"key":"e_1_3_2_1_24_1","unstructured":"Maxime Oquab Timoth\u00e9e Darcet Theo Moutakanni Huy\u00a0V. Vo Marc Szafraniec Vasil Khalidov Pierre Fernandez Daniel Haziza Francisco Massa Alaaeldin El-Nouby Russell Howes Po-Yao Huang Hu Xu Vasu Sharma Shang-Wen Li Wojciech Galuba Mike Rabbat Mido Assran Nicolas Ballas Gabriel Synnaeve Ishan Misra Herve Jegou Julien Mairal Patrick Labatut Armand Joulin and Piotr Bojanowski. 2023. DINOv2: Learning Robust Visual Features without Supervision.  Maxime Oquab Timoth\u00e9e Darcet Theo Moutakanni Huy\u00a0V. Vo Marc Szafraniec Vasil Khalidov Pierre Fernandez Daniel Haziza Francisco Massa Alaaeldin El-Nouby Russell Howes Po-Yao Huang Hu Xu Vasu Sharma Shang-Wen Li Wojciech Galuba Mike Rabbat Mido Assran Nicolas Ballas Gabriel Synnaeve Ishan Misra Herve Jegou Julien Mairal Patrick Labatut Armand Joulin and Piotr Bojanowski. 2023. DINOv2: Learning Robust Visual Features without Supervision."},{"key":"e_1_3_2_1_25_1","first-page":"2640","volume-title":"Proceedings of the 38th International Conference on Machine Learning. PMLR, 8748\u20138763","author":"Radford Alec","year":"2021","unstructured":"Alec Radford , Jong\u00a0Wook Kim , Chris Hallacy , Aditya Ramesh , Gabriel Goh , Sandhini Agarwal , Girish Sastry , Amanda Askell , Pamela Mishkin , Jack Clark , Gretchen Krueger , and Ilya Sutskever . 2021 . Learning Transferable Visual Models From Natural Language Supervision . In Proceedings of the 38th International Conference on Machine Learning. PMLR, 8748\u20138763 . https:\/\/proceedings.mlr.press\/v139\/radford21a.html ISSN: 2640 - 3498 . Alec Radford, Jong\u00a0Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning. PMLR, 8748\u20138763. https:\/\/proceedings.mlr.press\/v139\/radford21a.html ISSN: 2640-3498."},{"key":"e_1_3_2_1_26_1","volume-title":"Language models are unsupervised multitask learners. OpenAI blog 1, 8","author":"Radford Alec","year":"2019","unstructured":"Alec Radford , Jeffrey Wu , Rewon Child , David Luan , Dario Amodei , Ilya Sutskever , 2019. Language models are unsupervised multitask learners. OpenAI blog 1, 8 ( 2019 ), 9. Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, 2019. Language models are unsupervised multitask learners. OpenAI blog 1, 8 (2019), 9."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.01044"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW.2014.131"},{"key":"e_1_3_2_1_29_1","volume-title":"Thirty-sixth conference on neural information processing systems datasets and benchmarks track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY","author":"Schuhmann Christoph","year":"2022","unstructured":"Christoph Schuhmann , Romain Beaumont , Richard Vencu , Cade\u00a0 W Gordon , Ross Wightman , Mehdi Cherti , Theo Coombes , Aarush Katta , Clayton Mullis , Mitchell Wortsman , Patrick Schramowski , Srivatsa\u00a0 R Kundurthy , Katherine Crowson , Ludwig Schmidt , Robert Kaczmarczyk , and Jenia Jitsev . 2022 . LAION-5B: An open large-scale dataset for training next generation image-text models . In Thirty-sixth conference on neural information processing systems datasets and benchmarks track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade\u00a0W Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa\u00a0R Kundurthy, Katherine Crowson, Ludwig Schmidt, Robert Kaczmarczyk, and Jenia Jitsev. 2022. LAION-5B: An open large-scale dataset for training next generation image-text models. In Thirty-sixth conference on neural information processing systems datasets and benchmarks track. https:\/\/openreview.net\/forum?id=M3Y74vmsMcY"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2003.1238663"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00177"},{"key":"e_1_3_2_1_32_1","volume-title":"International conference on machine learning. PMLR, 6105\u20136114","author":"Tan Mingxing","year":"2019","unstructured":"Mingxing Tan and Quoc Le . 2019 . Efficientnet: Rethinking model scaling for convolutional neural networks . In International conference on machine learning. PMLR, 6105\u20136114 . Mingxing Tan and Quoc Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning. PMLR, 6105\u20136114."},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-16788-1_22"},{"key":"e_1_3_2_1_34_1","volume-title":"Advances in Neural Information Processing Systems, Vol.\u00a027. Curran Associates","author":"Yosinski Jason","year":"2014","unstructured":"Jason Yosinski , Jeff Clune , Yoshua Bengio , and Hod Lipson . 2014. How transferable are features in deep neural networks? . In Advances in Neural Information Processing Systems, Vol.\u00a027. Curran Associates , Inc .https:\/\/papers.nips.cc\/paper_files\/paper\/ 2014 \/hash\/375c71349b295fbe2dcdca9206f20a06-Abstract.html Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. 2014. How transferable are features in deep neural networks?. In Advances in Neural Information Processing Systems, Vol.\u00a027. Curran Associates, Inc.https:\/\/papers.nips.cc\/paper_files\/paper\/2014\/hash\/375c71349b295fbe2dcdca9206f20a06-Abstract.html"}],"event":{"name":"HIP '23: 7th International Workshop on Historical Document Imaging and Processing","acronym":"HIP '23","location":"San Jose CA USA"},"container-title":["Proceedings of the 7th International Workshop on Historical Document Imaging and Processing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604951.3605524","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3604951.3605524","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:46:38Z","timestamp":1750178798000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3604951.3605524"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,25]]},"references-count":34,"alternative-id":["10.1145\/3604951.3605524","10.1145\/3604951"],"URL":"https:\/\/doi.org\/10.1145\/3604951.3605524","relation":{},"subject":[],"published":{"date-parts":[[2023,8,25]]},"assertion":[{"value":"2023-08-25","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}