{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T12:43:38Z","timestamp":1740141818748,"version":"3.37.3"},"reference-count":33,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2024,6,21]],"date-time":"2024-06-21T00:00:00Z","timestamp":1718928000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,6,21]],"date-time":"2024-06-21T00:00:00Z","timestamp":1718928000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001652","name":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001652","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["IJDAR"],"published-print":{"date-parts":[[2024,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Transformers have emerged as the leading methods in natural language processing, computer vision, and multi-modal applications due to their ability to capture complex relationships and dependencies in data. In this study, we explore the potential of transformers as feature aggregators in the context of patch-based writer retrieval, with the objective of improving the quality of writer retrieval by effectively summarizing the relevant features from image patches. Our investigation underscores the complexity of leveraging transformers as feature aggregators in patch-based writer retrieval. While we have experimented with various model configurations, augmentations, and learning objectives, the performance of transformers in this task has room for improvement. This observation highlights the challenges in this domain and emphasizes the need for further research to enhance their effectiveness. By shedding light on the limitations of transformers in this context, our study contributes to the growing body of knowledge in the field of writer retrieval and provides valuable insights for future research and development in this area.\n<\/jats:p>","DOI":"10.1007\/s10032-024-00482-x","type":"journal-article","created":{"date-parts":[[2024,6,21]],"date-time":"2024-06-21T12:01:51Z","timestamp":1718971311000},"page":"265-274","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Evaluating learned feature aggregators for writer retrieval"],"prefix":"10.1007","volume":"27","author":[{"given":"Alexander","family":"Mattick","sequence":"first","affiliation":[]},{"given":"Martin","family":"Mayr","sequence":"additional","affiliation":[]},{"given":"Mathias","family":"Seuret","sequence":"additional","affiliation":[]},{"given":"Florian","family":"Kordon","sequence":"additional","affiliation":[]},{"given":"Fei","family":"Wu","sequence":"additional","affiliation":[]},{"given":"Vincent","family":"Christlein","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,6,21]]},"reference":[{"key":"482_CR1","doi-asserted-by":"crossref","unstructured":"Peer, M., Kleber, F., Sablatnig, R.: Towards writer retrieval for historical datasets. (2023)","DOI":"10.1007\/978-3-031-41676-7_24"},{"issue":"1","key":"482_CR2","doi-asserted-by":"publisher","first-page":"20220244","DOI":"10.1515\/jisys-2022-0244","volume":"32","author":"A Gattal","year":"2023","unstructured":"Gattal, A., Djeddi, C., Abbas, F., Siddiqi, I., Bouderah, B.: A new method for writer identification based on historical documents. J. Intell. Syst. 32(1), 20220244 (2023). https:\/\/doi.org\/10.1515\/jisys-2022-0244","journal-title":"J. Intell. Syst."},{"key":"482_CR3","doi-asserted-by":"crossref","unstructured":"Breci, E., Guarnera, L., Battiato, S.: A novel dataset for non-destructive inspection of handwritten documents. arXiv:2401.04448 (2024)","DOI":"10.1109\/ICASSP48485.2024.10448383"},{"key":"482_CR4","unstructured":"Khan, M.S.U.: A novel segmentation dataset for signatures on bank checks. arXiv:2104.12203 (2021)"},{"key":"482_CR5","doi-asserted-by":"crossref","unstructured":"Koepf, M., Kleber, F., Sablatnig, R.: Writer identification and writer retrieval using vision transformer for forensic documents. In: International Workshop on Document Analysis Systems (2022)","DOI":"10.1007\/978-3-031-06555-2_24"},{"key":"482_CR6","doi-asserted-by":"crossref","unstructured":"Arandjelovi\u0107, R., Gron\u00e1t, P., Torii, A., Pajdla, T., Sivic, J.: Netvlad: Cnn architecture for weakly supervised place recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5297\u20135307 (2015)","DOI":"10.1109\/CVPR.2016.572"},{"key":"482_CR7","unstructured":"Vaswani, A., Shazeer, N.M., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: NIPS (2017)"},{"key":"482_CR8","unstructured":"Xu, B., Wang, N., Chen, T., Li, M.: Empirical evaluation of rectified activations in convolutional network. arXiv:1505.00853 (2015)"},{"key":"482_CR9","unstructured":"Shazeer, N.M.: Glu variants improve transformer. arXiv:2002.05202 (2020)"},{"key":"482_CR10","unstructured":"Ba, J.L., Kiros, J.R., Hinton, G.E.: Layer normalization. arXiv preprint arXiv:1607.06450 (2016)"},{"key":"482_CR11","doi-asserted-by":"publisher","unstructured":"Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3304\u20133311. IEEE, San Francisco, CA, USA (2010). https:\/\/doi.org\/10.1109\/CVPR.2010.5540039","DOI":"10.1109\/CVPR.2010.5540039"},{"key":"482_CR12","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In:2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770\u2013778 (2015)","DOI":"10.1109\/CVPR.2016.90"},{"issue":"4","key":"482_CR13","doi-asserted-by":"publisher","first-page":"701","DOI":"10.1109\/TPAMI.2007.1009","volume":"29","author":"M Bulacu","year":"2007","unstructured":"Bulacu, M., Schomaker, L.: Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 701\u2013717 (2007)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"1","key":"482_CR14","doi-asserted-by":"publisher","first-page":"162","DOI":"10.1016\/j.patcog.2011.07.005","volume":"45","author":"AA Brink","year":"2012","unstructured":"Brink, A.A., Smit, J., Bulacu, M.L., Schomaker, L.R.B.: Writer identification using directional ink-trace width measurements. Pattern Recogn. 45(1), 162\u2013171 (2012)","journal-title":"Pattern Recogn."},{"key":"482_CR15","doi-asserted-by":"crossref","unstructured":"Fiel, S., Sablatnig, R.: Writer identification and writer retrieval using the fisher vector on visual vocabularies. In: Document Analysis and Recognition (ICDAR), 2013 12th International Conference On, Washington DC, pp. 545\u2013549 (2013)","DOI":"10.1109\/ICDAR.2013.114"},{"issue":"3","key":"482_CR16","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1109\/TIFS.2014.2301274","volume":"9","author":"X Wu","year":"2014","unstructured":"Wu, X., Tang, Y., Bu, W.: Offline text-independent writer identification based on scale invariant feature transform. IEEE Trans. Inf. Forensics Secur. 9(3), 526\u2013536 (2014)","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"482_CR17","doi-asserted-by":"crossref","unstructured":"Christlein, V., Bernecker, D., Angelopoulou, E.: Writer identification using vlad encoded contour-zernike moments. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), vol. 2015-Novem. Nancy, pp. 906\u2013910 (2015)","DOI":"10.1109\/ICDAR.2015.7333893"},{"key":"482_CR18","doi-asserted-by":"publisher","first-page":"258","DOI":"10.1016\/j.patcog.2016.10.005","volume":"63","author":"V Christlein","year":"2017","unstructured":"Christlein, V., Bernecker, D., H\u00f6nig, F., Maier, A., Angelopoulou, E.: Writer identification using GMM supervectors and exemplar-SVMS. Pattern Recogn. 63, 258\u2013267 (2017)","journal-title":"Pattern Recogn."},{"key":"482_CR19","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1016\/j.patcog.2016.09.044","volume":"63","author":"S He","year":"2017","unstructured":"He, S., Schomaker, L.: Writer identification using curvature-free features. Pattern Recogn. 63, 451\u2013464 (2017)","journal-title":"Pattern Recogn."},{"key":"482_CR20","doi-asserted-by":"crossref","unstructured":"Mohammed, H., Maergner, V., Konidaris, T., Stiehl, H.S.: Normalised local na\u00efve bayes nearest-neighbour classifier for offline writer identification. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto, pp. 1013\u20131018 (2017)","DOI":"10.1109\/ICDAR.2017.168"},{"key":"482_CR21","doi-asserted-by":"publisher","first-page":"3553","DOI":"10.1109\/TIFS.2020.2991880","volume":"15","author":"S Lai","year":"2020","unstructured":"Lai, S., Zhu, Y., Jin, L.: Encoding pathlet and sift features with bagged VLAD for historical writer identification. IEEE Trans. Inf. Forensics Secur. 15, 3553\u20133566 (2020)","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"482_CR22","doi-asserted-by":"crossref","unstructured":"Christlein, V., Gropp, M., Fiel, S., Maier, A.: Unsupervised feature learning for writer identification and writer retrieval. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 01, pp. 991\u2013997 (2017)","DOI":"10.1109\/ICDAR.2017.165"},{"key":"482_CR23","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1007\/978-3-031-21648-0_9","volume-title":"Front. Handwriting Recognit.","author":"M Peer","year":"2022","unstructured":"Peer, M., Kleber, F., Sablatnig, R.: Self-supervised vision transformers with\u00c2 data augmentation strategies using morphological operations for\u00c2 writer retrieval. In: Porwal, U., Forn\u00e9s, A., Shafait, F. (eds.) Front. Handwriting Recognit., pp. 122\u2013136. Springer, Cham (2022)"},{"key":"482_CR24","unstructured":"Zhu, Y., Yuan, H., Wang, S., Liu, J., Liu, W., Deng, C., Dou, Z., Wen, J.-r.: Large language models for information retrieval: a survey. arXiv:2308.07107 (2023)"},{"key":"482_CR25","doi-asserted-by":"publisher","first-page":"10491","DOI":"10.4249\/scholarpedia.10491","volume":"7","author":"T Lindeberg","year":"2012","unstructured":"Lindeberg, T.: Scale invariant feature transform. Scholarpedia 7, 10491 (2012)","journal-title":"Scholarpedia"},{"key":"482_CR26","doi-asserted-by":"publisher","unstructured":"Wen, Y., Zhang, K., Li, Z., Qiao, Y.: A Discriminative Feature Learning Approach for Deep Face Recognition. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) Computer Vision \u2013 ECCV 2016. Lecture Notes in Computer Science, pp. 499\u2013515. Springer International Publishing, Cham (2016). https:\/\/doi.org\/10.1007\/978-3-319-46478-7_31","DOI":"10.1007\/978-3-319-46478-7_31"},{"key":"482_CR27","unstructured":"Bardes, A., Ponce, J., LeCun, Y.: Vicreg: variance-invariance-covariance regularization for self-supervised learning. arXiv:2105.04906 (2021)"},{"key":"482_CR28","doi-asserted-by":"crossref","unstructured":"Musgrave, K., Belongie, S.J., Lim, S.-N.: A metric learning reality check. In: European Conference on Computer Vision (2020)","DOI":"10.1007\/978-3-030-58595-2_41"},{"key":"482_CR29","unstructured":"Bao, H., Dong, L., Wei, F.: Beit: Bert pre-training of image transformers. arXiv:2106.08254 (2021)"},{"key":"482_CR30","doi-asserted-by":"publisher","unstructured":"Fiel, S., Kleber, F., Diem, M., Christlein, V., Louloudis, G., Stamatopoulos, N., Gatos, B.: Scriptnet: Icdar2017 competition on historical document writer identification (historical-wi) (2018) https:\/\/doi.org\/10.5281\/zenodo.1324999","DOI":"10.5281\/zenodo.1324999"},{"key":"482_CR31","unstructured":"Dao, T., Fu, D.Y., Ermon, S., Rudra, A., R\u2019e, C.: Flashattention: Fast and memory-efficient exact attention with io-awareness. arXiv:2205.14135 (2022)"},{"key":"482_CR32","unstructured":"Chen, T., Xu, B., Zhang, C., Guestrin, C.: Training deep nets with sublinear memory cost. arXiv:1604.06174 (2016)"},{"key":"482_CR33","unstructured":"Bachlechner, T.C., Majumder, B.P., Mao, H.H., Cottrell, G., McAuley, J.: Rezero is all you need: fast convergence at large depth. In: Conference on Uncertainty in Artificial Intelligence (2020). https:\/\/api.semanticscholar.org\/CorpusID:212644626"}],"container-title":["International Journal on Document Analysis and Recognition (IJDAR)"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10032-024-00482-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10032-024-00482-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10032-024-00482-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,31]],"date-time":"2024-08-31T10:11:39Z","timestamp":1725099099000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10032-024-00482-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,21]]},"references-count":33,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2024,9]]}},"alternative-id":["482"],"URL":"https:\/\/doi.org\/10.1007\/s10032-024-00482-x","relation":{},"ISSN":["1433-2833","1433-2825"],"issn-type":[{"type":"print","value":"1433-2833"},{"type":"electronic","value":"1433-2825"}],"subject":[],"published":{"date-parts":[[2024,6,21]]},"assertion":[{"value":"14 November 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 May 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 May 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 June 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}