{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T19:29:05Z","timestamp":1747855745460},"reference-count":18,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2021,4,14]],"date-time":"2021-04-14T00:00:00Z","timestamp":1618358400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,3,23]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>For the identification of scribes and authors in handwritten documents, methods from classical linguistic analysis are combined with modern computer vision approaches to enhance the knowledge discovery process. One important finding is that it is possible to train neural networks for automatic transcription of handwritten documents and to use these transcriptions as input for statistical analysis. Furthermore, hypotheses about scribes can be tested by extracting visual handwriting features and clustering them. From a linguistic point of view, the R package stylo is a useful tool to analyse and cluster texts. Unfortunately, it only achieves a high level of accuracy with longer texts. For texts under 5000 words it is more suitable to measure their Euclidean distance based on a set of linguistic features. Both approaches, the analysis with stylo and the Euclidean distance, in combination with neural networks for automatic transcription and clustering allow for more precise statements about the relationship between texts, authors and scribes, even if the documents are under 1,000 words.<\/jats:p>","DOI":"10.1093\/llc\/fqaa004","type":"journal-article","created":{"date-parts":[[2020,1,23]],"date-time":"2020-01-23T20:09:20Z","timestamp":1579810160000},"page":"254-263","source":"Crossref","is-referenced-by-count":1,"title":["Scribe versus authorship attribution and clustering in historic Czech manuscripts: a case study with visual and linguistic features"],"prefix":"10.1093","volume":"37","author":[{"given":"Aleksej","family":"Tikhonov","sequence":"first","affiliation":[{"name":"Institut f\u00fcr Slawistik & Hungarologie, Humboldt University, Berlin, Germany (HU)"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Klaus","family":"M\u00fcller","sequence":"additional","affiliation":[{"name":"MusterFabrik Berlin, Berlin, Germany (MFB)"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,4,14]]},"reference":[{"key":"2022032516090088700_fqaa004-B1","volume-title":"Gottesacker-Geschichten als Ged\u00e4chtnis: eine Ethnographie zur Herrnhuter Erinnerungskultur am Beispiel von Neudietendorfer Lebensl\u00e4ufen\/Stephanie B\u00f6\u00df, Studien zur Volkskunde in Th\u00fcringen BV035193383 Band 6","author":"B\u00f6\u00df","year":"2016"},{"key":"2022032516090088700_fqaa004-B2","author":"Brink","year":"2009"},{"key":"2022032516090088700_fqaa004-B5","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1093\/llc\/fqt039","article-title":"Mind your corpus: systematic errors in authorship attribution","volume":"28","author":"Eder","year":"2013","journal-title":"Literary and Linguistic Computing"},{"key":"2022032516090088700_fqaa004-B3","first-page":"167","author":"Eder","year":"2015"},{"key":"2022032516090088700_fqaa004-B4","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1093\/llc\/fqv061","article-title":"Visualization in stylometry: cluster analysis using networks","volume":"32","author":"Eder","year":"2017","journal-title":"Digital Scholarship in the Humanities"},{"key":"2022032516090088700_fqaa004-B6","author":"Eder","year":"2016"},{"key":"2022032516090088700_fqaa004-B7","first-page":"369","author":"Graves","year":"2006"},{"key":"2022032516090088700_fqaa004-B9","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511518942","volume-title":"The Authorship of Shakespeare\u2019s Plays\u202f: A Socio-Linguistic Study","author":"Hope","year":"1994"},{"key":"2022032516090088700_fqaa004-B10","first-page":"1097","volume-title":"Advances in Neural Information Processing Systems 25","author":"Krizhevsky","year":"2012"},{"key":"2022032516090088700_fqaa004-B11","volume-title":"Mit einem Geleitw. von Dietrich Meyer und einer Einf. von Ulrich Herrmann","author":"Lost","year":"2007"},{"key":"2022032516090088700_fqaa004-B12","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1007\/s100320200071","article-title":"The IAM-database: an English sentence database for off-line handwriting recognition","volume":"5","author":"Marti","year":"2002","journal-title":"International Journal on Document Analysis and Recognition"},{"key":"2022032516090088700_fqaa004-B13","author":"Mettele","year":"2009"},{"key":"2022032516090088700_fqaa004-B15","author":"Meyer","year":"2007"},{"key":"2022032516090088700_fqaa004-B14","author":"Meyer","year":"2015"},{"key":"2022032516090088700_fqaa004-B16","volume-title":"Pattern Recognition and Neural Networks","author":"Ripley","year":"2007"},{"key":"2022032516090088700_fqaa004-B17","first-page":"1","article-title":"Visualizing data using t-SNE","author":"van der Maaten","year":"2008","journal-title":"Journal of Machine Learning Research,"},{"issue":"2","key":"2022032516090088700_fqaa004-B18","article-title":"Stylometry and characterisation in The Big Bang Theory","volume":"37","author":"van Zyl","year":"2016","journal-title":"Literator\u2014Journal of Literary Criticism, Comparative Linguistics and Literary Studies"},{"key":"2022032516090088700_fqaa004-B19","author":"Viehhauser","year":"2015"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/37\/1\/254\/42987140\/fqaa004.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/37\/1\/254\/42987140\/fqaa004.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,3,25]],"date-time":"2022-03-25T20:24:17Z","timestamp":1648239857000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article\/37\/1\/254\/6226031"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,4,14]]},"references-count":18,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,4,14]]},"published-print":{"date-parts":[[2022,3,23]]}},"URL":"https:\/\/doi.org\/10.1093\/llc\/fqaa004","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"value":"2055-7671","type":"print"},{"value":"2055-768X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,4,1]]},"published":{"date-parts":[[2021,4,14]]}}}