{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T22:05:46Z","timestamp":1769637946468,"version":"3.49.0"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"Supplement_2","license":[{"start":{"date-parts":[[2021,10,1]],"date-time":"2021-10-01T00:00:00Z","timestamp":1633046400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,11,5]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Stylometric analysis of medieval vernacular texts is still a significant challenge: the importance of scribal variation, be it spelling or more substantial, as well as the variants and errors introduced in the tradition, complicate the task of the would-be stylometrist, by inducing noise and perhaps even interferences in the authorship signal. Basing the analysis on the study of the copy from a single hand of several texts can partially mitigate these issues (Camps and Cafiero, 2013, Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature. Revue Des Nouvelles Technologies de l\u2019information (RNTI), SHS-1, pp. 55\u201384), but the limited availability of complete diplomatic transcriptions might make this difficult. In this article, we use a workflow combining handwritten text recognition and stylometric analysis, applied to the case of the hagiographic works contained in MS BnF, fr. 412. We seek to evaluate Paul Meyer's hypothesis about the constitution of groups of hagiographic works, as well as to examine potential authorial groupings in a vastly anonymous corpus.<\/jats:p>","DOI":"10.1093\/llc\/fqab033","type":"journal-article","created":{"date-parts":[[2021,3,24]],"date-time":"2021-03-24T12:10:18Z","timestamp":1616587818000},"page":"ii49-ii71","source":"Crossref","is-referenced-by-count":5,"title":["Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating Paul Meyer\u2019s hagiographic hypothesis"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0385-7037","authenticated-orcid":false,"given":"Jean-Baptiste","family":"Camps","sequence":"first","affiliation":[{"name":"\u00c9cole Nationale des Chartes, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1852-9204","authenticated-orcid":false,"given":"Thibault","family":"Cl\u00e9rice","sequence":"additional","affiliation":[{"name":"\u00c9cole Nationale des Chartes; Universit\u00e9 Lyon 3, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7843-5050","authenticated-orcid":false,"given":"Ariane","family":"Pinche","sequence":"additional","affiliation":[{"name":"\u00c9cole Nationale des Chartes; Universit\u00e9 Lyon 3, France"}]}],"member":"286","published-online":{"date-parts":[[2021,11,5]]},"reference":[{"key":"2021110520325235900_fqab033-B1","article-title":"Measuring the usefulness of function words for authorship attribution","author":"Argamon","year":"2005","journal-title":"Proceedings of the 2005 ACH\/ALLC Conference;"},{"key":"2021110520325235900_fqab033-B2","first-page":"58","article-title":"Automatic dating of medieval charters from Denmark","volume":"vol. 2364","author":"Boldsen","year":"2019","journal-title":"Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, CEUR Workshop Proceedings"},{"key":"2021110520325235900_fqab033-B3","article-title":"Why Moli\u00e8re most likely did write his plays","volume-title":"Science Advances","author":"Cafiero","year":"2019"},{"key":"2021110520325235900_fqab033-B4","author":"Camps","year":"2019"},{"key":"2021110520325235900_fqab033-B5","first-page":"55","article-title":"Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature","author":"Camps","year":"2013","journal-title":"Revue Des Nouvelles Technologies de l\u2019information (RNTI)"},{"key":"2021110520325235900_fqab033-B6","author":"Careri","year":"2001"},{"key":"2021110520325235900_fqab033-B7","article-title":"Evaluating deep learning methods for word segmentation of Scripta Continua texts in old French and Latin","volume":"2020","author":"Cl\u00e9rice","year":"2019","journal-title":"Journal of Data Mining and Digital Humanities"},{"key":"2021110520325235900_fqab033-B8","article-title":"Deucalion, Mod\u00e8le Ancien Francais (0.2.0)","author":"Cl\u00e9rice","year":"2019","journal-title":"Zenodo"},{"key":"2021110520325235900_fqab033-B9","article-title":"Classification of medieval documents: determining the issuer, place of issue, and decade for Old Swedish Charters. DHN 2020 Digital Humanities in the Nordic Countries: Proceedings of the Digital Humanities in the Nordic Countries 5th Conference \/ [ed] Sanita Reinsone, Inguna Skadi\u0146a, Anda Bakl\u0101ne, and J\u0101nis Daugavietis, 2020, pp. 12\u201323","author":"Dahll\u00f6f","year":"2020"},{"key":"2021110520325235900_fqab033-B10","article-title":"Wauchier de Denain, polygraphe du XIIIe si\u00e8cle, Aix-en-Provence: Presses universitaires de Provence","author":"Douchet","year":"2015"},{"key":"2021110520325235900_fqab033-B11","article-title":"Short samples in authorship attribution: a new approach. In DH","author":"Eder","year":"2017"},{"key":"2021110520325235900_fqab033-B12","article-title":"Does size matter? authorship attribution, small samples, big problem. Digital Scholarship in the Humanities, 30(2): 167\u201382. doi: 10.1093\/llc\/fqt066","author":"Eder","year":"2015"},{"issue":"4","key":"2021110520325235900_fqab033-B13","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1093\/llc\/fqt039","article-title":"Mind your corpus: systematic errors in authorship attribution","volume":"28","author":"Eder","year":"2013","journal-title":"Literary and Linguistic Computing"},{"issue":"suppl_2","key":"2021110520325235900_fqab033-B14","doi-asserted-by":"crossref","first-page":"ii4","DOI":"10.1093\/llc\/fqx023","article-title":"Understanding and explaining Delta measures for authorship attribution","volume":"32","author":"Evert","year":"2017","journal-title":"Digital Scholarship in the Humanities"},{"key":"2021110520325235900_fqab033-B15","doi-asserted-by":"crossref","DOI":"10.3389\/fdigh.2018.00004","article-title":"Attributing authorship in the noisy digitized correspondence of Jacob and Wilhelm Grimm","volume":"5","author":"Franzini","year":"2018","journal-title":"Frontiers in Digital Humanities"},{"key":"2021110520325235900_fqab033-B16","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1109\/DAS.2012.23","article-title":"Binarization-free text line segmentation for historical documents based on interest point clustering. In","author":"Garz","year":"2012","journal-title":"2012 10th IAPR International Workshop on Document Analysis Systems"},{"key":"2021110520325235900_fqab033-B17","first-page":"741","article-title":"Document embeddings learned on various types of N-Grams for cross-topic authorship attribution. Computing,","author":"G\u00f3mez-Adorno","year":"2018"},{"key":"2021110520325235900_fqab033-B18","author":"Ing"},{"key":"2021110520325235900_fqab033-B19","author":"Jannidis","year":"2015"},{"key":"2021110520325235900_fqab033-B20","first-page":"59","article-title":"Function words in authorship attribution. From black magic to theory? In Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL)","author":"Kestemont","year":"2014"},{"issue":"1","key":"2021110520325235900_fqab033-B21","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1002\/asi.20961","article-title":"Computational methods in authorship attribution","volume":"60","author":"Koppel","year":"2009","journal-title":". Journal of the American Society for Information Science & Technology"},{"key":"2021110520325235900_fqab033-B22","author":"Kunstmann","year":"2009"},{"key":"2021110520325235900_fqab033-B23","author":"Manjavacas","year":"2019"},{"key":"2021110520325235900_fqab033-B24","author":"Manjavacas","year":"2019"},{"key":"2021110520325235900_fqab033-B25","first-page":"328","article-title":"L\u00e9gendes hagiographiques en fran\u00e7ais. In Histoire litt\u00e9raire de la France vol. 33. Paris, France, pp.","author":"Meyer","year":"1906"},{"issue":"1","key":"2021110520325235900_fqab033-B26","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1080\/09296174.2011.533588","article-title":"Finding the minimum document length for reliable clustering of multi-document natural language corpora","volume":"18","author":"Moisl","year":"2011","journal-title":". Journal of Quantitative Linguistics"},{"key":"2021110520325235900_fqab033-B27","author":"Olivier-Martin","year":"2018"},{"key":"2021110520325235900_fqab033-B28","author":"Perreaux","year":"2011"},{"key":"2021110520325235900_fqab033-B29","author":"Perrot","year":"1992"},{"key":"2021110520325235900_fqab033-B30","author":"Philippart","year":"1977"},{"key":"2021110520325235900_fqab033-B31","author":"Pinche","year":"2019"},{"key":"2021110520325235900_fqab033-B32","author":"Pinche"},{"key":"2021110520325235900_fqab033-B33","first-page":"93","author":"Sapkota","year":"2015"},{"issue":"2","key":"2021110520325235900_fqab033-B34","first-page":"421","article-title":"On the robustness of authorship attribution based on character n-gram features","volume":"21","author":"Stamatatos","year":"2013","journal-title":"Journal of Law and Policy"},{"key":"2021110520325235900_fqab033-B35","author":"Stutzmann","year":"2013"},{"key":"2021110520325235900_fqab033-B36","first-page":"21","author":"Stutzmann","year":"2019"},{"issue":"301","key":"2021110520325235900_fqab033-B37","doi-asserted-by":"crossref","first-page":"236","DOI":"10.1080\/01621459.1963.10500845","article-title":"Hierarchical grouping to optimize an objective function","volume":"58","author":"Ward","year":"1963","journal-title":"Journal of the American Statistical Association"},{"key":"2021110520325235900_fqab033-B38","first-page":"48","author":"Wahlberg","year":"2016"}],"container-title":["Digital Scholarship in the Humanities"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/36\/Supplement_2\/ii49\/41091132\/fqab033.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/dsh\/article-pdf\/36\/Supplement_2\/ii49\/41091132\/fqab033.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,11,5]],"date-time":"2021-11-05T20:33:23Z","timestamp":1636144403000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/dsh\/article\/36\/Supplement_2\/ii49\/6421789"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,1]]},"references-count":38,"journal-issue":{"issue":"Supplement_2","published-online":{"date-parts":[[2021,11,5]]},"published-print":{"date-parts":[[2021,11,5]]}},"URL":"https:\/\/doi.org\/10.1093\/llc\/fqab033","relation":{},"ISSN":["2055-7671","2055-768X"],"issn-type":[{"value":"2055-7671","type":"print"},{"value":"2055-768X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,10,1]]},"published":{"date-parts":[[2021,10,1]]}}}