{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,11]],"date-time":"2026-06-11T21:56:03Z","timestamp":1781214963333,"version":"3.54.1"},"reference-count":261,"publisher":"MIT Press","issue":"3","license":[{"start":{"date-parts":[[2023,5,25]],"date-time":"2023-05-25T00:00:00Z","timestamp":1684972800000},"content-version":"vor","delay-in-days":144,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Ancient languages preserve the cultures and histories of the past. However, their study is fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from deciphering lost languages to restoring damaged inscriptions, to determining the authorship of works of literature. Technological aids have long supported the study of ancient texts, but in recent years advances in artificial intelligence and machine learning have enabled analyses on a scale and in a detail that are reshaping the field of humanities, similarly to how microscopes and telescopes have contributed to the realm of science. This article aims to provide a comprehensive survey of published research using machine learning for the study of ancient texts written in any language, script, and medium, spanning over three and a half millennia of civilizations around the ancient world. To analyze the relevant literature, we introduce a taxonomy of tasks inspired by the steps involved in the study of ancient documents: digitization, restoration, attribution, linguistic analysis, textual criticism, translation, and decipherment. This work offers three major contributions: first, mapping the interdisciplinary field carved out by the synergy between the humanities and machine learning; second, highlighting how active collaboration between specialists from both fields is key to producing impactful and compelling scholarship; third, highlighting promising directions for future work in this field. Thus, this work promotes and supports the continued collaborative impetus between the humanities and machine learning.<\/jats:p>","DOI":"10.1162\/coli_a_00481","type":"journal-article","created":{"date-parts":[[2023,5,25]],"date-time":"2023-05-25T19:40:33Z","timestamp":1685043633000},"page":"703-747","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":58,"title":["Machine Learning for Ancient Languages: A Survey"],"prefix":"10.1162","volume":"49","author":[{"given":"Thea","family":"Sommerschield","sequence":"first","affiliation":[{"name":"Ca\u2019 Foscari University of Venice, Department of Humanities. thea.sommerschield@unive.it"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yannis","family":"Assael","sequence":"additional","affiliation":[{"name":"Google DeepMind. yannisassael@google.com"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John","family":"Pavlopoulos","sequence":"additional","affiliation":[{"name":"Athens University of Economics and Business. annis@aueb.gr"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vanessa","family":"Stefanak","sequence":"additional","affiliation":[{"name":"Google DeepMind"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Andrew","family":"Senior","sequence":"additional","affiliation":[{"name":"Google DeepMind. andrewsenior@google.com"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chris","family":"Dyer","sequence":"additional","affiliation":[{"name":"Google DeepMind. cdyer@google.com"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"John","family":"Bodel","sequence":"additional","affiliation":[{"name":"Brown University Classics Faculty. John_Bodel@brown.edu"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jonathan","family":"Prag","sequence":"additional","affiliation":[{"name":"University of Oxford, Faculty of Classics. jonathan.prag@merton.ox.ac.uk"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ion","family":"Androutsopoulos","sequence":"additional","affiliation":[{"name":"Athens University of Economics and Business, Department of Informatics. ion@aueb.gr"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nando","family":"de Freitas","sequence":"additional","affiliation":[{"name":"Google DeepMind. nandodefreitas@google.com"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"281","published-online":{"date-parts":[[2023,9,1]]},"reference":[{"key":"2023111518555903300_bib1","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1109\/ASAR.2017.8067761","article-title":"WAHD: A database for writer identification of Arabic historical documents","volume-title":"International Workshop on Arabic Script Analysis and Recognition (ASAR)","author":"Abdelhaleem","year":"2017"},{"issue":"3","key":"2023111518555903300_bib2","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3460961","article-title":"Machine learning based assembly of fragments of ancient papyrus","volume":"14","author":"Abitbol","year":"2021","journal-title":"Journal on Computing and Cultural Heritage (JOCCH)"},{"issue":"4","key":"2023111518555903300_bib3","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1007\/s10032-018-0312-3","article-title":"KERTAS: Dataset for automatic dating of ancient Arabic manuscripts","volume":"21","author":"Adam","year":"2018","journal-title":"International Journal on Document Analysis and Recognition (IJDAR)"},{"key":"2023111518555903300_bib4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/ITSS-IoE53029.2021.9615302","article-title":"Arabic poetry meter categorization using machine learning based on customized feature extraction","volume-title":"International Conference on Intelligent Technology, System and Service for Internet of Everything (ITSS-IoE)","author":"Alqasemi","year":"2021"},{"key":"2023111518555903300_bib5","doi-asserted-by":"publisher","first-page":"292","DOI":"10.1109\/IALP54817.2021.9675149","article-title":"Ancient Tibetan word segmentation based on deep learning","volume-title":"International Conference on Asian Language Processing (IALP)","author":"An","year":"2021"},{"key":"2023111518555903300_bib6","doi-asserted-by":"publisher","first-page":"186","DOI":"10.1016\/j.culher.2019.04.002","article-title":"A general methodology for identifying the writer of codices. Application to the celebrated \u201ctwins.\u201d","volume":"39","author":"Arabadjis","year":"2019","journal-title":"Journal of Cultural Heritage"},{"issue":"8","key":"2023111518555903300_bib7","doi-asserted-by":"publisher","first-page":"2278","DOI":"10.1016\/j.patcog.2013.01.019","article-title":"New mathematical and algorithmic schemes for pattern classification with application to the identification of writers of important ancient documents","volume":"46","author":"Arabadjis","year":"2013","journal-title":"Pattern Recognition"},{"issue":"3","key":"2023111518555903300_bib8","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1007\/s10032-017-0289-3","article-title":"On writer identification for Arabic historical manuscripts","volume":"20","author":"Asi","year":"2017","journal-title":"International Journal on Document Analysis and Recognition (IJDAR)"},{"key":"2023111518555903300_bib9","doi-asserted-by":"publisher","first-page":"6368","DOI":"10.18653\/v1\/D19-1668","article-title":"Restoring ancient text using deep learning: A case study on Greek epigraphy","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Assael","year":"2019"},{"issue":"7900","key":"2023111518555903300_bib10","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1038\/s41586-022-04448-z","article-title":"Restoring and attributing ancient texts using deep neural networks","volume":"603","author":"Assael","year":"2022","journal-title":"Nature"},{"key":"2023111518555903300_bib11","first-page":"111","article-title":"Data-driven choices in neural part-of-speech tagging for Latin","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Bacon","year":"2020"},{"key":"2023111518555903300_bib12","article-title":"Latin BERT: A contextual language model for classical philology","author":"Bamman","year":"2020","journal-title":"arXiv preprint arXiv:2009.10053"},{"key":"2023111518555903300_bib13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/1998076.1998078","article-title":"Measuring historical word sense variation","volume-title":"ACM\/IEEE Joint Conference on Digital Libraries","author":"Bamman","year":"2011"},{"key":"2023111518555903300_bib14","doi-asserted-by":"publisher","first-page":"123438","DOI":"10.1109\/ACCESS.2021.3110082","article-title":"A deep learning approach to ancient Egyptian hieroglyphs classification","volume":"9","author":"Barucci","year":"2021","journal-title":"IEEE Access"},{"issue":"8","key":"2023111518555903300_bib15","doi-asserted-by":"publisher","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","article-title":"Representation learning: A review and new perspectives","volume":"35","author":"Bengio","year":"2013","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2023111518555903300_bib16","first-page":"194","article-title":"TwistBytes - Identification of Cuneiform languages and German dialects at VarDial 2019","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Benites de Azevedo e Souza","year":"2019"},{"key":"2023111518555903300_bib17","first-page":"313","article-title":"Simple effective decipherment via combinatorial optimization","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Berg-Kirkpatrick","year":"2011"},{"key":"2023111518555903300_bib18","doi-asserted-by":"publisher","first-page":"17","DOI":"10.18653\/v1\/W19-1402","article-title":"Improving Cuneiform language identification with BERT","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Bernier-Colborne","year":"2019"},{"issue":"3","key":"2023111518555903300_bib19","article-title":"Comparative rates of text reuse in classical Latin hexameter poetry","volume":"9","author":"Bernstein","year":"2015","journal-title":"DHQ: Digital Humanities Quarterly"},{"key":"2023111518555903300_bib20","first-page":"153","article-title":"The SLT-interactions parsing system at the CoNLL 2018 shared task","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Bhat","year":"2018"},{"key":"2023111518555903300_bib21","doi-asserted-by":"publisher","first-page":"771","DOI":"10.1007\/978-3-030-49795-8_73","article-title":"Survey on Sanskrit script recognition","volume-title":"International Conference on Mobile Computing and Sustainable Informatics","author":"Bhurke","year":"2020"},{"key":"2023111518555903300_bib22","doi-asserted-by":"publisher","first-page":"53","DOI":"10.18653\/v1\/W15-3708","article-title":"Word embeddings pointing the way for Late Antiquity","volume-title":"SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities","author":"Bjerva","year":"2015"},{"key":"2023111518555903300_bib23","first-page":"1","article-title":"Rethinking intertextuality through a word-space and social network approach \u2013 the case of Cassiodorus","author":"Bjerva","year":"2016","journal-title":"Journal of Data Mining and Digital Humanities"},{"key":"2023111518555903300_bib24","volume-title":"Representation and Inference for Natural Language: A First Course in Computational Semantics","author":"Blackburn","year":"2005"},{"issue":"Jan","key":"2023111518555903300_bib25","first-page":"993","article-title":"Latent Dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"2023111518555903300_bib26","first-page":"101","article-title":"EpiDoc: Epigraphic documents in XML for publication and interchange","author":"Bodard","year":"2010","journal-title":"Latin on Stone: Epigraphic Research and Electronic Archives"},{"key":"2023111518555903300_bib27","doi-asserted-by":"publisher","first-page":"615","DOI":"10.1109\/ICDAR.2017.106","article-title":"Automating transliteration of cuneiform from parallel lines with sparse data","volume-title":"IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Bogacz","year":"2017"},{"key":"2023111518555903300_bib28","doi-asserted-by":"publisher","first-page":"246","DOI":"10.1109\/ICFHR2020.2020.00053","article-title":"Period classification of 3D cuneiform tablets with geometric neural networks","volume-title":"International Conference on Frontiers in Handwriting Recognition (ICFHR)","author":"Bogacz","year":"2020"},{"issue":"2","key":"2023111518555903300_bib29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3491239","article-title":"Digital Assyriology\u2014Advances in visual cuneiform analysis","volume":"15","author":"Bogacz","year":"2022","journal-title":"Journal on Computing and Cultural Heritage (JOCCH)"},{"key":"2023111518555903300_bib30","first-page":"171","article-title":"NLP-Cube: End-to-end raw text processing with neural networks","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Boro\u015f","year":"2018"},{"issue":"11","key":"2023111518555903300_bib31","doi-asserted-by":"publisher","first-page":"4224","DOI":"10.1073\/pnas.1204678110","article-title":"Automated reconstruction of ancient languages using probabilistic models of sound change","volume":"110","author":"Bouchard-C\u00f4t\u00e9","year":"2013","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"key":"2023111518555903300_bib32","first-page":"82","article-title":"Data mining tools and GRID infrastructure for Assyriology text analysis (an Old-Babylonian situation studied through text analysis and data mining tools)","volume-title":"RAI - Rencontre Assyriologique Internationale - Private and State in the Ancient Near East","author":"Bracco","year":"2013"},{"key":"2023111518555903300_bib33","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1007\/978-3-030-86549-8_5","article-title":"Context aware generation of cuneiform signs","volume-title":"International Conference on Document Analysis and Recognition","author":"Brandenbusch","year":"2021"},{"key":"2023111518555903300_bib34","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown","year":"2020","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2023111518555903300_bib35","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1007\/978-3-642-33290-6_11","article-title":"Increasing recall for text re-use in historical documents to support research in the Humanities","volume-title":"International Conference on Theory and Practice of Digital Libraries","author":"B\u00fcchler","year":"2012"},{"key":"2023111518555903300_bib36","doi-asserted-by":"publisher","first-page":"4900","DOI":"10.18653\/v1\/2021.naacl-main.389","article-title":"Profiling of intertextuality in Latin literature using word embeddings","volume-title":"North American Chapter of the Association for Computational Linguistics (NAACL)","author":"Burns","year":"2021"},{"issue":"3","key":"2023111518555903300_bib37","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2905369","article-title":"Evaluating shape representations for Maya glyph classification","volume":"9","author":"Can","year":"2016","journal-title":"Journal on Computing and Cultural Heritage (JOCCH)"},{"key":"2023111518555903300_bib38","first-page":"119","article-title":"A gradient boosting-seq2seq system for Latin POS tagging and lemmatization","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Celano","year":"2020"},{"issue":"1","key":"2023111518555903300_bib39","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1515\/opli-2016-0020","article-title":"Part of speech tagging for ancient Greek","volume":"2","author":"Celano","year":"2016","journal-title":"Open Linguistics"},{"key":"2023111518555903300_bib40","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11042-022-12673-x","article-title":"A deep learning based system for writer identification in handwritten Arabic historical manuscripts","author":"Chammas","year":"2022","journal-title":"Multimedia Tools and Applications"},{"key":"2023111518555903300_bib41","doi-asserted-by":"publisher","first-page":"1195","DOI":"10.1145\/3503161.3547925","article-title":"Sundial-GAN: A cascade generative adversarial networks framework for deciphering Oracle Bone inscriptions","volume-title":"ACM International Conference on Multimedia","author":"Chang","year":"2022"},{"key":"2023111518555903300_bib42","first-page":"55","article-title":"Towards better UD parsing: Deep contextualized word embeddings, ensemble, and treebank concatenation","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Che","year":"2018"},{"key":"2023111518555903300_bib43","doi-asserted-by":"publisher","first-page":"256","DOI":"10.18653\/v1\/K18-2026","article-title":"A simple yet effective joint training method for cross-lingual universal dependency parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Chen","year":"2018"},{"key":"2023111518555903300_bib44","first-page":"52","article-title":"Integration of automatic sentence segmentation and lexical analysis of ancient Chinese based on BiLSTM-CRF model","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Cheng","year":"2020"},{"issue":"11","key":"2023111518555903300_bib45","doi-asserted-by":"publisher","first-page":"290","DOI":"10.3390\/info9110290","article-title":"Annotating a low-resource language with LLOD technology: Sumerian morphology and syntax","volume":"9","author":"Chiarcos","year":"2018","journal-title":"Information"},{"key":"2023111518555903300_bib46","article-title":"PaLM: Scaling language modeling with pathways","author":"Chowdhery","year":"2022","journal-title":"arXiv preprint arXiv:2204.02311"},{"key":"2023111518555903300_bib47","doi-asserted-by":"publisher","first-page":"1505","DOI":"10.1109\/ICDAR.2019.00242","article-title":"ICDAR 2019 competition on image retrieval for historical handwritten documents","volume-title":"International Conference on Document Analysis and Recognition (ICDAR)","author":"Christlein","year":"2019"},{"key":"2023111518555903300_bib48","article-title":"Empirical evaluation of gated recurrent neural networks on sequence modeling","volume-title":"Advances in Neural Information Processing Systems Workshop on Deep Learning","author":"Chung","year":"2014"},{"issue":"2","key":"2023111518555903300_bib49","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1093\/llc\/fqs033","article-title":"The Tesserae Project: Intertextual analysis of Latin poetry","volume":"28","author":"Coffee","year":"2012","journal-title":"Literary and Linguistic Computing"},{"key":"2023111518555903300_bib50","doi-asserted-by":"publisher","first-page":"383","DOI":"10.1353\/apa.2012.0010","article-title":"Intertextuality in the digital age","author":"Coffee","year":"2012","journal-title":"Transactions of the American Philological Association"},{"key":"2023111518555903300_bib51","doi-asserted-by":"publisher","first-page":"70","DOI":"10.1109\/VSMM.2014.7136691","article-title":"Computer-assisted reconstruction of virtual fragmented cuneiform tablets","volume-title":"International Conference on Virtual Systems & Multimedia (VSMM)","author":"Collins","year":"2014"},{"issue":"7","key":"2023111518555903300_bib52","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0269544","article-title":"Unsupervised deep learning supports reclassification of Bronze age cypriot writing system","volume":"17","author":"Corazza","year":"2022","journal-title":"PLOS ONE"},{"issue":"1","key":"2023111518555903300_bib53","doi-asserted-by":"publisher","first-page":"128","DOI":"10.2139\/ssrn.4214742","article-title":"Syllabic quantity patterns as rhythmic features for Latin authorship attribution","volume":"74","author":"Corbara","year":"2022","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"2023111518555903300_bib54","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1145\/3216122.3216163","article-title":"Data mining ancient script image data using convolutional neural networks","volume-title":"International Database Engineering & Applications Symposium","author":"Daggumati","year":"2018"},{"issue":"3","key":"2023111518555903300_bib55","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1093\/library\/8.3.251","article-title":"The practice of handwriting identification","volume":"8","author":"Davis","year":"2007","journal-title":"Library"},{"key":"2023111518555903300_bib56","first-page":"99","article-title":"Arc-hybrid non-projective dependency parsing with a static-dynamic oracle","volume-title":"International Conference on Parsing Technologies (IWPT)","author":"de Lhoneux","year":"2017"},{"issue":"1","key":"2023111518555903300_bib57","doi-asserted-by":"publisher","first-page":"6","DOI":"10.3390\/rs14010006","article-title":"A generative and entropy-based registration approach for the reassembly of ancient inscriptions","volume":"14","author":"de Lima-Hernandez","year":"2021","journal-title":"Remote Sensing"},{"key":"2023111518555903300_bib58","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1016\/j.engappai.2018.03.023","article-title":"Reliable writer identification in medieval manuscripts through page layout features: The \u201cAvila\u201d Bible case","volume":"72","author":"De Stefano","year":"2018","journal-title":"Engineering Applications of Artificial Intelligence"},{"issue":"11","key":"2023111518555903300_bib59","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s42452-019-1340-4","article-title":"Ancient Geez script recognition using deep learning","volume":"1","author":"Demilew","year":"2019","journal-title":"SN Applied Sciences"},{"issue":"12","key":"2023111518555903300_bib60","doi-asserted-by":"publisher","first-page":"e0243039","DOI":"10.1371\/journal.pone.0243039","article-title":"Deep learning of cuneiform sign detection with weak supervision using transliteration alignment","volume":"15","author":"Dencker","year":"2020","journal-title":"PLOS ONE"},{"key":"2023111518555903300_bib61","doi-asserted-by":"publisher","DOI":"10.1155\/2022\/3432330","article-title":"A deep learning approach for recognizing the cursive Tamil characters in palm leaf manuscripts","volume":"2022","author":"Devi","year":"2022","journal-title":"Computational Intelligence And Neuroscience"},{"key":"2023111518555903300_bib62","first-page":"4171","article-title":"BERT: Pre-training of deep bidirectional transformers for language understanding","volume-title":"North American Chapter of the Association for Computational Linguistics (NAACL)","author":"Devlin","year":"2019"},{"issue":"16","key":"2023111518555903300_bib63","doi-asserted-by":"publisher","first-page":"E3195\u2013E3204","DOI":"10.1073\/pnas.1611910114","article-title":"Quantitative criticism of literary relationships","volume":"114","author":"Dexter","year":"2017","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"key":"2023111518555903300_bib64","doi-asserted-by":"publisher","first-page":"693","DOI":"10.5220\/0006249706930702","article-title":"A digital palaeographic approach towards writer identification in the Dead Sea Scrolls","volume-title":"International Conference on Pattern Recognition Applications and Methods","author":"Dhali","year":"2017"},{"key":"2023111518555903300_bib65","doi-asserted-by":"publisher","first-page":"188","DOI":"10.18653\/v1\/W19-1420","article-title":"Investigating machine learning methods for language and dialect identification of cuneiform texts","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Doostmohammadi","year":"2019"},{"key":"2023111518555903300_bib66","first-page":"34","article-title":"CEA LIST: Processing low-resource languages for CoNLL 2018","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Duthoo","year":"2018"},{"issue":"1","key":"2023111518555903300_bib67","doi-asserted-by":"publisher","first-page":"195","DOI":"10.33899\/csmj.2013.163436","article-title":"Cuneiform symbols recognition based on k-means and neural network","volume":"10","author":"Edan","year":"2013","journal-title":"AL-Rafidain Journal of Computer Sciences and Mathematics"},{"issue":"17","key":"2023111518555903300_bib68","doi-asserted-by":"publisher","first-page":"4664","DOI":"10.1073\/pnas.1522200113","article-title":"Algorithmic handwriting analysis of Judah\u2019s military correspondence sheds light on composition of biblical texts","volume":"113","author":"Faigenbaum-Golovin","year":"2016","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"issue":"1","key":"2023111518555903300_bib69","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MBITS.2022.3197559","article-title":"Computational handwriting analysis of ancient Hebrew inscriptions\u2014A survey","volume":"2","author":"Faigenbaum-Golovin","year":"2022","journal-title":"IEEE BITS the Information Theory Magazine"},{"key":"2023111518555903300_bib70","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1109\/ICFHR.2014.130","article-title":"Document writer analysis with rejection for historical Arabic manuscripts","volume-title":"International Conference on Frontiers in Handwriting Recognition","author":"Fecker","year":"2014"},{"key":"2023111518555903300_bib71","doi-asserted-by":"publisher","first-page":"3050","DOI":"10.1109\/ICPR.2014.526","article-title":"Writer identification for historical Arabic documents","volume-title":"International Conference on Pattern Recognition","author":"Fecker","year":"2014"},{"issue":"37","key":"2023111518555903300_bib72","doi-asserted-by":"publisher","first-page":"22743","DOI":"10.1073\/pnas.2003794117","article-title":"Restoration of fragmentary Babylonian texts using recurrent neural networks","volume":"117","author":"Fetaya","year":"2020","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"key":"2023111518555903300_bib73","doi-asserted-by":"publisher","first-page":"1377","DOI":"10.1109\/ICDAR.2017.225","article-title":"ICDAR2017 competition on historical document writer identification","volume-title":"IAPR International Conference on Document Analysis and Recognition (ICDAR)","author":"Fiel","year":"2017"},{"key":"2023111518555903300_bib74","doi-asserted-by":"publisher","first-page":"102","DOI":"10.1016\/j.patrec.2020.02.017","article-title":"Machine learning for cultural heritage: A survey","volume":"133","author":"Fiorucci","year":"2020","journal-title":"Pattern Recognition Letters"},{"key":"2023111518555903300_bib75","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1145\/3219819.3219879","article-title":"Towards knowledge discovery from the Vatican secret archives. In Codice Ratio - episode 1: Machine transcription of the manuscripts","volume-title":"ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","author":"Firmani","year":"2018"},{"issue":"3","key":"2023111518555903300_bib76","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1093\/llc\/fqr029","article-title":"Evidence of intertextuality: Investigating Paul the Deacon\u2019s Angustae Vitae","volume":"26","author":"Forstall","year":"2011","journal-title":"Literary and Linguistic Computing"},{"key":"2023111518555903300_bib77","volume-title":"Computer Vision: A Modern Approach","author":"Forsyth","year":"2011"},{"key":"2023111518555903300_bib78","doi-asserted-by":"publisher","first-page":"765","DOI":"10.1145\/2502081.2502199","article-title":"Automatic Egyptian hieroglyph recognition by retrieving images as texts","volume-title":"ACM International Conference on Multimedia","author":"Franken","year":"2013"},{"issue":"4","key":"2023111518555903300_bib79","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1007\/s10044-005-0013-7","article-title":"An efficient segmentation-free approach to assist old Greek handwritten manuscript OCR","volume":"8","author":"Gatos","year":"2006","journal-title":"Pattern Analysis and Applications"},{"key":"2023111518555903300_bib80","doi-asserted-by":"publisher","first-page":"52","DOI":"10.18653\/v1\/W19-2507","article-title":"Stylometric classification of ancient Greek literary texts by genre","volume-title":"SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature","author":"Gianitsos","year":"2019"},{"issue":"1","key":"2023111518555903300_bib81","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1177\/0142064X19855583","article-title":"Dating ancient Egyptian papyri through Raman spectroscopy: Concept and application to the fragments of the Gospel of Jesus\u2019 wife and the Gospel of John","volume":"42","author":"Goler","year":"2019","journal-title":"Journal for the Study of the New Testament"},{"key":"2023111518555903300_bib82","volume-title":"Deep Learning","author":"Goodfellow","year":"2016"},{"issue":"11","key":"2023111518555903300_bib83","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1145\/3422622","article-title":"Generative adversarial networks","volume":"63","author":"Goodfellow","year":"2020","journal-title":"Communications of the ACM"},{"issue":"10","key":"2023111518555903300_bib84","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0240511","article-title":"Reading Akkadian cuneiform using Natural Language Processing","volume":"15","author":"Gordin","year":"2020","journal-title":"PLOS ONE"},{"key":"2023111518555903300_bib85","article-title":"Learning word vectors for 157 languages","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Grave","year":"2018"},{"issue":"3","key":"2023111518555903300_bib86","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1093\/llc\/fqm020","article-title":"Quantitative authorship attribution: An evaluation of techniques","volume":"22","author":"Grieve","year":"2007","journal-title":"Literary and Linguistic Computing"},{"key":"2023111518555903300_bib87","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1007\/978-3-030-37191-3_7","article-title":"Classification and detection of symbols in ancient papyri","volume-title":"Visual Computing for Cultural Heritage","author":"Haliassos","year":"2020"},{"key":"2023111518555903300_bib88","doi-asserted-by":"publisher","first-page":"102228","DOI":"10.1016\/j.jasrep.2020.102228","article-title":"Establishing the provenance of the Nazareth Inscription: Using stable isotopes to resolve a historic controversy and trace ancient marble production","volume":"30","author":"Harper","year":"2020","journal-title":"Journal of Archaeological Science: Reports"},{"key":"2023111518555903300_bib89","doi-asserted-by":"publisher","first-page":"770","DOI":"10.1109\/CVPR.2016.90","article-title":"Deep residual learning for image recognition","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"He","year":"2016"},{"key":"2023111518555903300_bib90","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/j.patcog.2016.03.032","article-title":"Image-based historical manuscript dating using contour and stroke fragments","volume":"58","author":"He","year":"2016","journal-title":"Pattern Recognition"},{"key":"2023111518555903300_bib91","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1007\/978-3-319-23980-4_3","article-title":"Morphological disambiguation of classical Sanskrit","volume-title":"International Workshop on Systems and Frameworks for Computational Morphology","author":"Hellwig","year":"2015"},{"key":"2023111518555903300_bib92","first-page":"288","article-title":"Detecting sentence boundaries in Sanskrit texts","volume-title":"International Conference on Computational Linguistics: Technical Papers (COLING)","author":"Hellwig","year":"2016"},{"issue":"8","key":"2023111518555903300_bib93","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Computation"},{"issue":"3","key":"2023111518555903300_bib94","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1093\/llc\/13.3.111","article-title":"The evolution of stylometry in humanities scholarship","volume":"13","author":"Holmes","year":"1998","journal-title":"Literary and Linguistic Computing"},{"key":"2023111518555903300_bib95","first-page":"4067","article-title":"Word segmentation for Akkadian cuneiform","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Homburg","year":"2016"},{"key":"2023111518555903300_bib96","doi-asserted-by":"publisher","first-page":"5456","DOI":"10.1145\/3503161.3548338","article-title":"AGTGAN: Unpaired image translation for photographic ancient character generation","volume-title":"ACM International Conference on Multimedia","author":"Huang","year":"2022"},{"key":"2023111518555903300_bib97","first-page":"15","article-title":"Classical Chinese sentence segmentation","volume-title":"CIPS-SIGHAN Joint Conference on Chinese Language Processing","author":"Huang","year":"2010"},{"issue":"1","key":"2023111518555903300_bib98","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1113\/jphysiol.1962.sp006837","article-title":"Receptive fields, binocular interaction and functional architecture in the cat\u2019s visual cortex","volume":"160","author":"Hubel","year":"1962","journal-title":"The Journal of Physiology"},{"key":"2023111518555903300_bib99","doi-asserted-by":"publisher","first-page":"89","DOI":"10.18653\/v1\/W19-1409","article-title":"Language and dialect identification of cuneiform texts","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Jauhiainen","year":"2019"},{"key":"2023111518555903300_bib100","first-page":"1","article-title":"ELMoLex: Connecting ELMo and lexicon features for dependency parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Jawahar","year":"2018"},{"key":"2023111518555903300_bib101","first-page":"248","article-title":"AntNLP at CoNLL 2018 shared task: A graph-based parser for universal dependency parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Ji","year":"2018"},{"key":"2023111518555903300_bib102","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11042-022-13709-y","article-title":"Text line segmentation in Indian ancient handwritten documents using faster R-CNN","author":"Jindal","year":"2022","journal-title":"Multimedia Tools and Applications"},{"key":"2023111518555903300_bib103","doi-asserted-by":"publisher","first-page":"20","DOI":"10.18653\/v1\/2021.acl-demo.3","article-title":"The Classical Language Toolkit: An NLP framework for pre-modern languages","volume-title":"Association for Computational Linguistics","author":"Johnson","year":"2021"},{"key":"2023111518555903300_bib104","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3529399.3529400","article-title":"Machine learning in textual criticism: An examination of the performance of supervised machine learning algorithms in reconstructing the text of the Greek New Testament","volume-title":"2022 7th International Conference on Machine Learning Technologies (ICMLT)","author":"Jones","year":"2022"},{"key":"2023111518555903300_bib105","first-page":"133","article-title":"Turku neural parser pipeline: An end-to-end system for the CoNLL 2018 shared task","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Kanerva","year":"2018"},{"key":"2023111518555903300_bib106","doi-asserted-by":"publisher","first-page":"4031","DOI":"10.18653\/v1\/2021.naacl-main.317","article-title":"Restoring and mining the records of the Joseon dynasty via neural language modeling and machine translation","volume-title":"North American Chapter of the Association for Computational Linguistics (NAACL)","author":"Kang","year":"2021"},{"key":"2023111518555903300_bib107","volume-title":"Computational pattern recognition in Linear A","author":"Karajgikar","year":"2021"},{"key":"2023111518555903300_bib108","first-page":"123","article-title":"Classifying Latin inscriptions of the Roman empire: A machine-learning approach","volume-title":"Workshop on Computational Humanities Research","author":"Ka\u0161e","year":"2021"},{"key":"2023111518555903300_bib109","doi-asserted-by":"publisher","first-page":"V\u2013V","DOI":"10.1109\/ISCAS.2003.1206399","article-title":"Hybrid neural network architecture for age identification of ancient Kannada scripts","volume-title":"International Symposium on Circuits and Systems","author":"Kashyap","year":"2003"},{"key":"2023111518555903300_bib110","first-page":"59","article-title":"Automatic semantic role labeling in ancient Greek using distributional semantic modeling","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Keersmaekers","year":"2020"},{"key":"2023111518555903300_bib111","doi-asserted-by":"publisher","first-page":"109","DOI":"10.18653\/v1\/W19-7812","article-title":"Creating, enriching and valorizing treebanks of ancient Greek","volume-title":"International Workshop on Treebanks and Linguistic Theories (TLT)","author":"Keersmaekers","year":"2019"},{"key":"2023111518555903300_bib112","doi-asserted-by":"publisher","first-page":"86","DOI":"10.1016\/j.eswa.2016.06.029","article-title":"Authenticating the writings of Julius Caesar","volume":"63","author":"Kestemont","year":"2016","journal-title":"Expert Systems with Applications"},{"key":"2023111518555903300_bib113","first-page":"124","article-title":"Tree-stack LSTM in transition based dependency parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"K\u0131rnap","year":"2018"},{"issue":"2","key":"2023111518555903300_bib114","first-page":"211","article-title":"The un-Platonic Menexenus: A stylometric analysis with more data","volume":"60","author":"Koentges","year":"2020","journal-title":"Greek, Roman, and Byzantine Studies"},{"key":"2023111518555903300_bib115","first-page":"1","article-title":"Measuring philosophy in the first thousand years of Greek literature","author":"K\u00f6ntges","year":"2020","journal-title":"Digital Classics Online"},{"key":"2023111518555903300_bib116","doi-asserted-by":"publisher","first-page":"40","DOI":"10.18653\/v1\/W16-0205","article-title":"Reconstructing ancient literary texts from noisy manuscripts","volume-title":"Workshop on Computational Linguistics for Literature","author":"Koppel","year":"2016"},{"issue":"1","key":"2023111518555903300_bib117","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1002\/asi.20961","article-title":"Computational methods in authorship attribution","volume":"60","author":"Koppel","year":"2009","journal-title":"Journal of the American Society for information Science and Technology"},{"issue":"1","key":"2023111518555903300_bib118","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1002\/asi.22954","article-title":"Determining if two documents are written by the same author","volume":"65","author":"Koppel","year":"2014","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"2023111518555903300_bib119","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10489-022-04046-6","article-title":"Zero-shot learning based cross-lingual sentiment analysis for Sanskrit text with insufficient labeled data","author":"Kumar","year":"2022","journal-title":"Applied Intelligence"},{"key":"2023111518555903300_bib120","doi-asserted-by":"publisher","first-page":"3553","DOI":"10.1109\/TIFS.2020.2991880","article-title":"Encoding pathlet and SIFT features with bagged VLAD for historical writer identification","volume":"15","author":"Lai","year":"2020","journal-title":"IEEE Transactions on Information Forensics and Security"},{"key":"2023111518555903300_bib121","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.384","article-title":"Filling the gaps in ancient Akkadian texts: A masked language modeling approach","author":"Lazar","year":"2021","journal-title":"arXiv preprint arXiv:2109.04513"},{"issue":"7553","key":"2023111518555903300_bib122","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"2023111518555903300_bib123","first-page":"472","article-title":"A computational model of text reuse in ancient literary texts","volume-title":"Annual Meeting of the Association of Computational Linguistics","author":"Lee","year":"2007"},{"key":"2023111518555903300_bib124","first-page":"135","article-title":"The first international ancient Chinese word segmentation and POS tagging bakeoff: Overview of the EvaHan 2022 evaluation campaign","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Li","year":"2022"},{"key":"2023111518555903300_bib125","doi-asserted-by":"publisher","first-page":"70874","DOI":"10.1109\/ACCESS.2018.2881280","article-title":"Capsules based Chinese word segmentation for ancient Chinese medical books","volume":"6","author":"Li","year":"2018","journal-title":"IEEE Access"},{"key":"2023111518555903300_bib126","first-page":"65","article-title":"Joint learning of POS and dependencies for multilingual universal dependency parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Li","year":"2018"},{"key":"2023111518555903300_bib127","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1109\/PRML56267.2022.9882261","article-title":"Research on multi-line recognition algorithm for Tibetan document","volume-title":"2022 3rd International Conference on Pattern Recognition and Machine Learning (PRML)","author":"Liu","year":"2022"},{"key":"2023111518555903300_bib128","doi-asserted-by":"publisher","first-page":"3146","DOI":"10.18653\/v1\/P19-1303","article-title":"Neural decipherment via minimum-cost flow: From Ugaritic to Linear B","volume-title":"Annual Meeting of the Association for Computational Linguistics","author":"Luo","year":"2019"},{"key":"2023111518555903300_bib129","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1162\/tacl_a_00354","article-title":"Deciphering undersegmented ancient scripts using phonetic prior","volume":"9","author":"Luo","year":"2021","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"2023111518555903300_bib130","volume-title":"Foundations of Statistical Natural Language Processing","author":"Manning","year":"1999"},{"issue":"2","key":"2023111518555903300_bib131","doi-asserted-by":"publisher","first-page":"347","DOI":"10.1093\/llc\/fqx021","article-title":"Devising Rhesus: A strange collaboration between Aeschylus and Euripides","volume":"33","author":"Manousakis","year":"2018","journal-title":"Digital Scholarship in the Humanities"},{"issue":"5","key":"2023111518555903300_bib132","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s42979-020-00286-w","article-title":"The computerization of archaeology: Survey on artificial intelligence techniques","volume":"1","author":"Mantovan","year":"2020","journal-title":"SN Computer Science"},{"issue":"1","key":"2023111518555903300_bib133","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s41109-021-00390-7","article-title":"Historia Augusta authorship: An approach based on measurements of complex networks","volume":"6","author":"Martins","year":"2021","journal-title":"Applied Network Science"},{"issue":"2","key":"2023111518555903300_bib134","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1007\/s10814-021-09162-4","article-title":"Archaeology and epigraphy in the digital era","volume":"30","author":"Matsumoto","year":"2022","journal-title":"Journal of Archaeological Research"},{"key":"2023111518555903300_bib135","article-title":"The challenges and prospects of the intersection of humanities and data science: A white paper from the Alan Turing Institute","author":"McGillivray","year":"2020","journal-title":"Alan Turing Institute"},{"issue":"4","key":"2023111518555903300_bib136","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/j.asej.2014.04.011","article-title":"Sentiment analysis algorithms and applications: A survey","volume":"5","author":"Medhat","year":"2014","journal-title":"Ain Shams Engineering Journal"},{"key":"2023111518555903300_bib137","doi-asserted-by":"publisher","first-page":"4460","DOI":"10.18653\/v1\/2021.naacl-main.353","article-title":"Ab antiquo: Neural proto-language reconstruction","volume-title":"North American Chapter of the Association for Computational Linguistics (NAACL)","author":"Meloni","year":"2021"},{"key":"2023111518555903300_bib138","first-page":"189","article-title":"An electra model for Latin token tagging tasks","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Mercelis","year":"2022"},{"key":"2023111518555903300_bib139","article-title":"Distributed representations of words and phrases and their compositionality","volume":"26","author":"Mikolov","year":"2013","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2023111518555903300_bib140","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1109\/ICDAR.2019.00121","article-title":"GRK-papyri: A dataset of Greek handwriting on papyri for the task of writer identification","volume-title":"International Conference on Document Analysis and Recognition (ICDAR)","author":"Mohammed","year":"2019"},{"issue":"4","key":"2023111518555903300_bib141","doi-asserted-by":"publisher","first-page":"1031","DOI":"10.1016\/S0031-3203(02)00112-7","article-title":"Visual enhancement of incised text","volume":"36","author":"Molton","year":"2003","journal-title":"Pattern Recognition"},{"key":"2023111518555903300_bib142","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1163\/9789004375086_010","article-title":"Using quantitative methods for measuring inter-textual relations in cuneiform","author":"Monroe","year":"2018","journal-title":"Digital Biblical Studies"},{"key":"2023111518555903300_bib143","doi-asserted-by":"publisher","first-page":"1849","DOI":"10.18653\/v1\/D16-1190","article-title":"Non-literal text reuse in historical texts: An approach to identify reuse transformations and its application to bible reuse","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Moritz","year":"2016"},{"key":"2023111518555903300_bib144","doi-asserted-by":"publisher","first-page":"119","DOI":"10.5220\/0005035401190123","article-title":"Intelligent recognition of ancient Persian cuneiform characters","volume-title":"International Conference on Neural Computation Theory and Applications","author":"Mostofi","year":"2014"},{"key":"2023111518555903300_bib145","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1109\/MIUCC55081.2022.9781784","article-title":"Hieroglyphs language translator using deep learning techniques (Scriba)","volume-title":"International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC)","author":"Moustafa","year":"2022"},{"issue":"6","key":"2023111518555903300_bib146","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s12046-019-1126-9","article-title":"Devanagari ancient documents recognition using statistical feature extraction techniques","volume":"44","author":"Narang","year":"2019","journal-title":"S\u0101dhan\u0101"},{"issue":"22","key":"2023111518555903300_bib147","doi-asserted-by":"publisher","first-page":"17279","DOI":"10.1007\/s00500-020-05018-z","article-title":"On the recognition of Devanagari ancient handwritten characters using SIFT and Gabor features","volume":"24","author":"Narang","year":"2020","journal-title":"Soft Computing"},{"issue":"8","key":"2023111518555903300_bib148","doi-asserted-by":"publisher","first-page":"5517","DOI":"10.1007\/s10462-020-09827-4","article-title":"Ancient text recognition: A review","volume":"53","author":"Narang","year":"2020","journal-title":"Artificial Intelligence Review"},{"issue":"13","key":"2023111518555903300_bib149","doi-asserted-by":"publisher","first-page":"20671","DOI":"10.1007\/s11042-021-10775-6","article-title":"DeepNetDevanagari: A deep learning model for Devanagari ancient character recognition","volume":"80","author":"Narang","year":"2021","journal-title":"Multimedia Tools and Applications"},{"key":"2023111518555903300_bib150","doi-asserted-by":"publisher","first-page":"229","DOI":"10.1007\/978-3-030-71804-6_17","article-title":"Learning features for writer identification from handwriting on papyri","volume-title":"Mediterranean Conference on Pattern Recognition and Artificial Intelligence","author":"Nasir","year":"2020"},{"key":"2023111518555903300_bib151","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K17-3014","article-title":"An improved neural network model for joint POS tagging and dependency parsing","author":"Nguyen","year":"2018","journal-title":"arXiv preprint arXiv:1807.03955"},{"key":"2023111518555903300_bib152","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1007\/978-3-030-86549-8_26","article-title":"On the use of attention in deep learning based denoising method for ancient Cham inscription images","volume-title":"International Conference on Document Analysis and Recognition","author":"Nguyen","year":"2021"},{"issue":"2","key":"2023111518555903300_bib153","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1007\/s10032-006-0031-z","article-title":"An old Greek handwritten OCR system based on an efficient segmentation-free approach","volume":"9","author":"Ntzios","year":"2007","journal-title":"International Journal on Document Analysis and Recognition (IJDAR)"},{"key":"2023111518555903300_bib154","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1145\/3322905.3322930","article-title":"Stylometry of literary papyri","volume-title":"International Conference on Digital Access to Textual Cultural Heritage","author":"Ochab","year":"2019"},{"key":"2023111518555903300_bib155","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1109\/ICCITechnol.2012.6285841","article-title":"Authorship attribution of ancient texts written by ten Arabic travelers using a SMO-SVM classifier","volume-title":"International Conference on Communications and Information Technology (ICCIT)","author":"Ouamour","year":"2012"},{"key":"2023111518555903300_bib156","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/CITS.2013.6705713","article-title":"Authorship attribution of ancient texts written by ten Arabic travelers using character n-grams","volume-title":"International Conference on Computer, Information and Telecommunication Systems (CITS)","author":"Ouamour","year":"2013"},{"key":"2023111518555903300_bib157","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1109\/CyberC.2013.31","article-title":"Authorship attribution of short historical Arabic texts based on lexical features","volume-title":"International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","author":"Ouamour","year":"2013"},{"key":"2023111518555903300_bib158","doi-asserted-by":"publisher","first-page":"479","DOI":"10.1007\/978-3-319-99579-3_50","article-title":"A comparative survey of authorship attribution on short Arabic texts","volume-title":"International Conference on Speech and Computer","author":"Ouamour","year":"2018"},{"key":"2023111518555903300_bib159","doi-asserted-by":"publisher","first-page":"209","DOI":"10.18653\/v1\/W19-1423","article-title":"Experiments in cuneiform language identification","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Paetzold","year":"2019"},{"key":"2023111518555903300_bib160","article-title":"Deep learning the Indus script","author":"Palaniappan","year":"2017","journal-title":"arXiv preprint arXiv:1702.00523"},{"key":"2023111518555903300_bib161","first-page":"1","article-title":"NER on ancient Greek with minimal annotation","volume-title":"Digital Humanities 2020","author":"Palladino","year":"2020"},{"key":"2023111518555903300_bib162","first-page":"11","article-title":"Tokenization and sentence segmentation","author":"Palmer","year":"2000","journal-title":"Handbook of Natural Language Processing"},{"issue":"8","key":"2023111518555903300_bib163","doi-asserted-by":"publisher","first-page":"1404","DOI":"10.1109\/TPAMI.2008.201","article-title":"Automatic writer identification of ancient Greek inscriptions","volume":"31","author":"Panagopoulos","year":"2008","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"2023111518555903300_bib164","doi-asserted-by":"publisher","first-page":"290","DOI":"10.1007\/978-3-031-13324-4_25","article-title":"PergaNet: A deep learning framework for automatic appearance-based analysis of ancient parchment collections","volume-title":"International Conference on Image Analysis and Processing","author":"Paolanti","year":"2022"},{"key":"2023111518555903300_bib165","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1145\/3411408.3411410","article-title":"NLP for the Greek language: A brief survey","volume-title":"Hellenic Conference on Artificial Intelligence","author":"Papantoniou","year":"2020"},{"key":"2023111518555903300_bib166","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/AIS.2010.5547045","article-title":"Handwriting automatic classification: Application to ancient Greek inscriptions","volume-title":"International Conference on Autonomous and Intelligent System","author":"Papaodysseus","year":"2010"},{"key":"2023111518555903300_bib167","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1016\/j.cviu.2014.01.003","article-title":"Identifying the writer of ancient inscriptions and Byzantine codices. A novel approach","volume":"121","author":"Papaodysseus","year":"2014","journal-title":"Computer Vision and Image Understanding"},{"key":"2023111518555903300_bib168","doi-asserted-by":"publisher","DOI":"10.21203\/rs.3.rs-2272076\/v1","article-title":"Dating Greek papyri images with machine learning","author":"Paparigopoulou","year":"2022","journal-title":"ICDAR Workshop on Computational Paleography"},{"key":"2023111518555903300_bib169","doi-asserted-by":"publisher","DOI":"10.1145\/3593431","article-title":"A generative model for the Mycenaean Linear B script and its application in infilling text from ancient tablets","author":"Papavassileiou","year":"2022","journal-title":"ACM Journal on Computing and Cultural Heritage"},{"key":"2023111518555903300_bib170","first-page":"2552","article-title":"A dataset of Mycenaean Linear B sequences","volume-title":"Language Resources and Evaluation Conference","author":"Papavassiliou","year":"2020"},{"key":"2023111518555903300_bib171","doi-asserted-by":"publisher","first-page":"116617","DOI":"10.1109\/ACCESS.2020.3004879","article-title":"Ancient Korean neural machine translation","volume":"8","author":"Park","year":"2020","journal-title":"IEEE Access"},{"key":"2023111518555903300_bib172","article-title":"Priming ancient Korean neural machine translation","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Park","year":"2022"},{"issue":"5","key":"2023111518555903300_bib173","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0215775","article-title":"From invisibility to readability: Recovering the ink of Herculaneum","volume":"14","author":"Parker","year":"2019","journal-title":"PLOS ONE"},{"key":"2023111518555903300_bib174","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1007\/s42803-022-00046-7","article-title":"Computational authorship analysis of the Homeric poems","volume":"4","author":"Pavlopoulos","year":"2022","journal-title":"International Journal of Digital Humanities"},{"key":"2023111518555903300_bib175","first-page":"7071","article-title":"Sentiment analysis of Homeric text: The 1st Book of Iliad","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Pavlopoulos","year":"2022"},{"key":"2023111518555903300_bib176","doi-asserted-by":"publisher","first-page":"56","DOI":"10.18653\/v1\/W19-4707","article-title":"GASC: Genre-aware semantic change for ancient Greek","volume-title":"International Workshop on Computational Approaches to Historical Language Change","author":"Perrone","year":"2019"},{"key":"2023111518555903300_bib177","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1145\/3352631.3352646","article-title":"Papy-S-Net: A Siamese network to match papyrus fragments","volume-title":"International Workshop on Historical Document Imaging and Processing","author":"Pirrone","year":"2019"},{"issue":"4","key":"2023111518555903300_bib178","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0249769","article-title":"Artificial intelligence based writer identification generates new evidence for the unknown scribes of the Dead Sea Scrolls exemplified by the Great Isaiah Scroll (1qisaa)","volume":"16","author":"Popovi\u0107","year":"2021","journal-title":"PLOS ONE"},{"key":"2023111518555903300_bib179","doi-asserted-by":"publisher","first-page":"3454","DOI":"10.18653\/v1\/2020.coling-main.308","article-title":"Towards the first machine translation system for Sumerian transliterations","volume-title":"International Conference on Computational Linguistics","author":"Punia","year":"2020"},{"key":"2023111518555903300_bib180","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K18-2016","article-title":"Universal dependency parsing from scratch","author":"Qi","year":"2019","journal-title":"arXiv preprint arXiv:1901.10457"},{"key":"2023111518555903300_bib181","doi-asserted-by":"publisher","first-page":"790","DOI":"10.1109\/ICCMC.2017.8282574","article-title":"Grantha script recognition from ancient palm leaves using histogram of orientation shape context","volume-title":"International Conference on Computing Methodologies and Communication (ICCMC)","author":"Raj","year":"2017"},{"issue":"5931","key":"2023111518555903300_bib182","doi-asserted-by":"publisher","first-page":"1165","DOI":"10.1126\/science.1170391","article-title":"Entropic evidence for linguistic structure in the Indus script","volume":"324","author":"Rao","year":"2009","journal-title":"Science"},{"issue":"33","key":"2023111518555903300_bib183","doi-asserted-by":"publisher","first-page":"13685","DOI":"10.1073\/pnas.0906237106","article-title":"A Markov model of the Indus script","volume":"106","author":"Rao","year":"2009","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"issue":"4","key":"2023111518555903300_bib184","doi-asserted-by":"publisher","first-page":"795","DOI":"10.1162\/coli_c_00030","article-title":"Entropy, the Indus script, and language: A reply to R. Sproat","volume":"36","author":"Rao","year":"2010","journal-title":"Computational Linguistics"},{"issue":"2","key":"2023111518555903300_bib185","first-page":"118","article-title":"Authorship attribution in historical and literary texts by a deep learning classifier","volume":"1","author":"Reisi","year":"2020","journal-title":"Journal of Applied Intelligent Systems and Information Sciences"},{"key":"2023111518555903300_bib186","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1109\/MWSCAS47672.2021.9531798","article-title":"A hybrid capsule network-based deep learning framework for deciphering ancient scripts with scarce annotations: A case study on Phoenician epigraphy","volume-title":"IEEE International Midwest Symposium on Circuits and Systems (MWSCAS)","author":"Rizk","year":"2021"},{"key":"2023111518555903300_bib187","doi-asserted-by":"publisher","DOI":"10.1093\/actrade\/9780199567782.001.0001","volume-title":"Writing and Script: A Very Short Introduction","author":"Robinson","year":"2009"},{"key":"2023111518555903300_bib188","doi-asserted-by":"publisher","first-page":"307","DOI":"10.1484\/J.RHT.5.101260","article-title":"Towards generating a stemma of complicated manuscript traditions: Petrus Alfonsi\u2019s Dialogus","volume":"5","author":"Roelli","year":"2010","journal-title":"Revue d\u2019Histoire des Textes"},{"issue":"4","key":"2023111518555903300_bib189","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1093\/llc\/fqp002","article-title":"Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets","volume":"24","author":"Roos","year":"2009","journal-title":"Literary and Linguistic Computing"},{"key":"2023111518555903300_bib190","article-title":"Semi-supervised neural system for tagging, parsing and lematization","author":"Rybak","year":"2020","journal-title":"arXiv preprint arXiv:2004.12450"},{"key":"2023111518555903300_bib191","unstructured":"Sahala, Aleksi\n          . 2021. Contributions to Computational Assyriology. Ph.D. thesis, Helsingin yliopisto."},{"key":"2023111518555903300_bib192","article-title":"Automated phonological transcription of Akkadian cuneiform text","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Sahala","year":"2020"},{"key":"2023111518555903300_bib193","article-title":"BabyFST: Towards a finite-state based computational model of ancient Babylonian","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Sahala","year":"2020"},{"issue":"1","key":"2023111518555903300_bib194","doi-asserted-by":"publisher","first-page":"204","DOI":"10.1093\/llc\/fqu058","article-title":"The sense of a connection: Automatic tracing of intertextuality by meaning","volume":"31","author":"Scheirer","year":"2016","journal-title":"Digital Scholarship in the Humanities"},{"key":"2023111518555903300_bib195","doi-asserted-by":"publisher","first-page":"216","DOI":"10.1109\/ICFHR2020.2020.00048","article-title":"ICFHR 2020 competition on image retrieval for historical handwritten fragments","volume-title":"International Conference on Frontiers in Handwriting Recognition (ICFHR)","author":"Seuret","year":"2020"},{"key":"2023111518555903300_bib196","unstructured":"Shaus, Arie\n          . 2017. Computer Vision and Machine Learning Methods for Analyzing First Temple Period Inscriptions. Ph.D. thesis, Tel Aviv University."},{"key":"2023111518555903300_bib197","doi-asserted-by":"publisher","first-page":"5186","DOI":"10.18653\/v1\/2020.emnlp-main.420","article-title":"Blank language models","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Shen","year":"2020"},{"key":"2023111518555903300_bib198","doi-asserted-by":"publisher","first-page":"128","DOI":"10.18653\/v1\/2021.latechclfl-1.15","article-title":"A pilot study for BERT language modeling and morphological analysis for ancient and medieval Greek","volume-title":"SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature","author":"Singh","year":"2021"},{"key":"2023111518555903300_bib199","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/K18-2011","article-title":"82 treebanks, 34 models: Universal dependency parsing with multi-treebank models","author":"Smith","year":"2018","journal-title":"arXiv preprint arXiv:1809.02237"},{"key":"2023111518555903300_bib200","first-page":"1048","article-title":"A statistical model for lost language decipherment","volume-title":"Association for Computational Linguistics","author":"Snyder","year":"2010"},{"key":"2023111518555903300_bib201","doi-asserted-by":"crossref","first-page":"1260","DOI":"10.18653\/v1\/2022.findings-emnlp.91","article-title":"Translating Hanja historical documents to contemporary Korean and English","volume-title":"Findings of the Association for Computational Linguistics: EMNLP","author":"Son","year":"2022"},{"key":"2023111518555903300_bib202","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1109\/ICSIP.2014.33","article-title":"Classification of ancient epigraphs into different periods using random forests","volume-title":"International Conference on Signal and Image Processing","author":"Soumya","year":"2014"},{"issue":"3","key":"2023111518555903300_bib203","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1162\/coli_a_00011","article-title":"Last words: Ancient symbols, computational linguistics, and the reviewing practices of the general science journals","volume":"36","author":"Sproat","year":"2010","journal-title":"Computational Linguistics"},{"issue":"2","key":"2023111518555903300_bib204","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1353\/lan.2014.0031","article-title":"A statistical comparison of written language and nonlinguistic symbol systems","volume":"90","author":"Sproat","year":"2014","journal-title":"Language"},{"key":"2023111518555903300_bib205","first-page":"105","article-title":"Overview of the EvaLatin 2020 evaluation campaign","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Sprugnoli","year":"2020"},{"key":"2023111518555903300_bib206","first-page":"183","article-title":"Overview of the EvaLatin 2022 evaluation campaign","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Sprugnoli","year":"2022"},{"key":"2023111518555903300_bib207","article-title":"Vir is to moderatus as mulier is to intemperans-lemma embeddings for Latin.","volume-title":"CLiC-it","author":"Sprugnoli","year":"2019"},{"issue":"3","key":"2023111518555903300_bib208","doi-asserted-by":"publisher","first-page":"538","DOI":"10.1002\/asi.21001","article-title":"A survey of modern authorship attribution methods","volume":"60","author":"Stamatatos","year":"2009","journal-title":"Journal of the American Society for Information Science and Technology"},{"key":"2023111518555903300_bib209","first-page":"130","article-title":"Voting for POS tagging of Latin texts: Using the flair of flair to better ensemble classifiers by example of Latin","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Stoeckel","year":"2020"},{"key":"2023111518555903300_bib210","doi-asserted-by":"publisher","DOI":"10.3389\/fdigh.2015.00005","article-title":"Digital approaches to paleography and book history: Some challenges, present and future","author":"Stokes","year":"2015","journal-title":"Frontiers in Digital Humanities"},{"issue":"1","key":"2023111518555903300_bib211","doi-asserted-by":"publisher","first-page":"239","DOI":"10.1002\/asi.23460","article-title":"Computational authorship verification method attributes a new work to a major 2nd century African author","volume":"67","author":"Stover","year":"2016","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"2023111518555903300_bib212","first-page":"197","article-title":"UDpipe 2.0 prototype at CoNLL 2018 UD shared task","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Straka","year":"2018"},{"key":"2023111518555903300_bib213","first-page":"4290","article-title":"UDPipe: Trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, POS tagging and parsing","volume-title":"Language Resources and Evaluation Conference (LREC)","author":"Straka","year":"2016"},{"key":"2023111518555903300_bib214","article-title":"UDpipe at EvaLatin 2020: Contextualized embeddings and treebank embeddings","author":"Straka","year":"2020","journal-title":"arXiv preprint arXiv:2006.03687"},{"key":"2023111518555903300_bib215","article-title":"Evaluating contextualized embeddings on 54 languages in POS tagging, lemmatization and dependency parsing","author":"Straka","year":"2019","journal-title":"arXiv preprint arXiv:1908.07448"},{"issue":"3","key":"2023111518555903300_bib216","doi-asserted-by":"publisher","first-page":"6873","DOI":"10.35940\/ijrte.C5842.098319","article-title":"Recognizing ancient characters from Tamil palm leaf manuscripts using convolution based deep learning","volume":"8","author":"Subramani","year":"2019","journal-title":"International Journal of Recent Technology and Engineering"},{"key":"2023111518555903300_bib217","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/ICAMMAET.2017.8186731","article-title":"Feature selection for an automated ancient Tamil script classification system using machine learning techniques","volume-title":"International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)","author":"Suganya","year":"2017"},{"key":"2023111518555903300_bib218","article-title":"Sequence to sequence learning with neural networks","volume":"27","author":"Sutskever","year":"2014","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2023111518555903300_bib219","first-page":"224","article-title":"Semantic domains in Akkadian texts","volume":"2","author":"Sv\u00e4rd","year":"2018","journal-title":"CyberResearch on the Ancient Near East and Neighboring Regions. Case Studies on Archaeological Data, Objects, Texts, and Digital Archiving"},{"key":"2023111518555903300_bib220","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1109\/eScience51609.2021.00023","article-title":"Exploring learning approaches for ancient Greek character recognition with citizen science data","volume-title":"International Conference on eScience","author":"Swindall","year":"2021"},{"key":"2023111518555903300_bib221","doi-asserted-by":"publisher","first-page":"4973","DOI":"10.24963\/ijcai.2022\/689","article-title":"Dataset augmentation in papyrology with generative models: A study of synthetic ancient Greek character images","volume-title":"International Joint Conference on Artificial Intelligence (IJCAI)","author":"Swindall","year":"2022"},{"key":"2023111518555903300_bib222","first-page":"159","article-title":"Simple tagging system with RoBERTa for ancient Chinese","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Tang","year":"2022"},{"key":"2023111518555903300_bib223","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1145\/3319921.3319958","article-title":"Authorship attribution of the Golden Lotus based on text classification methods","volume-title":"International Conference on Innovation in Artificial Intelligence","author":"Tang","year":"2019"},{"issue":"3","key":"2023111518555903300_bib224","first-page":"1","article-title":"Image and interpretation using artificial intelligence to read ancient Roman texts","volume":"7","author":"Terras","year":"2005","journal-title":"Human IT"},{"key":"2023111518555903300_bib225","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/IJCNN52387.2021.9534342","article-title":"AnchiBERT: A pre-trained model for ancient Chinese language understanding and generation","volume-title":"International Joint Conference on Neural Networks (IJCNN)","author":"Tian","year":"2021"},{"key":"2023111518555903300_bib226","doi-asserted-by":"publisher","first-page":"99","DOI":"10.3764\/aja.113.1.99","article-title":"The study of hands on Greek inscriptions: The need for a digital approach","author":"Tracy","year":"2009","journal-title":"American Journal of Archaeology"},{"issue":"4","key":"2023111518555903300_bib227","doi-asserted-by":"publisher","first-page":"18","DOI":"10.18356\/c002fa64-en","article-title":"The itinerary of a stolen stele","volume":"2020","author":"Tsirogiannis","year":"2020","journal-title":"UNESCO Courier"},{"issue":"2","key":"2023111518555903300_bib228","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1093\/llc\/fqw001","article-title":"An application of a profile-based method for authorship verification: Investigating the authenticity of Pliny the Younger\u2019s letter to Trajan concerning the Christians","volume":"32","author":"Tuccinardi","year":"2017","journal-title":"Digital Scholarship in the Humanities"},{"key":"2023111518555903300_bib229","first-page":"461","article-title":"Reconsidering the Roman workshop: Using computer vision to analyse the making of ancient inscriptions","volume":"10","author":"Tupman","year":"2021","journal-title":"Umanistica Digitale"},{"key":"2023111518555903300_bib230","first-page":"243","article-title":"Toward automatically assembling Hittite-language Cuneiform tablet fragments into larger texts","volume-title":"Annual Meeting of the Association for Computational Linguistics","author":"Tyndall","year":"2012"},{"key":"2023111518555903300_bib231","article-title":"Attention is all you need","volume-title":"Advances in Neural Information Processing Systems","author":"Vaswani","year":"2017"},{"issue":"1","key":"2023111518555903300_bib232","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1163\/24523666-01000013","article-title":"The Diorisis ancient Greek corpus: Linguistics and literature","volume":"3","author":"Vatri","year":"2018","journal-title":"Research Data Journal for the Humanities and Social Sciences"},{"issue":"2","key":"2023111518555903300_bib233","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1163\/15699846-02002001","article-title":"Lemmatization for ancient Greek: An experimental assessment of the state of the art","volume":"20","author":"Vatri","year":"2020","journal-title":"Journal of Greek Linguistics"},{"key":"2023111518555903300_bib234","first-page":"92","article-title":"IBM research at the CoNLL 2018 shared task on multilingual parsing","volume-title":"CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Wan","year":"2018"},{"key":"2023111518555903300_bib235","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1007\/978-3-319-49508-8_36","article-title":"A sentence segmentation method for ancient Chinese texts based on NNLM","volume-title":"Workshop on Chinese Lexical Semantics","author":"Wang","year":"2016"},{"key":"2023111518555903300_bib236","first-page":"178","article-title":"Glyph features matter: A multimodal solution for EvaHan in LT4HALA2022","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Wei","year":"2022"},{"key":"2023111518555903300_bib237","first-page":"226","article-title":"Recognition and translation of ancient Brahmi letters using deep learning and NLP","volume-title":"International Conference on Advancements in Computing (ICAC)","author":"Wijerathna","year":"2019"},{"issue":"1","key":"2023111518555903300_bib238","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/sdata.2016.18","article-title":"The fair guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Scientific Data"},{"key":"2023111518555903300_bib239","first-page":"39","article-title":"Topic modeling experiments on Hellenistic corpora","volume-title":"CDH@ TLT","author":"Wishart","year":"2017"},{"key":"2023111518555903300_bib240","doi-asserted-by":"publisher","DOI":"10.2307\/147248","volume-title":"The Study of Greek Inscriptions","author":"Woodhead","year":"1959"},{"key":"2023111518555903300_bib241","first-page":"193","article-title":"Transformer-based part-of-speech tagging and lemmatization for Latin","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Wr\u00f3bel","year":"2022"},{"key":"2023111518555903300_bib242","doi-asserted-by":"publisher","first-page":"54","DOI":"10.18653\/v1\/W19-1406","article-title":"Language discrimination and transfer learning for similar languages: Experiments with feature combinations and adaptation","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Wu","year":"2019"},{"key":"2023111518555903300_bib243","first-page":"114","article-title":"JHUBC\u2019s submission to LT4HALA EvaLatin 2020","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Wu","year":"2020"},{"issue":"3","key":"2023111518555903300_bib244","doi-asserted-by":"publisher","first-page":"e9506","DOI":"10.1371\/journal.pone.0009506","article-title":"Statistical analysis of the Indus script using n-grams","volume":"5","author":"Yadav","year":"2010","journal-title":"PLOS ONE"},{"key":"2023111518555903300_bib245","first-page":"6071","article-title":"BERT in Plutarch\u2019s shadows","volume-title":"Empirical Methods in Natural Language Processing (EMNLP)","author":"Yamshchikov","year":"2022"},{"issue":"8","key":"2023111518555903300_bib246","doi-asserted-by":"publisher","first-page":"3855","DOI":"10.1007\/s00521-020-05216-8","article-title":"An automatic evaluation metric for ancient-modern Chinese translation","volume":"33","author":"Yang","year":"2021","journal-title":"Neural Computing and Applications"},{"key":"2023111518555903300_bib247","first-page":"174","article-title":"A joint framework for ancient Chinese WS and POS tagging based on adversarial ensemble learning","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Yang","year":"2022"},{"key":"2023111518555903300_bib248","doi-asserted-by":"publisher","first-page":"1832","DOI":"10.18653\/v1\/2022.findings-naacl.140","article-title":"HUE: Pretrained model and dataset for understanding Hanja documents of ancient Korea","volume-title":"North American Chapter of the Association for Computational Linguistics (NAACL)","author":"Yoo","year":"2022"},{"key":"2023111518555903300_bib249","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1007\/978-3-642-34752-8_38","article-title":"Word segmentation for text in Japanese ancient writings based on probability of character n-grams","volume-title":"International Conference on Asian Digital Libraries","author":"Yoshimura","year":"2012"},{"key":"2023111518555903300_bib250","doi-asserted-by":"publisher","first-page":"101","DOI":"10.31219\/osf.io\/8epsy","article-title":"Automatic translation alignment for ancient Greek and Latin","volume-title":"Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages","author":"Yousef","year":"2022"},{"issue":"6","key":"2023111518555903300_bib251","first-page":"1","article-title":"Word segmentation for ancient Chinese texts based on nonparametric Bayesian models and deep learning","volume":"34","author":"Yu","year":"2020","journal-title":"Journal of Chinese Information Processing"},{"key":"2023111518555903300_bib252","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11263-022-01665-x","article-title":"Artificial intelligence for Dunhuang cultural heritage protection: The project and the dataset","volume":"130","author":"Yu","year":"2022","journal-title":"International Journal of Computer Vision"},{"key":"2023111518555903300_bib253","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1109\/IALP48816.2019.9037653","article-title":"A machine learning model for the dating of ancient Chinese texts","volume-title":"International Conference on Asian Language Processing (IALP)","author":"Yu","year":"2019"},{"key":"2023111518555903300_bib254","first-page":"1","article-title":"A report on the third VarDial evaluation campaign","volume-title":"Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)","author":"Zampieri","year":"2019"},{"key":"2023111518555903300_bib255","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18653\/v1\/K17-3001","article-title":"CoNLL 2017 shared task: Multilingual parsing from raw text to universal dependencies","volume-title":"CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies","author":"Zeman","year":"2017"},{"key":"2023111518555903300_bib256","doi-asserted-by":"publisher","first-page":"4482","DOI":"10.1145\/3534678.3539050","article-title":"Data-driven Oracle Bone rejoining: A dataset and practical self-supervised learning scheme","volume-title":"ACM SIGKDD Conference on Knowledge Discovery and Data Mining","author":"Zhang","year":"2022"},{"key":"2023111518555903300_bib257","first-page":"150","article-title":"BERT 4ever@ EvaHan 2022: Ancient Chinese word segmentation and part-of-speech tagging based on adversarial learning and continual pre-training","volume-title":"Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)","author":"Zhang","year":"2022"},{"key":"2023111518555903300_bib258","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3469213.3470270","article-title":"People name recognition from ancient Chinese literature using distant supervision and deep learning","volume-title":"International Conference on Artificial Intelligence and Information Systems","author":"Zhang","year":"2021"},{"key":"2023111518555903300_bib259","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1109\/ICDAR.2019.00057","article-title":"Oracle character recognition by nearest neighbor classification with deep metric learning","volume-title":"International Conference on Document Analysis and Recognition (ICDAR)","author":"Zhang","year":"2019"},{"key":"2023111518555903300_bib260","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1007\/978-3-030-32236-6_13","article-title":"Automatic translating between ancient Chinese and contemporary Chinese with limited aligned corpora","volume-title":"CCF International Conference on Natural Language Processing and Chinese Computing","author":"Zhang","year":"2019"},{"key":"2023111518555903300_bib261","doi-asserted-by":"publisher","first-page":"33080","DOI":"10.1109\/ACCESS.2020.2972807","article-title":"Improvement of ancient Shui character recognition model based on convolutional neural network","volume":"8","author":"Zhao","year":"2020","journal-title":"IEEE Access"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/coli\/article-pdf\/49\/3\/703\/2177413\/coli_a_00481.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/coli\/article-pdf\/49\/3\/703\/2177413\/coli_a_00481.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,15]],"date-time":"2023-11-15T18:57:27Z","timestamp":1700074647000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/49\/3\/703\/116160\/Machine-Learning-for-Ancient-Languages-A-Survey"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023]]},"references-count":261,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,9,1]]},"published-print":{"date-parts":[[2023,9,1]]}},"URL":"https:\/\/doi.org\/10.1162\/coli_a_00481","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023]]},"published":{"date-parts":[[2023]]}}}