{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T09:05:06Z","timestamp":1774947906749,"version":"3.50.1"},"reference-count":48,"publisher":"MIT Press","license":[{"start":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T00:00:00Z","timestamp":1751846400000},"content-version":"vor","delay-in-days":187,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,7,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Large-scale sense-annotated corpora are important for a range of tasks but are hard to come by. Dictionaries that record and describe the vocabulary of a language often offer a small set of real-world example sentences for each sense of a word. However, on their own, these sentences are too few to be used as diachronic sense-annotated corpora. We propose a targeted strategy for training and evaluating generative models producing historically and semantically accurate word usages given any word, sense definition, and year triple. Our results demonstrate that fine-tuned models can generate usages with the same properties as real-world example sentences from a reference dictionary. Thus the generated usages will be suitable for training and testing computational models where large-scale sense-annotated corpora are needed but currently unavailable.<\/jats:p>","DOI":"10.1162\/tacl_a_00761","type":"journal-article","created":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T19:37:07Z","timestamp":1751917027000},"page":"690-708","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":1,"title":["Sense-specific Historical Word Usage Generation"],"prefix":"10.1162","volume":"13","author":[{"given":"Pierluigi","family":"Cassotti","sequence":"first","affiliation":[{"name":"University of Gothenburg, Sweden. pierluigi.cassotti@gu.se"}]},{"given":"Nina","family":"Tahmasebi","sequence":"additional","affiliation":[{"name":"University of Gothenburg, Sweden. nina.tahmasebi@gu.se"}]}],"member":"281","published-online":{"date-parts":[[2025,7,3]]},"reference":[{"key":"2025070715370394400_bib1","article-title":"Llama 3 model card","author":"AI@Meta","year":"2024"},{"key":"2025070715370394400_bib2","first-page":"6958","article-title":"CCOHA: Clean corpus of historical American English","volume-title":"Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020","author":"Alatrash","year":"2020"},{"key":"2025070715370394400_bib3","unstructured":"Amazon Mechanical Turk. [link]."},{"key":"2025070715370394400_bib4","doi-asserted-by":"publisher","first-page":"3779","DOI":"10.24963\/ijcai.2021\/520","article-title":"Exemplification modeling: Can you give me an example, please?","volume-title":"Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event \/ Montreal, Canada, 19\u201327 August 2021","author":"Barba","year":"2021"},{"key":"2025070715370394400_bib5","doi-asserted-by":"publisher","DOI":"10.1515\/9783110931600","volume-title":"Prinzipien des lexikalischen Bedeutungswandels am Beispiel der romanischen Sprachen","author":"Blank","year":"1997"},{"key":"2025070715370394400_bib6","doi-asserted-by":"publisher","first-page":"3538","DOI":"10.18653\/v1\/2024.naacl-long.194","article-title":"Low-cost generation and evaluation of dictionary example sentences","volume-title":"Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)","author":"Cai","year":"2024"},{"key":"2025070715370394400_bib7","doi-asserted-by":"publisher","first-page":"1577","DOI":"10.18653\/v1\/2023.acl-short.135","article-title":"XL-LEXEME: WiC pretrained model for cross-lingual LEXical sEMantic changE","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)","author":"Cassotti","year":"2023"},{"issue":"2005","key":"2025070715370394400_bib8","doi-asserted-by":"publisher","first-page":"69","DOI":"10.20885\/informatika.vol3.iss1.art7","article-title":"A corpus of late modern english texts","volume":"29","author":"De Smet","year":"2005","journal-title":"ICAME Journal"},{"key":"2025070715370394400_bib9","article-title":"QLoRA: Efficient finetuning of quantized LLMs","volume-title":"Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10\u201316, 2023","author":"Dettmers","year":"2023"},{"key":"2025070715370394400_bib10","article-title":"BERT: Pre-training of deep bidirectional transformers for language understanding","author":"Devlin","year":"2018","journal-title":"CoRR"},{"key":"2025070715370394400_bib11","doi-asserted-by":"publisher","first-page":"457","DOI":"10.18653\/v1\/P19-1044","article-title":"Time-out: Temporal referencing for robust modeling of lexical semantic change","volume-title":"Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28\u2013August 2, 2019, Volume 1: Long Papers","author":"Dubossarsky","year":"2019"},{"issue":"3","key":"2025070715370394400_bib12","doi-asserted-by":"publisher","first-page":"511","DOI":"10.1162\/COLI_a_00142","article-title":"Measuring word meaning in context","volume":"39","author":"Erk","year":"2013","journal-title":"Computational Linguistics"},{"key":"2025070715370394400_bib13","doi-asserted-by":"publisher","first-page":"5712","DOI":"10.18653\/v1\/2024.findings-acl.339","article-title":"Definition generation for lexical semantic change detection","volume-title":"Findings of the Association for Computational Linguistics ACL 2024","author":"Fedorova","year":"2024"},{"key":"2025070715370394400_bib14","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1162\/tacl_a_00081","article-title":"A Bayesian model of diachronic meaning change","volume":"4","author":"Frermann","year":"2016","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"2025070715370394400_bib15","doi-asserted-by":"publisher","first-page":"266","DOI":"10.18653\/v1\/P18-2043","article-title":"Conditional generators of words definitions","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15\u201320, 2018, Volume 2: Short Papers","author":"Gadetsky","year":"2018"},{"key":"2025070715370394400_bib16","doi-asserted-by":"publisher","first-page":"3130","DOI":"10.18653\/v1\/2023.acl-long.176","article-title":"Interpretable word sense representations via definition generation: The case of semantic change analysis","volume-title":"Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Giulianelli","year":"2023"},{"key":"2025070715370394400_bib17","doi-asserted-by":"publisher","first-page":"2116","DOI":"10.18653\/v1\/D16-1229","article-title":"Cultural shift or linguistic drift? Comparing two computational measures of semantic change","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing","author":"Hamilton","year":"2016"},{"key":"2025070715370394400_bib18","doi-asserted-by":"publisher","first-page":"610","DOI":"10.18653\/v1\/2022.acl-long.46","article-title":"Controllable dictionary example generation: Generating example sentences for specific targeted audiences","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22\u201327, 2022","author":"He","year":"2022"},{"key":"2025070715370394400_bib19","doi-asserted-by":"publisher","first-page":"738","DOI":"10.1007\/978-3-642-04174-7_53","article-title":"Using temporal language models for document dating","volume-title":"Machine Learning and Knowledge Discovery in Databases, European Conference, ECML PKDD 2009, Bled, Slovenia, September 7\u201311, 2009, Proceedings, Part II","author":"Kanhabua","year":"2009"},{"key":"2025070715370394400_bib20","article-title":"Machine-assisted mixed methods: Augmenting humanities and social sciences with artificial intelligence","author":"Karjus","year":"2023","journal-title":"arXiv preprint arXiv:2309.14379"},{"key":"2025070715370394400_bib21","doi-asserted-by":"publisher","first-page":"625","DOI":"10.1145\/2736277.2741627","article-title":"Statistically significant detection of linguistic change","volume-title":"Proceedings of the 24th International Conference on World Wide Web","author":"Kulkarni","year":"2015"},{"key":"2025070715370394400_bib22","doi-asserted-by":"publisher","first-page":"7871","DOI":"10.18653\/v1\/2020.acl-main.703","article-title":"BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension","volume-title":"Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics","author":"Lewis","year":"2020"},{"key":"2025070715370394400_bib23","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.5255228","article-title":"Dwug la: Diachronic word usage graphs for latin","author":"McGillivray","year":"2021"},{"key":"2025070715370394400_bib24","doi-asserted-by":"publisher","DOI":"10.3115\/1075527.1075662","article-title":"WORDNET: A lexical database for english","volume-title":"Speech and Natural Language: Proceedings of a Workshop Held at Harriman, New York, USA, February 23\u201326, 1992","author":"Miller","year":"1992"},{"key":"2025070715370394400_bib25","doi-asserted-by":"publisher","DOI":"10.3115\/1075671.1075742","article-title":"A semantic concordance","volume-title":"Human Language Technology: Proc. of a Workshop Held at Plainsboro, New Jersey, USA, March 21\u201324, 1993","author":"Miller","year":"1993"},{"key":"2025070715370394400_bib26","article-title":"Oxford english dictionary","volume":"3","author":"Oxford English Dictionary","year":"1989","journal-title":"Simpson, Ja & Weiner, Esc"},{"key":"2025070715370394400_bib27","unstructured":"OED API. [link]."},{"key":"2025070715370394400_bib28","unstructured":"OpenAI. [link]."},{"key":"2025070715370394400_bib29","first-page":"5759","article-title":"A short survey on sense-annotated corpora","volume-title":"Proceedings of the Twelfth Language Resources and Evaluation Conference","author":"Pasini","year":"2020"},{"key":"2025070715370394400_bib30","doi-asserted-by":"publisher","first-page":"4495","DOI":"10.18653\/v1\/2024.acl-long.246","article-title":"Analyzing semantic change through lexical replacements","volume-title":"Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Periti","year":"2024"},{"issue":"11","key":"2025070715370394400_bib31","doi-asserted-by":"publisher","DOI":"10.1145\/3672393","article-title":"Lexical semantic change through large language models: A survey","volume":"56","author":"Periti","year":"2024","journal-title":"ACM Computing Surveys"},{"key":"2025070715370394400_bib32","doi-asserted-by":"publisher","first-page":"4262","DOI":"10.18653\/v1\/2024.naacl-long.240","article-title":"A systematic comparison of contextualized word embeddings for lexical semantic change","volume-title":"Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)","author":"Periti","year":"2024"},{"key":"2025070715370394400_bib33","doi-asserted-by":"publisher","first-page":"870","DOI":"10.18653\/v1\/S15-2147","article-title":"Semeval 2015, task 7: Diachronic text evaluation","volume-title":"Proceedings of the 9th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2015, Denver, Colorado, USA, June 4\u20135, 2015","author":"Popescu","year":"2015"},{"key":"2025070715370394400_bib34","doi-asserted-by":"publisher","first-page":"3982","DOI":"10.18653\/v1\/D19-1410","article-title":"Sentence-BERT: Sentence embeddings using Siamese BERT-networks","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Reimers","year":"2019"},{"key":"2025070715370394400_bib35","first-page":"478","article-title":"Exponential family embeddings","volume-title":"Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016","author":"Rudolph","year":"2016"},{"key":"2025070715370394400_bib36","doi-asserted-by":"publisher","first-page":"14379","DOI":"10.18653\/v1\/2024.emnlp-main.796","article-title":"More DWUGs: Extending and evaluating word usage graph datasets in multiple languages","volume-title":"Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing","author":"Schlechtweg","year":"2024"},{"key":"2025070715370394400_bib37","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.7387261","article-title":"Dwug en: Diachronic word usage graphs for english","author":"Schlechtweg","year":"2022"},{"key":"2025070715370394400_bib38","article-title":"Simulating lexical semantic change from sense-annotated data","volume-title":"The Evolution of Language: Proceedings of the 13th International Conference (EvoLang13)","author":"Schlechtweg","year":"2020"},{"key":"2025070715370394400_bib39","doi-asserted-by":"publisher","first-page":"169","DOI":"10.18653\/v1\/N18-2027","article-title":"Diachronic usage relatedness (DURel): A framework for the annotation of lexical semantic change","volume-title":"Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, Volume 2 (Short Papers)","author":"Schlechtweg","year":"2018"},{"key":"2025070715370394400_bib40","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18653\/v1\/2020.semeval-1.1","article-title":"SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection","volume-title":"Proceedings of the Fourteenth Workshop on Semantic Evaluation, SemEval@COLING2020","author":"Schlechtweg","year":"2020"},{"key":"2025070715370394400_bib41","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.7441645","article-title":"Dwug de: Diachronic word usage graphs for German","author":"Schlechtweg","year":"2022"},{"key":"2025070715370394400_bib42","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.567","article-title":"DWUG: A large resource of diachronic word usage graphs in four languages","volume-title":"Annual Conference of the North American Chapter of the Association for Computational Linguistics, (NAACL-HLT 2021)","author":"Schlechtweg","year":"2021"},{"key":"2025070715370394400_bib43","doi-asserted-by":"publisher","first-page":"66","DOI":"10.18653\/v1\/D19-1007","article-title":"Room to Glo: A systematic comparison of semantic change detection approaches with word embeddings","volume-title":"Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)","author":"Shoemark","year":"2020"},{"key":"2025070715370394400_bib44","doi-asserted-by":"publisher","DOI":"10.5281\/zenodo.7389506","article-title":"Dwug sv: Diachronic word usage graphs for swedish","author":"Tahmasebi","year":"2022"},{"key":"2025070715370394400_bib45","doi-asserted-by":"publisher","first-page":"1605","DOI":"10.18653\/v1\/P18-1149","article-title":"Dating documents using graph convolution networks","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, Melbourne, Australia, July 15\u201320, 2018, Volume 1: Long Papers","author":"Vashishth","year":"2018"},{"key":"2025070715370394400_bib46","article-title":"Large language models on lexical semantic change detection: An evaluation","author":"Wang","year":"2023","journal-title":"arXiv preprint arXiv:2312.06002"},{"key":"2025070715370394400_bib47","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1145\/3159652.3159703","article-title":"Dynamic word embeddings for evolving semantic discovery","volume-title":"Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, WSDM 2018","author":"Yao","year":"2018"},{"key":"2025070715370394400_bib48","article-title":"Bertscore: Evaluating text generation with BERT","volume-title":"8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26\u201330, 2020","author":"Zhang","year":"2020"}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/tacl\/article-pdf\/doi\/10.1162\/tacl_a_00761\/2535111\/tacl_a_00761.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/tacl\/article-pdf\/doi\/10.1162\/tacl_a_00761\/2535111\/tacl_a_00761.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,7]],"date-time":"2025-07-07T19:37:11Z","timestamp":1751917031000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/doi\/10.1162\/tacl_a_00761\/131585\/Sense-specific-Historical-Word-Usage-Generation"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"references-count":48,"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00761","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025]]},"published":{"date-parts":[[2025]]}}}