{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T04:57:49Z","timestamp":1780635469678,"version":"3.54.1"},"reference-count":93,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,9,2]],"date-time":"2024-09-02T00:00:00Z","timestamp":1725235200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,9,2]],"date-time":"2024-09-02T00:00:00Z","timestamp":1725235200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Intell Inf Syst"],"published-print":{"date-parts":[[2025,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Lexical Simplification (LS) is the task of substituting complex words within a sentence for simpler alternatives while maintaining the sentence\u2019s original meaning. LS is the lexical component of Text Simplification (TS) systems with the aim of improving accessibility to various target populations such as individuals with low literacy or reading disabilities. Prior surveys have been published several years before the introduction of transformers, transformer-based large language models (LLMs), and prompt learning that have drastically changed the field of NLP. The high performance of these models has sparked renewed interest in LS. To reflect these recent advances, we present a comprehensive survey of papers published since 2017 on LS and its sub-tasks focusing on deep learning. Finally, we describe available benchmark datasets for the future development of LS systems.<\/jats:p>","DOI":"10.1007\/s10844-024-00882-9","type":"journal-article","created":{"date-parts":[[2024,9,2]],"date-time":"2024-09-02T05:02:00Z","timestamp":1725253320000},"page":"111-134","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Deep learning approaches to lexical simplification: A survey"],"prefix":"10.1007","volume":"63","author":[{"given":"Kai","family":"North","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tharindu","family":"Ranasinghe","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Matthew","family":"Shardlow","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Marcos","family":"Zampieri","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,9,2]]},"reference":[{"key":"882_CR1","doi-asserted-by":"publisher","unstructured":"Abramov, A. V., & Ivanov, V. V. (2022). Collection and evaluation of lexical complexity data for Russian language using crowdsourcing. Russian Journal of Linguistics, 26(2), 409\u2013425. https:\/\/doi.org\/10.22363\/2687-0088-30118","DOI":"10.22363\/2687-0088-30118"},{"key":"882_CR2","doi-asserted-by":"publisher","unstructured":"Abramov, A. V., Ivanov, V. V., & Solovyev, V. D. (2023). Lexical Complexity Evaluation based on Context for Russian Language. Computaci\u00f3n y Sistemas, 27(1), 127\u2013139.https:\/\/doi.org\/10.13053\/cys-27-1-4528","DOI":"10.13053\/cys-27-1-4528"},{"issue":"2","key":"882_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3442695","volume":"54","author":"SS Al-Thanyyan","year":"2021","unstructured":"Al-Thanyyan, S. S., & Azmi, A. M. (2021). Automated Text Simplification: A Survey. ACM Comput Surv, 54(2), 1\u20133. https:\/\/doi.org\/10.1145\/3442695","journal-title":"ACM Comput Surv"},{"key":"882_CR4","unstructured":"Alarc\u00f3n, R., Moreno, L., & Mart\u00ednez, P. (2021a). Exploration of Spanish Word Embeddings for Lexical Simplification. In: Proceedings of the First Workshop on Current Trends in Text Simplification (CTTS 2021), online, URL https:\/\/ceur-ws.org\/Vol-2944\/paper2.pdf"},{"key":"882_CR5","doi-asserted-by":"publisher","first-page":"58755","DOI":"10.1109\/ACCESS.2021.3072697","volume":"9","author":"R Alarc\u00f3n","year":"2021","unstructured":"Alarc\u00f3n, R., Moreno, L., & Mart\u00ednez, P. (2021). Lexical Simplification System to Improve Web Accessibility. IEEE Access, 9, 58755\u20135876. https:\/\/doi.org\/10.1109\/ACCESS.2021.3072697","journal-title":"IEEE Access"},{"key":"882_CR6","doi-asserted-by":"publisher","unstructured":"Aleksandrova, D., & Brochu\u00a0Dufour, O. (2022). RCML at TSAR-2022 Shared Task: Lexical Simplification With Modular Substitution Candidate Ranking. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 259\u201326https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.29","DOI":"10.18653\/v1\/2022.tsar-1.29"},{"key":"882_CR7","doi-asserted-by":"publisher","unstructured":"Alonzo, O., Lee, S., Maddela, M., et\u00a0al. (2022a). A Dataset of Word-Complexity Judgements from Deaf and Hard-of-Hearing Adults for Text Simplification. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 119\u2013124, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.11","DOI":"10.18653\/v1\/2022.tsar-1.11"},{"key":"882_CR8","doi-asserted-by":"publisher","unstructured":"Alonzo, O., Trussell, J., Watkins, M., et\u00a0al. (2022b). Methods for Evaluating the Fluency of Automatically Simplified Texts with Deaf and Hard-of-Hearing Adults at Various Literacy Levels. In: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, https:\/\/doi.org\/10.1145\/3491102.3517566","DOI":"10.1145\/3491102.3517566"},{"key":"882_CR9","unstructured":"Alu\u00edsio, S. M., & Gasperin, C. (2010). Fostering digital inclusion and accessibility: The porsimples project for simplification of portuguese texts. In: Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas. Association for Computational Linguistics, Los Angeles, California, pp 46\u201353, URl https:\/\/aclanthology.org\/W10-1607"},{"key":"882_CR10","doi-asserted-by":"publisher","unstructured":"Arefyev, N., Sheludko, B., Podolskiy, A., et\u00a0al. (2020). Always Keep your Target in Mind: Studying Semantics and Improving Performance of Neural Lexical Substitution. In: Proceedings of the 28th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Barcelona, Spain (Online), pp 1242\u20131255, https:\/\/doi.org\/10.18653\/v1\/2020.coling-main.107","DOI":"10.18653\/v1\/2020.coling-main.107"},{"key":"882_CR11","doi-asserted-by":"publisher","unstructured":"Aumiller, D., & Gertz, M. (2022). UniHD at TSAR-2022 Shared Task: Is Compute All We Need for Lexical Simplification? In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 251\u2013258, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.28","DOI":"10.18653\/v1\/2022.tsar-1.28"},{"key":"882_CR12","unstructured":"Billami, M. B., & Fran\u00e7ois, T., & Gala, N. (2018). ReSyf: a French lexicon with ranked synonyms. In: Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, pp 2570\u20132581, URL https:\/\/aclanthology.org\/C18-1218"},{"key":"882_CR13","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski, P., Grave, E., Joulin, A., et al. (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5, 135\u2013146. https:\/\/doi.org\/10.1162\/tacl_a_00051","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"882_CR14","unstructured":"Brown, T. B., Mann ,B., Ryder, N., et\u00a0al. (2020). Language Models Are Few-Shot Learners. In: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada, URL https:\/\/proceedings.neurips.cc\/paper\/2020\/file\/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf"},{"key":"882_CR15","unstructured":"Ca\u00f1ete, J., Chaperon, G., Fuentes, R., et\u00a0al. (2020). Spanish Pre-Trained BERT Model and Evaluation Data. In: Proceedings of PML4DC at the International Conference on Learning Representation (ICLR.), Virtual, URL https:\/\/arxiv.org\/abs\/2308.02976"},{"key":"882_CR16","unstructured":"Carroll, J., Minnen, G., Canning, Y., et\u00a0al. (1998). Practical Simplification of English Newspaper Text to Assist Aphasic Readers. In: Proccedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), Madison, Wisconsin, USA, URL https:\/\/users.sussex.ac.uk\/~johnca\/papers\/aaai98.pdf"},{"key":"882_CR17","unstructured":"Clark, K., Luong, M. T., Le, Q. V., et\u00a0al. (2020). Electra: Pre-training text encoders as discriminators rather than generators. In: 8th International Conference on Learning Representations (ICLR-2020). OpenReview.net, Addis Ababa, Ethiopia, URL https:\/\/openreview.net\/forum?id=r1xMH1BtvB"},{"key":"882_CR18","doi-asserted-by":"publisher","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., et\u00a0al. (2020). Unsupervised cross-lingual representation learning at scale. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, pp 8440\u20138https:\/\/doi.org\/10.18653\/v1\/2020.acl-main.747","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"882_CR19","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, M. W., Lee, K., et\u00a0al. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota, pp 4171\u20134186, https:\/\/doi.org\/10.18653\/v1\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"882_CR20","unstructured":"Devlin, S., & Tait, J. (1998). The use of a psycholinguistic database in the simplification of text for aphasic readers. Linguistic Databases pp 161\u2013173"},{"key":"882_CR21","unstructured":"Ermakova, L., Bellot, P., Braslavski, P., et\u00a0al. (2021). Overview of SimpleText CLEF 2021 Workshop and Pilot Tasks. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC), Bucharest, Romania, URL https:\/\/ceur-ws.org\/Vol-2936\/paper-199.pdf"},{"key":"882_CR22","unstructured":"Fandi\u00f1o, A. G., Estap\u00e9, J. A., P\u00e1mies, M., et\u00a0al. (2022). Maria: Spanish language models. Procesamiento del Lenguaje Natural 68:39\u201360. URL https:\/\/api.semanticscholar.org\/CorpusID:252847802"},{"key":"882_CR23","unstructured":"Ferres, D., & Saggion, H. (2022). ALEXSIS: A dataset for lexical simplification in Spanish. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC), Marseille, France, pp 3582\u20133594, URL https:\/\/aclanthology.org\/2022.lrec-1.383"},{"key":"882_CR24","unstructured":"Gala, N., Tack, A., Javourey-Drevet, L., et\u00a0al. (2020). Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic Readers. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC). European Language Resources Association, Marseille, France, pp 1353\u20131361, https:\/\/aclanthology.org\/2020.lrec-1.169"},{"key":"882_CR25","unstructured":"Gasperin, C., Specia, L., Pereira, T. F., et\u00a0al. (2009). Learning When to Simplify Sentences for Natural Text Simplification. Proceedings of ENIA https:\/\/api.semanticscholar.org\/CorpusID:14656741"},{"key":"882_CR26","doi-asserted-by":"publisher","unstructured":"Gooding, S., & Tragut, M. (2022). One Size Does Not Fit All: The Case for Personalised Word Complexity Models. In: Findings of the Association for Computational Linguistics: NAACL 2022. Association for Computational Linguistics, Seattle, United States, pp 353\u2013365, https:\/\/doi.org\/10.18653\/v1\/2022.findings-naacl.27","DOI":"10.18653\/v1\/2022.findings-naacl.27"},{"key":"882_CR27","doi-asserted-by":"crossref","unstructured":"Hampton, A. J., Nye, B. D., Pavlik, P.I., et\u00a0al. (2018). Mitigating Knowledge Decay from Instruction with Voluntary Use of an Adaptive Learning System. In: Penstein\u00a0Ros\u00e9, C,, Mart\u00ednez-Maldonado, R., Hoppe, H. U., et\u00a0al. (eds) Artificial Intelligence in Education. Springer International Publishing, Cham, pp 119\u2013133, URL https:\/\/link.springer.com\/chapter\/10.1007\/978-3-319-93846-2_23","DOI":"10.1007\/978-3-319-93846-2_23"},{"key":"882_CR28","doi-asserted-by":"crossref","unstructured":"Hartmann, N. S., Alu\u00edsio, S. M. (2020). Adapta\u00e7\u00e3o Lexical Autom\u00e1tica em Textos Informativos do Portugu\u00eas Brasileiro para o Ensino Fundamental. Linguam\u00e1tica 12(2):3\u201327. URL https:\/\/www.teses.usp.br\/teses\/disponiveis\/55\/55134\/tde-29072020-161751\/pt-br.php","DOI":"10.21814\/lm.12.2.323"},{"key":"882_CR29","doi-asserted-by":"publisher","unstructured":"Horn, C., Manduca, C., & Kauchak, D. (2014). Learning a Lexical Simplifier Using Wikipedia. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Baltimore, Maryland, pp 458\u2013463, https:\/\/doi.org\/10.3115\/v1\/P14-2075","DOI":"10.3115\/v1\/P14-2075"},{"key":"882_CR30","doi-asserted-by":"publisher","unstructured":"Hssina, B., & Erritali, M, (2019), A Personalized Pedagogical Objectives Based on a Genetic Algorithm in an Adaptive Learning System. Procedia Computer Science, 151, 1152\u20131157. https:\/\/doi.org\/10.1016\/j.procs.2019.04.164, the 10th International Conference on Ambient Systems, Networks and Technologies (ANT 2019) \/ The 2nd International Conference on Emerging Data and Industry 4.0 (EDI40 2019) \/ Affiliated Workshops","DOI":"10.1016\/j.procs.2019.04.164"},{"key":"882_CR31","unstructured":"Jiang, A. Q., Sablayrolles, A., Mensch, A. et\u00a0al. (2023). Mistral 7B. arXiv:2310.06825"},{"key":"882_CR32","doi-asserted-by":"publisher","DOI":"10.1016\/j.caeai.2021.100017","volume":"2","author":"T Kabudi","year":"2021","unstructured":"Kabudi, T., Pappas, I., & Olsen, D. H. (2021). AI-enabled adaptive learning systems: A systematic mapping of the literature. Computers and Education: Artificial Intelligence, 2, 100017. https:\/\/doi.org\/10.1016\/j.caeai.2021.100017","journal-title":"Computers and Education: Artificial Intelligence"},{"key":"882_CR33","doi-asserted-by":"publisher","unstructured":"Kajiwara, T., & Yamamoto, K. (2015). Evaluation Dataset and System for Japanese Lexical Simplification. In: Proceedings of the ACL-IJCNLP 2015 Student Research Workshop, pp 35\u201340, https:\/\/doi.org\/10.3115\/v1\/P15-3006","DOI":"10.3115\/v1\/P15-3006"},{"key":"882_CR34","unstructured":"Kajiwara, T., Matsumoto, H., & Yamamoto, K. (2013). Selecting proper lexical paraphrase for children. In: Proceedings of ROCLING. The Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Kaohsiung, Taiwan, pp 59\u201373, URL https:\/\/aclanthology.org\/O13-1007"},{"key":"882_CR35","doi-asserted-by":"publisher","unstructured":"Kodaira, T., Kajiwara, T., & Komachi, M. (2016). Controlled and Balanced Dataset for Japanese Lexical Simplification. Association for Computational Linguistics, Berlin, Germany, pp 1\u20137, https:\/\/doi.org\/10.18653\/v1\/P16-3001, URL https:\/\/aclanthology.org\/P16-3001","DOI":"10.18653\/v1\/P16-3001"},{"key":"882_CR36","unstructured":"Koptient, A., & Grabar, N. (2022). Automatic Detection of Difficulty of French Medical Sequences in Context. In: Bhatia A, Cook P, Taslimipoor S, et\u00a0al (eds) Proceedings of the Conference and Labs of the Evaluation Forum (LREC). European Language Resources Association, Marseille, France, pp 55\u201366, URL https:\/\/aclanthology.org\/2022.mwe-1.9"},{"key":"882_CR37","unstructured":"Leal, S. E., Duran, M. S., & Alu\u00edsio, S. M. (2018). A Nontrivial Sentence Corpus for the Task of Sentence Readability Assessment in Portuguese. In: Proceedings of the 28th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, pp 401\u2013413, URL https:\/\/aclanthology.org\/C18-1034"},{"key":"882_CR38","doi-asserted-by":"crossref","unstructured":"Lee, J., & Yeung, C. Y. (2018a). Automatic prediction of vocabulary knowledge for learners of chinese as a foreign language. In: 2nd International Conference on Natural Language and Speech Processing (ICNLSP), pp 1\u20134, URL https:\/\/api.semanticscholar.org\/CorpusID:46967208","DOI":"10.1109\/ICNLSP.2018.8374392"},{"key":"882_CR39","unstructured":"Lee, J., & Yeung, C. Y. (2018b). Personalizing lexical simplification. In: Proceedings of the 28th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, pp 224\u2013232, URL https:\/\/aclanthology.org\/C18-1019"},{"key":"882_CR40","doi-asserted-by":"publisher","unstructured":"Li, X., Wiechmann, D., Qiao, Y, et\u00a0al. (2022). MANTIS at TSAR-2022 Shared Task: Improved Unsupervised Lexical Simplification with Pretrained Encoders. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 243\u2013250, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.27","DOI":"10.18653\/v1\/2022.tsar-1.27"},{"key":"882_CR41","unstructured":"Liu, Y., Ott, M., Goyal, N., et\u00a0al. (2019). Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 URL https:\/\/api.semanticscholar.org\/CorpusID:198953378"},{"key":"882_CR42","doi-asserted-by":"publisher","unstructured":"Maddela, M., & Xu, W. (2018). A word-complexity lexicon and a neural readability ranking model for lexical simplification. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Brussels, Belgium, pp 3749\u20133760, https:\/\/doi.org\/10.18653\/v1\/D18-1410, URL https:\/\/aclanthology.org\/D18-1410","DOI":"10.18653\/v1\/D18-1410"},{"key":"882_CR43","doi-asserted-by":"crossref","unstructured":"McCarthy, D., & Navigli, R. (2007). SemEval-2007 Task 10: English Lexical Substitution Task. In: Proceedings of the International Workshop on Semantic Evaluations. Association for Computational Linguistics, Prague, Czech Republic, pp 48\u201353, URL https:\/\/aclanthology.org\/S07-1009","DOI":"10.3115\/1621474.1621483"},{"key":"882_CR44","doi-asserted-by":"publisher","unstructured":"Melamud, O., Goldberger, J., & Dagan, I. (2016). context2vec: Learning Generic Context Embedding with Bidirectional LSTM. In: Proceedings of the Conference on Computational Natural Language Learning. Association for Computational Linguistics, Berlin, Germany, pp 51\u201361, https:\/\/doi.org\/10.18653\/v1\/K16-1006, URL https:\/\/aclanthology.org\/K16-1006","DOI":"10.18653\/v1\/K16-1006"},{"key":"882_CR45","unstructured":"Merejildo, B. (2021). Creaci\u00f3n de un corpus de textos universitarios en espa\u00f1ol para la identificaci\u00f3n de palabras complejas en el \u00e1rea de la simplificaci\u00f3n l\u00e9xica. Master\u2019s thesis, Universidad de Guayaquil"},{"key":"882_CR46","unstructured":"Mikolov, T., Chen, K., Corrado, G., et\u00a0al. (2013). Efficient Estimation of word Representations in Vector Space. In: Proceedings of the International Conference on Learning Representations, URL https:\/\/api.semanticscholar.org\/CorpusID:5959482"},{"key":"882_CR47","unstructured":"Minaee, S., Mikolov, T., Nikzad, N., et\u00a0al. (2024). Large Language Models: A Survey. arXiv preprint arXiv:2402.06196 abs\/2402.06196. URL https:\/\/api.semanticscholar.org\/CorpusID:267617032"},{"key":"882_CR48","unstructured":"Nishihara, D., & Kajiwara, T. (2020). Word Complexity Estimation for Japanese Lexical Simplification. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC). European Language Resources Association, Marseille, France, pp 3114\u20133120, URL https:\/\/aclanthology.org\/2020.lrec-1.381"},{"key":"882_CR49","doi-asserted-by":"publisher","unstructured":"North, K., & Zampieri, M. (2023). Features of Lexical Complexity: Insights from L1 and L2 Speakers. Frontiers in Artificial Intelligence 6(1). https:\/\/doi.org\/10.3389\/frai.2023.1236963","DOI":"10.3389\/frai.2023.1236963"},{"key":"882_CR50","doi-asserted-by":"publisher","unstructured":"North, K., Dmonte, A., Ranasinghe, T., et\u00a0al. (2022a). GMU-WLV at TSAR-2022 Shared Task: Evaluating Lexical Simplification Models. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 264\u2013270, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.30","DOI":"10.18653\/v1\/2022.tsar-1.30"},{"issue":"9","key":"882_CR51","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3557885","volume":"55","author":"K North","year":"2022","unstructured":"North, K., Zampieri, M., & Shardlow, M. (2022). Lexical Complexity Prediction: A Survey. ACM Computing Surveys, 55(9), 1\u201342. https:\/\/doi.org\/10.1145\/3557885","journal-title":"ACM Computing Surveys"},{"key":"882_CR52","doi-asserted-by":"publisher","unstructured":"North, K., Dmonte, A., Ranasinghe, T., et\u00a0al. (2023). ALEXSIS+: Improving Substitute Generation and Selection for Lexical Simplification with Information Retrieval. In: Proceedings of the Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics, Toronto, Canada, pp 404\u2013413, https:\/\/doi.org\/10.18653\/v1\/2023.bea-1.33, URL https:\/\/aclanthology.org\/2023.bea-1.33","DOI":"10.18653\/v1\/2023.bea-1.33"},{"key":"882_CR53","doi-asserted-by":"crossref","unstructured":"Ortiz\u00a0Zambrano, J., MontejoR\u00e1ez, A., Lino\u00a0Castillo, K. N., et\u00a0al. (2019). VYTEDU-CW: Difficult words as a barrier in the reading comprehension of university students. In: The International Conference on Advances in Emerging Trends and Technologies, pp 167\u2013176, URl https:\/\/link.springer.com\/chapter\/10.1007\/978-3-030-32022-5_16","DOI":"10.1007\/978-3-030-32022-5_16"},{"key":"882_CR54","doi-asserted-by":"crossref","unstructured":"Ortiz\u00a0Zambrano, J. A., & Montejo-R\u00e1ez, A. (2021). CLexIS2: A New Corpus for Complex Word Identification Research in Computing Studies. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing. INCOMA Ltd., Held Online, pp 1075\u20131083, URL https:\/\/aclanthology.org\/2021.ranlp-1.121","DOI":"10.26615\/978-954-452-072-4_121"},{"key":"882_CR55","doi-asserted-by":"publisher","unstructured":"Paetzold, G., & Specia, L. (2016a). SemEval 2016 Task 11: Complex Word Identification. In: Proceedings of the International Workshop on Semantic Evaluations. Association for Computational Linguistics, San Diego, California, pp 560\u2013569, https:\/\/doi.org\/10.18653\/v1\/S16-1085, URL https:\/\/aclanthology.org\/S16-1085","DOI":"10.18653\/v1\/S16-1085"},{"key":"882_CR56","doi-asserted-by":"crossref","unstructured":"Paetzold, G., & Specia, L. (2017a). Lexical Simplification with Neural Ranking. In: Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, Valencia, Spain, pp 34\u201340, URL https:\/\/aclanthology.org\/E17-2006","DOI":"10.18653\/v1\/E17-2006"},{"key":"882_CR57","doi-asserted-by":"publisher","unstructured":"Paetzold, G. H., & Specia, L. (2015). LEXenstein: A Framework for Lexical Simplification. In: Proceedings of ACL-IJCNLP 2015 System Demonstrations. Association for Computational Linguistics and The Asian Federation of Natural Language Processing, Beijing, China, pp 85\u201390, https:\/\/doi.org\/10.3115\/v1\/P15-4015, URL https:\/\/aclanthology.org\/P15-4015","DOI":"10.3115\/v1\/P15-4015"},{"key":"882_CR58","unstructured":"Paetzold, G. H., & Specia, L. (2016b). Benchmarking Lexical Simplification Systems. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC). European Language Resources Association (ELRA), Portoro\u017e, Slovenia, pp 3074\u20133080, URL https:\/\/aclanthology.org\/L16-1491"},{"key":"882_CR59","doi-asserted-by":"publisher","unstructured":"Paetzold, G. H., & Specia, L. (2016c). Unsupervised lexical simplification for non-native speakers. In: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, pp 9368\u20139379, https:\/\/doi.org\/10.18653\/v1\/2023.findings-emnlp.627, URL https:\/\/aclanthology.org\/2023.findings-emnlp.627","DOI":"10.18653\/v1\/2023.findings-emnlp.627"},{"issue":"1","key":"882_CR60","doi-asserted-by":"publisher","first-page":"549","DOI":"10.5555\/3207692.3207704","volume":"60","author":"GH Paetzold","year":"2017","unstructured":"Paetzold, G. H., & Specia, L. (2017). A Survey on Lexical Simplification. J Artif Int Res, 60(1), 549\u2013593. https:\/\/doi.org\/10.5555\/3207692.3207704","journal-title":"J Artif Int Res"},{"key":"882_CR61","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/s10844-022-00694-9","volume":"59","author":"M Peal","year":"2022","unstructured":"Peal, M., Hossain, M. S., & Chen, J. (2022). Summarizing consumer reviews. Intell. Inf Syst, 59, 193\u2013212. https:\/\/doi.org\/10.1007\/s10844-022-00694-9","journal-title":"Inf Syst"},{"key":"882_CR62","doi-asserted-by":"publisher","unstructured":"Peters, M. E., Neumann, M., Iyyer, M., et\u00a0al. (2018). Deep Contextualized Word Representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, New Orleans, Louisiana, pp 2227\u20132237, https:\/\/doi.org\/10.18653\/v1\/N18-1202, URL https:\/\/aclanthology.org\/N18-1202","DOI":"10.18653\/v1\/N18-1202"},{"key":"882_CR63","doi-asserted-by":"publisher","unstructured":"Przyby\u0142a, P., & Shardlow, M. (2020). Multi-Word Lexical Simplification. In: Proceedings of the 28th International Conference on Computational Linguistics. International Committee on Computational Linguistics, Barcelona, Spain (Online), pp 1435\u20131446, https:\/\/doi.org\/10.18653\/v1\/2020.coling-main.123, URL https:\/\/aclanthology.org\/2020.coling-main.123","DOI":"10.18653\/v1\/2020.coling-main.123"},{"key":"882_CR64","doi-asserted-by":"crossref","unstructured":"Qiang, J., Li, Y., Yi, Z., et\u00a0al. (2020). Lexical simplification with pretrained encoders. In: The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), URL https:\/\/cdn.aaai.org\/ojs\/6389\/6389-13-9614-1-10-20200517.pdf","DOI":"10.1609\/aaai.v34i05.6389"},{"key":"882_CR65","doi-asserted-by":"publisher","first-page":"1819","DOI":"10.1109\/TASLP.2021.3078361","volume":"29","author":"J Qiang","year":"2021","unstructured":"Qiang, J., Lu, X., Li, Y., et al. (2021). Chinese Lexical Simplification. IEEE\/ACM Transactions on Audio, Speech, and Language Processing, 29, 1819\u20131828. https:\/\/doi.org\/10.1109\/TASLP.2021.3078361","journal-title":"IEEE\/ACM Transactions on Audio, Speech, and Language Processing"},{"key":"882_CR66","unstructured":"Rahman, M. M., Irbaz, M. S., North, K., et\u00a0al. (2024). Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning. URL https:\/\/arxiv.org\/abs\/2401.15043, 2401.15043"},{"key":"882_CR67","doi-asserted-by":"crossref","unstructured":"Rello, L., Baeza-Yates, R., Dempere-Marco, L., et\u00a0al. (2013). Frequent words improve readability and short words improve understandability for people with dyslexia. In: Human-Computer Interaction \u2013 INTERACT 2013. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 203\u2013219, URL https:\/\/link.springer.com\/chapter\/10.1007\/978-3-642-40498-6_15","DOI":"10.1007\/978-3-642-40498-6_15"},{"issue":"3","key":"882_CR68","doi-asserted-by":"publisher","first-page":"705","DOI":"10.1111\/jcal.12517","volume":"37","author":"I Rets","year":"2020","unstructured":"Rets, I., & Rogaten, J. (2020). To simplify or not? Facilitating English L2 users\u2019 comprehension and processing of open educational resources in English using text simplification. Journal of Computer Assisted Learning, 37(3), 705\u2013717. https:\/\/doi.org\/10.1111\/jcal.12517","journal-title":"Journal of Computer Assisted Learning"},{"key":"882_CR69","doi-asserted-by":"crossref","unstructured":"Rolin, E., Langlois, Q., Watrin, P., et\u00a0al. (2021). FrenLyS: A Tool for the Automatic Simplification of French General Language Texts. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing. INCOMA Ltd., Held Online, pp 1196\u20131205, URL https:\/\/aclanthology.org\/2021.ranlp-1.135","DOI":"10.26615\/978-954-452-072-4_135"},{"key":"882_CR70","doi-asserted-by":"publisher","unstructured":"De\u00a0la Rosa, J., & Fern\u00e1ndez, A. (2022). Zero-shot reading comprehension and reasoning for spanish with BERTIN GPT-J-6B. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China, pp 5933\u20135940,https:\/\/doi.org\/10.18653\/v1\/D19-1607, URL https:\/\/aclanthology.org\/D19-1607","DOI":"10.18653\/v1\/D19-1607"},{"key":"882_CR71","doi-asserted-by":"publisher","unstructured":"Saggion H, \u0160tajner, S., Ferr\u00e9s, D., et\u00a0al. (2022). Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification. In: \"Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 271\u2013283, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.31, URL https:\/\/aclanthology.org\/2022.tsar-1.31","DOI":"10.18653\/v1\/2022.tsar-1.31"},{"key":"882_CR72","doi-asserted-by":"publisher","unstructured":"Seneviratne, S., Daskalaki, E., & Suominen, H. (2022). CILS at TSAR-2022 Shared Task: Investigating the Applicability of Lexical Substitution Methods for Lexical Simplification. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 207\u2013212, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.21","DOI":"10.18653\/v1\/2022.tsar-1.21"},{"key":"882_CR73","unstructured":"Shardlow, M. (2013). The CW Corpus: A New Resource for Evaluating the Identification of Complex Words. In: Proceedings of the Second Workshop on Predicting and Improving Text Readability for Target Reader Populations. Association for Computational Linguistics, Sofia, Bulgaria, URL https:\/\/aclanthology.org\/W13-2908"},{"key":"882_CR74","unstructured":"Shardlow, M., Cooper, M., & Zampieri, M. (2020). CompLex \u2014 a new corpus for lexical complexity prediction from Likert Scale data. In: Proceedings of READI. European Language Resources Association, Marseille, France, pp 57\u201362, URL https:\/\/aclanthology.org\/2020.readi-1.9"},{"key":"882_CR75","doi-asserted-by":"publisher","unstructured":"Shardlow, M., Evans, R., Paetzold, G., et\u00a0al. (2021). SemEval-2021 Task 1: Lexical Complexity Prediction. In: Proceedings of SemEval, Online, pp 1\u201316, https:\/\/doi.org\/10.18653\/v1\/2021.semeval-1.1, URL https:\/\/aclanthology.org\/2021.semeval-1.1","DOI":"10.18653\/v1\/2021.semeval-1.1"},{"key":"882_CR76","unstructured":"Shardlow, M., Alva-Manchego, F., Batista-Navarro, R. T., et\u00a0al. (2024). The BEA 2024 Shared Task on the Multilingual Lexical Simplification Pipeline. In: Proceedings of the Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics, Mexico City, Mexico, pp 571\u2013589, URL https:\/\/aclanthology.org\/2024.bea-1.51"},{"key":"882_CR77","doi-asserted-by":"crossref","unstructured":"Song, J., Hu, J., Wong, L. P., et\u00a0al. (2020). A New Context-Aware Method Based on Hybrid Ranking for Community-Oriented Lexical Simplification. In: Proceedings of the International Conference on Database Systems for Advanced Applications, URL https:\/\/api.semanticscholar.org\/CorpusID:221839918","DOI":"10.1007\/978-3-030-59413-8_7"},{"key":"882_CR78","doi-asserted-by":"publisher","unstructured":"Souza, F., Nogueira, R., & Lotufo, R. (2020). BERTimbau: pretrained BERT models for Brazilian Portuguese. In: Proceedings of the Intelligent Systems: 9th Brazilian Conference, BRACIS 2020. Springer-Verlag, Rio Grande, Brazil, p 403-417, https:\/\/doi.org\/10.1007\/978-3-030-61377-8_28, URL https:\/\/doi.org\/10.1007\/978-3-030-61377-8_28","DOI":"10.1007\/978-3-030-61377-8_28"},{"key":"882_CR79","unstructured":"Specia, L., Jauhar KSujay, & Mihalcea, R. (2012). Semeval - 2012 task 1: English lexical simplification. In: Proceedings of SemEval. Association for Computational Linguistics, Montr\u00e9al, Canada, pp 347\u2013355, URL https:\/\/aclanthology.org\/S12-1046"},{"key":"882_CR80","unstructured":"Touvron, H., Martin, L., Stone, K., et\u00a0al. (2023). Llama 2: Open foundation and fine-tuned chat models. arXiv:2307.09288 URL https:\/\/arxiv.org\/abs\/2307.09288"},{"key":"882_CR81","unstructured":"Trask, A., Michalak, P., & Liu, J. (2015). sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings. ArXiv abs\/1511.06388. URL http:\/\/arxiv.org\/abs\/1511.06388"},{"key":"882_CR82","doi-asserted-by":"crossref","unstructured":"Troussas, C., & Virvou, M. (2020). Introduction. In: Advances in Social Networking-based Learning: Machine Learning-based User Modelling and Sentiment Analysis. Springer International Publishing, Cham, pp 1\u201316, URL https:\/\/link.springer.com\/book\/10.1007\/978-3-030-39130-0","DOI":"10.1007\/978-3-030-39130-0_1"},{"key":"882_CR83","unstructured":"Uchida, S., Takada, S., & Arase, Y. (2018). CEFR-based Lexical Simplification Dataset. In: Proceedings of the Conference and Labs of the Evaluation Forum (LREC). European Language Resources Association (ELRA), Miyazaki, Japan, URL https:\/\/aclanthology.org\/L18-1514"},{"key":"882_CR84","doi-asserted-by":"publisher","unstructured":"V\u00e1squez-Rodr\u00edguez, L., Nguyen, N., Ananiadou, S., et\u00a0al. (2022). UoM &MMU at TSAR-2022 Shared Task: Prompt Learning for Lexical Simplification. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 218\u2013224,https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.23","DOI":"10.18653\/v1\/2022.tsar-1.23"},{"key":"882_CR85","doi-asserted-by":"publisher","unstructured":"Watanabe, W. M., Junior, A. C., Uz\u00eada, V. R., et\u00a0al. (2009). Facilita: Reading assistance for low-literacy readers. In: Proceedings of the 27th ACM International Conference on Design of Communication, p 29-36, https:\/\/doi.org\/10.1145\/1621995.1622002","DOI":"10.1145\/1621995.1622002"},{"key":"882_CR86","doi-asserted-by":"publisher","unstructured":"Whistely, P. J., Mathias, S., & Poornima, G. (2022). PresiUniv at TSAR-2022 Shared Task: Generation and Ranking of Simplification Substitutes of Complex Words in Multiple Languages. In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 213\u2013217, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.22","DOI":"10.18653\/v1\/2022.tsar-1.22"},{"key":"882_CR87","doi-asserted-by":"publisher","unstructured":"Wilkens, R., Alfter, D., Cardon, R., et\u00a0al. (2022). CENTAL at TSAR-2022 Shared Task: How Does Context Impact BERT-Generated Substitutions for Lexical Simplification? In: Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Virtual), pp 231\u2013238, https:\/\/doi.org\/10.18653\/v1\/2022.tsar-1.25","DOI":"10.18653\/v1\/2022.tsar-1.25"},{"key":"882_CR88","doi-asserted-by":"publisher","first-page":"325","DOI":"10.1007\/s10844-022-00757-x","volume":"61","author":"F Xie","year":"2022","unstructured":"Xie, F., Chen, J., & Chen, K. (2022). Extractive text-image summarization with relation-enhanced graph attention network. Intell Inf Syst, 61, 325\u2013341. https:\/\/doi.org\/10.1007\/s10844-022-00757-x","journal-title":"Intell Inf Syst"},{"key":"882_CR89","unstructured":"Yang, Z., Dai, Z., Yang, Y., et\u00a0al. (2019). XLNet: Generalized Autoregressive Pretraining for Language Understanding. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems, Red Hook, NY, USA, https:\/\/dl.acm.org\/doi\/10.5555\/3454287.3454804"},{"key":"882_CR90","unstructured":"Yeung, C. Y., & Lee, J. (2018). Personalized text retrieval for learners of Chinese as a foreign language. In: Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, pp 3448\u20133455, URL https:\/\/aclanthology.org\/C18-1292"},{"key":"882_CR91","doi-asserted-by":"publisher","unstructured":"Yimam, S. M., Biemann, C., Malmasi, S., et\u00a0al. (2018). A Report on the Complex Word Identification Shared Task 2018. In: Proceedings of the Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics, New Orleans, Louisiana, pp 66\u201378, https:\/\/doi.org\/10.18653\/v1\/W18-0507, URL https:\/\/aclanthology.org\/W18-0507","DOI":"10.18653\/v1\/W18-0507"},{"issue":"102351","key":"882_CR92","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.ipm.2020.102351","volume":"57","author":"F Zaman","year":"2020","unstructured":"Zaman, F., Shardlow, M., Hassan, S. U., et al. (2020). HTSS: A novel hybrid text summarisation and simplification architecture. Information Processing and Management, 57(102351), 1\u201313. https:\/\/doi.org\/10.1016\/j.ipm.2020.102351","journal-title":"Information Processing and Management"},{"key":"882_CR93","unstructured":"Zambrano, J. A. O., R\u00e1ez, A. M. (2020). Overview of ALexS 2020: First Workshop on Lexical Analysis at SEPLN. In: Proceedings of ALexS, URL https:\/\/api.semanticscholar.org\/CorpusID:225063101"}],"container-title":["Journal of Intelligent Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10844-024-00882-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10844-024-00882-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10844-024-00882-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,8]],"date-time":"2025-03-08T10:23:28Z","timestamp":1741429408000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10844-024-00882-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,2]]},"references-count":93,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,2]]}},"alternative-id":["882"],"URL":"https:\/\/doi.org\/10.1007\/s10844-024-00882-9","relation":{},"ISSN":["0925-9902","1573-7675"],"issn-type":[{"value":"0925-9902","type":"print"},{"value":"1573-7675","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,2]]},"assertion":[{"value":"1 May 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 August 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 August 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 September 2024","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest\/Competing interests"}},{"value":"N\/A.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical Approval"}}]}}