{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T18:26:31Z","timestamp":1761157591856,"version":"3.37.0"},"reference-count":30,"publisher":"PPUFU - Portal de Peri\u00f3dicos da Universidade Federal de Uberl\u00e2ndia","license":[{"start":{"date-parts":[[2023,11,15]],"date-time":"2023-11-15T00:00:00Z","timestamp":1700006400000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Dom. Ling."],"abstract":"<jats:p>In this paper, we present an experiment for complexity-level analysis of Portuguese texts from the 18th century using NLP tools. The 18th century was the time for the realization of a new world that had been built since the Renaissance, it was the period of consolidation of many of the current sciences. One of its characteristics is the presentation of scientific written records in national languages, rather than Latin, and the expressed wishes that the specialized texts could be more understandable to people of lesser erudition. As such, we intend to collaborate to identify if and how these wishes were fulfilled. To achieve this goal, we resort to an NLP supporting methodology to detect degrees of complexity of a medical work of this time period and compare it with two other works that have hypothesized lesser and greater complexities. By using NILC-Metrix, we intend to identify features of a continuum of complexity in this kind of document.<\/jats:p>","DOI":"10.14393\/dlv17a2023-53","type":"journal-article","created":{"date-parts":[[2023,12,27]],"date-time":"2023-12-27T12:08:37Z","timestamp":1703678917000},"page":"e1753","source":"Crossref","is-referenced-by-count":1,"title":["A Natural Language Processing approach to Complexity Assessment of 18th-century health literature"],"prefix":"10.14393","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6101-0814","authenticated-orcid":false,"given":"Leonardo","family":"Zilio","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6022-8408","authenticated-orcid":false,"given":"Maria Jos\u00e9 Bocorny","family":"Finatto","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2449-5477","authenticated-orcid":false,"given":"Renata","family":"Vieira","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5086-059X","authenticated-orcid":false,"given":"Paulo","family":"Quaresma","sequence":"additional","affiliation":[]}],"member":"5515","published-online":{"date-parts":[[2023,11,15]]},"reference":[{"key":"263286","unstructured":"ALU\u00cdSIO, S., GASPERIN, C. Fostering digital inclusion and accessibility: the Porsimples project for simplification of Portuguese texts. In: Proceedings of the NAACL-HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, 2010. p. 46\u201353."},{"key":"263287","unstructured":"BANZA, A. P., GON\u00c7ALVES, M. F. Roteiro de hist\u00f3ria da l\u00edngua portuguesa. C\u00e1tedra UNESCO, Universidade de \u00c9vora, 2018, p. 95. Available at: https:\/\/core.ac.uk\/download\/pdf\/154812031.pdf. Accessed on: 22 Jun. 2023."},{"key":"263288","unstructured":"BARBOSA, A. V. Do conhecimento da doen\u00e7a \u00e0 sua nomea\u00e7\u00e3o: uma viagem pelo tratado da conserva\u00e7\u00e3o da sa\u00fade dos povos, de Ant\u00f3nio Ribeiro Sanches. Panace@, v. 21(52), p. 37\u201348, 2020."},{"key":"263289","doi-asserted-by":"crossref","unstructured":"BERBER SARDINHA, T.; BARBARA, L. Freq\u00fc\u00eancia e uso de estrangeirismos ingleses no portugu\u00eas brasileiro: Um estudo baseado em corpus. Revista Brasileira de Lingu\u00edstica Aplicada, v. 5(1), p. 97\u2013114, 2005. DOI https:\/\/doi.org\/10.1590\/S1984-63982005000100006","DOI":"10.1590\/S1984-63982005000100006"},{"key":"263290","unstructured":"BIDERMAN, M. T. C., CARVALHO, C. S., PEDROSO, O. Meu primeiro livro de palavras: um dicion\u00e1rio ilustrado do portugu\u00eas de A a Z. \u00c1tica, 2004."},{"key":"263291","unstructured":"CASELI, H. M., PEREIRA, T. F., SPECIA, L., PARDO, T. A., GASPERIN, C., ALU\u00cdSIO, S. M. Building a brazilian portuguese parallel corpus of original and simplified texts. Advances in Computational Linguistics, Research in Computer Science, v. 41, p. 59\u201370, 2009."},{"key":"263292","unstructured":"CASTRO, I. Introdu\u00e7\u00e3o \u00e0 hist\u00f3ria do portugu\u00eas. Edi\u00e7\u00f5es Colibri, Lisboa, Portugal, 2006."},{"key":"263293","unstructured":"CUNHA, A. L. V. d. Coh-Metrix-Dementia: an\u00e1lise autom\u00e1tica de dist\u00farbios de linguagem nas dem\u00eancias utilizando Processamento de L\u00ednguas Naturais. 2015. Ph.D. thesis, Universidade de S\u00e3o Paulo, 2015."},{"key":"263294","doi-asserted-by":"crossref","unstructured":"DURY, P. ; PICTON, A. Terminologie et diachronie: vers une r\u00e9conciliation th\u00e9orique et m\u00e9thodologique? Revue fran\u00e7aise de linguistique appliqu\u00e9e, v. 14(2), p. 31\u201341, 2009. DOI https:\/\/doi.org\/10.3917\/rfla.142.0031","DOI":"10.3917\/rfla.142.0031"},{"key":"263295","doi-asserted-by":"crossref","unstructured":"FINATTO, M. J. B. Corpus-amostra portugu\u00eas do s\u00e9culo XVIII: textos antigos de medicina em atividades de ensino e pesquisa. Dom\u00ednios de Lingu@gem, Uberl\u00e2ndia 12(1), 2018. DOI https:\/\/doi.org\/10.14393\/DL33-v12n1a2018-15","DOI":"10.14393\/DL33-v12n1a2018-15"},{"key":"263296","unstructured":"FINATTO, M. J. B. Medicina em portugu\u00eas no s\u00e9culo XVIII: desafios da terminologia diacr\u00f4nica no cen\u00e1rio das humanidades digitais. Panace@, v. 21(52), p. 20\u201336, 2020."},{"key":"263297","unstructured":"FINATTO, M. J. B.; QUARESMA, P.; GON\u00c7ALVES, M.F. Portuguese corpora of the 18th century: old medicine texts for teaching and research. In: Proceedings of the Conference on Language Technologies and Digital Humanities. University of Ljubljana, 2018. p. 114\u2013120."},{"key":"263298","doi-asserted-by":"crossref","unstructured":"FURTADO, J. F. Tropical empiricism: making medical knowledge in colonial Brazil. In: Science and empire in the Atlantic world. Routledge, 2008. p. 127\u2013151. DOI https:\/\/doi.org\/10.4324\/9780203933848-8","DOI":"10.4324\/9780203933848-8"},{"key":"263299","unstructured":"GAZZOLA, M., LEAL, S. E., ALUISIO, S. M. Predi\u00e7\u00e3o da complexidade textual de recursos educacionais abertos em portugu\u00eas. In: Proceedings of the Symposium in Information and Human Language Technology - STIL. SBC, 2019."},{"key":"263300","doi-asserted-by":"crossref","unstructured":"GRAESSER, A. C.; MCNAMARA, D. S.; LOUWERSE, M. M.; CAI, Z. Coh-metrix: Analysis of text on cohesion and language. Behavior research methods, instruments, & computers, v. 36(2), p. 193\u2013202, 2004. DOI https:\/\/doi.org\/10.3758\/BF03195564","DOI":"10.3758\/BF03195564"},{"key":"263301","unstructured":"LEAL, S. E.; DURAN, M. S.; SCARTON, C. E.; HARTMANN, N. S.; ALU\u00cdSIO, S. M. NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese. arXiv preprint, arXiv:2201.03445, 2021."},{"key":"263302","doi-asserted-by":"crossref","unstructured":"LISBOA, J. L.; MIRANDA, T. C.; OLIVAL, F. As Gazetas Manuscritas da Biblioteca P\u00fablica de \u00c9vora. Colibri, CIDEHUS-UE, CHC-UNL, 2002. DOI https:\/\/doi.org\/10.4000\/books.cidehus.3083","DOI":"10.4000\/books.cidehus.3083"},{"key":"263303","doi-asserted-by":"crossref","unstructured":"LOBENSTEIN-REICHMANN, A. Luther\u2019s Contribution as Bible Translator to the German Language. The Bible Translator, v. 73(3), p. 301-334, 2022. DOI https:\/\/doi.org\/10.1177\/20516770221140051","DOI":"10.1177\/20516770221140051"},{"key":"263304","unstructured":"MARTINS, T. B.; GHIRALDELO, C. M.; NUNES, M. D. G. V.; OLIVEIRA JUNIOR, O. N. D. Readability formulas applied to textbooks in Brazilian Portuguese. 1996. Technical report, ICMSC-USP, 1996."},{"key":"263305","unstructured":"MOTTA, E. \u00cdndices de complexidade textual em senten\u00e7as dos juizados especiais c\u00edveis do poder judici\u00e1rio do estado do Rio Grande do Sul. Invent\u00e1rio, v. 1(21), p. 35\u201350, 2018."},{"key":"263306","doi-asserted-by":"crossref","unstructured":"MOTTA, E. Senten\u00e7as judiciais e acessibilidade textual e terminol\u00f3gica. Dom\u00ednios de Lingu@gem, v. 15(3), p. 761\u2013813, 2021. DOI https:\/\/doi.org\/10.14393\/DL47-v15n3a2021-6","DOI":"10.14393\/DL47-v15n3a2021-6"},{"key":"263307","doi-asserted-by":"crossref","unstructured":"PIOTROWSKI, M. Natural language processing for historical texts. Synthesis lectures on human language technologies, v. 5(2), p. 1\u2013157, 2012. DOI https:\/\/doi.org\/10.2200\/S00436ED1V01Y201207HLT017","DOI":"10.2200\/S00436ED1V01Y201207HLT017"},{"key":"263308","doi-asserted-by":"crossref","unstructured":"QUARESMA, P.; FINATTO, M. J. B. Information extraction from historical texts: a case study. In: Proceedings of the Workshop on Digital Humanities and Natural Language Processing (DHandNLP). Co-located with the International Conference on the Computational Processing of Portuguese (PROPOR 2020). \u00c9vora, Portugal, 2020. p. 49\u201356. DOI https:\/\/doi.org\/10.1007\/978-3-030-41505-1","DOI":"10.1007\/978-3-030-41505-1"},{"key":"263309","unstructured":"SANTOS, I.; OLIVAL, F.; SEQUEIRA, O. Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard. In: Proceedings of the Workshop on Digital Humanities and Natural Language Processing (DHandNLP). Co-located with the International Conference on the Computational Processing of Portuguese (PROPOR 2020). \u00c9vora, Portugal, 2020. p. 69\u201375."},{"key":"263310","doi-asserted-by":"crossref","unstructured":"SANTOS, L. B. D.; DURAN, M. S.; HARTMANN, N. S.; CANDIDO, A.; PAETZOLD, G. H.; ALUISIO, S. M. A lightweight regression method to infer psycholinguistic properties for brazilian portuguese. In: International conference on text, speech, and dialogue. Springer, 2017. p. 281\u2013289. DOI https:\/\/doi.org\/10.1007\/978-3-319-64206-2_32","DOI":"10.1007\/978-3-319-64206-2_32"},{"key":"263311","unstructured":"SANTOS, R.; PEDRO, G.; LEAL, S.; VALE, O.; PARDO, T.; BONTCHEVA, K.; SCARTON, C. Measuring the impact of readability features in fake news detection. In: Proceedings of the 12th language resources and evaluation conference, 2020. p. 1404\u20131413."},{"key":"263312","unstructured":"SEMEDO, J.C. Observa\u00e7oens medicas doutrinaes de cem casos gravissimos, que em servi\u00e7o da patria, & das na\u00e7\u00f5es estranhas escreve em lingua Portugueza, & Latina Joam Curvo Semmedo. Officina de Antonio Pedrozo Galram, Lisboa, Portugal, 1707."},{"key":"263313","doi-asserted-by":"crossref","unstructured":"SOUSA, M. C. P. d. O Corpus Tycho Brahe: contribui\u00e7\u00f5es para as humanidades digitais no Brasil. Filologia e lingu\u00edstica portuguesa, v. 16(esp.), p. 53\u201393, 2014. DOI https:\/\/doi.org\/10.11606\/issn.2176-9419.v16ispep53-93","DOI":"10.11606\/issn.2176-9419.v16ispep53-93"},{"key":"263314","unstructured":"VERDELHO, T. Terminologias na l\u00edngua portuguesa: perspectiva diacr\u00f3nica. 1998. Available at: http:\/\/clp.dlc.ua.pt\/Publicacoes\/Terminologias_lingua_portuguesa.pdf. Accessed on: 22 Jun. 2023."},{"key":"263315","unstructured":"WAGNER FILHO, J. A.; WILKENS, R.; IDIART, M.; VILLAVICENCIO, A. The brWaC corpus: a new open resource for Brazilian Portuguese. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018). 2018."}],"container-title":["Dom\u00ednios de Lingu@gem"],"original-title":[],"link":[{"URL":"https:\/\/seer.ufu.br\/index.php\/dominiosdelinguagem\/article\/download\/69775\/37166","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/seer.ufu.br\/index.php\/dominiosdelinguagem\/article\/download\/69775\/38362","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/seer.ufu.br\/index.php\/dominiosdelinguagem\/article\/download\/69775\/37166","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,11]],"date-time":"2025-02-11T14:20:20Z","timestamp":1739283620000},"score":1,"resource":{"primary":{"URL":"https:\/\/seer.ufu.br\/index.php\/dominiosdelinguagem\/article\/view\/69775"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,15]]},"references-count":30,"URL":"https:\/\/doi.org\/10.14393\/dlv17a2023-53","relation":{},"ISSN":["1980-5799"],"issn-type":[{"type":"electronic","value":"1980-5799"}],"subject":[],"published":{"date-parts":[[2023,11,15]]}}}