{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T05:40:36Z","timestamp":1773207636033,"version":"3.50.1"},"reference-count":59,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T00:00:00Z","timestamp":1751328000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Legislative documents are crucial to democratic societies, defining the legal framework for social life. In Brazil, legislative texts are particularly complex due to extensive technical jargon, intricate sentence structures, and frequent references to prior legislation. The country\u2019s civil law tradition and multicultural context introduce further interpretative and linguistic challenges. Moreover, the study of Brazilian Portuguese legislative texts remains underexplored, lacking legal-specific models and datasets. To address these gaps, this work proposes a data-driven approach utilizing large language models (LLMs) to analyze these documents and extract knowledge graphs (KGs). A case study was conducted using 1869proposals from the Legislative Assembly of Rio Grande do Norte (ALRN), spanning January 2019 to April 2024. The Llama 3.2 3B Instruct model was employed to extract KGs representing entities and their relationships. The findings support the method\u2019s effectiveness in producing coherent graphs faithful to the original content. Nevertheless, challenges remain in resolving entity ambiguity and achieving full relationship coverage. Additionally, readability analyses using metrics for Brazilian Portuguese revealed that ALRN proposals require superior reading skills due to their technical style. Ultimately, this study advances legal artificial intelligence by providing insights into Brazilian legislative texts and promoting transparency and accessibility through natural language processing techniques.<\/jats:p>","DOI":"10.3390\/data10070106","type":"journal-article","created":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T04:04:22Z","timestamp":1751342662000},"page":"106","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Exploring Legislative Textual Data in Brazilian Portuguese: Readability Analysis and Knowledge Graph Generation"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6037-9660","authenticated-orcid":false,"given":"Gisliany Lillian Alves de","family":"Oliveira","sequence":"first","affiliation":[{"name":"UFRN-PPgEEC, Postgraduate Program in Electrical and Computer Engineering, Federal University of Rio Grande do Norte, Natal 59078-970, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8790-2546","authenticated-orcid":false,"given":"Breno Santana","family":"Santos","sequence":"additional","affiliation":[{"name":"Information System Department, Federal University of Sergipe, Itabaiana 49400-000, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8277-7571","authenticated-orcid":false,"given":"Marianne","family":"Silva","sequence":"additional","affiliation":[{"name":"Campus Arapiraca, Federal University of Alagoas, Penedo 57200-000, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0116-6489","authenticated-orcid":false,"given":"Ivanovitch","family":"Silva","sequence":"additional","affiliation":[{"name":"UFRN-PPgEEC, Postgraduate Program in Electrical and Computer Engineering, Federal University of Rio Grande do Norte, Natal 59078-970, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,7,1]]},"reference":[{"key":"ref_1","unstructured":"Chamber of Deputies, Brazil (2025, January 02). The Legislative Branch. Available online: https:\/\/www2.camara.leg.br\/english\/papellegislativo.html."},{"key":"ref_2","unstructured":"Federal Senate, Brazil (2025, January 02). Legislative Documents and Public Access. Available online: https:\/\/www12.senado.leg.br\/institucional\/carta-de-servicos\/en\/carta-de-servicos."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Anh, D.H., Do, D.T., Tran, V., and Minh, N.L. (2023, January 18\u201320). The Impact of Large Language Modeling on Natural Language Processing in Legal Texts: A Comprehensive Survey. Proceedings of the 15th International Conference on Knowledge and Systems Engineering (KSE), Hanoi, Vietnam.","DOI":"10.1109\/KSE59128.2023.10299488"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Alves, A., Miranda, P., Mello, R., and Nascimento, A. (2023). Automatic Simplification of Legal Texts in Portuguese Using Machine Learning. Legal Knowledge and Information Systems, IOS Press.","DOI":"10.3233\/FAIA230975"},{"key":"ref_5","first-page":"171","article-title":"Named Entity Recognition: A Survey for the Portuguese Language","volume":"70","author":"Albuquerque","year":"2023","journal-title":"Proces. Leng. Nat."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Moreira Valle, L., Giacomazzi Dantas, S., Guerreiro e Silva, D., Silva Dias, U., and Monteiro Monasterio, L. (2022). RegBR: A novel Brazilian government framework to classify and analyze industry-specific regulations. PLoS ONE, 17.","DOI":"10.1371\/journal.pone.0275282"},{"key":"ref_7","unstructured":"Fitsilis, F., and Mikros, G. (2022). Smart Parliaments: Data-Driven Democracy, European Liberal Forum."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.aiopen.2024.09.002","article-title":"Large language models in law: A survey","volume":"5","author":"Lai","year":"2024","journal-title":"AI Open"},{"key":"ref_9","unstructured":"Negro, A. (2021). Graph-Powered Machine Learning, Manning Publications Co."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Schneider, P., Schopf, T., Vladika, J., Galkin, M., Simperl, E., and Matthes, F. (2022, January 20\u201323). A Decade of Knowledge Graphs in Natural Language Processing: A Survey. Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Online only.","DOI":"10.18653\/v1\/2022.aacl-main.46"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1561\/2200000096","article-title":"Graph Neural Networks for Natural Language Processing: A Survey","volume":"16","author":"Wu","year":"2023","journal-title":"Found. Trends Mach. Learn."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1109\/TNNLS.2021.3070843","article-title":"A Survey on Knowledge Graphs: Representation, Acquisition, and Applications","volume":"33","author":"Ji","year":"2022","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"870","DOI":"10.1016\/j.procir.2024.07.069","article-title":"A survey of LLM-augmented knowledge graph construction and application in complex product design","volume":"128","author":"Liang","year":"2024","journal-title":"Procedia CIRP"},{"key":"ref_14","unstructured":"Alves, G., Santos, B.S., Silva, M., and Silva, I. (2025). Brazilian Portuguese Legislative Documents: A Dataset from the Legislative Assembly of Rio Grande do Norte, Universidade Federal do Rio Grande do Norte. Mendeley Data, Version 1."},{"key":"ref_15","unstructured":"Palmirani, M., Vitali, F., Van Puymbroeck, W., and Nubla Durango, F. (2022). Legal Drafting in the Era of Artificial Intelligence and Digitisation, European Commission."},{"key":"ref_16","unstructured":"Souza, E., Moriyama, G., Vit\u00f3rio, D., Carvalho, A.C.P.L.F.d., F\u00e9lix, N., Albuquerque, H.O., and Oliveira, A.L.I. (December, January 29). Assessing the Impact of Stemming Algorithms Applied to Brazilian Legislative Documents Retrieval. Proceedings of the Anais do Simp\u00f3sio Brasileiro de Tecnologia da Informa\u00e7\u00e3o e da Linguagem Humana (STIL), Online."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Albuquerque, H.O., Costa, R., Silvestre, G., Souza, E., da Silva, N.F.F., Vit\u00f3rio, D., Moriyama, G., Martins, L., Soezima, L., and Nunes, A. (2022, January 21\u201323). UlyssesNER-Br: A Corpus of Brazilian Legislative Documents for Named Entity Recognition. Proceedings of the Computational Processing of the Portuguese Language, Fortaleza, Brazil.","DOI":"10.1007\/978-3-030-98305-5_1"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Rocha, F.C., Souza, E., Vit\u00f3rio, D., Silva, N.F.F.d., Carvalho, A.C.P.L.F.d., and Oliveira, A.L.I. (2023, January 6\u201311). Avalia\u00e7\u00e3o de frameworks para Recupera\u00e7\u00e3o de Documentos Legislativos: Um Estudo de Caso na C\u00e2mara dos Deputados Brasileira. Proceedings of the Anais do Workshop de Computa\u00e7\u00e3o Aplicada em Governo Eletr\u00f4nico (WCGE), Jo\u00e3o Pessoa, Brazil.","DOI":"10.5753\/wcge.2023.229925"},{"key":"ref_19","unstructured":"Schweighofer, E. (2021). An Information Retrieval Pipeline for Legislative Documents from the Brazilian Chamber of Deputies. Legal Knowledge and Information Systems, IOS Press. Frontiers in Artificial Intelligence and Applications."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Vit\u00f3rio, D., Souza, E., Martins, L., da Silva, N.F.F., de Leon Ferreira de Carvalho, A.C.P., and Oliveira, A.L.I. (2022). Ulysses-RFSQ: A Novel Method to Improve Legal Information Retrieval Based on Relevance Feedback. Intelligent Systems, Proceedings of the 11th Brazilian Conference, Campinas, Brazil, 28 November\u20131 December 2022, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-031-21686-2_6"},{"key":"ref_21","first-page":"1257","article-title":"Building a Relevance Feedback Corpus for Legal Information Retrieval in the Real-Case Scenario of the Brazilian Chamber of Deputies","volume":"59","author":"Souza","year":"2024","journal-title":"Lang. Resour. Eval."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Maia, D.F., Silva, N.F.F., Souza, E.P.R., Nunes, A.S., Proc\u00f3pio, L.C., Sampaio, G.d.S., Dias, M.d.S., Alves, A.O., Maia, D.F., and Ribeiro, I.A. (2022). UlyssesSD-Br: Stance Detection in Brazilian Political Polls. Progress in Artificial Intelligence, Proceedings of the 21st EPIA Conference on Artificial Intelligence, Lisbon, Portugal, 31 August\u20132 September 2022, Springer.","DOI":"10.1007\/978-3-031-16474-3_8"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Silva, N.F.F.d., Silva, M.C.R., Pereira, F.S.F., Tarrega, J.P.M., Beinotti, J.V.P., Fonseca, M., Andrade, F.E.d., and de Carvalho, A.C.P.d.L.F. (December, January 29). Evaluating Topic Models in Portuguese Political Comments About Bills from Brazil\u2019s Chamber of Deputies. Proceedings of the Intelligent Systems: 10th Brazilian Conference, BRACIS 2021, Virtual Event.","DOI":"10.1007\/978-3-030-91699-2_8"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Cifuentes-Silva, F., and Labra Gayo, J.E. (2019, January 2\u20136). Legislative Document Content Extraction Based on Semantic Web Technologies. Proceedings of the Semantic Web (ESWC 2019), 16th International Conference, Portoro\u017e, Slovenia.","DOI":"10.1007\/978-3-030-21348-0_36"},{"key":"ref_25","unstructured":"Colombo, A., Bernasconi, A., and Ceri, S. (2024). Modelling Legislative Systems into Property Graphs to Enable Advanced Pattern Detection. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1007\/s10506-023-09364-9","article-title":"A RDF-based graph to representing and searching parts of legal documents","volume":"32","author":"Oliveira","year":"2023","journal-title":"Artif. Intell. Law"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Bianchini, F., Calamo, M., De Luzi, F., Macr\u00ec, M., and Mecella, M. (2025). A Service-Based Pipeline for Complex Linguistic Tasks Adopting LLMs and Knowledge Graphs. Service-Oriented Computing, Proceedings of the 18th Symposium and Summer School, SummerSOC 2024, Crete, Greece, 24\u201329 June 2024, Springer.","DOI":"10.1007\/978-3-031-72578-4_8"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Colombo, A. (2024, January 21\u201325). Leveraging Knowledge Graphs and LLMs to Support and Monitor Legislative Systems. Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, Boise, ID, USA.","DOI":"10.1145\/3627673.3680268"},{"key":"ref_29","unstructured":"Gao, S., Li, Y., Ge, F., Lin, M., Yu, H., Wang, S., and Miao, Z. (July, January 30). LeGalFormer: A Graph Representation Learning and Transformer-based Approach for Legal Similar Case Retrieval. Proceedings of the International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Li, J., Qian, L., Liu, P., and Liu, T. (2024). Construction of Legal Knowledge Graph Based on Knowledge-Enhanced Large Language Models. Information, 15.","DOI":"10.3390\/info15110666"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Shi, J., Guo, Q., Liao, Y., Wang, Y., Chen, S., and Liang, S. (2024, January 5\u20138). Legal-LM: Knowledge Graph Enhanced Large Language Models for Law Consulting. Proceedings of the Advanced Intelligent Computing Technology and Applications. Springer Nature Singapore, Tianjin, China.","DOI":"10.1007\/978-981-97-5672-8_15"},{"key":"ref_32","unstructured":"Speer, R. (2019). ftfy, version 5.5, Zenodo."},{"key":"ref_33","unstructured":"Bird, S., Klein, E., and Loper, E. (2009). Natural Language Processing with Python, O\u2019Reilly Media, Inc."},{"key":"ref_34","unstructured":"Keraghel, I., Morbieu, S., and Nadif, M. (2024). Recent Advances in Named Entity Recognition: A Comprehensive Survey and Comparative Study. arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhang, L., Sun, X., Ma, X., and Hu, K. (2024). A New Entity Relationship Extraction Method for Semi-Structured Patent Documents. Electronics, 13.","DOI":"10.3390\/electronics13163144"},{"key":"ref_36","unstructured":"Bratani\u010d, T. (2023). Graph Algorithms for Data Science, Manning Publications Co."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1007\/s11280-024-01297-w","article-title":"LLMs for knowledge graph construction and reasoning: Recent capabilities and future opportunities","volume":"27","author":"Zhu","year":"2024","journal-title":"World Wide Web"},{"key":"ref_38","unstructured":"Negro, A., Kus, V., Futia, G., and Montagna, F. (2025). Knowledge Graphs and LLMs in Action, Manning Publications Co."},{"key":"ref_39","unstructured":"Rao, P.J., Rao, K.N., Gokuruboyina, S., and Neeraja, K. (2023, January 28\u201329). An Efficient Methodology for Identifying the Similarity Between Languages with Levenshtein Distance. Proceedings of the 6th International Conference on Communications and Cyber Physical Engineering, Hyderabad, India."},{"key":"ref_40","unstructured":"Santos, B.S., Silva, I., and Melo, E. (2021, January 17\u201319). Metodologia orientada a ci\u00eancia de dados em grafos para avalia\u00e7\u00e3o de PPGs. Proceedings of the XV Simp\u00f3sio Brasileiro de Automa\u00e7\u00e3o Inteligente (SBAI 2021), Rio Grande, Rio Grande do Sul, Brazil."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Santos, B.S., Silva, I., and Costa, D.G. (2023). Symmetry in Scientific Collaboration Networks: A Study Using Temporal Graph Data Science and Scientometrics. Symmetry, 15.","DOI":"10.3390\/sym15030601"},{"key":"ref_42","unstructured":"Legislative Assembly of Rio Grande do Norte - ALRN (2024, December 20). Unale 2024: Director of Technology Management Presents Advances in Artificial Intelligence. Available online: https:\/\/www.al.rn.leg.br\/noticia\/31558\/unale-2024-diretor-de-gestao-tecnologica-apresenta-avancos-em-inteligencia-artificial."},{"key":"ref_43","unstructured":"Meta AI (2024, December 16). Llama 3.2 Model Card. Available online: https:\/\/huggingface.co\/meta-llama\/Llama-3.2-3B-Instruct."},{"key":"ref_44","unstructured":"Meta AI (2024, December 16). Llama 3.2: Advancing AI for Vision and Language at the Edge and Beyond. Available online: https:\/\/ai.meta.com\/blog\/llama-3-2-connect-2024-vision-edge-mobile-devices\/."},{"key":"ref_45","unstructured":"Robinson, I., Webber, J., and Eifrem, E. (2015). Graph Databases: New Opportunities for Connected Data, O\u2019Reilly Media, Inc.. [2nd ed.]."},{"key":"ref_46","unstructured":"Anthapu, R. (2022). Graph Data Processing with Cypher, Packt Publishing."},{"key":"ref_47","unstructured":"Scifo, E. (2023). Graph Data Science with Neo4j, Packt Publishing."},{"key":"ref_48","unstructured":"Martins, T.B.F., Ghiraldelo, C.M., Nunes, M.d.G.V., and Oliveira J\u00fanior, O.N.d. (1996). Readability formulas applied to textbooks in brazilian portuguese. Notas do ICMSC, ICMSC-USP. S\u00e9rie Computa\u00e7\u00e3o."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1007\/s10579-023-09693-w","article-title":"NILC-Metrix: Assessing the complexity of written and spoken language in Brazilian Portuguese","volume":"58","author":"Leal","year":"2024","journal-title":"Lang. Resour. Eval."},{"key":"ref_50","unstructured":"Biderman, M.T.C. (1998). Dicion\u00e1rio Did\u00e1tico de Portugu\u00eas, Editora \u00e1tica."},{"key":"ref_51","unstructured":"McKinney, W. (2022). Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Jupyter, O\u2019Reilly Media."},{"key":"ref_52","unstructured":"D\u00f6bler, M., and Gro\u03b2mann, T. (2019). Data Visualization with Python, Packt Publishing Ltd."},{"key":"ref_53","unstructured":"Anaconda Inc (2024, December 10). Anaconda: The Data Science Platform. Available online: https:\/\/www.anaconda.com."},{"key":"ref_54","unstructured":"Google Inc (2024, December 10). Google Colab: Hi, This Is the Colaboratory. Available online: https:\/\/colab.research.google.com."},{"key":"ref_55","unstructured":"Tunstall, L., von Werra, L., and Wolf, T. (2022). Natural Language Processing with Transformers, O\u2019Reilly Media, Inc."},{"key":"ref_56","unstructured":"Alves, G., and Silva, I. (2025, January 05). GitHub Repository of This Study. Available online: https:\/\/github.com\/conect2ai\/legislative-texts-rn."},{"key":"ref_57","unstructured":"Brazilian National Congress (2020). Glossary of Legislative Terms, Brazilian National Congress. [2nd ed.]. Available online: https:\/\/www.congressonacional.leg.br\/legislacao-e-publicacoes\/glossario-legislativo."},{"key":"ref_58","unstructured":"Legislative Assembly of the State of S\u00e3o Paulo (2024, December 19). Legislative Process Manual, Available online: https:\/\/www.al.sp.gov.br\/arquivos\/documentacao\/estudos-e-manuais\/manual-processo-legislativo\/manual_proclegis_2.pdf."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"4310","DOI":"10.1109\/TAI.2024.3439048","article-title":"Editorial: From Explainable Artificial Intelligence (xAI) to Understandable Artificial Intelligence (uAI)","volume":"5","author":"Abbass","year":"2024","journal-title":"IEEE Trans. Artif. Intell."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/106\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:02:07Z","timestamp":1760032927000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/106"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,1]]},"references-count":59,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7]]}},"alternative-id":["data10070106"],"URL":"https:\/\/doi.org\/10.3390\/data10070106","relation":{},"ISSN":["2306-5729"],"issn-type":[{"value":"2306-5729","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,1]]}}}