{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,22]],"date-time":"2025-10-22T20:34:55Z","timestamp":1761165295249,"version":"build-2065373602"},"reference-count":13,"publisher":"Sociedade Brasileira de Computa\u00e7\u00e3o - SBC","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"abstract":"<jats:p>Investigamos a efetividade de m\u00e9todos supervisionados de aprendizado de m\u00e1quina na identifica\u00e7\u00e3o de indiv\u00edduos possivelmente n\u00e3o diagnosticados ou com alto risco de desenvolver Diabetes Mellitus (DM) no contexto de operadoras de sa\u00fade suplementar. O cen\u00e1rio \u00e9 desafiador: h\u00e1 apenas dados administrativos indiretos (tipo e frequ\u00eancia de exames), sem resultados cl\u00ednicos, al\u00e9m de baixa separabilidade entre classes e incerteza nos r\u00f3tulos. Avaliamos tr\u00eas classificadores (XGBoost, Random Forest e Regress\u00e3o Log\u00edstica), obtendo desempenho robusto (Macro-F1 de 90,1%). A an\u00e1lise de erros sugere que falsos positivos podem indicar casos ainda n\u00e3o diagnosticados, enquanto falsos negativos podem refletir controle cl\u00ednico inadequado.<\/jats:p>","DOI":"10.5753\/sbbd.2025.247707","type":"proceedings-article","created":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T19:26:36Z","timestamp":1761074796000},"page":"788-794","source":"Crossref","is-referenced-by-count":0,"title":["Quando os Erros Informam: Apoio ao Diagn\u00f3stico de Diabetes em Cen\u00e1rios de Alta Incerteza"],"prefix":"10.5753","author":[{"given":"Samuel Norberto","family":"Alves","sequence":"first","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Celso","family":"Fran\u00e7a","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Regina T. I.","family":"Bernal","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Crizian S.","family":"Gomes","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Oluwatoyin Joy","family":"Omole","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Deborah","family":"Malta","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Marcos Andr\u00e9","family":"Gon\u00e7alves","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]},{"given":"Jussara M.","family":"Almeida","sequence":"additional","affiliation":[],"role":[{"role":"author","vocab":"crossref"}]}],"member":"3742","published-online":{"date-parts":[[2025,9,29]]},"reference":[{"key":"1","doi-asserted-by":"crossref","unstructured":"Alnowaiser, K. (2024). Improving healthcare prediction of diabetic patients using knn imputed features and tri-ensemble model. IEEE Access, 12:16783\u201316793.","DOI":"10.1109\/ACCESS.2024.3359760"},{"key":"2","unstructured":"ANS (2021). Promo\u00e7\u00e3o da sa\u00fade e preven\u00e7\u00e3o de doen\u00e7as - PROMOPREV - <a href=\"https:\/\/www.gov.br\/ans\/pt-br\/assuntos\/operadoras\/compromissos-e-interacoes-com-a-ans-1\/programas-ans-1\/promoprev\"target=\"_blank\">[link]<\/a>. Atualizado em 06\/06\/2025."},{"key":"3","doi-asserted-by":"crossref","unstructured":"Banday, M. Z., Sameer, A. S., and Nissar, S. (2020). Pathophysiology of diabetes: An overview. Avicenna journal of medicine, 10(04):174\u2013188.","DOI":"10.4103\/ajm.ajm_53_20"},{"key":"4","doi-asserted-by":"crossref","unstructured":"Cunha, W. et al. (2023). An effective, efficient, and scalable confidence-based instance selection framework for transformer-based text classification. In SIGIR, page 665\u2013674.","DOI":"10.1145\/3539618.3591638"},{"key":"5","doi-asserted-by":"crossref","unstructured":"Cunha, W., Moreo Fern\u00e1ndez, A., Esuli, A., Sebastiani, F., Rocha, L., and Gon\u00e7alves, M. A. (2025). A noise-oriented and redundancy-aware instance selection framework. ACM Trans. Inf. Syst., 43(2).","DOI":"10.1145\/3705000"},{"key":"6","unstructured":"da Cunha Paula, D. J. (2014). An\u00e1lise de custo e efetividade do tratamento de diab\u00e9ticos adultos atendidos no centro hiperdia de juiz de fora, minas gerais. Disserta\u00e7\u00e3o de mestrado, Universidade Federal de Juiz de Fora, Juiz de Fora, MG, Brasil. Aprovado em 17 de fevereiro de 2014."},{"key":"7","doi-asserted-by":"crossref","unstructured":"Dinh, A., Miertschin, S., Young, A., and Mohanty, S. D. (2019). A data-driven approach to predicting diabetes and cardiovascular disease with machine learning. BMC Medical Informatics and Decision Making, 19(1):211.","DOI":"10.1186\/s12911-019-0918-5"},{"key":"8","doi-asserted-by":"crossref","unstructured":"Ferreira, T., Fran\u00e7a, C., A. Gon\u00e7alves, M., Pagano, A., et al. (2021). Evaluating recognizing question entailment methods for a Portuguese community question-answering system about diabetes mellitus. In Proc. Int\u2019l Conf. on Recent Advances in Natural Language Processing.","DOI":"10.26615\/978-954-452-072-4_028"},{"key":"9","doi-asserted-by":"crossref","unstructured":"Fran\u00e7a, C., Lima, R. C., Andrade, C., Cunha, W., de Melo, P. O. V., Ribeiro-Neto, B., Rocha, L., Santos, R. L., Pagano, A. S., and Gon\u00e7alves, M. A. (2024). On representation learning-based methods for effective, efficient, and scalable code retrieval. Neurocomputing, 600:128172.","DOI":"10.1016\/j.neucom.2024.128172"},{"key":"10","doi-asserted-by":"crossref","unstructured":"Glechner, A., Keuchel, L., Affengruber, L., Titscher, V., Sommer, I., Matyas, N., Wagner, G., Kien, C., Klerings, I., and Gartlehner, G. (2018). Effects of lifestyle changes on adults with prediabetes: A systematic review and meta-analysis. Primary care diabetes, 12(5):393\u2013408.","DOI":"10.1016\/j.pcd.2018.07.003"},{"key":"11","doi-asserted-by":"crossref","unstructured":"Kiran, M., Xie, Y., Anjum, N., Ball, G., Pierscionek, B., and Russell, D. (2025). Machine learning and artificial intelligence in type 2 diabetes prediction: a comprehensive 33-year bibliometric and literature analysis. Frontiers in Digital Health, 7:1557467.","DOI":"10.3389\/fdgth.2025.1557467"},{"key":"12","doi-asserted-by":"crossref","unstructured":"Sledzik, R. and Zabihimayvan, M. (2022). Focal loss improves performance of high-sensitivity c-reactive protein imbalanced classification. In 2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS), pages 114\u2013118.","DOI":"10.1109\/CBMS55023.2022.00027"},{"key":"13","doi-asserted-by":"crossref","unstructured":"Tuppad, A. and Devi Patil, S. (2024). An efficient classification framework for type 2 diabetes incorporating feature interactions. Expert Systems with Applications, 239:122138.","DOI":"10.1016\/j.eswa.2023.122138"}],"event":{"name":"Simp\u00f3sio Brasileiro de Banco de Dados","number":"40","location":"Brasil","acronym":"SBBD 2025"},"container-title":["Anais do XL Simp\u00f3sio Brasileiro de Banco de Dados (SBBD 2025)"],"original-title":[],"link":[{"URL":"https:\/\/sol.sbc.org.br\/index.php\/sbbd\/article\/download\/37285\/37068","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/sol.sbc.org.br\/index.php\/sbbd\/article\/download\/37285\/37068","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T19:28:04Z","timestamp":1761074884000},"score":1,"resource":{"primary":{"URL":"https:\/\/sol.sbc.org.br\/index.php\/sbbd\/article\/view\/37285"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,9,29]]},"references-count":13,"URL":"https:\/\/doi.org\/10.5753\/sbbd.2025.247707","relation":{},"subject":[],"published":{"date-parts":[[2025,9,29]]}}}