{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T08:14:50Z","timestamp":1773735290883,"version":"3.50.1"},"reference-count":0,"publisher":"IOS Press","isbn-type":[{"value":"9781643685335","type":"electronic"}],"license":[{"start":{"date-parts":[[2024,8,22]],"date-time":"2024-08-22T00:00:00Z","timestamp":1724284800000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,8,22]]},"abstract":"<jats:p>With the objective of extracting new knowledge about rare diseases from social media messages, we evaluated three models on a Named Entity Recognition (NER) task, consisting of extracting phenotypes and treatments from social media messages. We trained the three models on a dataset with social media messages about Developmental and Epileptic Encephalopathies and more common diseases. This preliminary study revealed that CamemBERT and CamemBERT-bio exhibit similar performance on social media testimonials, slightly outperforming DrBERT. It also highlighted that their performance was lower on this type of data than on structured health datasets. Limitations, including a narrow focus on NER performance and dataset-specific evaluation, call for further research to fully assess model capabilities on larger and more diverse datasets.<\/jats:p>","DOI":"10.3233\/shti240556","type":"book-chapter","created":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T09:48:40Z","timestamp":1724406520000},"source":"Crossref","is-referenced-by-count":2,"title":["Evaluation of BERT-Based Models on Patient Data from French Social Media"],"prefix":"10.3233","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-0958-6949","authenticated-orcid":false,"given":"Emma","family":"Le Priol","sequence":"first","affiliation":[{"name":"HeKA team, Inria, Inserm, France"},{"name":"Universit\u00e9 Paris-Cit\u00e9, France"},{"name":"Kap Code, Paris, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8019-6159","authenticated-orcid":false,"given":"Manissa","family":"Talmatkadi","sequence":"additional","affiliation":[{"name":"Kap Code, Paris, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2642-7726","authenticated-orcid":false,"given":"St\u00e9phane","family":"Sch\u00fcck","sequence":"additional","affiliation":[{"name":"Kap Code, Paris, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6297-1748","authenticated-orcid":false,"given":"Nathalie","family":"Texier","sequence":"additional","affiliation":[{"name":"Kap Code, Paris, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6855-4366","authenticated-orcid":false,"given":"Anita","family":"Burgun","sequence":"additional","affiliation":[{"name":"HeKA team, Inria, Inserm, France"},{"name":"Universit\u00e9 Paris-Cit\u00e9, France"},{"name":"Clinical Bioinformatics group, Institut Imagine, France"},{"name":"Department of Medical Informatics, Necker Hospital, AP-HP, France"}]}],"member":"7437","container-title":["Studies in Health Technology and Informatics","Digital Health and Informatics Innovations for Sustainable Health Care Systems"],"original-title":[],"link":[{"URL":"https:\/\/ebooks.iospress.nl\/pdf\/doi\/10.3233\/SHTI240556","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T09:48:41Z","timestamp":1724406521000},"score":1,"resource":{"primary":{"URL":"https:\/\/ebooks.iospress.nl\/doi\/10.3233\/SHTI240556"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,22]]},"ISBN":["9781643685335"],"references-count":0,"URL":"https:\/\/doi.org\/10.3233\/shti240556","relation":{},"ISSN":["0926-9630","1879-8365"],"issn-type":[{"value":"0926-9630","type":"print"},{"value":"1879-8365","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,8,22]]}}}