{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,7]],"date-time":"2026-01-07T07:48:07Z","timestamp":1767772087238,"version":"build-2065373602"},"reference-count":20,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T00:00:00Z","timestamp":1750723200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"5 \u00d7 1000 research funding program"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Data"],"abstract":"<jats:p>Research of Inflammatory Bowel Disease (IBD) involves integrating diverse and heterogeneous data sources, from clinical records to imaging and laboratory results, which presents significant challenges in data harmonization and exploration. These challenges are also reflected in the development of machine-learning applications, where inconsistencies in data quality, missing information, and variability in data formats can adversely affect the performance and generalizability of models. In this study, we describe the collection and curation of a comprehensive dataset focused on IBD. In addition, we present a dedicated research platform. We focus on ethical standards, data protection, and seamless integration of different data types. We also discuss the challenges encountered, as well as the insights gained during its implementation.<\/jats:p>","DOI":"10.3390\/data10070100","type":"journal-article","created":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T08:50:57Z","timestamp":1750755057000},"page":"100","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Collecting and Analyzing IBD Clinical Data for Machine-Learning: Insights from an Italian Cohort"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9651-7156","authenticated-orcid":false,"given":"Aldo","family":"Marzullo","sequence":"first","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3376-8740","authenticated-orcid":false,"given":"Victor","family":"Savevski","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-9084-6504","authenticated-orcid":false,"given":"Maddalena","family":"Menini","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alessandro","family":"Schilir\u00f2","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3440-4981","authenticated-orcid":false,"given":"Gianluca","family":"Franchellucci","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8543-4355","authenticated-orcid":false,"given":"Arianna","family":"Dal Buono","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0076-8549","authenticated-orcid":false,"given":"Cristina","family":"Bezzio","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7277-1053","authenticated-orcid":false,"given":"Roberto","family":"Gabbiadini","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7167-1459","authenticated-orcid":false,"given":"Cesare","family":"Hassan","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1621-6450","authenticated-orcid":false,"given":"Alessandro","family":"Repici","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1572-0118","authenticated-orcid":false,"given":"Alessandro","family":"Armuzzi","sequence":"additional","affiliation":[{"name":"IRCCS Humanitas Research Hospital-Via Manzoni 56, Rozzano, 20089 Milan, Italy"},{"name":"Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,6,24]]},"reference":[{"key":"ref_1","first-page":"743","article-title":"Basic research in endoscopy","volume":"31","author":"Kozarek","year":"1999","journal-title":"Ital. J. Gastroenterol. Hepatol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1520","DOI":"10.1136\/gutjnl-2019-320065","article-title":"Big data in IBD: Big progress for clinical practice","volume":"69","author":"Tabib","year":"2020","journal-title":"Gut"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"260","DOI":"10.4338\/ACI-2015-09-RA-0125","article-title":"Integrating heterogeneous biomedical data for cancer research: The CARPEM infrastructure","volume":"7","author":"Rance","year":"2016","journal-title":"Appl. Clin. Inform."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Lidstr\u00f6mer, N., and Ashrafian, H. (2021). AIM in Endoscopy Procedures. Artificial Intelligence in Medicine, Springer.","DOI":"10.1007\/978-3-030-58080-3"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1920","DOI":"10.3748\/wjg.v27.i17.1920","article-title":"Artificial intelligence applications in inflammatory bowel disease: Emerging technologies and future directions","volume":"27","author":"Gubatan","year":"2021","journal-title":"World J. Gastroenterol."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Pogorelov, K., Randel, K.R., Griwodz, C., Eskeland, S.L., de Lange, T., Johansen, D., Spampinato, C., Dang-Nguyen, D.T., Lux, M., and Schmidt, P.T. (2017, January 20\u201323). Kvasir: A multi-class image dataset for computer aided gastrointestinal disease detection. Proceedings of the 8th ACM on Multimedia Systems Conference, Taipei, Taiwan.","DOI":"10.1145\/3083187.3083212"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1016\/j.gie.2023.12.003","article-title":"Consensus statements on the current landscape of artificial intelligence applications in endoscopy, addressing roadblocks, and advancing artificial intelligence in gastroenterology","volume":"101","author":"Parasa","year":"2024","journal-title":"Gastrointest. Endosc."},{"key":"ref_8","first-page":"2016","article-title":"Regulation (EU) 2016\/679 of the European Parliament and of the Council","volume":"679","author":"Regulation","year":"2016","journal-title":"Regulation"},{"key":"ref_9","unstructured":"(2025, January 07). Real-World Data for the Life Sciences and Healthcare | TriNetX\u2014trinetx.com. Available online: https:\/\/trinetx.com\/."},{"key":"ref_10","unstructured":"(2025, January 07). Gut Reaction. Available online: https:\/\/bioresource.nihr.ac.uk\/centres-programmes\/ibd-bioresource\/gut-reaction\/."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1038\/s41597-024-03592-7","article-title":"Histopathology imaging and clinical data including remission status in pediatric inflammatory bowel disease","volume":"11","author":"Nael","year":"2024","journal-title":"Sci. Data"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Klumpp, M., Hintze, M., Immonen, M., R\u00f3denas-Rigla, F., Pilati, F., Aparicio-Mart\u00ednez, F., \u00c7elebi, D., Liebig, T., Jirstrand, M., and Urbann, O. (2021). Artificial intelligence for hospital health care: Application cases and answers to challenges in European hospitals. Healthcare, 9.","DOI":"10.3390\/healthcare9080961"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"105304","DOI":"10.1016\/j.ijmedinf.2023.105304","article-title":"Smart hospital definition: Academic and industrial perspective","volume":"182","author":"Rajaei","year":"2024","journal-title":"Int. J. Med. Inform."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Khalid, N., Qayyum, A., Bilal, M., Al-Fuqaha, A., and Qadir, J. (2023). Privacy-preserving artificial intelligence in healthcare: Techniques and applications. Comput. Biol. Med., 158.","DOI":"10.1016\/j.compbiomed.2023.106848"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Williamson, S.M., and Prybutok, V. (2024). Balancing privacy and progress: A review of privacy challenges, systemic oversight, and patient perceptions in AI-driven healthcare. Appl. Sci., 14.","DOI":"10.3390\/app14020675"},{"key":"ref_16","unstructured":"Bartoletti, I. (2019). AI in healthcare: Ethical and privacy challenges. Proceedings of the Artificial Intelligence in Medicine: 17th Conference on Artificial Intelligence in Medicine, AIME 2019, Poznan, Poland, 26\u201329 June 2019, Springer. Proceedings 17."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"100162","DOI":"10.1016\/j.fhj.2024.100162","article-title":"How should we train clinicians for artificial intelligence in healthcare?","volume":"11","author":"Misra","year":"2024","journal-title":"Future Healthc. J."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1701","DOI":"10.1001\/jama.2024.21772","article-title":"Translating AI for the Clinician","volume":"332","author":"Patel","year":"2024","journal-title":"JAMA"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1038\/s41746-024-01233-2","article-title":"Privacy-preserving large language models for structured medical information retrieval","volume":"7","author":"Wiest","year":"2024","journal-title":"NPJ Digit. Med."},{"key":"ref_20","unstructured":"Compagnucci, M.C., Wilson, M.L., Fenwick, M., Forg\u00f3, N., and B\u00e4rnighausen, T. (2022). AI in EHealth: Human Autonomy, Data Governance and Privacy in Healthcare, Cambridge University Press."}],"container-title":["Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/100\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:57:49Z","timestamp":1760032669000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2306-5729\/10\/7\/100"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,24]]},"references-count":20,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7]]}},"alternative-id":["data10070100"],"URL":"https:\/\/doi.org\/10.3390\/data10070100","relation":{},"ISSN":["2306-5729"],"issn-type":[{"type":"electronic","value":"2306-5729"}],"subject":[],"published":{"date-parts":[[2025,6,24]]}}}