{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,24]],"date-time":"2025-09-24T00:15:28Z","timestamp":1758672928102,"version":"3.44.0"},"publisher-location":"California","reference-count":0,"publisher":"International Joint Conferences on Artificial Intelligence Organization","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,9]]},"abstract":"<jats:p>Electrocardiogram (ECG) is widely used to diagnose cardiac conditions via deep learning methods. Although existing self-supervised learning (SSL) methods have achieved great performance in learning representation for ECG-based cardiac conditions classification, the clinical semantics can not be effectively captured. To overcome this limitation, we proposed to learn cross-modal ECG representations that contain more clinical semantics via a novel framework with \\textbf{D}eep \\textbf{E}CG-\\textbf{R}eport \\textbf{I}nteraction (\\textbf{DERI}). Specifically, we design a novel framework combining multiple alignments and mutual feature reconstructions to learn effective representation of the ECG with the clinical report, which fuses the clinical semantics of the report. An RME-Module inspired by masked modeling is proposed to improve the ECG representation learning. Furthermore, we extend ECG representation learning to report generation with a language model, which is significant for evaluating clinical semantics in the learned representations and even clinical applications. Comprehensive experiments with various settings are conducted on various datasets to show the superior performance of our DERI. Our code is released on https:\/\/github.com\/cccccj-03\/DERI.<\/jats:p>","DOI":"10.24963\/ijcai.2025\/537","type":"proceedings-article","created":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T08:10:40Z","timestamp":1758269440000},"page":"4824-4832","source":"Crossref","is-referenced-by-count":0,"title":["DERI: Cross-Modal ECG Representation Learning with Deep ECG-Report Interaction"],"prefix":"10.24963","author":[{"given":"Jian","family":"Chen","sequence":"first","affiliation":[{"name":"Shenzhen MSU-BIT University"},{"name":"Sun Yat-sen University"}]},{"given":"Xiaoru","family":"Dong","sequence":"additional","affiliation":[{"name":"The University of Hong Kong"}]},{"given":"Wei","family":"Wang","sequence":"additional","affiliation":[{"name":"Shenzhen MSU-BIT University"},{"name":"Beijing Institute of Technology"}]},{"given":"Shaorui","family":"Zhou","sequence":"additional","affiliation":[{"name":"Sun Yat-sen university"}]},{"given":"Lequan","family":"Yu","sequence":"additional","affiliation":[{"name":"The University of Hong Kong"}]},{"given":"Xiping","family":"Hu","sequence":"additional","affiliation":[{"name":"Shenzhen MSU-BIT University"},{"name":"Beijing Institute of Technology"}]}],"member":"10584","event":{"number":"34","sponsor":["International Joint Conferences on Artificial Intelligence Organization (IJCAI)"],"acronym":"IJCAI-2025","name":"Thirty-Fourth International Joint Conference on Artificial Intelligence {IJCAI-25}","start":{"date-parts":[[2025,8,16]]},"theme":"Artificial Intelligence","location":"Montreal, Canada","end":{"date-parts":[[2025,8,22]]}},"container-title":["Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence"],"original-title":[],"deposited":{"date-parts":[[2025,9,23]],"date-time":"2025-09-23T11:34:18Z","timestamp":1758627258000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.ijcai.org\/proceedings\/2025\/537"}},"subtitle":[],"proceedings-subject":"Artificial Intelligence Research Articles","short-title":[],"issued":{"date-parts":[[2025,9]]},"references-count":0,"URL":"https:\/\/doi.org\/10.24963\/ijcai.2025\/537","relation":{},"subject":[],"published":{"date-parts":[[2025,9]]}}}