{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T05:26:57Z","timestamp":1774934817943,"version":"3.50.1"},"reference-count":48,"publisher":"World Scientific Pub Co Pte Ltd","issue":"11","funder":[{"name":"Sichuan University and Yibin Municipal People\u2019s Government University and City strategic cooperation special fund","award":["2020CDYB-29"],"award-info":[{"award-number":["2020CDYB-29"]}]},{"name":"Science and Technology plan transfer payment project of Sichuan province","award":["2021ZYSF007"],"award-info":[{"award-number":["2021ZYSF007"]}]},{"name":"Key Research and Development Program of Science and Technology Department of Sichuan Province","award":["2020YFS0575"],"award-info":[{"award-number":["2020YFS0575"]}]},{"name":"Key Research and Development Program of Science and Technology Department of Sichuan Province","award":["2021KJT0012"],"award-info":[{"award-number":["2021KJT0012"]}]},{"name":"Key Research and Development Program of Science and Technology Department of Sichuan Province","award":["2021YFS0067"],"award-info":[{"award-number":["2021YFS0067"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Neur. Syst."],"published-print":{"date-parts":[[2022,11]]},"abstract":"<jats:p> Depression is a common mental disease that has a tendency to develop at a younger age. Early detection of depression with psychological intervention may effectively prevent youth suicide. The establishment of the computer-aided model may be efficient for early detection. However, the existing methods of automatic detection for depression mostly rely on unimodal data. Clinical research shows that patients with depression have specificity in speech, text, expression, and other modal data. Multimodal machine learning is emerging but not yet widely used for the detection of psychiatric disorders. The problem of existing multimodal detection models is that only global or local information is considered in feature fusion, which leads to the low accuracy of the depression detection model. Therefore, this study constructs an automatic detection model based on multimodal machine learning for adolescent depression. The proposed method first extracted four features from audio and text globally and locally; then construct a coarse-grained fusion model and fine-grained fusion model base on these four features; and fuse the coarse-grained and the fine-grained fusion model finally. Experiments on the real-world dataset demonstrate that the proposed method could improve the accuracy of depression detection automatically. <\/jats:p>","DOI":"10.1142\/s0129065722500459","type":"journal-article","created":{"date-parts":[[2022,7,9]],"date-time":"2022-07-09T03:26:15Z","timestamp":1657337175000},"source":"Crossref","is-referenced-by-count":14,"title":["Adolescent Depression Detection Model Based on Multimodal Data of Interview Audio and Text"],"prefix":"10.1142","volume":"32","author":[{"given":"Lei","family":"Zhang","sequence":"first","affiliation":[{"name":"College of Computer Science, Sichuan University, No. 24 South Section 1, Yihuan Road, Chengdu, Sichuan 610065, P. R. China"},{"name":"West China Hospital of Sichuan University, 37 Guoxue Lane, Chengdu City, Sichuan, Province, P. R. China"}]},{"given":"Yuanxiao","family":"Fan","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, No. 24 South Section 1, Yihuan Road, Chengdu, Sichuan 610065, P. R. China"},{"name":"West China Hospital of Sichuan University, 37 Guoxue Lane, Chengdu City, Sichuan, Province, P. R. China"}]},{"given":"Jingwen","family":"Jiang","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, No. 24 South Section 1, Yihuan Road, Chengdu, Sichuan 610065, P. R. China"},{"name":"West China Hospital of Sichuan University, 37 Guoxue Lane, Chengdu City, Sichuan, Province, P. R. China"}]},{"given":"Yuchen","family":"Li","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, No. 24 South Section 1, Yihuan Road, Chengdu, Sichuan 610065, P. R. China"},{"name":"West China Hospital of Sichuan University, 37 Guoxue Lane, Chengdu City, Sichuan, Province, P. R. China"}]},{"given":"Wei","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Computer Science, Sichuan University, No. 24 South Section 1, Yihuan Road, Chengdu, Sichuan 610065, P. R. China"},{"name":"West China Hospital of Sichuan University, 37 Guoxue Lane, Chengdu City, Sichuan, Province, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2022,8,26]]},"reference":[{"key":"S0129065722500459BIB002","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1016\/S0002-7138(09)60939-0","volume":"21","author":"Carlson G. A.","year":"1982","journal-title":"J. Am. Acad. Child Adolesc. Psychiatry"},{"key":"S0129065722500459BIB003","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1007\/s004060050015","volume":"248","author":"Harrington R.","year":"1998","journal-title":"Eur. Arch. Psychiatry Clin. Neurosci."},{"key":"S0129065722500459BIB004","first-page":"195","volume":"12","author":"Xia L.","year":"2021","journal-title":"Front. Psychiatry"},{"key":"S0129065722500459BIB005","doi-asserted-by":"publisher","DOI":"10.1016\/j.neuroimage.2011.09.069"},{"key":"S0129065722500459BIB006","first-page":"301","volume":"56","author":"Choi E.","year":"2016","journal-title":"Proc. Mach. Learn. Res."},{"key":"S0129065722500459BIB007","doi-asserted-by":"publisher","DOI":"10.1038\/s41591-018-0316-z"},{"key":"S0129065722500459BIB008","doi-asserted-by":"publisher","DOI":"10.1016\/j.cmpb.2018.04.012"},{"key":"S0129065722500459BIB009","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1016\/j.specom.2015.03.004","volume":"71","author":"Cummins N.","year":"2015","journal-title":"Speech Commun."},{"key":"S0129065722500459BIB010","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijpsycho.2012.05.001"},{"key":"S0129065722500459BIB011","doi-asserted-by":"publisher","DOI":"10.1177\/1550059413480504"},{"key":"S0129065722500459BIB012","doi-asserted-by":"publisher","DOI":"10.1159\/000381950"},{"key":"S0129065722500459BIB013","doi-asserted-by":"publisher","DOI":"10.1159\/000438457"},{"key":"S0129065722500459BIB014","doi-asserted-by":"publisher","DOI":"10.1016\/j.bbr.2015.10.036"},{"key":"S0129065722500459BIB015","doi-asserted-by":"crossref","first-page":"661213","DOI":"10.3389\/fpsyt.2021.661213","volume":"12","author":"Wang Y.","year":"2021","journal-title":"Front. Psychiatry"},{"key":"S0129065722500459BIB016","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1016\/j.jvcir.2018.11.003","volume":"57","author":"Wang Q.","year":"2018","journal-title":"J. Vis. Commun. Image Represent."},{"key":"S0129065722500459BIB017","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.specom.2015.09.003","volume":"75","author":"Cummins N.","year":"2015","journal-title":"Speech Commun."},{"issue":"1","key":"S0129065722500459BIB018","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1037\/cou0000440","volume":"68","author":"Shapira N.","year":"2021","journal-title":"J. Couns. Psychol."},{"key":"S0129065722500459BIB019","first-page":"47","volume":"347","author":"Hollien H.","year":"1980","journal-title":"Forens. Psychol. Psychiatry"},{"key":"S0129065722500459BIB020","doi-asserted-by":"crossref","first-page":"105","DOI":"10.24193\/jebp.2017.1.7","volume":"17","author":"Trifu R. N.","year":"2017","journal-title":"J. Evid.-Based Psychother."},{"key":"S0129065722500459BIB021","doi-asserted-by":"crossref","first-page":"1530","DOI":"10.1109\/TBME.2004.827544","volume":"51","author":"Ozdas A.","year":"2004","journal-title":"IEEE. Trans. Biomed. Eng."},{"key":"S0129065722500459BIB022","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1109\/TBME.2007.900562","volume":"55","author":"Moore II E.","year":"2007","journal-title":"IEEE. Trans. Biomed. Eng."},{"key":"S0129065722500459BIB023","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TBME.2010.2091640","volume":"58","author":"Low L.-S. A.","year":"2010","journal-title":"IEEE. Trans. Biomed. Eng."},{"key":"S0129065722500459BIB024","first-page":"2997","volume-title":"12th Annual Conf. Int. Speech Communication Association","author":"Cummins N.","year":"2011"},{"key":"S0129065722500459BIB025","doi-asserted-by":"crossref","first-page":"7542","DOI":"10.1109\/ICASSP.2013.6639129","volume-title":"2013 IEEE Int. Conf. Acoustics, Speech and Signal Processing","author":"Cummins N.","year":"2013"},{"key":"S0129065722500459BIB026","doi-asserted-by":"crossref","first-page":"4613","DOI":"10.1109\/ICASSP.2012.6288946","volume-title":"2012 IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP)","author":"Ooi K. E. B.","year":"2012"},{"key":"S0129065722500459BIB027","first-page":"847","volume-title":"Interspeech","author":"Scherer S.","year":"2013"},{"key":"S0129065722500459BIB028","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1145\/2988257.2988267","volume-title":"Proc. 6th Int. Workshop on Audio\/Visual Emotion Challenge","author":"Ma X.","year":"2016"},{"key":"S0129065722500459BIB029","doi-asserted-by":"publisher","DOI":"10.1145\/2661806.2661818"},{"key":"S0129065722500459BIB030","first-page":"128","volume-title":"7th Int. AAAI Conf. Weblogs and Social Media","author":"Choudhury M. D.","year":"2013"},{"key":"S0129065722500459BIB031","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1109\/TAFFC.2014.2315623","volume":"5","author":"Nguyen T.","year":"2014","journal-title":"IEEE Trans. Affect. Comput."},{"key":"S0129065722500459BIB032","doi-asserted-by":"publisher","DOI":"10.1007\/s13042-017-0697-1"},{"key":"S0129065722500459BIB033","doi-asserted-by":"crossref","first-page":"588","DOI":"10.1109\/TKDE.2018.2885515","volume":"32","author":"Trotzek M.","year":"2018","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"S0129065722500459BIB034","doi-asserted-by":"crossref","first-page":"32035","DOI":"10.1088\/1742-6596\/1237\/3\/032035","volume":"1237","author":"Yang C.","year":"2019","journal-title":"J. Phys. Conf. Ser."},{"key":"S0129065722500459BIB035","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1109\/MCI.2020.2998234","volume":"15","author":"Qureshi S. A.","year":"2020","journal-title":"IEEE Comput. Intell. Mag."},{"key":"S0129065722500459BIB036","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1109\/TPAMI.2018.2798607","volume":"41","author":"Baltruaitis T.","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"S0129065722500459BIB037","first-page":"1","volume-title":"2009 3rd Int. Conf. Affective Computing and Intelligent Interaction and Workshops","author":"Cohn J. F.","year":"2009"},{"key":"S0129065722500459BIB038","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1145\/2988257.2988263","volume-title":"Proc. 6th Int. Workshop on Audio\/Visual Emotion Challenge","author":"Williamson J. R.","year":"2016"},{"key":"S0129065722500459BIB039","first-page":"1716","volume-title":"Interspeech","author":"Hanai T. A.","year":"2018"},{"key":"S0129065722500459BIB040","first-page":"1","author":"Aloshban N.","year":"2021","journal-title":"Cognit. Comput."},{"key":"S0129065722500459BIB041","doi-asserted-by":"crossref","first-page":"904","DOI":"10.1016\/j.jad.2021.08.090","volume":"295","author":"Ye J.","year":"2021","journal-title":"J. Affect. Disord."},{"key":"S0129065722500459BIB042","first-page":"1459","volume-title":"Proc. 18th ACM Int. Conf. Multimedia","author":"Eyben F.","year":"2010"},{"key":"S0129065722500459BIB043","first-page":"148","volume-title":"Proc. Annual Conf. Int. Speech Communication Association, INTERSPEECH","author":"Schuller B.","year":"2013"},{"key":"S0129065722500459BIB044","doi-asserted-by":"crossref","first-page":"478","DOI":"10.1145\/3123266.3123371","volume-title":"Proc. 25th ACM Int. Conf. Multimedia","author":"Cummins N.","year":"2017"},{"key":"S0129065722500459BIB045","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/3267935.3267948","volume-title":"Proc. Joint Workshop of the 4th Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data","author":"Zhao Z.","year":"2018"},{"key":"S0129065722500459BIB046","first-page":"3683","volume-title":"Interspeech","author":"Ma X.","year":"2018"},{"key":"S0129065722500459BIB047","doi-asserted-by":"crossref","first-page":"2666","DOI":"10.1109\/ICASSP.2018.8462219","volume-title":"2018 IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP)","author":"Guo L.","year":"2018"},{"key":"S0129065722500459BIB048","first-page":"1097","volume":"25","author":"Krizhevsky A.","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"S0129065722500459BIB052","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1109\/SLT.2018.8639583","volume-title":"2018 IEEE Spoken Language Technology Workshop (SLT)","author":"Yoon S.","year":"2018"}],"container-title":["International Journal of Neural Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0129065722500459","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,11,26]],"date-time":"2022-11-26T07:39:59Z","timestamp":1669448399000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0129065722500459"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,26]]},"references-count":48,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2022,11]]}},"alternative-id":["10.1142\/S0129065722500459"],"URL":"https:\/\/doi.org\/10.1142\/s0129065722500459","relation":{},"ISSN":["0129-0657","1793-6462"],"issn-type":[{"value":"0129-0657","type":"print"},{"value":"1793-6462","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,26]]},"article-number":"2250045"}}