{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,6]],"date-time":"2026-06-06T16:27:11Z","timestamp":1780763231321,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":29,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,10,23]],"date-time":"2017-10-23T00:00:00Z","timestamp":1508716800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Shaanxi Provincial International Science and Technology Collaboration Project","award":["2017KW-ZD-14"],"award-info":[{"award-number":["2017KW-ZD-14"]}]},{"name":"VUB Interdisciplinary Research Program","award":["EMO-App project"],"award-info":[{"award-number":["EMO-App project"]}]},{"name":"National Natural Science Foundation of China","award":["61273265"],"award-info":[{"award-number":["61273265"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,10,23]]},"DOI":"10.1145\/3133944.3133948","type":"proceedings-article","created":{"date-parts":[[2017,10,20]],"date-time":"2017-10-20T19:24:32Z","timestamp":1508527472000},"page":"53-59","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":134,"title":["Multimodal Measurement of Depression Using Deep Learning Models"],"prefix":"10.1145","author":[{"given":"Le","family":"Yang","sequence":"first","affiliation":[{"name":"Northwestern Polytechnical University, Xi'an, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dongmei","family":"Jiang","sequence":"additional","affiliation":[{"name":"Northwestern Polytechnical University, Xi'an, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaohan","family":"Xia","sequence":"additional","affiliation":[{"name":"Northwestern Polytechnical University, Xi'an, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ercheng","family":"Pei","sequence":"additional","affiliation":[{"name":"Northwestern Polytechnical University, Xi'an, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Meshia C\u00e9dric","family":"Oveneke","sequence":"additional","affiliation":[{"name":"Vrije Universiteit Brussels, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hichem","family":"Sahli","sequence":"additional","affiliation":[{"name":"Vrije Universiteit Brussels &amp; Interuniversity Microelectronics Centre, Brussels, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2017,10,23]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 8022--8026","author":"Alghowinem Sharifa","year":"2013","unstructured":"Sharifa Alghowinem , Roland Goecke , Michael Wagner , Julien Epps , Tom Gedeon , Michael Breakspear , and Gordon Parker . 2013 . A comparative study of different classifiers for detecting depression from spontaneous speech. In Acoustics , Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 8022--8026 . Sharifa Alghowinem, Roland Goecke, Michael Wagner, Julien Epps, Tom Gedeon, Michael Breakspear, and Gordon Parker. 2013. A comparative study of different classifiers for detecting depression from spontaneous speech. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 8022--8026."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2009.5349358"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2011-750"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2015.2457417"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3133944.3133953"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997632"},{"key":"e_1_3_2_1_7_1","volume-title":"World Health Organization, et al","author":"Friedli Lynne","year":"2009","unstructured":"Lynne Friedli , World Health Organization, et al . 2009 . Mental health, resilience and inequalities. (2009). Lynne Friedli, World Health Organization, et al. 2009. Mental health, resilience and inequalities. (2009)."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_3_2_1_9_1","unstructured":"Jonathan Gratch Ron Artstein Gale M. Lucas Giota Stratou Stefan Scherer Angela Nazarian Rachel Wood Jill Boberg David DeVault Stacy Marsella etal 2014. The Distress Analysis Interview Corpus of human and computer interviews. In LREC. 3123--3128.  Jonathan Gratch Ron Artstein Gale M. Lucas Giota Stratou Stefan Scherer Angela Nazarian Rachel Wood Jill Boberg David DeVault Stacy Marsella et al. 2014. The Distress Analysis Interview Corpus of human and computer interviews. In LREC. 3123--3128."},{"key":"e_1_3_2_1_10_1","volume-title":"Caffe: Convolutional Architecture for Fast Feature Embedding. arXiv preprint arXiv:1408.5093","author":"Jia Yangqing","year":"2014","unstructured":"Yangqing Jia , Evan Shelhamer , Jeff Donahue , Sergey Karayev , Jonathan Long , Ross Girshick , Sergio Guadarrama , and Trevor Darrell . 2014 . Caffe: Convolutional Architecture for Fast Feature Embedding. arXiv preprint arXiv:1408.5093 (2014). Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional Architecture for Fast Feature Embedding. arXiv preprint arXiv:1408.5093 (2014)."},{"key":"e_1_3_2_1_11_1","volume-title":"Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 3687--3691","author":"Kim Yelin","year":"2013","unstructured":"Yelin Kim , Honglak Lee , and Emily Mower Provost . 2013 . Deep learning for robust feature generation in audiovisual emotion recognition. In Acoustics , Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 3687--3691 . Yelin Kim, Honglak Lee, and Emily Mower Provost. 2013. Deep learning for robust feature generation in audiovisual emotion recognition. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 3687--3691."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10772-011-9125-1"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2013.6707732"},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the 31st International Conference on Machine Learning (ICML-14)","author":"Le Quoc","year":"2014","unstructured":"Quoc Le and Tomas Mikolov . 2014 . Distributed representations of sentences and documents . In Proceedings of the 31st International Conference on Machine Learning (ICML-14) . 1188--1196. Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning (ICML-14). 1188--1196."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2013.58"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988257.2988267"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2016.07.233"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661806.2661818"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/SLT.2016.7846256"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988257.2988266"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"e_1_3_2_1_22_1","volume-title":"The INTERSPEECH 2012 speaker trait challenge. In INTERSPEECH 2012, Conference of the International Speech Communication Association.","author":"Schuller Bj\u00f3rn","year":"2012","unstructured":"Bj\u00f3rn Schuller , Stefan Steidl , Anton Batliner , Elmar N\u00fcth , Alessandro Vinciarelli , Felix Burkhardt , Rob Son , Felix Weninger , Florian Eyben , and Tobias Bocklet . 2012 . The INTERSPEECH 2012 speaker trait challenge. In INTERSPEECH 2012, Conference of the International Speech Communication Association. Bj\u00f3rn Schuller, Stefan Steidl, Anton Batliner, Elmar N\u00fcth, Alessandro Vinciarelli, Felix Burkhardt, Rob Son, Felix Weninger, Florian Eyben, and Tobias Bocklet. 2012. The INTERSPEECH 2012 speaker trait challenge. In INTERSPEECH 2012, Conference of the International Speech Communication Association."},{"key":"e_1_3_2_1_23_1","volume-title":"The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. In INTERSPEECH. 427--431","author":"Schuller Bj\u00f6rn W.","year":"2014","unstructured":"Bj\u00f6rn W. Schuller , Stefan Steidl , Anton Batliner , Julien Epps , Florian Eyben , Fabien Ringeval , Erik Marchi , and Yue Zhang . 2014 . The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. In INTERSPEECH. 427--431 . Bj\u00f6rn W. Schuller, Stefan Steidl, Anton Batliner, Julien Epps, Florian Eyben, Fabien Ringeval, Erik Marchi, and Yue Zhang. 2014. The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. In INTERSPEECH. 427--431."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988257.2988263"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661806.2661809"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2993148.2997630"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988257.2988269"},{"key":"e_1_3_2_1_29_1","first-page":"1","article-title":"Automated Depression Diagnosis based on Deep Networks to Encode Facial Appearance and Dynamics","volume":"99","author":"Zhu Yu","year":"1949","unstructured":"Yu Zhu , Yuanyuan Shang , Zhuhong Shao , and Guodong Guo . 1949 . Automated Depression Diagnosis based on Deep Networks to Encode Facial Appearance and Dynamics . IEEE Transactions on Affective Computing PP , 99 (1949), 1 -- 1 . Yu Zhu, Yuanyuan Shang, Zhuhong Shao, and Guodong Guo. 1949. Automated Depression Diagnosis based on Deep Networks to Encode Facial Appearance and Dynamics. IEEE Transactions on Affective Computing PP, 99 (1949), 1--1.","journal-title":"IEEE Transactions on Affective Computing PP"}],"event":{"name":"MM '17: ACM Multimedia Conference","location":"Mountain View California USA","acronym":"MM '17","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 7th Annual Workshop on Audio\/Visual Emotion Challenge"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3133944.3133948","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3133944.3133948","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:13:26Z","timestamp":1750212806000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3133944.3133948"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,23]]},"references-count":29,"alternative-id":["10.1145\/3133944.3133948","10.1145\/3133944"],"URL":"https:\/\/doi.org\/10.1145\/3133944.3133948","relation":{},"subject":[],"published":{"date-parts":[[2017,10,23]]},"assertion":[{"value":"2017-10-23","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}