{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T10:20:29Z","timestamp":1779358829830,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,10,15]],"date-time":"2019-10-15T00:00:00Z","timestamp":1571097600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,10,15]]},"DOI":"10.1145\/3347320.3357697","type":"proceedings-article","created":{"date-parts":[[2019,10,24]],"date-time":"2019-10-24T19:04:48Z","timestamp":1571943888000},"page":"81-88","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":120,"title":["Multi-level Attention Network using Text, Audio and Video for Depression Prediction"],"prefix":"10.1145","author":[{"given":"Anupama","family":"Ray","sequence":"first","affiliation":[{"name":"IBM Research, Bangalore, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siddharth","family":"Kumar","sequence":"additional","affiliation":[{"name":"IIIT Scricity, Sricity, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rutvik","family":"Reddy","sequence":"additional","affiliation":[{"name":"IIIT Sricity, Sricity, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Prerana","family":"Mukherjee","sequence":"additional","affiliation":[{"name":"IIIT Sricity, Sricity, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ritu","family":"Garg","sequence":"additional","affiliation":[{"name":"IEEE Member, Bangalore, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2019,10,15]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Sharifa Alghowinem Roland Goecke Julien Epps Michael Wagner and Jeffrey F Cohn. 2016a. Cross-Cultural Depression Recognition from Vocal Biomarkers.. In INTERSPEECH . 1943--1947.  Sharifa Alghowinem Roland Goecke Julien Epps Michael Wagner and Jeffrey F Cohn. 2016a. Cross-Cultural Depression Recognition from Vocal Biomarkers.. In INTERSPEECH . 1943--1947.","DOI":"10.21437\/Interspeech.2016-1339"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAFFC.2016.2634527"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOMTECH.2018.8722410"},{"key":"e_1_3_2_1_4_1","volume-title":"2016 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 1--10","author":"Tadas Baltruvs","year":"2016","unstructured":"Tadas Baltruvs aitis, Peter Robinson , and Louis-Philippe Morency . 2016 . Openface: an open source facial behavior analysis toolkit . In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 1--10 . Tadas Baltruvs aitis, Peter Robinson, and Louis-Philippe Morency. 2016. Openface: an open source facial behavior analysis toolkit. In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 1--10."},{"key":"e_1_3_2_1_5_1","volume-title":"Robust unsupervised arousal rating: A rule-based framework withknowledge-inspired vocal features","author":"Bone Daniel","year":"2014","unstructured":"Daniel Bone , Chi-Chun Lee , and Shrikanth Narayanan . 2014. Robust unsupervised arousal rating: A rule-based framework withknowledge-inspired vocal features . IEEE transactions on affective computing , Vol. 5 , 2 ( 2014 ), 201--213. Daniel Bone, Chi-Chun Lee, and Shrikanth Narayanan. 2014. Robust unsupervised arousal rating: A rule-based framework withknowledge-inspired vocal features. IEEE transactions on affective computing , Vol. 5, 2 (2014), 201--213."},{"key":"e_1_3_2_1_6_1","volume-title":"Accuracy of general practitioner unassisted detection of depression. (4","author":"Meadows G","year":"2014","unstructured":"Meadows G Carey M, Jones K. 2014. Accuracy of general practitioner unassisted detection of depression. (4 2014 ). Meadows G Carey M, Jones K. 2014. Accuracy of general practitioner unassisted detection of depression. (4 2014)."},{"key":"e_1_3_2_1_7_1","volume-title":"A content analysis of depression-related tweets. Computers in human behavior","author":"Cavazos-Rehg Patricia A","year":"2016","unstructured":"Patricia A Cavazos-Rehg , Melissa J Krauss , Shaina Sowles , Sarah Connolly , Carlos Rosas , Meghana Bharadwaj , and Laura J Bierut . 2016. A content analysis of depression-related tweets. Computers in human behavior , Vol. 54 ( 2016 ), 351--357. Patricia A Cavazos-Rehg, Melissa J Krauss, Shaina Sowles, Sarah Connolly, Carlos Rosas, Meghana Bharadwaj, and Laura J Bierut. 2016. A content analysis of depression-related tweets. Computers in human behavior , Vol. 54 (2016), 351--357."},{"key":"e_1_3_2_1_8_1","volume-title":"Universal Sentence Encoder. CoRR","author":"Cer Daniel","year":"2018","unstructured":"Daniel Cer , Yinfei Yang , Sheng-yi Kong, Nan Hua , Nicole Limtiaco , Rhomni St. John , Noah Constant , Mario Guajardo-Cespedes , Steve Yuan , Chris Tar , Yun-Hsuan Sung , Brian Strope , and Ray Kurzweil . 2018. Universal Sentence Encoder. CoRR , Vol. abs\/ 1803 .11175 ( 2018 ). http:\/\/arxiv.org\/abs\/1803.11175 Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, and Ray Kurzweil. 2018. Universal Sentence Encoder. CoRR , Vol. abs\/1803.11175 (2018). http:\/\/arxiv.org\/abs\/1803.11175"},{"key":"e_1_3_2_1_9_1","volume-title":"2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops .","author":"Cohn J. F.","unstructured":"J. F. Cohn , T. S. Kruez , I. Matthews , Y. Yang , M. H. Nguyen , M. T. Padilla , F. Zhou , and F. De la Torre. 2009. Detecting depression from facial actions and vocal prosody . In 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops . J. F. Cohn, T. S. Kruez, I. Matthews, Y. Yang , M. H. Nguyen, M. T. Padilla , F. Zhou, and F. De la Torre. 2009. Detecting depression from facial actions and vocal prosody. In 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops ."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2015.03.004"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2015.09.003"},{"key":"e_1_3_2_1_12_1","volume-title":"Meta-analysis of emotion recognition deficits in major depressive disorder. Psychological medicine","author":"Dalili MN","year":"2015","unstructured":"MN Dalili , IS Penton-Voak , CJ Harmer , and MR Munaf\u00f2 . 2015. Meta-analysis of emotion recognition deficits in major depressive disorder. Psychological medicine , Vol. 45 , 6 ( 2015 ), 1135--1144. MN Dalili, IS Penton-Voak, CJ Harmer, and MR Munaf\u00f2. 2015. Meta-analysis of emotion recognition deficits in major depressive disorder. Psychological medicine , Vol. 45, 6 (2015), 1135--1144."},{"key":"e_1_3_2_1_13_1","volume-title":"Proceedings of the 2014 International Conference on Autonomous Agents and Multi-agent Systems (AAMAS '14)","author":"DeVault David","year":"2014","unstructured":"David DeVault , Ron Artstein , Grace Benn , Teresa Dey , Ed Fast , Alesia Gainer , Kallirroi Georgila , Jon Gratch , Arno Hartholt , Margaux Lhommet , Gale Lucas , Stacy Marsella , Fabrizio Morbini , Angela Nazarian , Stefan Scherer , Giota Stratou , Apar Suri , David Traum , Rachel Wood , Yuyu Xu , Albert Rizzo , and Louis-Philippe Morency . 2014 . SimSensei Kiosk: A Virtual Human Interviewer for Healthcare Decision Support . In Proceedings of the 2014 International Conference on Autonomous Agents and Multi-agent Systems (AAMAS '14) . International Foundation for Autonomous Agents and Multiagent Systems. David DeVault, Ron Artstein, Grace Benn, Teresa Dey, Ed Fast, Alesia Gainer, Kallirroi Georgila, Jon Gratch, Arno Hartholt, Margaux Lhommet, Gale Lucas, Stacy Marsella, Fabrizio Morbini, Angela Nazarian, Stefan Scherer, Giota Stratou, Apar Suri, David Traum, Rachel Wood, Yuyu Xu, Albert Rizzo, and Louis-Philippe Morency. 2014. SimSensei Kiosk: A Virtual Human Interviewer for Healthcare Decision Support. In Proceedings of the 2014 International Conference on Autonomous Agents and Multi-agent Systems (AAMAS '14). International Foundation for Autonomous Agents and Multiagent Systems."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818346.2830596"},{"key":"e_1_3_2_1_15_1","volume-title":"The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing","author":"Eyben Florian","year":"2016","unstructured":"Florian Eyben , Klaus Scherer , Bj\u00f6rn Schuller , Johan Sundberg , Elisabeth Andr\u00e9 , Carlos Busso , Laurence Devillers , Julien Epps , Petri Laukka , Shrikanth Narayanan , and Khiet Phuong Truong . 2016. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing . IEEE transactions on affective computing , Vol. 7 (4 2016 ), 190--202. Florian Eyben, Klaus Scherer, Bj\u00f6rn Schuller, Johan Sundberg, Elisabeth Andr\u00e9, Carlos Busso, Laurence Devillers, Julien Epps, Petri Laukka, Shrikanth Narayanan, and Khiet Phuong Truong. 2016. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing. IEEE transactions on affective computing , Vol. 7 (4 2016), 190--202."},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (2013)","author":"Eyben Florian","year":"2013","unstructured":"Florian Eyben , F Weninger , and Bj\u00c3\u00b6rn Schuller . 2013 . Affect recognition in real-life acoustic conditions - A new perspective on feature selection . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (2013) , 2044--2048. Florian Eyben, F Weninger, and Bj\u00c3\u00b6rn Schuller. 2013. Affect recognition in real-life acoustic conditions - A new perspective on feature selection. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (2013), 2044--2048."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Fabien Ringeval and Bj\u00f6rn Schuller and Michel Valstar and Nicholas Cummins and Roddy Cowie and Leili Tavabi and Maximilian Schmitt and Sina Alisamir and Shahin Amiriparian and Eva-Maria Messner and Siyang Song and Shuo Lui and Ziping Zhao and Adria Mallol-Ragolta and Zhao Ren and Maja Pantic. 2019. AVEC 2019 Workshop and Challenge: State-of-Mind Depression with AI and Cross-Cultural Affect Recognition. In Proceedings of the 9th International Workshop on Audio\/Visual Emotion Challenge AVEC'19 co-located with the 27th ACM International Conference on Multimedia MM 2019 Fabien Ringeval Bj\u00f6rn Schuller Michel Valstar Nicholas Cummins Roddy Cowie and Maja Pantic (Eds.). ACM Nice France.  Fabien Ringeval and Bj\u00f6rn Schuller and Michel Valstar and Nicholas Cummins and Roddy Cowie and Leili Tavabi and Maximilian Schmitt and Sina Alisamir and Shahin Amiriparian and Eva-Maria Messner and Siyang Song and Shuo Lui and Ziping Zhao and Adria Mallol-Ragolta and Zhao Ren and Maja Pantic. 2019. AVEC 2019 Workshop and Challenge: State-of-Mind Depression with AI and Cross-Cultural Affect Recognition. In Proceedings of the 9th International Workshop on Audio\/Visual Emotion Challenge AVEC'19 co-located with the 27th ACM International Conference on Multimedia MM 2019 Fabien Ringeval Bj\u00f6rn Schuller Michel Valstar Nicholas Cummins Roddy Cowie and Maja Pantic (Eds.). ACM Nice France.","DOI":"10.1145\/3347320.3357688"},{"key":"e_1_3_2_1_18_1","unstructured":"Jonathan Gratch Ron Arstein Gale Lucas Giota Stratou Stefan Scherer Angela Nazarian Rachel Wood Jill Boberg David DeVault Stacy Marsella David Traum Albert Rizzo and L P. Morency. 2014a. The Distress Analysis Interview Corpus of human and computer interviews.  Jonathan Gratch Ron Arstein Gale Lucas Giota Stratou Stefan Scherer Angela Nazarian Rachel Wood Jill Boberg David DeVault Stacy Marsella David Traum Albert Rizzo and L P. Morency. 2014a. The Distress Analysis Interview Corpus of human and computer interviews."},{"key":"e_1_3_2_1_19_1","volume-title":"et almbox","author":"Gratch Jonathan","year":"2014","unstructured":"Jonathan Gratch , Ron Artstein , Gale M Lucas , Giota Stratou , Stefan Scherer , Angela Nazarian , Rachel Wood , Jill Boberg , David DeVault , Stacy Marsella , et almbox . 2014 b. The distress analysis interview corpus of human and computer interviews.. In LREC . 3123--3128. Jonathan Gratch, Ron Artstein, Gale M Lucas, Giota Stratou, Stefan Scherer, Angela Nazarian, Rachel Wood, Jill Boberg, David DeVault, Stacy Marsella, et almbox. 2014b. The distress analysis interview corpus of human and computer interviews.. In LREC . 3123--3128."},{"key":"e_1_3_2_1_20_1","volume-title":"Attention-deficit hyperactivity disorder and children's emotion dysregulation: A meta-analysis. Clinical psychology review","author":"Graziano Paulo A","year":"2016","unstructured":"Paulo A Graziano and Alexis Garcia . 2016. Attention-deficit hyperactivity disorder and children's emotion dysregulation: A meta-analysis. Clinical psychology review , Vol. 46 ( 2016 ), 106--123. Paulo A Graziano and Alexis Garcia. 2016. Attention-deficit hyperactivity disorder and children's emotion dysregulation: A meta-analysis. Clinical psychology review , Vol. 46 (2016), 106--123."},{"key":"e_1_3_2_1_21_1","volume-title":"Providing Appropriate Social Support to Prevention of Depression for Highly Anxious Sufferers","author":"Hao Fei","year":"2019","unstructured":"Fei Hao , Guangyao Pang , Yulei Wu , Zhongling Pi , Lirong Xia , and Geyong Min . 2019. Providing Appropriate Social Support to Prevention of Depression for Highly Anxious Sufferers . IEEE Transactions on Computational Social Systems ( 2019 ). Fei Hao, Guangyao Pang, Yulei Wu, Zhongling Pi, Lirong Xia, and Geyong Min. 2019. Providing Appropriate Social Support to Prevention of Depression for Highly Anxious Sufferers. IEEE Transactions on Computational Social Systems (2019)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICTC.2017.8190959"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Jia Jia. 2018. Mental Health Computing via Harvesting Social Media Data.. In IJCAI. 5677--5681.  Jia Jia. 2018. Mental Health Computing via Harvesting Social Media Data.. In IJCAI. 5677--5681.","DOI":"10.24963\/ijcai.2018\/808"},{"key":"e_1_3_2_1_24_1","article-title":"The PHQ-8 as a Measure of Current Depression in the General Population","volume":"114","author":"Kroenke Kurt","year":"2008","unstructured":"Kurt Kroenke , Tara Strine , Robert L Spitzer , Janet Williams , Joyce T Berry , and Ali Mokdad . 2008 . The PHQ-8 as a Measure of Current Depression in the General Population . Journal of affective disorders , Vol. 114 (09 2008), 163--73. Kurt Kroenke, Tara Strine, Robert L Spitzer, Janet Williams, Joyce T Berry, and Ali Mokdad. 2008. The PHQ-8 as a Measure of Current Depression in the General Population. Journal of affective disorders , Vol. 114 (09 2008), 163--73.","journal-title":"Journal of affective disorders"},{"key":"e_1_3_2_1_25_1","volume-title":"Context-aware Deep Learning for Multi-modal Depression Detection. In ICASSP 2019--2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 3946--3950","author":"Lam Genevieve","year":"2019","unstructured":"Genevieve Lam , Huang Dongyan , and Weisi Lin . 2019 . Context-aware Deep Learning for Multi-modal Depression Detection. In ICASSP 2019--2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 3946--3950 . Genevieve Lam, Huang Dongyan, and Weisi Lin. 2019. Context-aware Deep Learning for Multi-modal Depression Detection. In ICASSP 2019--2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 3946--3950."},{"key":"e_1_3_2_1_26_1","volume-title":"Meta-analysis of face processing event-related potentials in schizophrenia. Biological psychiatry","author":"McCleery Amanda","year":"2015","unstructured":"Amanda McCleery , Junghee Lee , Aditi Joshi , Jonathan K Wynn , Gerhard S Hellemann , and Michael F Green . 2015. Meta-analysis of face processing event-related potentials in schizophrenia. Biological psychiatry , Vol. 77 , 2 ( 2015 ), 116--126. Amanda McCleery, Junghee Lee, Aditi Joshi, Jonathan K Wynn, Gerhard S Hellemann, and Michael F Green. 2015. Meta-analysis of face processing event-related potentials in schizophrenia. Biological psychiatry , Vol. 77, 2 (2015), 116--126."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472788"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W18-0602"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1081"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2015.01.095"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2016.0055"},{"key":"e_1_3_2_1_32_1","volume-title":"The Verbal and Non Verbal Signals of Depression--Combining Acoustics, Text and Visuals for Estimating Depression Level. arXiv preprint arXiv:1904.07656","author":"Qureshi Syed Arbaaz","year":"2019","unstructured":"Syed Arbaaz Qureshi , Mohammed Hasanuzzaman , Sriparna Saha , and Ga\u00ebl Dias . 2019. The Verbal and Non Verbal Signals of Depression--Combining Acoustics, Text and Visuals for Estimating Depression Level. arXiv preprint arXiv:1904.07656 ( 2019 ). Syed Arbaaz Qureshi, Mohammed Hasanuzzaman, Sriparna Saha, and Ga\u00ebl Dias. 2019. The Verbal and Non Verbal Signals of Depression--Combining Acoustics, Text and Visuals for Estimating Depression Level. arXiv preprint arXiv:1904.07656 (2019)."},{"key":"e_1_3_2_1_33_1","unstructured":"Fabien Ringeval Bj\u00f6rn Schuller Michel Valstar Shashank Jaiswal Erik Marchi Denis Lalanne Roddy Cowie and Maja Pantic. 2015. AV  Fabien Ringeval Bj\u00f6rn Schuller Michel Valstar Shashank Jaiswal Erik Marchi Denis Lalanne Roddy Cowie and Maja Pantic. 2015. AV"},{"key":"e_1_3_2_1_34_1","volume-title":"Proceedings of the 5th International Workshop on Audio\/Visual Emotion Challenge (AVEC '15)","author":"EC","year":"2015","unstructured":"EC 2015 : The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data . In Proceedings of the 5th International Workshop on Audio\/Visual Emotion Challenge (AVEC '15) . EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data. In Proceedings of the 5th International Workshop on Audio\/Visual Emotion Challenge (AVEC '15)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2014.06.001"},{"key":"e_1_3_2_1_36_1","volume-title":"openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit. CoRR","author":"Schmitt Maximilian","year":"2016","unstructured":"Maximilian Schmitt and Bj\u00f6 rn W. Schuller . 2016. openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit. CoRR , Vol. abs\/ 1605 .06778 ( 2016 ). arxiv: 1605.06778 http:\/\/arxiv.org\/abs\/1605.06778 Maximilian Schmitt and Bj\u00f6 rn W. Schuller. 2016. openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit. CoRR , Vol. abs\/1605.06778 (2016). arxiv: 1605.06778 http:\/\/arxiv.org\/abs\/1605.06778"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Bj\u00f6rn W. Schuller Anton Batliner Dino Seppi Stefan Steidl Thurid Vogt Johannes Wagner Laurence Devillers Laurence Vidrascu Noam Amir Lo\u00efc Kessous and Vered Aharonson. 2007. The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals. In INTERSPEECH .  Bj\u00f6rn W. Schuller Anton Batliner Dino Seppi Stefan Steidl Thurid Vogt Johannes Wagner Laurence Devillers Laurence Vidrascu Noam Amir Lo\u00efc Kessous and Vered Aharonson. 2007. The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals. In INTERSPEECH .","DOI":"10.21437\/Interspeech.2007-612"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"crossref","unstructured":"Guangyao Shen Jia Jia Liqiang Nie Fuli Feng Cunjun Zhang Tianrui Hu Tat-Seng Chua and Wenwu Zhu. 2017. Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution.. In IJCAI. 3838--3844.  Guangyao Shen Jia Jia Liqiang Nie Fuli Feng Cunjun Zhang Tianrui Hu Tat-Seng Chua and Wenwu Zhu. 2017. Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution.. In IJCAI. 3838--3844.","DOI":"10.24963\/ijcai.2017\/536"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Brian Stasak Julien Epps Nicholas Cummins and Roland Goecke. 2016. An Investigation of Emotional Speech in Depression Classification.. In Interspeech . 485--489.  Brian Stasak Julien Epps Nicholas Cummins and Roland Goecke. 2016. An Investigation of Emotional Speech in Depression Classification.. In Interspeech . 485--489.","DOI":"10.21437\/Interspeech.2016-867"},{"key":"e_1_3_2_1_40_1","volume-title":"et almbox","author":"Tong Lei","year":"2019","unstructured":"Lei Tong , Qianni Zhang , Abdul Sadka , Ling Li , Huiyu Zhou , et almbox . 2019 . Inverse boosting pruning trees for depression detection on Twitter . arXiv preprint arXiv:1906.00398 (2019). Lei Tong, Qianni Zhang, Abdul Sadka, Ling Li, Huiyu Zhou, et almbox. 2019. Inverse boosting pruning trees for depression detection on Twitter. arXiv preprint arXiv:1906.00398 (2019)."},{"key":"e_1_3_2_1_41_1","volume-title":"AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge. CoRR","volume":"1605","author":"Valstar Michel F.","year":"2016","unstructured":"Michel F. Valstar , Jonathan Gratch , Bj\u00f6 rn W. Schuller , Fabien Ringeval , Denis Lalanne , Mercedes Torres , Stefan Scherer , Giota Stratou , Roddy Cowie , and Maja Pantic . 2016 . AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge. CoRR , Vol. abs\/ 1605 .01600 (2016). http:\/\/arxiv.org\/abs\/1605.01600 Michel F. Valstar, Jonathan Gratch, Bj\u00f6 rn W. Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres, Stefan Scherer, Giota Stratou, Roddy Cowie, and Maja Pantic. 2016. AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge. CoRR , Vol. abs\/1605.01600 (2016). http:\/\/arxiv.org\/abs\/1605.01600"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/FG.2017.107"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.5555\/3295222.3295349"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/2988257.2988263"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.374"}],"event":{"name":"MM '19: The 27th ACM International Conference on Multimedia","location":"Nice France","acronym":"MM '19","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 9th International on Audio\/Visual Emotion Challenge and Workshop"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3347320.3357697","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3347320.3357697","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T19:05:46Z","timestamp":1750273546000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3347320.3357697"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,10,15]]},"references-count":45,"alternative-id":["10.1145\/3347320.3357697","10.1145\/3347320"],"URL":"https:\/\/doi.org\/10.1145\/3347320.3357697","relation":{},"subject":[],"published":{"date-parts":[[2019,10,15]]},"assertion":[{"value":"2019-10-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}