{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T15:59:55Z","timestamp":1772553595081,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":49,"publisher":"ACM","license":[{"start":{"date-parts":[[2014,11,7]],"date-time":"2014-11-07T00:00:00Z","timestamp":1415318400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2014,11,7]]},"DOI":"10.1145\/2661806.2661810","type":"proceedings-article","created":{"date-parts":[[2014,11,3]],"date-time":"2014-11-03T14:41:51Z","timestamp":1415025711000},"page":"33-40","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":80,"title":["Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions"],"prefix":"10.1145","author":[{"given":"Rahul","family":"Gupta","sequence":"first","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Nikolaos","family":"Malandrakis","sequence":"additional","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Bo","family":"Xiao","sequence":"additional","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Tanaya","family":"Guha","sequence":"additional","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Maarten","family":"Van Segbroeck","sequence":"additional","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Matthew","family":"Black","sequence":"additional","affiliation":[{"name":"Univesity of Southern California, Los Angeles, CA, USA"}]},{"given":"Alexandros","family":"Potamianos","sequence":"additional","affiliation":[{"name":"National Technical University of Athens, Athens, Greece"}]},{"given":"Shrikanth","family":"Narayanan","sequence":"additional","affiliation":[{"name":"University of Southern California, Los Angeles, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2014,11,7]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Gnu aspell. http:\/\/www.aspell.net.  Gnu aspell. http:\/\/www.aspell.net."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACII.2013.53"},{"key":"e_1_3_2_1_3_1","volume-title":"Diagnostic and statistical manual of mental disorders","author":"American Psychiatric Association","unstructured":"American Psychiatric Association . Diagnostic and statistical manual of mental disorders . 5 th edition, VA : American Psychiatric Publishing . American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5th edition, VA: American Psychiatric Publishing.","edition":"5"},{"key":"e_1_3_2_1_4_1","first-page":"21","volume-title":"Proc. Int. Workshop UrbanSense","author":"Annavaram Murali","year":"2008","unstructured":"Murali Annavaram , Nenad Medvidovic , Urbashi Mitra , Shrikanth S. Narayanan , Gaurav Sukhatme , Zhaoshi Meng , Shi Qiu , Rohit Kumar , Gautam Thatte , and Donna Spruijt-Metz . Multimodal sensing for pediatric obesity applications . In Proc. Int. Workshop UrbanSense , pages 21 -- 25 , Raleigh, NC , November 2008 . Murali Annavaram, Nenad Medvidovic, Urbashi Mitra, Shrikanth S. Narayanan, Gaurav Sukhatme, Zhaoshi Meng, Shi Qiu, Rohit Kumar, Gautam Thatte, and Donna Spruijt-Metz. Multimodal sensing for pediatric obesity applications. In Proc. Int. Workshop UrbanSense, pages 21--25, Raleigh, NC, November 2008."},{"key":"e_1_3_2_1_5_1","unstructured":"Anxiety and Depression Association of America. Depression January 2014. http:\/\/www.adaa.org\/ understanding-anxiety\/depression.  Anxiety and Depression Association of America. Depression January 2014. http:\/\/www.adaa.org\/ understanding-anxiety\/depression."},{"key":"e_1_3_2_1_6_1","volume-title":"Comparison of beck depression inventories-ia and-ii in psychiatric outpatients. J. of personality assessment, 67(3):588--597","author":"Beck Aaron T.","year":"1996","unstructured":"Aaron T. Beck , Robert A Steer , Roberta Ball , and William F. Ranieri . Comparison of beck depression inventories-ia and-ii in psychiatric outpatients. J. of personality assessment, 67(3):588--597 , 1996 . Aaron T. Beck, Robert A Steer, Roberta Ball, and William F. Ranieri. Comparison of beck depression inventories-ia and-ii in psychiatric outpatients. J. of personality assessment, 67(3):588--597, 1996."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2011.12.003"},{"key":"e_1_3_2_1_8_1","volume-title":"The psychologist as an interlocutor in autism spectrum disorder assessment: Insights from a study of spontaneous prosody. J. of Speech, Language, and Hearing Research","author":"Bone Daniel","year":"2013","unstructured":"Daniel Bone , Matthew Black , Chi-Chun Lee , Marian Williams , Pat Levitt , Sungbok Lee , and Shrikanth S. Narayanan . The psychologist as an interlocutor in autism spectrum disorder assessment: Insights from a study of spontaneous prosody. J. of Speech, Language, and Hearing Research , 2013 . Daniel Bone, Matthew Black, Chi-Chun Lee, Marian Williams, Pat Levitt, Sungbok Lee, and Shrikanth S. Narayanan. The psychologist as an interlocutor in autism spectrum disorder assessment: Insights from a study of spontaneous prosody. J. of Speech, Language, and Hearing Research, 2013."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638349"},{"key":"e_1_3_2_1_10_1","volume-title":"Brisbane","author":"Cox M","year":"2013","unstructured":"M Cox , J Nuevo-Chiquero , JM Saragih , and S Lucey . Csiro face analysis sdk . Brisbane , Australia , 2013 . M Cox, J Nuevo-Chiquero, JM Saragih, and S Lucey. Csiro face analysis sdk. Brisbane, Australia, 2013."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1873951.1874246"},{"key":"e_1_3_2_1_12_1","volume-title":"Burden of depressive disorders by country, sex, age, and year: Findings from the global burden of disease study","author":"Ferrari A. J.","year":"2010","unstructured":"A. J. Ferrari , F. J. Charlson , R. E. Norman , S. B. Patten , G. Freedman , C. J. L. Murray , and H. A Whiteford . Burden of depressive disorders by country, sex, age, and year: Findings from the global burden of disease study 2010 . Public Library of Science Medicine , 10(11), 2013. A. J. Ferrari, F. J. Charlson, R. E. Norman, S. B. Patten, G. Freedman, C. J. L. Murray, and H. A Whiteford. Burden of depressive disorders by country, sex, age, and year: Findings from the global burden of disease study 2010. Public Library of Science Medicine, 10(11), 2013."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/10.846676"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0191-8869(01)00103-9"},{"key":"e_1_3_2_1_15_1","volume-title":"Image and Vision Computing","author":"Girard J.","year":"2013","unstructured":"J. Girard , J. Cohn , M. H. Mahoor , S. M. Mavadati ., Z. Hammal , and D. P. Rosenwald . Nonverbal social withdrawal in depression: Evidence from manual and automatic analysis . In Image and Vision Computing , 2013 . J. Girard, J. Cohn, M. H. Mahoor, S. M. Mavadati., Z. Hammal, and D. P. Rosenwald. Nonverbal social withdrawal in depression: Evidence from manual and automatic analysis. In Image and Vision Computing, 2013."},{"key":"e_1_3_2_1_16_1","volume-title":"Proc. Interspeech","author":"Gupta R.","year":"2013","unstructured":"R. Gupta , K. Audhkhasi , S. Lee , and S. S. Narayanan . Speech paralinguistic event detection using probabilistic time-series smoothing and masking . In Proc. Interspeech , 2013 . R. Gupta, K. Audhkhasi, S. Lee, and S. S. Narayanan. Speech paralinguistic event detection using probabilistic time-series smoothing and masking. In Proc. Interspeech, 2013."},{"key":"e_1_3_2_1_17_1","volume-title":"Proc. InterSpeech","author":"Gupta Rahul","year":"2014","unstructured":"Rahul Gupta , Panayiotis G. Georgiou , David Atkins , and Shrikanth S. Narayanan . Predicting client's inclination towards target behavior change in motivational interviewing and investigating the role of laughter . In Proc. InterSpeech , September 2014 . Rahul Gupta, Panayiotis G. Georgiou, David Atkins, and Shrikanth S. Narayanan. Predicting client's inclination towards target behavior change in motivational interviewing and investigating the role of laughter. In Proc. InterSpeech, September 2014."},{"issue":"4","key":"e_1_3_2_1_18_1","first-page":"S92","article-title":"The benefits of early and appropriate treatment","volume":"13","author":"Halfin A.","year":"2007","unstructured":"A. Halfin . Depression : The benefits of early and appropriate treatment . American J. of Managed Care , 13 ( 4 ): S92 -- S97 , 2007 . A. Halfin. Depression: The benefits of early and appropriate treatment. American J. of Managed Care, 13(4):S92--S97, 2007.","journal-title":"American J. of Managed Care"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jneumeth.2011.06.023"},{"key":"e_1_3_2_1_20_1","volume-title":"Proc. Int. Conf. on Pattern Recognition Applications and Methods","author":"K\u00e1chele M.","year":"2014","unstructured":"M. K\u00e1chele , M. Glodek , D. Zharkov , S. Meudt , and F. Schwenker . Fusion of audio-visual features using hierarchical classifier systems for the recognition of affective states and the state of depression . In Proc. Int. Conf. on Pattern Recognition Applications and Methods , 2014 . M. K\u00e1chele, M. Glodek, D. Zharkov, S. Meudt, and F. Schwenker. Fusion of audio-visual features using hierarchical classifier systems for the recognition of affective states and the state of depression. In Proc. Int. Conf. on Pattern Recognition Applications and Methods, 2014."},{"key":"e_1_3_2_1_21_1","first-page":"185","article-title":"Detection and prediction of clinical depression. Mental Health Informatics","volume":"491","author":"Lech M.","year":"2014","unstructured":"M. Lech , L.-S. Low , and K. E. Ooi . Detection and prediction of clinical depression. Mental Health Informatics , Studies in Computational Intelligence , 491 : 185 -- 199 , 2014 . M. Lech, L.-S. Low, and K. E. Ooi. Detection and prediction of clinical depression. Mental Health Informatics, Studies in Computational Intelligence, 491:185--199, 2014.","journal-title":"Studies in Computational Intelligence"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2012.06.006"},{"key":"e_1_3_2_1_23_1","volume-title":"Proc. ICASSP","author":"Low L.-S. A.","year":"2010","unstructured":"L.-S. A. Low , N. C. Maddage , M. Lech , L. Sheeber , and N. Allen . Inuence of acoustic low-level descriptors in the detection of clinical depression in adolescents . In Proc. ICASSP , 2010 . L.-S. A. Low, N. C. Maddage, M. Lech, L. Sheeber, and N. Allen. Inuence of acoustic low-level descriptors in the detection of clinical depression in adolescents. In Proc. ICASSP, 2010."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2013.2277931"},{"key":"e_1_3_2_1_25_1","volume-title":"Recognition of facial affect in depression. Perceptual and motor skills, 61 (1):13--14","author":"Mandal M.","year":"1985","unstructured":"M. Mandal and B. Bhattacharya . Recognition of facial affect in depression. Perceptual and motor skills, 61 (1):13--14 , 1985 . M. Mandal and B. Bhattacharya. Recognition of facial affect in depression. Perceptual and motor skills, 61 (1):13--14, 1985."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/T-AFFC.2011.40"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2012.08.018"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBME.2007.900562"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.biopsych.2012.03.015"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2012.2236291"},{"key":"e_1_3_2_1_31_1","unstructured":"National Institute of Mental Health. Depression January 2014. http:\/\/www.nimh.nih.gov\/health\/topics\/depression\/index.shtml.  National Institute of Mental Health. Depression January 2014. http:\/\/www.nimh.nih.gov\/health\/topics\/depression\/index.shtml."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2002.1017623"},{"key":"e_1_3_2_1_33_1","volume-title":"Image Analysis for Multimedia Interactive Services","author":"Ooi K. E. B.","year":"2011","unstructured":"K. E. B. Ooi , L. S. A. Low , M. Lech , and N. B. Allen . Prediction of clinical depression in adolescents using facial image analysis . In Image Analysis for Multimedia Interactive Services , 2011 . K. E. B. Ooi, L. S. A. Low, M. Lech, and N. B. Allen. Prediction of clinical depression in adolescents using facial image analysis. In Image Analysis for Multimedia Interactive Services, 2011."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-010-0380-4"},{"key":"e_1_3_2_1_35_1","first-page":"44","volume-title":"Proc. Int. Conf. on New Methods in Language Processing","volume":"12","author":"Schmid H.","year":"1994","unstructured":"H. Schmid . Probabilistic part-of-speech tagging using decision trees . In Proc. Int. Conf. on New Methods in Language Processing , volume 12 , pages 44 -- 49 , 1994 . H. Schmid. Probabilistic part-of-speech tagging using decision trees. In Proc. Int. Conf. on New Methods in Language Processing, volume 12, pages 44--49, 1994."},{"key":"e_1_3_2_1_36_1","volume-title":"The INTERSPECH 2014 computational paralinguistics challenge: Cognitive & physical load. In Proc. Interspeech","author":"Schuller B.","year":"2014","unstructured":"B. Schuller , S. Steidl , A. Batliner , J. Epps , F. Eyben , F. Ringeval , E. Marchi , and Y. Zhang . The INTERSPECH 2014 computational paralinguistics challenge: Cognitive & physical load. In Proc. Interspeech , Singapore, Singapore , 2014 . B. Schuller, S. Steidl, A. Batliner, J. Epps, F. Eyben, F. Ringeval, E. Marchi, and Y. Zhang. The INTERSPECH 2014 computational paralinguistics challenge: Cognitive & physical load. In Proc. Interspeech, Singapore, Singapore, 2014."},{"key":"e_1_3_2_1_37_1","first-page":"593","volume-title":"Proc. CVPR 1994","author":"Shi Jianbo","year":"1994","unstructured":"Jianbo Shi and Carlo Tomasi . Good features to track . In Proc. CVPR 1994 , pages 593 -- 600 . IEEE, 1994 . Jianbo Shi and Carlo Tomasi. Good features to track. In Proc. CVPR 1994, pages 593--600. IEEE, 1994."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661806.2661807"},{"key":"e_1_3_2_1_39_1","volume-title":"Proc. Interspeech","author":"Segbroeck M. Van","year":"2014","unstructured":"M. Van Segbroeck , R. Travadi , Colin Vaz , Jangwon Kim , Matthew P. Black , Alexandros Potamianos , and S. S. Narayanan . Classification of cognitive load from speech using an i-vector framework . In Proc. Interspeech , 2014 . M. Van Segbroeck, R. Travadi, Colin Vaz, Jangwon Kim, Matthew P. Black, Alexandros Potamianos, and S. S. Narayanan. Classification of cognitive load from speech using an i-vector framework. In Proc. Interspeech, 2014."},{"key":"e_1_3_2_1_40_1","volume-title":"Proc. Interspeech","author":"Segbroeck Maarten Van","year":"2013","unstructured":"Maarten Van Segbroeck , Andreas Tsiartas , and Shrikanth S. Narayanan . A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice . In Proc. Interspeech , 2013 . Maarten Van Segbroeck, Andreas Tsiartas, and Shrikanth S. Narayanan. A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice. In Proc. Interspeech, 2013."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/T-AFFC.2011.27"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.3758\/BRM.41.2.534"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jneumeth.2007.09.030"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0036706"},{"key":"e_1_3_2_1_45_1","volume-title":"Verbal Behavior: Adaptation and Psychopathology","author":"Weintraub W.","year":"1981","unstructured":"W. Weintraub . Verbal Behavior: Adaptation and Psychopathology . New York : Springer , 1981 . W. Weintraub. Verbal Behavior: Adaptation and Psychopathology. New York: Springer, 1981."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2512530.2512531"},{"key":"e_1_3_2_1_47_1","volume-title":"Proc. Interspeech","author":"Xiao B.","year":"2013","unstructured":"B. Xiao , P. G. Georgiou , Z. E. Imel , D. Atkins , and S. S. Narayanan . Modeling therapist empathy and vocal entrainment in drug addition counseling . In Proc. Interspeech , 2013 . B. Xiao, P. G. Georgiou, Z. E. Imel, D. Atkins, and S. S. Narayanan. Modeling therapist empathy and vocal entrainment in drug addition counseling. In Proc. Interspeech, 2013."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2007.1110"},{"key":"e_1_3_2_1_49_1","volume-title":"Proc. Workshop on Semantics and Pragmatics of Dialogue","author":"Zhou Y.","year":"2013","unstructured":"Y. Zhou , S. Scherer , D. Devault , J. Gratch , G. Stratou , L.-P. Morency , and J. Cassell . Multimodal prediction of psychological disorder: Learning nonverbal commonality in adjacency pairs . In Proc. Workshop on Semantics and Pragmatics of Dialogue , 2013 . Y. Zhou, S. Scherer, D. Devault, J. Gratch, G. Stratou, L.-P. Morency, and J. Cassell. Multimodal prediction of psychological disorder: Learning nonverbal commonality in adjacency pairs. In Proc. Workshop on Semantics and Pragmatics of Dialogue, 2013."}],"event":{"name":"MM '14: 2014 ACM Multimedia Conference","location":"Orlando Florida USA","acronym":"MM '14","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 4th International Workshop on Audio\/Visual Emotion Challenge"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2661806.2661810","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2661806.2661810","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T06:12:55Z","timestamp":1750227175000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2661806.2661810"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,11,7]]},"references-count":49,"alternative-id":["10.1145\/2661806.2661810","10.1145\/2661806"],"URL":"https:\/\/doi.org\/10.1145\/2661806.2661810","relation":{},"subject":[],"published":{"date-parts":[[2014,11,7]]},"assertion":[{"value":"2014-11-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}