{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,12,30]],"date-time":"2022-12-30T20:54:52Z","timestamp":1672433692065},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,3,3]],"date-time":"2012-03-03T00:00:00Z","timestamp":1330732800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J AUDIO SPEECH MUSIC PROC."],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>A novel approach for robust dialogue act detection in a spoken dialogue system is proposed. Shallow representation named partial sentence trees are employed to represent automatic speech recognition outputs. Parsing results of partial sentences can be decomposed into derivation rules, which turn out to be salient features for dialogue act detection. Data-driven dialogue acts are learned via an unsupervised learning algorithm called spectral clustering, in a vector space whose axes correspond to derivation rules. The proposed method is evaluated in a Mandarin spoken dialogue system for tourist-information services. Combined with information obtained from the automatic speech recognition module and from a Markov model on dialogue act sequence, the proposed method achieves a detection accuracy of 85.1%, which is significantly better than the baseline performance of 62.3% using a na\u00efve Bayes classifier. Furthermore, the average number of turns per dialogue session also decreases significantly with the improved detection accuracy.<\/jats:p>","DOI":"10.1186\/1687-4722-2012-13","type":"journal-article","created":{"date-parts":[[2012,3,5]],"date-time":"2012-03-05T16:58:37Z","timestamp":1330966717000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Robust dialogue act detection based on partial sentence tree, derivation rule, and spectral clustering algorithm"],"prefix":"10.1186","volume":"2012","author":[{"given":"Chia-Ping","family":"Chen","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chung-Hsien","family":"Wu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei-Bin","family":"Liang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2012,3,3]]},"reference":[{"key":"48_CR1","first-page":"564","volume-title":"Handbook of Standards and Resources for Spoken Language Systems","author":"N Fraser","year":"1997","unstructured":"Fraser N: Handbook of Standards and Resources for Spoken Language Systems. Volume chap. 6. Edited by: Gibbon D, Moore R, Winski R. Mouton de Gruyter, Berlin; 1997:564-564."},{"key":"48_CR2","doi-asserted-by":"publisher","first-page":"91","DOI":"10.3115\/116580.116612","volume-title":"Proc the workshop on Speech and Natural Language","author":"PJ Price","year":"1990","unstructured":"Price PJ: Evaluation of spoken language systems: the ATIS domain. In Proc the workshop on Speech and Natural Language. Hidden Valley, Pennsylvania; 1990:91-95."},{"key":"48_CR3","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/S0167-6393(97)00040-X","volume":"23","author":"A Gorin","year":"1997","unstructured":"Gorin A, Riccardi G, Wright JH: How may i help you? Speech Commun 1997, 23: 113-127. 10.1016\/S0167-6393(97)00040-X","journal-title":"Speech Commun"},{"key":"48_CR4","doi-asserted-by":"crossref","first-page":"211","DOI":"10.21437\/Interspeech.2008-66","volume-title":"Proc INTERSPEECH-2008","author":"C Hori","year":"2008","unstructured":"Hori C, Ohtake K, Misu T, Kashioka H, Nakamura S: Dialog management using weighted finite-state transducers. In Proc INTERSPEECH-2008. Brisbane, Australia; 2008:211-214."},{"key":"48_CR5","first-page":"1","volume-title":"Proc International Symposium on Chinese Spoken Language Processing","author":"J Liu","year":"2008","unstructured":"Liu J, Xu Y, Seneff S, Zue V: CITYBROWSER II: a multimodal restaurant guide in Mandarin. In Proc International Symposium on Chinese Spoken Language Processing. Kunming, China; 2008:1-4."},{"key":"48_CR6","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1016\/j.specom.2009.08.007","volume":"52","author":"T Misu","year":"2010","unstructured":"Misu T, Kawahara T: Bayes risk-based dialogue management for document retrieval system with speech interface. Speech Commun 2010, 52: 61-71. 10.1016\/j.specom.2009.08.007","journal-title":"Speech Commun"},{"key":"48_CR7","volume-title":"The Artificial Linguistic Internet Computer Entity (A. L. I. C. E.)","author":"R Wallace","year":"2001","unstructured":"Wallace R:The Artificial Linguistic Internet Computer Entity (A. L. I. C. E.). 2001. [http:\/\/www.alicebot.org]"},{"key":"48_CR8","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1145\/302979.303150","volume-title":"Proc the SIGCHI Conference on Human Factors in Computing Systems: the CHI is the limit","author":"J Cassell","year":"1999","unstructured":"Cassell J, Bickmore T, Billinghurst M, Campbell L, Chang K, Vilhj\u00e1lmsson H, Yan H: Embodiment in conversational interfaces: rea. In Proc the SIGCHI Conference on Human Factors in Computing Systems: the CHI is the limit. Pittsburgh, Pennsylvania; 1999:520-527."},{"issue":"5","key":"48_CR9","doi-asserted-by":"publisher","first-page":"1574","DOI":"10.1109\/TASL.2006.878267","volume":"14","author":"JF Yeh","year":"2006","unstructured":"Yeh JF, Wu CH: Edit Disfluency detection and correction using a cleanup language model and an alignment model. IEEE Trans Speech Audio Process 2006, 14(5):1574-1583.","journal-title":"IEEE Trans Speech Audio Process"},{"key":"48_CR10","volume-title":"ACM Trans Asian Lang Inf Process","author":"CH Wu","year":"2010","unstructured":"Wu CH, Liang WB, Yeh JF: Interruption point detection of spontaneous speech using inter-syllable boundary-based prosodic features. ACM Trans Asian Lang Inf Process 2010. 10, 6:16:21"},{"key":"48_CR11","first-page":"439","volume-title":"Proc 41st Annual Meeting on Association for Computational Linguistics (ACL)","author":"R Levy","year":"2003","unstructured":"Levy R, Manning C: Is it harder to parse Chinese, or the Chinese Treebank? In Proc 41st Annual Meeting on Association for Computational Linguistics (ACL). Sapporo, Japan; 2003:439-446."},{"key":"48_CR12","first-page":"1043","volume-title":"Proc INTERSPEECH","author":"CH Liu","year":"2009","unstructured":"Liu CH, Wu CH: Semantic role labeling with discriminative feature selection for spoken language understanding. In Proc INTERSPEECH. Brighton, United Kingdom; 2009:1043-1046."},{"key":"48_CR13","first-page":"85","volume-title":"Proc Annual Conference of the North American Chapter of the Association for Computational Linguistics-Human Language Technologies","author":"B Coppola","year":"2009","unstructured":"Coppola B, Moschitti A, Riccardi G: Shallow semantic parsing for spoken language understanding. In Proc Annual Conference of the North American Chapter of the Association for Computational Linguistics-Human Language Technologies. Boulder, Colorado; 2009:85-88."},{"key":"48_CR14","first-page":"1403","volume-title":"Proc International Conference on Spoken Language Processing","author":"H Wright","year":"1998","unstructured":"Wright H: Automatic utterance type detection using suprasegmental features. In Proc International Conference on Spoken Language Processing. Volume 4. Sydney, Australia; 1998:1403-1406."},{"issue":"6","key":"48_CR15","doi-asserted-by":"publisher","first-page":"558","DOI":"10.1109\/89.725322","volume":"6","author":"T Kawahara","year":"1998","unstructured":"Kawahara T, Lee CH, Juang BH: Flexible speech understanding based on combined key-phrase detection and verification. IEEE Trans Speech Audio Process 1998, 6(6):558-568. 10.1109\/89.725322","journal-title":"IEEE Trans Speech Audio Process"},{"key":"48_CR16","first-page":"19","volume":"3","author":"H Bunt","year":"1994","unstructured":"Bunt H: Context and dialogue control. THINK Quarterly 1994, 3: 19-31.","journal-title":"THINK Quarterly"},{"key":"48_CR17","first-page":"162","volume-title":"Proc Annual Meeting of the Association for Computational Linguistics","author":"R Prasad","year":"2002","unstructured":"Prasad R, Walker M: Training a dialogue act tagger for human-human and human-computer travel dialogues. In Proc Annual Meeting of the Association for Computational Linguistics. Volume 2. Philadelphia, Pennsylvania; 2002:162-173."},{"key":"48_CR18","volume-title":"How to Do Things with Words","author":"JL Austin","year":"1962","unstructured":"Austin JL: How to Do Things with Words. Edited by: Urmson JO, Sbis\u00e1 M. Harvard University Press, Cambridge, MA; 1962."},{"issue":"3","key":"48_CR19","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1162\/089120100561737","volume":"26","author":"A Stolcke","year":"2000","unstructured":"Stolcke A, Ries K, Coccaro N, Shriberg E, Bates R, Jurafsky D, Taylor P, Martin R: Dialogue act modeling for automatic tagging and recognition of conversational speech. Comput Linguist 2000, 26(3):339-373. 10.1162\/089120100561737","journal-title":"Comput Linguist"},{"key":"48_CR20","first-page":"19","volume-title":"Proc IEEE Workshop on Spoken Language Technologies","author":"G Tur","year":"2010","unstructured":"Tur G, Hakkani-T\u00fcr D, Heck L: What is left to be understood in ATIS. In Proc IEEE Workshop on Spoken Language Technologies. Berkeley, California; 2010:19-24."},{"key":"48_CR21","volume-title":"Proc the 4th SIGdial Workshop on Discourse and Dialogue","author":"L Levin","year":"2003","unstructured":"Levin L, Langley C, Donna Gates AL, Wallace D, Peterson K: Domain specific speech acts for spoken language translation. In Proc the 4th SIGdial Workshop on Discourse and Dialogue. Sapparo, Japan; 2003."},{"key":"48_CR22","first-page":"495","volume-title":"Proc Conference on Speech and Computer","author":"S Grau","year":"2004","unstructured":"Grau S, Sanchis E, Castro MJ, Vilar D: Dialogue act classification using a Bayesian approach. In Proc Conference on Speech and Computer. St Petersberg; 2004:495-499."},{"key":"48_CR23","doi-asserted-by":"crossref","first-page":"79","DOI":"10.3115\/1628960.1628976","volume-title":"Pro the ACL Student Research Workshop, Association for Computational Linguistics","author":"E Ivanovic","year":"2005","unstructured":"Ivanovic E: Dialogue Act Tagging for Instant Messaging Chat Sessions. In Pro the ACL Student Research Workshop, Association for Computational Linguistics. Ann Arbor, Michigan; 2005:79-84."},{"key":"48_CR24","first-page":"641","volume-title":"Proc EUROSPEECH-2003","author":"S Seneff","year":"2003","unstructured":"Seneff S, Wang C, Hazen TJ: Automatic induction of N -gram language models from a natural language grammar. In Proc EUROSPEECH-2003. Geneva, Swiss; 2003:641-644."},{"key":"48_CR25","doi-asserted-by":"crossref","first-page":"3034","DOI":"10.21437\/Interspeech.2010-53","volume-title":"Proc INTERSPEECH-2010","author":"S Hara","year":"2010","unstructured":"Hara S, Kitaoka N, Takeda K: Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act N-gram. In Proc INTERSPEECH-2010. Makuhari, Japan; 2010:3034-3037."},{"key":"48_CR26","first-page":"497","volume-title":"Proc IEEE International Conference on Acoustics, Speech, and Signal Processing","author":"K Ries","year":"1999","unstructured":"Ries K: Hmm and Neural network based speech act detection. In Proc IEEE International Conference on Acoustics, Speech, and Signal Processing. Volume 1. Phoenix, Arizona; 1999:497-500."},{"issue":"3","key":"48_CR27","doi-asserted-by":"publisher","first-page":"330","DOI":"10.1109\/TSA.2005.845820","volume":"13","author":"CH Wu","year":"2005","unstructured":"Wu CH, Yan GL: Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system. IEEE Trans Speech Audio Process 2005, 13(3):330-344.","journal-title":"IEEE Trans Speech Audio Process"},{"key":"48_CR28","doi-asserted-by":"publisher","first-page":"88","DOI":"10.3115\/1118121.1118134","volume-title":"Proc the 3rd SIGdial workshop on Discourse and dialogue","author":"S Keizer","year":"2002","unstructured":"Keizer S, Nijholt A: Dialogue act recognition with Bayesian networks for Dutch dialogues. In Proc the 3rd SIGdial workshop on Discourse and dialogue. Volume 2. Philadelphia, Pennsylvania; 2002:88-94."},{"key":"48_CR29","doi-asserted-by":"crossref","first-page":"268","DOI":"10.21437\/Interspeech.2009-92","volume-title":"Proc INTERSPEECH","author":"C Hori","year":"2009","unstructured":"Hori C, Ohtake K, Misu T, Kashioka H, Nakamura S: Recent advances in WFST-based dialog system. In Proc INTERSPEECH. Brighton, United Kingdom; 2009:268-271."},{"key":"48_CR30","first-page":"4793","volume-title":"Proc IEEE International Conference on Acoustics Speech and Signal Processing","author":"C Hori","year":"2009","unstructured":"Hori C, Ohtake K, Misu T, Kashioka H, Nakamura S: Statistical dialog management applied to WFST-based dialog systems. In Proc IEEE International Conference on Acoustics Speech and Signal Processing. Taipei, Taiwan; 2009:4793-4796."},{"key":"48_CR31","first-page":"2123","volume-title":"Proc LREC2010","author":"K Ohtake","year":"2010","unstructured":"Ohtake K, Misu T, Hori C, Kashioka H, Nakamura S: Dialogue acts annotation for NICT Kyoto tour dialogue corpus to construct statistical dialogue systems. In Proc LREC2010. Valletta, Malta; 2010:2123-2130."},{"key":"48_CR32","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1016\/j.csl.2006.06.008","volume":"21","author":"JD Williams","year":"2007","unstructured":"Williams JD, Young S: Partially observable Markov decision processes for spoken dialog systems. Comput Speech Lang 2007, 21: 393-422. 10.1016\/j.csl.2006.06.008","journal-title":"Comput Speech Lang"},{"key":"48_CR33","first-page":"1","volume-title":"Proc INTERSPEECH2010","author":"S Young","year":"2010","unstructured":"Young S: Still talking to machines (cognitively speaking). In Proc INTERSPEECH2010. Makuhari, Japan; 2010:1-10."},{"key":"48_CR34","doi-asserted-by":"crossref","unstructured":"Williams JD, Young S: Scaling POMDPs for spoken dialog management. IEEE Trans Acoustic Speech Lang Process 15(7):2116-2129.","DOI":"10.1109\/TASL.2007.902050"},{"issue":"1-2","key":"48_CR35","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/j.specom.2004.02.003","volume":"43","author":"CH Wu","year":"2004","unstructured":"Wu CH, Chen YJ: Recovery from false rejection using statistical partial pattern trees for sentence verification. Speech Commun 2004, 43(1-2):71-88. 10.1016\/j.specom.2004.02.003","journal-title":"Speech Commun"},{"key":"48_CR36","volume-title":"An Introduction to Mathematical Statistics and Its Applications","author":"RJ Larsen","year":"2000","unstructured":"Larsen RJ, Marx ML: An Introduction to Mathematical Statistics and Its Applications. 3rd edition. Prentice Hall, Lebanon, Indiana, USA; 2000. ISBN: 0139223037","edition":"3"},{"key":"48_CR37","volume-title":"Speech and Language Processing","author":"D Jurafsky","year":"2009","unstructured":"Jurafsky D, Martin JH: Speech and Language Processing. 2nd edition. Pearson Prentice Hall, New Jersey; 2009.","edition":"2"},{"issue":"4","key":"48_CR38","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1007\/s11222-007-9033-z","volume":"17","author":"U von Luxburg","year":"2007","unstructured":"von Luxburg U: A tutorial on spectral clustering. Stat Comput 2007, 17(4):395-416. 10.1007\/s11222-007-9033-z","journal-title":"Stat Comput"},{"key":"48_CR39","volume-title":"The HTK Book Version 3.4","author":"SJ Young","year":"2006","unstructured":"Young SJ, Kershaw D, Odell J, Ollason D, Valtchev V, Woodland P: The HTK Book Version 3.4. Cambridge University Press, Cambridge; 2006."},{"key":"48_CR40","first-page":"901","volume-title":"Proc International Conference on Spoken Language Processing","author":"A Stolcke","year":"2002","unstructured":"Stolcke A: SRILM - an extensible language modeling toolkit. In Proc International Conference on Spoken Language Processing. Denver, Colorado; 2002:901-904."},{"key":"48_CR41","first-page":"207","volume-title":"ACM SIGMOD","author":"R Agrawal","year":"1993","unstructured":"Agrawal R, Imielinski T, Swami AN: Mining association rules between sets of items in large databases. In ACM SIGMOD. Washington, D.C; 1993:207-216."}],"container-title":["EURASIP Journal on Audio, Speech, and Music Processing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2012-13.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1687-4722-2012-13\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1687-4722-2012-13.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T18:38:21Z","timestamp":1630521501000},"score":1,"resource":{"primary":{"URL":"https:\/\/asmp-eurasipjournals.springeropen.com\/articles\/10.1186\/1687-4722-2012-13"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,3,3]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["48"],"URL":"https:\/\/doi.org\/10.1186\/1687-4722-2012-13","relation":{},"ISSN":["1687-4722"],"issn-type":[{"value":"1687-4722","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,3,3]]},"assertion":[{"value":"10 December 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 March 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 March 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"13"}}