{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:29:18Z","timestamp":1750307358597,"version":"3.41.0"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2011,5,1]],"date-time":"2011-05-01T00:00:00Z","timestamp":1304208000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Speech Lang. Process."],"published-print":{"date-parts":[[2011,5]]},"abstract":"<jats:p>This article presents a user model for user simulation and a system state representation in spoken decision support dialogue systems. When selecting from a group of alternatives, users apply different decision-making criteria with different priorities. At the beginning of the dialogue, however, users often do not have a definite goal or criteria in which they place value, thus they can learn about new features while interacting with the system and accordingly create new criteria. In this article, we present a user model and dialogue state representation that accommodate these patterns by considering the user's knowledge and preferences. To estimate the parameters used in the user model, we implemented a trial sightseeing guidance system, collected dialogue data, and trained a user simulator. Since the user parameters are not observable from the system, the dialogue is modeled as a partially observable Markov decision process (POMDP), and a dialogue state representation was introduced based on the model. We then optimized its dialogue strategy so that users can make better choices. The dialogue strategy is evaluated using a user simulator trained from a large number of dialogues collected using a trial dialogue system.<\/jats:p>","DOI":"10.1145\/1966407.1966415","type":"journal-article","created":{"date-parts":[[2011,6,6]],"date-time":"2011-06-06T11:51:38Z","timestamp":1307361098000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Modeling spoken decision support dialogue and optimization of its dialogue strategy"],"prefix":"10.1145","volume":"7","author":[{"given":"Teruhisa","family":"Misu","sequence":"first","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Komei","family":"Sugiura","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tatsuya","family":"Kawahara","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kiyonori","family":"Ohtake","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chiori","family":"Hori","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hideki","family":"Kashioka","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hisashi","family":"Kawai","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Satoshi","family":"Nakamura","sequence":"additional","affiliation":[{"name":"National Institute of Information and Communications Technology, Japan"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2011,6,6]]},"reference":[{"volume-title":"Proceedings of the Spoken Laguage Technology Workshop (SLT). 170--173","author":"Bohus D.","key":"e_1_2_1_1_1","unstructured":"Bohus , D. , Langner , B. , Raux , A. , Black , A. , Eskenazi , M. , and Rudnicky , A . 2006. Online supervised learning of non-understanding recovery policies . In Proceedings of the Spoken Laguage Technology Workshop (SLT). 170--173 . Bohus, D., Langner, B., Raux, A., Black, A., Eskenazi, M., and Rudnicky, A. 2006. Online supervised learning of non-understanding recovery policies. In Proceedings of the Spoken Laguage Technology Workshop (SLT). 170--173."},{"volume-title":"Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence. 43--52","author":"Breese J.","key":"e_1_2_1_2_1","unstructured":"Breese , J. , Heckerman , D. , and Kadie , C . 1998. Empirical analysis of predictive algorithms for collaborative filtering . In Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence. 43--52 . Breese, J., Heckerman, D., and Kadie, C. 1998. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence. 43--52."},{"volume-title":"Proceedings of the Eurospeech.","author":"Dohsaka K.","key":"e_1_2_1_3_1","unstructured":"Dohsaka , K. , Yasuda , N. , and Aikawa , K . 2003. Efficient spoken dialogue control depending on the speech recognition rate and system's database . In Proceedings of the Eurospeech. Dohsaka, K., Yasuda, N., and Aikawa, K. 2003. Efficient spoken dialogue control depending on the speech recognition rate and system's database. In Proceedings of the Eurospeech."},{"volume-title":"Proceedings of the Acoustic Society of Japan Fall Meeting (in Japanese). 221--222","author":"Itoh G.","key":"e_1_2_1_4_1","unstructured":"Itoh , G. , Ashikari , Y. , Jitsuhiro , T. , and Nakamura , S . 2004. Summary and evaluation of speech recognition integrated environment ATRASR . In Proceedings of the Acoustic Society of Japan Fall Meeting (in Japanese). 221--222 . Itoh, G., Ashikari, Y., Jitsuhiro, T., and Nakamura, S. 2004. Summary and evaluation of speech recognition integrated environment ATRASR. In Proceedings of the Acoustic Society of Japan Fall Meeting (in Japanese). 221--222."},{"volume-title":"Proceedings of Interspeech. 1696--1696","author":"Kawahara T.","key":"e_1_2_1_5_1","unstructured":"Kawahara , T. , Toyokura , M. , Misu , T. , and Hori , C . 2008. Detection of feeling through back-channels in spoken dialogue . In Proceedings of Interspeech. 1696--1696 . Kawahara, T., Toyokura, M., Misu, T., and Hori, C. 2008. Detection of feeling through back-channels in spoken dialogue. In Proceedings of Interspeech. 1696--1696."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11257-004-5659-0"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(01)00048-6"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1999.758172"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of SEMdial.","author":"Lemon O.","year":"2008","unstructured":"Lemon , O. 2008 . Adaptive natural language generation in dialogue using reinforcement learning . In Proceedings of SEMdial. Lemon, O. 2008. Adaptive natural language generation in dialogue using reinforcement learning. In Proceedings of SEMdial."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/89.817450"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1121949.1121979"},{"volume-title":"Proceedings of the 18th National Conference on Artificial Intelligence. 187--192","author":"Melville P.","key":"e_1_2_1_12_1","unstructured":"Melville , P. , Mooney , R. , and Nagarajan , R . 2002. Content-boosted collaborative filtering for improved recommendations . In Proceedings of the 18th National Conference on Artificial Intelligence. 187--192 . Melville, P., Mooney, R., and Nagarajan, R. 2002. Content-boosted collaborative filtering for improved recommendations. In Proceedings of the 18th National Conference on Artificial Intelligence. 187--192."},{"volume-title":"Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS).","author":"Minami Y.","key":"e_1_2_1_13_1","unstructured":"Minami , Y. , Mori , A. , Meguro , T. , Higashinaka , R. , Dohsaka , K. , and Maeda , E . 2009. Dialogue control algorithm for ambient intelligence based on partially observable markov decision processes . In Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS). Minami, Y., Mori, A., Meguro, T., Higashinaka, R., Dohsaka, K., and Maeda, E. 2009. Dialogue control algorithm for ambient intelligence based on partially observable markov decision processes. In Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS)."},{"volume-title":"Proceedings of Interspeech. 9--12","author":"Misu T.","key":"e_1_2_1_14_1","unstructured":"Misu , T. and Kawahara , T . 2006. A bootstrapping approach for developing language model of new spoken dialogue systems by selecting Web Texts . In Proceedings of Interspeech. 9--12 . Misu, T. and Kawahara, T. 2006. A bootstrapping approach for developing language model of new spoken dialogue systems by selecting Web Texts. In Proceedings of Interspeech. 9--12."},{"volume-title":"Proceedings of Interspeech.","author":"Misu T.","key":"e_1_2_1_15_1","unstructured":"Misu , T. , Ohtake , K. , Hori , C. , Kashioka , H. , and Nakamura , S . 2009. Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems . In Proceedings of Interspeech. Misu, T., Ohtake, K., Hori, C., Kashioka, H., and Nakamura, S. 2009. Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems. In Proceedings of Interspeech."},{"volume-title":"Proceedings of the 7th Workshop on Asian Language Resources. 32--39","author":"Ohtake K.","key":"e_1_2_1_16_1","unstructured":"Ohtake , K. , Misu , T. , Hori , C. , Kashioka , H. , and Nakamura , S . 2009. Annotating dialogue acts to construct dialogue systems for consulting . In Proceedings of the 7th Workshop on Asian Language Resources. 32--39 . Ohtake, K., Misu, T., Hori, C., Kashioka, H., and Nakamura, S. 2009. Annotating dialogue acts to construct dialogue systems for consulting. In Proceedings of the 7th Workshop on Asian Language Resources. 32--39."},{"volume-title":"Proceedings of SIGDIAL. 1--10","author":"Paksima T.","key":"e_1_2_1_17_1","unstructured":"Paksima , T. , Georgila , K. , and Moore , J . 2009. Evaluating the effectiveness of information presentation in a full end-to-end dialogue system . In Proceedings of SIGDIAL. 1--10 . Paksima, T., Georgila, K., and Moore, J. 2009. Evaluating the effectiveness of information presentation in a full end-to-end dialogue system. In Proceedings of SIGDIAL. 1--10."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2007.11.026"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.855836"},{"volume-title":"Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS).","author":"Pietquin O.","key":"e_1_2_1_20_1","unstructured":"Pietquin , O. , Rossignol , S. , and Ianotto , M . 2009. Training Bayesian networks for realistic man-machine spoken dialogue simulation . In Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS). Pietquin, O., Rossignol, S., and Ianotto, M. 2009. Training Bayesian networks for realistic man-machine spoken dialogue simulation. In Proceedings of the 1st International Workshop on Spoken Dialogue Systems Technology (IWSDS)."},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of Interspeech.","author":"Pinault F.","year":"2009","unstructured":"Pinault , F. , Lefevre , F. , and de Mori , R. 2009 . Feature-based summary space for stochastic dialogue modeling with hierarchical semantic frames . In Proceedings of Interspeech. Pinault, F., Lefevre, F., and de Mori, R. 2009. Feature-based summary space for stochastic dialogue modeling with hierarchical semantic frames. In Proceedings of Interspeech."},{"volume-title":"Proceedings of ACL\/HLT. 479--487","author":"Polifroni J.","key":"e_1_2_1_22_1","unstructured":"Polifroni , J. and Walker , M . 2008. Intensional summaries as cooperative responses in dialogue: Automation and evaluation . In Proceedings of ACL\/HLT. 479--487 . Polifroni, J. and Walker, M. 2008. Intensional summaries as cooperative responses in dialogue: Automation and evaluation. In Proceedings of ACL\/HLT. 479--487."},{"volume-title":"Proceedings of ICSLP.","author":"Potamianos A.","key":"e_1_2_1_23_1","unstructured":"Potamianos , A. , Ammicht , E. , and Kuo , H . 2000. Dialogue management in the Bell Labs Communicator System . In Proceedings of ICSLP. Potamianos, A., Ammicht, E., and Kuo, H. 2000. Dialogue management in the Bell Labs Communicator System. In Proceedings of ICSLP."},{"volume-title":"Proceedings of Interspeech.","author":"Raux A.","key":"e_1_2_1_24_1","unstructured":"Raux , A. , Langner , B. , Black , A. , and Eskenazi , M . 2005. Let's go public&excl; Taking a spoken dialog system to the real world . In Proceedings of Interspeech. Raux, A., Langner, B., Black, A., and Eskenazi, M. 2005. Let's go public&excl; Taking a spoken dialog system to the real world. In Proceedings of Interspeech."},{"volume-title":"Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL).","author":"Rieser V.","key":"e_1_2_1_25_1","unstructured":"Rieser , V. and Lemon , O . 2009. Natural language generation as planning under uncertainty for spoken dialogue systems . In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL). Rieser, V. and Lemon, O. 2009. Natural language generation as planning under uncertainty for spoken dialogue systems. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL)."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of ICSLP.","volume":"2","author":"Rudnicky A.","year":"2000","unstructured":"Rudnicky , A. , Bennett , C. , Black , A. , Chotomongcol , A. , Lenzo , K. , Oh , A. , and Singh . 2000 . Tasks and domain specific modelling in the Carnegie Mellon Communicator System . In Proceedings of ICSLP. Vol. 2 . Rudnicky, A., Bennett, C., Black, A., Chotomongcol, A., Lenzo, K., Oh, A., and Singh. 2000. Tasks and domain specific modelling in the Carnegie Mellon Communicator System. In Proceedings of ICSLP. Vol. 2."},{"volume-title":"Priority Setting, Resource Allocation","author":"Saaty T.","key":"e_1_2_1_27_1","unstructured":"Saaty , T. 1980. The Analytic Hierarchy Process: Planning , Priority Setting, Resource Allocation . McGraw-Hill . Saaty, T. 1980. The Analytic Hierarchy Process: Planning, Priority Setting, Resource Allocation. McGraw-Hill."},{"volume-title":"Proceedings of HLT\/NAACL.","author":"Schatzmann J.","key":"e_1_2_1_28_1","unstructured":"Schatzmann , J. , Thomson , B. , Weilhammer , K. , Ye , H. , and Young , S . 2007. Agenda-based user simulation for bootstrapping a POMDP dialogue system . In Proceedings of HLT\/NAACL. Schatzmann, J., Thomson, B., Weilhammer, K., Ye, H., and Young, S. 2007. Agenda-based user simulation for bootstrapping a POMDP dialogue system. In Proceedings of HLT\/NAACL."},{"volume-title":"Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 9--13","author":"Schatzmann J.","key":"e_1_2_1_29_1","unstructured":"Schatzmann , J. , Thomson , B. , and Young , S . 2007. Error simulation for training statistical dialogue systems . In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 9--13 . Schatzmann, J., Thomson, B., and Young, S. 2007. Error simulation for training statistical dialogue systems. In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU). 9--13."},{"volume-title":"Proceedings of ANLP-NAACL, Satellite Workshop.","author":"Seneff S.","key":"e_1_2_1_30_1","unstructured":"Seneff , S. and Polifroni , J . 2000. Dialogue management in the Mercury flight reservation system . In Proceedings of ANLP-NAACL, Satellite Workshop. Seneff, S. and Polifroni, J. 2000. Dialogue management in the Mercury flight reservation system. In Proceedings of ANLP-NAACL, Satellite Workshop."},{"volume-title":"Proceedings of the ESCA Workshop on Interactive Dialogue in Multi-Modal Systems.","author":"Sturm J.","key":"e_1_2_1_31_1","unstructured":"Sturm , J. , Os , E. , and Boves , L . 1999. Issues in spoken dialogue systems: Experiences with the Dutch ARISE system . In Proceedings of the ESCA Workshop on Interactive Dialogue in Multi-Modal Systems. Sturm, J., Os, E., and Boves, L. 1999. Issues in spoken dialogue systems: Experiences with the Dutch ARISE system. In Proceedings of the ESCA Workshop on Interactive Dialogue in Multi-Modal Systems."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/551283"},{"volume-title":"Proceedings of ICASSP. 4937--4940","author":"Thomson B.","key":"e_1_2_1_33_1","unstructured":"Thomson , B. , Schatzmann , J. , and Young , S . 2008. Bayesian update of dialogue state for robust dialogue systems . In Proceedings of ICASSP. 4937--4940 . Thomson, B., Schatzmann, J., and Young, S. 2008. Bayesian update of dialogue state for robust dialogue systems. In Proceedings of ICASSP. 4937--4940."},{"key":"e_1_2_1_34_1","unstructured":"Thrun S. 2000. Monte Carlo POMDPs. In Advances in Neural Information Processing Systems 12. 1064--1070.  Thrun S. 2000. Monte Carlo POMDPs. In Advances in Neural Information Processing Systems 12. 1064--1070."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3115\/976909.979652"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2007.902050"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/89.817460"}],"container-title":["ACM Transactions on Speech and Language Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1966407.1966415","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1966407.1966415","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:22:26Z","timestamp":1750245746000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1966407.1966415"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,5]]},"references-count":37,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2011,5]]}},"alternative-id":["10.1145\/1966407.1966415"],"URL":"https:\/\/doi.org\/10.1145\/1966407.1966415","relation":{},"ISSN":["1550-4875","1550-4883"],"issn-type":[{"type":"print","value":"1550-4875"},{"type":"electronic","value":"1550-4883"}],"subject":[],"published":{"date-parts":[[2011,5]]},"assertion":[{"value":"2010-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-06-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}