{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,5,6]],"date-time":"2022-05-06T00:12:11Z","timestamp":1651795931428},"reference-count":11,"publisher":"IGI Global","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,4]]},"abstract":"<jats:p>In real world applications of speech recognition, recognition errors are inevitable, and manual correction is necessary. This paper presents an approach for the refinement of Mandarin speech recognition result by exploiting user feedback. An interface incorporating character-based candidate lists and feedback-driven updating of the candidate lists is introduced. For dynamic updating of candidate lists, a novel method based on lattice modification and rescoring is proposed. By adding words with similar pronunciations to the candidates next to the corrected character into the lattice and then performing rescoring on the modified lattice, the proposed method can improve the accuracy of the candidate lists even if the correct characters are not in the original lattice, with much lower computational cost than that of the speech re-recognition methods. Experimental results show that the proposed method can reduce 24.03% of user inputs and improve average candidate rank by 25.31%.<\/jats:p>","DOI":"10.4018\/ijapuc.2017040104","type":"journal-article","created":{"date-parts":[[2017,5,31]],"date-time":"2017-05-31T17:52:07Z","timestamp":1496253127000},"page":"55-64","source":"Crossref","is-referenced-by-count":0,"title":["Feedback-Driven Refinement of Mandarin Speech Recognition Result based on Lattice Modification and Rescoring"],"prefix":"10.4018","volume":"9","author":[{"given":"Xiangdong","family":"Wang","sequence":"first","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China"}]},{"given":"Yang","family":"Yang","sequence":"additional","affiliation":[{"name":"Jiangsu Enterprise Information Operation Center, China Telecom Corporation Limited, Nanjing, China"}]},{"given":"Hong","family":"Liu","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China"}]},{"given":"Yueliang","family":"Qian","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China"}]},{"given":"Duan","family":"Jia","sequence":"additional","affiliation":[{"name":"Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China"}]}],"member":"2432","reference":[{"key":"IJAPUC.2017040104-0","first-page":"113","article-title":"Real-Time Correction of Closed-Captions.","author":"P.Cardinal","year":"2007","journal-title":"Proceedings of the ACL \u201807 Demo and Poster"},{"key":"IJAPUC.2017040104-1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2011.5947450"},{"key":"IJAPUC.2017040104-2","first-page":"583","article-title":"Candidate Generation for Interactive Chinese Speech Recognition.","author":"X.Li","year":"2009","journal-title":"Proc. Joint Conferences on Pervasive Computing (JCPC \u201809)"},{"issue":"2","key":"IJAPUC.2017040104-3","first-page":"31","article-title":"Efficient Speech-recognition Error Correction for More Usable Speech-to-text Input.","volume":"11","author":"Y.Nakashima","year":"2009","journal-title":"NTT DOCOMO Technical Journal"},{"key":"IJAPUC.2017040104-4","article-title":"Computer Assisted Speech Transcription System for Efficient Speech Archive.","author":"H.Nanjo","year":"2006","journal-title":"Proceedings of the 9th Western Pacific Acoustics Conference"},{"key":"IJAPUC.2017040104-5","first-page":"133","article-title":"Speech Repair: Quick Error Correction Just by Using Selection Operation for Speech Input Interfaces.","volume":"2005","author":"J.Ogata","year":"2005","journal-title":"Proc. Eurospeech"},{"key":"IJAPUC.2017040104-6","doi-asserted-by":"crossref","unstructured":"Oviatt, S., Cohen, P., et al. (2000). Designing the User Interface for Multimodal Speech and Pen-Based Gesture Applications: State-of-the-Art Systems and Future Research Directions. Human-Computer Interaction, 15(4), pp. 263-322).","DOI":"10.1207\/S15327051HCI1504_1"},{"key":"IJAPUC.2017040104-7","doi-asserted-by":"publisher","DOI":"10.1145\/1891903.1891943"},{"key":"IJAPUC.2017040104-8","unstructured":"Senay, G. & Linares, G., et al. (2010). Transcriber Driving Strategies for Transcription aid System, LREC 2010."},{"key":"IJAPUC.2017040104-9","first-page":"1698","article-title":"Intelligently Aiding Human-Guided Correction of Speech Recognition.","author":"K.Vertanen","year":"2011","journal-title":"Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI \u201810)"},{"key":"IJAPUC.2017040104-10","first-page":"853","article-title":"Improved Confusion Network Algorithm and Shortest Path Search from word lattice.","author":"J.Xue","year":"2005","journal-title":"Proc. ICASSP \u201805"}],"container-title":["International Journal of Advanced Pervasive and Ubiquitous Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=182527","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T23:53:58Z","timestamp":1651794838000},"score":1,"resource":{"primary":{"URL":"http:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJAPUC.2017040104"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2017,4]]},"references-count":11,"journal-issue":{"issue":"2"},"URL":"https:\/\/doi.org\/10.4018\/ijapuc.2017040104","relation":{},"ISSN":["1937-965X","1937-9668"],"issn-type":[{"value":"1937-965X","type":"print"},{"value":"1937-9668","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,4]]}}}