{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,10,23]],"date-time":"2023-10-23T08:10:43Z","timestamp":1698048643298},"reference-count":16,"publisher":"Wiley","issue":"4","license":[{"start":{"date-parts":[[2007,9,5]],"date-time":"2007-09-05T00:00:00Z","timestamp":1188950400000},"content-version":"vor","delay-in-days":7187,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems &amp;amp; Computers in Japan"],"published-print":{"date-parts":[[1988,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper discusses the selection of candidates in speech recognition based on the phoneme recognition. The method is based on the result of phoneme recognition for the part of speech input, for which the segmentation is performed with a high reliability. Using the information concerning the order of the phonemes or phoneme chains, and the information concerning the top and tail phonemes, the candidates are selected. Since only the part for which the segmentation can be performed with a high reliability is used, the candidate reduction has a great effect for the clearly uttered speech, and vice versa. Consequently, the method has the feature that the recognition rate is degraded less by the candidate selection.<\/jats:p><jats:p>First, the proposed selection method is introduced into the word recognition. The candidate selection is applied to all words in the dictionary. A recognition experiment was performed for the cases of the word dictionary composed of 643 city names, with 100 city names uttered by 50 examinees as the input. As a result, the word candidates were reduced to 16 percent, maintaining almost the same recognition performance as in the case without candidate reduction.<\/jats:p><jats:p>Next, the proposed candidate selection is introduced into the phase recognition. In the method, the location of the phoneme to be rejected is estimated in the candidate selection in the derivation of hypothesis, and based on that result, the syntax tree is back\u2010tracked. An experiment was performed for 235 phrases uttered by 2 examinees. As a result, the phrase candidates were reduced to 21 percent, compared with the case without candidate selection.<\/jats:p>","DOI":"10.1002\/scj.4690190402","type":"journal-article","created":{"date-parts":[[2009,11,19]],"date-time":"2009-11-19T21:12:32Z","timestamp":1258665152000},"page":"11-22","source":"Crossref","is-referenced-by-count":0,"title":["Reduction of Word and Minimal Phrase Candidates for Speech Recognition Based on Phoneme Recognition"],"prefix":"10.1002","volume":"19","author":[{"given":"Sho\u2010Ichi","family":"Matsunaga","sequence":"first","affiliation":[]},{"given":"Masaki","family":"Kohda","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2007,9,5]]},"reference":[{"key":"e_1_2_1_2_2","first-page":"551","article-title":"Large\u2010vocabulary isolated word recognition with pre\u2010selection","volume":"3","author":"Sugamura","year":"1981","journal-title":"Proc. Acoust. Soc. Jap."},{"key":"e_1_2_1_3_2","first-page":"150","article-title":"Speaker\u2010independent word recognition with large vocabulary using preselection and nonlinear spectral matching, Tech. Rep.","volume":"83","author":"Miwa J.","year":"1983","journal-title":"Acoust. Soc. Jap."},{"key":"e_1_2_1_4_2","first-page":"1061","article-title":"A hierarchical decision approach to large vocabulary discrete utterance recognition, I.E.E.E. Trans. Acoust., Speech","volume":"32","author":"Kaneko T.","year":"1983","journal-title":"Signal Processing"},{"key":"e_1_2_1_5_2","first-page":"546","article-title":"A vector\u2010quantization\u2010based preprocessor for speaker\u2010independent isolated word recognition, I.E.E.E. Trans. Acoust., Speech","volume":"33","author":"Pan K. C.","year":"1985","journal-title":"Signal Processing"},{"key":"e_1_2_1_6_2","first-page":"103","article-title":"A large\u2010vocabulary spoken word recognition by more than one pre\u2010selection, Tech. Rep.","volume":"84","author":"Sawai H.","year":"1984","journal-title":"Acoust. Soc. Jap."},{"key":"e_1_2_1_7_2","unstructured":"S.Nakagawa UtsumiandT.Sakai Pre\u2010comparison using global and local features of spoken word Tech. Rep. Acoust. Soc. Jap. S78\u201322(1978)."},{"key":"e_1_2_1_8_2","unstructured":"D. P.HuttenlocherandV. W.Zue A model of lexical access from partial phonetic information Proc. ICAS\u2010SP 26.4 (March1984)."},{"issue":"8","key":"e_1_2_1_9_2","first-page":"869","article-title":"Vocabulary reduction effect by specifying phoneme sequence in words, Trans. (D) I.E.C.E.","volume":"67","author":"Itabashi S.","year":"1984","journal-title":"Japan"},{"issue":"6","key":"e_1_2_1_10_2","first-page":"1304","article-title":"Phrase speech recognition for large vocabulary, Trans. (D) I.E.C.E.","volume":"68","author":"Kobayashi T.","year":"1985","journal-title":"Japan"},{"key":"e_1_2_1_11_2","first-page":"99","article-title":"A continuous speech recognition system with phrase as input unit","volume":"85","author":"Ando N.","year":"1985","journal-title":"Tech. Rep. Acoust. Soc. Jap."},{"key":"e_1_2_1_12_2","unstructured":"K.Shikaano Phoneme level processing in the top\u2010down and bottom\u2010up speech recognition Proc. Acoust. Soc. Jap. pp.87\u201388(Sept.1983)."},{"issue":"6","key":"e_1_2_1_13_2","first-page":"693","article-title":"Spoken word recognition based on top\u2010down phoneme recognition, Trans. (D) I.E.C.E.","volume":"67","author":"Aikawa K.","year":"1984","journal-title":"Japan"},{"issue":"5","key":"e_1_2_1_14_2","first-page":"409","article-title":"LPC peak weighted matching measure, Trans. (A) I.E.C.E.","volume":"64","author":"Sugiyama M.","year":"1982","journal-title":"Japan"},{"issue":"9","key":"e_1_2_1_15_2","first-page":"1641","article-title":"Speech recognition based on top\u2010down and bottom\u2010up phoneme recognition, Trans. (D) I.E.C.E.","volume":"68","author":"Matsunaga S.","year":"1985","journal-title":"Japan"},{"key":"e_1_2_1_16_2","volume-title":"Fundamentals of Speech Signal Processing","author":"Saito S.","year":"1981"},{"key":"e_1_2_1_17_2","first-page":"481","volume-title":"The Art of Programming, Vol. 3, Sorting and Searching","author":"Knuth D. E.","year":"1973"}],"container-title":["Systems and Computers in Japan"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fscj.4690190402","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fscj.4690190402","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/scj.4690190402","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,22]],"date-time":"2023-10-22T07:59:34Z","timestamp":1697961574000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/scj.4690190402"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1988,1]]},"references-count":16,"journal-issue":{"issue":"4","published-print":{"date-parts":[[1988,1]]}},"alternative-id":["10.1002\/scj.4690190402"],"URL":"https:\/\/doi.org\/10.1002\/scj.4690190402","archive":["Portico"],"relation":{},"ISSN":["0882-1666","1520-684X"],"issn-type":[{"value":"0882-1666","type":"print"},{"value":"1520-684X","type":"electronic"}],"subject":[],"published":{"date-parts":[[1988,1]]}}}