{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,18]],"date-time":"2025-01-18T13:40:22Z","timestamp":1737207622212,"version":"3.33.0"},"reference-count":19,"publisher":"Wiley","issue":"7","license":[{"start":{"date-parts":[[2007,3,21]],"date-time":"2007-03-21T00:00:00Z","timestamp":1174435200000},"content-version":"vor","delay-in-days":7749,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Systems &amp;amp; Computers in Japan"],"published-print":{"date-parts":[[1986,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper discusses a speech recognition system which integrates the top\u2010down and bottom\u2010up phoneme recognitions. The system is based on the recognition of phonemes, where the top\u2010down and bottom\u2010up processings are combined using a table called a blackboard. In top\u2010down processing, the segmentation and the scoring are performed for each phoneme in the total speech interval, and in the bottom\u2010up processing, only for the interval in which the phoneme segmentation can be performed with certainty. By this scheme, the two recognition processings cooperate, while maintaining their independence. In the proposed system, the linguistic processing and the acoustic processing are structured hierarchically. The two parts are combined through the blackboard, avoiding duplicated processings in the same environment. To evaluate the constructed system, a spoken word recognition experiment with the word dictionaries composed of 100 or 643 city names, and the continuous speech recognition experiment for 235 minimal phrases uttered by two examinees were performed. It was observed as a result that the recognition performance by the traditional top\u2010down processing is almost maintained, while the processing time is decreased to one\u2010half or one\u2010third in word recognition and less than one\u2010fourth in minimal phrase recognition.<\/jats:p>","DOI":"10.1002\/scj.4690170711","type":"journal-article","created":{"date-parts":[[2007,7,7]],"date-time":"2007-07-07T12:28:28Z","timestamp":1183811308000},"page":"95-106","source":"Crossref","is-referenced-by-count":1,"title":["Speech recognition based on top\u2010down and bottom\u2010up phoneme recognition"],"prefix":"10.1002","volume":"17","author":[{"given":"Sho\u2010Ichi","family":"Matsunaga","sequence":"first","affiliation":[]},{"given":"Kiyohiro","family":"Shikano","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2007,3,21]]},"reference":[{"key":"e_1_2_1_2_2","first-page":"10","article-title":"Isolated word recognition for large vocabularies","volume":"61","author":"Rabiner L. R.","year":"1982","journal-title":"BSTJ"},{"key":"e_1_2_1_3_2","first-page":"5","article-title":"A hierarchical decision approach to large\u2010vocabulary discrete utterance recognition","volume":"31","author":"Kaneko T.","year":"1983","journal-title":"I.E.E.E. Trans. Acoust., Speech, Signal Processing"},{"key":"e_1_2_1_4_2","first-page":"6","article-title":"Two\u2010level DP\u2010matching a dynamic programming\u2010based pattern matching algorithm for connected word recognition","volume":"27","author":"Sakoe H.","year":"1979","journal-title":"I.E.E.E. Trans. Acoust., Speech Signal Processing"},{"key":"e_1_2_1_5_2","first-page":"2","article-title":"A level building dynamic time warping algorithm for connected word recognition","volume":"39","author":"Myers C. S.","year":"1981","journal-title":"I.E.E.E. Trans. Acoust., Speech, Signal Processing"},{"key":"e_1_2_1_6_2","doi-asserted-by":"publisher","DOI":"10.1121\/1.388090"},{"key":"e_1_2_1_7_2","first-page":"1","article-title":"Organization of the Hearsay II speech understanding system","volume":"23","author":"Lesser V. R.","year":"1975","journal-title":"I.E.E.E. Trans. Acoust., Speech, Signal Processing"},{"key":"e_1_2_1_8_2","unstructured":"W. A.Woodset al. Speech understanding systems\u2010final report BBN Tech. Rep. 3438(1976)."},{"key":"e_1_2_1_9_2","doi-asserted-by":"crossref","unstructured":"R. A.Coleet al. Feature\u2010based speaker\u2010independent recognition of isolated English letters Proc. 1983 ICASSP pp.731\u2013733(Apr. 1983).","DOI":"10.1109\/ICASSP.1983.1172077"},{"key":"e_1_2_1_10_2","first-page":"3","article-title":"Demisyllable based isolated word recognition system","volume":"31","author":"Rosenberg A. E.","year":"1983","journal-title":"I.E.E.E. Trans. Acoust., Speech, Signal Processing"},{"key":"e_1_2_1_11_2","article-title":"Spoken word recognition based on phoneme recognition","volume":"83","author":"Makino S.","year":"1983","journal-title":"Tech. Rep. Speech"},{"key":"e_1_2_1_12_2","article-title":"Speech recognition system integrating top\u2010down and bottom\u2010up processings","volume":"5","author":"Shitano K.","year":"1983","journal-title":"Proc. Inf. Proc. Soc. Jap."},{"key":"e_1_2_1_13_2","first-page":"5","article-title":"LPC peak weighted spectral matching measure","volume":"64","author":"Sugiyama M.","year":"1981","journal-title":"Trans. (A) I.E.C.E., Japan"},{"key":"e_1_2_1_14_2","first-page":"1","article-title":"Phoneme level processing in the top\u2010down and bottom\u2010up speech recognition","volume":"2","author":"Shitano K.","year":"1983","journal-title":"Proc. Acoust. Soc. Jap."},{"key":"e_1_2_1_15_2","first-page":"6","article-title":"Spoken word recognition based on top\u2010down phoneme recognition","volume":"67","author":"Aikawa K.","year":"1984","journal-title":"Trans. (D), I.E.C.E., Japan"},{"volume-title":"Shim\u2010Meikai Japanese dictionary","year":"1974","author":"Kaneda H.","key":"e_1_2_1_16_2"},{"key":"e_1_2_1_17_2","first-page":"481","volume-title":"The Art of Programming","author":"Knuth D. E.","year":"1973"},{"key":"e_1_2_1_18_2","first-page":"2","article-title":"A top\u2010down linguistic processing model for spoken minimal phrase recognition","volume":"2","author":"Matsunaga S.","year":"1983","journal-title":"Proc. Acoust. Soc. Jap."},{"key":"e_1_2_1_19_2","first-page":"1","article-title":"Speech recognition using blackboard in RTN","volume":"1","author":"Mastsunaga S.","year":"1983","journal-title":"Proc. Acoust. Soc. Jap."},{"volume-title":"Fundamentals of Speech Signal Processing","year":"1981","author":"Saito S.","key":"e_1_2_1_20_2"}],"container-title":["Systems and Computers in Japan"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fscj.4690170711","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/scj.4690170711","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,18]],"date-time":"2025-01-18T13:06:04Z","timestamp":1737205564000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/scj.4690170711"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1986,1]]},"references-count":19,"journal-issue":{"issue":"7","published-print":{"date-parts":[[1986,1]]}},"alternative-id":["10.1002\/scj.4690170711"],"URL":"https:\/\/doi.org\/10.1002\/scj.4690170711","archive":["Portico"],"relation":{},"ISSN":["0882-1666","1520-684X"],"issn-type":[{"type":"print","value":"0882-1666"},{"type":"electronic","value":"1520-684X"}],"subject":[],"published":{"date-parts":[[1986,1]]}}}