{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,31]],"date-time":"2025-01-31T23:10:22Z","timestamp":1738365022471,"version":"3.35.0"},"reference-count":15,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2008,9,12]],"date-time":"2008-09-12T00:00:00Z","timestamp":1221177600000},"content-version":"vor","delay-in-days":6829,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Trans Emerging Tel Tech"],"published-print":{"date-parts":[[1990,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper gives an introduction to the use of Hidden Markov Models and Information Theory principles for speech recognition. The use of these techniques has led to powerful large vocabulary speech recognition systems developed in the recent years. The use of Hidden Markov Models for stochastic modeling of speech and the basic modules of a Hidden Markov Model based speech recognition system are explained. It is shown how the speech waveform can be transformed into acoustic features and into prototypes which characterize the acoustic state space of the given speaker. The phonemes of the language can be represented by Hidden Markov Models. The statistical parameters of these Markov models can be obtained during a training session. During recognition the input to the linguistic decoder is the acoustic label string resulting from the speech signal of a spoken sentence. The decoder picks the sentence which maximizes the probability of the word sequence of this sentence if the given label stream is seen. The paper concludes with a more detailed explanation of the acoustic processor of the speech recognition system and some proposed algorithms for its improvement which can lead to a better system performance.<\/jats:p>","DOI":"10.1002\/ett.4460010108","type":"journal-article","created":{"date-parts":[[2008,9,12]],"date-time":"2008-09-12T12:51:55Z","timestamp":1221223915000},"page":"37-42","source":"Crossref","is-referenced-by-count":3,"title":["Large vocabulary hidden markov model based speech recognition"],"prefix":"10.1002","volume":"1","author":[{"given":"Cerhard","family":"Rigoll","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2008,9,12]]},"reference":[{"key":"e_1_2_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/PROC.1976.10159"},{"key":"e_1_2_1_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1983.4767370"},{"key":"e_1_2_1_4_2","first-page":"549","volume-title":"Handbook of Statistics","author":"Jelinek F.","year":"1982"},{"key":"e_1_2_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/PROC.1985.13343"},{"key":"e_1_2_1_6_2","doi-asserted-by":"crossref","unstructured":"A.Averbuch et al.:An IBM\u2010PC based large\u2010vocabulary isolated\u2010utterance speech recognizer. Proc. IEEE\u2010ICASSP Tokyo 1986 p.53\u201356.","DOI":"10.1109\/ICASSP.1986.1169169"},{"key":"e_1_2_1_7_2","doi-asserted-by":"crossref","unstructured":"A.Averbuch et al.:Experiments with the Tangora 20 000 word speech recognizer. Proc. IEEE\u2010ICASSP Dallas 1987 p.701\u2013704.","DOI":"10.1109\/ICASSP.1987.1169870"},{"key":"e_1_2_1_8_2","doi-asserted-by":"crossref","unstructured":"L. R.Bahl P. F.Brown P. V.de Souza R. L.Mercer M. A.Picheny:Acoustic Markov Models used in the tangora speech recognition system. Proc. IEEE\u2010ICASSP New York 1988 p.497\u2013500.","DOI":"10.1109\/ICASSP.1988.196628"},{"key":"e_1_2_1_9_2","doi-asserted-by":"crossref","unstructured":"P. S.Gopalakrishnan D.Kanevsky A.Nadas D.Nahamoo M. A.Picheny:Decoder selection based on cross\u2010entropies. Proc. IEEE\u2010ICASSP. New York 1988 p.20\u201323.","DOI":"10.1109\/ICASSP.1988.196499"},{"key":"e_1_2_1_10_2","doi-asserted-by":"crossref","unstructured":"G.Rigoll:Experiments with the acoustic processor of a Hidden Markov Model based large vocabulary speech recognition system. Proc. 4th Int. Symposium on Biological and Artificial Intelligence Systems Trento Italy 1988. p.533\u2013545.","DOI":"10.1007\/978-94-009-3117-6_26"},{"key":"e_1_2_1_11_2","doi-asserted-by":"publisher","DOI":"10.1121\/1.2022857"},{"key":"e_1_2_1_12_2","doi-asserted-by":"publisher","DOI":"10.1147\/rd.136.0675"},{"key":"e_1_2_1_13_2","doi-asserted-by":"crossref","unstructured":"A.Nadas R. L.Mercer L. R.Bahl R.Bakis P. S.Cohen A. G.Cole F.Jelinek B.Lewis:Continuous speech recognition with automatically selected acoustic prototypes obtained by either bootstrapping or clustering Proc. IEEE\u2010ICASSP Atlanta 1981 p.1153\u20131155.","DOI":"10.1109\/ICASSP.1981.1171177"},{"key":"e_1_2_1_14_2","doi-asserted-by":"crossref","unstructured":"L. R.Bahl P. F.Brown P. V.de Souza R. L.Mercer:Maximum mutual information estimation of Hidden Markov Model printers for speech recognition. Proc. IEEE\u2010ICASSP Tokyo 1988 p.49\u201352.","DOI":"10.1109\/ICASSP.1986.1169179"},{"key":"e_1_2_1_15_2","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1983.tb03114.x"},{"volume-title":"Digital processing of speech signals","year":"1978","author":"Rabiner L. R.","key":"e_1_2_1_16_2"}],"container-title":["European Transactions on Telecommunications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fett.4460010108","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/ett.4460010108","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,31]],"date-time":"2025-01-31T22:33:32Z","timestamp":1738362812000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/ett.4460010108"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1990,1]]},"references-count":15,"journal-issue":{"issue":"1","published-print":{"date-parts":[[1990,1]]}},"alternative-id":["10.1002\/ett.4460010108"],"URL":"https:\/\/doi.org\/10.1002\/ett.4460010108","archive":["Portico"],"relation":{},"ISSN":["1124-318X","1541-8251"],"issn-type":[{"type":"print","value":"1124-318X"},{"type":"electronic","value":"1541-8251"}],"subject":[],"published":{"date-parts":[[1990,1]]}}}