{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T09:33:19Z","timestamp":1763458399494,"version":"3.45.0"},"reference-count":50,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2007,12,1]],"date-time":"2007-12-01T00:00:00Z","timestamp":1196467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["H0011-06-C-0023"],"award-info":[{"award-number":["H0011-06-C-0023"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Speech Lang. Process."],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:p>\n                    We explore the use of morph-based language models in large-vocabulary continuous-speech recognition systems across four so-called morphologically rich languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic. The morphs are subword units discovered in an unsupervised, data-driven way using the\n                    <jats:italic toggle=\"yes\">Morfessor<\/jats:italic>\n                    algorithm. By estimating\n                    <jats:italic toggle=\"yes\">n<\/jats:italic>\n                    -gram language models over sequences of morphs instead of words, the quality of the language model is improved through better vocabulary coverage and reduced data sparsity. Standard word models suffer from high out-of-vocabulary (OOV) rates, whereas the morph models can recognize previously unseen word forms by concatenating morphs. It is shown that the morph models do perform fairly well on OOVs without compromising the recognition accuracy on in-vocabulary words. The Arabic experiment constitutes the only exception since here the standard word model outperforms the morph model. Differences in the datasets and the amount of data are discussed as a plausible explanation.\n                  <\/jats:p>","DOI":"10.1145\/1322391.1322394","type":"journal-article","created":{"date-parts":[[2008,1,3]],"date-time":"2008-01-03T13:20:10Z","timestamp":1199366410000},"page":"1-29","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":53,"title":["Morph-based speech recognition and modeling of out-of-vocabulary words across languages"],"prefix":"10.1145","volume":"5","author":[{"given":"Mathias","family":"Creutz","sequence":"first","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Teemu","family":"Hirsim\u00e4ki","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mikko","family":"Kurimo","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Antti","family":"Puurula","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Janne","family":"Pylkk\u00f6nen","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vesa","family":"Siivola","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Matti","family":"Varjokallio","sequence":"additional","affiliation":[{"name":"Helsinki University of Technology, TKK, Finland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ebru","family":"Arisoy","sequence":"additional","affiliation":[{"name":"Bo\u01e7azi\u00e7i University, Istanbul"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Murat","family":"Sara\u00e7lar","sequence":"additional","affiliation":[{"name":"Bo\u01e7azi\u00e7i University, Istanbul"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Stolcke","sequence":"additional","affiliation":[{"name":"SRI International, Menlo Park International Computer Science Institute, Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2007,12,12]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.sigpro.2005.12.002"},{"volume-title":"Proceedings of Interspeech (ICSLP'06)","author":"Arisoy E.","key":"e_1_2_1_2_1","unstructured":"Arisoy, E. and Sara\u00e7lar, M. 2006. Lattice extension and rescoring based approaches for LVCSR of turkish. In Proceedings of Interspeech (ICSLP'06). 1025--1028."},{"volume-title":"Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech)","author":"Bazzi I.","key":"e_1_2_1_3_1","unstructured":"Bazzi, I. and Glass, J. 2001. Learning units for domain-independent out-of-vocabulary word modelling. In Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech). Aalborg, Denmark."},{"volume-title":"Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP)","author":"Bazzi I.","key":"e_1_2_1_4_1","unstructured":"Bazzi, I. and Glass, J. R. 2000. Modeling out-of-vocabulary words for robust speech recognition. In Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP). Beijing, China."},{"volume-title":"Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP)","author":"Bazzi I.","key":"e_1_2_1_5_1","unstructured":"Bazzi, I. and Glass, J. R. 2002. A multi-class approach for modelling out-of-vocabulary words. In Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP). Denver, CO."},{"key":"e_1_2_1_6_1","first-page":"1165","article-title":"Compound words in large-vocabulary German speech recognition systems. In Proceedings of ICSLP '96. Philadelphia","volume":"2","author":"Berton A.","year":"1996","unstructured":"Berton, A., Fetter, P., and Regel-Brietzmann, P. 1996. Compound words in large-vocabulary German speech recognition systems. In Proceedings of ICSLP '96. Philadelphia, PA. Vol. 2, 1165--1168.","journal-title":"PA."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073483.1073485"},{"volume-title":"Proceedings of Interspeech'05","author":"Bisani M.","key":"e_1_2_1_8_1","unstructured":"Bisani, M. and Ney, H. 2005. Open vocabulary speech recognition with flat hybrid models. In Proceedings of Interspeech'05. Lisbon, Portugal."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007541817488"},{"volume-title":"Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech). 487--489","author":"Byrne W.","key":"e_1_2_1_10_1","unstructured":"Byrne, W., Haji\u010d, J., Ircing, P., Jelinek, F., Khudanpur, S., Krbec, P., and Psutka, J. 2001. On large vocabulary continuous speech recognition of highly inflectional language---Czech. In Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech). 487--489."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1999.0128"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075132"},{"key":"e_1_2_1_13_1","unstructured":"Creutz M. 2006. Induction of the morphology of natural language: Unsupervised morpheme segmentation with application to automatic speech recognition. Ph.D. thesis Dissertations in Computer and Information Science Report D13 Helsinki University of Technology. http:\/\/lib.tkk.fi\/Diss\/2006\/isbn9512282119\/."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118647.1118650"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622153.1622159"},{"volume-title":"Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05)","author":"Creutz M.","key":"e_1_2_1_16_1","unstructured":"Creutz, M. and Lagus, K. 2005a. Inducing the morphological lexicon of a natural language from unannotated text. In Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05). Espoo, Finland. 106--113."},{"key":"e_1_2_1_17_1","unstructured":"Creutz M. and Lagus K. 2005b. Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0. Tech. rep. A81 Publications in Computer and Information Science Helsinki University of Technology."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1187415.1187418"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.5555\/924901"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.2003-111"},{"volume-title":"Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP)","author":"Gallwitz F.","key":"e_1_2_1_21_1","unstructured":"Gallwitz, F., N\u00f6th, E., and Niemann, H. 1996. A category based approach for recognition of out-of-vocabulary words. In Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP). Philadelphia, PA."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).","volume":"2","author":"Geutner P.","unstructured":"Geutner, P., Finke, M., and Scheytt, P. 1998. Adaptive vocabularies for transcribing multilingual broadcast news. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vol. 2. 925--928."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1162\/089120101750300490"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220260"},{"volume-title":"Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech). 1165--1168","author":"Hacioglu K.","key":"e_1_2_1_25_1","unstructured":"Hacioglu, K., Pellom, B., Ciloglu, T., Ozturk, O., Kurimo, M., and Creutz, M. 2003. On lexicon creation for Turkish LVCSR. In Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech). 1165--1168."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.2307\/411036"},{"volume-title":"Morpheme boundaries within words: Report on a computer test. Transform. Discourse Anal. Papers 73. (Reprinted 1970 in Papers in Structural and Transformational Linguistics","author":"Harris Z. S.","key":"e_1_2_1_27_1","unstructured":"Harris, Z. S. 1967. Morpheme boundaries within words: Report on a computer test. Transform. Discourse Anal. Papers 73. (Reprinted 1970 in Papers in Structural and Transformational Linguistics, Reidel Publishing Company, Dordrecht, Holland.)"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2005.07.002"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2005.10.001"},{"volume-title":"Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech)","author":"Klakow D.","key":"e_1_2_1_30_1","unstructured":"Klakow, D., Rose, G., and Aubert, X. 1999. OOV-detection in large vocabulary system using automatically defined word-fragments as fillers. In Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech). Budapest, Hungary. 49--52."},{"volume-title":"Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech). 69--72","author":"Kneissler J.","key":"e_1_2_1_31_1","unstructured":"Kneissler, J. and Klakow, D. 2001. Speech recognition for huge vocabularies by using optimized sub-word units. In Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech). 69--72."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'95)","author":"Kneser R.","key":"e_1_2_1_32_1","unstructured":"Kneser, R. and Ney, H. 1995. Improved backing-off for m-gram language modeling. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP'95). 181--184."},{"volume-title":"Proceedings of the PASCAL Challenge Workshop on Unsupervised Segmentation of Words into Morphemes","author":"Kurimo M.","key":"e_1_2_1_33_1","unstructured":"Kurimo, M., Creutz, M., Varjokallio, M., Arisoy, E., and Sara\u00e7lar, M. 2006a. Unsupervised segmentation of words into morphemes---Challenge 2005, an introduction and evaluation report. In Proceedings of the PASCAL Challenge Workshop on Unsupervised Segmentation of Words into Morphemes. Venice, Italy."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of Interspeech","author":"Kurimo M.","year":"2006","unstructured":"Kurimo, M., Creutz, M., Varjokallio, M., Arisoy, E., and Sara\u00e7lar, M. 2006b. Unsupervised segmentation of words into morphemes---Morpho Challenge 2005, Application to automatic speech recognition. In Proceedings of Interspeech 2006. Pittsburgh, PA."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220835.1220897"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-6393(02)00031-6"},{"volume-title":"Proceedings of ICSLP.","author":"Larson M.","key":"e_1_2_1_37_1","unstructured":"Larson, M., Willett, D., Koehler, J., and Rigoll, G. 2000. Compound splitting and lexical unit recombination for improved performance of a speech recognition system for German parliamentary speeches. In Proceedings of ICSLP."},{"key":"e_1_2_1_38_1","unstructured":"Mohri M. and Riley M. D. 2002. DCD library Speech recognition decoder library. AT&T Labs Research. http:\/\/www.research.att.com\/sw\/tools\/dcd\/."},{"volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Mou X.","key":"e_1_2_1_39_1","unstructured":"Mou, X. and Zue, V. 2001. Sub-lexical modelling using a finite-state transducer framework. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Salt Lake City, UT."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.2003-105"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the 2nd Baltic Conference on Human Language Technologies (HLT'05)","author":"Pylkk\u00f6nen J.","year":"2005","unstructured":"Pylkk\u00f6nen, J. 2005. An efficient one-pass decoder for Finnish large vocabulary continuous speech recognition. In Proceedings of the 2nd Baltic Conference on Human Language Technologies (HLT'05). 167--172."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.5555\/534247"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.3115\/1117601.1117615"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073336.1073360"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.5555\/1610075.1610130"},{"volume-title":"Proceedings of Interspeech'05","author":"Siivola V.","key":"e_1_2_1_46_1","unstructured":"Siivola, V. and Pellom, B. 2005. Growing an n-gram model. In Proceedings of Interspeech'05. Lisbon, Portugal, 1309--1312."},{"key":"e_1_2_1_47_1","volume-title":"Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop","author":"Stolcke A.","year":"1998","unstructured":"Stolcke, A. 1998. Entropy-based pruning of backoff language models. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop. Lansdowne, VA. 270--274."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.21437\/ICSLP.2002-303"},{"volume-title":"Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP). 170--173","author":"Whittaker E.","key":"e_1_2_1_49_1","unstructured":"Whittaker, E. and Woodland, P. 2000. Particle-based language modelling. In Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP). 170--173."},{"key":"e_1_2_1_50_1","unstructured":"Young S. Ollason D. Valtchev V. and Woodland P. 2002. The HTK Book (for version 3.2 of HTK). University of Cambridge Cambridge UK."}],"container-title":["ACM Transactions on Speech and Language Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1322391.1322394","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1322391.1322394","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1322391.1322394","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T09:22:19Z","timestamp":1763457739000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1322391.1322394"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,12]]},"references-count":50,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["10.1145\/1322391.1322394"],"URL":"https:\/\/doi.org\/10.1145\/1322391.1322394","relation":{},"ISSN":["1550-4875","1550-4883"],"issn-type":[{"type":"print","value":"1550-4875"},{"type":"electronic","value":"1550-4883"}],"subject":[],"published":{"date-parts":[[2007,12]]},"assertion":[{"value":"2006-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2007-06-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2007-12-12","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}