{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T09:33:26Z","timestamp":1763458406166,"version":"3.45.0"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2007,12,1]],"date-time":"2007-12-01T00:00:00Z","timestamp":1196467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["MDA972-02-C-0038"],"award-info":[{"award-number":["MDA972-02-C-0038"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004965","name":"Sixth Framework Programme","doi-asserted-by":"publisher","award":["FP6-506811"],"award-info":[{"award-number":["FP6-506811"]}],"id":[{"id":"10.13039\/501100004965","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Speech Lang. Process."],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:p>This article describes a methodology for collecting text from the Web to match a target sublanguage both in style (register) and topic. Unlike other work that estimates n-gram statistics from page counts, the approach here is to select and filter documents, which provides more control over the type of material contributing to the n-gram counts. The data can be used in a variety of ways; here, the different sources are combined in two types of mixture models. Focusing on conversational speech where data collection can be quite costly, experiments demonstrate the positive impact of Web collections on several tasks with varying amounts of data, including Mandarin and English telephone conversations and English meetings and lectures.<\/jats:p>","DOI":"10.1145\/1322391.1322392","type":"journal-article","created":{"date-parts":[[2008,1,3]],"date-time":"2008-01-03T13:20:10Z","timestamp":1199366410000},"page":"1-25","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":25,"title":["Web resources for language modeling in conversational speech recognition"],"prefix":"10.1145","volume":"5","author":[{"given":"Ivan","family":"Bulyko","sequence":"first","affiliation":[{"name":"BBN Technologies, Cambridge, MA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mari","family":"Ostendorf","sequence":"additional","affiliation":[{"name":"University of Washington"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manhung","family":"Siu","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tim","family":"Ng","sequence":"additional","affiliation":[{"name":"Hong Kong University of Science and Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Stolcke","sequence":"additional","affiliation":[{"name":"SRI International and the International Computer Science Institute"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"\u00d6zg\u00fcr","family":"\u00c7etin","sequence":"additional","affiliation":[{"name":"International Computer Science Institute"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2007,12,12]]},"reference":[{"volume-title":"Proceedings of Interspeech. 1873--1876","author":"Akbacak M.","key":"e_1_2_1_1_1","unstructured":"Akbacak, M., Gao, Y., Gu, L., and Kuo, H.-K. 2005. Rapid transition to new spoken dialog domains: Language model training using knowledge from previous domain applications and web text resources. In Proceedings of Interspeech. 1873--1876."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072133.1072204"},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Bellegarda J.","year":"1998","unstructured":"Bellegarda, J. 1998. Exploiting both local and global constraints for multispan statistical language modeling. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. II, 677--680."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Berger A.","key":"e_1_2_1_4_1","unstructured":"Berger, A. and Miller, R. 1998. Just-in-time language modeling. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. II, 705--708."},{"volume-title":"Proceedings of the Eurospeech. 1755--1758","author":"Bessling S.","key":"e_1_2_1_5_1","unstructured":"Bessling, S. and Meier, H. 1995. Language model speaker adaptation. In Proceedings of the Eurospeech. 1755--1758."},{"volume-title":"Variation Across Speech and Writing","author":"Biber D.","key":"e_1_2_1_6_1","unstructured":"Biber, D. 1988. Variation Across Speech and Writing. Cambridge University Press."},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/972470.972472"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1168644"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073483.1073486"},{"key":"e_1_2_1_10_1","unstructured":"\u00c7etin O. and Stolcke A. 2005. Language modeling in the ICSI-SRI Spring 2005 Meeting speech recognition evaluation system. Tech. rep. tr-05-06 International Computer Science Institute."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1999.0128"},{"volume-title":"Proceedings of Eurospeech. 1597--1600","author":"Cieri C.","key":"e_1_2_1_12_1","unstructured":"Cieri, C., Miller, D., and Walker, K. 2003. From Switchboard to Fisher: Telephone collection protocols, their uses and yields. In Proceedings of Eurospeech. 1597--1600."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/839274.839354"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1111\/j.2517-6161.1977.tb01600.x"},{"volume-title":"Proceedings of the Association for Computational Linguistics (ACL).","author":"Duh K.","key":"e_1_2_1_15_1","unstructured":"Duh, K. and Kirchhoff, K. 2005. Pos tagging of dialectal Arabic: A minimally supervised approach. In Proceedings of the Association for Computational Linguistics (ACL)."},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","volume":"1","author":"Evermann G.","unstructured":"Evermann, G., Chan, H., Gales, M., Hain, T., Liu, X., Mrva, D., Wang, L., and Woodland, P. 2004a. Development of the 2003 CU-HTK conversational telephone speech transcription system. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 1, 249--252."},{"volume-title":"Proceedings of the NIST RT-04F Rich Transcription Workshop.","author":"Evermann G.","key":"e_1_2_1_17_1","unstructured":"Evermann, G., Chan, H., Gales, M., Jia, B., Liu, X., Mrva, D., Sim, K., Wang, L., Woodland, P., and Yu, K. 2004b. Development of the 2004 CU-HTK English CTS system using more than 2000 hours of data. In Proceedings of the NIST RT-04F Rich Transcription Workshop."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Gao Y.","key":"e_1_2_1_18_1","unstructured":"Gao, Y., Gu, L., and Kuo, H.-K. 2005. Portability challenges in developing interactive dialogue systems. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. V, 1017--1020."},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing. L. Lee and D. Harman, Eds. 167--202","author":"Gildea D.","year":"2001","unstructured":"Gildea, D. 2001. Corpus variation and parser performance. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. L. Lee and D. Harman, Eds. 167--202."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1992.225858"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.2001.0174"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1007\/11677482_30"},{"volume-title":"Proceedings of the NIST RT-04F Rich Transcription Workshop.","author":"Hwang M.","key":"e_1_2_1_23_1","unstructured":"Hwang, M., Lei, X., Ng, T., Ostendorf, M., Stolcke, A., Wang, W., Zheng, J., and Gadde, V. 2004. Porting Decipher from English to Mandarin. In Proceedings of the NIST RT-04F Rich Transcription Workshop."},{"key":"e_1_2_1_24_1","volume-title":"et al","author":"Hwang M.-Y.","year":"1996","unstructured":"Hwang, M.-Y. et al. 1996. Predicting unseen triphones with senones. IEEE Trans. Speech Audio Process. 4. 412--419."},{"volume-title":"Proceedings of the International Conference on Spoken Language Processing (ICSLP). 236--239","author":"Iyer R.","key":"e_1_2_1_25_1","unstructured":"Iyer, R. and Ostendorf, M. 1996. Modeling long range dependencies in languages. In Proceedings of the International Conference on Spoken Language Processing (ICSLP). 236--239."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.1997-524"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.1999.0124"},{"volume-title":"IEEE Workshop on Speech Recognition and Understanding Proceedings. 254--261","author":"Iyer R.","key":"e_1_2_1_28_1","unstructured":"Iyer, R., Ostendorf, M., and Meteer, M. 1997. Analyzing and predicting language model improvements. In IEEE Workshop on Speech Recognition and Understanding Proceedings. 254--261."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1162\/089120103322711604"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1162\/089120103322711569"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2000.862077"},{"volume-title":"Proceedings of Interspeech. 1657--1660","author":"Lamel L.","key":"e_1_2_1_32_1","unstructured":"Lamel, L., Adda, G., Bilinski, E., and Gauvain, J. L. 2005. Transcribing lectures and seminars. In Proceedings of Interspeech. 1657--1660."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1075389.1075392"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564403"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1996.540314"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1999.758182"},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of Eurospeech.","volume":"3","author":"Martin S.","unstructured":"Martin, S., Liermann, J., and Ney, H. 1997. Adaptive topic-dependent language modeling using word-based varigrams. In Proceedings of Eurospeech. vol. 3. 3, 1447--1450."},{"key":"e_1_2_1_38_1","volume-title":"Meetings about meetings: Research at ICSI on speech in multiparty conversations. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","volume":"4","author":"Morgan N.","unstructured":"Morgan, N., Baron, D., Bhagat, S., Carvey, H., Dhillon, R., Edwards, J., Gelbart, D., Janin, A., Krupski, A., Peskin, B., Pfau, T., Shriberg, E., Stolcke, A., and Wooters, C. 2003. Meetings about meetings: Research at ICSI on speech in multiparty conversations. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vol. 4, 740--743."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). 89--593","author":"Ng T.","key":"e_1_2_1_39_1","unstructured":"Ng, T., Ostendorf, M., Hwang, M.-Y., Siu, M., Bulyko, I., and Lei, X. 2005. Web-data augmented language models for Mandarin conversational speech recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). 89--593."},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of Empirical Methods in Natural Language Processing Conference. 133--141","author":"Ratnaparkhi A.","year":"1996","unstructured":"Ratnaparkhi, A. 1996. A maximum entropy part-of-speech tagger. In Proceedings of Empirical Methods in Natural Language Processing Conference. 133--141."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.1997-526"},{"key":"e_1_2_1_42_1","volume-title":"Proceedings of ARPA Spoken Language Technology Workshop. 66--69","author":"Rudnicky A.","year":"1995","unstructured":"Rudnicky, A. 1995. Language modeling with limited domain data. In Proceedings of ARPA Spoken Language Technology Workshop. 66--69."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Sarikaya R.","key":"e_1_2_1_43_1","unstructured":"Sarikaya, R., Gravano, A., and Gao, Y. 2005. Rapid language model development using external resources for new spoken dialog domains. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vol. I, 573--576."},{"key":"e_1_2_1_44_1","volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","volume":"2","author":"Scheytt P.","unstructured":"Scheytt, P., Geutner, P., and Waibel, A. 1998. Serbo-Croatian LVCSR on the dictation and broadcast news domain. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 2, 897--900."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2004.825666"},{"volume-title":"Proceedings of Interspeech. 1293--1296","author":"Sethy A.","key":"e_1_2_1_46_1","unstructured":"Sethy, A., Georgiou, P., and Narayanan, S. 2005. Building topic-specific language models from webdata using competitive models. In Proceedings of Interspeech. 1293--1296."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1006\/csla.2001.0169"},{"key":"e_1_2_1_48_1","volume-title":"Proceedings of DARPA Broadcast News Transcription and Understanding Workshop. 270--274","author":"Stolcke A.","year":"1998","unstructured":"Stolcke, A. 1998. Entropy-based pruning of backoff language models. In Proceedings of DARPA Broadcast News Transcription and Understanding Workshop. 270--274."},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.21437\/ICSLP.2002-303"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1007\/11677482_39"},{"volume-title":"NIST RT-03 Workshop.","author":"Stolcke A.","key":"e_1_2_1_51_1","unstructured":"Stolcke, A. et al. 2003. Speech-to-text research at SRI-ICSI-UW. NIST RT-03 Workshop."},{"volume-title":"Proceedings of Eurospeech. 245--248","author":"Venkataraman A.","key":"e_1_2_1_52_1","unstructured":"Venkataraman, A. and Wang, W. 2003. Techniques for effective vocabulary selection. In Proceedings of Eurospeech. 245--248."},{"volume-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP).","author":"Wang W.","key":"e_1_2_1_53_1","unstructured":"Wang, W., Stolcke, A., and Harper, M. 2004. The use of a linguistically motivated language model in conversational speech recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. I, 261--264."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.1993-495"},{"volume-title":"Proceedings of Interspeech. 741--744","author":"Xu P.","key":"e_1_2_1_55_1","unstructured":"Xu, P. and Mangu, L. 2005. Using random forest language models in the IBM RT-04 CTS system. In Proceedings of Interspeech. 741--744."},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.5555\/645526.657137"},{"volume-title":"Proceedings of Interspeech. 2141--2144","author":"Zhu Q.","key":"e_1_2_1_57_1","unstructured":"Zhu, Q., Stolcke, A., Chen, B., and Morgan, N. 2005. Using mlp features in SRI's conversational speech recognition system. In Proceedings of Interspeech. 2141--2144."},{"key":"e_1_2_1_58_1","first-page":"533","article-title":"Improving trigram language modeling with the World Wide Web","author":"Zhu X.","year":"2001","unstructured":"Zhu, X. and Rosenfeld, R. 2001. Improving trigram language modeling with the World Wide Web. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). I:533--536.","journal-title":"Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP)."}],"container-title":["ACM Transactions on Speech and Language Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1322391.1322392","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1322391.1322392","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1322391.1322392","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T09:22:25Z","timestamp":1763457745000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1322391.1322392"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,12]]},"references-count":58,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["10.1145\/1322391.1322392"],"URL":"https:\/\/doi.org\/10.1145\/1322391.1322392","relation":{},"ISSN":["1550-4875","1550-4883"],"issn-type":[{"type":"print","value":"1550-4875"},{"type":"electronic","value":"1550-4883"}],"subject":[],"published":{"date-parts":[[2007,12]]},"assertion":[{"value":"2005-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2007-08-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2007-12-12","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}