{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,9]],"date-time":"2026-01-09T20:28:07Z","timestamp":1767990487605,"version":"3.49.0"},"publisher-location":"Berlin, Heidelberg","reference-count":26,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"value":"9783540012740","type":"print"},{"value":"9783540366188","type":"electronic"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2003]]},"DOI":"10.1007\/3-540-36618-0_24","type":"book-chapter","created":{"date-parts":[[2007,7,16]],"date-time":"2007-07-16T11:49:02Z","timestamp":1184586542000},"page":"335-350","source":"Crossref","is-referenced-by-count":79,"title":["Combining Naive Bayes and n-Gram Language Models for Text Classification"],"prefix":"10.1007","author":[{"given":"Fuchun","family":"Peng","sequence":"first","affiliation":[]},{"given":"Dale","family":"Schuurmans","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2003,4,15]]},"reference":[{"key":"24_CR1","unstructured":"T. Bell, J. Cleary and I. Witten. (1990). Text Compression. Prentice Hall."},{"key":"24_CR2","unstructured":"S. Chen and J. Goodman. (1998). An Empirical Study of Smoothing Techniques for Language Modeling. Technical report, TR-10-98, Harvard University."},{"key":"24_CR3","unstructured":"W. Cavnar, J. Trenkle. (1994). N-Gram-Based Text Categorization. In Proceedings of SDAIR-94."},{"key":"24_CR4","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1023\/A:1007413511361","volume":"29","author":"P. Domingos","year":"1997","unstructured":"P. Domingos and M. Pazzani. (1997). Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier. Machine Learning, 29, 103\u2013130","journal-title":"Machine Learning"},{"key":"24_CR5","volume-title":"Pattern Classification and Scene Analysis","author":"R. Duda","year":"1973","unstructured":"R. Duda and P. Hart. (1973). Pattern Classification and Scene Analysis. Wiley, NY."},{"key":"24_CR6","unstructured":"S. Eyheramendy, D. Lewis and D. Madigan. (2003). On the Naive Bayes Model for Text Categorization. To appear in Artificial Intelligence & Statistics 2003."},{"key":"24_CR7","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1023\/A:1007465528199","volume":"29","author":"N. Friedman","year":"1997","unstructured":"N. Friedman, D. Geiger, and M. Goldszmidt. (1997). Bayesian Network Classifiers. In Machine Learning 29:131\u2013163.","journal-title":"Machine Learning"},{"key":"24_CR8","unstructured":"J. He, A. Tan, and C. Tan. (2000). A Comparative Study on Chinese Text Categorization Methods. In Proceedings of PRICAI\u20192000 International Workshop on Text and Web Mining, p24\u201335."},{"key":"24_CR9","unstructured":"D. Hiemstra. (2001). Using Language Models for Information Retrieval. Ph.D. Thesis, Centre for Telematics and Information Technology, University of Twente."},{"key":"24_CR10","unstructured":"E. Keogh and M. Pazzanni. (1999). Learning Augmented Bayesian Classifiers: A Comparison of Distribution-based and Classification-based Approaches. In Artificial Intelligence & Statistics 1999"},{"issue":"8","key":"24_CR11","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1002\/(SICI)1097-4571(1999)50:8<709::AID-ASI8>3.0.CO;2-V","volume":"50","author":"K. Kwok","year":"1999","unstructured":"K. Kwok. (1999). Employing Multiple Representations for Chinese Information Retrieval, JASIS, 50(8), 709\u2013723.","journal-title":"JASIS"},{"key":"24_CR12","doi-asserted-by":"crossref","unstructured":"D. Lewis. (1998). Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval. In Proceedings ECML-98.","DOI":"10.1007\/BFb0026666"},{"key":"24_CR13","volume-title":"Foundations of Statistical Natural Language Processing","author":"C. Manning","year":"1999","unstructured":"C. Manning, and H. Sch\u00fctze. (1999). Foundations of Statistical Natural Language Processing, MIT Press, Cambridge, Massachusetts."},{"key":"24_CR14","unstructured":"A. McCallum and K. Nigam. (1998). A Comparison of Event Models for Naive Bayes Text Classification. In Proceedings of AAAI-98 Workshop on \u201cLearning for Text Categorization\u201d, AAAI Presss."},{"issue":"1","key":"24_CR15","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1006\/csla.1994.1001","volume":"8","author":"H. Ney","year":"1994","unstructured":"H. Ney, U. Essen, and R. Kneser. (1994). On Structuring Probabilistic Dependencies in Stochastic Language Modeling. In Comput. Speech and Lang., 8(1), 1\u201328.","journal-title":"Comput. Speech and Lang."},{"key":"24_CR16","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1023\/A:1007369909943","volume":"27","author":"M. Pazzani","year":"1997","unstructured":"M. Pazzani and D. Billsus. (1997). Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning, 27, 313\u2013331.","journal-title":"Machine Learning"},{"key":"24_CR17","doi-asserted-by":"crossref","unstructured":"J. Ponte, W. Croft. (1998). A Language Modeling Approach to Information Retrieval. In Proceedings of SIGIR1998, 275\u2013281.","DOI":"10.1145\/290941.291008"},{"key":"24_CR18","unstructured":"J. Rennie. (2001). Improving Multi-class Text Classification with Naive Bayes. Master\u2019s Thesis. M. I. T. AI Technical Report AITR-2001-004. 2001."},{"key":"24_CR19","unstructured":"I. Rish. (2001). An Empirical Study of the Naive Bayes Classifier. In Proceedings of IJCAI-01 Workshop on Empirical Methods in Artificial Intelligence."},{"key":"24_CR20","doi-asserted-by":"publisher","first-page":"129","DOI":"10.1002\/asi.4630270302","volume":"27","author":"S. Robertson","year":"1976","unstructured":"S. Robertson and K. Sparck Jones. (1976). Relevance Weighting of Search Terms. JASIS, 27, 129\u2013146.","journal-title":"JASIS"},{"key":"24_CR21","unstructured":"S. Scott and S. Matwin. (1999). Feature Engineering for Text Classification. In Proceedings of ICML\u201999, pp. 379\u2013388."},{"issue":"1","key":"24_CR22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/505282.505283","volume":"34","author":"F. Sebastiani","year":"2002","unstructured":"F. Sebastiani. (2002). Machine Learning in Automated Text Categorization. ACM Computing Surveys, 34(1):1\u201347, 2002.","journal-title":"ACM Computing Surveys"},{"issue":"4","key":"24_CR23","doi-asserted-by":"publisher","first-page":"471","DOI":"10.1162\/089120100750105920","volume":"26","author":"E. Stamatatos","year":"2000","unstructured":"E. Stamatatos, N. Fakotakis and G. Kokkinakis. (2000). Automatic Text Categorization in Terms of Genre and Author. Comput. Ling., 26(4), pp. 471\u2013495.","journal-title":"Comput. Ling."},{"key":"24_CR24","unstructured":"W. Teahan and D. Harper. (2001). Using Compression-Based Language Models for Text Categorization. In Proceedings of Workshop on LMIR."},{"key":"24_CR25","doi-asserted-by":"crossref","unstructured":"A. Turpin and A. Moffat. (1999). Statistical Phrases for Vector-Space Information Retrieval. Proceedings of SIGIR 1999, pp. 309\u2013310.","DOI":"10.1145\/312624.312741"},{"issue":"1\/2","key":"24_CR26","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1023\/A:1009982220290","volume":"1","author":"Y. Yang","year":"1999","unstructured":"Y. Yang. (1999). An Evaluation of Statistical Approaches to Text Categorization. Information Retrieval, Vol. 1, No. 1\/2, pp. 67\u201388.","journal-title":"Information Retrieval"}],"container-title":["Lecture Notes in Computer Science","Advances in Information Retrieval"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/3-540-36618-0_24","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,2,17]],"date-time":"2019-02-17T13:38:29Z","timestamp":1550410709000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/3-540-36618-0_24"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003]]},"ISBN":["9783540012740","9783540366188"],"references-count":26,"URL":"https:\/\/doi.org\/10.1007\/3-540-36618-0_24","relation":{},"ISSN":["0302-9743"],"issn-type":[{"value":"0302-9743","type":"print"}],"subject":[],"published":{"date-parts":[[2003]]}}}