{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T20:03:34Z","timestamp":1774037014232,"version":"3.50.1"},"publisher-location":"Berlin, Heidelberg","reference-count":33,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"value":"9783642311369","type":"print"},{"value":"9783642311376","type":"electronic"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012]]},"DOI":"10.1007\/978-3-642-31137-6_47","type":"book-chapter","created":{"date-parts":[[2012,6,18]],"date-time":"2012-06-18T09:17:55Z","timestamp":1340011075000},"page":"618-630","source":"Crossref","is-referenced-by-count":1,"title":["Evaluation of Normalization Techniques in Text Classification for Portuguese"],"prefix":"10.1007","author":[{"given":"Merley","family":"da Silva Conrado","sequence":"first","affiliation":[]},{"given":"V\u00edctor Antonio","family":"Laguna Guti\u00e9rrez","sequence":"additional","affiliation":[]},{"given":"Solange Oliveira","family":"Rezende","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"47_CR1","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1007\/11595014_67","volume-title":"Progress in Artificial Intelligence","author":"R.V. Alvares","year":"2005","unstructured":"Alvares, R.V., Garcia, A.C.B., Ferraz, I.: STEMBR: A Stemming Algorithm for the Brazilian Portuguese Language. In: Bento, C., Cardoso, A., Dias, G. (eds.) EPIA 2005. LNCS (LNAI), vol.\u00a03808, pp. 693\u2013701. Springer, Heidelberg (2005)"},{"key":"47_CR2","first-page":"201","volume-title":"Linguistically-motivated Information Retrieval","author":"A. Arampatzis","year":"2000","unstructured":"Arampatzis, A., van der Weide, T., Koster, C., van Bommel, P.: Linguistically-motivated Information Retrieval, pp. 201\u2013222. Marcel Dekker, NY (2000)"},{"key":"47_CR3","unstructured":"Aranha, C.N.: Uma Abordagem de Pr\u00e9-Processamento Autom\u00e1tico para Minera\u00e7\u00e3o de Textos em Portugu\u00eas: sob o Enfoque da Intelig\u00eancia Computacional. PhD thesis, Departamento de Engenharia El\u00e9trica - PUC - Rio de Janeiro (2007)"},{"key":"47_CR4","unstructured":"Bekkerman, R., Allan, J.: Using bigrams in text categorization. Technical Report IR-408, Center of Intelligent Information Retrieval, UMass Amherst (2004)"},{"key":"47_CR5","unstructured":"Brill, E.: Transformation-based error-driven learning of natural language: A case study in part of speech tagging. Computational Linguistics, 543\u2013565 (1995)"},{"key":"47_CR6","unstructured":"Conrado, M.S.: O efeito do uso de diferentes formas de gera\u00e7\u00e3o de termos na compreensibilidade e representatividade dos termos em cole\u00e7\u00f5es textuais na L\u00edngua Portuguesa. Master\u2019s thesis, Instituto de Ci\u00eancias Matem\u00e1ticas e de Computa\u00e7\u00e3o - USP, S\u00e3o Carlos, SP (2009)"},{"key":"47_CR7","unstructured":"Conrado, M.S., Marcacini, R.M., Moura, M.F., Rezende, S.O.: O efeito do uso de diferentes formas de gera\u00e7\u00e3o de termos na compreensibilidade e representatividade dos termos em cole\u00e7\u00f5es textuais na L\u00edngua Portuguesa. In: Proceedings of II Web and Text Intelligence - 7th Brazilian Symposium in Information and Human Language Technology, S\u00e3o Carlos, SP (2009)"},{"key":"47_CR8","unstructured":"das Nunes, M.G.V.: The design of a lexicon for brazilian portuguese: Lessons learned and perspectives. In: Proceedings of the II Workshop on Computational Processing of Written and Spoken Portuguese, Curitiba, pp. 61\u201370 (1996)"},{"issue":"1","key":"47_CR9","first-page":"1","volume":"7","author":"J. Dem\u0161ar","year":"2006","unstructured":"Dem\u0161ar, J.: Statistical comparison of classifiers over multiple data sets. Journal of Machine Learning Research\u00a07(1), 1\u201330 (2006)","journal-title":"Journal of Machine Learning Research"},{"key":"47_CR10","unstructured":"Ebecken, N.F.F., Lopes, M.C.S., de Arag\u00e3o, M.C.: Minera\u00e7\u00e3o de Textos. In: Rezende, S.O. (ed.) Sistemas Inteligentes: Fundamentos e Aplica\u00e7\u00f5es, 1st edn., Manole, ch. 13, pp. 337\u2013364 (2003)"},{"key":"47_CR11","unstructured":"Gonzalez, M.\u00a0A.\u00a0I.: Termos e Relacionamentos em Evid\u00eancia na Recupera\u00e7\u00e3o de Informa\u00e7\u00e3o. PhD thesis, Instituto de Inform\u00e1tica - UFRGS, Porto Alegre (2005)"},{"key":"47_CR12","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1007\/11751984_11","volume-title":"Computational Processing of the Portuguese Language","author":"M.A.I. Gonzalez","year":"2006","unstructured":"Gonzalez, M.A.I., de Lima, V.L.S., de Lima, J.V.: Tools for Nominalization: An Alternative for Lexical Normalization. In: Vieira, R., Quaresma, P., Nunes, M.d.G.V., Mamede, N.J., Oliveira, C., Dias, M.C. (eds.) PROPOR 2006. LNCS (LNAI), vol.\u00a03960, pp. 100\u2013109. Springer, Heidelberg (2006)"},{"key":"47_CR13","unstructured":"Braga, \u00cd.A., Monard, M.C., Matsubara, E.T.: Combining unigrams and bigrams in semi-supervised text classification. In: 14th Portuguese Conference on Artificial Intelligence - New Trends in Artificial Intelligence, Aveiro, Portugal, pp. 489\u2013500 (2009)"},{"key":"47_CR14","doi-asserted-by":"crossref","unstructured":"Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. In: Explorations of Special Interest Group on Knowledge Discovery and Data Mining, vol.\u00a011, pp. 10\u201318 (2009)","DOI":"10.1145\/1656274.1656278"},{"issue":"3","key":"47_CR15","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1162\/089976601300014493","volume":"13","author":"S.S. Keerthi","year":"2001","unstructured":"Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: Improvements to platt\u2019s smo algorithm for svm classifier design. Neural Comput.\u00a013(3), 637\u2013649 (2001)","journal-title":"Neural Comput."},{"key":"47_CR16","doi-asserted-by":"crossref","unstructured":"Manning, C.D., Raghavan, P., Sch\u00fctze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)","DOI":"10.1017\/CBO9780511809071"},{"key":"47_CR17","doi-asserted-by":"crossref","unstructured":"Manning, C.D., Raghavan, P., Sch\u00fctze, H.: Language models for information retrieval. In: An Introduction to Information Retrieval, ch. 12. Cambridge University Press (2008)","DOI":"10.1017\/CBO9780511809071"},{"key":"47_CR18","doi-asserted-by":"crossref","unstructured":"Maziero, E.G., del Rosario Castro Jorge, M.L., Pardo, T.A.S.: Identifying multidocument relations. In: Proceedings of 7th International Workshop on Natural Language Processing and Cognitive Science, Funchal\/Madeira, Portugal, vol.\u00a01, pp. 60\u201369 (2010)","DOI":"10.5220\/0003028800600069"},{"key":"47_CR19","unstructured":"Mccallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI Magazine - Workshop on \u2019Learning for Text Categorization, pp. 1\u20138 (1998)"},{"key":"47_CR20","unstructured":"Miner, G., Elder, J., Hill, T., Nisbet, R., Delen, D.: Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications. Elsevier Science (2012)"},{"issue":"3","key":"47_CR21","doi-asserted-by":"publisher","first-page":"908","DOI":"10.1016\/j.asoc.2006.04.002","volume":"7","author":"V. Mitra","year":"2007","unstructured":"Mitra, V., Wang, C.-J., Banerjee, S.: Text classification: A least square support vector machine approach. Appl. Soft Comput.\u00a07(3), 908\u2013914 (2007)","journal-title":"Appl. Soft Comput."},{"key":"47_CR22","doi-asserted-by":"publisher","first-page":"3699","DOI":"10.4028\/www.scientific.net\/AMR.403-408.3699","volume":"403-408","author":"V. Nuipian","year":"2011","unstructured":"Nuipian, V., Meesad, P., Boonrawd, P.: Improve abstract data with feature selection for classification techniques. Advanced Materials Research\u00a0403-408, 3699\u20133703 (2011)","journal-title":"Advanced Materials Research"},{"key":"47_CR23","doi-asserted-by":"crossref","unstructured":"Orengo, V.M., Huyck, C.: A stemming algorithm for portuguese language. In: Proceedings of Eigth Symposium on String Processing and Information Retrieval, Chile, pp. 186\u2013193 (2001)","DOI":"10.1109\/SPIRE.2001.989755"},{"key":"47_CR24","volume-title":"C4.5: programs for machine learning","author":"J.R. Quinlan","year":"1993","unstructured":"Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)"},{"key":"47_CR25","unstructured":"Ratnaparkhi, A.: A maximum entropy model for part-of-speech tagging. In: Proceedings of the Empirical Methods in Natural Language Processing Conference, pp. 491\u2013497. University of Pennsylvania (1996)"},{"key":"47_CR26","unstructured":"Read, J., Webster, J., Fang, A.C.: In: Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, Sendai, Japan"},{"key":"47_CR27","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1007\/978-3-540-85110-3_44","volume-title":"Intelligent Computer Mathematics","author":"R. \u0158eh\u016f\u0159ek","year":"2008","unstructured":"\u0158eh\u016f\u0159ek, R., Sojka, P.: Automated Classification and Categorization of\u00a0Mathematical\u00a0Knowledge. In: Autexier, S., Campbell, J., Rubio, J., Sorge, V., Suzuki, M., Wiedijk, F. (eds.) AISC 2008, Calculemus 2008, and MKM 2008. LNCS (LNAI), vol.\u00a05144, pp. 543\u2013557. Springer, Heidelberg (2008)"},{"key":"47_CR28","unstructured":"Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: Proceedings of International Conference on New Methods in Language Processing, pp. 44\u201349 (1994)"},{"issue":"1","key":"47_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/505282.505283","volume":"34","author":"F. Sebastiani","year":"2002","unstructured":"Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys\u00a034(1), 1\u201347 (2002)","journal-title":"ACM Computing Surveys"},{"key":"47_CR30","unstructured":"Silic, A., Chauchat, J.-H., Basic, B.-D., Morin, A.: N-grams and morphological normalization in text classification: A comparison on a croatian-english parallel corpus. In: Neves, J., Santos, M.-F., Machado, J. (eds.) 13th Portuguese Conference on Artificial Intelligence, Guimaraes, Portugal"},{"key":"47_CR31","volume-title":"Statistical Methods","author":"G.W. Snedecor","year":"1967","unstructured":"Snedecor, G.W., Cochran, W.G.: Statistical Methods, 6th edn. Iowa State University Press, Ames (1967)","edition":"6"},{"key":"47_CR32","unstructured":"Soares, M.V., Prati, R.C., Monard, M.C.: PreTexT II: Descri\u00e7\u00e3o da reestrutura\u00e7\u00e3o da ferramenta de pr\u00e9-processamento de textos. Technical Report 333, Instituto de Ci\u00eancias Matem\u00e1ticas e de Computa\u00e7\u00e3o - USP, S\u00e3o Carlos, SP (2008)"},{"key":"47_CR33","doi-asserted-by":"publisher","first-page":"1016","DOI":"10.1145\/1390156.1390284","volume-title":"Proceedings of the 25th International Conference on Machine Learning","author":"J. Su","year":"2008","unstructured":"Su, J., Zhang, H., Ling, C.X., Matwin, S.: Discriminative parameter learning for bayesian networks. In: Proceedings of the 25th International Conference on Machine Learning, pp. 1016\u20131023. ACM, New York (2008)"}],"container-title":["Lecture Notes in Computer Science","Computational Science and Its Applications \u2013 ICCSA 2012"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-642-31137-6_47","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,1]],"date-time":"2025-04-01T16:15:56Z","timestamp":1743524156000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-642-31137-6_47"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012]]},"ISBN":["9783642311369","9783642311376"],"references-count":33,"URL":"https:\/\/doi.org\/10.1007\/978-3-642-31137-6_47","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"value":"0302-9743","type":"print"},{"value":"1611-3349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012]]}}}