{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:34:16Z","timestamp":1760243656060,"version":"build-2065373602"},"reference-count":38,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2013,11,28]],"date-time":"2013-11-28T00:00:00Z","timestamp":1385596800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/3.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Informatics"],"abstract":"<jats:p>Numerous initiatives have allowed users to share knowledge or opinions using collaborative platforms. In most cases, the users provide a textual description of their knowledge, following very limited or no constraints. Here, we tackle the classification of documents written in such an environment. As a use case, our study is made in the context of text mining evaluation campaign material, related to the classification of cooking recipes tagged by users from a collaborative website. This context makes some of the corpus specificities difficult to model for machine-learning-based systems and keyword or lexical-based systems. In particular, different authors might have different opinions on how to classify a given document. The systems presented hereafter were submitted to the D\u00b4Efi Fouille de Textes 2013 evaluation campaign, where they obtained the best overall results, ranking first on task 1 and second on task 2. In this paper, we explain our approach for building relevant and effective systems dealing with such a corpus.<\/jats:p>","DOI":"10.3390\/informatics1010032","type":"journal-article","created":{"date-parts":[[2013,11,28]],"date-time":"2013-11-28T12:06:47Z","timestamp":1385640407000},"page":"32-51","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Using Collaborative Tagging for Text Classification: From Text Classification to Opinion Mining"],"prefix":"10.3390","volume":"1","author":[{"given":"Eric","family":"Charton","sequence":"first","affiliation":[{"name":"Ecole Polytechnique de Montr\u00e9al, Montr\u00e9al, QC H3T 1J4, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marie-Jean","family":"Meurs","sequence":"additional","affiliation":[{"name":"Centre for Structural and Functional Genomics, Concordia University, Montr\u00e9al,QC H4B 1R6, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ludovic","family":"Jean-Louis","sequence":"additional","affiliation":[{"name":"Ecole Polytechnique de Montr\u00e9al, Montr\u00e9al, QC H3T 1J4, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michel","family":"Gagnon","sequence":"additional","affiliation":[{"name":"Ecole Polytechnique de Montr\u00e9al, Montr\u00e9al, QC H3T 1J4, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2013,11,28]]},"reference":[{"key":"ref_1","first-page":"1","article-title":"Folksonomies\u2014Cooperative classification and communication through shared metadata","volume":"47","author":"Mathes","year":"2004","journal-title":"Comput. Med. Commun."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1108\/00242530610667558","article-title":"Collaborative tagging as a knowledge organisation and resource discovery tool","volume":"55","author":"Macgregor","year":"2006","journal-title":"Libr. Rev."},{"key":"ref_3","unstructured":"Grouin, C., Zweigenbaum, P., and Paroubek, P. (2013, January 17\u201321). DEFT 2013 se met \u00e0 table: Pr\u00e9sentation du d\u00e9fi et r\u00e9sultats. Proceedings of the Neuvi\u00e8me D\u00c9fi Fouille de Textes, Les Sables d\u2019Olonne, France."},{"key":"ref_4","unstructured":"Sebastiani, F. (2005). Text Mining and Its Applications to Intelligence, CRM and Knowledge Management, WIT Press."},{"key":"ref_5","unstructured":"Voss, J. Collaborative Thesaurus Tagging the Wikipedia Way. Available online at http:\/\/arxiv.org\/abs\/cs\/0604036."},{"key":"ref_6","unstructured":"Charton, E., and Torres-Moreno, J. (2010, January 17\u201323). NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems. Proceedings of the International Conference on Language Resources and Evaluation (LREC 2010), Valetta, Malta."},{"key":"ref_7","unstructured":"Zhang, Z., Webster, P., Uren, V., Varga, A., and Ciravegna, F. (, January 21\u201327). Automatically Extracting Procedural Knowledge from Instructional Texts using Natural Language Processing. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC2012), Istanbul, Turkey."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Schumacher, P., Minor, M., Walter, K., and Bergmann, R. (2012, January 16\u201320). Extraction of Procedural Knowledge from the Web. Proceedings of the International World Wide Web Conference 2012 (WWW2012), Lyon, France.","DOI":"10.1145\/2187980.2188194"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Schein, A., and Popescul, A. (2002, January 11\u201315). Methods and Metrics for Cold-Start Recommendations. Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland.","DOI":"10.1145\/564376.564421"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Dave, K., Lawrence, S., and Pennock, D. (2003, January 20\u201324). Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. Proceedings of the 12th International World Wide Web Conference (WWW2003), Budapest, Hungary.","DOI":"10.1145\/775152.775226"},{"key":"ref_11","unstructured":"Grouin, C., Berthelin, J.B., Ayari, S.E., Heitz, T., Hurault-Plantet, M., and Jardino, M. (2007, January 3). Pr\u00e9sentation de DEFT 2007. Proceedings of the plate-forme of the Association Fran\u00e7aise pour l\u2019Intelligence Articielle, D\u00c9fi Fouille de Textes, Grenoble, France."},{"key":"ref_12","first-page":"91","article-title":"Opinion mining and sentiment analysis","volume":"1","author":"Pang","year":"2008","journal-title":"Found. Trends Inf. Retr."},{"key":"ref_13","first-page":"297","article-title":"Good News or Bad News? Let the Market Decide","volume":"Volume 20","author":"Koppel","year":"2006","journal-title":"Computing Attitude and Affect in Text: Theory and Application, The Information Retrieval Series"},{"key":"ref_14","unstructured":"Wu, F., and Huberman, B. Social Structure and Opinion Formation. Available online at http:\/\/arxiv.org\/abs\/cond-mat\/0407252."},{"key":"ref_15","unstructured":"Yummly. Available online at http:\/\/www.yummly.com."},{"key":"ref_16","unstructured":"BBC Food. Available online at http:\/\/www.bbc.co.uk\/food\/recipes."},{"key":"ref_17","unstructured":"BBC Good Food. Available online at http:\/\/www.bbcgoodfood.com."},{"key":"ref_18","unstructured":"Allrecipes. Available online at http:\/\/allrecipes.com."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wang, L., Li, Q., Li, N., Dong, G., and Yang, Y. (2008, January 21\u201325). Substructure Similarity Measurement in Chinese Recipes. Proceedings of the 17th International World Wide Web Conference (WWW2008), Beijing, China.","DOI":"10.1145\/1367497.1367629"},{"key":"ref_20","unstructured":"Wang, L., Li, Q., Li, Y., and Meng, X. (2006, January 1\u20133). Dish Master: An Intelligent and Adaptive Manager for a Web-based Recipe Database System. Proceedings of the Second International Conference on Semantics, Knowledge and Grid, 2006 (SKG \u201906), Guilin, China."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Blat\u00e1k, J., Mr\u00e1kov\u00e1, E., and Popel\u00ednsk\u00fd, L. (2004, January 21\u201326). Fragments and Text Categorization. Proceedings of the ACL 2004 Interactive Poster and Demonstration Sessions (ACLdemo2004), Barcelona, Spain.","DOI":"10.3115\/1219044.1219078"},{"key":"ref_22","unstructured":"Charton, E., Jean-Louis, L., Meurs, M.J., and Gagnon, M. (2013, January 17\u201321). Trois Recettes d\u2019Apprentissage Automatique pour un Syst\u00e8me d\u2019Extraction d\u2019Information et de Classification de Recettes de Cuisines. Proceedings of the 20\u00e8me Conf\u00e9rence sur le Traitement Automatique du Langage Naturel, Neuvi\u00e8me D\u00c9fi Fouille de Textes, Les Sables d\u2019Olonne, France."},{"key":"ref_23","unstructured":"Marmiton. Available online at http:\/\/www.marmiton.org."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Manning, C.D., Raghavan, P., and Sch\u00fctze, H. (2008). Introduction to Information Retrieval, Cambridge University Press.","DOI":"10.1017\/CBO9780511809071"},{"key":"ref_25","unstructured":"Hall, M.A. (1999). Correlation-Based Feature Selection for Machine Learning. [Ph.D. Thesis, The University of Waikato]."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1007\/s10994-005-0466-3","article-title":"Logistic model trees","volume":"59","author":"Landwehr","year":"2005","journal-title":"Mach. Learn."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/0004-3702(86)90072-X","article-title":"Fusion, propagation, and structuring in belief networks","volume":"29","author":"Pearl","year":"1986","journal-title":"Artif. Intel."},{"key":"ref_28","unstructured":"Pearl, J. (1998). Bayesian Networks, MIT Press."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Vapnik, V. (1995). The Nature of Statistical Learning Theory, Springer-Verlag.","DOI":"10.1007\/978-1-4757-2440-0"},{"key":"ref_30","unstructured":"Quinlan, J.R. (1993). C4.5: Programs for Machine Learning, Morgan Kaufmann."},{"key":"ref_31","unstructured":"Charton, E., and Acuna-Agost, R. Quel mod\u00e8le pour d\u00e9tecter une opinion? Trois propositions pour g\u00e9n\u00e9raliser l\u2019extraction d\u2019une id\u00e9e dans un corpus. Proceedings of the Plate-Gorme of the Association Fran\u00e7aise pour l\u2019Intelligence Articielle."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/BF00994110","article-title":"A Bayesian method for the induction of probabilistic networks from data","volume":"9","author":"Cooper","year":"1992","journal-title":"Mach. Learn."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1656274.1656278","article-title":"The WEKA data mining software: An update","volume":"11","author":"Hall","year":"2009","journal-title":"ACM SIGKDD Explor. Newsl."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1023\/A:1013912006537","article-title":"Logistic regression, AdaBoost and Bregman distances","volume":"48","author":"Collins","year":"2002","journal-title":"Mach. Learn."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Sumner, M., Frank, E., and Hall, M. (2005, January 3\u20137). Speeding up Logistic Model Tree Induction. Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD2005), Porto, Portugal.","DOI":"10.1007\/11564126_72"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1109\/72.991427","article-title":"A comparison of methods for multiclass support vector machines","volume":"13","author":"Hsu","year":"2002","journal-title":"IEEE Trans. Neural Netw."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM: A library for support vector machines","volume":"2","author":"Chang","year":"2011","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"ref_38","unstructured":"El-Manzalawy, Y., and Honavar, V. WLSVM: Integrating LibSVM into WEKA Environment. Available online at http:\/\/www.cs.iastate.edu\/yasser\/wlsvm."}],"container-title":["Informatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2227-9709\/1\/1\/32\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T21:50:59Z","timestamp":1760219459000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2227-9709\/1\/1\/32"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,11,28]]},"references-count":38,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2014,6]]}},"alternative-id":["informatics1010032"],"URL":"https:\/\/doi.org\/10.3390\/informatics1010032","relation":{},"ISSN":["2227-9709"],"issn-type":[{"type":"electronic","value":"2227-9709"}],"subject":[],"published":{"date-parts":[[2013,11,28]]}}}