{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,8]],"date-time":"2024-06-08T06:00:35Z","timestamp":1717826435740},"reference-count":24,"publisher":"IGI Global","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,1,1]]},"abstract":"<p>Day after day the cases of plagiarism increase and become a crucial problem in the modern world caused by the quantity of textual information available in the web. Data mining becomes the foundation for many different domains as one of its chores is the text categorization, which can be used in order to resolve the impediment of automatic plagiarism detection. This article is devoted to a new approach for combating plagiarism named MML (Multi-agents Machine learning system) and is composed of three modules: data preparation and digitalization, using n-gram character or bag of words as methods for the text representation; TF*IDF as weighting to calculate the importance of each term in the corpus in order to transform each document to a vector; and learning and voting phase using three supervised learning algorithms (decision tree c4.5, na\u00efve Bayes and support vector machine).<\/p>","DOI":"10.4018\/ijats.2016010101","type":"journal-article","created":{"date-parts":[[2017,11,30]],"date-time":"2017-11-30T17:56:29Z","timestamp":1512064589000},"page":"1-17","source":"Crossref","is-referenced-by-count":2,"title":["Multi-Agents Machine Learning (MML) System for Plagiarism Detection"],"prefix":"10.4018","volume":"8","author":[{"given":"Hadj Ahmed","family":"Bouarara","sequence":"first","affiliation":[{"name":"Department of Computer Science, Dr. Tahar Moulay University, Saida, Algeria"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"2432","reference":[{"key":"IJATS.2016010101-0","unstructured":"Basile, C. (2009). A plagiarism detection procedure in three steps: selection, matches and squares. In Proceeding of the SEPLN \u201909 pan 09 3rd workshop and 1st international competition on plagiarism, San Sebastian, Spain (pp. 19-23). IEEE."},{"key":"IJATS.2016010101-1","first-page":"19","article-title":"A plagiarism detection procedure in three steps: Selection, matches and \u201csquares\u201d.","author":"C.Basile","year":"2009","journal-title":"Proc. of SEPLN"},{"key":"IJATS.2016010101-2","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-009-9112-1"},{"issue":"2","key":"IJATS.2016010101-3","article-title":"Plagiarism Detection using Sequential Pattern Mining.","volume":"5","author":"A.El-Matarawy","year":"2013","journal-title":"International Journal of Applied Information Systems"},{"key":"IJATS.2016010101-4","first-page":"173","article-title":"A novel genetic algorithm for automatic clustering.","author":"G.Gautam","year":"2004","journal-title":"Pattern Recognition Letters"},{"key":"IJATS.2016010101-5","unstructured":"Ghosh, A., Bhaskar, P., Pal, S., & Bandyopadhyay, S. (2011). Rule Based Plagiarism Detection using Information Retrieval. In Proceedings of the CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation (pp. 19-22). Amsterdam."},{"key":"IJATS.2016010101-6","doi-asserted-by":"publisher","DOI":"10.1016\/0305-0548(93)E0023-M"},{"key":"IJATS.2016010101-7","unstructured":"Grozea, C., Gehl, C., & Popescu, M. (2009, September). ENCOPLOT: Pairwise sequence matching in linear time applied to plagiarism detection. In Proceedings of the 3rd PAN Workshop. Uncovering Plagiarism, Authorship and Social Software Misuse (p. 10)."},{"key":"IJATS.2016010101-8","doi-asserted-by":"publisher","DOI":"10.4018\/jalr.2012070101"},{"key":"IJATS.2016010101-9","doi-asserted-by":"publisher","DOI":"10.4018\/jaec.2013040104"},{"key":"IJATS.2016010101-10","unstructured":"Jalam, R. (2003). Apprentissage automatique et cat\u00e9gorisation de textes multilingues [Doctoral dissertation]. Universit\u00e9 Lumi\u00e8re-Lyon."},{"key":"IJATS.2016010101-11","first-page":"50","article-title":"Semantic similarity based on corpus statistics and lexical taxonomy.","author":"J. J.Jiang","year":"1997","journal-title":"International Conference Research on Computational Linguistics"},{"key":"IJATS.2016010101-12","first-page":"24","article-title":"Finding plagiarism by evaluating document similarities.","volume":"Vol. 9","author":"J.Kasprzak","year":"2009","journal-title":"Proc. of SEPLN"},{"key":"IJATS.2016010101-13","unstructured":"Kasprzak, J., Brandejs, M., & Miroslav, K. (2009). Finding Plagiarism by Evaluating Document Similarities. In Proceeding of the SEPLN \u201909 pan 09 3rd workshop and 1st international competition on plagiarism, San Sebastian, Spain (pp. 24-28). Adventure Works Press."},{"key":"IJATS.2016010101-14","unstructured":"Meyer, Z. E., Sven, & Benno, S. (2008). Intrinsic plagiarism detection. In Proceedings of the European Conference on Information Retrieval (ECIR \u201906) (pp. 565\u2013569). Springer."},{"key":"IJATS.2016010101-15","unstructured":"P, B., Ralf, S., & I, C. (2003). Automatic Identification of Document Translations in Large Multilingual Document Collections. In Proceedings of the International Conference Recent Advances in Natural Language Processing (pp. 401-408)."},{"key":"IJATS.2016010101-16","unstructured":"Palkovskii, Y., Belov, A., & Muzyka, I. (2011). Using WordNet-based semantic similarity measurement in External Plagiarism Detection. In Proceedings of CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation (pp. 19-22). Amsterdam: IEEE."},{"key":"IJATS.2016010101-17","unstructured":"Sayad, D. S. (2010). decision_tree. Retrieved November 20, 2013, from http:\/\/www.saedsayad.com\/"},{"issue":"3","key":"IJATS.2016010101-18","article-title":"Semantic plagiarism detection system using ontology mapping.","volume":"3","author":"K. C.Shet","year":"2012","journal-title":"Advances in Computers"},{"key":"IJATS.2016010101-19","unstructured":"Stamatatos, E. (2009). Intrinsic Plagiarism Detection Using Character n-gram Profiles. In Proceeding of the SEPLN \u201909 pan 09 3rd workshop and 1st international competition on plagiarism, San Sebastian, Spain (pp. 38-46). Springer."},{"key":"IJATS.2016010101-20","first-page":"527","article-title":"Principles of Hash-Based Text Retrieval.","author":"B.Stein","year":"2007","journal-title":"30th Annual International ACM SIGIR Conference"},{"key":"IJATS.2016010101-21","first-page":"37","article-title":"Using Syntactic Information to Identify Plagiarism.","author":"O.Uzuner","year":"2005","journal-title":"Proceedings of the 2nd Workshop on Building Educational Applications Using NLP"},{"key":"IJATS.2016010101-22","unstructured":"Weimar, B.-U. (2009, September 10). webis groupe weimer. Retrieved November 9, 2013, from http:\/\/www.webis.de\/research\/events\/pan-09"},{"key":"IJATS.2016010101-23","unstructured":"Zechner, M., Muhr, M., & Kern, R. (2009). external and intrinsic plagiarism detection using vector space model. In Proceedings of SEPLN, San Sebastian, Spain (Vol. 32, pp. 47-55). IEEE."}],"container-title":["International Journal of Agent Technologies and Systems"],"original-title":[],"language":"ng","link":[{"URL":"https:\/\/www.igi-global.com\/viewtitle.aspx?TitleId=193955","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,6,2]],"date-time":"2022-06-02T01:22:52Z","timestamp":1654132972000},"score":1,"resource":{"primary":{"URL":"https:\/\/services.igi-global.com\/resolvedoi\/resolve.aspx?doi=10.4018\/IJATS.2016010101"}},"subtitle":[""],"short-title":[],"issued":{"date-parts":[[2016,1,1]]},"references-count":24,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,1]]}},"URL":"https:\/\/doi.org\/10.4018\/ijats.2016010101","relation":{},"ISSN":["1943-0744","1943-0752"],"issn-type":[{"value":"1943-0744","type":"print"},{"value":"1943-0752","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,1,1]]}}}