{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T01:34:34Z","timestamp":1777685674161,"version":"3.51.4"},"reference-count":34,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["HIS"],"published-print":{"date-parts":[[2020,9,28]]},"abstract":"<jats:p>With the growth of the content found throughout the Web, every information can be plagiarized. Plagiarism is the process of using the ideas of another without naming the source. Consequently, plagiarism detection is necessary but complicated as it is often facing significant challenges given the large amount of material on the World-wide-web and the limited access to a substantial part of them. In this paper, we present a novel plagiarism detection method for French documents. The proposed method combines the intrinsic and extrinsic aspects for plagiarism detection. We achieved good results with both approaches. For the extrinsic method, we achieved an accuracy of 62% for the first tests of the method. As for the intrinsic, we achieved an F-score of 0.328.<\/jats:p>","DOI":"10.3233\/his-200284","type":"journal-article","created":{"date-parts":[[2020,7,3]],"date-time":"2020-07-03T13:25:08Z","timestamp":1593782708000},"page":"163-175","source":"Crossref","is-referenced-by-count":2,"title":["Hybrid plagiarism detection method for French language"],"prefix":"10.1177","volume":"16","author":[{"given":"Maryam","family":"Elamine","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Seifeddine","family":"Mechti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lamia Hadrich","family":"Belguith","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","reference":[{"key":"10.3233\/HIS-200284_ref1","unstructured":"A. Magooda, A.Y. Mahgoub, M. Rashwan, M.B. Fayek and H. Raafat, RDI System for Extrinsic Plagiarism Detection (RDI_RED) \u2013 Working Notes for PANAraPlagDet at FIRE 2015, in: FIRE 2015 Working Notes Papers, 2015."},{"issue":"4","key":"10.3233\/HIS-200284_ref2","first-page":"245","article-title":"Moss: A system for detecting software plagiarism","volume":"23","author":"Aiken","year":"2015","journal-title":"University of California-Berkeley"},{"key":"10.3233\/HIS-200284_ref3","doi-asserted-by":"crossref","unstructured":"A. Polydouri, G. Siolas and A. Stafylopatis, Intrinsic plagiarism detection with feature-rich imbalanced dataset learning, in: International Conference on Engineering Applications of Neural Networks, EANN 2017: Engineering Applications of Neural Networks, 2017, pp. 99\u2013110.","DOI":"10.1007\/978-3-319-65172-9_9"},{"key":"10.3233\/HIS-200284_ref4","unstructured":"A.V. Belyy and M.A. Dubova, Framework for Russian plagiarism detection using sentence embedding similarity and negative sampling, in: Proceedings of the International Conference Dialogue 2018, Computational Linguistics and Intellectual Technologies, 2018, pp. 96\u2013109."},{"key":"10.3233\/HIS-200284_ref5","doi-asserted-by":"crossref","unstructured":"A.M. Uzun and S. Kilis, Investigating antecedents of plagiarism using extended theory of planned behavior, Computers & Education 144 (2020).","DOI":"10.1016\/j.compedu.2019.103700"},{"issue":"2","key":"10.3233\/HIS-200284_ref6","first-page":"36","article-title":"Plagiarism: A misplaced emphasis","volume":"3","author":"Martin","year":"1994","journal-title":"Journal of Information Ethics"},{"key":"10.3233\/HIS-200284_ref7","doi-asserted-by":"crossref","unstructured":"B. Stein, S.M. Eissen and M. Potthast, Strategies for Retrieving Plagiarized Documents, in: SIGIR \u201907: Proceedings of the 30\ud835\udc61\u210e Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007, pp. 825\u2013826.","DOI":"10.1145\/1277741.1277928"},{"issue":"1","key":"10.3233\/HIS-200284_ref8","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/s10579-010-9115-y","article-title":"Intrinsic plagiarism analysis","volume":"45","author":"Stein","year":"2011","journal-title":"Language Resources and Evaluation"},{"key":"10.3233\/HIS-200284_ref9","unstructured":"C. Grozea, C. Gehl and M. Popescu, ENCOPLOT: Pairwise sequence matching in linear time applied to plagiarism detection, in: Proceedings of the 3\ud835\udc5f\ud835\udc51 PAN@CLEF. Uncovering Plagiarism, Authorship and Social Software Misuse, 2009, pp.\u00a010\u201318."},{"key":"10.3233\/HIS-200284_ref10","unstructured":"D. Kara\u015b, M. \u015apiewak and P. Sobecki, OPI-JSA at CLEF 2017: Author Clustering and Style Breach Detection, in: Proceedings of the 9\ud835\udc61\u210e PAN@CLEF Competition, 2017."},{"key":"10.3233\/HIS-200284_ref11","doi-asserted-by":"crossref","unstructured":"D. Sakamoto and K. Tsuda, A Detection Method for Plagiarism Reports of Students, in: 23\ud835\udc5f\ud835\udc51 International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, Procedia Computer Science, 159 (2019), 1329\u20131338.","DOI":"10.1016\/j.procs.2019.09.303"},{"key":"10.3233\/HIS-200284_ref12","unstructured":"D. Zlatkova, D. Kopev, K. Mitov, A. Atanasov, M. Hardalov, I. Koychev and P. Nakov, An Ensemble-Rich Multi-Aspect Approach for Robust Style Change Detection Notebook for PAN at CLEF-2018, in: L. Cappellato, N. Ferro, J.-Y. Nie and L. Soulier, eds, CLEF 2018 Evaluation Labs and Workshop Working Notes Papers, Avignon, France, 2018."},{"key":"10.3233\/HIS-200284_ref14","doi-asserted-by":"crossref","unstructured":"G. Oberreuter, G. L\u2019Huilier, S. Rios and J.D. Vel\u00e1squez, Approaches for intrinsic and external plagiarism detection, in: Proceedings of the 3\ud835\udc5f\ud835\udc51 PAN@Conference and Labs of the Evaluation Forum (CLEF), Netherlands, 2011.","DOI":"10.1007\/978-3-642-23863-5_2"},{"issue":"9","key":"10.3233\/HIS-200284_ref15","doi-asserted-by":"crossref","first-page":"3756","DOI":"10.1016\/j.eswa.2012.12.082","article-title":"Text mining applied to plagiarism detection: The use of words for detecting deviations in the writing style","volume":"40","author":"Oberreuter","year":"2013","journal-title":"Expert Systems with Applications"},{"key":"10.3233\/HIS-200284_ref16","doi-asserted-by":"crossref","first-page":"661","DOI":"10.3233\/IDA-183985","article-title":"On the use of word embedding for cross language plagiarism detection","volume":"23","author":"Asghari","year":"2019","journal-title":"Intelligent Data Analysis"},{"key":"10.3233\/HIS-200284_ref17","doi-asserted-by":"crossref","unstructured":"I. Ben Salem, P. Rosso and S. Chikhi, Intrinsic plagiarism detection using n-gram classes, in: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics, 2014, pp. 1459\u20131464.","DOI":"10.3115\/v1\/D14-1153"},{"issue":"3","key":"10.3233\/HIS-200284_ref18","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1007\/s10579-019-09444-w","article-title":"On the use of character n-grams as the only intrinsic evidence of plagiarism","volume":"53","author":"Ben Salem","year":"2019","journal-title":"Language Resources and Evaluation"},{"key":"10.3233\/HIS-200284_ref20","unstructured":"L. Quoc and T. Mikolov, Distributed representations of sentences and documents, in: Proceedings of the 31\ud835\udc60\ud835\udc61 International Conference on Machine Learning, PMLR 32(2), 2014, pp. 1188\u20131196."},{"key":"10.3233\/HIS-200284_ref21","doi-asserted-by":"crossref","first-page":"700","DOI":"10.1016\/j.future.2017.11.023","article-title":"An integrated approach for intrinsic plagiarism detection","volume":"96","author":"Al-Sallal","year":"2019","journal-title":"Future Generation Computer Systems"},{"key":"10.3233\/HIS-200284_ref22","doi-asserted-by":"crossref","unstructured":"M. Elamine, S. Mechti and L. Belguith, An unsupervised method for detecting style breaches in a document, in: IEEE\/ACS 16th International Conference on Computer Systems and Applications (AICCSA), 2019.","DOI":"10.1109\/AICCSA47632.2019.9035264"},{"key":"10.3233\/HIS-200284_ref23","doi-asserted-by":"crossref","unstructured":"M. Elamine, F. Bougares, S. Mechti and L. Belguith, Extrinsic plagiarism detection for French language with word embeddings, in: 19\ud835\udc61\u210e International Conference on Intelligent Systems Design and Applications ISDA, 2019.","DOI":"10.1007\/978-3-030-49342-4_21"},{"issue":"2","key":"10.3233\/HIS-200284_ref25","first-page":"80","article-title":"APlag: A plagiarism checker for arabic texts","volume":"10","author":"El Bachir","year":"2014","journal-title":"I.J. Information Technology and Computer Science"},{"key":"10.3233\/HIS-200284_ref26","unstructured":"M. Kuznetsov, A. Motrenko, R. Kuznetsova and V. Strijov, Methods for intrinsic plagiarism and author diarization, in: K. Balog, L. Cappellato, N. Ferro and C. Macdonald, eds, CLEF 2016 Evaluation Labs and Workshop, 2016, pp. 912\u2013919."},{"key":"10.3233\/HIS-200284_ref27","unstructured":"M. Potthast, B. Stein, A. Barron-Cedeno and P. Rosso, An evaluation framework for plagiarism detection, in: Proceedings of the 23\ud835\udc5f\ud835\udc51 International Conference on Computational Linguistics (Coling), 2010, pp. 997\u20131005."},{"issue":"7","key":"10.3233\/HIS-200284_ref28","first-page":"46","article-title":"Grey wolf optimizer","volume":"69","author":"Seyedali","year":"2014","journal-title":"Journal of Advances in Engineering Software"},{"key":"10.3233\/HIS-200284_ref29","doi-asserted-by":"crossref","unstructured":"M. Zaher, A. Shehab, M. Elhoseny and F.F. Farahat, Unsupervised model for detecting plagiarism in internet-based handwritten arabic documents, Journal of Organizational and End User Computing 32(2) (2020), Article 3, 25 pages.","DOI":"10.4018\/JOEUC.2020040103"},{"key":"10.3233\/HIS-200284_ref31","doi-asserted-by":"crossref","unstructured":"N. Meuschke, V. Stange, M. Schubotz and B. Gipp, HyPlag: A Hybrid Approach to Academic Plagiarism Detection, in: Proceedings of the 41\ud835\udc60\ud835\udc61 International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018, pp. 1321\u20131324.","DOI":"10.1145\/3209978.3210177"},{"key":"10.3233\/HIS-200284_ref33","doi-asserted-by":"crossref","unstructured":"Q. Zhang and A. Youssef, An approach to math-similarity search, in: Intelligent Computer Mathematics, CICM 2014: Intelligent Computer Mathematics, 2014, pp. 404\u2013418.","DOI":"10.1007\/978-3-319-08434-3_29"},{"issue":"15","key":"10.3233\/HIS-200284_ref34","first-page":"1815","article-title":"Representation learning for plagiarism detection","volume":"199","author":"Menon","year":"2018","journal-title":"International Journal of Pure and Applied Mathematics"},{"issue":"4","key":"10.3233\/HIS-200284_ref36","first-page":"322","article-title":"Automatic plagiarism detection using similarity analysis","volume":"9","author":"Hariharan","year":"2012","journal-title":"The International Arab Journal of Information Technology"},{"key":"10.3233\/HIS-200284_ref37","unstructured":"T. Folt\u00fdnek, N. Meuschke and B. Gipp, Academic plagiarism detection: A systematic literature review, ACM Computing Surveys 52(6) (2019), Article 112, 42 pages."},{"issue":"12","key":"10.3233\/HIS-200284_ref38","first-page":"3226","article-title":"Identifying document-level text plagiarism: A two phase approach","volume":"12","author":"Gupta","year":"2017","journal-title":"Journal of Engineering Science and Technology"},{"key":"10.3233\/HIS-200284_ref39","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.eswa.2016.12.022","article-title":"Detection of idea plagiarism using syntax-semantic concept extractions with genetic algorithm","volume":"73","author":"Gupta","year":"2017","journal-title":"Journal of Expert Systems with Applications"},{"key":"10.3233\/HIS-200284_ref40","first-page":"1054","article-title":"Cross-lingual latent semantic analysis","volume":"48","author":"Cox","year":"2008","journal-title":"Journal of the Australian New Zealand Industrial and Applied Mathematics"}],"container-title":["International Journal of Hybrid Intelligent Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/HIS-200284","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T08:52:49Z","timestamp":1777452769000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/HIS-200284"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,28]]},"references-count":34,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/his-200284","relation":{},"ISSN":["1448-5869","1875-8819"],"issn-type":[{"value":"1448-5869","type":"print"},{"value":"1875-8819","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,28]]}}}