{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T12:44:37Z","timestamp":1770986677447,"version":"3.50.1"},"reference-count":41,"publisher":"SAGE Publications","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDA"],"published-print":{"date-parts":[[2019,4,29]]},"DOI":"10.3233\/ida-183985","type":"journal-article","created":{"date-parts":[[2019,5,14]],"date-time":"2019-05-14T14:54:55Z","timestamp":1557845695000},"page":"661-680","source":"Crossref","is-referenced-by-count":11,"title":["On the use of word embedding for cross language plagiarism detection"],"prefix":"10.1177","volume":"23","author":[{"given":"Habibollah","family":"Asghari","sequence":"first","affiliation":[{"name":"School of Electrical and Computer Engineering, University of Tehran, Iran"}]},{"given":"Omid","family":"Fatemi","sequence":"additional","affiliation":[{"name":"School of Electrical and Computer Engineering, University of Tehran, Iran"}]},{"given":"Salar","family":"Mohtaj","sequence":"additional","affiliation":[{"name":"ICT Research Institute of ACECR, Tehran, Iran"}]},{"given":"Heshaam","family":"Faili","sequence":"additional","affiliation":[{"name":"School of Electrical and Computer Engineering, University of Tehran, Iran"}]},{"given":"Paolo","family":"Rosso","sequence":"additional","affiliation":[{"name":"Universitat Politecnica de Valencia, Spain"}]}],"member":"179","reference":[{"key":"10.3233\/IDA-183985_ref1","unstructured":"H. Asghari, K. Khoshnava, O. Fatemi and H. Faili, Developing bilingual plagiarism detection corpus using sentence aligned parallel corpus: Notebook for {PAN} at {CLEF} 2015, In L. Cappellato, N. Ferro, G.J.F. Jones and E. SanJuan, editors, Working Notes of {CLEF} 2015 \u2013 Conference and Labs of the Evaluation forum, Toulouse, France, September 8\u201311, 2015, volume 1391 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2015."},{"key":"10.3233\/IDA-183985_ref2","first-page":"135","article-title":"Algorithms and corpora for persian plagiarism detection: overview of {PAN} at {FIRE} 2016","author":"Asghari","year":"2016","journal-title":"Working notes of {FIRE} 2016 \u2013 Forum for Information Retrieval Evaluation, December 7\u201310, 2016"},{"key":"10.3233\/IDA-183985_ref3","first-page":"597","article-title":"Paraphrasing with bilingual parallel corpora","author":"Bannard","year":"2005","journal-title":"{ACL} 2005, 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, 25\u201330 June 2005"},{"key":"10.3233\/IDA-183985_ref4","unstructured":"A. Barr\u00f3n-Cede\u00a0no, M. Potthast, P. Rosso and B. Stein, Corpus and evaluation measures for automatic plagiarism detection, In N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner and D. Tapias, editors, Proceedings of the International Conference on Language Resources and Evaluation, {LREC} 2010, 17\u201323 May 2010, Valletta, Malta. European Language Resources Association, 2010."},{"key":"10.3233\/IDA-183985_ref5","first-page":"37","article-title":"Plagiarism detection across distant language pairs","author":"Barr\u00f3n-Cede\u00a0no","year":"2010","journal-title":"{COLING} 2010, 23rd International Conference on Computational Linguistics, Proceedings of the Conference, 23\u201327 August 2010, Beijing, China"},{"key":"10.3233\/IDA-183985_ref6","first-page":"59","article-title":"PAN@FIRE: Overview of the cross-language !ndian text re-use detection competition","author":"Barr\u00f3n-Cede\u00a0no","year":"2011","journal-title":"Multilingual Information Access in South Asian Languages \u2013 Second International Workshop, {FIRE} 2010, Gandhinagar, India, February 19\u201321, 2010 and Third International Workshop, {FIRE} 2011, Bombay, India, December 2\u20134, 2011, Revised Selected Papers"},{"key":"10.3233\/IDA-183985_ref7","unstructured":"A. Barr\u00f3n-Cede\u00a0no, P. Rosso, D. Pinto and A. Juan, On cross-lingual plagiarism analysis using a statistical model, In B. Stein, E. Stamatatos and M. Koppel, editors, Proceedings of the ECAI\u201908 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, Patras, Greece, July 22, 2008, volume 377 of {CEUR} Workshop Proceedings. CEUR-WS.org, 2008."},{"key":"10.3233\/IDA-183985_ref8","first-page":"502","article-title":"Paraphrase substitution for recognizing textual entailment","author":"Bosma","year":"2006","journal-title":"Evaluation of Multilingual and Multi-modal Information Retrieval, 7th Workshop of the Cross-Language Evaluation Forum, {CLEF} 2006, Alicante, Spain, September 20\u201322, 2006, Revised Selected Papers"},{"key":"10.3233\/IDA-183985_ref9","first-page":"83","article-title":"Multilingual Plagiarism Detection","author":"Ceska","year":"2008","journal-title":"Artificial Intelligence: Methodology, Systems, and Applications, 13th International Conference, {AIMSA} 2008, Varna, Bulgaria, September 4\u20136, 2008. Proceedings"},{"issue":"4","key":"10.3233\/IDA-183985_ref10","doi-asserted-by":"crossref","first-page":"14:1","DOI":"10.1145\/1644879.1644881","article-title":"Arabic natural language processing: challenges and solutions","volume":"8","author":"Farghaly","year":"2009","journal-title":"{ACM} Trans. Asian Lang. Inf. Process."},{"key":"10.3233\/IDA-183985_ref11","unstructured":"J. Ferrero, F. Agn\u00e8s, L. Besacier and D. Schwab, A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection, In N. Calzolari, K. Choukri, T. Declerck, S. Goggi, M. Grobelnik, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk and S. Piperidis, editors, Proceedings of the Tenth International Conference on Language Resources and Evaluation {LREC} 2016, Portoro\u017e, Slovenia, May 23\u201328, 2016, European Language Resources Association {(ELRA)}, 2016."},{"key":"10.3233\/IDA-183985_ref12","first-page":"109","article-title":"CompiLIG at SemEval-2017 Task 1: Cross-language plagiarism detection methods for semantic textual similarity","author":"Ferrero","year":"2017","journal-title":"Proceedings of the 11th International Workshop on Semantic Evaluation, SemEval@ACL 2017, Vancouver, Canada, August 3\u20134, 2017"},{"key":"10.3233\/IDA-183985_ref13","first-page":"227","article-title":"Knowledge graphs as context models: Improving the detection of cross-language plagiarism with paraphrasing","author":"Franco-Salvador","year":"2013","journal-title":"Bridging Between Information Retrieval and Databases \u2013 {PROMISE} Winter School 2013, Bressanone, Italy, February 4\u20138, 2013. Revised Tutorial Lectures"},{"key":"10.3233\/IDA-183985_ref14","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1016\/j.knosys.2016.08.004","article-title":"Cross-language plagiarism detection over continuous-space-and knowledge graph-based representations of language","volume":"111","author":"Franco-Salvador","year":"2016","journal-title":"Knowl.-Based Syst."},{"issue":"4","key":"10.3233\/IDA-183985_ref15","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1016\/j.ipm.2015.12.004","article-title":"A systematic study of knowledge graph analysis for cross-language plagiarism detection","volume":"52","author":"Franco-Salvador","year":"2016","journal-title":"Inf. Process. Manage."},{"key":"10.3233\/IDA-183985_ref16","first-page":"154","article-title":"A deep learning approach to persian plagiarism detection","author":"Gharavi","year":"2016","journal-title":"Working notes of FIRE 2016 \u2013 Forum for Information Retrieval Evaluation, Kolkata, India, December 7\u201310, 2016"},{"key":"10.3233\/IDA-183985_ref17","first-page":"748","article-title":"BilBOWA: Fast bilingual distributed representations without word alignments","author":"Gouws","year":"2015","journal-title":"Proceedings of the 32nd International Conference on Machine Learning, {ICML} 2015, Lille, France, 6\u201311 July 2015"},{"key":"10.3233\/IDA-183985_ref18","first-page":"67","article-title":"Cross-language high similarity search using a conceptual thesaurus","author":"Gupta","year":"2012","journal-title":"Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics \u2013 Third International Conference of the {CLEF} Initiative, {CLEF} 2012, Rome, Italy, September 17\u201320, 2012. Proceedings"},{"key":"10.3233\/IDA-183985_ref19","first-page":"79","article-title":"Mapping hindi-english text re-use document pairs","author":"Gupta","year":"2011","journal-title":"Multilingual Information Access in South Asian Languages\u00a0\u2013 Second International Workshop, {FIRE} 2010, Gandhinagar, India, February 19\u201321, 2010 and Third International Workshop, {FIRE} 2011, Bombay, India, December 2\u20134, 2011, Revised Selected Papers"},{"key":"10.3233\/IDA-183985_ref20","doi-asserted-by":"crossref","unstructured":"C.K. Kent and N. Salim, Web based cross language plagiarism detection, CoRR, abs\/0912.3, 2009.","DOI":"10.1109\/CIMSiM.2010.10"},{"issue":"1-2","key":"10.3233\/IDA-183985_ref21","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1023\/B:INRT.0000009441.78971.be","article-title":"Character N-gram tokenization for european language text retrieval","volume":"7","author":"McNamee","year":"2004","journal-title":"Inf. Retr."},{"key":"10.3233\/IDA-183985_ref22","unstructured":"T. Mikolov, K. Chen, G. Corrado and J. Dean, Efficient estimation of word representations in vector space, CoRR, abs\/1301.3, 2013."},{"key":"10.3233\/IDA-183985_ref23","unstructured":"S. Mohtaj, B. Roshanfekr, A. Zafarian and H. Asghari, Parsivar: A language processing toolkit for persian, In N. Calzolari, K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis and T. Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7\u201312, 2018, European Language Resources Association ELRA, 2018."},{"key":"10.3233\/IDA-183985_ref24","first-page":"216","article-title":"BabelNet: Building a very large multilingual semantic network","author":"Navigli","year":"2010","journal-title":"{ACL} 2010, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, July 11\u201316, 2010, Uppsala, Sweden"},{"key":"10.3233\/IDA-183985_ref25","unstructured":"R.M.A. Nawab, M. Stevenson and P.D. Clough, University of Sheffield \u2013 Lab Report for {PAN} at {CLEF} 2010, In M. Braschler, D. Harman and E. Pianta, editors, {CLEF} 2010 LABs and Workshops, Notebook Papers, 22\u201323 September 2010, Padua, Italy, volume 1176 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2010."},{"key":"10.3233\/IDA-183985_ref26","unstructured":"G. Oberreuter, G. L\u2019Huillier, S.A. Rios and J.D. Vel\u00e1squez, Approaches for intrinsic and external plagiarism detection\u00a0\u2013 Notebook for {PAN} at {CLEF} 2011, In V. Petras, P. Forner and P.D. Clough, editors, {CLEF} 2011 Labs and Workshop, Notebook Papers, 19\u201322 September 2011, Amsterdam, The Netherlands, volume 1177 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2011."},{"key":"10.3233\/IDA-183985_ref27","first-page":"15","article-title":"A new approach for cross-language plagiarism analysis","author":"Pereira","year":"2010","journal-title":"Multilingual and Multimodal Information Access Evaluation, International Conference of the Cross-Language Evaluation Forum, {CLEF} 2010, Padua, Italy, September 20\u201323, 2010. Proceedings"},{"issue":"1","key":"10.3233\/IDA-183985_ref28","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.jalgor.2009.02.005","article-title":"A statistical approach to crosslingual natural language tasks","volume":"64","author":"Pinto","year":"2009","journal-title":"J. Algorithms"},{"key":"10.3233\/IDA-183985_ref29","unstructured":"M. Potthast, A. Barr\u00f3n-Cede\u00a0no, A. Eiselt, B. Stein and P. Rosso, Overview of the 2nd international competition on plagiarism detection, In M. Braschler, D. Harman and E. Pianta, editors, {CLEF} 2010 LABs and Workshops, Notebook Papers, 22\u201323 September 2010, Padua, Italy, volume 1176 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2010."},{"issue":"1","key":"10.3233\/IDA-183985_ref30","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1007\/s10579-009-9114-z","article-title":"Cross-language plagiarism detection","volume":"45","author":"Potthast","year":"2011","journal-title":"Language Resources and Evaluation"},{"key":"10.3233\/IDA-183985_ref31","unstructured":"M. Potthast, A. Eiselt, A. Barr\u00f3n-Cede\u00a0no, B. Stein and P. Rosso, Overview of the 3rd international competition on plagiarism detection, In V. Petras, P. Forner and P.D. Clough, editors, {CLEF} 2011 Labs and Workshop, Notebook Papers, 19\u201322 September 2011, Amsterdam, The Netherlands, volume 1177 of {CEUR} Workshop Proceedings. CEUR-WS.org, 2011."},{"key":"10.3233\/IDA-183985_ref32","unstructured":"M. Potthast, S. Goering, P. Rosso and B. Stein, Towards data submissions for shared tasks: First experiences for the task of text alignment, In L. Cappellato, N. Ferro, G.J.F. Jones and E. SanJuan, editors, Working Notes of {CLEF} 2015\u00a0\u2013 Conference and Labs of the Evaluation forum, Toulouse, France, September 8\u201311, 2015, volume 1391 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2015."},{"key":"10.3233\/IDA-183985_ref33","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1007\/978-3-540-78646-7_51","article-title":"A wikipedia-based multilingual retrieval model","author":"Potthast","year":"2008","journal-title":"Advances in Information Retrieval , 30th European Conference on {IR} Research, {ECIR} 2008, Glasgow, UK, March 30\u2013April 3, 2008. Proceedings"},{"key":"10.3233\/IDA-183985_ref34","first-page":"997","article-title":"An evaluation framework for plagiarism detection","author":"Potthast","year":"2010","journal-title":"{COLING} 2010, 23rd International Conference on Computational Linguistics, Posters Volume, 23\u201327 August 2010, Beijing, China"},{"key":"10.3233\/IDA-183985_ref35","unstructured":"B. Pouliquen, R. Steinberger and C. Ignat, Automatic identification of document translations in large multilingual document collections, CoRR, abs\/cs\/060, 2006."},{"key":"10.3233\/IDA-183985_ref36","first-page":"233","article-title":"Automatic 3-language cross-language information retrieval with latent semantic indexing","author":"Rehder","year":"1997","journal-title":"Proceedings of The Sixth Text REtrieval Conference, {TREC} 1997, Gaithersburg, Maryland, USA, November 19\u201321, 1997"},{"key":"10.3233\/IDA-183985_ref37","unstructured":"B. Stein, E. Stamatatos and M. Koppel, Proceedings of the ECAI\u201908 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, Patras, Greece, July 22, 2008, volume 377 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2008."},{"key":"10.3233\/IDA-183985_ref38","first-page":"825","article-title":"Strategies for retrieving plagiarized documents","author":"Stein","year":"2007","journal-title":"{SIGIR} 2007: Proceedings of the 30th Annual International {ACM}{SIGIR} Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands, July 23\u201327, 2007"},{"key":"10.3233\/IDA-183985_ref39","unstructured":"J. Wieting, M. Bansal, K. Gimpel and K. Livescu, Towards universal paraphrastic sentence embeddings, CoRR, abs\/1511.0, 2015."},{"key":"10.3233\/IDA-183985_ref40","unstructured":"V. Zarrabi, J. Rafiei, K. Khoshnava, H. Asghari and S. Mohtaj, Evaluation of text reuse corpora for text alignment task of plagiarism detection, In L. Cappellato, N. Ferro, G.J.F. Jones and E. SanJuan, editors, Working Notes of {CLEF} 2015\u00a0\u2013 Conference and Labs of the Evaluation forum, Toulouse, France, September 8\u201311, 2015, volume 1391 of {CEUR} Workshop Proceedings, CEUR-WS.org, 2015."},{"key":"10.3233\/IDA-183985_ref41","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1016\/j.knosys.2013.06.018","article-title":"Methods for cross-language plagiarism detection","volume":"50","author":"Barr\u00f3n-Cede\u00f1o","year":"2013","journal-title":"Knowledge-Based Systems"}],"container-title":["Intelligent Data Analysis"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDA-183985","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T06:44:19Z","timestamp":1741675459000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDA-183985"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,29]]},"references-count":41,"journal-issue":{"issue":"3"},"URL":"https:\/\/doi.org\/10.3233\/ida-183985","relation":{},"ISSN":["1088-467X","1571-4128"],"issn-type":[{"value":"1088-467X","type":"print"},{"value":"1571-4128","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,4,29]]}}}