{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T14:19:05Z","timestamp":1781792345406,"version":"3.54.5"},"reference-count":19,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2004,12,1]],"date-time":"2004-12-01T00:00:00Z","timestamp":1101859200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Educ. Resour. Comput."],"published-print":{"date-parts":[[2004,12]]},"abstract":"<jats:p>With the increasing levels of access to higher education in the United Kingdom, larger class sizes make it unrealistic for tutors to be expected to identify instances of peer-to-peer plagiarism by eye and so automated solutions to the problem are required. This document details a novel algorithm for comparison of suspect documents at a sentence level and has been implemented as a component of plagiarism detection software for detecting similarities in both natural language documents and comments within program source-code. The algorithm is capable of detecting sophisticated obfuscation (such as paraphrasing, reordering, merging, and splitting sentences) as well as direct copying. The implemented algorithm has also been used to successfully detect plagiarism on real assignments at the university. The software has been evaluated by comparison with other plagiarism detection tools.<\/jats:p>","DOI":"10.1145\/1086339.1086341","type":"journal-article","created":{"date-parts":[[2005,11,7]],"date-time":"2005-11-07T16:00:45Z","timestamp":1131379245000},"page":"2","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":27,"title":["Sentence-based natural language plagiarism detection"],"prefix":"10.1145","volume":"4","author":[{"given":"Daniel R.","family":"White","sequence":"first","affiliation":[{"name":"University of Warwick, Coventry, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mike S.","family":"Joy","sequence":"additional","affiliation":[{"name":"University of Warwick, Coventry, United Kingdom"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2004,12]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Plagiarism: A good practice guide. Tech. rep., Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_practise (Accessed 27th","author":"Carroll J.","year":"2001","unstructured":"Carroll , J. and Appleton , J . 2001 . Plagiarism: A good practice guide. Tech. rep., Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_practise (Accessed 27th January 2004). Carroll, J. and Appleton, J. 2001. Plagiarism: A good practice guide. Tech. rep., Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_practise (Accessed 27th January 2004)."},{"key":"e_1_2_1_2_1","volume-title":"Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_pilot (Accessed 20th","author":"Chester G.","year":"2005","unstructured":"Chester , G. 2001. Pilot of free-text detection software. Tech. rep ., Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_pilot (Accessed 20th April 2005 ). Chester, G. 2001. Pilot of free-text detection software. Tech. rep., Joint Information Services Committee. Available: http:\/\/www.jisc.ac.uk\/index.cfm?name=project_plag_pilot (Accessed 20th April 2005)."},{"key":"e_1_2_1_3_1","volume-title":"5th International Conference of Information Visualisation (IV","author":"Culwin F.","year":"2001","unstructured":"Culwin , F. and Lancaster , T . 2001. Visualising intra-corpal plagiarism . In 5th International Conference of Information Visualisation (IV 2001 ). London, England. Culwin, F. and Lancaster, T. 2001. Visualising intra-corpal plagiarism. In 5th International Conference of Information Visualisation (IV 2001). London, England."},{"key":"e_1_2_1_4_1","unstructured":"Culwin F. and Lancaster T. 2004. Plagiarism prevention and detection. online. Available: http:\/\/cise.lsbu.ac.uk (Accessed 20th April 2005).  Culwin F. and Lancaster T. 2004. Plagiarism prevention and detection. online. Available: http:\/\/cise.lsbu.ac.uk (Accessed 20th April 2005)."},{"key":"e_1_2_1_5_1","volume-title":"2nd Annual Conference of the LTSN Centre for Information and Computer Sciences","author":"Culwin F.","unstructured":"Culwin , F. , Macleod , A. , and Lancaster , T . 2001. Source-code plagiarism in uk he computing schools . In 2nd Annual Conference of the LTSN Centre for Information and Computer Sciences . University of North London, England. Culwin, F., Macleod, A., and Lancaster, T. 2001. Source-code plagiarism in uk he computing schools. In 2nd Annual Conference of the LTSN Centre for Information and Computer Sciences. University of North London, England."},{"key":"e_1_2_1_6_1","volume-title":"Hodge defends higher education target. online. Available: http:\/\/education. guardian.co.uk\/print\/0,3858,4582592-108229,00.html (Accessed 20th","author":"Curtis P.","year":"2005","unstructured":"Curtis , P. 2003. Hodge defends higher education target. online. Available: http:\/\/education. guardian.co.uk\/print\/0,3858,4582592-108229,00.html (Accessed 20th April 2005 ). Curtis, P. 2003. Hodge defends higher education target. online. Available: http:\/\/education. guardian.co.uk\/print\/0,3858,4582592-108229,00.html (Accessed 20th April 2005)."},{"key":"e_1_2_1_7_1","volume-title":"Crisis on Campus: Confronting Academic Misconduct","author":"Decoo W.","unstructured":"Decoo , W. 2002. Crisis on Campus: Confronting Academic Misconduct . The MIT Press , Cambridge, MA . Decoo, W. 2002. Crisis on Campus: Confronting Academic Misconduct. The MIT Press, Cambridge, MA."},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the 25th Australasian Conference on Computer Science. Australian Computer Society, Inc., 59--64","author":"Finkel R. A.","unstructured":"Finkel , R. A. , Zaslavsky , A. , Monostori , K. , and Schmidt , H . 2002. Signature extraction for overlap detection in documents . In Proceedings of the 25th Australasian Conference on Computer Science. Australian Computer Society, Inc., 59--64 . Finkel, R. A., Zaslavsky, A., Monostori, K., and Schmidt, H. 2002. Signature extraction for overlap detection in documents. In Proceedings of the 25th Australasian Conference on Computer Science. Australian Computer Society, Inc., 59--64."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.10170"},{"key":"e_1_2_1_10_1","volume-title":"Jisc service---solutions for a new era in education. online. Available: http:\/\/www.submit.ac.uk (Accessed 20th","author":"Paradigms","year":"2005","unstructured":"i Paradigms . 2005. Jisc service---solutions for a new era in education. online. Available: http:\/\/www.submit.ac.uk (Accessed 20th April 2005 ). iParadigms. 2005. Jisc service---solutions for a new era in education. online. Available: http:\/\/www.submit.ac.uk (Accessed 20th April 2005)."},{"key":"e_1_2_1_11_1","unstructured":"Joy M. S. and Luck M. 1998. Computer Based Assessment (Vol. 2): Case Studies in Science and Computing. SEED Publications University of Plymouth United Kingdom. The BOSS system for on-line submission and assessment of computing assignments 39--44.  Joy M. S. and Luck M. 1998. Computer Based Assessment (Vol. 2): Case Studies in Science and Computing. SEED Publications University of Plymouth United Kingdom. The BOSS system for on-line submission and assessment of computing assignments 39--44."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/13.762946"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 6th Australian Document Computing Symposium (ADCS","author":"Monostori K.","year":"2001","unstructured":"Monostori , K. , Zaslavsky , A. , and Bia , A . 2001. Using the matchdetectreveal system for comparative analysis of texts . In Proceedings of the 6th Australian Document Computing Symposium (ADCS 2001 ). 51--58. Monostori, K., Zaslavsky, A., and Bia, A. 2001. Using the matchdetectreveal system for comparative analysis of texts. In Proceedings of the 6th Australian Document Computing Symposium (ADCS 2001). 51--58."},{"key":"e_1_2_1_14_1","volume-title":"International Conference on Computational Science (ICCS","author":"Monostori K.","year":"2002","unstructured":"Monostori , K. , Finkel , R. A. , Zaslavsky , A. , Hodasz , G. , and Pataki , M . 2002. Comparison of overlap detection techniques . In International Conference on Computational Science (ICCS 2002 ). Amsterdam, The Netherlands, 51--60. Monostori, K., Finkel, R. A., Zaslavsky, A., Hodasz, G., and Pataki, M. 2002. Comparison of overlap detection techniques. In International Conference on Computational Science (ICCS 2002). Amsterdam, The Netherlands, 51--60."},{"key":"e_1_2_1_15_1","first-page":"11","article-title":"Finding plagiarisms among a set of programs with jplag","volume":"8","author":"Prechelt L.","year":"2002","unstructured":"Prechelt , L. , Malpohl , G. , and Phillipsen , M. 2002 . Finding plagiarisms among a set of programs with jplag . Journal of Universal Computer Science 8 , 11 . Prechelt, L., Malpohl, G., and Phillipsen, M. 2002. Finding plagiarisms among a set of programs with jplag. Journal of Universal Computer Science 8, 11.","journal-title":"Journal of Universal Computer Science"},{"key":"e_1_2_1_16_1","volume-title":"IEEE Symposium on Information Visualisation","author":"Ribler R. L.","unstructured":"Ribler , R. L. and Abrams , M . 2000. Using visualization to detect plagiarism in computer science classes . In IEEE Symposium on Information Visualisation . Salt Lake City, Utah, 173--177. Ribler, R. L. and Abrams, M. 2000. Using visualization to detect plagiarism in computer science classes. In IEEE Symposium on Information Visualisation. Salt Lake City, Utah, 173--177."},{"key":"e_1_2_1_17_1","volume-title":"Managing Gigabytes: Compressing and Indexing Documents and Images","author":"Witten I. H.","year":"1999","unstructured":"Witten , I. H. , Moffat , A. , and Bell , T. C . 1999 . Managing Gigabytes: Compressing and Indexing Documents and Images , 2 nd Edn. Morgan Kaufmann , San Francisco , California. Witten, I. H., Moffat, A., and Bell, T. C. 1999. Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd Edn. Morgan Kaufmann, San Francisco, California.","edition":"2"},{"key":"e_1_2_1_18_1","unstructured":"Woolls D. 2003. Private correspondence.  Woolls D. 2003. Private correspondence."},{"key":"e_1_2_1_19_1","volume-title":"Welcome to the home of powerful text analysis tools. online. Available: http:\/\/www.copycatchgold.com\/ (Accessed 20th","author":"Woolls D.","year":"2005","unstructured":"Woolls , D. 2004. Welcome to the home of powerful text analysis tools. online. Available: http:\/\/www.copycatchgold.com\/ (Accessed 20th April 2005 ). Woolls, D. 2004. Welcome to the home of powerful text analysis tools. online. Available: http:\/\/www.copycatchgold.com\/ (Accessed 20th April 2005)."}],"container-title":["Journal on Educational Resources in Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1086339.1086341","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1086339.1086341","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T16:08:12Z","timestamp":1750262892000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1086339.1086341"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,12]]},"references-count":19,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2004,12]]}},"alternative-id":["10.1145\/1086339.1086341"],"URL":"https:\/\/doi.org\/10.1145\/1086339.1086341","relation":{},"ISSN":["1531-4278","1531-4278"],"issn-type":[{"value":"1531-4278","type":"print"},{"value":"1531-4278","type":"electronic"}],"subject":[],"published":{"date-parts":[[2004,12]]},"assertion":[{"value":"2004-12-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}