{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T22:30:45Z","timestamp":1772058645940,"version":"3.50.1"},"reference-count":32,"publisher":"PeerJ","license":[{"start":{"date-parts":[[2020,3,23]],"date-time":"2020-03-23T00:00:00Z","timestamp":1584921600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"abstract":"<jats:p>The vast volume of documents available in legal databases demands effective information retrieval approaches which take into consideration the intricacies of the legal domain. Relevant document retrieval is the backbone of the legal domain. The concept of relevance in the legal domain is very complex and multi-faceted. In this work, we propose a novel approach of concept based similarity estimation among court judgments. We use a graph-based method, to identify prominent concepts present in a judgment and extract sentences representative of these concepts. The sentences and concepts so mined are used to express\/visualize likeness among concepts between a pair of documents from different perspectives. We also propose to aggregate the different levels of matching so obtained into one measure quantifying the level of similarity between a judgment pair. We employ the ordered weighted average (OWA) family of aggregation operators for obtaining the similarity value. The experimental results suggest that the proposed approach of concept based similarity is effective in the extraction of relevant legal documents and performs better than other competing techniques. Additionally, the proposed two-level abstraction of similarity enables informative visualization for deeper insights into case relevance.<\/jats:p>","DOI":"10.7717\/peerj-cs.262","type":"journal-article","created":{"date-parts":[[2020,3,23]],"date-time":"2020-03-23T08:31:26Z","timestamp":1584952286000},"page":"e262","source":"Crossref","is-referenced-by-count":18,"title":["Legal document similarity: a multi-criteria decision-making perspective"],"prefix":"10.7717","volume":"6","author":[{"given":"Rupali S.","family":"Wagh","sequence":"first","affiliation":[{"name":"Department of Computer Science, JAIN Deemed to be University, Bangalore, Karnataka, India"}]},{"given":"Deepa","family":"Anand","sequence":"additional","affiliation":[{"name":"Department of Information Science and Engineering, CMR Institute of Technology, Bangalore, Karnataka, India"}]}],"member":"4443","published-online":{"date-parts":[[2020,3,23]]},"reference":[{"issue":"3","key":"10.7717\/peerj-cs.262\/ref-1","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1007\/s10506-012-9131-x","article-title":"A history of AI and Law in 50 papers: 25 years of the international conference on AI and Law","volume":"20","author":"Bench-Capon","year":"2012","journal-title":"Artificial Intelligence and Law"},{"issue":"1","key":"10.7717\/peerj-cs.262\/ref-2","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1007\/s10506-017-9193-x","article-title":"Data-centric and logic-based models for automated legal problem solving","volume":"25","author":"Branting","year":"2017","journal-title":"Artificial Intelligence and Law"},{"key":"10.7717\/peerj-cs.262\/ref-3","article-title":"Compound interdependences in MOP","author":"Carlsson","year":"1996"},{"issue":"3","key":"10.7717\/peerj-cs.262\/ref-4","doi-asserted-by":"publisher","first-page":"213","DOI":"10.3233\/SW-160224","article-title":"Semantic web for the legal domain: the next step","volume":"7","author":"Casanovas","year":"2016","journal-title":"Semantic Web"},{"key":"10.7717\/peerj-cs.262\/ref-5","author":"Community detection\u00a0for Networkx\u2019s\u00a0Documentation","year":"2010"},{"issue":"5","key":"10.7717\/peerj-cs.262\/ref-6","doi-asserted-by":"crossref","first-page":"833","DOI":"10.1109\/TKDE.2018.2849727","article-title":"A survey on network embedding","volume":"31","author":"Cui","year":"2018","journal-title":"IEEE Transactions on Knowledge and Data Engineering"},{"key":"10.7717\/peerj-cs.262\/ref-7","doi-asserted-by":"crossref","DOI":"10.1109\/ISDA.2011.6121636","article-title":"Generalized louvain method for community detection in large networks","author":"De\u00a0Meo","year":"2011"},{"key":"10.7717\/peerj-cs.262\/ref-8","article-title":"models.word2vec Word2vec Embedding","author":"Gensim","year":"2014"},{"key":"10.7717\/peerj-cs.262\/ref-9","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1016\/j.knosys.2018.03.022","article-title":"Graph embedding techniques, applications, and performance: a survey","volume":"151","author":"Goyal","year":"2018","journal-title":"Knowledge-Based Systems"},{"key":"10.7717\/peerj-cs.262\/ref-10","doi-asserted-by":"crossref","DOI":"10.1145\/2939672.2939754","article-title":"node2vec: scalable feature learning for networks","author":"Grover","year":"2016"},{"issue":"2","key":"10.7717\/peerj-cs.262\/ref-11","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1093\/comnet\/cnx029","article-title":"Network analysis in the legal domain: a complex model for European Union legal sources","volume":"6","author":"Koniaris","year":"2017","journal-title":"Journal of Complex Networks"},{"key":"10.7717\/peerj-cs.262\/ref-12","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-37134-9_9","article-title":"Finding similar legal judgements under common law system","author":"Kumar","year":"2013"},{"key":"10.7717\/peerj-cs.262\/ref-13","doi-asserted-by":"crossref","DOI":"10.18653\/v1\/W16-1609","article-title":"An empirical evaluation of doc2vec with practical insights into document embedding generation","author":"Lau","year":"2016"},{"key":"10.7717\/peerj-cs.262\/ref-14","article-title":"Distributed representations of sentences and documents","author":"Le","year":"2014"},{"key":"10.7717\/peerj-cs.262\/ref-15","article-title":"Matching long text documents via graph convolutional networks","author":"Liu","year":"2018"},{"key":"10.7717\/peerj-cs.262\/ref-16","doi-asserted-by":"crossref","DOI":"10.1145\/3140107.3140119","article-title":"Measuring similarity among legal court case documents","author":"Mandal","year":"2017"},{"key":"10.7717\/peerj-cs.262\/ref-17","first-page":"63","article-title":"Overview of the FIRE 2017 IRLeD track: information retrieval from legal documents","author":"Mandal","year":"2017","journal-title":"FIRE (Working Notes)"},{"issue":"1","key":"10.7717\/peerj-cs.262\/ref-18","doi-asserted-by":"publisher","first-page":"100","DOI":"10.1017\/S1351324909005129","article-title":"Introduction to information retrieval","volume":"16","author":"Manning","year":"2010","journal-title":"Natural Language Engineering"},{"key":"10.7717\/peerj-cs.262\/ref-19","article-title":"Efficient estimation of word representations in vector space","author":"Mikolov","year":"2013"},{"key":"10.7717\/peerj-cs.262\/ref-20","article-title":"Mixing dirichlet topic models and word embeddings to make lda2vec","author":"Moody","year":"2016"},{"key":"10.7717\/peerj-cs.262\/ref-21","article-title":"Similarity analysis of court judgements using association rule mining on case citation data","author":"Nair"},{"issue":"2\u20133","key":"10.7717\/peerj-cs.262\/ref-22","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1007\/s10791-017-9321-y","article-title":"Neural information retrieval: at the end of the early years","volume":"21","author":"Onal","year":"2018","journal-title":"Information Retrieval Journal"},{"key":"10.7717\/peerj-cs.262\/ref-23","first-page":"242","article-title":"Using tf-idf to determine word relevance in document queries","author":"Ramos","year":"2003"},{"issue":"2","key":"10.7717\/peerj-cs.262\/ref-24","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1007\/s10506-009-9075-y","article-title":"Improving legal information retrieval using an ontological framework","volume":"17","author":"Saravanan","year":"2009","journal-title":"Artificial Intelligence and Law"},{"issue":"2","key":"10.7717\/peerj-cs.262\/ref-25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2542214.2542215","article-title":"A graph analytical approach for topic detection","volume":"13","author":"Sayyadi","year":"2013","journal-title":"ACM Transactions on Internet Technology (TOIT)"},{"key":"10.7717\/peerj-cs.262\/ref-26","doi-asserted-by":"crossref","DOI":"10.1007\/11941439_114","article-title":"Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation","volume-title":"Australasian joint conference on artificial intelligence","author":"Sokolova","year":"2006"},{"key":"10.7717\/peerj-cs.262\/ref-27","article-title":"Legal document retrieval using document vector embeddings and deep learning","author":"Sugathadasa","year":"2018"},{"key":"10.7717\/peerj-cs.262\/ref-28","article-title":"Citation analysis and beyond: in search of indicators measuring case law importance","volume-title":"JURIX","volume":"250","author":"Van\u00a0Opijnen","year":"2012"},{"key":"10.7717\/peerj-cs.262\/ref-29","doi-asserted-by":"crossref","DOI":"10.1145\/3086512.3086531","article-title":"Experimenting word embeddings in assisting legal review","author":"Vo","year":"2017"},{"key":"10.7717\/peerj-cs.262\/ref-30","first-page":"1","article-title":"Application of citation network analysis for improved similarity index estimation of legal case documents: a study","author":"Wagh","year":"2017"},{"issue":"2","key":"10.7717\/peerj-cs.262\/ref-31","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1016\/S0165-0114(02)00223-3","article-title":"Fuzzy logic methods in recommender systems","volume":"136","author":"Yager","year":"2003","journal-title":"Fuzzy Sets and Systems"},{"key":"10.7717\/peerj-cs.262\/ref-32","doi-asserted-by":"publisher","first-page":"248","DOI":"10.1016\/j.procs.2017.03.087","article-title":"A graph-based approach of automatic keyphrase extraction","volume":"107","author":"Ying","year":"2017","journal-title":"Procedia Computer Science"}],"container-title":["PeerJ Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/peerj.com\/articles\/cs-262.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/peerj.com\/articles\/cs-262.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/peerj.com\/articles\/cs-262.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/peerj.com\/articles\/cs-262.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,3,23]],"date-time":"2020-03-23T08:31:35Z","timestamp":1584952295000},"score":1,"resource":{"primary":{"URL":"https:\/\/peerj.com\/articles\/cs-262"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,23]]},"references-count":32,"alternative-id":["10.7717\/peerj-cs.262"],"URL":"https:\/\/doi.org\/10.7717\/peerj-cs.262","archive":["CLOCKSS","LOCKSS","Portico"],"relation":{},"ISSN":["2376-5992"],"issn-type":[{"value":"2376-5992","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,23]]},"article-number":"e262"}}