{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T14:43:33Z","timestamp":1775832213803,"version":"3.50.1"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2018,10,26]],"date-time":"2018-10-26T00:00:00Z","timestamp":1540512000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Artif Intell Law"],"published-print":{"date-parts":[[2019,6]]},"DOI":"10.1007\/s10506-018-9236-y","type":"journal-article","created":{"date-parts":[[2018,10,26]],"date-time":"2018-10-26T02:18:17Z","timestamp":1540520297000},"page":"199-225","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["Unsupervised and supervised text similarity systems for automated identification of national implementing measures of European directives"],"prefix":"10.1007","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7124-4799","authenticated-orcid":false,"given":"Rohan","family":"Nanda","sequence":"first","affiliation":[]},{"given":"Giovanni","family":"Siragusa","sequence":"additional","affiliation":[]},{"given":"Luigi","family":"Di Caro","sequence":"additional","affiliation":[]},{"given":"Guido","family":"Boella","sequence":"additional","affiliation":[]},{"given":"Lorenzo","family":"Grossio","sequence":"additional","affiliation":[]},{"given":"Marco","family":"Gerbaudo","sequence":"additional","affiliation":[]},{"given":"Francesco","family":"Costamagna","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2018,10,26]]},"reference":[{"key":"9236_CR1","unstructured":"Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al (2016) Tensorflow: a system for large-scale machine learning. In: OSDI, vol 16, pp 265\u2013283"},{"issue":"4","key":"9236_CR2","doi-asserted-by":"publisher","first-page":"325","DOI":"10.3233\/AO-170174","volume":"2","author":"G Ajani","year":"2017","unstructured":"Ajani G, Boella G, Di Caro L, Robaldo L, Humphreys L, Praduroux S, Rossi P, Violato A (2017) The European legal taxonomy syllabus: a multi-lingual, multi-level ontology framework to untangle the web of European legal terminology. Appl Ontol 2(4):325\u2013375","journal-title":"Appl Ontol"},{"key":"9236_CR3","doi-asserted-by":"publisher","first-page":"e93","DOI":"10.7717\/peerj-cs.93","volume":"2","author":"N Aletras","year":"2016","unstructured":"Aletras N, Tsarapatsanis D, Preo\u0163iuc-Pietro D, Lampos V (2016) Predicting judicial decisions of the European court of human rights: a natural language processing perspective. PeerJ Comput Sci 2:e93","journal-title":"PeerJ Comput Sci"},{"key":"9236_CR4","unstructured":"Bergamaschi S, Po L (2014) Comparing lda and lsa topic models for content-based movie recommendation systems. In: International conference on web information systems and technologies. Springer, pp 247\u2013263"},{"key":"9236_CR5","doi-asserted-by":"crossref","unstructured":"Bird S, Loper E (2004) Nltk: the natural language toolkit. In: Proceedings of the ACL 2004 on interactive poster and demonstration sessions. Association for Computational Linguistics, p\u00a031","DOI":"10.3115\/1219044.1219075"},{"issue":"Jan","key":"9236_CR6","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993\u20131022","journal-title":"J Mach Learn Res"},{"key":"9236_CR7","unstructured":"Boella G, Di\u00a0Caro L, Humphreys L, Robaldo L, van\u00a0der Torre L (2012) Nlp challenges for eunomos, a tool to build and manage legal knowledge. In: Language resources and evaluation (LREC). pp 3672\u20133678"},{"key":"9236_CR8","first-page":"218","volume-title":"Semantic relation extraction from legislative text using generalized syntactic dependencies and support vector machines","author":"G Boella","year":"2013","unstructured":"Boella G, Di Caro L, Robaldo L (2013) Semantic relation extraction from legislative text using generalized syntactic dependencies and support vector machines. Springer, Berlin, pp 218\u2013225"},{"key":"9236_CR9","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1007\/s10506-016-9184-3","volume":"24","author":"G Boella","year":"2016","unstructured":"Boella G, Di Caro L, Humphreys L, Robaldo L, Rossi R, van der Torre L (2016) Eunomos, a legal document and knowledge management system for the web to provide relevant, reliable and up-to-date information on the law. Artif Intell Law 24:245\u2013283","journal-title":"Artif Intell Law"},{"key":"9236_CR10","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T (2016) Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606"},{"key":"9236_CR11","doi-asserted-by":"crossref","unstructured":"Cardellino C, Teruel M, Alemany LA, Villata S (2017) A low-cost, high-coverage legal named entity recognizer, classifier and linker. In: Proceedings of the 16th edition of the international conference on artificial intelligence and law. ACM, pp 9\u201318","DOI":"10.1145\/3086512.3086514"},{"key":"9236_CR12","unstructured":"Ciavarini Azzi G (2000) The slow march of european legislation: the implementation of directives. In: European integration after Amsterdam: institutional dynamics and prospects for democracy"},{"issue":"3","key":"9236_CR13","doi-asserted-by":"publisher","first-page":"379","DOI":"10.1109\/TC.2011.223","volume":"61","author":"G Cosma","year":"2012","unstructured":"Cosma G, Joy M (2012) An approach to source-code plagiarism detection and investigation using latent semantic analysis. IEEE Trans Comput 61(3):379\u2013394","journal-title":"IEEE Trans Comput"},{"issue":"6","key":"9236_CR14","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9","volume":"41","author":"S Deerwester","year":"1990","unstructured":"Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391","journal-title":"J Am Soc Inf Sci"},{"key":"9236_CR15","unstructured":"Eliantonio M, Ballesteros M, Rostane M, Petrovic D (2013) Tools for ensuring implementation and application of eu law and evaluation of their effectiveness. Technical reports on European Parliament"},{"issue":"5","key":"9236_CR16","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1007\/BF02163027","volume":"14","author":"GH Golub","year":"1970","unstructured":"Golub GH, Reinsch C (1970) Singular value decomposition and least squares solutions. Numer Math 14(5):403\u2013420","journal-title":"Numer Math"},{"key":"9236_CR17","volume-title":"Statistical meta-analysis with applications","author":"J Hartung","year":"2011","unstructured":"Hartung J, Knapp G, Sinha B (2011) Statistical meta-analysis with applications, vol 738. Wiley, Hoboken"},{"key":"9236_CR18","doi-asserted-by":"crossref","unstructured":"Hong L, Davison BD (2010) Empirical study of topic modeling in twitter. In: Proceedings of the first workshop on social media analytics. ACM, pp 80\u201388","DOI":"10.1145\/1964858.1964870"},{"key":"9236_CR19","doi-asserted-by":"crossref","unstructured":"Humphreys L, Santos C, Di\u00a0Caro L, Boella G, Van Der\u00a0Torre L, Robaldo L (2015) Mapping recitals to normative provisions in eu legislation to assist legal interpretation. In: JURIX. pp 41\u201349","DOI":"10.3233\/978-1-61499-609-5-41"},{"key":"9236_CR20","doi-asserted-by":"crossref","unstructured":"Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. In: European conference on machine learning. Springer, pp 137\u2013142","DOI":"10.1007\/BFb0026683"},{"key":"9236_CR21","doi-asserted-by":"crossref","unstructured":"Kenter T, De Rijke M (2015) Short text similarity with word embeddings. In: Proceedings of the 24th ACM international on conference on information and knowledge management. ACM, pp 1411\u20131420","DOI":"10.1145\/2806416.2806475"},{"key":"9236_CR22","unstructured":"Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning. pp 1188\u20131196"},{"issue":"Nov","key":"9236_CR23","first-page":"2579","volume":"9","author":"LVD Maaten","year":"2008","unstructured":"Maaten LVD, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(Nov):2579\u20132605","journal-title":"J Mach Learn Res"},{"issue":"2","key":"9236_CR24","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1007\/s11192-009-0046-6","volume":"82","author":"T Magerman","year":"2010","unstructured":"Magerman T, Van Looy B, Song X (2010) Exploring the feasibility and accuracy of latent semantic analysis based text mining techniques to detect similarity between patent documents and scientific publications. Scientometrics 82(2):289\u2013306","journal-title":"Scientometrics"},{"key":"9236_CR25","doi-asserted-by":"crossref","unstructured":"Mandal A, Chaki R, Saha S, Ghosh K, Pal A, Ghosh S (2017) Measuring similarity among legal court case documents. In: Proceedings of the 10th annual ACM India compute conference, Compute \u201917. ACM, New York, pp 1\u20139","DOI":"10.1145\/3140107.3140119"},{"issue":"3","key":"9236_CR26","doi-asserted-by":"publisher","first-page":"276","DOI":"10.11613\/BM.2012.031","volume":"22","author":"ML McHugh","year":"2012","unstructured":"McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Med 22(3):276\u2013282","journal-title":"Biochem Med"},{"key":"9236_CR27","unstructured":"Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781"},{"key":"9236_CR28","doi-asserted-by":"crossref","unstructured":"Nanda R, Di\u00a0Caro L, Boella G (2016) A text similarity approach for automated transposition detection of European union directives. In: 29th International conference on legal knowledge and information systems, JURIX 2016, vol 294. IOS Press, pp 143\u2013148","DOI":"10.3233\/978-1-61499-726-9-143"},{"key":"9236_CR29","doi-asserted-by":"crossref","unstructured":"Nanda R, Di\u00a0Caro L, Boella G, Konstantinov H, Tyankov T, Traykov D, Hristov H, Costamagna F, Humphreys L, Robaldo L, et al (2017) A unifying similarity measure for automated identification of national implementations of European union directives. In: Proceedings of the 16th edition of the international conference on articial intelligence and law. ACM, pp 149\u2013158","DOI":"10.1145\/3086512.3086527"},{"key":"9236_CR30","doi-asserted-by":"crossref","unstructured":"Nanda R, Siragusa G, Caro LD, Theobald M, Boella G, Robaldo L, Costamagna F (2017) Concept recognition in European and national law. In: Legal knowledge and information systems\u2014JURIX 2017: the thirtieth annual conference, Luxembourg, 13\u201315 December 2017, pp 193\u2013198","DOI":"10.3233\/978-1-61499-838-9-193"},{"key":"9236_CR31","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"9236_CR32","unstructured":"\u0158eh\u016f\u0159ek R, Sojka P (2010) Software framework for topic modelling with Large Corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks, ELRA, Valletta, Malta, pp 45\u201350. http:\/\/is.muni.cz\/publication\/884893\/en"},{"issue":"5","key":"9236_CR33","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1016\/j.jcss.2009.10.009","volume":"76","author":"L Robaldo","year":"2010","unstructured":"Robaldo L (2010) Interpretation and inference with maximal referential terms. J Comput Syst Sci 76(5):373\u2013388","journal-title":"J Comput Syst Sci"},{"issue":"2","key":"9236_CR34","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1007\/s10849-010-9131-8","volume":"20","author":"L Robaldo","year":"2011","unstructured":"Robaldo L (2011) Distributivity, collectivity, and cumulativity in terms of (in)dependence and maximality. J Log Lang Inf 20(2):233\u2013271","journal-title":"J Log Lang Inf"},{"key":"9236_CR35","doi-asserted-by":"publisher","first-page":"2471","DOI":"10.1093\/logcom\/exx009","volume":"27","author":"L Robaldo","year":"2017","unstructured":"Robaldo L, Sun X (2017) Reified input\/output logic: combining input\/output logic and reification to represent norms coming from existing legislation. J Log Comput 27:2471\u20132503","journal-title":"J Log Comput"},{"key":"9236_CR36","doi-asserted-by":"crossref","unstructured":"Robaldo L, Caselli T, Russo I, Grella M (2011) From Italian text to timeml document via dependency parsing. In: Computational linguistics and intelligent text processing\u201412th international conference, CICLing 2011, Tokyo, Japan, 2011, pp 177\u2013187","DOI":"10.1007\/978-3-642-19437-5_14"},{"issue":"1","key":"9236_CR37","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1108\/eb026526","volume":"28","author":"K Sparck Jones","year":"1972","unstructured":"Sparck Jones K (1972) A statistical interpretation of term specificity and its application in retrieval. J Doc 28(1):11\u201321","journal-title":"J Doc"}],"container-title":["Artificial Intelligence and Law"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10506-018-9236-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10506-018-9236-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10506-018-9236-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T21:54:05Z","timestamp":1775253245000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10506-018-9236-y"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,26]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2019,6]]}},"alternative-id":["9236"],"URL":"https:\/\/doi.org\/10.1007\/s10506-018-9236-y","relation":{},"ISSN":["0924-8463","1572-8382"],"issn-type":[{"value":"0924-8463","type":"print"},{"value":"1572-8382","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,10,26]]},"assertion":[{"value":"26 October 2018","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}