{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T06:18:14Z","timestamp":1780553894867,"version":"3.54.1"},"reference-count":326,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T00:00:00Z","timestamp":1700438400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T00:00:00Z","timestamp":1700438400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100007069","name":"Universit\u00e0 della Calabria","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100007069","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Artif Intell Law"],"published-print":{"date-parts":[[2024,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Transformer-based language models (TLMs) have widely been recognized to be a cutting-edge technology for the successful development of deep-learning-based solutions to problems and applications that require natural language processing and understanding. Like for other textual domains, TLMs have indeed pushed the state-of-the-art of AI approaches for many tasks of interest in the legal domain. Despite the first Transformer model being proposed about six years ago, there has been a rapid progress of this technology at an unprecedented rate, whereby BERT and related models represent a major reference, also in the legal domain. This article provides the first systematic overview of TLM-based methods for AI-driven problems and tasks in the legal sphere. A major goal is to highlight research advances in this field so as to understand, on the one hand, how the Transformers have contributed to the success of AI in supporting legal processes, and on the other hand, what are the current limitations and opportunities for further research development.<\/jats:p>","DOI":"10.1007\/s10506-023-09374-7","type":"journal-article","created":{"date-parts":[[2023,11,20]],"date-time":"2023-11-20T09:02:31Z","timestamp":1700470951000},"page":"863-1010","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":41,"title":["Bringing order into the realm of Transformer-based language models for artificial intelligence and law"],"prefix":"10.1007","volume":"32","author":[{"given":"Candida M.","family":"Greco","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8142-503X","authenticated-orcid":false,"given":"Andrea","family":"Tagarelli","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2023,11,20]]},"reference":[{"key":"9374_CR1","doi-asserted-by":"crossref","unstructured":"Aguiar A, Silveira R, Pinheiro V, Furtado V, Neto JA (2021) Text classification in legal documents extracted from lawsuits in Brazilian courts. In: Proceedings of the Brazilian conference on intelligent systems (BRACIS), pp 586\u2013600","DOI":"10.1007\/978-3-030-91699-2_40"},{"key":"9374_CR2","doi-asserted-by":"crossref","unstructured":"Ahmad WU, Chi J, Le T, Norton T, Tian Y, Chang K (2021) Intent classification and slot filling for privacy policies. In: Proceedings of the annual meeting of the association for computational linguistics and the international joint conference on natural language processing (ACL\/IJCNLP). Association for Computational Linguistics, pp 4402\u20134417","DOI":"10.18653\/v1\/2021.acl-long.340"},{"key":"9374_CR3","doi-asserted-by":"crossref","unstructured":"Ahmad WU, Chi J, Tian Y, Chang K (2020) PolicyQA: a reading comprehension dataset for privacy policies. In: Findings of the Association for Computational Linguistics: EMNLP, findings of ACL. Association for Computational Linguistics, pp 743\u2013749","DOI":"10.18653\/v1\/2020.findings-emnlp.66"},{"key":"9374_CR4","doi-asserted-by":"crossref","unstructured":"Alberts H, Ipek A, Lucas R, Wozny P (2020) COLIEE 2020: legal information retrieval and entailment with legal embeddings and boosting. In: New frontiers in artificial intelligence\u2014JSAI-isAI 2020 workshops, JURISIN, LENLS 2020 Workshops, volume 12758 of lecture notes in computer science. Springer, pp 211\u2013225","DOI":"10.1007\/978-3-030-79942-7_14"},{"key":"9374_CR5","doi-asserted-by":"crossref","unstructured":"Aletras N, Tsarapatsanis D, Preotiuc-Pietro D, Lampos V (2016) Predicting judicial decisions of the European Court of Human Rights: a natural language processing perspective. PeerJ Comput Sci 2:e93","DOI":"10.7717\/peerj-cs.93"},{"key":"9374_CR6","doi-asserted-by":"crossref","unstructured":"Allan J, Harman D, Kanoulas E, Li D, Gysel CV, Voorhees, EM (2017) TREC 2017 common core track overview. In: Proceedings of the Text REtrieval conference (TREC), volume 500-324 of NIST Special Publication. National Institute of Standards and Technology (NIST)","DOI":"10.6028\/NIST.SP.500-324.core-overview"},{"key":"9374_CR7","unstructured":"Althammer S, Askari A, Verberne S, Hanbury A (2021) DoSSIER@COLIEE 2021: leveraging dense retrieval and summarization-based re-ranking for case law retrieval. CoRR, arXiv:2108.03937"},{"key":"9374_CR8","doi-asserted-by":"crossref","unstructured":"Amos R, Acar G, Lucherini E, Kshirsagar M, Narayanan A, Mayer JR (2021) Privacy policies over time: curation and analysis of a million-document dataset. In: Proceedings of the ACM web conference (WWW). ACM, pp 2165\u20132176","DOI":"10.1145\/3442381.3450048"},{"key":"9374_CR9","unstructured":"Antoun W, Baly F, Hajj HM (2020) AraBERT: Transformer-based model for Arabic language understanding. CoRR, arXiv:2003.00104"},{"key":"9374_CR10","doi-asserted-by":"crossref","unstructured":"Artetxe M, Bhosale S, Goyal N, Mihaylov T, Ott M, Shleifer S, Lin XV, Du J, Iyer S, Pasunuru R, Anantharaman G, Li X, Chen S, Akin H, Baines M, Martin L, Zhou X, Koura PS, O\u2019Horo B, Wang J, Zettlemoyer L, Diab MT, Kozareva Z, Stoyanov V (2022) Efficient large scale language modeling with mixtures of experts. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 11699\u201311732","DOI":"10.18653\/v1\/2022.emnlp-main.804"},{"key":"9374_CR11","doi-asserted-by":"crossref","unstructured":"Artetxe, M., Ruder, S., and Yogatama, D. (2020). On the cross-lingual transferability of monolingual representations. In Proc. of the Annual Meeting of the Association for Computational Linguistics (ACL), pages 4623\u20134637. Association for Computational Linguistics","DOI":"10.18653\/v1\/2020.acl-main.421"},{"key":"9374_CR12","unstructured":"Askari A, Verberne S (2021) Combining lexical and neural retrieval with Longformer-based summarization for effective case law retrieval. In: Proceedings of the international conference on design of experimental search & information retrieval systems, volume 2950 of CEUR workshop proceedings. CEUR-WS.org, pp 162\u2013170"},{"key":"9374_CR13","unstructured":"Athinaios K (2020) Named entity recognition using a novel linguistic model for greek legal corpora based on BERT model. BS Thesis, School of Science, Department of Informatics and Telecommunications"},{"key":"9374_CR14","doi-asserted-by":"crossref","unstructured":"Aumiller D, Almasian S, Lackner S, Gertz M (2021) Structural text segmentation of legal documents. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 2\u201311","DOI":"10.1145\/3462757.3466085"},{"key":"9374_CR15","doi-asserted-by":"crossref","unstructured":"Aumiller D, Chouhan A, Gertz M (2022) EUR-Lex-Sum: a multi- and cross-lingual dataset for long-form summarization in the legal domain. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 7626\u20137639","DOI":"10.18653\/v1\/2022.emnlp-main.519"},{"key":"9374_CR16","doi-asserted-by":"crossref","unstructured":"Avram A, Pais VF, Tufis DI (2021) PyEuroVoc: a tool for multilingual legal document classification with EuroVoc descriptors. In: Proceedings of the international conference on recent advances in natural language processing (RANLP), pp 92\u2013101","DOI":"10.26615\/978-954-452-072-4_012"},{"key":"9374_CR17","doi-asserted-by":"crossref","unstructured":"Aydemir A, de\u00a0Castro\u00a0Souza P, Gelfman A (2020) Using BERT and TF-IDF to predict entailment in law-based queries. In: New Frontiers in artificial intelligence\u2014JSAI-isAI 2020 workshops, JURISIN, LENLS 2020 workshops, volume 12758 of lecture notes in computer science. Springer, pp 286\u2013293","DOI":"10.1007\/978-3-030-79942-7_19"},{"key":"9374_CR18","unstructured":"Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: Proceedings of the international conference on learning representations (ICLR)"},{"key":"9374_CR19","unstructured":"Bao H, Dong L, Wei F, Wang W, Yang N, Liu X, Wang Y, Gao J, Piao S, Zhou M, Hon H (2020) UniLMv2: pseudo-masked language models for unified language model pre-training. In: Proceedings of the international conference on machine learning (ICML), pp 642\u2013652"},{"key":"9374_CR20","unstructured":"Barrios F, L\u00f3pez F, Argerich, L, Wachenchauzer R (2016) Variations of the similarity function of TextRank for automated summarization. CoRR, arXiv:1602.03606"},{"key":"9374_CR21","doi-asserted-by":"crossref","unstructured":"Baudis P, Sediv\u00fd J (2015) Modeling of the question answering task in the YodaQA system. In: Proceedings of the international conference of the CLEF Association: experimental IR meets multilinguality, multimodality, and interaction, volume 9283 of lecture notes in computer science. Springer, pp 222\u2013228","DOI":"10.1007\/978-3-319-24027-5_20"},{"key":"9374_CR22","unstructured":"Beltagy I, Peters ME, Cohan A (2020) Longformer: the long-document Transformer. CoRR, arXiv:2004.05150"},{"key":"9374_CR23","unstructured":"Berant J, Chou A, Frostig R, Liang P (2013) Semantic parsing on freebase from question-answer pairs. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). ACL, pp 1533\u20131544"},{"key":"9374_CR24","doi-asserted-by":"crossref","unstructured":"Bhattacharya P, Ghosh K, Pal A, Ghosh S (2020a) Hier-spcnet: a legal statute hierarchy-based heterogeneous network for computing legal case document similarity. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 1657\u20131660","DOI":"10.1145\/3397271.3401191"},{"key":"9374_CR25","doi-asserted-by":"crossref","unstructured":"Bhattacharya P, Ghosh K, Pal A, Ghosh S (2020b) Hier-SPCNet: a legal statute hierarchy-based heterogeneous network for computing legal case document similarity. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 1657\u20131660","DOI":"10.1145\/3397271.3401191"},{"key":"9374_CR26","doi-asserted-by":"crossref","unstructured":"Bhattacharya P, Hiware K, Rajgaria S, Pochhi N, Ghosh K, Ghosh S (2019a) A comparative study of summarization algorithms applied to legal case judgments. In: Proceedings of the European conference on IR research (ECIR), volume 11437 of lecture notes in computer science. Springer, pp 413\u2013428","DOI":"10.1007\/978-3-030-15712-8_27"},{"key":"9374_CR27","unstructured":"Bhattacharya P, Paul S, Ghosh K, Ghosh S, Wyner A (2019b) Identification of rhetorical roles of sentences in Indian legal judgments. In: Proceedings of the international conference on legal knowledge and information systems (JURIX)"},{"key":"9374_CR28","doi-asserted-by":"crossref","unstructured":"Bhattacharya P, Paul S, Ghosh K, Ghosh S, Wyner A (2021) DeepRhole: deep learning for rhetorical role labeling of sentences in legal case documents. Artif Intell Law 6:66","DOI":"10.1007\/s10506-021-09304-5"},{"issue":"2","key":"9374_CR29","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1007\/s10506-020-09270-4","volume":"29","author":"A Bibal","year":"2021","unstructured":"Bibal A, Lognoul M, de Streel A, Fr\u00e9nay B (2021) Legal requirements on explainability in machine learning. Artif Intell Law 29(2):149\u2013169","journal-title":"Artif Intell Law"},{"key":"9374_CR30","doi-asserted-by":"crossref","unstructured":"Black S, Biderman S, Hallahan E, Anthony Q, Gao L, Golding L, He H, Leahy C, McDonell K, Phang J, Pieler M, Prashanth US, Purohit S, Reynolds L, Tow J, Wang B, Weinbach S (2022) Gpt-neox-20b: an open-source autoregressive language model. CoRR, arXiv:2204.06745","DOI":"10.18653\/v1\/2022.bigscience-1.9"},{"key":"9374_CR31","doi-asserted-by":"crossref","unstructured":"Blair-Stanek A, Holzenberger N, Durme BV (2023) Can GPT-3 perform statutory reasoning? CoRR, arXiv:2302.06100","DOI":"10.1145\/3594536.3595163"},{"key":"9374_CR32","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1162\/tacl_a_00051","volume":"5","author":"P Bojanowski","year":"2017","unstructured":"Bojanowski P, Grave E, Joulin A, Mikolov T (2017) Enriching word vectors with subword information. Trans Assoc Comput Linguist 5:135\u2013146","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR33","doi-asserted-by":"crossref","unstructured":"Bowman SR, Angeli G, Potts C, Manning CD (2015) A large annotated corpus for learning natural language inference. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). The Association for Computational Linguistics, pp 632\u2013642","DOI":"10.18653\/v1\/D15-1075"},{"key":"9374_CR34","unstructured":"Brack A, Hoppe A, Buscherm\u00f6hle P, Ewerth R (2021) Sequential sentence classification in research papers using cross-domain multi-task learning. CoRR, arXiv:2102.06008"},{"issue":"2","key":"9374_CR35","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1007\/s10506-020-09273-1","volume":"29","author":"LK Branting","year":"2021","unstructured":"Branting LK, Pfeifer C, Brown B, Ferro L, Aberdeen JS, Weiss B, Pfaff M, Liao B (2021) Scalable and explainable legal prediction. Artif Intell Law 29(2):213\u2013238","journal-title":"Artif Intell Law"},{"key":"9374_CR36","unstructured":"Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S, Herbert-Voss A, Krueger G, Henighan T, Child R, Ramesh A, Ziegler DM, Wu J, Winter C, Hesse C, Chen M, Sigler E, Litwin M, Gray S, Chess B, Clark J, Berner C, McCandlish S, Radford A, Sutskever I, Amodei D (2020a) Language models are few-shot learners. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR37","unstructured":"Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S, Herbert-Voss A, Krueger G, Henighan T, Child R, Ramesh A, Ziegler DM, Wu J, Winter C, Hesse C, Chen M, Sigler E, Litwin M, Gray S, Chess B, Clark J, Berner C, McCandlish S, Radford A, Sutskever I, Amodei D (2020b) Language models are few-shot learners. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"issue":"2","key":"9374_CR38","doi-asserted-by":"publisher","first-page":"88","DOI":"10.2478\/popets-2021-0019","volume":"2021","author":"D Bui","year":"2021","unstructured":"Bui D, Shin KG, Choi J, Shin J (2021) Automated extraction and presentation of data practices in privacy policies. Proc Privacy Enhanc Technol 2021(2):88\u2013110","journal-title":"Proc Privacy Enhanc Technol"},{"issue":"2","key":"9374_CR39","first-page":"57","volume":"10","author":"K Burton","year":"2017","unstructured":"Burton K (2017) \u201cThink Like a Lawyer\u2019\u2019 using a legal reasoning grid and criterion-referenced assessment rubric on IRAC (Issue, Rule, Application, Conclusion). J Learn Des 10(2):57\u201368","journal-title":"J Learn Des"},{"issue":"161","key":"9374_CR40","first-page":"66","volume":"112","author":"PD Callister","year":"2020","unstructured":"Callister PD (2020) Law, artificial intelligence, and natural language processing: a funny thing happened on the way to my search results. Law Libr J 112(161):66","journal-title":"Law Libr J"},{"key":"9374_CR41","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Androutsopoulos I, Aletras N (2019a) Neural legal judgment prediction in English. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 4317\u20134323","DOI":"10.18653\/v1\/P19-1424"},{"key":"9374_CR42","unstructured":"Chalkidis I, Dai X, Fergadiotis M, Malakasiotis P, Elliott D (2022a) An exploration of hierarchical attention transformers for efficient long document classification. CoRR, arXiv:2210.05529"},{"key":"9374_CR43","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Androutsopoulos I (2021a) MultiEURLEX\u2014a multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 6974\u20136996","DOI":"10.18653\/v1\/2021.emnlp-main.559"},{"key":"9374_CR44","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Kotitsas S, Malakasiotis P, Aletras N, Androutsopoulos I (2020a) An empirical study on large-scale multi-label text classification including few and zero-shot labels. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 7503\u20137515","DOI":"10.18653\/v1\/2020.emnlp-main.607"},{"key":"9374_CR45","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Malakasiotis P, Aletras N, Androutsopoulos I (2020b) LEGAL-BERT: the muppets straight out of law school. CoRR, arXiv:2010.02559","DOI":"10.18653\/v1\/2020.findings-emnlp.261"},{"key":"9374_CR46","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Malakasiotis P, Androutsopoulos I (2019b) Large-scale multi-label text classification on EU legislation. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 6314\u20136322","DOI":"10.18653\/v1\/P19-1636"},{"key":"9374_CR47","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Manginas N, Katakalou E, Malakasiotis P (2021b) Regulatory Compliance through Doc2Doc information retrieval: a case study in EU\/UK legislation where text similarity has limitations. In: Proceedings of the conference of the European chapter of the Association for Computational Linguistics (EACL). Association for Computational Linguistics, pp 3498\u20133511","DOI":"10.18653\/v1\/2021.eacl-main.305"},{"key":"9374_CR48","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Fergadiotis M, Tsarapatsanis D, Aletras N, Androutsopoulos I, Malakasiotis P (2021c) Paragraph-level rationale extraction through regularization: a case study on European Court of human rights cases. In: Proceedings of the conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). Association for Computational Linguistics, pp 226\u2013241","DOI":"10.18653\/v1\/2021.naacl-main.22"},{"key":"9374_CR49","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Jana A, Hartung D II, Meon JB, Androutsopoulos I, Katz DM, Aletras N (2022b) LexGLUE: a benchmark dataset for legal language understanding in English. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 4310\u20134330","DOI":"10.18653\/v1\/2022.acl-long.297"},{"key":"9374_CR50","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Nikolaou C, Soursos P, Koubarakis M (2017) Modeling and querying greek legislation using semantic web technologies. In: Proceedings of the international conference on the semantic web (ESWC), volume 10249 of lecture notes in computer science, pp 591\u2013606","DOI":"10.1007\/978-3-319-58068-5_36"},{"key":"9374_CR51","doi-asserted-by":"crossref","unstructured":"Chalkidis I, Pasini T, Zhang S, Tomada L, Schwemer SF, S\u00f8gaard A (2022c) Fairlex: a multilingual benchmark for evaluating fairness in legal text processing. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 4389\u20134406","DOI":"10.18653\/v1\/2022.acl-long.301"},{"key":"9374_CR52","doi-asserted-by":"crossref","unstructured":"Chan B, Schweter S, M\u00f6ller T (2020) German\u2019s next language model. In: Proceedings of the International conference on computational linguistics (COLING). International Committee on Computational Linguistics, pp 6788\u20136796","DOI":"10.18653\/v1\/2020.coling-main.598"},{"key":"9374_CR53","unstructured":"Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H, Chen K et\u00a0al (2015) Xgboost: extreme gradient boosting. R package version 0.4-2, 1(4):1\u20134"},{"key":"9374_CR54","doi-asserted-by":"crossref","unstructured":"Chi J, Ahmad WU, Tian Y, Chang K (2023) PLUE: language understanding evaluation benchmark for privacy policies in English. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 352\u2013365","DOI":"10.18653\/v1\/2023.acl-short.31"},{"key":"9374_CR55","first-page":"66","volume":"6","author":"JH Choi","year":"2023","unstructured":"Choi JH, Hickman KE, Monahan A, Schwarcz D (2023) ChatGPT goes to law school. J Legal Educ 6:66","journal-title":"J Legal Educ"},{"key":"9374_CR56","unstructured":"Clark K, Luong M, Le QV, Manning CD (2020) ELECTRA: pre-training text encoders as discriminators rather than generators. In: Proceedings of the international conference on learning representations (ICLR)"},{"key":"9374_CR57","doi-asserted-by":"crossref","unstructured":"Cohan A, Beltagy I, King D, Dalvi B, Weld DS (2019) Pretrained language models for sequential sentence classification. In: Proceedings of the conference on empirical methods in natural language processing and the international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 3691\u20133697","DOI":"10.18653\/v1\/D19-1383"},{"key":"9374_CR58","doi-asserted-by":"crossref","unstructured":"Conneau A, Khandelwal K, Goyal N, Chaudhary V, Wenzek G, Guzm\u00e1n F, Grave E, Ott M, Zettlemoyer L, Stoyanov V (2020) Unsupervised cross-lingual representation learning at scale. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 8440\u20138451","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"9374_CR59","unstructured":"Conneau A, Lample G (2019) Cross-lingual language model pretraining. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 7057\u20137067"},{"key":"9374_CR60","doi-asserted-by":"crossref","unstructured":"Conneau A, Rinott R, Lample G, Williams A, Bowman SR, Schwenk H, Stoyanov V (2018) XNLI: evaluating cross-lingual sentence representations. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 2475\u20132485","DOI":"10.18653\/v1\/D18-1269"},{"key":"9374_CR61","doi-asserted-by":"crossref","unstructured":"Cui Y, Che W, Liu T, Qin B, Wang S, Hu G (2020) Revisiting pre-trained models for Chinese natural language processing. In: Findings of the Association for Computational Linguistics: EMNLP. Association for Computational Linguistics, pp 657\u2013668","DOI":"10.18653\/v1\/2020.findings-emnlp.58"},{"key":"9374_CR62","unstructured":"Cui Y, Yang Z, Liu T (2022) PERT: pre-training BERT with permuted language model. CoRR, arXiv:2203.06906"},{"issue":"1","key":"9374_CR63","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1007\/s10506-020-09261-5","volume":"29","author":"F Dadgostari","year":"2021","unstructured":"Dadgostari F, Guim M, Beling P, Livermore MA, Rockmore D (2021) Modeling law search as prediction. Artif Intell Law 29(1):3\u201334","journal-title":"Artif Intell Law"},{"key":"9374_CR64","doi-asserted-by":"crossref","unstructured":"Dadu T, Pant K (2020) Team rouges at SemEval-2020 Task 12: cross-lingual inductive transfer to detect offensive language. In: Proceedings of the fourteenth workshop on semantic evaluation, SemEval@COLING 2020. International Committee for Computational Linguistics, pp 2183\u20132189","DOI":"10.18653\/v1\/2020.semeval-1.290"},{"key":"9374_CR65","doi-asserted-by":"crossref","unstructured":"Dai Z, Yang Z, Yang Y, Carbonell JG, Le QV, Salakhutdinov R (2019) Transformer-XL: attentive language models beyond a fixed-length context. In: Proceedings of the annual meeting of the association for computational linguistics (ACL). Association for Computational Linguistics, pp 2978\u20132988","DOI":"10.18653\/v1\/P19-1285"},{"key":"9374_CR66","unstructured":"de\u00a0Vries W, van Cranenburgh A, Bisazza A, Caselli T, van Noord G, Nissim M (2019) Bertje: a dutch BERT model. CoRR, arXiv:1912.09582"},{"key":"9374_CR67","doi-asserted-by":"crossref","unstructured":"Delobelle P, Winters T, Berendt B (2020) RobBERT: a Dutch RoBERTa-based language model. In: Findings of the Association for Computational Linguistics: EMNLP. Association for Computational Linguistics, pp 3255\u20133265","DOI":"10.18653\/v1\/2020.findings-emnlp.292"},{"key":"9374_CR68","doi-asserted-by":"crossref","unstructured":"Deroy A, Bhattacharya P, Ghosh K, Ghosh S (2021) An analytical study of algorithmic and expert summaries of legal cases. In: Proceedings of the conference on legal knowledge and information systems (JURIX), volume 346 of frontiers in artificial intelligence and applications. IOS Press, pp 90\u201399","DOI":"10.3233\/FAIA210322"},{"key":"9374_CR69","unstructured":"Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the conference of the North American chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp 4171\u20134186"},{"key":"9374_CR70","unstructured":"Doddapaneni S, Ramesh G, Kunchukuttan A, Kumar P, Khapra MM (2021) A primer on pretrained multilingual language models. CoRR, arXiv:2107.00676"},{"key":"9374_CR71","unstructured":"Dong L, Yang N, Wang W, Wei F, Liu X, Wang Y, Gao J, Zhou M, Hon H (2019) Unified language model pre-training for natural language understanding and generation. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 13042\u201313054"},{"key":"9374_CR72","doi-asserted-by":"crossref","unstructured":"Dong Q, Niu S (2021) Legal judgment prediction via relational learning. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 983\u2013992","DOI":"10.1145\/3404835.3462931"},{"key":"9374_CR73","doi-asserted-by":"crossref","unstructured":"Douka S, Abdine H, Vazirgiannis M, Hamdani RE, Amariles DR (2021) JuriBERT: a masked-language model adaptation for French legal text. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 95\u2013101","DOI":"10.18653\/v1\/2021.nllp-1.9"},{"key":"9374_CR74","doi-asserted-by":"crossref","unstructured":"Drawzeski K, Galassi A, Jablonowska A, Lagioia F, Lippi M, Micklitz HW, Sartor G, Tagiuri G, Torroni P (2021) A corpus for multilingual analysis of online terms of service. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). Association for Computational Linguistics, pp 1\u20138","DOI":"10.18653\/v1\/2021.nllp-1.1"},{"key":"9374_CR75","doi-asserted-by":"crossref","unstructured":"Dumitrescu SD, Avram A, Pyysalo S (2020) The birth of Romanian BERT. In: Findings of the Association for Computational Linguistics: EMNLP. Association for Computational Linguistics, pp 4324\u20134328","DOI":"10.18653\/v1\/2020.findings-emnlp.387"},{"key":"9374_CR76","unstructured":"Eddine MK, Tixier AJ, Vazirgiannis M (2021) BARThez: a skilled pretrained french sequence-to-sequence model. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 9369\u20139390"},{"key":"9374_CR77","unstructured":"Elwany E, Moore D, Oberoi G (2019) BERT goes to law school: quantifying the competitive advantage of access to large legal corpora in contract understanding. CoRR, arXiv:1911.00473"},{"key":"9374_CR78","doi-asserted-by":"crossref","unstructured":"Engstrom DF, Ho DE, Sharkey CM, Cu\u00e9llar M-F (2020) Government by algorithm: artificial intelligence in federal administrative agencies. NYU School of Law, Public Law Research Paper, pp 20\u201354","DOI":"10.2139\/ssrn.3551505"},{"key":"9374_CR79","first-page":"11","volume":"66","author":"A Farzindar","year":"2004","unstructured":"Farzindar A, Lapalme G (2004) Letsum, an automatic legal text summarizing system. Jurix 66:11\u201318","journal-title":"Jurix"},{"key":"9374_CR80","doi-asserted-by":"crossref","unstructured":"Feij\u00f3 D, Moreira V (2018) RulingBR: a summarization dataset for legal texts. In: Proceedings of the conference on computational processing of the Portuguese language (PROPOR), volume 11122 of lecture notes in computer science. Springer, pp 255\u2013264","DOI":"10.1007\/978-3-319-99722-3_26"},{"key":"9374_CR81","doi-asserted-by":"crossref","unstructured":"Feij\u00f3 D, Moreira V (2019) Summarizing legal rulings: comparative experiments. In: Proceedings of the international conference on recent advances in natural language processing (RANLP), pp 313\u2013322","DOI":"10.26615\/978-954-452-056-4_036"},{"key":"9374_CR82","doi-asserted-by":"crossref","unstructured":"Feng Y, Li C, Ng V (2022) Legal judgment prediction via event extraction with constraints. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 648\u2013664","DOI":"10.18653\/v1\/2022.acl-long.48"},{"issue":"2","key":"9374_CR83","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1007\/s10506-022-09309-8","volume":"30","author":"E Francesconi","year":"2022","unstructured":"Francesconi E (2022) The winter, the summer and the summer dream of artificial intelligence in law. Artif Intell Law 30(2):147\u2013161","journal-title":"Artif Intell Law"},{"key":"9374_CR84","doi-asserted-by":"crossref","unstructured":"Fu X, Zhang J, Meng Z, King I (2020) MAGNN: metapath aggregated graph neural network for heterogeneous graph embedding. In: Proceedings of the ACM Web conference (WWW). ACM, pp 2331\u20132341","DOI":"10.1145\/3366423.3380297"},{"key":"9374_CR85","unstructured":"Furniturewala S, Jain R, Kumari V, Sharma Y (2021) Legal text classification and summarization using transformers and joint text features. In: Working notes of FIRE 2021\u2014forum for information retrieval evaluation, volume 3159 of CEUR workshop proceedings. CEUR-WS.org, pp 541\u2013546"},{"key":"9374_CR86","unstructured":"Gain B, Bandyopadhyay D, Saikh T, Ekbal A (2019) IITP@COLIEE 2019: legal information retrieval using BM25 and BERT. In: Proceedings of COLIEE 2019 workshop: competition on legal information extraction\/entailment"},{"key":"9374_CR87","doi-asserted-by":"crossref","unstructured":"Gan L, Kuang K, Yang Y, Wu F (2021) Judgment prediction via injecting legal knowledge into neural networks. In: Proceedings of the AAAI conference on artificial intelligence (AAAI), the conference on innovative applications of artificial intelligence (IAAI), the AAAI symposium on educational advances in artificial intelligence (EAAI). AAAI Press, pp 12866\u201312874","DOI":"10.1609\/aaai.v35i14.17522"},{"key":"9374_CR88","unstructured":"Gao J, Ning H, Han Z, Kong L, Qi H (2020) Legal text classification model based on text statistical features and deep semantic features. In: Working notes of FIRE 2020\u2014forum for information retrieval evaluation, volume 2826 of CEUR workshop proceedings. CEUR-WS.org, pp 35\u201341"},{"key":"9374_CR89","unstructured":"Gao L, Biderman S, Black S, Golding L, Hoppe T, Foster C, Phang J, He H, Thite A, Nabeshima N, Presser S, Leahy C (2021) The pile: an 800gb dataset of diverse text for language modeling. CoRR, arXiv:2101.00027"},{"key":"9374_CR90","doi-asserted-by":"crossref","unstructured":"Garneau N, Gaumond E, Lamontagne L, D\u00e9ziel P (2021) Criminelbart: a French Canadian legal language model specialized in criminal law. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 256\u2013257","DOI":"10.1145\/3462757.3466147"},{"key":"9374_CR91","doi-asserted-by":"crossref","unstructured":"Goebel R, Kano Y, Kim M-Y, Rabelo J, Satoh K, Yoshioka M (eds) (2021) Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021)","DOI":"10.1007\/s12626-022-00108-w"},{"key":"9374_CR92","doi-asserted-by":"crossref","unstructured":"Goyal N, Du J, Ott M, Anantharaman G, Conneau, A (2021) Larger-scale transformers for multilingual masked language modeling. In: Proceedings of the workshop on representation learning for NLP, RepL4NLP@ACL-IJCNLP 2021. Association for Computational Linguistics, pp 29\u201333","DOI":"10.18653\/v1\/2021.repl4nlp-1.4"},{"key":"9374_CR93","doi-asserted-by":"crossref","unstructured":"Greco CM, Tagarelli A, Zumpano E (2022) A comparison of Transformer-based language models on NLP benchmarks. In: Proceedings of the international conference on applications of natural language to information systems (NLDB), volume 13286 of lecture notes in computer science. Springer, pp 490\u2013501","DOI":"10.1007\/978-3-031-08473-7_45"},{"key":"9374_CR94","unstructured":"Guillou P (2020) GPorTuguese-2 (Portuguese GPT-2 small): a language model for Portuguese text generation (and more NLP tasks...). Technical report"},{"key":"9374_CR95","doi-asserted-by":"crossref","unstructured":"Guo J, Fan Y, Ai Q, Croft WB (2016) A deep relevance matching model for ad-hoc retrieval. In: Proceedings of the ACM conference on information and knowledge management (CIKM). ACM, pp 55\u201364","DOI":"10.1145\/2983323.2983769"},{"key":"9374_CR96","doi-asserted-by":"crossref","unstructured":"Gururangan S, Marasovic A, Swayamdipta S, Lo K, Beltagy I, Downey D, Smith NA (2020) Don\u2019t stop pretraining: adapt language models to domains and tasks. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 8342\u20138360","DOI":"10.18653\/v1\/2020.acl-main.740"},{"key":"9374_CR97","unstructured":"Guti\u00e9rrez-Fandi\u00f1o A, Armengol-Estap\u00e9 J, Gonzalez-Agirre A, Villegas M (2021a) Spanish legalese language model and corpora. CoRR, arXiv:2110.12201"},{"key":"9374_CR98","unstructured":"Guti\u00e9rrez-Fandi\u00f1o A, Armengol-Estap\u00e9 J, P\u00e0mies M, Llop-Palao J, Silveira-Ocampo J, Carrino CP, Gonzalez-Agirre A, Armentano-Oller C, Penagos CR, Villegas M (2021b) Spanish language models CoRR, arXiv:2107.07253"},{"key":"9374_CR99","first-page":"66","volume":"6","author":"J He","year":"2019","unstructured":"He J, Wang L, Liu L, Feng J, Wu H (2019) Long document classification from local word glimpses via recurrent attention learning. IEEE Access 6:66","journal-title":"IEEE Access"},{"key":"9374_CR100","unstructured":"He P, Gao J, Chen W (2023) Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing. In: Proceedings of the international conference on learning representations (ICLR)"},{"key":"9374_CR101","unstructured":"He P, Liu X, Gao J, Chen W (2021) DeBERTa: decoding-enhanced BERT with disentangled attention. In: Proceedings of the international conference on learning representations (ICLR)"},{"key":"9374_CR102","unstructured":"Henderson P, Krass MS, Zheng L, Guha N, Manning CD, Jurafsky D, Ho DE (2022) Pile of law: learning responsible data filtering from the law and a 256\u00a0GB open-source legal dataset. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR103","unstructured":"Hendrycks D, Burns C, Chen A, Ball S (2021) CUAD: an expert-annotated NLP dataset for legal contract review. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR104","unstructured":"Hermann KM, Kocisk\u00fd T, Grefenstette E, Espeholt L, Kay W, Suleyman M, Blunsom P (2015) Teaching machines to read and comprehend. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 1693\u20131701"},{"key":"9374_CR105","unstructured":"Holzenberger N, Blair-Stanek A, Durme BV (2020) A dataset for statutory reasoning in tax law entailment and question answering. In: Proceedings of the 2020 workshop on natural legal language processing (NLLP), volume 2645 of CEUR workshop proceedings, pp 31\u201338"},{"key":"9374_CR106","doi-asserted-by":"crossref","unstructured":"Holzenberger N, Durme BV (2021) Factoring statutory reasoning as language understanding challenges. In: Proceedings of the annual meeting of the Association for Computational Linguistics and the international joint conference on natural language processing (ACL\/IJCNLP)","DOI":"10.18653\/v1\/2021.acl-long.213"},{"key":"9374_CR107","doi-asserted-by":"crossref","unstructured":"Hong J, Chong D, Manning CD (2021) Learning from limited labels for long legal dialogue. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 190\u2013204","DOI":"10.18653\/v1\/2021.nllp-1.20"},{"key":"9374_CR108","unstructured":"Houlsby N, Giurgiu A, Jastrzebski S, Morrone B, de\u00a0Laroussilhe Q, Gesmundo A, Attariyan M, Gelly S (2019) Parameter-efficient transfer learning for NLP. In: Proceedings of the international conference on machine learning (ICML), volume\u00a097 of proceedings of machine learning research. PMLR, pp 2790\u20132799"},{"key":"9374_CR109","unstructured":"Howe JST, Khang LH, Chai IE (2019) Legal area classification: a comparative study of text classifiers on Singapore supreme court judgments. CoRR, arXiv:1904.06470"},{"key":"9374_CR110","unstructured":"Hu J, Ruder S, Siddhant A, Neubig G, Firat O, Johnson M (2020) XTREME: a massively multilingual multi-task benchmark for evaluating cross-lingual generalization. CoRR, arXiv:2003.11080"},{"key":"9374_CR111","doi-asserted-by":"publisher","first-page":"166","DOI":"10.1016\/j.neucom.2021.07.013","volume":"460","author":"Y Huang","year":"2021","unstructured":"Huang Y, Yu Z, Guo J, Xiang Y, Xian Y (2021) Element graph-augmented abstractive summarization for legal public opinion news with graph transformer. Neurocomputing 460:166\u2013180","journal-title":"Neurocomputing"},{"key":"9374_CR112","doi-asserted-by":"crossref","unstructured":"Hudzina J, Madan K, Chinnappa D, Harmouche J, Bretz H, Vold A, Schilder F (2020) Information extraction\/entailment of common law and civil code. In: New frontiers in artificial intelligence-JSAI-isAI 2020 workshops, JURISIN, LENLS 2020 Workshops, volume 12758 of lecture notes in computer science. Springer, pp 254\u2013268","DOI":"10.1007\/978-3-030-79942-7_17"},{"key":"9374_CR113","doi-asserted-by":"crossref","unstructured":"Hui K, Yates A, Berberich K, de\u00a0Melo G (2017) PACRR: a position-aware neural IR model for relevance matching. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 1049\u20131058","DOI":"10.18653\/v1\/D17-1110"},{"key":"9374_CR114","doi-asserted-by":"crossref","unstructured":"II MJB, Katz DM (2022) GPT takes the bar exam. CoRR, arXiv:2212.14402","DOI":"10.2139\/ssrn.4389233"},{"key":"9374_CR115","unstructured":"Jain D, Borah MD, Biswas A (2021) Summarization of Indian legal judgement documents via ensembling of contextual embedding based MLP models. In: Working notes of FIRE 2021\u2014forum for information retrieval evaluation, volume 3159 of CEUR workshop proceedings. CEUR-WS.org, pp 553\u2013561"},{"key":"9374_CR116","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1162\/tacl_a_00300","volume":"8","author":"M Joshi","year":"2020","unstructured":"Joshi M, Chen D, Liu Y, Weld DS, Zettlemoyer L, Levy O (2020) Spanbert: improving pre-training by representing and predicting spans. Trans Assoc Comput Linguist 8:64\u201377","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR117","doi-asserted-by":"crossref","unstructured":"Joshi M, Choi E, Weld DS, Zettlemoyer L (2017) TriviaQA: a large scale distantly supervised challenge dataset for reading comprehension. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 1601\u20131611","DOI":"10.18653\/v1\/P17-1147"},{"key":"9374_CR118","doi-asserted-by":"crossref","unstructured":"Joulin A, Grave E, Bojanowski P, Mikolov T (2017) Bag of tricks for efficient text classification. In: Proceedings of the conference of the European chapter of the Association for Computational Linguistics (EACL). Association for Computational Linguistics, pp 427\u2013431","DOI":"10.18653\/v1\/E17-2068"},{"key":"9374_CR119","unstructured":"Kalamkar P, Tiwari A, Agarwal A, Karn S, Gupta S, Raghavan V, Modi A (2022) Corpus for automatic structuring of legal documents. In: Proceedings of the language resources and evaluation conference (LREC). European Language Resources Association, pp 4420\u20134429"},{"key":"9374_CR120","doi-asserted-by":"crossref","unstructured":"Karpukhin V, Oguz B, Min S, Lewis PSH, Wu L, Edunov S, Chen D, Yih W (2020) Dense passage retrieval for open-domain question answering. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 6769\u20136781","DOI":"10.18653\/v1\/2020.emnlp-main.550"},{"key":"9374_CR121","doi-asserted-by":"crossref","unstructured":"Khazaeli S, Punuru J, Morris C, Sharma S, Staub B, Cole M, Chiu-Webster S, Sakalley D (2021) A free format legal question answering system. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 107\u2014113","DOI":"10.18653\/v1\/2021.nllp-1.11"},{"key":"9374_CR122","doi-asserted-by":"crossref","unstructured":"Kim M-Y, Rabelo J, Goebel R (2021) BM25 and Transformer-based Legal Information Extraction and Entailment. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021)","DOI":"10.1007\/s12626-022-00105-z"},{"key":"9374_CR123","doi-asserted-by":"crossref","unstructured":"Klaus S, Hecke RV, Naini KD, Altingovde IS, Bernab\u00e9-Moreno J, Herrera-Viedma E (2022) Summarizing legal regulatory documents using transformers. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 2426\u20132430","DOI":"10.1145\/3477495.3531872"},{"key":"9374_CR124","unstructured":"Kojima T, Gu SS, Reid M, Matsuo Y, Iwasawa Y (2022) Large language models are zero-shot reasoners. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR125","doi-asserted-by":"crossref","unstructured":"Koreeda Y, Manning CD (2021) Contractnli: a dataset for document-level natural language inference for contracts. In: Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, pp 1907\u20131919","DOI":"10.18653\/v1\/2021.findings-emnlp.164"},{"key":"9374_CR126","doi-asserted-by":"crossref","unstructured":"Koutsikakis J, Chalkidis I, Malakasiotis P, Androutsopoulos I (2020) GREEK-BERT: the Greeks visiting Sesame Street. In: Proceedings of the Hellenic conference on artificial intelligence. ACM, pp 110\u2013117","DOI":"10.1145\/3411408.3411440"},{"key":"9374_CR127","doi-asserted-by":"crossref","unstructured":"Kruiper R, Konstas I, Gray A, Sadeghineko F, Watson R, Kumar B (2021) SPAR.txt, a cheap Shallow Parsing approach for Regulatory texts. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 129\u2014143","DOI":"10.18653\/v1\/2021.nllp-1.14"},{"key":"9374_CR128","doi-asserted-by":"crossref","unstructured":"Kudo T Richardson J (2018) Sentencepiece: a simple and language independent subword tokenizer and detokenizer for neural text processing. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP): system demonstrations. Association for Computational Linguistics, pp 66\u201371","DOI":"10.18653\/v1\/D18-2012"},{"key":"9374_CR129","first-page":"452","volume":"7","author":"T Kwiatkowski","year":"2019","unstructured":"Kwiatkowski T, Palomaki J, Redfield O, Collins M, Parikh AP, Alberti C, Epstein D, Polosukhin I, Devlin J, Lee K, Toutanova K, Jones L, Kelcey M, Chang M, Dai AM, Uszkoreit J, Le Q, Petrov S (2019) Natural questions: a benchmark for question answering research. Trans Assoc Comput Linguist 7:452\u2013466","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR130","doi-asserted-by":"publisher","DOI":"10.7717\/peerj-cs.904","volume":"8","author":"A Lage-Freitas","year":"2022","unstructured":"Lage-Freitas A, Allende-Cid H, Santana O, de Oliveira-Lage L (2022) Predicting Brazilian court decisions. PeerJ Comput Sci 8:e904","journal-title":"PeerJ Comput Sci"},{"key":"9374_CR131","doi-asserted-by":"crossref","unstructured":"Lai G, Xie Q, Liu H, Yang Y, Hovy EH (2017) RACE: large-scale reading comprehension dataset from examinations. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 785\u2013794","DOI":"10.18653\/v1\/D17-1082"},{"key":"9374_CR132","unstructured":"Lam JT, Liang D, Dahan S, Zulkernine FH (2020) The gap between deep learning and law: predicting employment notice. In: Proceedings of the 2020 workshop on natural legal language processing (NLLP), volume 2645 of CEUR workshop proceedings, pp 52\u201356"},{"key":"9374_CR133","doi-asserted-by":"crossref","unstructured":"Lan Z, Chen M, Goodman S, Gimpel K, Sharma P, Soricut R (2020) ALBERT: a lite BERT for self-supervised learning of language representations. In: Proceedings of the international conference on learning representations (ICLR)","DOI":"10.1109\/SLT48900.2021.9383575"},{"key":"9374_CR134","unstructured":"Le H, Vial L, Frej J, Segonne V, Coavoux M, Lecouteux B, Allauzen A, Crabb\u00e9 B, Besacier L, Schwab D (2020) FlauBERT: unsupervised language model pre-training for French. In: Proceedings of the language resources and evaluation conference (LREC). European Language Resources Association, pp 2479\u20132490"},{"key":"9374_CR135","doi-asserted-by":"crossref","unstructured":"Lei T, Barzilay R, Jaakkola TS (2016) Rationalizing neural predictions. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). The Association for Computational Linguistics, pp 107\u2013117","DOI":"10.18653\/v1\/D16-1011"},{"key":"9374_CR136","unstructured":"Leivaditi S, Rossi J, Kanoulas E (2020) A benchmark for lease contract review. CoRR, arXiv:2010.10386"},{"key":"9374_CR137","doi-asserted-by":"crossref","unstructured":"Lewis M, Liu Y, Goyal N, Ghazvininejad M, Mohamed A, Levy O, Stoyanov V, Zettlemoyer L (2020) BART: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 7871\u20137880","DOI":"10.18653\/v1\/2020.acl-main.703"},{"key":"9374_CR138","doi-asserted-by":"crossref","unstructured":"Li D, Yang K, Zhang L, Yin D, Peng D (2021a) CLASS: a novel method for Chinese legal judgments summarization. In: Proceedings of the international conference on computer science and application engineering. ACM, pp 86:1\u201386:5","DOI":"10.1145\/3487075.3487161"},{"key":"9374_CR139","unstructured":"Li J, Monroe W, Jurafsky D (2016) Understanding neural networks through representation erasure. CoRR, arXiv:1612.08220"},{"key":"9374_CR140","unstructured":"Li J, Zhao X, Liu J, Wen J, Yang M (2021b) SIAT@COLIEE-2021: combining statistics recall and semantic ranking for legal case retrieval and entailment. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021), pp 31\u201337"},{"key":"9374_CR141","doi-asserted-by":"crossref","unstructured":"Li L, Dai Y, Tang D, Feng Z, Zhou C, Qiu X, Xu Z, Shi S (2022) MarkBERT: marking word boundaries improves Chinese BERT. CoRR, arXiv:2203.06378","DOI":"10.1007\/978-3-031-44693-1_26"},{"key":"9374_CR142","unstructured":"Licari D, Comand\u00e8 G (2022) ITALIAN-LEGAL-BERT: a pre-trained Transformer language model for Italian law. In: Proceedings of the KM4LAW Workshop with the 23rd international conference on knowledge engineering and knowledge management, volume 3256 of CEUR workshop proceedings. CEUR"},{"key":"9374_CR143","doi-asserted-by":"crossref","unstructured":"Limsopatham N (2021) Effectively leveraging BERT for legal document classification. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 210\u2014216","DOI":"10.18653\/v1\/2021.nllp-1.22"},{"key":"9374_CR144","unstructured":"Lin C-Y (2004) ROUGE: a package for automatic evaluation of summaries. In: Text summarization branches out. Association for Computational Linguistics, pp 74\u201381"},{"issue":"1","key":"9374_CR145","doi-asserted-by":"publisher","first-page":"145","DOI":"10.1109\/18.61115","volume":"37","author":"J Lin","year":"1991","unstructured":"Lin J (1991) Divergence measures based on the Shannon entropy. IEEE Trans Inf Theory 37(1):145\u2013151","journal-title":"IEEE Trans Inf Theory"},{"key":"9374_CR146","doi-asserted-by":"crossref","unstructured":"Lin XV, Mihaylov T, Artetxe M, Wang T, Chen S, Simig D, Ott M, Goyal N, Bhosale S, Du J, Pasunuru R, Shleifer S, Koura PS, Chaudhary V, O\u2019Horo B, Wang J, Zettlemoyer L, Kozareva Z, Diab M, Stoyanov V, Li X (2022) Few-shot learning with multilingual generative language models. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 9019\u20139052","DOI":"10.18653\/v1\/2022.emnlp-main.616"},{"issue":"2","key":"9374_CR147","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1007\/s10506-019-09243-2","volume":"27","author":"M Lippi","year":"2019","unstructured":"Lippi M, Palka P, Contissa G, Lagioia F, Micklitz H, Sartor G, Torroni P (2019) CLAUDETTE: an automated detector of potentially unfair clauses in online terms of service. Artif Intell Law 27(2):117\u2013139","journal-title":"Artif Intell Law"},{"key":"9374_CR148","doi-asserted-by":"crossref","unstructured":"Liu C, Chen K (2019) Extracting the gist of Chinese judgments of the supreme court. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 73\u201382","DOI":"10.1145\/3322640.3326715"},{"key":"9374_CR149","doi-asserted-by":"crossref","unstructured":"Liu L, Zhang W, Liu J, Shi W, Huang Y (2021a) Interpretable charge prediction for legal cases based on interdependent legal information. In: Proceedings of the joint conference on neural networks (IJCNN). IEEE, pp 1\u20138","DOI":"10.1109\/IJCNN52387.2021.9533902"},{"key":"9374_CR150","doi-asserted-by":"crossref","unstructured":"Liu X, Yin D, Feng Y, Wu Y, Zhao D (2021b) Everything has a cause: leveraging causal inference in legal text analysis. In: Proceedings of the conference of the North American chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). Association for Computational Linguistics, pp 1928\u20131941","DOI":"10.18653\/v1\/2021.naacl-main.155"},{"key":"9374_CR151","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1162\/tacl_a_00343","volume":"8","author":"Y Liu","year":"2020","unstructured":"Liu Y, Gu J, Goyal N, Li X, Edunov S, Ghazvininejad M, Lewis M, Zettlemoyer L (2020) Multilingual denoising pre-training for neural machine translation. Trans Assoc Comput Linguist 8:726\u2013742","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR152","doi-asserted-by":"crossref","unstructured":"Liu Y, Lapata M (2019a) Text summarization with pretrained encoders. In: Proceedings of the conference on empirical methods in natural language processing and the international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 3728\u20133738","DOI":"10.18653\/v1\/D19-1387"},{"key":"9374_CR153","doi-asserted-by":"crossref","unstructured":"Liu Y, Lapata M (2019b) Text summarization with pretrained encoders. In: Proceedings of the conference on empirical methods in natural language processing and the international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 3728\u20133738","DOI":"10.18653\/v1\/D19-1387"},{"key":"9374_CR154","unstructured":"Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V (2019) Roberta: a robustly optimized BERT pretraining approach. CoRR, arXiv:1907.11692"},{"key":"9374_CR155","unstructured":"Locke D, Zuccon G (2022) Case law retrieval: problems, methods, challenges and evaluations in the last 20 years. CoRR, arXiv:2202.07209"},{"key":"9374_CR156","doi-asserted-by":"crossref","unstructured":"Louis A, Spanakis G (2022) A statutory article retrieval dataset in French. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 6789\u20136803","DOI":"10.18653\/v1\/2022.acl-long.468"},{"key":"9374_CR157","doi-asserted-by":"crossref","unstructured":"Louis A, van Dijck G, Spanakis G (2023) Finding the law: enhancing statutory article retrieval via graph neural networks. In: Proceedings of the conference of the European chapter of the Association for Computational Linguistics (EACL). Association for Computational Linguistics, pp 2753\u20132768","DOI":"10.18653\/v1\/2023.eacl-main.203"},{"issue":"1","key":"9374_CR158","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2021.102780","volume":"59","author":"Y Lyu","year":"2022","unstructured":"Lyu Y, Wang Z, Ren Z, Ren P, Chen Z, Liu X, Li Y, Li H, Song H (2022) Improving legal judgment prediction through reinforced criminal element extraction. Inf Process Manag 59(1):102780","journal-title":"Inf Process Manag"},{"key":"9374_CR159","unstructured":"Ma Y, Shao Y, Liu B, Liu Y, Zhang M, Ma S (2021) Retrieving legal cases from a large-scale candidate corpus. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021), pp 38\u201342"},{"key":"9374_CR160","doi-asserted-by":"crossref","unstructured":"MacAvaney S, Yates A, Cohan A, Goharian N (2019) CEDR: contextualized embeddings for document ranking. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 1101\u20131104","DOI":"10.1145\/3331184.3331317"},{"key":"9374_CR161","unstructured":"Mahari RZ (2021) AutoLAW: augmented legal reasoning through legal precedent prediction. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP)"},{"key":"9374_CR162","unstructured":"Majumder SB. Das D (2020) Rhetorical role labelling for legal judgements using ROBERTA. In: Working notes of FIRE 2020\u2014forum for information retrieval evaluation, volume 2826 of CEUR workshop proceedings. CEUR-WS.org, pp 22\u201325"},{"key":"9374_CR163","doi-asserted-by":"crossref","unstructured":"Malik V, Sanjay R, Nigam SK, Ghosh K, Guha SK, Bhattacharya A, Modi A (2021) ILDC for CJPE: Indian legal documents corpus for court judgment prediction and explanation. In: Proceedings of the annual meeting of the Association for Computational Linguistics and the international joint conference on natural language processing (ACL\/IJCNLP). Association for Computational Linguistics, pp 4046\u20134062","DOI":"10.18653\/v1\/2021.acl-long.313"},{"key":"9374_CR164","doi-asserted-by":"crossref","unstructured":"Mamakas D, Tsotsi P, Androutsopoulos I, Chalkidis I (2022) Processing long legal documents with pre-trained transformers: modding LegalBERT and longformer. CoRR, arXiv:2211.00974","DOI":"10.18653\/v1\/2022.nllp-1.11"},{"key":"9374_CR165","doi-asserted-by":"crossref","unstructured":"Manor L, Li JJ (2019) Plain English summarization of contracts. CoRR, arXiv:1906.00424","DOI":"10.18653\/v1\/W19-2201"},{"key":"9374_CR166","doi-asserted-by":"crossref","unstructured":"Martin L, M\u00fcller B, Su\u00e1rez PJO, Dupont Y, Romary L, de\u00a0la Clergerie \u00c9, Seddah D, Sagot B (2020) CamemBERT: a Tasty French language model. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 7203\u20137219","DOI":"10.18653\/v1\/2020.acl-main.645"},{"issue":"3","key":"9374_CR167","doi-asserted-by":"publisher","first-page":"359","DOI":"10.1007\/s10506-021-09297-1","volume":"30","author":"GD Martino","year":"2022","unstructured":"Martino GD, Pio G, Ceci M (2022) PRILJ: an efficient two-step method based on embedding and clustering for the identification of regularities in legal case judgments. Artif Intell Law 30(3):359\u2013390","journal-title":"Artif Intell Law"},{"key":"9374_CR168","doi-asserted-by":"crossref","unstructured":"Masala M, Iacob RCA, Uban AS, Cidota M, Velicu H, Rebedea T, Popescu M (2021) jurBERT: a Romanian BERT model for legal judgement prediction. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 86\u201394","DOI":"10.18653\/v1\/2021.nllp-1.8"},{"key":"9374_CR169","doi-asserted-by":"crossref","unstructured":"Masala M, Ruseti S, Dascalu M (2020) RoBERT\u2014a Romanian BERT model. In: Proceedings of the international conference on computational linguistics (COLING). International Committee on Computational Linguistics, pp 6626\u20136637","DOI":"10.18653\/v1\/2020.coling-main.581"},{"key":"9374_CR170","unstructured":"Mialon G, Dess\u00ec R., Lomeli M, Nalmpantis C, Pasunuru R, Raileanu R, Rozi\u00e8re B, Schick T, Dwivedi-Yu J, Celikyilmaz A, Grave E, LeCun Y, Scialom T (2023) Augmented language models: a survey. CoRR, arXiv:2302.07842"},{"key":"9374_CR171","unstructured":"Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). ACL, pp 404\u2013411"},{"key":"9374_CR172","unstructured":"Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. In: Proceedings of the Workshops of the international conference on learning representations (ICLR)"},{"key":"9374_CR173","doi-asserted-by":"crossref","unstructured":"Mistica M, Lau JH, Merrifield B, Fazio K, Baldwin T (2021) Semi-automatic triage of requests for free legal assistance. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 217\u2013227","DOI":"10.18653\/v1\/2021.nllp-1.23"},{"key":"9374_CR174","unstructured":"Miyato T, Dai AM, Goodfellow IJ (2017) Adversarial training methods for semi-supervised text classification. In: Proceedings of the international conference on learning representations (ICLR)"},{"key":"9374_CR175","doi-asserted-by":"crossref","unstructured":"Morris C, Ritzert M, Fey M, Hamilton WL, Lenssen JE., Rattan G, Grohe M (2019) Weisfeiler and leman go neural: higher-order graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), the conference on Innovative Applications of Artificial Intelligence (IAAI), the AAAI symposium on Educational Advances in Artificial Intelligence (EAAI). AAAI Press, pp 4602\u20134609","DOI":"10.1609\/aaai.v33i01.33014602"},{"key":"9374_CR176","doi-asserted-by":"crossref","unstructured":"Nallapati R, Zhai F, Zhou B (2017) Summarunner: a recurrent neural network based sequence model for extractive summarization of documents. In: Proceedings of the AAAI conference on Artificial Intelligence (AAAI). AAAI Press, pp 3075\u20133081","DOI":"10.1609\/aaai.v31i1.10958"},{"key":"9374_CR177","doi-asserted-by":"crossref","unstructured":"Nallapati R, Zhou B, dos Santos CN, G\u00fcl\u00e7ehre \u00c7, Xiang B (2016) Abstractive text summarization using sequence-to-sequence rnns and beyond. In: Proceedings of the SIGNLL conference on computational natural language learning (CoNLL). ACL, pp 280\u2013290","DOI":"10.18653\/v1\/K16-1028"},{"key":"9374_CR178","doi-asserted-by":"crossref","unstructured":"Narayan S, Cohen SB, Lapata M (2018) Don\u2019t give me the details, just the summary! topic-aware convolutional neural networks for extreme summarization. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 1797\u20131807","DOI":"10.18653\/v1\/D18-1206"},{"key":"9374_CR179","doi-asserted-by":"crossref","unstructured":"Naseri S, Foley J, Allan J (2018) Umass at TREC 2018: car, common core and news tracks. In: Proceedings of the text REtrieval conference (TREC), volume 500-331 of NIST Special Publication. National Institute of Standards and Technology (NIST)","DOI":"10.6028\/NIST.SP.500-331.core-UMass"},{"key":"9374_CR180","unstructured":"Nguyen H (2023) A Brief Report on LawGPT 1.0: a virtual legal assistant based on GPT-3. CoRR, arXiv:2302.05729"},{"key":"9374_CR181","unstructured":"Nguyen H, Nguyen L (2021) Sublanguage: a serious issue affects pretrained models in legal domain. CoRR, arXiv:2104.07782"},{"key":"9374_CR182","unstructured":"Nguyen H, Nguyen PM, Vuong T, Bui QM, Nguyen CM, Dang TB, Tran V, Nguyen ML, Satoh K (2021a) JNLP team: deep learning approaches for legal processing tasks in COLIEE 2021. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021), pp 46\u201353"},{"key":"9374_CR183","unstructured":"Nguyen H, Tran V, Nguyen PM, Vuong T, Bui QM, Nguyen CM, Dang TB, Nguyen ML, Satoh K (2021b) Paralaw nets\u2014cross-lingual sentence-level pretraining for legal text processing. CoRR, arXiv:2106.13403"},{"key":"9374_CR184","unstructured":"Nguyen H, Vuong HT, Nguyen PM, Dang TB, Bui QM, Sinh VT, Nguyen CM, Tran VD, Satoh K, Nguyen ML (2020) JNLP team: deep learning for legal processing in COLIEE 2020. In: Proceedings of the 14th international workshop on Juris-Informatics (JURISIN2020). The Japanese Society of Artificial Intelligence, pp 195\u2013208"},{"key":"9374_CR185","unstructured":"Nguyen T, Rosenberg M, Song X, Gao J, Tiwary S, Majumder R, Deng L (2016) MS MARCO: a human generated machine reading comprehension dataset. In: Proceedings of the workshop on cognitive computation: integrating neural and symbolic approaches, co-located with the annual conference on neural information processing systems (NIPS), volume 1773 of CEUR workshop proceedings. CEUR-WS.org"},{"key":"9374_CR186","doi-asserted-by":"crossref","unstructured":"Niklaus J, Chalkidis I, St\u00fcrmer M (2021) Swiss-judgment-prediction: a multilingual legal judgment prediction benchmark. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 19\u201335","DOI":"10.18653\/v1\/2021.nllp-1.3"},{"key":"9374_CR187","unstructured":"Niklaus J, St\u00fcrmer M, Chalkidis I (2022) An empirical study on cross-x transfer for legal judgment prediction. In :Proceedings of the conference of the Asia-Pacific chapter of the Association for Computational Linguistics and the international joint conference on natural language processing (AACL\/IJCNLP). Association for Computational Linguistics, pp 32\u201346"},{"key":"9374_CR188","doi-asserted-by":"crossref","unstructured":"Nogueira R, Jiang Z, Pradeep R., Lin J (2020) Document ranking with a pretrained sequence-to-sequence model. In: Findings of the Association for Computational Linguistics: EMNLP, volume EMNLP 2020 of findings of ACL. Association for Computational Linguistics, pp 708\u2013718","DOI":"10.18653\/v1\/2020.findings-emnlp.63"},{"key":"9374_CR189","doi-asserted-by":"crossref","unstructured":"Oliver Wendell\u00a0Holmes J (1897) The path of the law. Harv Law Rev 10:457\u2013478","DOI":"10.2307\/1322028"},{"key":"9374_CR190","doi-asserted-by":"crossref","unstructured":"Ostendorff M, Ash E, Ruas T, Gipp B, Schneider JM, Rehm G (2021) Evaluating document representations for content-based legal literature recommendations. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 109\u2013118","DOI":"10.1145\/3462757.3466073"},{"key":"9374_CR191","doi-asserted-by":"crossref","unstructured":"Ostendorff M, Blume T, Ostendorff S (2020) Towards an open platform for legal information. In: Proceedings of the ACM\/IEEE joint conference on digital libraries (JCDL), pp 385\u2013388","DOI":"10.1145\/3383583.3398616"},{"key":"9374_CR192","unstructured":"Ouyang L, Wu J, Jiang X, Almeida D, Wainwright CL, Mishkin P, Zhang C, Agarwal S, Slama K, Ray A, Schulman J, Hilton J, Kelton F, Miller L, Simens M, Askell A, Welinder P, Christiano PF, Leike J, Lowe R (2022) Training language models to follow instructions with human feedback. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR193","doi-asserted-by":"crossref","unstructured":"Pagliardini M, Gupta P, Jaggi M (2018) Unsupervised learning of sentence embeddings using compositional n-gram features. In: Proceedings of the conference of the North American chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). Association for Computational Linguistics, pp 528\u2013540","DOI":"10.18653\/v1\/N18-1049"},{"key":"9374_CR194","doi-asserted-by":"crossref","unstructured":"Pais V, Mitrofan M, Gasan CL, Coneschi V, Ianov A (2021) Named entity recognition in the Romanian legal domain. In: Proceedings of the 2021 Workshop on Natural Legal Language Processing (NLLP). Association for Computational Linguistics, pp 9\u201318","DOI":"10.18653\/v1\/2021.nllp-1.2"},{"key":"9374_CR195","doi-asserted-by":"crossref","unstructured":"Pant K, Dadu T (2020) Cross-lingual inductive transfer to detect offensive language. CoRR, arXiv:2007.03771","DOI":"10.18653\/v1\/2020.semeval-1.290"},{"key":"9374_CR196","doi-asserted-by":"crossref","unstructured":"Papaloukas C, Chalkidis I, Athinaios K, Pantazi D, Koubarakis M (2021) Multi-granular legal topic classification on greek legislation. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 63\u201375","DOI":"10.18653\/v1\/2021.nllp-1.6"},{"key":"9374_CR197","doi-asserted-by":"crossref","unstructured":"Paperno D, Kruszewski G, Lazaridou A, Pham QN, Bernardi R, Pezzelle S, Baroni M, Boleda G, Fern\u00e1ndez R (2016) The LAMBADA dataset: word prediction requiring a broad discourse context. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computer Linguistics","DOI":"10.18653\/v1\/P16-1144"},{"key":"9374_CR198","doi-asserted-by":"crossref","unstructured":"Paul S, Goyal P, Ghosh S (2022a) Lesicin: a heterogeneous graph-based approach for automatic legal statute identification from Indian legal documents. In: Proceedings of the AAAI conference on artificial intelligence (AAAI), the conference on Innovative Applications of Artificial Intelligence (IAAI), the AAAI symposium on Educational Advances in Artificial Intelligence (EAAI). AAAI Press, pp 11139\u201311146","DOI":"10.1609\/aaai.v36i10.21363"},{"key":"9374_CR199","unstructured":"Paul S, Mandal A, Goyal P, Ghosh S (2022b) Pre-training Transformers on Indian legal text. CoRR, arXiv:2209.06049"},{"key":"9374_CR200","doi-asserted-by":"crossref","unstructured":"Pennington J, Socher R, Manning CD (2014) Glove: global vectors for word representation. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). ACL, pp 1532\u20131543","DOI":"10.3115\/v1\/D14-1162"},{"key":"9374_CR201","unstructured":"P\u00e9rez JM, Furman DA, Alemany LA, Luque FM (2022) RoBERTuito: a pre-trained language model for social media text in Spanish. In: Proceedings of the language resources and evaluation conference (LREC). European Language Resources Association, pp 7235\u20137243"},{"key":"9374_CR202","doi-asserted-by":"crossref","unstructured":"Pfeiffer J, Vulic I, Gurevych I, Ruder S (2020) MAD-X: an adapter-based framework for multi-task cross-lingual transfer. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 7654\u20137673","DOI":"10.18653\/v1\/2020.emnlp-main.617"},{"key":"9374_CR203","unstructured":"Polignano M, Basile P, de\u00a0Gemmis M, Semeraro G, Basile V (2019) AlBERTo: Italian BERT language understanding model for NLP challenging tasks based on tweets. In: Proceedings of the sixth Italian conference on computational linguistics, volume 2481 of CEUR workshop proceedings. CEUR-WS.org"},{"key":"9374_CR204","doi-asserted-by":"crossref","unstructured":"Polo FM, Mendon\u00e7a GCF, Parreira KCJ, de\u00a0Godoy\u00a0Gianvechio L, Cordeiro P, Ferreira JB., de\u00a0Lima LMP, do\u00a0Amaral\u00a0Maia AC., Vicente R (2021) LegalNLP\u2014natural language processing methods for the Brazilian legal language. CoRR, arXiv:2110.15709","DOI":"10.5753\/eniac.2021.18301"},{"key":"9374_CR205","unstructured":"Polsley S, Jhunjhunwala P, Huang R (2016) CaseSummarizer: a system for automated summarization of legal texts. In: Proceedings of the international conference on computational linguistics (COLING): system demonstrations. ACL, pp 258\u2013262"},{"key":"9374_CR206","doi-asserted-by":"crossref","unstructured":"Ponte JM, Croft WB (1998) A language modeling approach to information retrieval. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 275\u2013281","DOI":"10.1145\/290941.291008"},{"key":"9374_CR207","unstructured":"Pradhan S, Moschitti A, Xue N, Uryupina O, Zhang Y. (2012) CoNLL\u20142012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Proceedings of the joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL): modeling multilingual unrestricted coreference in OntoNotes. ACL, pp 1\u201340"},{"key":"9374_CR208","unstructured":"Pyysalo S, Kanerva J, Virtanen A, Ginter F (2021) WikiBERT models: deep transfer learning for many languages. In: Proceedings of the Nordic conference on computational linguistics (NoDaLiDa), pp 1\u201310"},{"key":"9374_CR209","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2021.101822","volume":"106","author":"A Quemy","year":"2022","unstructured":"Quemy A, Wrembel R (2022) ECHR-OD: on building an integrated open repository of legal documents for machine learning applications. Inf Syst 106:101822","journal-title":"Inf Syst"},{"key":"9374_CR210","doi-asserted-by":"crossref","unstructured":"Rabelo J, Goebel R, Kim M-Y, Kano Y, Yoshioka M, Satoh K (2022) Overview and discussion of the competition on legal information extraction\/entailment (COLIEE) 2021. The Review of Socionetwork Strategies","DOI":"10.1007\/s12626-022-00105-z"},{"key":"9374_CR211","doi-asserted-by":"crossref","unstructured":"Rabelo J, Kim M, Goebel R (2019a) Combining similarity and transformer methods for case law entailment. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 290\u2013296","DOI":"10.1145\/3322640.3326741"},{"key":"9374_CR212","doi-asserted-by":"crossref","unstructured":"Rabelo J, Kim M, Goebel R (2020) The application of text entailment techniques in COLIEE 2020. In: New frontiers in artificial intelligence\u2014JSAI-isAI 2020 Workshops, JURISIN, LENLS 2020 workshops, volume 12758 of lecture notes in computer science. Springer, pp 240\u2013253","DOI":"10.1007\/978-3-030-79942-7_16"},{"key":"9374_CR213","doi-asserted-by":"crossref","unstructured":"Rabelo J, Kim M-Y, Goebel R, Yoshioka M, Kano Y, Satoh K (2019) A summary of the COLIEE 2019 competition. New frontiers in artificial intelligence: JSAI-isAI international workshops, JURISIN, AI-Biz, LENLS, Kansei-AI, pp 34\u201349","DOI":"10.1007\/978-3-030-58790-1_3"},{"key":"9374_CR214","unstructured":"Radford A, Narasimhan K, Salimans T, Sutskever I (2018) Improving language understanding by generative pre-training. OpenAI blog"},{"key":"9374_CR215","unstructured":"Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. OpenAI blog"},{"key":"9374_CR216","unstructured":"Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, Zhou Y, Li W, Liu PJ (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21:140:1\u2013140:67"},{"key":"9374_CR217","doi-asserted-by":"crossref","unstructured":"Rajpurkar P, Jia R, Liang P (2018) Know what you don\u2019t know: unanswerable questions for squad. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 784\u2013789","DOI":"10.18653\/v1\/P18-2124"},{"key":"9374_CR218","doi-asserted-by":"crossref","unstructured":"Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) Squad: 100,000+ questions for machine comprehension of text. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 2383\u20132392","DOI":"10.18653\/v1\/D16-1264"},{"key":"9374_CR219","doi-asserted-by":"crossref","unstructured":"Ranaldi L, Fallucchi F, Santilli A, Zanzotto FM (2021) KERMITviz: visualizing neural network activations on syntactic trees. In: Proceedings of the international conference on metadata and semantic research (MTSR), revised selected papers, volume 1537 of communications in computer and information science. Springer, pp 139\u2013147","DOI":"10.1007\/978-3-030-98876-0_12"},{"key":"9374_CR220","doi-asserted-by":"crossref","unstructured":"Ravichander A, Black AW, Wilson S, Norton TB, Sadeh NM (2019) Question answering for privacy policies: combining computational and legal perspectives. In: Proceedings of the conference on empirical methods in natural language processing and the international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, pp 4946\u20134957","DOI":"10.18653\/v1\/D19-1500"},{"key":"9374_CR221","doi-asserted-by":"crossref","unstructured":"Reimers N, Gurevych I (2019) Sentence-BERT: sentence embeddings using Siamese BERT-networks. In: Proceedings of the conference on empirical methods in natural language processing and the international joint conference on natural language processing (EMNLP-IJCNLP), pp 3980\u20133990","DOI":"10.18653\/v1\/D19-1410"},{"key":"9374_CR222","doi-asserted-by":"crossref","unstructured":"Ribeiro MT, Singh S, Guestrin C (2016) \u201cWhy Should I Trust You?\u201d: explaining the predictions of any classifier. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1135\u20131144","DOI":"10.1145\/2939672.2939778"},{"key":"9374_CR223","doi-asserted-by":"publisher","first-page":"842","DOI":"10.1162\/tacl_a_00349","volume":"8","author":"A Rogers","year":"2020","unstructured":"Rogers A, Kovaleva O, Rumshisky A (2020) A primer in BERTology: what we know about how BERT works. Trans Assoc Comput Linguist 8:842\u2013866","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR224","doi-asserted-by":"crossref","unstructured":"Rosa GM, Rodrigues RC, de\u00a0Alencar\u00a0Lotufo R, Nogueira R (2021) To tune or not to tune?: zero-shot models for legal case entailment. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 295\u2013300","DOI":"10.1145\/3462757.3466103"},{"key":"9374_CR225","unstructured":"Rossi J Kanoulas E (2019) Legal information retrieval with generalized language models: ILPS participation to COLIEE 2019. In: Proc of COLIEE 2019 workshop: competition on legal information extraction\/entailment"},{"key":"9374_CR226","doi-asserted-by":"crossref","unstructured":"Sakata W, Shibata T, Tanaka R, Kurohashi S (2019) FAQ retrieval using query-question similarity and bert-based query-answer relevance. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 1113\u20131116","DOI":"10.1145\/3331184.3331326"},{"key":"9374_CR227","doi-asserted-by":"crossref","unstructured":"Sala\u00fcn O, Langlais P, Lou A, Westermann H, Benyekhlef K (2020) Analysis and multilabel classification of Quebec court decisions in the domain of housing law. In: Proceedings of the international conference on applications of natural language to information systems (NLDB), volume 12089 of lecture notes in computer science. Springer, pp 135\u2013143","DOI":"10.1007\/978-3-030-51310-8_13"},{"key":"9374_CR228","unstructured":"Samy D, Arenas-Garc\u00eda J, P\u00e9rez-Fern\u00e1ndez D (2020) Legal-ES: a set of large scale resources for Spanish legal text processing. In: Proceedings of the workshop on language technologies for Government and Public Administration (LT4Gov@LREC 2020). European Language Resources Association, pp 32\u201336"},{"key":"9374_CR229","doi-asserted-by":"crossref","unstructured":"Sanchez L, He J, Manotumruksa J, Albakour D, Martinez M, Lipani A (2020) Easing legal news monitoring with learning to rank and BERT. In: Proceedings of the European conference on IR research (ECIR), volume 12036 of lecture notes in computer science. Springer, pp 336\u2013343","DOI":"10.1007\/978-3-030-45442-5_42"},{"key":"9374_CR230","unstructured":"Sang EFTK (2002) Introduction to the CoNLL-2002 shared task: language-independent named entity recognition. In: Proceedings of the conference on natural language learning (CoNLL). ACL"},{"key":"9374_CR231","doi-asserted-by":"crossref","unstructured":"Sang EFTK, Meulder FD (2003) Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings of the conference on natural language learning (CoNLL). ACL, pp 142\u2013147","DOI":"10.3115\/1119176.1119195"},{"key":"9374_CR232","unstructured":"Sanh V, Debut L, Chaumond J, Wolf T (2019) Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, arXiv:1910.01108"},{"key":"9374_CR233","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2021.101967","volume":"106","author":"C Sansone","year":"2022","unstructured":"Sansone C, Sperl\u00ed G (2022) Legal information retrieval systems: state-of-the-art and open issues. Inf Syst 106:101967","journal-title":"Inf Syst"},{"key":"9374_CR234","doi-asserted-by":"crossref","unstructured":"Santosh TYSS, Xu S, Ichim O, Grabmair M (2022) Deconfounding legal judgment prediction for European court of human rights cases towards better alignment with experts. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 1120\u20131138","DOI":"10.18653\/v1\/2022.emnlp-main.74"},{"key":"9374_CR235","doi-asserted-by":"crossref","unstructured":"Sarkar, R., Ojha, A.\u00a0K., Megaro, J., Mariano, J., Herard, V., and McCrae, J.\u00a0P. (2021). Few-shot and Zero-shot Approaches to Legal Text Classification: A Case Study in the Financial Sector. In Proc. of the 2021 Workshop on Natural Legal Language Processing (NLLP), pages 102\u2014106. ACL","DOI":"10.18653\/v1\/2021.nllp-1.10"},{"key":"9374_CR236","doi-asserted-by":"crossref","unstructured":"Savelka J, Ashley KD (2021) Discovering explanatory sentences in legal case decisions using pre-trained language models. In: Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, pp 4273\u20134283","DOI":"10.18653\/v1\/2021.findings-emnlp.361"},{"issue":"2","key":"9374_CR237","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1007\/s10506-021-09293-5","volume":"30","author":"J Savelka","year":"2022","unstructured":"Savelka J, Ashley KD (2022) Legal information retrieval for understanding statutory terms. Artif Intell Law 30(2):245\u2013289","journal-title":"Artif Intell Law"},{"key":"9374_CR238","unstructured":"Savelka J, Westermann H, Benyekhlef K (2020) Cross-domain generalization and knowledge transfer in transformers trained on legal data. In: Proceedings of the fourth workshop on automated semantic analysis of information in legal text (ASAIL) in conjunction with the international conference on legal knowledge and information systems (JURIX), volume 2764 of CEUR workshop proceedings. CEUR-WS.org"},{"key":"9374_CR239","doi-asserted-by":"crossref","unstructured":"Savelka J, Westermann H, Benyekhlef K, Alexander CS, Grant JC, Amariles DR, Hamdani RE, Mee\u00f9s S, Troussel AC, Araszkiewicz M, Ashley KD, Ashley A, Branting K, Falduti M, Grabmair M, Harasta J, Novotn\u00e1 T, Tippett E, Johnson S (2021) Lex Rosetta: transfer of predictive models across languages, jurisdictions, and legal domains. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 129\u2013138","DOI":"10.1145\/3462757.3466149"},{"key":"9374_CR240","unstructured":"Schilder F, Chinnappa D, Madan K, Harmouche J, Vold A, Bretz H, Hudzina J (2021) A Pentapus grapples with legal reasoning. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021), pp 60\u201368"},{"key":"9374_CR241","unstructured":"Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms. CoRR, arXiv:1707.06347"},{"key":"9374_CR242","doi-asserted-by":"crossref","unstructured":"See A, Liu PJ, Manning CD (2017) Get to the point: summarization with pointer-generator networks. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 1073\u20131083","DOI":"10.18653\/v1\/P17-1099"},{"key":"9374_CR243","unstructured":"Seker A, Bandel E, Bareket D, Brusilovsky I, Greenfeld RS, Tsarfaty R (2021) AlephBERT: a hebrew large pre-trained language model to start-off your hebrew NLP application with. CoRR, arXiv:2104.04052"},{"key":"9374_CR244","doi-asserted-by":"crossref","unstructured":"Sennrich R, Haddow B, Birch A (2016) Neural machine translation of rare words with subword units. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics","DOI":"10.18653\/v1\/P16-1162"},{"key":"9374_CR245","doi-asserted-by":"crossref","unstructured":"Serras FR, Finger M (2022) verBERT: automating Brazilian case law document multi-label categorization using BERT. CoRR, arXiv:2203.06224","DOI":"10.5753\/stil.2021.17803"},{"key":"9374_CR246","doi-asserted-by":"crossref","unstructured":"Shaghaghian S, Feng LY, Jafarpour B, Pogrebnyakov N (2020) Customizing contextualized language models for legal document reviews. In: Proceedings of the IEEE international conf. on big data (BigData). IEEE, pp 2139\u20132148","DOI":"10.1109\/BigData50022.2020.9378201"},{"key":"9374_CR247","unstructured":"Shaheen Z, Wohlgenannt G, Filtz E (2020) Large scale legal text classification using transformer models. CoRR, arXiv:2010.12871"},{"key":"9374_CR248","doi-asserted-by":"crossref","unstructured":"Shaheen Z, Wohlgenannt G, Muromtsev D (2021) Zero-shot cross-lingual transfer in legal domain using transformer models. CoRR, arXiv:2111.14192","DOI":"10.1109\/CSCI54926.2021.00145"},{"key":"9374_CR249","doi-asserted-by":"crossref","unstructured":"Shao H, Chen Y, Huang S (2020a) BERT-based ensemble model for statute law retrieval and legal information entailment. In: New frontiers in artificial intelligence\u2014JSAI-isAI 2020 workshops, JURISIN, LENLS 2020 workshops, volume 12758 of lecture notes in computer science. Springer, pp 226\u2013239","DOI":"10.1007\/978-3-030-79942-7_15"},{"key":"9374_CR250","unstructured":"Shao Y, Liu B, Mao J, Liu Y, Zhang M, Ma S (2020b) Thuir@coliee-2020: leveraging semantic understanding and exact matching for legal case retrieval and entailment. CoRR, arXiv:2012.13102"},{"key":"9374_CR251","doi-asserted-by":"crossref","unstructured":"Shao Y, Mao J, Liu Y, Ma W, Satoh K, Zhang M, Ma S (2020c) BERT-PLI: modeling paragraph-level interactions for legal case retrieval. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 3501\u20133507","DOI":"10.24963\/ijcai.2020\/484"},{"key":"9374_CR252","unstructured":"Shen Z, Lo K, Yu L, Dahlberg N, Schlanger M, Downey D (2022) Multi-LexSum: real-world summaries of civil rights lawsuits at multiple granularities. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR253","unstructured":"Shliazhko O, Fenogenova A, Tikhonova M, Mikhailov V, Kozlova A, Shavrina T (2022) mgpt: few-shot learners go multilingual. CoRR, arXiv:2204.07580"},{"key":"9374_CR254","unstructured":"Shukla A, Bhattacharya P, Poddar S, Mukherjee R, Ghosh K, Goyal P, Ghosh S (2022) Legal case document summarization: extractive and abstractive methods and their evaluation. In: Proceedings of the conference of the Asia-Pacific chapter of the Association for Computational Linguistics and the international joint conference on natural language processing (AACL\/IJCNLP)"},{"key":"9374_CR255","unstructured":"Simeri A, Tagarelli A (2023) Exploring domain and task adaptation of LamBERTa models for article retrieval on the Italian Civil Code. In: Proceedings of the conference on information and research science connecting to digital and library science (IRCDL), volume 3365 of CEUR workshop proceedings. CEUR-WS.org, pp 130\u2013143"},{"key":"9374_CR256","doi-asserted-by":"publisher","first-page":"75835","DOI":"10.1109\/ACCESS.2022.3190408","volume":"10","author":"D Song","year":"2022","unstructured":"Song D, Gao S, He B, Schilder F (2022) On the effectiveness of pre-trained language models for legal natural language processing: an empirical study. IEEE Access 10:75835\u201375858","journal-title":"IEEE Access"},{"key":"9374_CR257","doi-asserted-by":"crossref","unstructured":"Song F, Croft WB (1999) A general language model for information retrieval. In: Proceedings of the ACM conference on information and knowledge management (CIKM). ACM, pp 316\u2013321","DOI":"10.1145\/319950.320022"},{"key":"9374_CR258","unstructured":"Song K, Tan X, Qin T, Lu J, Liu T (2019) MASS: masked sequence to sequence pre-training for language generation. In: Proceedings of the international conference on machine learning (ICML), pp 5926\u20135936"},{"key":"9374_CR259","doi-asserted-by":"crossref","unstructured":"Souza F, Nogueira R, de\u00a0Alencar\u00a0Lotufo R (2020) BERTimbau: pretrained BERT models for Brazilian Portuguese. In: Proceedings of the Brazilian conferenec on intelligent systems (BRACIS), volume 12319 of lecture notes in computer science. Springer, pp 403\u2013417","DOI":"10.1007\/978-3-030-61377-8_28"},{"key":"9374_CR260","unstructured":"Steinberger R, Pouliquen B, Widiger A, Ignat C, Erjavec T, Tufis D, Varga D (2006) The JRC-Acquis: a multilingual aligned parallel corpus with 20+ languages. In: Proceedings of the international conference on language resources and evaluation (LREC). European Language Resources Association (ELRA), pp 2142\u20132147"},{"key":"9374_CR261","doi-asserted-by":"crossref","unstructured":"Strickson B, Iglesia BDL (2020) Legal judgement prediction for UK courts. In: Proceedings of the international conference on information science and systems (ICISS). ACM, pp 204\u2013209","DOI":"10.1145\/3388176.3388183"},{"key":"9374_CR262","unstructured":"Su J (2021) T5 pegasus\u2014zhuiyiai. Technical report"},{"key":"9374_CR263","unstructured":"Su J, Lu Y, Pan S, Wen B, Liu Y (2021) Roformer: enhanced transformer with rotary position embedding. CoRR, arXiv:2104.09864"},{"key":"9374_CR264","doi-asserted-by":"crossref","unstructured":"Sun M, Guo Z, Deng X (2021) Intelligent bert-bilstm-crf based legal case entity recognition method. In: Proceedings of the ACM turing award celebration conference. ACM, pp 186\u2013191","DOI":"10.1145\/3472634.3474069"},{"key":"9374_CR265","unstructured":"Sundararajan M, Taly A, Yan Q (2017) Axiomatic attribution for deep networks. In: Proceedings of the international conference on machine learning (ICML), volume\u00a070 of proceedings of machine learning research. PMLR, pp 3319\u20133328"},{"key":"9374_CR266","unstructured":"Surden H (2019) Artificial intelligence and law: an overview. 35 GA. ST. U. L. REV., 1305"},{"issue":"3","key":"9374_CR267","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1007\/s10506-021-09301-8","volume":"30","author":"A Tagarelli","year":"2022","unstructured":"Tagarelli A, Simeri A (2022) Unsupervised law article mining based on deep pre-trained language representation models with application to the Italian civil code. Artif Intell Law 30(3):417\u2013473","journal-title":"Artif Intell Law"},{"key":"9374_CR268","doi-asserted-by":"crossref","unstructured":"Tang L Clematide S (2021) Searching for legal documents at paragraph level: automating label generation and use of an extended attention mask for boosting neural models of semantic similarity. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 114\u2013122","DOI":"10.18653\/v1\/2021.nllp-1.12"},{"key":"9374_CR269","doi-asserted-by":"crossref","unstructured":"Tarasconi F, Botros M, Caserio M, Sportelli G, Giacalone G, Uttini C, Vignati L, Zanetta F (2020) Natural language processing applications in case-law text publishing. In: Proceedings of the conference on legal knowledge and information systems (JURIX), volume 334 of frontiers in artificial intelligence and applications. IOS Press, pp 154\u2013163","DOI":"10.3233\/FAIA200859"},{"key":"9374_CR270","unstructured":"Trautmann D, Petrova A, Schilder F (2022) Legal prompt engineering for multilingual legal judgement prediction. CoRR, arXiv:2212.02199"},{"key":"9374_CR271","doi-asserted-by":"crossref","unstructured":"Trias F, Wang H, Jaume S, Idreos S (2021) Named entity recognition in historic legal text: a transformer and state machine ensemble method. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 172\u2013179","DOI":"10.18653\/v1\/2021.nllp-1.18"},{"issue":"2","key":"9374_CR272","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1007\/s10115-013-0706-y","volume":"42","author":"I Triguero","year":"2015","unstructured":"Triguero I, Garc\u00eda S, Herrera F (2015) Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl Inf Syst 42(2):245\u2013284","journal-title":"Knowl Inf Syst"},{"key":"9374_CR273","doi-asserted-by":"crossref","unstructured":"Trivedi H, Kwon H, Khot T, Sabharwal A, Balasubramanian N (2019) Repurposing entailment for multi-hop question answering tasks. In: Proceedings of the conference of the North American chapter of the Association for Computational Linguistics: human language technologies (NAACL-HLT). Association for Computational Linguistics, pp 2948\u20132958","DOI":"10.18653\/v1\/N19-1302"},{"key":"9374_CR274","doi-asserted-by":"crossref","unstructured":"Tsarapatsanis D, Aletras N (2021) On the ethical limits of natural language processing on legal text. In: Findings of the Association for Computational Linguistics: ACL\/IJCNLP 2021. Association for Computational Linguistics, pp 3590\u20133599","DOI":"10.18653\/v1\/2021.findings-acl.314"},{"key":"9374_CR275","unstructured":"Tuggener D, von D\u00e4niken P, Peetz T, Cieliebak M (2020) LEDGAR: a large-scale multi-label corpus for text classification of legal provisions in contracts. In: Proceedings of the international conference on language resources and evaluation (LREC). European Language Resources Association, pp 1235\u20131241"},{"key":"9374_CR276","doi-asserted-by":"crossref","unstructured":"Tziafas G, de\u00a0Saint-Phalle E, de\u00a0Vries W, Egger C, Caselli T (2021) A multilingual approach to identify and classify exceptional measures against COVID-19. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 46\u201362","DOI":"10.18653\/v1\/2021.nllp-1.5"},{"key":"9374_CR277","doi-asserted-by":"crossref","unstructured":"Urchs S, Mitrovic J, Granitzer M (2021) Design and implementation of German legal decision corpora. In: Proceedings of the international conference on agents and artificial intelligence (ICAART), pp 515\u2013521","DOI":"10.5220\/0010187305150521"},{"key":"9374_CR278","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 5998\u20136008"},{"issue":"10","key":"9374_CR279","doi-asserted-by":"publisher","first-page":"107","DOI":"10.13053\/rcs-147-10-9","volume":"147","author":"S Verma","year":"2018","unstructured":"Verma S, Nidhi V (2018) Extractive summarization using deep learning. Res Comput Sci 147(10):107\u2013117","journal-title":"Res Comput Sci"},{"key":"9374_CR280","doi-asserted-by":"crossref","unstructured":"Vig J (2019) A multiscale visualization of attention in the transformer model. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL): system demonstrations","DOI":"10.18653\/v1\/P19-3007"},{"key":"9374_CR281","doi-asserted-by":"crossref","unstructured":"Vold A, Conrad JG (2021) Using transformers to improve answer retrieval for legal questions. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 245\u2013249","DOI":"10.1145\/3462757.3466102"},{"key":"9374_CR282","doi-asserted-by":"crossref","unstructured":"Voorhees EM (2004) Overview of the TREC 2004 robust track. In: Proceedings of the Text REtrieval Conference (TREC), volume 500-261 of NIST special publication. National Institute of Standards and Technology (NIST)","DOI":"10.6028\/NIST.SP.500-261.robust-overview"},{"issue":"3","key":"9374_CR283","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1007\/s10506-022-09319-6","volume":"31","author":"T Vuong","year":"2023","unstructured":"Vuong T, Bui QM, Nguyen H, Nguyen T, Tran V, Phan X, Satoh K, Le NM (2023) SM-BERT-CR: a deep learning approach for case law retrieval with supporting model. Artif Intell Law 31(3):601\u2013628","journal-title":"Artif Intell Law"},{"key":"9374_CR284","unstructured":"Walker VR, Pillaipakkamnatt K, Davidson AM, Linares M, Pesce DJ (2019) Automatic classification of rhetorical roles for sentences: comparing rule-based scripts with machine learning. In: Proceedings of the third workshop on automated semantic analysis of information in legal texts, co-located with the international conference on artificial intelligence and law (ICAIL), volume 2385 of CEUR Workshop Proceedings. CEUR-WS.org"},{"key":"9374_CR285","unstructured":"Wang A, Pruksachatkun Y, Nangia N, Singh A, Michael J, Hill F, Levy O, Bowman SR (2019) SuperGLUE: a stickier benchmark for general-purpose language understanding systems. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 3261\u20133275"},{"key":"9374_CR286","doi-asserted-by":"crossref","unstructured":"Wang A, Singh A, Michael J, Hill F, Levy O, Bowman SR (2018) GLUE: a multi-task benchmark and analysis platform for natural language understanding. In: Proceedings of the workshop on analyzing and interpreting neural networks for NLP, BlackboxNLP@EMNLP 2018. Association for Computational Linguistics, pp 353\u2013355","DOI":"10.18653\/v1\/W18-5446"},{"key":"9374_CR287","doi-asserted-by":"crossref","unstructured":"Wang Q, Zhao K, Amor R, Liu B, Wang R (2022) D2GCLF: document-to-graph classifier for legal document classification. In: Findings of the Association for Computational Linguistics: NAACL. Association for Computational Linguistics, pp 2208\u20132221","DOI":"10.18653\/v1\/2022.findings-naacl.170"},{"key":"9374_CR288","doi-asserted-by":"crossref","unstructured":"Wang S, Khabsa M, Ma H (2020a) To pretrain or not to pretrain: examining the benefits of pretraining on resource rich tasks. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 2209\u20132213","DOI":"10.18653\/v1\/2020.acl-main.200"},{"key":"9374_CR289","doi-asserted-by":"crossref","unstructured":"Wang SH, Scardigli A, Tang L, Chen W, Levkin D, Chen A, Ball S, Woodside T, Zhang O, Hendrycks D (2023) MAUD: an expert-annotated legal NLP dataset for merger agreement understanding. CoRR, arXiv:2301.00876","DOI":"10.18653\/v1\/2023.emnlp-main.1019"},{"key":"9374_CR290","doi-asserted-by":"crossref","unstructured":"Wang W, Wei F, Dong L, Bao H, Yang N, Zhou M (2020b) MiniLM: deep self-attention distillation for task-agnostic compression of pre-trained transformers. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)","DOI":"10.18653\/v1\/2021.findings-acl.188"},{"key":"9374_CR291","unstructured":"Wang Y, Xiao C, Ma S, Zhong H, Tu C, Zhang T, Liu Z, Sun M (2021) Equality before the law: legal judgment consistency analysis for fairness. CoRR, arXiv:2103.13868"},{"issue":"1","key":"9374_CR292","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1007\/s12626-022-00101-3","volume":"16","author":"S Wehnert","year":"2022","unstructured":"Wehnert S, Dureja S, Kutty L, Sudhi V, Luca EWD (2022) Applying BERT embeddings to predict legal textual entailment. Rev. Socionetwork Strateg. 16(1):197\u2013219","journal-title":"Rev. Socionetwork Strateg."},{"key":"9374_CR293","doi-asserted-by":"crossref","unstructured":"Wehnert S, Sudhi V, Dureja S, Kutty L, Shahania S, Luca EWD (2021) Legal norm retrieval with variations of the BERT model combined with TF-IDF vectorization. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM","DOI":"10.1145\/3462757.3466104"},{"key":"9374_CR294","doi-asserted-by":"publisher","first-page":"287","DOI":"10.1162\/tacl_a_00021","volume":"6","author":"J Welbl","year":"2018","unstructured":"Welbl J, Stenetorp P, Riedel S (2018) Constructing datasets for multi-hop reading comprehension across documents. Trans Assoc Comput Linguist 6:287\u2013302","journal-title":"Trans Assoc Comput Linguist"},{"key":"9374_CR295","doi-asserted-by":"crossref","unstructured":"Westermann H, Savelka J, Benyekhlef K (2020) Paragraph similarity scoring and fine-tuned BERT for legal information retrieval and entailment. In: New frontiers in artificial intelligence\u2014SAI-isAI 2020 workshops, JURISIN, LENLS 2020 Workshops, volume 12758 of lecture notes in computer science. Springer, pp 269\u2013285","DOI":"10.1007\/978-3-030-79942-7_18"},{"key":"9374_CR296","doi-asserted-by":"crossref","unstructured":"Wilson S, Schaub F, Dara AA, Liu F, Cherivirala S, Leon PG, Andersen MS, Zimmeck S, Sathyendra KM, Russell NC, Norton TB, Hovy EH, Reidenberg JR, Sadeh NM (2016) The creation and analysis of a website privacy policy corpus. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computer Linguistics","DOI":"10.18653\/v1\/P16-1126"},{"key":"9374_CR297","doi-asserted-by":"crossref","unstructured":"Wrzalik M, Krechel D (2021) GerDaLIR: a German dataset for legal information retrieval. In: Proceedings of the 2021 workshop on natural legal language processing (NLLP). ACL, pp 123\u2013128","DOI":"10.18653\/v1\/2021.nllp-1.13"},{"key":"9374_CR298","unstructured":"Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, Krikun M, Cao Y, Gao Q, Macherey K, Klingner J, Shah A, Johnson M, Liu X, Kaiser L, Gouws S, Kato Y, Kudo T, Kazawa H, Stevens K, Kurian G, Patil N, Wang W, Young C, Smith J, Riesa J, Rudnick A, Vinyals O, Corrado G, Hughes M, Dean J (2016) Google\u2019s neural machine translation system: bridging the gap between human and machine translation. CoRR, arXiv:1609.08144"},{"key":"9374_CR299","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/j.aiopen.2021.06.003","volume":"2","author":"C Xiao","year":"2021","unstructured":"Xiao C, Hu X, Liu Z, Tu C, Sun M (2021) Lawformer: a pre-trained language model for Chinese legal long documents. AI Open 2:79\u201384","journal-title":"AI Open"},{"key":"9374_CR300","unstructured":"Xiao C, Zhong H, Guo Z, Tu C, Liu Z, Sun M, Feng Y, Han X, Hu Z, Wang H, Xu J (2018) CAIL2018: a large-scale legal dataset for judgment prediction. CoRR, arXiv:1807.02478"},{"key":"9374_CR301","unstructured":"Xiao C, Zhong H, Guo Z, Tu C, Liu Z, Sun M, Zhang T, Han X, Hu Z, Wang H, Xu J (2019) CAIL2019-SCM: a dataset of similar case matching in legal domain. CoRR, arXiv:1911.08962"},{"key":"9374_CR302","doi-asserted-by":"crossref","unstructured":"Xiao W, Beltagy I, Carenini G, Cohan A (2022) PRIMERA: pyramid-based masked sentence pre-training for multi-document summarization. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 5245\u20135263","DOI":"10.18653\/v1\/2022.acl-long.360"},{"key":"9374_CR303","doi-asserted-by":"crossref","unstructured":"Xiong C, Callan J, Liu T (2017) Word-entity duet representations for document ranking. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 763\u2013772","DOI":"10.1145\/3077136.3080768"},{"key":"9374_CR304","doi-asserted-by":"crossref","unstructured":"Xue L, Constant N, Roberts A, Kale M, Al-Rfou R, Siddhant A, Barua A, Raffel C (2021) mT5: a massively multilingual pre-trained text-to-text transformer. In: Proceedings of the conference of the North American chapter of the Association for Computational Linguistics: human language technologies (NAACL-HLT). Association for Computational Linguistics, pp 483\u2013498","DOI":"10.18653\/v1\/2021.naacl-main.41"},{"key":"9374_CR305","unstructured":"Yamada H, Tokunaga T (2019) A performance study on fine-tuned large language models in the Legal Case Entailment Task. In: Proceedings of COLIEE 2019 workshop: competition on legal information extraction\/entailment"},{"key":"9374_CR306","unstructured":"Yang J, Jin H, Tang R, Han X, Feng Q, Jiang H, Yin B, Hu X (2023) Harnessing the power of LLMs in practice: a survey on ChatGPT and beyond. CoRR, arXiv:2304.13712v2"},{"key":"9374_CR307","unstructured":"Yang Z, Dai Z, Yang Y, Carbonell JG, Salakhutdinov R, Le QV (2019) XLNet: generalized autoregressive pretraining for language understanding. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 5754\u20135764"},{"key":"9374_CR308","doi-asserted-by":"crossref","unstructured":"Yang Z, Yang D, Dyer C, He X, Smola AJ, Hovy EH (2016) Hierarchical attention networks for document classification. In: Proceedings of the conferenec of the North American chapter of the Association for Computational Linguistics: human language technologies (NAACL-HLT). Association for Computational Linguistics, pp 1480\u20131489","DOI":"10.18653\/v1\/N16-1174"},{"key":"9374_CR309","doi-asserted-by":"crossref","unstructured":"Yoon J, Junaid M, Ali S, Lee J (2022) Abstractive summarization of Korean legal cases using pre-trained language models. In: Proceedings of the international conference on ubiquitous information management and communication (IMCOM). IEEE, pp 1\u20137","DOI":"10.1109\/IMCOM53663.2022.9721808"},{"key":"9374_CR310","doi-asserted-by":"crossref","unstructured":"Yoshioka M, Aoki Y, Suzuki Y (2021a) BERT-based ensemble methods with data augmentation for legal textual entailment in COLIEE statute law task. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 278\u2013284","DOI":"10.1145\/3462757.3466105"},{"key":"9374_CR311","doi-asserted-by":"crossref","unstructured":"Yoshioka M, Suzuki Y, Aoki Y (2021b) BERT-based ensemble methods for information retrieval and legal textual entailment in COLIEE statute law task. In: Proceedings of the eighth international competition on legal information extraction\/entailment (COLIEE 2021), pp 78\u201383","DOI":"10.1145\/3462757.3466105"},{"key":"9374_CR312","unstructured":"Yu F, Quartey L, Schilder F (2022a) Legal prompting: teaching a language model to think like a lawyer. CoRR, arXiv:2212.01326"},{"key":"9374_CR313","doi-asserted-by":"crossref","unstructured":"Yu W, Sun Z, Xu J, Dong Z, Chen X, Xu H, Wen J (2022b) Explainable legal case matching via inverse optimal transport-based rationale extraction. In: Proceedings of the ACM SIGIR conference on research and development in information retrieval (SIGIR). ACM, pp 657\u2013668","DOI":"10.1145\/3477495.3531974"},{"key":"9374_CR314","unstructured":"Yun S, Jeong M, Kim R, Kang J, Kim HJ (2019) Graph transformer networks. In: Proceedings of the annual conference on neural information processing systems (NeurIPS), pp 11960\u201311970"},{"key":"9374_CR315","unstructured":"Zaheer M, Guruganesh G, Dubey KA, Ainslie J, Alberti C, Onta\u00f1\u00f3n S, Pham P, Ravula A, Wang Q, Yang L, Ahmed A (2020) Big Bird: transformers for longer sequences. In: Proceedings of the annual conference on neural information processing systems (NeurIPS)"},{"key":"9374_CR316","doi-asserted-by":"crossref","unstructured":"Zanzotto FM, Santilli A, Ranaldi L, Onorati D, Tommasino P, Fallucchi F (2020) KERMIT: complementing transformer architectures with encoders of explicit syntactic interpretations. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 256\u2013267","DOI":"10.18653\/v1\/2020.emnlp-main.18"},{"key":"9374_CR317","doi-asserted-by":"crossref","unstructured":"Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: Proceedings of the European conference on computer vision (ECCV), volume 8689 of lecture notes in computer science. Springer, pp 818\u2013833","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"9374_CR318","doi-asserted-by":"crossref","unstructured":"Zellers R, Bisk Y, Schwartz R, Choi Y (2018) SWAG: a large-scale adversarial dataset for grounded commonsense inference. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 93\u2013104","DOI":"10.18653\/v1\/D18-1009"},{"key":"9374_CR319","doi-asserted-by":"crossref","unstructured":"Zhang B, Xiong D, Su J (2018) Accelerating neural transformer via an average attention network. In: Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, pp 1789\u20131798","DOI":"10.18653\/v1\/P18-1166"},{"key":"9374_CR320","unstructured":"Zhang J, Zhao Y, Saleh M, Liu PJ (2020) PEGASUS: pre-training with extracted gap-sentences for abstractive summarization. In: Proceedings of the international conference on machine learning (ICML), pp 11328\u201311339"},{"key":"9374_CR321","doi-asserted-by":"crossref","unstructured":"Zheng L, Guha N, Anderson BR, Henderson P, Ho DE (2021) When does pretraining help? Assessing self-supervised learning for law and the CaseHOLD dataset of 53,000+ legal holdings. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 159\u2013168","DOI":"10.1145\/3462757.3466088"},{"key":"9374_CR322","doi-asserted-by":"crossref","unstructured":"Zhong H, Guo Z, Tu C, Xiao C, Liu Z, Sun M (2018) Legal judgment prediction via topological learning. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, pp 3540\u20133549","DOI":"10.18653\/v1\/D18-1390"},{"key":"9374_CR323","doi-asserted-by":"crossref","unstructured":"Zhong H, Xiao C, Tu C, Zhang T, Liu Z, Sun M (2020) How does NLP benefit legal system: a summary of legal artificial intelligence. In: Proceedings of the annual meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp 5218\u20135230","DOI":"10.18653\/v1\/2020.acl-main.466"},{"key":"9374_CR324","unstructured":"Zhong H, Zhang Z, Liu Z, Sun M (2019a) Open Chinese language pre-trained model zoo. Technical report"},{"key":"9374_CR325","doi-asserted-by":"crossref","unstructured":"Zhong L, Zhong Z, Zhao Z, Wang S, Ashley KD, Grabmair M (2019b) Automatic summarization of legal decisions using iterative masking of predictive sentences. In: Proceedings of the international conference on artificial intelligence and law (ICAIL). ACM, pp 163\u2013172","DOI":"10.1145\/3322640.3326728"},{"issue":"3","key":"9374_CR326","doi-asserted-by":"publisher","first-page":"66","DOI":"10.2478\/popets-2019-0037","volume":"2019","author":"S Zimmeck","year":"2019","unstructured":"Zimmeck S, Story P, Smullen D, Ravichander A, Wang Z, Reidenberg JR, Russell NC, Sadeh NM (2019) MAPS: scaling privacy compliance analysis to a million apps. Proc Privacy Enhanc Technol 2019(3):66\u201386","journal-title":"Proc Privacy Enhanc Technol"}],"container-title":["Artificial Intelligence and Law"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10506-023-09374-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10506-023-09374-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10506-023-09374-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,23]],"date-time":"2024-10-23T12:12:29Z","timestamp":1729685549000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10506-023-09374-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,20]]},"references-count":326,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,12]]}},"alternative-id":["9374"],"URL":"https:\/\/doi.org\/10.1007\/s10506-023-09374-7","relation":{},"ISSN":["0924-8463","1572-8382"],"issn-type":[{"value":"0924-8463","type":"print"},{"value":"1572-8382","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,20]]},"assertion":[{"value":"15 September 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 November 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}