About

NLLP Workshop 2025 took place on 8 November 2025, co-located with the EMNLP 2025 conference.

The workshop proceedings are available here.

The recording of the workshop is available here:

Sponsors

Program

All times are in CST time zone

09:00 - 10:40Session 1
09:00 - 09:20Workshop opening
09:20 - 09:25LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
Joseph Enguehard, Morgane Van Ermengem, Kate Atkinson, Sujeong Cha, Arijit Ghosh Chowdhury, Prashanth Kallur Ramaswamy, Jeremy Roghair, Hannah R Marlowe, Carina Suzana Negreanu, Kitty Boxall, Diana Mincu
09:25 - 09:30PILOT-Bench: A Benchmark for Legal Reasoning in the Patent Domain with IRAC-Aligned Classification Tasks
Yehoon Jang, Chaewon Lee, Hyun-seok Min, Sungchul Choi
09:30 - 09:35Contemporary LLMs struggle with extracting formal legal arguments
Lena Held, Ivan Habernal
09:35 - 09:40Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks
Davide Romano, Jonathan Richard Schwarz, Daniele Giofrè
09:40 - 09:45GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations
Odysseas S. Chlapanis, Dimitris Galanis, Nikolaos Aletras, Ion Androutsopoulos
09:45 - 09:50Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Yinghao Hu, Yaoyao Yu, Leilei Gan, Bin Wei, Kun Kuang, Fei Wu
09:50 - 09:55Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning
Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena
09:55 - 10:15Joint Q&A
10:15 - 10:20The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets
Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei
10:20 - 10:25LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits
Sanket Badhe
10:25 - 10:30Label-Free Distinctiveness: Building a Continuous Trademark Scale via Synthetic Anchors
Huihui Xu, Kevin D. Ashley
10:30 - 10:40Joint Q&A
10:40 - 11:10Break
11:10 - 12:30Session 2
11:10 - 11:15 Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification
M. Mikail Demir, M Abdullah Canbaz
11:15 - 11:20 ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts
Shuang Liu, Zelong Li, Ruoyun Ma, Haiyan Zhao, Mengnan Du
11:20 - 11:25 Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator
Hyunji Lee, Kevin Chenhao Li, Matthias Grabmair, Shanshan Xu
11:25 - 11:30 Not ready for the bench: LLM legal interpretation is unstable and uncalibrated to human judgments
Abhishek Purushothama, Junghyun Min, Brandon Waldon, Nathan Schneider
11:30 - 11:45Joint Q&A
11:45 - 11:50 GReX: A Graph Neural Network-Based Rerank-then-Expand Method for Detecting Conflicts Among Legal Articles in Korean Criminal Law
Seonho An, Young-Yik Rhim, Min-Soo Kim
11:50 - 11:55 Tracing Definitions: Lessons from Alliance Contracts in the Biopharmaceutical Industry
Maximilian Kreutner, Doerte Leusmann, Florian Lemmerich, Carolin Haeussler
11:55 - 12:00 CourtNav: Voice-Guided, Anchor-Accurate Navigation of Long Legal Documents in Courtrooms
Sai Khadloya, Kush Juvekar, Arghya Bhattacharya, Utkarsh Saxena
11:55 - 12:00 NyayGraph: A Knowledge Graph Enhanced Approach for Legal Statute Identification in Indian Law using Large Language Models
Siddharth Shukla, Tanuj Tyagi, Abhay Singh Bisht, Ashish Sharma, Basant Agarwal
11:55 - 12:00 Beyond the Haystack: Sensitivity to Context in Legal Reference Recall
Eric Xia, Karthik Srikumar, Keshav Karthik, Advaith Renjith, Ashwinee Panda
12:10 - 12:30Joint Q&A
12:30 - 14:00Lunch & In-Person Poster Session (Lunch provided)
14:00 - 15:30Session 3
14:00 - 14:05 A Framework to Retrieve Relevant Laws for Will Execution
Md Asiful Islam, Alice Saebom Kwak, Derek Bambauer, Clayton T Morrison, Mihai Surdeanu
14:05 - 14:10 Grounded Answers from Multi-Passage Regulations: Learning-to-Rank for Regulatory RAG
Tuba Gokhan, Ted Briscoe
14:10 - 14:15 GuRE:Generative Query REwriter for Legal Passage Retrieval
Daehui Kim, Deokhyung Kang, Jonghwi Kim, Sangwon Ryu, Gary Lee
14:15 - 14:20 Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter, Tobias Lingenberg, Ruta Liepina, Francesca Lagioia, Marco Lippi, Giovanni Sartor, Andrea Passerini, Burcu Sayin
14:20 - 14:25 KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval
Jaehyung Seo, Dahyun Jung, Jaewook Lee, Yongchan Chun, Dongjun Kim, Hwijung Ryu, Donghoon Shin, Heuiseok Lim
14:25 - 14:45Joint Q&A
14:45 - 14:50 Labor Lex: A New Portuguese Corpus and Pipeline for Information Extraction in Brazilian Legal Texts
Pedro Vitor Quinta de Castro, Nadia Felix Felipe da Silva
14:50 - 14:55 Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards
Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn, Ekapol Chuangsuwanich, Attapol Rutherford, Sarana Nutanong
14:55 - 15:00 ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation
Siying Zhou, Yiquan Wu, Hui Chen, Xueyu Hu, Kun Kuang, Adam Jatowt, Chunyan Zheng, Fei Wu
15:00 - 15:05 Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework
Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger
15:05 - 15:10 Linking Transparency and Accountability: Analysing The Connection Between TikTok's Terms of Service and Moderation Decisions
Leonard Eßer, Gerasimos Spanakis
15:10 - 15:30Joint Q&A
15:30 - 16:00Break
16:00 - 17:30Session 4
16:00 - 16:05Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization
Eunjung Cho, Alexander Miserlis Hoyle, Yoan Hermstrüwer
16:05 - 16:10Domain Adapted Text Summarization with Self-Generated Guidelines
Andrianos Michail, Bartosz Rudnikowicz, Pavlos Fragkogiannis, Cristina Kadar
16:10 - 16:15Risks and Limits of Automatic Consolidation of Statutes
Max Prior, Adrian Hof, Niklas Wais, Matthias Grabmair
16:15 - 16:20Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland
Luca Rolshoven, Vishvaksenan Rasiah, Srinanda Brügger Bose, Sarah Hostettler, Lara Burkhalter, Matthias Stürmer, Joel Niklaus
16:20 - 16:25Extract-Explain-Abstract: A Rhetorical Role-Driven Domain-Specific Summarisation Framework for Indian Legal Documents
Veer Chheda, Aaditya Uday Ghaisas, Avantika Sankhe, Dr. Narendra Shekokar
16:25 - 16:45Joint Q&A
16:45 - 16:50Copyright Infringement by Large Language Models in the EU: Misalignment, Safeguards, and the Path Forward
Noah Scharrenberg, Chang Sun
16:50 - 16:55Machine Unlearning of Personally Identifiable Information in Large Language Models
Dan Parii, Thomas van Osch, Chang Sun
16:55 - 17:00Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis
17:00 - 17:05Nine Ways to Break Copyright Law and Why Our LLM Won’t: A Fair Use Aligned Generation Framework
Aakash Sen Sharma, Debdeep Sanyal, Priyansh Srivastava, Sundar Athreya H, Shirish Karande, Mohan Kankanhalli, Murari Mandal
17:05 - 17:20Joint Q&A
17:20 - 17:30Best Presentation Award and Closing

Organizing Committee

Program Committee

  • Sallam Abualhaija - University of Luxembourg (Luxembourg)
  • Tomaso Agnoloni - Institute of Legal Information Theory and Technologies (Italy)
  • Ion Androutsopoulos - Athens University of Economics and Business (Greece)
  • Tom Ault - Bloomberg (US)
  • Jaap Baaij - Utrecht University (The Netherlands)
  • Ilayda Balaban - Utrecht University (The Netherlands)
  • Breck Baldwin - Columbia University (US)
  • Claire Barale - University of Edinburgh (UK)
  • Thales Bertaglia - Maastricht University (The Netherlands)
  • Floris Bex - Utrecht University (The Netherlands)
  • Luca Cagliero - Politecnico di Torino (Italy)
  • Jiahong Chen - University of Sheffield (UK)
  • Odysseas Spyridon Chlapanis - Athens University of Business and Economics (Greece)
  • Ashish Chouhan - Heidelberg University & SRH Hochschule Heidelberg (Germany)
  • Bram Duivenvoorde - Utrecht University (The Netherlands)
  • Dominik Dworniczak - University of Zurich (Switzerland)
  • Arthur Dyevre - KU Leuven (Belgium)
  • Dimitrios Galanis - Athena Research Center (Greece)
  • Piyush Ghai - Relativity (US)
  • Ivan Habernal - Technical University of Darmstadt (Germany)
  • Ben Hagag - Darrow (Israel)
  • Nils Holzenberger - Johns Hopkins University (US)
  • Abe Hou - Johns Hopkins University (US)
  • Abderrahmane Issam - Maastricht University (The Netherlands)
  • Constantinos Karouzos - University of Sheffield (UK)
  • Aykut Koç - Bilkent University (Turkey)
  • Hellen van der Kroef - Maastricht University (The Netherlands)
  • Alice Kwak - Univesity of Arizona (US)
  • Tong Liang - Dynosaur Tech (US)
  • Ruta Liepina - University of Bologna (Italy)
  • Chu Luo - Queen's University (CA)
  • Megan Ma - Stanford Law School (US)
  • Pawel Maka - Maastricht University (The Netherlands)
  • Adam Meyers - New York University (US)
  • Jelena Mitrović - University of Passau (Germany) & Institute for AI R&D of Serbia (Serbia)
  • Rohan Nanda - Maastricht University (The Netherlands)
  • Joel Niklaus - University of Bern (Switzerland)
  • Henrik Palmer Olsen - University of Copenhagen (Denmark)
  • Katsikouli Panagiota - University of Copenhagen (Denmark)
  • Ioannis Panagis - University of Copenhagen (Denmark)
  • Anu Pradhan - Bloomberg (US)
  • Paulo Quaresma - University of Evora (Portugal)
  • Vageesh Saxena - Maastricht University (The Netherlands)
  • Yusuf Can Semerci - Maastricht University (The Netherlands)
  • Gil Semo - Darrow (Israel)
  • Madhavan Seshadri - Bloomberg (US)
  • Samyak Sheth - Maastricht University (The Netherlands)
  • Dan Simonson - BlackBoiler LLC (US)
  • Christoph Sorge - Universität des Saarlandes (Germany)
  • Jerrold Soh - Singapore Management University (Singapore)
  • Alexandru Sotropa - VU Amsterdam & eMAG (The Netherlands/Romania)
  • Ieva Staliunaite - University of Cambridge (UK)
  • T.Y.S.S. Santosh - Amazon (US)
  • Dimitrios Tsarapatsanis - University of York (UK)
  • Kalpana Tyagi - Maastricht University (The Netherlands)
  • Gijs Van Dijck - Maastricht University (The Netherlands)
  • Jianqiang Wang - University of Buffalo (US)
  • Hannes Westermann - Maastricht University (The Netherlands)
  • ShanShan Xu - Technical University of Munich (Germany)
  • Huiyin Xue - University of Sheffield (UK)
  • Marcos Zampieri - Rochester Institute of Technology (US)
  • Frederike Zufall - Max Planck Institute for Research on Collective Goods (Germany)
  • Miri Zilka - University of Cambridge (UK)