About
NLLP Workshop 2025 took place on 8 November 2025, co-located with the EMNLP 2025 conference.
The workshop proceedings are available here.
The recording of the workshop is available here:
Sponsors
Program
All times are in CST time zone
| 09:00 - 10:40 | Session 1 |
| 09:00 - 09:20 | Workshop opening |
| 09:20 - 09:25 | LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation Joseph Enguehard, Morgane Van Ermengem, Kate Atkinson, Sujeong Cha, Arijit Ghosh Chowdhury, Prashanth Kallur Ramaswamy, Jeremy Roghair, Hannah R Marlowe, Carina Suzana Negreanu, Kitty Boxall, Diana Mincu |
| 09:25 - 09:30 | PILOT-Bench: A Benchmark for Legal Reasoning in the Patent Domain with IRAC-Aligned Classification Tasks Yehoon Jang, Chaewon Lee, Hyun-seok Min, Sungchul Choi |
| 09:30 - 09:35 | Contemporary LLMs struggle with extracting formal legal arguments Lena Held, Ivan Habernal |
| 09:35 - 09:40 | Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks Davide Romano, Jonathan Richard Schwarz, Daniele Giofrè |
| 09:40 - 09:45 | GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations Odysseas S. Chlapanis, Dimitris Galanis, Nikolaos Aletras, Ion Androutsopoulos |
| 09:45 - 09:50 | Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond Yinghao Hu, Yaoyao Yu, Leilei Gan, Bin Wei, Kun Kuang, Fei Wu |
| 09:50 - 09:55 | Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena |
| 09:55 - 10:15 | Joint Q&A |
| 10:15 - 10:20 | The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei |
| 10:20 - 10:25 | LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits Sanket Badhe |
| 10:25 - 10:30 | Label-Free Distinctiveness: Building a Continuous Trademark Scale via Synthetic Anchors Huihui Xu, Kevin D. Ashley |
| 10:30 - 10:40 | Joint Q&A |
| 10:40 - 11:10 | Break |
| 11:10 - 12:30 | Session 2 |
| 11:10 - 11:15 |
Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification M. Mikail Demir, M Abdullah Canbaz |
| 11:15 - 11:20 |
ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts
Shuang Liu, Zelong Li, Ruoyun Ma, Haiyan Zhao, Mengnan Du |
| 11:20 - 11:25 |
Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator
Hyunji Lee, Kevin Chenhao Li, Matthias Grabmair, Shanshan Xu |
| 11:25 - 11:30 |
Not ready for the bench: LLM legal interpretation is unstable and uncalibrated to human judgments
Abhishek Purushothama, Junghyun Min, Brandon Waldon, Nathan Schneider |
| 11:30 - 11:45 | Joint Q&A |
| 11:45 - 11:50 |
GReX: A Graph Neural Network-Based Rerank-then-Expand Method for Detecting Conflicts Among Legal Articles in Korean Criminal Law
Seonho An, Young-Yik Rhim, Min-Soo Kim |
| 11:50 - 11:55 |
Tracing Definitions: Lessons from Alliance Contracts in the Biopharmaceutical Industry
Maximilian Kreutner, Doerte Leusmann, Florian Lemmerich, Carolin Haeussler |
| 11:55 - 12:00 |
CourtNav: Voice-Guided, Anchor-Accurate Navigation of Long Legal Documents in Courtrooms
Sai Khadloya, Kush Juvekar, Arghya Bhattacharya, Utkarsh Saxena |
| 11:55 - 12:00 |
NyayGraph: A Knowledge Graph Enhanced Approach for Legal Statute Identification in Indian Law using Large Language Models
Siddharth Shukla, Tanuj Tyagi, Abhay Singh Bisht, Ashish Sharma, Basant Agarwal |
| 11:55 - 12:00 |
Beyond the Haystack: Sensitivity to Context in Legal Reference Recall
Eric Xia, Karthik Srikumar, Keshav Karthik, Advaith Renjith, Ashwinee Panda |
| 12:10 - 12:30 | Joint Q&A |
| 12:30 - 14:00 | Lunch & In-Person Poster Session (Lunch provided) |
| 14:00 - 15:30 | Session 3 |
| 14:00 - 14:05 |
A Framework to Retrieve Relevant Laws for Will Execution
Md Asiful Islam, Alice Saebom Kwak, Derek Bambauer, Clayton T Morrison, Mihai Surdeanu |
| 14:05 - 14:10 |
Grounded Answers from Multi-Passage Regulations: Learning-to-Rank for Regulatory RAG
Tuba Gokhan, Ted Briscoe |
| 14:10 - 14:15 |
GuRE:Generative Query REwriter for Legal Passage Retrieval
Daehui Kim, Deokhyung Kang, Jonghwi Kim, Sangwon Ryu, Gary Lee |
| 14:15 - 14:20 |
Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter, Tobias Lingenberg, Ruta Liepina, Francesca Lagioia, Marco Lippi, Giovanni Sartor, Andrea Passerini, Burcu Sayin |
| 14:20 - 14:25 |
KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval
Jaehyung Seo, Dahyun Jung, Jaewook Lee, Yongchan Chun, Dongjun Kim, Hwijung Ryu, Donghoon Shin, Heuiseok Lim |
| 14:25 - 14:45 | Joint Q&A |
| 14:45 - 14:50 |
Labor Lex: A New Portuguese Corpus and Pipeline for Information Extraction in Brazilian Legal Texts
Pedro Vitor Quinta de Castro, Nadia Felix Felipe da Silva |
| 14:50 - 14:55 |
Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards
Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn, Ekapol Chuangsuwanich, Attapol Rutherford, Sarana Nutanong |
| 14:55 - 15:00 |
ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation
Siying Zhou, Yiquan Wu, Hui Chen, Xueyu Hu, Kun Kuang, Adam Jatowt, Chunyan Zheng, Fei Wu |
| 15:00 - 15:05 |
Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework
Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger |
| 15:05 - 15:10 |
Linking Transparency and Accountability: Analysing The Connection Between TikTok's Terms of Service and Moderation Decisions
Leonard Eßer, Gerasimos Spanakis |
| 15:10 - 15:30 | Joint Q&A |
| 15:30 - 16:00 | Break |
| 16:00 - 17:30 | Session 4 |
| 16:00 - 16:05 | Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization Eunjung Cho, Alexander Miserlis Hoyle, Yoan Hermstrüwer |
| 16:05 - 16:10 | Domain Adapted Text Summarization with Self-Generated Guidelines Andrianos Michail, Bartosz Rudnikowicz, Pavlos Fragkogiannis, Cristina Kadar |
| 16:10 - 16:15 | Risks and Limits of Automatic Consolidation of Statutes Max Prior, Adrian Hof, Niklas Wais, Matthias Grabmair |
| 16:15 - 16:20 | Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland Luca Rolshoven, Vishvaksenan Rasiah, Srinanda Brügger Bose, Sarah Hostettler, Lara Burkhalter, Matthias Stürmer, Joel Niklaus |
| 16:20 - 16:25 | Extract-Explain-Abstract: A Rhetorical Role-Driven Domain-Specific Summarisation Framework for Indian Legal Documents Veer Chheda, Aaditya Uday Ghaisas, Avantika Sankhe, Dr. Narendra Shekokar |
| 16:25 - 16:45 | Joint Q&A |
| 16:45 - 16:50 | Copyright Infringement by Large Language Models in the EU: Misalignment, Safeguards, and the Path Forward Noah Scharrenberg, Chang Sun |
| 16:50 - 16:55 | Machine Unlearning of Personally Identifiable Information in Large Language Models Dan Parii, Thomas van Osch, Chang Sun |
| 16:55 - 17:00 | Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis |
| 17:00 - 17:05 | Nine Ways to Break Copyright Law and Why Our LLM Won’t: A Fair Use Aligned Generation Framework Aakash Sen Sharma, Debdeep Sanyal, Priyansh Srivastava, Sundar Athreya H, Shirish Karande, Mohan Kankanhalli, Murari Mandal |
| 17:05 - 17:20 | Joint Q&A |
| 17:20 - 17:30 | Best Presentation Award and Closing |
Organizing Committee
- Nikolaos Aletras - University of Sheffield (UK)
- Leslie Barrett - Bloomberg Law (US)
- Ilias Chalkidis - University of Copenhagen (Denmark)
- Catalina Goanta - Utrecht University (The Netherlands)
- Daniel Preotiuc-Pietro - Bloomberg (US)
- Gerasimos (Jerry) Spanakis - Maastricht University (The Netherlands)
Program Committee
- Sallam Abualhaija - University of Luxembourg (Luxembourg)
- Tomaso Agnoloni - Institute of Legal Information Theory and Technologies (Italy)
- Ion Androutsopoulos - Athens University of Economics and Business (Greece)
- Tom Ault - Bloomberg (US)
- Jaap Baaij - Utrecht University (The Netherlands)
- Ilayda Balaban - Utrecht University (The Netherlands)
- Breck Baldwin - Columbia University (US)
- Claire Barale - University of Edinburgh (UK)
- Thales Bertaglia - Maastricht University (The Netherlands)
- Floris Bex - Utrecht University (The Netherlands)
- Luca Cagliero - Politecnico di Torino (Italy)
- Jiahong Chen - University of Sheffield (UK)
- Odysseas Spyridon Chlapanis - Athens University of Business and Economics (Greece)
- Ashish Chouhan - Heidelberg University & SRH Hochschule Heidelberg (Germany)
- Bram Duivenvoorde - Utrecht University (The Netherlands)
- Dominik Dworniczak - University of Zurich (Switzerland)
- Arthur Dyevre - KU Leuven (Belgium)
- Dimitrios Galanis - Athena Research Center (Greece)
- Piyush Ghai - Relativity (US)
- Ivan Habernal - Technical University of Darmstadt (Germany)
- Ben Hagag - Darrow (Israel)
- Nils Holzenberger - Johns Hopkins University (US)
- Abe Hou - Johns Hopkins University (US)
- Abderrahmane Issam - Maastricht University (The Netherlands)
- Constantinos Karouzos - University of Sheffield (UK)
- Aykut Koç - Bilkent University (Turkey)
- Hellen van der Kroef - Maastricht University (The Netherlands)
- Alice Kwak - Univesity of Arizona (US)
- Tong Liang - Dynosaur Tech (US)
- Ruta Liepina - University of Bologna (Italy)
- Chu Luo - Queen's University (CA)
- Megan Ma - Stanford Law School (US)
- Pawel Maka - Maastricht University (The Netherlands)
- Adam Meyers - New York University (US)
- Jelena Mitrović - University of Passau (Germany) & Institute for AI R&D of Serbia (Serbia)
- Rohan Nanda - Maastricht University (The Netherlands)
- Joel Niklaus - University of Bern (Switzerland)
- Henrik Palmer Olsen - University of Copenhagen (Denmark)
- Katsikouli Panagiota - University of Copenhagen (Denmark)
- Ioannis Panagis - University of Copenhagen (Denmark)
- Anu Pradhan - Bloomberg (US)
- Paulo Quaresma - University of Evora (Portugal)
- Vageesh Saxena - Maastricht University (The Netherlands)
- Yusuf Can Semerci - Maastricht University (The Netherlands)
- Gil Semo - Darrow (Israel)
- Madhavan Seshadri - Bloomberg (US)
- Samyak Sheth - Maastricht University (The Netherlands)
- Dan Simonson - BlackBoiler LLC (US)
- Christoph Sorge - Universität des Saarlandes (Germany)
- Jerrold Soh - Singapore Management University (Singapore)
- Alexandru Sotropa - VU Amsterdam & eMAG (The Netherlands/Romania)
- Ieva Staliunaite - University of Cambridge (UK)
- T.Y.S.S. Santosh - Amazon (US)
- Dimitrios Tsarapatsanis - University of York (UK)
- Kalpana Tyagi - Maastricht University (The Netherlands)
- Gijs Van Dijck - Maastricht University (The Netherlands)
- Jianqiang Wang - University of Buffalo (US)
- Hannes Westermann - Maastricht University (The Netherlands)
- ShanShan Xu - Technical University of Munich (Germany)
- Huiyin Xue - University of Sheffield (UK)
- Marcos Zampieri - Rochester Institute of Technology (US)
- Frederike Zufall - Max Planck Institute for Research on Collective Goods (Germany)
- Miri Zilka - University of Cambridge (UK)

