Program

Workshop schedule

All times are in CST time zone

09:00 - 10:40Session 1
09:00 - 09:20Workshop opening
09:20 - 09:25LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation
Joseph Enguehard, Morgane Van Ermengem, Kate Atkinson, Sujeong Cha, Arijit Ghosh Chowdhury, Prashanth Kallur Ramaswamy, Jeremy Roghair, Hannah R Marlowe, Carina Suzana Negreanu, Kitty Boxall, Diana Mincu
09:25 - 09:30PILOT-Bench: A Benchmark for Legal Reasoning in the Patent Domain with IRAC-Aligned Classification Tasks
Yehoon Jang, Chaewon Lee, Hyun-seok Min, Sungchul Choi
09:30 - 09:35Contemporary LLMs struggle with extracting formal legal arguments
Lena Held, Ivan Habernal
09:35 - 09:40Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks
Davide Romano, Jonathan Richard Schwarz, Daniele Giofrè
09:40 - 09:45GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations
Odysseas S. Chlapanis, Dimitris Galanis, Nikolaos Aletras, Ion Androutsopoulos
09:45 - 09:50Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond
Yinghao Hu, Yaoyao Yu, Leilei Gan, Bin Wei, Kun Kuang, Fei Wu
09:50 - 09:55Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning
Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena
09:55 - 10:15Joint Q&A
10:15 - 10:20The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets
Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei
10:20 - 10:25LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits
Sanket Badhe
10:25 - 10:30Label-Free Distinctiveness: Building a Continuous Trademark Scale via Synthetic Anchors
Huihui Xu, Kevin D. Ashley
10:30 - 10:40Joint Q&A
10:40 - 11:10Break
11:10 - 12:30Session 2
11:10 - 11:15 Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification
M. Mikail Demir, M Abdullah Canbaz
11:15 - 11:20 ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts
Shuang Liu, Zelong Li, Ruoyun Ma, Haiyan Zhao, Mengnan Du
11:20 - 11:25 Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator
Hyunji Lee, Kevin Chenhao Li, Matthias Grabmair, Shanshan Xu
11:25 - 11:30 Not ready for the bench: LLM legal interpretation is unstable and uncalibrated to human judgments
Abhishek Purushothama, Junghyun Min, Brandon Waldon, Nathan Schneider
11:30 - 11:45Joint Q&A
11:45 - 11:50 GReX: A Graph Neural Network-Based Rerank-then-Expand Method for Detecting Conflicts Among Legal Articles in Korean Criminal Law
Seonho An, Young-Yik Rhim, Min-Soo Kim
11:50 - 11:55 Tracing Definitions: Lessons from Alliance Contracts in the Biopharmaceutical Industry
Maximilian Kreutner, Doerte Leusmann, Florian Lemmerich, Carolin Haeussler
11:55 - 12:00 CourtNav: Voice-Guided, Anchor-Accurate Navigation of Long Legal Documents in Courtrooms
Sai Khadloya, Kush Juvekar, Arghya Bhattacharya, Utkarsh Saxena
11:55 - 12:00 NyayGraph: A Knowledge Graph Enhanced Approach for Legal Statute Identification in Indian Law using Large Language Models
Siddharth Shukla, Tanuj Tyagi, Abhay Singh Bisht, Ashish Sharma, Basant Agarwal
11:55 - 12:00 Beyond the Haystack: Sensitivity to Context in Legal Reference Recall
Eric Xia, Karthik Srikumar, Keshav Karthik, Advaith Renjith, Ashwinee Panda
12:10 - 12:30Joint Q&A
12:30 - 14:00Lunch & In-Person Poster Session (Lunch provided)
14:00 - 15:30Session 3
14:00 - 14:05 A Framework to Retrieve Relevant Laws for Will Execution
Md Asiful Islam, Alice Saebom Kwak, Derek Bambauer, Clayton T Morrison, Mihai Surdeanu
14:05 - 14:10 Grounded Answers from Multi-Passage Regulations: Learning-to-Rank for Regulatory RAG
Tuba Gokhan, Ted Briscoe
14:10 - 14:15 GuRE:Generative Query REwriter for Legal Passage Retrieval
Daehui Kim, Deokhyung Kang, Jonghwi Kim, Sangwon Ryu, Gary Lee
14:15 - 14:20 Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter, Tobias Lingenberg, Ruta Liepina, Francesca Lagioia, Marco Lippi, Giovanni Sartor, Andrea Passerini, Burcu Sayin
14:20 - 14:25 KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval
Jaehyung Seo, Dahyun Jung, Jaewook Lee, Yongchan Chun, Dongjun Kim, Hwijung Ryu, Donghoon Shin, Heuiseok Lim
14:25 - 14:45Joint Q&A
14:45 - 14:50 Labor Lex: A New Portuguese Corpus and Pipeline for Information Extraction in Brazilian Legal Texts
Pedro Vitor Quinta de Castro, Nadia Felix Felipe da Silva
14:50 - 14:55 Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards
Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn, Ekapol Chuangsuwanich, Attapol Rutherford, Sarana Nutanong
14:55 - 15:00 ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation
Siying Zhou, Yiquan Wu, Hui Chen, Xueyu Hu, Kun Kuang, Adam Jatowt, Chunyan Zheng, Fei Wu
15:00 - 15:05 Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework
Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger
15:05 - 15:10 Linking Transparency and Accountability: Analysing The Connection Between TikTok's Terms of Service and Moderation Decisions
Leonard Eßer, Gerasimos Spanakis
15:10 - 15:30Joint Q&A
15:30 - 16:00Break
16:00 - 17:30Session 4
16:00 - 16:05Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization
Eunjung Cho, Alexander Miserlis Hoyle, Yoan Hermstrüwer
16:05 - 16:10Domain Adapted Text Summarization with Self-Generated Guidelines
Andrianos Michail, Bartosz Rudnikowicz, Pavlos Fragkogiannis, Cristina Kadar
16:10 - 16:15Risks and Limits of Automatic Consolidation of Statutes
Max Prior, Adrian Hof, Niklas Wais, Matthias Grabmair
16:15 - 16:20Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland
Luca Rolshoven, Vishvaksenan Rasiah, Srinanda Brügger Bose, Sarah Hostettler, Lara Burkhalter, Matthias Stürmer, Joel Niklaus
16:20 - 16:25Extract-Explain-Abstract: A Rhetorical Role-Driven Domain-Specific Summarisation Framework for Indian Legal Documents
Veer Chheda, Aaditya Uday Ghaisas, Avantika Sankhe, Dr. Narendra Shekokar
16:25 - 16:45Joint Q&A
16:45 - 16:50Copyright Infringement by Large Language Models in the EU: Misalignment, Safeguards, and the Path Forward
Noah Scharrenberg, Chang Sun
16:50 - 16:55Machine Unlearning of Personally Identifiable Information in Large Language Models
Dan Parii, Thomas van Osch, Chang Sun
16:55 - 17:00Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis
17:00 - 17:05Nine Ways to Break Copyright Law and Why Our LLM Won’t: A Fair Use Aligned Generation Framework
Aakash Sen Sharma, Debdeep Sanyal, Priyansh Srivastava, Sundar Athreya H, Shirish Karande, Mohan Kankanhalli, Murari Mandal
17:05 - 17:20Joint Q&A
17:20 - 17:30Best Presentation Award and Closing