All times are in CST time zone
| 09:00 - 10:40 | Session 1 |
| 09:00 - 09:20 | Workshop opening |
| 09:20 - 09:25 | LeMAJ (Legal LLM-as-a-Judge): Bridging Legal Reasoning and LLM Evaluation Joseph Enguehard, Morgane Van Ermengem, Kate Atkinson, Sujeong Cha, Arijit Ghosh Chowdhury, Prashanth Kallur Ramaswamy, Jeremy Roghair, Hannah R Marlowe, Carina Suzana Negreanu, Kitty Boxall, Diana Mincu |
| 09:25 - 09:30 | PILOT-Bench: A Benchmark for Legal Reasoning in the Patent Domain with IRAC-Aligned Classification Tasks Yehoon Jang, Chaewon Lee, Hyun-seok Min, Sungchul Choi |
| 09:30 - 09:35 | Contemporary LLMs struggle with extracting formal legal arguments Lena Held, Ivan Habernal |
| 09:35 - 09:40 | Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks Davide Romano, Jonathan Richard Schwarz, Daniele Giofrè |
| 09:40 - 09:45 | GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations Odysseas S. Chlapanis, Dimitris Galanis, Nikolaos Aletras, Ion Androutsopoulos |
| 09:45 - 09:50 | Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond Yinghao Hu, Yaoyao Yu, Leilei Gan, Bin Wei, Kun Kuang, Fei Wu |
| 09:50 - 09:55 | Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena |
| 09:55 - 10:15 | Joint Q&A |
| 10:15 - 10:20 | The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei |
| 10:20 - 10:25 | LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits Sanket Badhe |
| 10:25 - 10:30 | Label-Free Distinctiveness: Building a Continuous Trademark Scale via Synthetic Anchors Huihui Xu, Kevin D. Ashley |
| 10:30 - 10:40 | Joint Q&A |
| 10:40 - 11:10 | Break |
| 11:10 - 12:30 | Session 2 |
| 11:10 - 11:15 |
Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification M. Mikail Demir, M Abdullah Canbaz |
| 11:15 - 11:20 |
ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts
Shuang Liu, Zelong Li, Ruoyun Ma, Haiyan Zhao, Mengnan Du |
| 11:20 - 11:25 |
Efficient Prompt Optimisation for Legal Text Classification with Proxy Prompt Evaluator
Hyunji Lee, Kevin Chenhao Li, Matthias Grabmair, Shanshan Xu |
| 11:25 - 11:30 |
Not ready for the bench: LLM legal interpretation is unstable and uncalibrated to human judgments
Abhishek Purushothama, Junghyun Min, Brandon Waldon, Nathan Schneider |
| 11:30 - 11:45 | Joint Q&A |
| 11:45 - 11:50 |
GReX: A Graph Neural Network-Based Rerank-then-Expand Method for Detecting Conflicts Among Legal Articles in Korean Criminal Law
Seonho An, Young-Yik Rhim, Min-Soo Kim |
| 11:50 - 11:55 |
Tracing Definitions: Lessons from Alliance Contracts in the Biopharmaceutical Industry
Maximilian Kreutner, Doerte Leusmann, Florian Lemmerich, Carolin Haeussler |
| 11:55 - 12:00 |
CourtNav: Voice-Guided, Anchor-Accurate Navigation of Long Legal Documents in Courtrooms
Sai Khadloya, Kush Juvekar, Arghya Bhattacharya, Utkarsh Saxena |
| 11:55 - 12:00 |
NyayGraph: A Knowledge Graph Enhanced Approach for Legal Statute Identification in Indian Law using Large Language Models
Siddharth Shukla, Tanuj Tyagi, Abhay Singh Bisht, Ashish Sharma, Basant Agarwal |
| 11:55 - 12:00 |
Beyond the Haystack: Sensitivity to Context in Legal Reference Recall
Eric Xia, Karthik Srikumar, Keshav Karthik, Advaith Renjith, Ashwinee Panda |
| 12:10 - 12:30 | Joint Q&A |
| 12:30 - 14:00 | Lunch & In-Person Poster Session (Lunch provided) |
| 14:00 - 15:30 | Session 3 |
| 14:00 - 14:05 |
A Framework to Retrieve Relevant Laws for Will Execution
Md Asiful Islam, Alice Saebom Kwak, Derek Bambauer, Clayton T Morrison, Mihai Surdeanu |
| 14:05 - 14:10 |
Grounded Answers from Multi-Passage Regulations: Learning-to-Rank for Regulatory RAG
Tuba Gokhan, Ted Briscoe |
| 14:10 - 14:15 |
GuRE:Generative Query REwriter for Legal Passage Retrieval
Daehui Kim, Deokhyung Kang, Jonghwi Kim, Sangwon Ryu, Gary Lee |
| 14:15 - 14:20 |
Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter, Tobias Lingenberg, Ruta Liepina, Francesca Lagioia, Marco Lippi, Giovanni Sartor, Andrea Passerini, Burcu Sayin |
| 14:20 - 14:25 |
KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval
Jaehyung Seo, Dahyun Jung, Jaewook Lee, Yongchan Chun, Dongjun Kim, Hwijung Ryu, Donghoon Shin, Heuiseok Lim |
| 14:25 - 14:45 | Joint Q&A |
| 14:45 - 14:50 |
Labor Lex: A New Portuguese Corpus and Pipeline for Information Extraction in Brazilian Legal Texts
Pedro Vitor Quinta de Castro, Nadia Felix Felipe da Silva |
| 14:50 - 14:55 |
Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards
Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn, Ekapol Chuangsuwanich, Attapol Rutherford, Sarana Nutanong |
| 14:55 - 15:00 |
ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation
Siying Zhou, Yiquan Wu, Hui Chen, Xueyu Hu, Kun Kuang, Adam Jatowt, Chunyan Zheng, Fei Wu |
| 15:00 - 15:05 |
Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework
Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger |
| 15:05 - 15:10 |
Linking Transparency and Accountability: Analysing The Connection Between TikTok's Terms of Service and Moderation Decisions
Leonard Eßer, Gerasimos Spanakis |
| 15:10 - 15:30 | Joint Q&A |
| 15:30 - 16:00 | Break |
| 16:00 - 17:30 | Session 4 |
| 16:00 - 16:05 | Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization Eunjung Cho, Alexander Miserlis Hoyle, Yoan Hermstrüwer |
| 16:05 - 16:10 | Domain Adapted Text Summarization with Self-Generated Guidelines Andrianos Michail, Bartosz Rudnikowicz, Pavlos Fragkogiannis, Cristina Kadar |
| 16:10 - 16:15 | Risks and Limits of Automatic Consolidation of Statutes Max Prior, Adrian Hof, Niklas Wais, Matthias Grabmair |
| 16:15 - 16:20 | Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland Luca Rolshoven, Vishvaksenan Rasiah, Srinanda Brügger Bose, Sarah Hostettler, Lara Burkhalter, Matthias Stürmer, Joel Niklaus |
| 16:20 - 16:25 | Extract-Explain-Abstract: A Rhetorical Role-Driven Domain-Specific Summarisation Framework for Indian Legal Documents Veer Chheda, Aaditya Uday Ghaisas, Avantika Sankhe, Dr. Narendra Shekokar |
| 16:25 - 16:45 | Joint Q&A |
| 16:45 - 16:50 | Copyright Infringement by Large Language Models in the EU: Misalignment, Safeguards, and the Path Forward Noah Scharrenberg, Chang Sun |
| 16:50 - 16:55 | Machine Unlearning of Personally Identifiable Information in Large Language Models Dan Parii, Thomas van Osch, Chang Sun |
| 16:55 - 17:00 | Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis |
| 17:00 - 17:05 | Nine Ways to Break Copyright Law and Why Our LLM Won’t: A Fair Use Aligned Generation Framework Aakash Sen Sharma, Debdeep Sanyal, Priyansh Srivastava, Sundar Athreya H, Shirish Karande, Mohan Kankanhalli, Murari Mandal |
| 17:05 - 17:20 | Joint Q&A |
| 17:20 - 17:30 | Best Presentation Award and Closing |