Score: 2

AnnoCaseLaw: A Richly-Annotated Dataset For Benchmarking Explainable Legal Judgment Prediction

Published: February 28, 2025 | arXiv ID: 2503.00128v1

By: Magnus Sesodia , Alina Petrova , John Armour and more

Potential Business Impact:

Helps computers understand court cases to predict decisions.

Business Areas:

Legal Tech Professional Services

Legal systems worldwide continue to struggle with overwhelming caseloads, limited judicial resources, and growing complexities in legal proceedings. Artificial intelligence (AI) offers a promising solution, with Legal Judgment Prediction (LJP) -- the practice of predicting a court's decision from the case facts -- emerging as a key research area. However, existing datasets often formulate the task of LJP unrealistically, not reflecting its true difficulty. They also lack high-quality annotation essential for legal reasoning and explainability. To address these shortcomings, we introduce AnnoCaseLaw, a first-of-its-kind dataset of 471 meticulously annotated U.S. Appeals Court negligence cases. Each case is enriched with comprehensive, expert-labeled annotations that highlight key components of judicial decision making, along with relevant legal concepts. Our dataset lays the groundwork for more human-aligned, explainable LJP models. We define three legally relevant tasks: (1) judgment prediction; (2) concept identification; and (3) automated case annotation, and establish a performance baseline using industry-leading large language models (LLMs). Our results demonstrate that LJP remains a formidable task, with application of legal precedent proving particularly difficult. Code and data are available at https://github.com/anonymouspolar1/annocaselaw.

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning

Artificial Intelligence

Helps judges make fairer decisions faster.

9 Jun 2025 3

87%

Debate-Feedback: A Multi-Agent Framework for Efficient Legal Judgment Prediction

Multiagent Systems

AI helps predict court cases faster.

7 Apr 2025 1

86%

A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences

Artificial Intelligence

Helps judges make fair decisions by showing how they think.

2 Mar 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

15 pages

AnnoCaseLaw: A Richly-Annotated Dataset For Benchmarking Explainable Legal Judgment Prediction

Helps computers understand court cases to predict decisions.

Technical Abstract

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning

Debate-Feedback: A Multi-Agent Framework for Efficient Legal Judgment Prediction

A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences