Mining Legal Arguments to Study Judicial Formalism
By: Tomáš Koref , Lena Held , Mahammad Namazov and more
Potential Business Impact:
Helps computers understand why judges make decisions.
Courts must justify their decisions, but systematically analyzing judicial reasoning at scale remains difficult. This study refutes claims about formalistic judging in Central and Eastern Europe (CEE) by developing automated methods to detect and classify judicial reasoning in Czech Supreme Courts' decisions using state-of-the-art natural language processing methods. We create the MADON dataset of 272 decisions from two Czech Supreme Courts with expert annotations of 9,183 paragraphs with eight argument types and holistic formalism labels for supervised training and evaluation. Using a corpus of 300k Czech court decisions, we adapt transformer LLMs for Czech legal domain by continued pretraining and experiment with methods to address dataset imbalance including asymmetric loss and class weighting. The best models successfully detect argumentative paragraphs (82.6\% macro-F1), classify traditional types of legal argument (77.5\% macro-F1), and classify decisions as formalistic/non-formalistic (83.2\% macro-F1). Our three-stage pipeline combining ModernBERT, Llama 3.1, and traditional feature-based machine learning achieves promising results for decision classification while reducing computational costs and increasing explainability. Empirically, we challenge prevailing narratives about CEE formalism. This work shows that legal argument mining enables reliable judicial philosophy classification and shows the potential of legal argument mining for other important tasks in computational legal studies. Our methodology is easily replicable across jurisdictions, and our entire pipeline, datasets, guidelines, models, and source codes are available at https://github.com/trusthlt/madon.
Similar Papers
Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning
Artificial Intelligence
Makes AI judge cases fairly and explain why.
What Are the Facts? Automated Extraction of Court-Established Facts from Criminal-Court Opinions
Computation and Language
Helps computers understand crime details from court papers.
From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis
Computation and Language
Helps computers understand and organize laws.