Scaling Legal AI: Benchmarking Mamba and Transformers for Statutory Classification and Case Law Retrieval
By: Anuraj Maurya
Potential Business Impact:
Helps computers understand long legal texts faster.
The rapid growth of statutory corpora and judicial decisions requires scalable legal AI systems capable of classification and retrieval over extremely long contexts. Transformer-based architectures (e.g., Longformer, DeBERTa) dominate current legal NLP benchmarks but struggle with quadratic attention costs, limiting efficiency and scalability. In this work, we present the first comprehensive benchmarking of Mamba, a state-space model (SSM) with linear-time selective mechanisms, against leading transformer models for statutory classification and case law retrieval. We evaluate models on open-source legal corpora including LexGLUE, EUR-Lex, and ILDC, covering statutory tagging, judicial outcome prediction, and case retrieval tasks. Metrics include accuracy, recall at k, mean reciprocal rank (MRR), and normalized discounted cumulative gain (nDCG), alongside throughput measured in tokens per second and maximum context length. Results show that Mamba's linear scaling enables processing of legal documents several times longer than transformers, while maintaining or surpassing retrieval and classification performance. This study introduces a new legal NLP benchmark suite for long-context modeling, along with open-source code and datasets to support reproducibility. Our findings highlight trade-offs between state-space models and transformers, providing guidance for deploying scalable legal AI in statutory analysis, judicial decision support, and policy research.
Similar Papers
MLMA: Towards Multilingual with Mamba Based Architectures
Computation and Language
Lets computers understand many languages spoken.
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Machine Learning (CS)
Makes math problem-solving computers faster and smarter.
MLMA: Towards Multilingual ASR With Mamba-based Architectures
Computation and Language
Lets computers understand many languages spoken.