Score: 0

JBE-QA: Japanese Bar Exam QA Dataset for Assessing Legal Domain Knowledge

Published: November 28, 2025 | arXiv ID: 2511.22869v1

By: Zhihan Cao , Fumihito Nishino , Hiroaki Yamada and more

Potential Business Impact:

Tests if computers understand Japanese law.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We introduce JBE-QA, a Japanese Bar Exam Question-Answering dataset to evaluate large language models' legal knowledge. Derived from the multiple-choice (tanto-shiki) section of the Japanese bar exam (2015-2024), JBE-QA provides the first comprehensive benchmark for Japanese legal-domain evaluation of LLMs. It covers the Civil Code, the Penal Code, and the Constitution, extending beyond the Civil Code focus of prior Japanese resources. Each question is decomposed into independent true/false judgments with structured contextual fields. The dataset contains 3,464 items with balanced labels. We evaluate 26 LLMs, including proprietary, open-weight, Japanese-specialised, and reasoning models. Our results show that proprietary models with reasoning enabled perform best, and the Constitution questions are generally easier than the Civil Code or the Penal Code questions.