Score: 1

FEANEL: A Benchmark for Fine-Grained Error Analysis in K-12 English Writing

Published: November 28, 2025 | arXiv ID: 2511.22883v1

By: Jingheng Ye , Shen Wang , Jiaqi Chen and more

Potential Business Impact:

Helps AI grade student writing more accurately.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) have transformed artificial intelligence, offering profound opportunities for educational applications. However, their ability to provide fine-grained educational feedback for K-12 English writing remains underexplored. In this paper, we challenge the error analysis and pedagogical skills of LLMs by introducing the problem of Fine-grained Error Analysis for English Learners and present the Fine-grained Error ANalysis for English Learners (FEANEL) Benchmark. The benchmark comprises 1,000 essays written by elementary and secondary school students, and a well-developed English writing error taxonomy. Each error is annotated by language education experts and categorized by type, severity, and explanatory feedback, using a part-of-speech-based taxonomy they co-developed. We evaluate state-of-the-art LLMs on the FEANEL Benchmark to explore their error analysis and pedagogical abilities. Experimental results reveal significant gaps in current LLMs' ability to perform fine-grained error analysis, highlighting the need for advancements in particular methods for educational applications.

FLAWS: A Benchmark for Error Identification and Localization in Scientific Papers

Computation and Language

Helps computers find mistakes in science papers.

26 Nov 2025 1

87%

Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems

Computation and Language

Helps students learn why they make writing mistakes.

9 Aug 2025 1

86%

FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain

Computation and Language

Checks if AI gives correct medical advice.

2 Sep 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

19 pages

FEANEL: A Benchmark for Fine-Grained Error Analysis in K-12 English Writing

Helps AI grade student writing more accurately.

Technical Abstract

FLAWS: A Benchmark for Error Identification and Localization in Scientific Papers

Annotating Errors in English Learners' Written Language Production: Advancing Automated Written Feedback Systems

FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain