mucAI at BAREC Shared Task 2025: Towards Uncertainty Aware Arabic Readability Assessment
By: Ahmed Abdou
Potential Business Impact:
Helps grade Arabic text difficulty more accurately.
We present a simple, model-agnostic post-processing technique for fine-grained Arabic readability classification in the BAREC 2025 Shared Task (19 ordinal levels). Our method applies conformal prediction to generate prediction sets with coverage guarantees, then computes weighted averages using softmax-renormalized probabilities over the conformal sets. This uncertainty-aware decoding improves Quadratic Weighted Kappa (QWK) by reducing high-penalty misclassifications to nearer levels. Our approach shows consistent QWK improvements of 1-3 points across different base models. In the strict track, our submission achieves QWK scores of 84.9\%(test) and 85.7\% (blind test) for sentence level, and 73.3\% for document level. For Arabic educational assessment, this enables human reviewers to focus on a handful of plausible levels, combining statistical guarantees with practical usability.
Similar Papers
!MSA at BAREC Shared Task 2025: Ensembling Arabic Transformers for Readability Assessment
Computation and Language
Helps computers understand hard Arabic text better.
QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation
CV and Pattern Recognition
Reads messy Arabic text better than before.
Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment
Computation and Language
Helps computers grade essays with confidence.