Score: 1

Generalizable and Efficient Automated Scoring with a Knowledge-Distilled Multi-Task Mixture-of-Experts

Published: November 18, 2025 | arXiv ID: 2511.17601v1

By: Luyang Fang , Tao Wang , Ping Ma and more

Potential Business Impact:

One AI grades many writing tasks efficiently.

Business Areas:

MOOC Education, Software

Automated scoring of written constructed responses typically relies on separate models per task, straining computational resources, storage, and maintenance in real-world education settings. We propose UniMoE-Guided, a knowledge-distilled multi-task Mixture-of-Experts (MoE) approach that transfers expertise from multiple task-specific large models (teachers) into a single compact, deployable model (student). The student combines (i) a shared encoder for cross-task representations, (ii) a gated MoE block that balances shared and task-specific processing, and (iii) lightweight task heads. Trained with both ground-truth labels and teacher guidance, the student matches strong task-specific models while being far more efficient to train, store, and deploy. Beyond efficiency, the MoE layer improves transfer and generalization: experts develop reusable skills that boost cross-task performance and enable rapid adaptation to new tasks with minimal additions and tuning. On nine NGSS-aligned science-reasoning tasks (seven for training/evaluation and two held out for adaptation), UniMoE-Guided attains performance comparable to per-task models while using $\sim$6$\times$ less storage than maintaining separate students, and $87\times$ less than the 20B-parameter teacher. The method offers a practical path toward scalable, reliable, and resource-efficient automated scoring for classroom and large-scale assessment systems.

MoSE: Skill-by-Skill Mixture-of-Experts Learning for Embodied Autonomous Machines

Artificial Intelligence

Robots learn tasks faster, like humans do.

10 Jul 2025 1

91%

A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications

Machine Learning (CS)

Makes smart computer programs use less power.

10 Mar 2025 1

91%

MoMoE: A Mixture of Expert Agent Model for Financial Sentiment Analysis

Computational Engineering, Finance, and Science

Makes AI smarter by letting many AI parts work together.

17 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

11 pages

Generalizable and Efficient Automated Scoring with a Knowledge-Distilled Multi-Task Mixture-of-Experts

One AI grades many writing tasks efficiently.

Technical Abstract

MoSE: Skill-by-Skill Mixture-of-Experts Learning for Embodied Autonomous Machines

A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications

MoMoE: A Mixture of Expert Agent Model for Financial Sentiment Analysis