Score: 2

YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Published: December 29, 2025 | arXiv ID: 2512.23273v1

By: Xu Lin , Jinlong Peng , Zhenye Gan and more

BigTech Affiliations: Tencent

Potential Business Impact:

Lets computers see objects better by thinking smarter.

Business Areas:

Image Recognition Data and Analytics, Software

Existing Real-Time Object Detection (RTOD) methods commonly adopt YOLO-like architectures for their favorable trade-off between accuracy and speed. However, these models rely on static dense computation that applies uniform processing to all inputs, misallocating representational capacity and computational resources such as over-allocating on trivial scenes while under-serving complex ones. This mismatch results in both computational redundancy and suboptimal detection performance. To overcome this limitation, we propose YOLO-Master, a novel YOLO-like framework that introduces instance-conditional adaptive computation for RTOD. This is achieved through a Efficient Sparse Mixture-of-Experts (ES-MoE) block that dynamically allocates computational resources to each input according to its scene complexity. At its core, a lightweight dynamic routing network guides expert specialization during training through a diversity enhancing objective, encouraging complementary expertise among experts. Additionally, the routing network adaptively learns to activate only the most relevant experts, thereby improving detection performance while minimizing computational overhead during inference. Comprehensive experiments on five large-scale benchmarks demonstrate the superiority of YOLO-Master. On MS COCO, our model achieves 42.4% AP with 1.62ms latency, outperforming YOLOv13-N by +0.8% mAP and 17.8% faster inference. Notably, the gains are most pronounced on challenging dense scenes, while the model preserves efficiency on typical inputs and maintains real-time inference speed. Code will be available.

YOLO-ROC: A High-Precision and Ultra-Lightweight Model for Real-Time Road Damage Detection

CV and Pattern Recognition

Finds road holes and cracks faster and smaller.

31 Jul 2025 0

90%

YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection

CV and Pattern Recognition

Makes computer vision models smarter at finding objects.

17 Nov 2025 0

90%

YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection

CV and Pattern Recognition

Makes computer vision models smarter at spotting things.

17 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇸🇬 Singapore, China

Page Count

11 pages

YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Lets computers see objects better by thinking smarter.

Technical Abstract

YOLO-ROC: A High-Precision and Ultra-Lightweight Model for Real-Time Road Damage Detection

YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection

YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection