Score: 1

MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver

Published: June 3, 2025 | arXiv ID: 2506.02935v2

By: Yuepeng Zheng , Fu Luo , Zhenkun Wang and more

Potential Business Impact:

Helps delivery trucks find best routes faster.

Business Areas:

Autonomous Vehicles Transportation

Multi-Task Learning (MTL) in Neural Combinatorial Optimization (NCO) is a promising approach to train a unified model capable of solving multiple Vehicle Routing Problem (VRP) variants. However, existing Reinforcement Learning (RL)-based multi-task methods can only train light decoder models on small-scale problems, exhibiting limited generalization ability when solving large-scale problems. To overcome this limitation, this work introduces a novel multi-task learning method driven by knowledge distillation (MTL-KD), which enables the efficient training of heavy decoder models with strong generalization ability. The proposed MTL-KD method transfers policy knowledge from multiple distinct RL-based single-task models to a single heavy decoder model, facilitating label-free training and effectively improving the model's generalization ability across diverse tasks. In addition, we introduce a flexible inference strategy termed Random Reordering Re-Construction (R3C), which is specifically adapted for diverse VRP tasks and further boosts the performance of the multi-task model. Experimental results on 6 seen and 10 unseen VRP variants with up to 1000 nodes indicate that our proposed method consistently achieves superior performance on both uniform and real-world benchmarks, demonstrating robust generalization abilities.

Improving Generalization of Neural Combinatorial Optimization for Vehicle Routing Problems via Test-Time Projection Learning

Machine Learning (CS)

Makes delivery routes work for huge cities.

3 Jun 2025 0

89%

RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation

Machine Learning (CS)

Makes big AI models smaller and faster.

19 Sep 2025 1

88%

KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning

Machine Learning (CS)

Teaches computers to think better and faster.

2 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇳🇱 China, Netherlands

Page Count

24 pages

MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver

Helps delivery trucks find best routes faster.

Technical Abstract

Improving Generalization of Neural Combinatorial Optimization for Vehicle Routing Problems via Test-Time Projection Learning

RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation

KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning