Score: 1

A Unified Deep Reinforcement Learning Approach for Close Enough Traveling Salesman Problem

Published: October 3, 2025 | arXiv ID: 2510.03065v1

By: Mingfeng Fan , Jiaqi Cheng , Yaoxin Wu and more

Potential Business Impact:

Helps delivery robots find best routes faster.

Business Areas:

Smart Cities Real Estate

In recent years, deep reinforcement learning (DRL) has gained traction for solving the NP-hard traveling salesman problem (TSP). However, limited attention has been given to the close-enough TSP (CETSP), primarily due to the challenge introduced by its neighborhood-based visitation criterion, wherein a node is considered visited if the agent enters a compact neighborhood around it. In this work, we formulate a Markov decision process (MDP) for CETSP using a discretization scheme and propose a novel unified dual-decoder DRL (UD3RL) framework that separates decision-making into node selection and waypoint determination. Specifically, an adapted encoder is employed for effective feature extraction, followed by a node-decoder and a loc-decoder to handle the two sub-tasks, respectively. A k-nearest neighbors subgraph interaction strategy is further introduced to enhance spatial reasoning during location decoding. Furthermore, we customize the REINFORCE algorithm to train UD3RL as a unified model capable of generalizing across different problem sizes and varying neighborhood radius types (i.e., constant and random radii). Experimental results show that UD3RL outperforms conventional methods in both solution quality and runtime, while exhibiting strong generalization across problem scales, spatial distributions, and radius ranges, as well as robustness to dynamic environments.

An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones

Machine Learning (CS)

Helps delivery trucks and drones find best routes.

7 Nov 2025 0

86%

GELD: A Unified Neural Model for Efficiently Solving Traveling Salesman Problems Across Different Scales

Artificial Intelligence

Finds shortest routes for many stops quickly.

7 Jun 2025 2

86%

Efficient Environment Design for Multi-Robot Navigation via Continuous Control

Robotics

Robots learn to navigate fields faster, safer.

17 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇸🇬 🇨🇳 Singapore, China

Page Count

12 pages

A Unified Deep Reinforcement Learning Approach for Close Enough Traveling Salesman Problem

Helps delivery robots find best routes faster.

Technical Abstract

An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones

GELD: A Unified Neural Model for Efficiently Solving Traveling Salesman Problems Across Different Scales

Efficient Environment Design for Multi-Robot Navigation via Continuous Control