Score: 0

Task-Aware Morphology Optimization of Planar Manipulators via Reinforcement Learning

Published: November 16, 2025 | arXiv ID: 2511.12650v1

By: Arvind Kumar Mishra, Sohom Chakrabarty

Potential Business Impact:

Teaches robots to build themselves better.

Business Areas:

Robotics Hardware, Science and Engineering, Software

In this work, Yoshikawa's manipulability index is used to investigate reinforcement learning (RL) as a framework for morphology optimization in planar robotic manipulators. A 2R manipulator tracking a circular end-effector path is first examined because this case has a known analytical optimum: equal link lengths and the second joint orthogonal to the first. This serves as a validation step to test whether RL can rediscover the optimum using reward feedback alone, without access to the manipulability expression or the Jacobian. Three RL algorithms (SAC, DDPG, and PPO) are compared with grid search and black-box optimizers, with morphology represented by a single action parameter phi that maps to the link lengths. All methods converge to the analytical solution, showing that numerical recovery of the optimum is possible without supplying analytical structure. Most morphology design tasks have no closed-form solutions, and grid or heuristic search becomes expensive as dimensionality increases. RL is therefore explored as a scalable alternative. The formulation used for the circular path is extended to elliptical and rectangular paths by expanding the action space to the full morphology vector (L1, L2, theta2). In these non-analytical settings, RL continues to converge reliably, whereas grid and black-box methods require far larger evaluation budgets. These results indicate that RL is effective for both recovering known optima and solving morphology optimization problems without analytical solutions.

Model-Based Lookahead Reinforcement Learning for in-hand manipulation

Robotics

Robots learn to better grab and move objects.

10 Oct 2025 0

88%

Collaborative Loco-Manipulation for Pick-and-Place Tasks with Dynamic Reward Curriculum

Robotics

Robots learn to pick up and move things together.

16 Sep 2025 0

88%

RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward

Robotics

Designs robots that move and look better.

30 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇮🇳 India

Page Count

10 pages

Task-Aware Morphology Optimization of Planar Manipulators via Reinforcement Learning

Teaches robots to build themselves better.

Technical Abstract

Model-Based Lookahead Reinforcement Learning for in-hand manipulation

Collaborative Loco-Manipulation for Pick-and-Place Tasks with Dynamic Reward Curriculum

RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward