Score: 2

TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for Spatial Public Goods Games

Published: July 3, 2025 | arXiv ID: 2507.02675v1

By: Zhaoqilin Yang , Xin Wang , Ruichen Zhang and more

Potential Business Impact:

Teaches robots to work together for better results.

We introduce Team Utility-Constrained Proximal Policy Optimization (TUC-PPO), a new deep reinforcement learning framework. It extends Proximal Policy Optimization (PPO) by integrating team welfare objectives specifically for spatial public goods games. Unlike conventional approaches where cooperation emerges indirectly from individual rewards, TUC-PPO instead optimizes a bi-level objective integrating policy gradients and team utility constraints. Consequently, all policy updates explicitly incorporate collective payoff thresholds. The framework preserves PPO's policy gradient core while incorporating constrained optimization through adaptive Lagrangian multipliers. Therefore, decentralized agents dynamically balance selfish and cooperative incentives. The comparative analysis demonstrates superior performance of this constrained deep reinforcement learning approach compared to unmodified PPO and evolutionary game theory baselines. It achieves faster convergence to cooperative equilibria and greater stability against invasion by defectors. The framework formally integrates team objectives into policy updates. This work advances multi-agent deep reinforcement learning for social dilemmas while providing new computational tools for evolutionary game theory research.

PPO-ACT: Proximal Policy Optimization with Adversarial Curriculum Transfer for Spatial Public Goods Games

CS and Game Theory

Teaches computers to work together better.

7 May 2025 1

87%

GRPO-GCC: Enhancing Cooperation in Spatial Public Goods Games via Group Relative Policy Optimization with Global Cooperation Constraint

Multiagent Systems

Teaches computers to work together better.

7 Oct 2025 1

87%

Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems

Distributed, Parallel, and Cluster Computing

Makes supercomputers run jobs faster and better.

6 May 2025 0

View PDF Login to Bookmark

Country of Origin

🇸🇬 🇨🇳 China, Singapore

Repos / Data Links

github.com

Page Count

33 pages

TUC-PPO: Team Utility-Constrained Proximal Policy Optimization for Spatial Public Goods Games

Teaches robots to work together for better results.

Technical Abstract

PPO-ACT: Proximal Policy Optimization with Adversarial Curriculum Transfer for Spatial Public Goods Games

GRPO-GCC: Enhancing Cooperation in Spatial Public Goods Games via Group Relative Policy Optimization with Global Cooperation Constraint

Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems