Score: 2

COMPEER: Controllable Empathetic Reinforcement Reasoning for Emotional Support Conversation

Published: August 13, 2025 | arXiv ID: 2508.09521v1

By: Yunxiao Wang , Meng Liu , Wenqi Liu and more

BigTech Affiliations: Kuaishou

Potential Business Impact:

Helps computers give better emotional support.

Emotional support conversations are crucial for promoting emotional well-being, yet current models often lack deep empathetic reasoning grounded in psychological principles. To address this, we propose controllable empathetic reasoning, which combines natural language reasoning with structured psychological steps. We construct a fine-grained dataset annotated with reasoning correctness and response preferences to enable this capability. To further enhance training, we employ reinforcement learning with a unified process-outcome reward model that delivers precise feedback. To mitigate response repetitiveness from entropy collapse, we introduce personality-based dialogue rewriting and a redundancy-aware reward reweighting strategy. Our approach significantly improves model's emotional support ability, advancing the development of empathetic, human-like support systems.

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Computation and Language

Helps computers understand feelings in talking.

25 Aug 2025 1

90%

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Computation and Language

Makes talking computers understand feelings better.

25 Aug 2025 1

89%

Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support

Computation and Language

Helps AI understand feelings for better mental health help.

18 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

9 pages

COMPEER: Controllable Empathetic Reinforcement Reasoning for Emotional Support Conversation

Helps computers give better emotional support.

Technical Abstract

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support