Score: 1

RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems

Published: December 11, 2025 | arXiv ID: 2512.10575v1

By: Hang Ding , Qiming Feng , Dongqi Liu and more

BigTech Affiliations: Tencent

Potential Business Impact:

Makes AI better at pretending to be characters.

Business Areas:

Simulation Software

Reward modeling has become a cornerstone of aligning large language models (LLMs) with human preferences. Yet, when extended to subjective and open-ended domains such as role play, existing reward models exhibit severe degradation, struggling to capture nuanced and persona-grounded human judgments. To address this gap, we introduce RoleRMBench, the first systematic benchmark for reward modeling in role-playing dialogue, covering seven fine-grained capabilities from narrative management to role consistency and engagement. Evaluation on RoleRMBench reveals large and consistent gaps between general-purpose reward models and human judgment, particularly in narrative and stylistic dimensions. We further propose RoleRM, a reward model trained with Continuous Implicit Preferences (CIP), which reformulates subjective evaluation as continuous consistent pairwise supervision under multiple structuring strategies. Comprehensive experiments show that RoleRM surpasses strong open- and closed-source reward models by over 24% on average, demonstrating substantial gains in narrative coherence and stylistic fidelity. Our findings highlight the importance of continuous preference representation and annotation consistency, establishing a foundation for subjective alignment in human-centered dialogue systems.

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Computation and Language

Tests how well AI can pretend to be people.

27 Jul 2025 0

89%

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Artificial Intelligence

Helps AI use tools better and smarter.

30 Oct 2025 4

89%

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Computation and Language

AI learns what you like in any format.

27 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

23 pages

RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems

Makes AI better at pretending to be characters.

Technical Abstract

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences