Score: 1

REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems

Published: August 21, 2025 | arXiv ID: 2508.15308v1

By: Haibo Xing , Hao Deng , Yucheng Mao and more

BigTech Affiliations: Alibaba

Potential Business Impact:

Suggests better items by thinking through choices.

Business Areas:

Semantic Search Internet Services

Sequential recommendation aims to predict a user's next action in large-scale recommender systems. While traditional methods often suffer from insufficient information interaction, recent generative recommendation models partially address this issue by directly generating item predictions. To better capture user intents, recent studies have introduced a reasoning process into generative recommendation, significantly improving recommendation performance. However, these approaches are constrained by the singularity of item semantic representations, facing challenges such as limited diversity in reasoning pathways and insufficient reliability in the reasoning process. To tackle these issues, we introduce REG4Rec, a reasoning-enhanced generative model that constructs multiple dynamic semantic reasoning paths alongside a self-reflection process, ensuring high-confidence recommendations. Specifically, REG4Rec utilizes an MoE-based parallel quantization codebook (MPQ) to generate multiple unordered semantic tokens for each item, thereby constructing a larger-scale diverse reasoning space. Furthermore, to enhance the reliability of reasoning, we propose a training reasoning enhancement stage, which includes Preference Alignment for Reasoning (PARS) and a Multi-Step Reward Augmentation (MSRA) strategy. PARS uses reward functions tailored for recommendation to enhance reasoning and reflection, while MSRA introduces future multi-step actions to improve overall generalization. During inference, Consistency-Oriented Self-Reflection for Pruning (CORP) is proposed to discard inconsistent reasoning paths, preventing the propagation of erroneous reasoning. Lastly, we develop an efficient offline training strategy for large-scale recommendation. Experiments on real-world datasets and online evaluations show that REG4Rec delivers outstanding performance and substantial practical value.

REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems

Information Retrieval

Suggests better things you might like.

21 Aug 2025 1

90%

MR.Rec: Synergizing Memory and Reasoning for Personalized Recommendation Assistant with LLMs

Information Retrieval

Helps websites guess what you want to buy.

16 Oct 2025 1

90%

OneRec-Think: In-Text Reasoning for Generative Recommendation

Information Retrieval

Helps apps understand you better to keep you engaged.

13 Oct 2025 3

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

11 pages

REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems

Suggests better items by thinking through choices.

Technical Abstract

REG4Rec: Reasoning-Enhanced Generative Model for Large-Scale Recommendation Systems

MR.Rec: Synergizing Memory and Reasoning for Personalized Recommendation Assistant with LLMs

OneRec-Think: In-Text Reasoning for Generative Recommendation