Training Proactive and Personalized LLM Agents
By: Weiwei Sun , Xuhui Zhou , Weihua Du and more
Potential Business Impact:
AI learns to ask questions and help better.
While existing work focuses primarily on task success, we argue that effective real-world agents require optimizing three dimensions: productivity (task completion), proactivity (asking essential questions), and personalization (adapting to diverse user preferences). We introduce UserVille, an interactive environment with LLM-based user simulators enabling diverse, configurable user preferences. Leveraging UserVille, we introduce PPP, a multi-objective reinforcement learning approach that jointly optimizes all three dimensions: Productivity, Proactivity, and Personalization. Experiments on software engineering and deep research tasks show that agents trained with PPP achieve substantial improvements over strong baselines such as GPT-5 (+21.6 on average), demonstrating the ability to ask strategic clarifying questions, adapt to unseen user preferences, and improve task success through better interaction. This work demonstrates that explicitly optimizing for user-centered interaction is critical for building practical and effective AI agents.
Similar Papers
Towards Proactive Personalization through Profile Customization for Individual Users in Dialogues
Computation and Language
Teaches computers to learn what you like over time.
ProAgent: Harnessing On-Demand Sensory Contexts for Proactive LLM Agent Systems
Artificial Intelligence
Helps smart glasses help you before you ask.
Enabling Personalized Long-term Interactions in LLM-based Agents through Persistent Memory and User Profiles
Artificial Intelligence
AI remembers you for better conversations.