Personas within Parameters: Fine-Tuning Small Language Models with Low-Rank Adapters to Mimic User Behaviors
By: Himanshu Thakur, Eshani Agrawal, Smruthi Mukund
Potential Business Impact:
Helps apps learn what you like faster.
A long-standing challenge in developing accurate recommendation models is simulating user behavior, mainly due to the complex and stochastic nature of user interactions. Towards this, one promising line of work has been the use of Large Language Models (LLMs) for simulating user behavior. However, aligning these general-purpose large pre-trained models with user preferences necessitates: (i) effectively and continously parsing large-scale tabular user-item interaction data, (ii) overcoming pre-training-induced inductive biases to accurately learn user specific knowledge, and (iii) achieving the former two at scale for millions of users. While most previous works have focused on complex methods to prompt an LLM or fine-tune it on tabular interaction datasets, our approach shifts the focus to extracting robust textual user representations using a frozen LLM and simulating cost-effective, resource-efficient user agents powered by fine-tuned Small Language Models (SLMs). Further, we showcase a method for training multiple low-rank adapters for groups of users or \textit{persona}, striking an optimal balance between scalability and performance of user behavior agents. Our experiments provide compelling empirical evidence of the efficacy of our methods, demonstrating that user agents developed using our approach have the potential to bridge the gap between offline metrics and real-world performance of recommender systems.
Similar Papers
Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Computation and Language
Helps online stores act like you.
Less but Better: Parameter-Efficient Fine-Tuning of Large Language Models for Personality Detection
Computation and Language
Helps computers guess your personality faster.
Towards Explainable Temporal User Profiling with LLMs
Information Retrieval
Explains why you get certain movie suggestions.