Score: 0

Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations

Published: October 2, 2025 | arXiv ID: 2510.01606v1

By: Bo Ma , LuYao Liu , Simon Lau and more

Potential Business Impact:

Shows you things you'll like, even if they change.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent research has explored using Large Language Models for recommendation tasks by transforming user interaction histories and item metadata into text prompts, then having the LLM produce rankings or recommendations. A promising approach involves connecting collaborative filtering knowledge to LLM representations through compact adapter networks, which avoids expensive fine-tuning while preserving the strengths of both components. Yet several challenges persist in practice: collaborative filtering models often use static snapshots that miss rapidly changing user preferences; many real-world items contain rich visual and audio content beyond textual descriptions; and current systems struggle to provide trustworthy explanations backed by concrete evidence. Our work introduces \model{}, a framework that tackles these limitations through three key innovations. We develop an online adaptation mechanism that continuously incorporates new user interactions through lightweight modules, avoiding the need to retrain large models. We create a unified representation that seamlessly combines collaborative signals with visual and audio features, handling cases where some modalities may be unavailable. Finally, we design an explanation system that grounds recommendations in specific collaborative patterns and item attributes, producing natural language rationales users can verify. Our approach maintains the efficiency of frozen base models while adding minimal computational overhead, making it practical for real-world deployment.

MLLMRec: Exploring the Potential of Multimodal Large Language Models in Recommender Systems

Information Retrieval

Suggests better movies and products you'll like.

21 Aug 2025 2

90%

End-to-End Personalization: Unifying Recommender Systems with Large Language Models

Information Retrieval

Suggests movies you'll love, explains why.

2 Aug 2025 0

90%

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

Information Retrieval

Helps video apps understand what you *really* like.

13 Aug 2025 2

View PDF Login to Bookmark

Page Count

6 pages

Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations

Shows you things you'll like, even if they change.

Technical Abstract

MLLMRec: Exploring the Potential of Multimodal Large Language Models in Recommender Systems

End-to-End Personalization: Unifying Recommender Systems with Large Language Models

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations