Score: 1

Latent Geometry of Taste: Scalable Low-Rank Matrix Factorization

Published: January 6, 2026 | arXiv ID: 2601.03466v1

By: Joshua Salako

Potential Business Impact:

Finds movies you'll like, even if new.

Business Areas:
Personalization Commerce and Shopping

Scalability and data sparsity remain critical bottlenecks for collaborative filtering on massive interaction datasets. This work investigates the latent geometry of user preferences using the MovieLens 32M dataset, implementing a high-performance, parallelized Alternating Least Squares (ALS) framework. Through extensive hyperparameter optimization, we demonstrate that constrained low-rank models significantly outperform higher dimensional counterparts in generalization, achieving an optimal balance between Root Mean Square Error (RMSE) and ranking precision. We visualize the learned embedding space to reveal the unsupervised emergence of semantic genre clusters, confirming that the model captures deep structural relationships solely from interaction data. Finally, we validate the system's practical utility in a cold-start scenario, introducing a tunable scoring parameter to manage the trade-off between popularity bias and personalized affinity effectively. The codebase for this research can be found here: https://github.com/joshsalako/recommender.git

Repos / Data Links

Page Count
9 pages

Category
Computer Science:
CV and Pattern Recognition