Score: 1

The Layout Is the Model: On Action-Item Coupling in Generative Recommendation

Published: October 19, 2025 | arXiv ID: 2510.16804v1

By: Xiaokai Wei , Jiajun Wu , Daiyao Yi and more

BigTech Affiliations: Roblox

Potential Business Impact:

Makes online suggestions smarter and faster.

Business Areas:

Personalization Commerce and Shopping

Generative Recommendation (GR) models treat a user's interaction history as a sequence to be autoregressively predicted. When both items and actions (e.g., watch time, purchase, comment) are modeled, the layout-the ordering and visibility of item/action tokens-critically determines what information the model can use and how it generalizes. We present a unified study of token layouts for GR grounded in first principles: (P1) maximize item/action signal in both input/output space, (P2) preserve the conditioning relationship "action given item" and (P3) no information leakage. While interleaved layout (where item and action occupy separate tokens) naturally satisfies these principles, it also bloats sequence length with larger training/inference cost. On the non-interleaved front, we design a novel and effective approach, Lagged Action Conditioning (LAC), which appears strange on the surface but aligns well with the design principles to yield strong accuracy. Comprehensive experiments on public datasets and large-scale production logs evaluate different layout options and empirically verifies the design principles. Our proposed non-interleaved method, LAC, achieves competitive or superior quality at substantially lower FLOPs than interleaving. Our findings offer actionable guidance for assembling GR systems that are both accurate and efficient.

Align$^3$GR: Unified Multi-Level Alignment for LLM-based Generative Recommendation

Information Retrieval

Helps online stores show you better stuff.

14 Nov 2025 2

85%

Align$^3$GR: Unified Multi-Level Alignment for LLM-based Generative Recommendation

Information Retrieval

Recommends better things by understanding you more.

14 Nov 2025 2

85%

A Simple yet Effective Layout Token in Large Language Models for Document Understanding

CV and Pattern Recognition

Helps computers understand documents better.

24 Mar 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

10 pages

The Layout Is the Model: On Action-Item Coupling in Generative Recommendation

Makes online suggestions smarter and faster.

Technical Abstract

Align$^3$GR: Unified Multi-Level Alignment for LLM-based Generative Recommendation

Align$^3$GR: Unified Multi-Level Alignment for LLM-based Generative Recommendation

A Simple yet Effective Layout Token in Large Language Models for Document Understanding