Score: 2

HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis Generation

Published: April 29, 2025 | arXiv ID: 2505.00038v2

By: Cristina Garbacea, Chenhao Tan

Potential Business Impact:

Makes AI write like you, not everyone.

Business Areas:

Personalization Commerce and Shopping

Alignment algorithms are widely used to align large language models (LLMs) to human users based on preference annotations. Typically these (often divergent) preferences are aggregated over a diverse set of users, resulting in fine-tuned models that are aligned to the ``average-user'' preference. Nevertheless, current models are used by individual users in very specific contexts and situations, emphasizing the need for user-dependent preference control. In this work we address the problem of personalizing LLM outputs to their users. We aim to generate customized responses tailored to specific individuals instead of generic outputs that emulate the collective voices of diverse populations. We propose HyPerAlign, an interpretable and sample-efficient hypothesis-driven personalization approach for LLM models. Given few-shot examples written by a particular user, we first infer hypotheses about their communication strategies, personality, and writing style, then prompt LLM models with these hypotheses and user-specific attributes to generate customized outputs. We conduct experiments on two different personalization tasks, namely authorship attribution and deliberative alignment, with datasets from diverse domains (news articles, blog posts, emails, jailbreaking benchmarks). Results demonstrate the superiority of hypothesis-driven LLM personalization compared to preference-based fine-tuning methods. For authorship attribution, HyPerAlign generations have consistently high win-rates (commonly $> 90\%$) against state-of-the-art preference fine-tuning approaches across diverse user profiles and LLM models. For deliberative alignment, the helpfulness of LLM models is improved by up to $70\%$ on average. Overall, HyPerAlign represents an interpretable and sample-efficient strategy for the personalization of LLM models to individual users.

A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications

Computation and Language

Teaches AI to be helpful and kind, your way.

21 Mar 2025 0

89%

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Computation and Language

Teaches AI to understand what *you* want.

19 Mar 2025 1

88%

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

Computation and Language

Makes AI understand what you like best.

9 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

41 pages

HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis Generation

Makes AI write like you, not everyone.

Technical Abstract

A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models