Score: 2

Learning to Select In-Context Demonstration Preferred by Large Language Model

Published: May 26, 2025 | arXiv ID: 2505.19966v1

By: Zheng Zhang , Shaocheng Lan , Lei Song and more

BigTech Affiliations: Microsoft

Potential Business Impact:

Helps AI learn better by picking good examples.

Business Areas:
Semantic Search Internet Services

In-context learning (ICL) enables large language models (LLMs) to adapt to new tasks during inference using only a few demonstrations. However, ICL performance is highly dependent on the selection of these demonstrations. Recent work explores retrieval-based methods for selecting query-specific demonstrations, but these approaches often rely on surrogate objectives such as metric learning, failing to directly optimize ICL performance. Consequently, they struggle to identify truly beneficial demonstrations. Moreover, their discriminative retrieval paradigm is ineffective when the candidate pool lacks sufficient high-quality demonstrations. To address these challenges, we propose GenICL, a novel generative preference learning framework that leverages LLM feedback to directly optimize demonstration selection for ICL. Experiments on 19 datasets across 11 task categories demonstrate that GenICL achieves superior performance than existing methods in selecting the most effective demonstrations, leading to better ICL performance.

Country of Origin
🇨🇳 🇺🇸 China, United States

Page Count
16 pages

Category
Computer Science:
Machine Learning (CS)