Score: 1

An Empirical Study on Prompt Compression for Large Language Models

Published: April 24, 2025 | arXiv ID: 2505.00019v1

By: Zheng Zhang , Jinyi Li , Yihuai Lan and more

Potential Business Impact:

Shortens computer instructions, saves money and time.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Prompt engineering enables Large Language Models (LLMs) to perform a variety of tasks. However, lengthy prompts significantly increase computational complexity and economic costs. To address this issue, we study six prompt compression methods for LLMs, aiming to reduce prompt length while maintaining LLM response quality. In this paper, we present a comprehensive analysis covering aspects such as generation performance, model hallucinations, efficacy in multimodal tasks, word omission analysis, and more. We evaluate these methods across 13 datasets, including news, scientific articles, commonsense QA, math QA, long-context QA, and VQA datasets. Our experiments reveal that prompt compression has a greater impact on LLM performance in long contexts compared to short ones. In the Longbench evaluation, moderate compression even enhances LLM performance. Our code and data is available at https://github.com/3DAgentWorld/Toolkit-for-Prompt-Compression.

Understanding and Improving Information Preservation in Prompt Compression for LLMs

Computation and Language

Makes AI understand long instructions better.

24 Mar 2025 3

91%

The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance

Artificial Intelligence

Teaches AI to understand pictures and words better.

14 Apr 2025 0

91%

Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation

Information Retrieval

Helps computers suggest things you'll like.

17 Jul 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com github.com github.com

Page Count

21 pages

An Empirical Study on Prompt Compression for Large Language Models

Shortens computer instructions, saves money and time.

Technical Abstract

Understanding and Improving Information Preservation in Prompt Compression for LLMs

The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance

Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation