PromptPilot: Improving Human-AI Collaboration Through LLM-Enhanced Prompt Engineering
By: Niklas Gutheil , Valentin Mayer , Leopold Müller and more
Potential Business Impact:
Helps people get better answers from AI.
Effective prompt engineering is critical to realizing the promised productivity gains of large language models (LLMs) in knowledge-intensive tasks. Yet, many users struggle to craft prompts that yield high-quality outputs, limiting the practical benefits of LLMs. Existing approaches, such as prompt handbooks or automated optimization pipelines, either require substantial effort, expert knowledge, or lack interactive guidance. To address this gap, we design and evaluate PromptPilot, an interactive prompting assistant grounded in four empirically derived design objectives for LLM-enhanced prompt engineering. We conducted a randomized controlled experiment with 80 participants completing three realistic, work-related writing tasks. Participants supported by PromptPilot achieved significantly higher performance (median: 78.3 vs. 61.7; p = .045, d = 0.56), and reported enhanced efficiency, ease-of-use, and autonomy during interaction. These findings empirically validate the effectiveness of our proposed design objectives, establishing LLM-enhanced prompt engineering as a viable technique for improving human-AI collaboration.
Similar Papers
Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation
Information Retrieval
Helps computers suggest things you'll like.
Prompt Engineering and the Effectiveness of Large Language Models in Enhancing Human Productivity
Human-Computer Interaction
Clear instructions make AI work better.
Prompt engineering does not universally improve Large Language Model performance across clinical decision-making tasks
Computation and Language
Helps doctors make better patient diagnoses and treatments.