Score: 1

Experimental Analysis of Productive Interaction Strategy with ChatGPT: User Study on Function and Project-level Code Generation Tasks

Published: August 6, 2025 | arXiv ID: 2508.04125v1

By: Sangwon Hyun , Hyunjun Kim , Jinhyuk Jang and more

Potential Business Impact:

Helps computers write better code, faster.

The application of Large Language Models (LLMs) is growing in the productive completion of Software Engineering tasks. Yet, studies investigating the productive prompting techniques often employed a limited problem space, primarily focusing on well-known prompting patterns and mainly targeting function-level SE practices. We identify significant gaps in real-world workflows that involve complexities beyond class-level (e.g., multi-class dependencies) and different features that can impact Human-LLM Interactions (HLIs) processes in code generation. To address these issues, we designed an experiment that comprehensively analyzed the HLI features regarding the code generation productivity. Our study presents two project-level benchmark tasks, extending beyond function-level evaluations. We conducted a user study with 36 participants from diverse backgrounds, asking them to solve the assigned tasks by interacting with the GPT assistant using specific prompting patterns. We also examined the participants' experience and their behavioral features during interactions by analyzing screen recordings and GPT chat logs. Our statistical and empirical investigation revealed (1) that three out of 15 HLI features significantly impacted the productivity in code generation; (2) five primary guidelines for enhancing productivity for HLI processes; and (3) a taxonomy of 29 runtime and logic errors that can occur during HLI processes, along with suggested mitigation plans.

From Prompting to Partnering: Personalization Features for Human-LLM Interactions

Human-Computer Interaction

Makes AI easier to use and understand.

2 Mar 2025 0

91%

Uncovering Systematic Failures of LLMs in Verifying Code Against Natural Language Specifications

Software Engineering

Computers can't always tell if code matches instructions.

17 Aug 2025 0

91%

Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality

Software Engineering

Helps computers write better code by fixing mistakes.

12 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇦🇺 🇰🇷 Korea, Republic of, Australia

Page Count

30 pages

Experimental Analysis of Productive Interaction Strategy with ChatGPT: User Study on Function and Project-level Code Generation Tasks

Helps computers write better code, faster.

Technical Abstract

From Prompting to Partnering: Personalization Features for Human-LLM Interactions

Uncovering Systematic Failures of LLMs in Verifying Code Against Natural Language Specifications

Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality