Score: 0

Non-programmers Assessing AI-Generated Code: A Case Study of Business Users Analyzing Data

Published: August 8, 2025 | arXiv ID: 2508.06484v1

By: Yuvraj Virk, Dongyu Liu

Potential Business Impact:

People can't trust AI for important business decisions.

Non-technical end-users increasingly rely on AI code generation to perform technical tasks like data analysis. However, large language models (LLMs) remain unreliable, and it is unclear whether end-users can effectively identify model errors $\unicode{x2014}$ especially in realistic and domain-specific scenarios. We surveyed marketing and sales professionals to assess their ability to critically evaluate LLM-generated analyses of marketing data. Participants were shown natural language explanations of the AI's code, repeatedly informed the AI often makes mistakes, and explicitly prompted to identify them. Yet, participants frequently failed to detect critical flaws that could compromise decision-making, many of which required no technical knowledge to recognize. To investigate why, we reformatted AI responses into clearly delineated steps and provided alternative approaches for each decision to support critical evaluation. While these changes had a positive effect, participants often struggled to reason through the AI's steps and alternatives. Our findings suggest that business professionals cannot reliably verify AI-generated data analyses on their own and explore reasons why to inform future designs. As non-programmers adopt code-generating AI for technical tasks, unreliable AI and insufficient human oversight poses risks of unsafe or low-quality decisions.

Toward Automated and Trustworthy Scientific Analysis and Visualization with LLM-Generated Code

Software Engineering

AI writes code for scientists' data.

26 Nov 2025 0

89%

Human-Written vs. AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity

Software Engineering

AI code has more security flaws than human code.

29 Aug 2025 1

88%

"I Would Have Written My Code Differently'': Beginners Struggle to Understand LLM-Generated Code

Software Engineering

Helps new coders understand computer-written code.

26 Apr 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

6 pages

Non-programmers Assessing AI-Generated Code: A Case Study of Business Users Analyzing Data

People can't trust AI for important business decisions.

Technical Abstract

Toward Automated and Trustworthy Scientific Analysis and Visualization with LLM-Generated Code

Human-Written vs. AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity

"I Would Have Written My Code Differently'': Beginners Struggle to Understand LLM-Generated Code