Score: 0

Toward Explaining Large Language Models in Software Engineering Tasks

Published: December 23, 2025 | arXiv ID: 2512.20328v1

By: Antonio Vitale , Khai-Nguyen Nguyen , Denys Poshyvanyk and more

Recent progress in Large Language Models (LLMs) has substantially advanced the automation of software engineering (SE) tasks, enabling complex activities such as code generation and code summarization. However, the black-box nature of LLMs remains a major barrier to their adoption in high-stakes and safety-critical domains, where explainability and transparency are vital for trust, accountability, and effective human supervision. Despite increasing interest in explainable AI for software engineering, existing methods lack domain-specific explanations aligned with how practitioners reason about SE artifacts. To address this gap, we introduce FeatureSHAP, the first fully automated, model-agnostic explainability framework tailored to software engineering tasks. Based on Shapley values, FeatureSHAP attributes model outputs to high-level input features through systematic input perturbation and task-specific similarity comparisons, while remaining compatible with both open-source and proprietary LLMs. We evaluate FeatureSHAP on two bi-modal SE tasks: code generation and code summarization. The results show that FeatureSHAP assigns less importance to irrelevant input features and produces explanations with higher fidelity than baseline methods. A practitioner survey involving 37 participants shows that FeatureSHAP helps practitioners better interpret model outputs and make more informed decisions. Collectively, FeatureSHAP represents a meaningful step toward practical explainable AI in software engineering. FeatureSHAP is available at https://github.com/deviserlab/FeatureSHAP.

Utilizing Large Language Models for Machine Learning Explainability

Machine Learning (CS)

AI builds smart computer programs that explain themselves.

8 Oct 2025 2

90%

ContextualSHAP : Enhancing SHAP Explanations Through Contextual Language Generation

Artificial Intelligence

Explains AI decisions in simple words for everyone.

8 Dec 2025 1

90%

From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms

Computation and Language

Helps computers judge translation quality better.

14 Aug 2025 0

View PDF Login to Bookmark

Toward Explaining Large Language Models in Software Engineering Tasks

Technical Abstract

Utilizing Large Language Models for Machine Learning Explainability

ContextualSHAP : Enhancing SHAP Explanations Through Contextual Language Generation

From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms