Score: 1

Model-Agnostic Correctness Assessment for LLM-Generated Code via Dynamic Internal Representation Selection

Published: October 3, 2025 | arXiv ID: 2510.02934v1

By: Thanh Trong Vu , Tuan-Dung Bui , Thu-Trang Nguyen and more

Potential Business Impact:

Checks if computer code works correctly.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) have demonstrated impressive capabilities in code generation and are increasingly integrated into the software development process. However, ensuring the correctness of LLM-generated code remains a critical concern. Prior work has shown that the internal representations of LLMs encode meaningful signals for assessing code correctness. Nevertheless, the existing methods rely on representations from pre-selected/fixed layers and token positions, which could limit its generalizability across diverse model architectures and tasks. In this work, we introduce AUTOPROBE, a novel model-agnostic approach that dynamically selects the most informative internal representations for code correctness assessment. AUTOPROBE employs an attention-based mechanism to learn importance scores for hidden states, enabling it to focus on the most relevant features. These weighted representations are then aggregated and passed to a probing classifier to predict code correctness across multiple dimensions, including compilability, functionality, and security. To evaluate the performance of AUTOPROBE, we conduct extensive experiments across multiple benchmarks and code LLMs. Our experimental results show that AUTOPROBE consistently outperforms the baselines. For security assessment, AUTOPROBE surpasses the state-of-the-art white-box approach by 18%. For compilability and functionality assessment, AUTOPROBE demonstrates its highest robustness to code complexity, with the performance higher than the other approaches by up to 19% and 111%, respectively. These findings highlight that dynamically selecting important internal signals enables AUTOPROBE to serve as a robust and generalizable solution for assessing the correctness of code generated by various LLMs.

UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models

Computation and Language

Teaches computers to write code without examples.

19 Dec 2025 0

88%

Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders

Software Engineering

Finds AI mistakes in computer code.

3 Oct 2025 0

88%

Empirical Evaluation of Large Language Models in Automated Program Repair

Software Engineering

Fixes computer code errors faster and better.

16 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇻🇳 Viet Nam

Page Count

21 pages

Model-Agnostic Correctness Assessment for LLM-Generated Code via Dynamic Internal Representation Selection

Checks if computer code works correctly.

Technical Abstract

UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models

Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders

Empirical Evaluation of Large Language Models in Automated Program Repair