Score: 1

Rethinking Cognitive Complexity for Unit Tests: Toward a Readability-Aware Metric Grounded in Developer Perception

Published: June 7, 2025 | arXiv ID: 2506.06764v2

By: Wendkûuni C. Ouédraogo , Yinghua Li , Xueqi Dang and more

Potential Business Impact:

Makes computer tests easier to understand.

Business Areas:

Usability Testing Data and Analytics, Design

Automatically generated unit tests-from search-based tools like EvoSuite or LLMs-vary significantly in structure and readability. Yet most evaluations rely on metrics like Cyclomatic Complexity and Cognitive Complexity, designed for functional code rather than test code. Recent studies have shown that SonarSource's Cognitive Complexity metric assigns near-zero scores to LLM-generated tests, yet its behavior on EvoSuite-generated tests and its applicability to test-specific code structures remain unexplored. We introduce CCTR, a Test-Aware Cognitive Complexity metric tailored for unit tests. CCTR integrates structural and semantic features like assertion density, annotation roles, and test composition patterns-dimensions ignored by traditional complexity models but critical for understanding test code. We evaluate 15,750 test suites generated by EvoSuite, GPT-4o, and Mistral Large-1024 across 350 classes from Defects4J and SF110. Results show CCTR effectively discriminates between structured and fragmented test suites, producing interpretable scores that better reflect developer-perceived effort. By bridging structural analysis and test readability, CCTR provides a foundation for more reliable evaluation and improvement of generated tests. We publicly release all data, prompts, and evaluation scripts to support replication.

Unveiling Hybrid Cyclomatic Complexity: A Comprehensive Analysis and Evaluation as an Integral Feature in Automatic Defect Prediction Models

Software Engineering

Finds bugs in computer programs faster.

1 Apr 2025 0

86%

Beyond Surface Similarity: Evaluating LLM-Based Test Refactorings with Structural and Semantic Awareness

Software Engineering

Measures how well AI improves computer code.

7 Jun 2025 2

85%

Test Case Generation from Bug Reports via Large Language Models: A Cognitive Layered Evaluation Framework

Software Engineering

Helps computers write better code tests.

6 Oct 2025 0

View PDF Login to Bookmark

Country of Origin

🇱🇺 🇸🇬 🇹🇷 Singapore, Turkey, Luxembourg

Page Count

6 pages

Rethinking Cognitive Complexity for Unit Tests: Toward a Readability-Aware Metric Grounded in Developer Perception

Makes computer tests easier to understand.

Technical Abstract

Unveiling Hybrid Cyclomatic Complexity: A Comprehensive Analysis and Evaluation as an Integral Feature in Automatic Defect Prediction Models

Beyond Surface Similarity: Evaluating LLM-Based Test Refactorings with Structural and Semantic Awareness

Test Case Generation from Bug Reports via Large Language Models: A Cognitive Layered Evaluation Framework