Score: 3

SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories

Published: September 10, 2025 | arXiv ID: 2509.08724v1

By: Junhao Wang , Daoguang Zan , Shulin Xin and more

BigTech Affiliations: ByteDance

Potential Business Impact:

Helps computers learn to fix software problems.

Business Areas:

Simulation Software

Creating large-scale verifiable training datasets for issue-resolving tasks is a critical yet notoriously difficult challenge. Existing methods on automating the Gym environment setup process for real-world issues suffer from low success rates and high overhead. Meanwhile, synthesizing new tasks within existing Gym environments leaves the vast pool of authentic, human-reported problems untapped. To maximize the utilization of existing Gym environments and also the rich data of issue-resolving history on GitHub, we introduce SWE-Mirror, a pipeline that distills a real-world issue's semantic essence, mirrors it into another repository with a configured Gym environment, and re-animates it as a verifiable issue-resolving task. SWE-Mirror reuses existing Gym environments along with the vast pool of issue-resolving history hosted on GitHub to construct a large-scale dataset of mirrored authentic and verifiable tasks. Applying SWE-Mirror to 40 repositories across 4 languages, we have curated a dataset with 60,671 issue-resolving tasks and demonstrated the value of our dataset by training and evaluating coding agents at various scale. Post-training experiments show that models trained with the dataset exhibit improvements in issue-resolving capabilities. Furthermore, by extending the dataset size to over 12,000 high-quality trajectories, we established a new state-of-the-art (SOTA) among Qwen2.5-Coder-Instruct based LLMs on the OpenHands agent framework, which increases the resolve rate on SWE-Bench-Verified by +21.8% for the 7B model and +46.0% for the 32B model and validates the effectiveness of our approach.

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Software Engineering

Helps computers learn to fix software bugs faster.

12 Jun 2025 1

89%

SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs

Software Engineering

Fixes computer code automatically and better.

20 Apr 2025 2

89%

SWE-smith: Scaling Data for Software Engineering Agents

Software Engineering

Creates lots of practice problems for AI to fix code.

30 Apr 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com github.com github.com

Page Count

21 pages

SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories

Helps computers learn to fix software problems.

Technical Abstract

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs

SWE-smith: Scaling Data for Software Engineering Agents