Score: 0

Learning Dependency Models for Subset Repair

Published: December 20, 2025 | arXiv ID: 2512.18204v1

By: Haoda Li , Jiahui Chen , Yu Sun and more

Inconsistent values are commonly encountered in real-world applications, which can negatively impact data analysis and decision-making. While existing research primarily focuses on identifying the smallest removal set to resolve inconsistencies, recent studies have shown that multiple minimum removal sets may exist, making it difficult to make further decisions. While some approaches use the most frequent values as the guidance for the subset repair, this strategy has been criticized for its potential to inaccurately identify errors. To address these issues, we consider the dependencies between attribute values to determine a more appropriate subset repair. Our main contributions include (1) formalizing the optimal subset repair problem with attribute dependencies and analyzing its computational hardness; (2) computing the exact solution using integer linear programming; (3) developing an approximate algorithm with performance guarantees based on cliques and LP relaxation; and (4) designing a probabilistic approach with an approximation bound for efficiency. Experimental results on real-world datasets validate the effectiveness of our methods in both subset repair performance and downstream applications.

Efficient Query Repair for Aggregate Constraints

Databases

Fixes search results to meet special rules.

2 Nov 2025 2

84%

A Rule-Based Approach to Specifying Preferences over Conflicting Facts and Querying Inconsistent Knowledge Bases

Logic in Computer Science

Fixes wrong information in computer brains.

11 Aug 2025 1

84%

An Empirical Study of Sample Selection Strategies for Large Language Model Repair

Machine Learning (CS)

Makes AI less mean and more helpful.

23 Oct 2025 0

View PDF Login to Bookmark

Learning Dependency Models for Subset Repair

Technical Abstract

Efficient Query Repair for Aggregate Constraints

A Rule-Based Approach to Specifying Preferences over Conflicting Facts and Querying Inconsistent Knowledge Bases

An Empirical Study of Sample Selection Strategies for Large Language Model Repair