SIEVE: Towards Verifiable Certification for Code-datasets
By: Fatou Ndiaye Mbodji , El-hacen Diallo , Jordan Samhi and more
Potential Business Impact:
Makes computer code trustworthy and reliable.
Code agents and empirical software engineering rely on public code datasets, yet these datasets lack verifiable quality guarantees. Static 'dataset cards' inform, but they are neither auditable nor do they offer statistical guarantees, making it difficult to attest to dataset quality. Teams build isolated, ad-hoc cleaning pipelines. This fragments effort and raises cost. We present SIEVE, a community-driven framework. It turns per-property checks into Confidence Cards-machine-readable, verifiable certificates with anytime-valid statistical bounds. We outline a research plan to bring SIEVE to maturity, replacing narrative cards with anytime-verifiable certification. This shift is expected to lower quality-assurance costs and increase trust in code-datasets.
Similar Papers
Scalable Enforcement of Fine Grained Access Control Policies in Relational Database Management Systems
Databases
Makes computer databases check rules faster.
Does SWE-Bench-Verified Test Agent Ability or Model Memory?
Software Engineering
Tests show AI might cheat on computer problem tests.
Does SWE-Bench-Verified Test Agent Ability or Model Memory?
Software Engineering
Models might cheat on tests, not solve real problems.