Unveiling Perceptual Artifacts: A Fine-Grained Benchmark for Interpretable AI-Generated Image Detection
By: Yao Xiao , Weiyan Chen , Jiahao Chen and more
Potential Business Impact:
Shows how fake pictures are made, pixel by pixel.
Current AI-Generated Image (AIGI) detection approaches predominantly rely on binary classification to distinguish real from synthetic images, often lacking interpretable or convincing evidence to substantiate their decisions. This limitation stems from existing AIGI detection benchmarks, which, despite featuring a broad collection of synthetic images, remain restricted in their coverage of artifact diversity and lack detailed, localized annotations. To bridge this gap, we introduce a fine-grained benchmark towards eXplainable AI-Generated image Detection, named X-AIGD, which provides pixel-level, categorized annotations of perceptual artifacts, spanning low-level distortions, high-level semantics, and cognitive-level counterfactuals. These comprehensive annotations facilitate fine-grained interpretability evaluation and deeper insight into model decision-making processes. Our extensive investigation using X-AIGD provides several key insights: (1) Existing AIGI detectors demonstrate negligible reliance on perceptual artifacts, even at the most basic distortion level. (2) While AIGI detectors can be trained to identify specific artifacts, they still substantially base their judgment on uninterpretable features. (3) Explicitly aligning model attention with artifact regions can increase the interpretability and generalization of detectors. The data and code are available at: https://github.com/Coxy7/X-AIGD.
Similar Papers
Is Artificial Intelligence Generated Image Detection a Solved Problem?
CV and Pattern Recognition
Finds fake pictures made by computers.
Exploration of Reproducible Generated Image Detection
CV and Pattern Recognition
Finds fake AI pictures better, even new ones.
Task-Model Alignment: A Simple Path to Generalizable AI-Generated Image Detection
CV and Pattern Recognition
Finds fake pictures by checking words and details.