EfficientIML: Efficient High-Resolution Image Manipulation Localization
By: Jinhan Li , Haoyang He , Lei Xie and more
Potential Business Impact:
Finds fake pictures made by AI.
With imaging devices delivering ever-higher resolutions and the emerging diffusion-based forgery methods, current detectors trained only on traditional datasets (with splicing, copy-moving and object removal forgeries) lack exposure to this new manipulation type. To address this, we propose a novel high-resolution SIF dataset of 1200+ diffusion-generated manipulations with semantically extracted masks. However, this also imposes a challenge on existing methods, as they face significant computational resource constraints due to their prohibitive computational complexities. Therefore, we propose a novel EfficientIML model with a lightweight, three-stage EfficientRWKV backbone. EfficientRWKV's hybrid state-space and attention network captures global context and local details in parallel, while a multi-scale supervision strategy enforces consistency across hierarchical predictions. Extensive evaluations on our dataset and standard benchmarks demonstrate that our approach outperforms ViT-based and other SOTA lightweight baselines in localization performance, FLOPs and inference speed, underscoring its suitability for real-time forensic applications.
Similar Papers
From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations
CV and Pattern Recognition
Finds fake parts in pictures with less work.
Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation
CV and Pattern Recognition
Finds fake pictures online using web images.
Training-Free In-Context Forensic Chain for Image Manipulation Detection and Localization
CV and Pattern Recognition
Find fake pictures without training.