Score: 1

EfficientIML: Efficient High-Resolution Image Manipulation Localization

Published: September 10, 2025 | arXiv ID: 2509.08583v1

By: Jinhan Li , Haoyang He , Lei Xie and more

Potential Business Impact:

Finds fake pictures made by AI.

Business Areas:
Image Recognition Data and Analytics, Software

With imaging devices delivering ever-higher resolutions and the emerging diffusion-based forgery methods, current detectors trained only on traditional datasets (with splicing, copy-moving and object removal forgeries) lack exposure to this new manipulation type. To address this, we propose a novel high-resolution SIF dataset of 1200+ diffusion-generated manipulations with semantically extracted masks. However, this also imposes a challenge on existing methods, as they face significant computational resource constraints due to their prohibitive computational complexities. Therefore, we propose a novel EfficientIML model with a lightweight, three-stage EfficientRWKV backbone. EfficientRWKV's hybrid state-space and attention network captures global context and local details in parallel, while a multi-scale supervision strategy enforces consistency across hierarchical predictions. Extensive evaluations on our dataset and standard benchmarks demonstrate that our approach outperforms ViT-based and other SOTA lightweight baselines in localization performance, FLOPs and inference speed, underscoring its suitability for real-time forensic applications.

Page Count
7 pages

Category
Computer Science:
CV and Pattern Recognition