Score: 0

EGD-YOLO: A Lightweight Multimodal Framework for Robust Drone-Bird Discrimination via Ghost-Enhanced YOLOv8n and EMA Attention under Adverse Condition

Published: October 12, 2025 | arXiv ID: 2510.10765v1

By: Sudipto Sarkar , Mohammad Asif Hasan , Khondokar Ashik Shahriar and more

Potential Business Impact:

Spots birds and drones from pictures faster.

Business Areas:
Image Recognition Data and Analytics, Software

Identifying drones and birds correctly is essential for keeping the skies safe and improving security systems. Using the VIP CUP 2025 dataset, which provides both RGB and infrared (IR) images, this study presents EGD-YOLOv8n, a new lightweight yet powerful model for object detection. The model improves how image features are captured and understood, making detection more accurate and efficient. It uses smart design changes and attention layers to focus on important details while reducing the amount of computation needed. A special detection head helps the model adapt to objects of different shapes and sizes. We trained three versions: one using RGB images, one using IR images, and one combining both. The combined model achieved the best accuracy and reliability while running fast enough for real-time use on common GPUs.

Page Count
6 pages

Category
Computer Science:
CV and Pattern Recognition