Score: 1

MMDEW: Multipurpose Multiclass Density Estimation in the Wild

Published: October 2, 2025 | arXiv ID: 2510.02213v1

By: Villanelle O'Reilly , Jonathan Cox , Georgios Leontidis and more

Potential Business Impact:

Counts many things in crowded pictures.

Business Areas:
Image Recognition Data and Analytics, Software

Density map estimation can be used to estimate object counts in dense and occluded scenes where discrete counting-by-detection methods fail. We propose a multicategory counting framework that leverages a Twins pyramid vision-transformer backbone and a specialised multi-class counting head built on a state-of-the-art multiscale decoding approach. A two-task design adds a segmentation-based Category Focus Module, suppressing inter-category cross-talk at training time. Training and evaluation on the VisDrone and iSAID benchmarks demonstrates superior performance versus prior multicategory crowd-counting approaches (33%, 43% and 64% reduction to MAE), and the comparison with YOLOv11 underscores the necessity of crowd counting methods in dense scenes. The method's regional loss opens up multi-class crowd counting to new domains, demonstrated through the application to a biodiversity monitoring dataset, highlighting its capacity to inform conservation efforts and enable scalable ecological insights.

Country of Origin
🇬🇧 United Kingdom

Page Count
9 pages

Category
Computer Science:
CV and Pattern Recognition