MMDEW: Multipurpose Multiclass Density Estimation in the Wild
By: Villanelle O'Reilly , Jonathan Cox , Georgios Leontidis and more
Potential Business Impact:
Counts many things in crowded pictures.
Density map estimation can be used to estimate object counts in dense and occluded scenes where discrete counting-by-detection methods fail. We propose a multicategory counting framework that leverages a Twins pyramid vision-transformer backbone and a specialised multi-class counting head built on a state-of-the-art multiscale decoding approach. A two-task design adds a segmentation-based Category Focus Module, suppressing inter-category cross-talk at training time. Training and evaluation on the VisDrone and iSAID benchmarks demonstrates superior performance versus prior multicategory crowd-counting approaches (33%, 43% and 64% reduction to MAE), and the comparison with YOLOv11 underscores the necessity of crowd counting methods in dense scenes. The method's regional loss opens up multi-class crowd counting to new domains, demonstrated through the application to a biodiversity monitoring dataset, highlighting its capacity to inform conservation efforts and enable scalable ecological insights.
Similar Papers
FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
CV and Pattern Recognition
Helps cameras count people better in crowded places.
Density Estimation and Crowd Counting
CV and Pattern Recognition
Counts people in videos more accurately and faster.
Count2Density: Crowd Density Estimation without Location-level Annotations
CV and Pattern Recognition
Counts people in pictures without marking each one.