WSCF-MVCC: Weakly-supervised Calibration-free Multi-view Crowd Counting
By: Bin Li, Daijie Chen, Qi Zhang
Potential Business Impact:
Counts people in crowds without needing cameras to be set up.
Multi-view crowd counting can effectively mitigate occlusion issues that commonly arise in single-image crowd counting. Existing deep-learning multi-view crowd counting methods project different camera view images onto a common space to obtain ground-plane density maps, requiring abundant and costly crowd annotations and camera calibrations. Hence, calibration-free methods are proposed that do not require camera calibrations and scene-level crowd annotations. However, existing calibration-free methods still require expensive image-level crowd annotations for training the single-view counting module. Thus, in this paper, we propose a weakly-supervised calibration-free multi-view crowd counting method (WSCF-MVCC), directly using crowd count as supervision for the single-view counting module rather than density maps constructed from crowd annotations. Instead, a self-supervised ranking loss that leverages multi-scale priors is utilized to enhance the model's perceptual ability without additional annotation costs. What's more, the proposed model leverages semantic information to achieve a more accurate view matching and, consequently, a more precise scene-level crowd count estimation. The proposed method outperforms the state-of-the-art methods on three widely used multi-view counting datasets under weakly supervised settings, indicating that it is more suitable for practical deployment compared with calibrated methods. Code is released in https://github.com/zqyq/Weakly-MVCC.
Similar Papers
FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
CV and Pattern Recognition
Helps cameras count people better in crowded places.
FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning
CV and Pattern Recognition
Helps cameras count people in crowded places.
Density Estimation and Crowd Counting
CV and Pattern Recognition
Counts people in videos more accurately and faster.