Abstract
Estimation errors caused by perspective distortions are a long-standing problem in the domain of crowd counting. In this paper, we propose a novel loss function to allow filters in convolutional neural networks to learn features that are adaptive to the scale and perspective variation of individuals in crowd images. By exploring the crowd count error from regions close to the vanishing point of a perspective distorted image, we are able to penalize under-estimations. This is useful to train a network that is robust against perspective distortion for accurate density estimation. The proposed method is scene-independent and can be applied effectively to crowd scene with a variety of physical layout. Extensive comparative evaluations demonstrate that our proposed method achieves significant improvement over the state-of-the-art approaches on the challenging ShanghaiTech and UCF-QNRF datasets.
Original language | English |
---|---|
Title of host publication | Proceedings of the MVA 2019, International Conference on Machine Vision Applications |
Editors | Hitoshi Habe, Tony Tung |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 366-371 |
Number of pages | 6 |
ISBN (Electronic) | 9784901122184 |
ISBN (Print) | 9781728109251 |
DOIs | |
Publication status | Published - 2019 |
Event | Machine Vision Applications 2019 - Tokyo, Japan Duration: 27 May 2019 → 31 May 2019 Conference number: 16th https://ieeexplore.ieee.org/xpl/conhome/8750992/proceeding (Proceedings) |
Conference
Conference | Machine Vision Applications 2019 |
---|---|
Abbreviated title | MVA 2019 |
Country/Territory | Japan |
City | Tokyo |
Period | 27/05/19 → 31/05/19 |
Internet address |