Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/27119
Title: MFP-Net: Multi-scale feature pyramid network for crowd counting
Authors: Lei, T
Zhang, D
Wang, R
Li, S
Zhang, W
Nandi, AK
Keywords: optical, image and video signal processing;computer vision and image processing techniques;neural nets
Issue Date: 9-Jun-2021
Publisher: Institution of Engineering and Technology (IET)
Citation: Lei, T. et al. (2021) 'MFP-Net: Multi-scale feature pyramid network for crowd counting', IET Image Processing, 15 (14), pp. 3522 - 3533. doi: 10.1049/ipr2.12230.
Abstract: Copyright © 2021 The Authors.. Although deep learning has been widely used for dense crowd counting, it still faces two challenges. Firstly, the popular network models are sensitive to scale variance of human head, human occlusions, and complex background due to repeated utilization of vanilla convolution kernels. Secondly, the vanilla feature fusion often depends on summation or concatenation, which ignores the correlation of different features leading to information redundancy and low robustness to background noise. To address these issues, a multi-scale feature pyramid network (MFP-Net) for dense crowd counting is proposed in this paper. The proposed MFP-Net makes two contributions. Firstly, the feature pyramid fusion module is designed that adopts rich convolutions with different depths and scales, not only to expand the receptive field, but also to improve the inference speed of models by using parallel group convolution. Secondly, a feature attention-aware module is added in the feature fusion stage. The module can achieve local and global information fusion by capturing the importance of the spatial and channel domains to improve model robustness. The proposed MFP-Net is evaluated on five publicly available datasets, and experiments show that the MFP-Net not only provides better crowd counting results than comparative models, but also requires fewer parameters.
URI: https://bura.brunel.ac.uk/handle/2438/27119
DOI: https://doi.org/10.1049/ipr2.12230
ISSN: 1751-9659
Other Identifiers: ORCID iDs: Tao Lei https://orcid.org/0000-0002-2104-9298; Asoke K. Nandi https://orcid.org/0000-0001-6248-2875
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2021 The Authors. IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited.3.83 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons