U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery

Crowd counting considers one of the most significant and challenging issues in computer vision and deep learning communities, whose applications are being utilized for various tasks. While this issue is well studied, it remains an open challenge to manage perspective distortions and scale variations...

Full description

Saved in:
Bibliographic Details
Main Authors: Hafeezallah, Adel, Al-Dhamari, Ahlam, Abu-Bakar, Syed Abd. Rahman
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/93961/1/SyedAbdRahman2021_UASDNetSupervisedCrowdCounting.pdf
http://eprints.utm.my/id/eprint/93961/
http://dx.doi.org/10.1109/ACCESS.2021.3112174
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.93961
record_format eprints
spelling my.utm.939612022-02-28T13:27:00Z http://eprints.utm.my/id/eprint/93961/ U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery Hafeezallah, Adel Al-Dhamari, Ahlam Abu-Bakar, Syed Abd. Rahman TK Electrical engineering. Electronics Nuclear engineering Crowd counting considers one of the most significant and challenging issues in computer vision and deep learning communities, whose applications are being utilized for various tasks. While this issue is well studied, it remains an open challenge to manage perspective distortions and scale variations. How well these problems are resolved has a huge impact on predicting a high-quality crowd density map. In this study, a hybrid and modified deep neural network (U-ASD Net), based on U-Net and adaptive scenario discovery (ASD), is proposed to get precise and effective crowd counting. The U part is produced by replacing the nearest upsampling in the encoder of U-Net with max-unpooling. This modification provides a better crowd counting performance by capturing more spatial information. The max-unpooling layers upsample the feature maps based on the max locations held from the downsampling process. The ASD part is constructed with three light pathways, two of which have been learned to reflect various densities of the crowd and define the appropriate geometric configuration employing various sizes of the receptive field. The third pathway is an adaptation path, which implicitly discovers and models complex scenarios to recalibrate pathway-wise responses adaptively. ASD has no additional branches to avoid increasing the complexity. The designed model is end-to-end trainable. This integration provides an effective model to count crowds in both dense and sparse datasets. It also predicts an elevated quality density map with a high structural similarity index and a high peak signal-to-noise ratio. Several comprehensive experiments on four popular datasets for crowd counting have been carried out to demonstrate the proposed method's promising performance compared to other state-of-the-art approaches. Furthermore, a new dataset with its manual annotations, called Haramain with three different scenes and different densities, is introduced and used for evaluating the U-ASD Net. Institute of Electrical and Electronics Engineers Inc. 2021-09 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/93961/1/SyedAbdRahman2021_UASDNetSupervisedCrowdCounting.pdf Hafeezallah, Adel and Al-Dhamari, Ahlam and Abu-Bakar, Syed Abd. Rahman (2021) U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery. IEEE Access, 9 . pp. 127444-127459. ISSN 2169-3536 http://dx.doi.org/10.1109/ACCESS.2021.3112174 DOI:10.1109/ACCESS.2021.3112174
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Hafeezallah, Adel
Al-Dhamari, Ahlam
Abu-Bakar, Syed Abd. Rahman
U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
description Crowd counting considers one of the most significant and challenging issues in computer vision and deep learning communities, whose applications are being utilized for various tasks. While this issue is well studied, it remains an open challenge to manage perspective distortions and scale variations. How well these problems are resolved has a huge impact on predicting a high-quality crowd density map. In this study, a hybrid and modified deep neural network (U-ASD Net), based on U-Net and adaptive scenario discovery (ASD), is proposed to get precise and effective crowd counting. The U part is produced by replacing the nearest upsampling in the encoder of U-Net with max-unpooling. This modification provides a better crowd counting performance by capturing more spatial information. The max-unpooling layers upsample the feature maps based on the max locations held from the downsampling process. The ASD part is constructed with three light pathways, two of which have been learned to reflect various densities of the crowd and define the appropriate geometric configuration employing various sizes of the receptive field. The third pathway is an adaptation path, which implicitly discovers and models complex scenarios to recalibrate pathway-wise responses adaptively. ASD has no additional branches to avoid increasing the complexity. The designed model is end-to-end trainable. This integration provides an effective model to count crowds in both dense and sparse datasets. It also predicts an elevated quality density map with a high structural similarity index and a high peak signal-to-noise ratio. Several comprehensive experiments on four popular datasets for crowd counting have been carried out to demonstrate the proposed method's promising performance compared to other state-of-the-art approaches. Furthermore, a new dataset with its manual annotations, called Haramain with three different scenes and different densities, is introduced and used for evaluating the U-ASD Net.
format Article
author Hafeezallah, Adel
Al-Dhamari, Ahlam
Abu-Bakar, Syed Abd. Rahman
author_facet Hafeezallah, Adel
Al-Dhamari, Ahlam
Abu-Bakar, Syed Abd. Rahman
author_sort Hafeezallah, Adel
title U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
title_short U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
title_full U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
title_fullStr U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
title_full_unstemmed U-ASD Net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
title_sort u-asd net: supervised crowd counting based on semantic segmentation and adaptive scenario discovery
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2021
url http://eprints.utm.my/id/eprint/93961/1/SyedAbdRahman2021_UASDNetSupervisedCrowdCounting.pdf
http://eprints.utm.my/id/eprint/93961/
http://dx.doi.org/10.1109/ACCESS.2021.3112174
_version_ 1726791460108369920
score 13.160551