Staff View: Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention

Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention

Scene recognition is considered as one of the most important functionalities of human vision. In the field of computer vision, scene recognition problem is very significant and important. Scene recognition or classification is a process of organizing images and predicting the class category of a sce...

Full description

Saved in:

Bibliographic Details
Main Author:	Ahmad Ridzuan, Kudus
Format:	Thesis
Language:	English
Published:	Universiti Malaysia Sarawak (UNIMAS) 2021
Subjects:	BF Psychology QA76 Computer software T201 Patents. Trademarks
Online Access:	http://ir.unimas.my/id/eprint/34925/3/Ahmad%20Ridzuan%20Kudus%20ft.pdf http://ir.unimas.my/id/eprint/34925/
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.unimas.ir.34925
record_format	eprints
spelling	my.unimas.ir.349252024-08-20T06:44:48Z http://ir.unimas.my/id/eprint/34925/ Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention Ahmad Ridzuan, Kudus BF Psychology QA76 Computer software T201 Patents. Trademarks Scene recognition is considered as one of the most important functionalities of human vision. In the field of computer vision, scene recognition problem is very significant and important. Scene recognition or classification is a process of organizing images and predicting the class category of a scene image. Human can accurately classify scene effortlessly within short period of time. Using this concept, a novel approach of scene classification model which built based on human pre-attentive visual attention has been proposed in this study by utilizing one of the earliest saliency model to generate a set of high-quality regions potentially contain salient objects. An experimental study was performed to investigate the efficiency of Saliency Toolbox on natural indoor scene images when its parameters are manipulated. At the end of this experiment, an acceptable parameter scales have been finalized for the use of Saliency Toolbox in the proposed scene classification model. The proposed model is developed with three main operations; (i) salient region proposals generation, (ii) feature extraction and concatenation, and (iii) classification. The proposed model has been trained and tested on MIT Indoor 67 dataset. An experiment and a benchmarking testing have been conducted on the proposed model. The results of the experiment have clearly shown providing more salient regions means providing more meaningful details of an input image. For the benchmarking testing, the result has proved that saliency model used in this study is capable to generate high-quality informative salient regions that lead to good classification accuracy. The proposed model achieves a higher average accuracy percentage than a standard approach model, which classifies based on one whole image. This indicates the advantages of using deep features of local salient objects over global deep features. Two experiments have been conducted in this study to test and evaluate human performance on scene classification for various visual input conditions. The accuracy of human classification on complete scene images for a brief period of time in Experiment 1 is compared to the accuracy obtained by the proposed scene classification model. Furthermore, the accuracy of human classification in Experiment 1 is also compared to the accuracy obtained by human in Experiment 2, where their classification performance is tested on cropped salient regions. Evaluation of results from these experiments have shown that the proposed model has not achieved the same standard as human. Using only object features to differentiate between two different scenes is not enough to achieve the best classification accuracy as human. The scene background and layout, relationship between objects and human memory are the other features that affect human classification performance. These other attributes of scene need to be taken in the process of recognition and classification of scene images in further study. Universiti Malaysia Sarawak (UNIMAS) 2021-03-16 Thesis NonPeerReviewed text en http://ir.unimas.my/id/eprint/34925/3/Ahmad%20Ridzuan%20Kudus%20ft.pdf Ahmad Ridzuan, Kudus (2021) Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention. Masters thesis, Universiti Malaysia Sarawak.
institution	Universiti Malaysia Sarawak
building	Centre for Academic Information Services (CAIS)
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaysia Sarawak
content_source	UNIMAS Institutional Repository
url_provider	http://ir.unimas.my/
language	English
topic	BF Psychology QA76 Computer software T201 Patents. Trademarks
spellingShingle	BF Psychology QA76 Computer software T201 Patents. Trademarks Ahmad Ridzuan, Kudus Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
description	Scene recognition is considered as one of the most important functionalities of human vision. In the field of computer vision, scene recognition problem is very significant and important. Scene recognition or classification is a process of organizing images and predicting the class category of a scene image. Human can accurately classify scene effortlessly within short period of time. Using this concept, a novel approach of scene classification model which built based on human pre-attentive visual attention has been proposed in this study by utilizing one of the earliest saliency model to generate a set of high-quality regions potentially contain salient objects. An experimental study was performed to investigate the efficiency of Saliency Toolbox on natural indoor scene images when its parameters are manipulated. At the end of this experiment, an acceptable parameter scales have been finalized for the use of Saliency Toolbox in the proposed scene classification model. The proposed model is developed with three main operations; (i) salient region proposals generation, (ii) feature extraction and concatenation, and (iii) classification. The proposed model has been trained and tested on MIT Indoor 67 dataset. An experiment and a benchmarking testing have been conducted on the proposed model. The results of the experiment have clearly shown providing more salient regions means providing more meaningful details of an input image. For the benchmarking testing, the result has proved that saliency model used in this study is capable to generate high-quality informative salient regions that lead to good classification accuracy. The proposed model achieves a higher average accuracy percentage than a standard approach model, which classifies based on one whole image. This indicates the advantages of using deep features of local salient objects over global deep features. Two experiments have been conducted in this study to test and evaluate human performance on scene classification for various visual input conditions. The accuracy of human classification on complete scene images for a brief period of time in Experiment 1 is compared to the accuracy obtained by the proposed scene classification model. Furthermore, the accuracy of human classification in Experiment 1 is also compared to the accuracy obtained by human in Experiment 2, where their classification performance is tested on cropped salient regions. Evaluation of results from these experiments have shown that the proposed model has not achieved the same standard as human. Using only object features to differentiate between two different scenes is not enough to achieve the best classification accuracy as human. The scene background and layout, relationship between objects and human memory are the other features that affect human classification performance. These other attributes of scene need to be taken in the process of recognition and classification of scene images in further study.
format	Thesis
author	Ahmad Ridzuan, Kudus
author_facet	Ahmad Ridzuan, Kudus
author_sort	Ahmad Ridzuan, Kudus
title	Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
title_short	Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
title_full	Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
title_fullStr	Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
title_full_unstemmed	Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention
title_sort	scene recognition and classification model based on human pre-attentive visual attention
publisher	Universiti Malaysia Sarawak (UNIMAS)
publishDate	2021
url	http://ir.unimas.my/id/eprint/34925/3/Ahmad%20Ridzuan%20Kudus%20ft.pdf http://ir.unimas.my/id/eprint/34925/
_version_	1808981485311492096
score	13.211869

Scene Recognition and Classification Model Based on Human Pre-attentive Visual Attention

Similar Items