Deep learning framework for hierarchical-based object identification and description
Humans have the capability to quickly classify, identify, and describe objects in the surrounding environment. The Deep learning (DL) and Computer Vision (CV) approaches allow computers to gain high-level understanding from images or videos. Although DL-based CV approaches have achieved various succ...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English English |
Published: |
2024
|
Subjects: | |
Online Access: | https://etd.uum.edu.my/11215/1/permission%20to%20deposit-allow%20embargo%2014%20months-s903276.pdf https://etd.uum.edu.my/11215/2/s903276_01.pdf https://etd.uum.edu.my/11215/3/s903276_02.pdf https://etd.uum.edu.my/11215/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.uum.etd.11215 |
---|---|
record_format |
eprints |
spelling |
my.uum.etd.112152024-07-10T00:32:38Z https://etd.uum.edu.my/11215/ Deep learning framework for hierarchical-based object identification and description Alamro, Loai C. A. T58.5-58.64 Information technology T Technology (General) Humans have the capability to quickly classify, identify, and describe objects in the surrounding environment. The Deep learning (DL) and Computer Vision (CV) approaches allow computers to gain high-level understanding from images or videos. Although DL-based CV approaches have achieved various successes in object recognition and identification, these approaches cannot adapt like humans. Existing approaches exhibit setbacks due to their inability to identify objects that are beyond training samples when deployed in real-world applications. Hence, DL approaches lack of global generalization, hierarchical learning, and correlation learning. In addition, DL-based CV approaches only depend on extracting high-level features which are not semantic to recognize parts or subsets of an object. In this study, a new DL framework called Human Identification and Description Framework (HIDF) is developed. The HIDF aims to overcome global generalization, hierarchical and correlation learning limitations by describing an object when it cannot be initially identified. The HIDF components include five phases: 1) a feature extraction network which extracts suitable feature levels for object identification and classification tasks, 2) identification network which specifies the identity of the object, 3) a multi-output classification network for hierarchical-based object classification, 4) object description algorithm which generates sentences describing object characteristics, and 5) shunt connections which regulate prediction direction of HIDF according to the desired task (identification or classification). HIDF achieved 99.27% identification accuracy and 98.61% classification accuracy on benchmark datasets for human identification and human head attribute classification. The framework performance was compared with other state-of-the-art networks based on accuracy, precision, recall, and F1-score measures. The experimental results have shown that HIDF is able to overcome global generalization, hierarchical learning and correlation learning limitations by tracing the hierarchy and correlation of the object. Therefore, HIDF can be employed in CV applications, including but not limited to visually impaired aids, and robot guidance. 2024 Thesis NonPeerReviewed text en https://etd.uum.edu.my/11215/1/permission%20to%20deposit-allow%20embargo%2014%20months-s903276.pdf text en https://etd.uum.edu.my/11215/2/s903276_01.pdf text en https://etd.uum.edu.my/11215/3/s903276_02.pdf Alamro, Loai C. A. (2024) Deep learning framework for hierarchical-based object identification and description. Doctoral thesis, Universiti Utara Malaysia. |
institution |
Universiti Utara Malaysia |
building |
UUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Utara Malaysia |
content_source |
UUM Electronic Theses |
url_provider |
http://etd.uum.edu.my/ |
language |
English English English |
topic |
T58.5-58.64 Information technology T Technology (General) |
spellingShingle |
T58.5-58.64 Information technology T Technology (General) Alamro, Loai C. A. Deep learning framework for hierarchical-based object identification and description |
description |
Humans have the capability to quickly classify, identify, and describe objects in the surrounding environment. The Deep learning (DL) and Computer Vision (CV) approaches allow computers to gain high-level understanding from images or videos. Although DL-based CV approaches have achieved various successes in object recognition and identification, these approaches cannot adapt like humans. Existing approaches exhibit setbacks due to their inability to identify objects that are beyond training samples when deployed in real-world applications. Hence, DL approaches lack of global generalization, hierarchical learning, and correlation learning. In addition, DL-based CV approaches only depend on extracting high-level features which are not semantic to recognize parts or subsets of an object. In this study, a new DL framework called Human Identification and Description Framework (HIDF) is developed. The HIDF aims to overcome global generalization, hierarchical and correlation learning limitations by describing an object when it cannot be initially identified. The HIDF components include five phases: 1) a feature extraction network which extracts suitable feature levels for object identification and classification tasks, 2) identification network which specifies the identity of the object, 3) a multi-output classification network for hierarchical-based object classification, 4) object description algorithm which generates sentences describing object characteristics, and 5) shunt connections which regulate prediction direction of HIDF according to the desired task (identification or classification). HIDF achieved 99.27% identification accuracy and 98.61% classification accuracy on benchmark datasets for human identification and human head attribute classification. The framework performance was compared with other state-of-the-art networks based on accuracy, precision, recall, and F1-score measures. The experimental results have shown that HIDF is able to overcome global generalization, hierarchical learning and correlation learning limitations by tracing the hierarchy and correlation of the object. Therefore, HIDF can be employed in CV applications, including but not limited to visually impaired aids, and robot guidance. |
format |
Thesis |
author |
Alamro, Loai C. A. |
author_facet |
Alamro, Loai C. A. |
author_sort |
Alamro, Loai C. A. |
title |
Deep learning framework for hierarchical-based object identification and description |
title_short |
Deep learning framework for hierarchical-based object identification and description |
title_full |
Deep learning framework for hierarchical-based object identification and description |
title_fullStr |
Deep learning framework for hierarchical-based object identification and description |
title_full_unstemmed |
Deep learning framework for hierarchical-based object identification and description |
title_sort |
deep learning framework for hierarchical-based object identification and description |
publishDate |
2024 |
url |
https://etd.uum.edu.my/11215/1/permission%20to%20deposit-allow%20embargo%2014%20months-s903276.pdf https://etd.uum.edu.my/11215/2/s903276_01.pdf https://etd.uum.edu.my/11215/3/s903276_02.pdf https://etd.uum.edu.my/11215/ |
_version_ |
1805892960236077056 |
score |
13.211869 |