Improved attentive pairwise interaction (API-Net) for fine-grained image classification

Fine-grained classification is a challenging problem as one has to deal with a similar class of objects but with various types of variations. For more elaboration, they are almost similar and have subtle differences, and are confusing. In this study, aircraft will be the fine-grained object to be fo...

Full description

Saved in:
Bibliographic Details
Main Authors: Yet, Ong Zu, Rassem, Taha H., Rahman, Md. Arafatur, M. M., Rahman
Format: Conference or Workshop Item
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/35448/2/Improved%20attentive%20pairwise%20interaction%20%28API-Net%29%20for%20fine-grained%20image%20classification_Abs.pdf
http://umpir.ump.edu.my/id/eprint/35448/
https://doi.org/10.1109/ETCCE54784.2021.9689910
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Fine-grained classification is a challenging problem as one has to deal with a similar class of objects but with various types of variations. For more elaboration, they are almost similar and have subtle differences, and are confusing. In this study, aircraft will be the fine-grained object to be focused on. Aircraft which has almost similar shapes and patterns can be hardly recognized even for humans, especially those who haven not gone through any training. In recent years, a lot of proposed methods addressed to solve the difficulties in fine-grained problems by learning contrastive clues from an image. This study aims to increase the accuracy of the Attentive Pairwise Interaction Network (API-Net) by introducing data augmentation into the network structure. Some of the previous studies proved that data augmentation does help improve a network. So, this study is going to modify the API-Net with different data augmentation settings. In this study, various settings have been introduced to the API-Net. Several experiments had been done with a simple modification where a portion of the train dataset's images will randomly convert into greyscale images. These settings are, only brightness & contrast 0.2, only grayscale 0.3, only grayscale 0.5, brightness & contrast 0.2 with grayscale 0.3, and brightness & contrast 0.2 with grayscale 0.5. As a result, the proposed modification achieved with 92.74% with brightness & contrast 0.2, 92.80% on brightness & contrast 0.2 with grayscale 0.5, and 92.86% on brightness & contrast 0.2 with grayscale 0.3. While grayscale 0.3 alone achieve 93.25% and grayscale 0.5 alone achieve 93.46% compared with the original results which reached 92.77%.