Either crop or pad the input volume: What is beneficial for Convolutional Neural Network?

Convolutional Neural Network (CNN) is the most popular method of deep learning in the machine learning field. Training a CNN has always been a demanding task compared to other machine learning paradigms, and this is due to its big space of hyper-parameters such as convolutional kernel size, number o...

Full description

Saved in:
Bibliographic Details
Main Authors: Al-Saggaf, U.M., Botalb, A., Moinuddin, M., Alfakeh, S.A., Ali, S.S.A., Boon, T.T.
Format: Conference or Workshop Item
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85124170608&doi=10.1109%2fICIAS49414.2021.9642661&partnerID=40&md5=fe97d0334bee476e8e7e0756fc5b0943
http://eprints.utp.edu.my/29179/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Convolutional Neural Network (CNN) is the most popular method of deep learning in the machine learning field. Training a CNN has always been a demanding task compared to other machine learning paradigms, and this is due to its big space of hyper-parameters such as convolutional kernel size, number of strides, number of layers, pooling window size, etc. What makes the CNN's huge hyper-parameters space optimization harder is that there is no universal robust theory supporting it, and any work flow proposed so far in literature is based on heuristics that are just rules of thumb and only depend on the dataset and problem at hand. In this work, it is empirically illustrated that the performance of a CNN is not linked only with the choice of the right hyper-parameters, but also linked to how some of the CNN operations are implemented. More specifically, the CNN performance is contrasted for two different implementations: cropping and padding the input volume. The results state that padding the input volume achieves higher accuracy and takes less time in training compared with cropping method. © 2021 IEEE.