Convolutional neural networks with fused layers applied to face recognition

In this paper, we propose an e®ective convolutional neural network (CNN) model to the problem of face recognition. The proposed CNN architecture applies fused convolution/ subsampling layers that result in a simpler model with fewer network parameters; that is, a smaller number of neurons, trainabl...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad Radzi, Syafeeza, Hani, Mohamed Khalil, Liew, Shan Sung, Bakhteri, Rabia
Format: Article
Language:English
Published: World Scientific Publishing 2015
Online Access:http://eprints.utem.edu.my/id/eprint/18946/2/CNNs%20with%20fused%20layers%20applied%20to%20face%20recognition.pdf
http://eprints.utem.edu.my/id/eprint/18946/
http://www.worldscientific.com/action/showMultipleAbstracts?mailPageTitle=Search&href=%2Faction%2FdoSearch%3FpubType%3D%26AllField%3DConvolutional%2BNeural%2BNetworks%2BWith%2BFused%2BConvolution%252FSubsampling%2BLayers%2BApplied%2BTo%2BFace%2BRecognitio
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we propose an e®ective convolutional neural network (CNN) model to the problem of face recognition. The proposed CNN architecture applies fused convolution/ subsampling layers that result in a simpler model with fewer network parameters; that is, a smaller number of neurons, trainable parameters, and connections. In addition, it does not require any complex or costly image preprocessing steps that are typical in existing face recognizer systems. In this work, we enhance the stochastic diagonal Levenberg–Marquardt algorithm, a second-order back-propagation algorithm to obtain faster network convergence and better generalization ability. Experimental work completed on the ORL database shows that a recognition accuracy of 100% is achieved, with the network converging within 15 epochs. The average processing time of the proposed CNN face recognition solution, executed on a 2.5 GHz Intel i5 quad-core processor, is 3 s per epoch, with a recognition speed of less than 0.003 s. These results show that the proposed CNN model is a computationally efficient architecture that exhibits faster processing and learning times, and also produces higher recognition accuracy, outperforming other existing work on face recognizers based on neural networks