Staff View: Effects of approximation in computation on the accuracy and performance of deep neural network inference

Effects of approximation in computation on the accuracy and performance of deep neural network inference

Recently, deep learning is at the forefront of the state-of-the-art machine learning algorithms and has shown excellent results in a variety of applications such as medical field, consumer as well as autonomous vehicles. Convolutional Neural Network (CNN) - is the leading deep learning architecture...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hui, Nee Ow1, Sheikh, Usman Ullah, Mohd. Mokji, Musa
Format:	Conference or Workshop Item
Language:	English
Published:	2020
Subjects:	TK Electrical engineering. Electronics Nuclear engineering
Online Access:	http://eprints.utm.my/id/eprint/92804/1/UsmanUllahSheikh2020_EffectsofApproximationinComputationontheAccuracyandPerformance.pdf http://eprints.utm.my/id/eprint/92804/ http://dx.doi.org/10.1088/1757-899X/884/1/012083
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.utm.92804
record_format	eprints
spelling	my.utm.928042021-10-28T10:14:01Z http://eprints.utm.my/id/eprint/92804/ Effects of approximation in computation on the accuracy and performance of deep neural network inference Hui, Nee Ow1 Sheikh, Usman Ullah Mohd. Mokji, Musa TK Electrical engineering. Electronics Nuclear engineering Recently, deep learning is at the forefront of the state-of-the-art machine learning algorithms and has shown excellent results in a variety of applications such as medical field, consumer as well as autonomous vehicles. Convolutional Neural Network (CNN) - is the leading deep learning architecture that is mostly applied. However, huge dataset is needed to train with complex architecture to achieve precise learning. Inference can be performed when given a ready CNN model and its weight file to another user. Inference takes time with precise weights and huge dataset. To overcome this problem, and enhance the inference system, approximation computation will be applying in terms of weight for changed of decimal place. The smaller size of the dataset is used in the inference process to reduce the inference time. MobileNetV2 architecture is used with the new weight for inference. Also, open source libraries such as TensorFlow, Keras and python is used. GPU (NVIDIA GeForce GTX 1060 6GB 64 Bit) is used as training and inference platform. Inference time is shortened, and the accuracy of performance for new weights compare with the precise weight only has a small gap which still has a great performance for classification. This work has proved that with 4 decimal places is able to obtain the same accuracy for inference when compared to benchmark with 9 decimal places. Inference time for 4 decimal places is also less than benchmark time. 2020 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/92804/1/UsmanUllahSheikh2020_EffectsofApproximationinComputationontheAccuracyandPerformance.pdf Hui, Nee Ow1 and Sheikh, Usman Ullah and Mohd. Mokji, Musa (2020) Effects of approximation in computation on the accuracy and performance of deep neural network inference. In: 2019 Sustainable and Integrated Engineering International Conference, SIE 2019, 8 - 9 December 2019, Putrajaya, Malaysia. http://dx.doi.org/10.1088/1757-899X/884/1/012083
institution	Universiti Teknologi Malaysia
building	UTM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknologi Malaysia
content_source	UTM Institutional Repository
url_provider	http://eprints.utm.my/
language	English
topic	TK Electrical engineering. Electronics Nuclear engineering
spellingShingle	TK Electrical engineering. Electronics Nuclear engineering Hui, Nee Ow1 Sheikh, Usman Ullah Mohd. Mokji, Musa Effects of approximation in computation on the accuracy and performance of deep neural network inference
description	Recently, deep learning is at the forefront of the state-of-the-art machine learning algorithms and has shown excellent results in a variety of applications such as medical field, consumer as well as autonomous vehicles. Convolutional Neural Network (CNN) - is the leading deep learning architecture that is mostly applied. However, huge dataset is needed to train with complex architecture to achieve precise learning. Inference can be performed when given a ready CNN model and its weight file to another user. Inference takes time with precise weights and huge dataset. To overcome this problem, and enhance the inference system, approximation computation will be applying in terms of weight for changed of decimal place. The smaller size of the dataset is used in the inference process to reduce the inference time. MobileNetV2 architecture is used with the new weight for inference. Also, open source libraries such as TensorFlow, Keras and python is used. GPU (NVIDIA GeForce GTX 1060 6GB 64 Bit) is used as training and inference platform. Inference time is shortened, and the accuracy of performance for new weights compare with the precise weight only has a small gap which still has a great performance for classification. This work has proved that with 4 decimal places is able to obtain the same accuracy for inference when compared to benchmark with 9 decimal places. Inference time for 4 decimal places is also less than benchmark time.
format	Conference or Workshop Item
author	Hui, Nee Ow1 Sheikh, Usman Ullah Mohd. Mokji, Musa
author_facet	Hui, Nee Ow1 Sheikh, Usman Ullah Mohd. Mokji, Musa
author_sort	Hui, Nee Ow1
title	Effects of approximation in computation on the accuracy and performance of deep neural network inference
title_short	Effects of approximation in computation on the accuracy and performance of deep neural network inference
title_full	Effects of approximation in computation on the accuracy and performance of deep neural network inference
title_fullStr	Effects of approximation in computation on the accuracy and performance of deep neural network inference
title_full_unstemmed	Effects of approximation in computation on the accuracy and performance of deep neural network inference
title_sort	effects of approximation in computation on the accuracy and performance of deep neural network inference
publishDate	2020
url	http://eprints.utm.my/id/eprint/92804/1/UsmanUllahSheikh2020_EffectsofApproximationinComputationontheAccuracyandPerformance.pdf http://eprints.utm.my/id/eprint/92804/ http://dx.doi.org/10.1088/1757-899X/884/1/012083
_version_	1715189692333293568
score	13.159267

Effects of approximation in computation on the accuracy and performance of deep neural network inference

Similar Items