Benchmarking different deep regression models for predicting image rotation angle and robot’s end effector’s position

Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography...

Full description

Saved in:
Bibliographic Details
Main Authors: AlDahoul, Nouar, Mohd Suhaimi, Nur Farahana
Format: Conference or Workshop Item
Language:English
English
Published: Institute of Electrical and Electronics Engineers Inc. 2019
Subjects:
Online Access:http://irep.iium.edu.my/78088/1/78088_Benchmarking%20different%20deep%20regression%20models_complete_new.pdf
http://irep.iium.edu.my/78088/2/78088_Benchmarking%20different%20deep%20regression%20models_scopus.pdf
http://irep.iium.edu.my/78088/
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8952047
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography, and head-pose estimation. Deep learning has recently been cutting-edge research. Most of the research papers have focused on utilizing deep learning in classification tasks. There is still a lack of research that use deep learning for regression. This paper utilizes different deep learning models for two regression tasks. The first one is the prediction of the image rotation angle. The second task is to predict the position of the robot’s end-effector in 2D space. Efficient features were learned or extracted in order to perform good regression. The paper demonstrates and compares various models such as a local Receptive Field-Extreme Learning Machine (LRF-ELM), Hierarchical ELM, Supervised Convolutional Neural Network (CNN), and pre-trained CNN such as AlexNet. Each model was trained to learn or extract features and map them to specific continuous output. The results show that all models gave good performance in terms of RMSE and accuracy. H-ELM was found to outperform other models in term of training speed.