FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Mo...

Full description

Saved in:
Bibliographic Details
Main Authors: Abu Bakar, Mohamad Hafiz, Shamsudin, Abu Ubaidah, Soomro, Zubair Adil, Tadokoro, Satoshi, Salaan, C. J
Format: Article
Language:English
Published: UTM 2024
Subjects:
Online Access:http://eprints.uthm.edu.my/11759/1/J17155_fee1ae5af82031f5ee1f764afb29b8db.pdf
http://eprints.uthm.edu.my/11759/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uthm.eprints.11759
record_format eprints
spelling my.uthm.eprints.117592025-01-10T08:05:38Z http://eprints.uthm.edu.my/11759/ FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION Abu Bakar, Mohamad Hafiz Shamsudin, Abu Ubaidah Soomro, Zubair Adil Tadokoro, Satoshi Salaan, C. J TJ Mechanical engineering and machinery Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%. UTM 2024 Article PeerReviewed text en http://eprints.uthm.edu.my/11759/1/J17155_fee1ae5af82031f5ee1f764afb29b8db.pdf Abu Bakar, Mohamad Hafiz and Shamsudin, Abu Ubaidah and Soomro, Zubair Adil and Tadokoro, Satoshi and Salaan, C. J (2024) FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION. Jurnal Teknologi, 86 (2). pp. 1-13. ISSN 2180–3722
institution Universiti Tun Hussein Onn Malaysia
building UTHM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tun Hussein Onn Malaysia
content_source UTHM Institutional Repository
url_provider http://eprints.uthm.edu.my/
language English
topic TJ Mechanical engineering and machinery
spellingShingle TJ Mechanical engineering and machinery
Abu Bakar, Mohamad Hafiz
Shamsudin, Abu Ubaidah
Soomro, Zubair Adil
Tadokoro, Satoshi
Salaan, C. J
FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
description Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.
format Article
author Abu Bakar, Mohamad Hafiz
Shamsudin, Abu Ubaidah
Soomro, Zubair Adil
Tadokoro, Satoshi
Salaan, C. J
author_facet Abu Bakar, Mohamad Hafiz
Shamsudin, Abu Ubaidah
Soomro, Zubair Adil
Tadokoro, Satoshi
Salaan, C. J
author_sort Abu Bakar, Mohamad Hafiz
title FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
title_short FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
title_full FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
title_fullStr FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
title_full_unstemmed FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
title_sort fusion sparse and shaping reward function in soft actor-critic deep reinforcement learning for mobile robot navigation
publisher UTM
publishDate 2024
url http://eprints.uthm.edu.my/11759/1/J17155_fee1ae5af82031f5ee1f764afb29b8db.pdf
http://eprints.uthm.edu.my/11759/
_version_ 1821004327478624256
score 13.235796