Long-Horizon Manipulation by a Single-arm Robot via Sub-goal Network based Hierarchical Reinforcement Learning

Jin Gyun Jeong, Ji Heon Oh, Hwanseok Jung, Jin Hyuk Lee, Ismael Nicolas Espinoza Jaramillo, Channabasava Chola, Won Hee Lee, Tae Seong Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this work, we present an approach of long-horizon intelligence that utilizes Sub-goal network based hierarchical reinforcement learning (HRL) for long-horizon tasks by a single-arm robot. Long-horizon (LH) tasks are complicated due to their longer complex sequences and the large number of environmental variables. We attempt to solve the LH learning problem by the Sub-goal network based HRL. The proposed approach is tested in both simulation and hardware environments by a LH task of opening a drawer, grasping and relocating an object, and closing a drawer. Our Sub-goal network based HRL achieves a success rate of 90.3% in completing the LH tasks. Whereas the conventional deep reinforcement learning solution could not complete the LH task.

Original languageEnglish
Title of host publicationICBET 2023 - Proceedings of 2023 13th International Conference on Biomedical Engineering and Technology
PublisherAssociation for Computing Machinery
Pages88-92
Number of pages5
ISBN (Electronic)9798400707438
DOIs
Publication statusPublished - 15 Jun 2023
Event13th International Conference on Biomedical Engineering and Technology, ICBET 2023 - Tokyo, Japan
Duration: 15 Jun 202318 Jun 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference13th International Conference on Biomedical Engineering and Technology, ICBET 2023
Country/TerritoryJapan
CityTokyo
Period15/06/2318/06/23

Bibliographical note

Publisher Copyright:
© 2023 ACM.

Keywords

  • Hierarchical Reinforcement Learning
  • Long-horizon
  • Robot Manipulation

Fingerprint

Dive into the research topics of 'Long-Horizon Manipulation by a Single-arm Robot via Sub-goal Network based Hierarchical Reinforcement Learning'. Together they form a unique fingerprint.

Cite this