Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Evan Prianto; MyeongSeop Kim; Jae-Han Park; Ji-Hun Bae; Jung-Su Kim

doi:10.3390/s20205911

Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Sensors ◽

10.3390/s20205911 ◽

2020 ◽

Vol 20 (20) ◽

pp. 5911

Author(s):

Evan Prianto ◽

MyeongSeop Kim ◽

Jae-Han Park ◽

Ji-Hun Bae ◽

Jung-Su Kim

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Configuration Space ◽

Entropy Term ◽

High Dimensional Problem ◽

Experience Replay ◽

Planning Algorithm ◽

Simulation And Experiment ◽

Efficient Exploration ◽

Path Planning Algorithm

Since path planning for multi-arm manipulators is a complicated high-dimensional problem, effective and fast path generation is not easy for the arbitrarily given start and goal locations of the end effector. Especially, when it comes to deep reinforcement learning-based path planning, high-dimensionality makes it difficult for existing reinforcement learning-based methods to have efficient exploration which is crucial for successful training. The recently proposed soft actor–critic (SAC) is well known to have good exploration ability due to the use of the entropy term in the objective function. Motivated by this, in this paper, a SAC-based path planning algorithm is proposed. The hindsight experience replay (HER) is also employed for sample efficiency and configuration space augmentation is used in order to deal with complicated configuration space of the multi-arms. To show the effectiveness of the proposed algorithm, both simulation and experiment results are given. By comparing with existing results, it is demonstrated that the proposed method outperforms the existing results.

Download Full-text

UCAV Path Planning Algorithm Based on Deep Reinforcement Learning

Lecture Notes in Computer Science - Image and Graphics ◽

10.1007/978-3-030-34110-7_59 ◽

2019 ◽

pp. 702-714

Author(s):

Kaiyuan Zheng ◽

Jingpeng Gao ◽

Liangxi Shen

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Wind farm water area path planning algorithm based on A* and reinforcement learning

2019 5th International Conference on Transportation Information and Safety (ICTIS) ◽

10.1109/ictis.2019.8883718 ◽

2019 ◽

Author(s):

Tianqi Zha ◽

Lei Xie ◽

Jiliang Chang

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Wind Farm ◽

Water Area ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Path Planning for Origami Carton Folding With a Multi-Fingered Robotic System

Volume 8: 31st Mechanisms and Robotics Conference, Parts A and B ◽

10.1115/detc2007-35542 ◽

2007 ◽

Author(s):

Wei Yao ◽

Jian S. Dai

Keyword(s):

Path Planning ◽

Configuration Space ◽

Robotic System ◽

Test Rig ◽

Packaging System ◽

Mechanism Model ◽

Planning Algorithm ◽

Path Planning Algorithm ◽

Equivalent Mechanism ◽

Robotic Fingers

This paper investigates the algorithm of origami carton folding with a multi-fingered robotic carton-packaging system. The equivalent mechanism structure of origami cartons is developed by modeling carton boards as links and creases as revolution joints. The trajectories of carton folding are analyzed by the mechanism model. Particularly the vertex of the carton is identified as a spherical linkage. A path planning algorithm is then generated based on the trajectory that is passed on to the tip of a five-bar robotic finger and the finger configuration space is identified. A test rig with two robotic fingers was developed to demonstrate the principle.

Download Full-text

An Automated Assembly of 3D Point Clouds using Coupling Matching and Path Planning Algorithm by Reinforcement Learning

Proceedings of the International Seminar of Science and Applied Technology (ISSAT 2020) ◽

10.2991/aer.k.201221.061 ◽

2020 ◽

Author(s):

Dianthika Puteri Andini ◽

Muhammad Yusuf Fadhlan

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Point Clouds ◽

Automated Assembly ◽

3D Point Clouds ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Research on Path Planning Algorithm for Mobile Robot Based on Improved Reinforcement Learning

Intelligent Computing Theories and Application - Lecture Notes in Computer Science ◽

10.1007/978-3-030-84529-2_50 ◽

2021 ◽

pp. 592-604

Author(s):

Junwei Liu ◽

Aihua Zhang ◽

Yang Zhang

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Mobile Robot ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

Path Planning Algorithm for Two-dimensional Raster Maps based on Deep Reinforcement Learning

10.1109/bigcom53800.2021.00026 ◽

2021 ◽

Author(s):

Jie Li ◽

Yuhan Zhang ◽

Jiaqi Tang ◽

Xianjie Liu ◽

Abdulhamid Ibrahim

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Two Dimensional ◽

Raster Maps ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

An Efficient Path-Planning Algorithm for a Robotic Manipulator by Automatic Selection Search of Indispensable Regions in Its Configuration Space

Journal of Robotics and Mechatronics ◽

10.20965/jrm.1992.p0378 ◽

1992 ◽

Vol 4 (5) ◽

pp. 378-385

Author(s):

Hiroshi Noborio ◽

◽

Motohiko Watanabe ◽

Takeshi Fujii

Keyword(s):

Path Planning ◽

Motion Planning ◽

Configuration Space ◽

Robotic Manipulator ◽

Joint Space ◽

Practical Applications ◽

Continuous Sequence ◽

Angle Difference ◽

Planning Algorithm ◽

Path Planning Algorithm

In this paper, we propose a feasible motion planning algorithm for a robotic manipulator and its obstacles. The algorithm quickly selects a feasible sequence of collision-free motions while adaptively expanding a graph in the implicit configuration joint-space. In the configuration graph, each arc represents an angle difference of the manipulator joint; therefore, an arc sequence represents a continuous sequence of robot motions. Thus, the algorithm can execute a continuous sequence of collision-free motions. Furthermore, the algorithm expands the configuration graph only in space which is to be cluttered in the implicit configuration joint-space and which is needed to select a collision-free sequence between the initial and target positions/orientations. The algorithm maintains the configuration graph in a small size and quickly selects a collision-free sequence from the configuration graph, whose shape is to be simple enough to move the manipulator in practical applications.

Download Full-text

Fuzzy Greedy RRT Path Planning Algorithm in a Complex Configuration Space

International Journal of Control Automation and Systems ◽

10.1007/s12555-018-0037-6 ◽

2018 ◽

Vol 16 (6) ◽

pp. 3026-3035 ◽

Cited By ~ 9

Author(s):

Ehsan Taheri ◽

Mohammad Hossein Ferdowsi ◽

Mohammad Danesh

Keyword(s):

Path Planning ◽

Configuration Space ◽

Complex Configuration ◽

Planning Algorithm ◽

Path Planning Algorithm

Download Full-text

A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning

Applied Soft Computing ◽

10.1016/j.asoc.2021.107605 ◽

2021 ◽

pp. 107605

Author(s):

Shuhuan Wen ◽

Zeteng Wen ◽

Di Zhang ◽

Hong Zhang ◽

Tao Wang

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Transfer Learning ◽

Autonomous Navigation ◽

Robot Path Planning ◽

Planning Algorithm ◽

Path Planning Algorithm ◽

Robot Path ◽

Multi Robot

Download Full-text

Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

IEEE Access ◽

10.1109/access.2021.3057485 ◽

2021 ◽

Vol 9 ◽

pp. 24884-24900

Author(s):

Ronglei Xie ◽

Zhijun Meng ◽

Lifeng Wang ◽

Haochen Li ◽

Kaipeng Wang ◽

...

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Unmanned Aerial Vehicle ◽

Large Scale ◽

Dynamic Environments ◽

Planning Algorithm ◽

Aerial Vehicle ◽

Vehicle Path ◽

Path Planning Algorithm

Download Full-text