A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space

Xiao Wang; Peng Shi; Yushan Zhao; Yue Sun

doi:10.3390/s20082253

A Pre-Trained Fuzzy Reinforcement Learning Method for the Pursuing Satellite in a One-to-One Game in Space

Sensors ◽

10.3390/s20082253 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2253

Author(s):

Xiao Wang ◽

Peng Shi ◽

Yushan Zhao ◽

Yue Sun

Keyword(s):

Reinforcement Learning ◽

Gradient Descent ◽

Fuzzy Inference ◽

Learning Algorithm ◽

Control Policy ◽

Descent Method ◽

Gradient Descent Method ◽

One To One ◽

Inference Systems ◽

First Time

In order to help the pursuer find its advantaged control policy in a one-to-one game in space, this paper proposes an innovative pre-trained fuzzy reinforcement learning algorithm, which is conducted in the x, y, and z channels separately. Compared with the previous algorithms applied in ground games, this is the first time reinforcement learning has been introduced to help the pursuer in space optimize its control policy. The known part of the environment is utilized to help the pursuer pre-train its consequent set before learning. An actor-critic framework is built in each moving channel of the pursuer. The consequent set of the pursuer is updated through the gradient descent method in fuzzy inference systems. The numerical experimental results validate the effectiveness of the proposed algorithm in improving the game ability of the pursuer.

Download Full-text

QoS-Controlling Soft Handoff Based on Simple Step Control and a Fuzzy Inference System With the Gradient Descent Method

IEEE Transactions on Vehicular Technology ◽

10.1109/tvt.2004.825755 ◽

2004 ◽

Vol 53 (3) ◽

pp. 820-834 ◽

Cited By ~ 7

Author(s):

B. Homnan ◽

W. Benjapolakul

Keyword(s):

Fuzzy Inference System ◽

Gradient Descent ◽

Fuzzy Inference ◽

Descent Method ◽

Gradient Descent Method ◽

Soft Handoff ◽

Inference System ◽

Simple Step ◽

Step Control

Download Full-text

Adaptive Natural Gradient Method for Learning of Stochastic Neural Networks in Mini-Batch Mode

Applied Sciences ◽

10.3390/app9214568 ◽

2019 ◽

Vol 9 (21) ◽

pp. 4568

Author(s):

Hyeyoung Park ◽

Kwanyong Lee

Keyword(s):

Neural Networks ◽

Gradient Descent ◽

Learning Algorithm ◽

Descent Method ◽

Benchmark Problems ◽

Stochastic Neural Networks ◽

Gradient Descent Method ◽

Natural Gradient ◽

Convergence Properties ◽

Data Set

Gradient descent method is an essential algorithm for learning of neural networks. Among diverse variations of gradient descent method that have been developed for accelerating learning speed, the natural gradient learning is based on the theory of information geometry on stochastic neuromanifold, and is known to have ideal convergence properties. Despite its theoretical advantages, the pure natural gradient has some limitations that prevent its practical usage. In order to get the explicit value of the natural gradient, it is required to know true probability distribution of input variables, and to calculate inverse of a matrix with the square size of the number of parameters. Though an adaptive estimation of the natural gradient has been proposed as a solution, it was originally developed for online learning mode, which is computationally inefficient for the learning of large data set. In this paper, we propose a novel adaptive natural gradient estimation for mini-batch learning mode, which is commonly adopted for big data analysis. For two representative stochastic neural network models, we present explicit rules of parameter updates and learning algorithm. Through experiments on three benchmark problems, we confirm that the proposed method has superior convergence properties to the conventional methods.

Download Full-text

SIRMs (Single Input Rule Modules) Connected Fuzzy Inference Model

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.1997.p0023 ◽

1997 ◽

Vol 1 (1) ◽

pp. 23-30 ◽

Cited By ~ 24

Author(s):

Naoyoshi Yubazaki ◽

◽

Jianqiang Yi ◽

Kaoru Hirota ◽

Keyword(s):

Gradient Descent ◽

Fuzzy Inference ◽

Fuzzy Rule ◽

Descent Method ◽

Gradient Descent Method ◽

Control Performance ◽

Nonlinear Functions ◽

Inference Model ◽

Proposed Model ◽

Single Input

A new fuzzy inference model, SIRMs (Single Input Rule Modules) Connected Fuzzy Inference Model, is proposed for plural input fuzzy control. For each input item, an importance degree is defined and single input fuzzy rule module is constructed. The importance degrees control the roles of the input items in systems. The model output is obtained by the summation of the products of the importance degree and the fuzzy inference result of each SIRM. The proposed model needs both very few rules and parameters, and the rules can be designed much easier. The new model is first applied to typical secondorder lag systems. The simulation results show that the proposed model can largely improve the control performance compared with that of the conventional fuzzy inference model. The tuning algorithm is then given based on the gradient descent method and used to adjust the parameters of the proposed model for identifying 4-input 1-output nonlinear functions. The identification results indicate that the proposed model also has the ability to identify nonlinear systems.

Download Full-text

Designing fuzzy inference system based on improved gradient descent method

Journal of Systems Engineering and Electronics ◽

10.1016/s1004-4132(07)60027-9 ◽

2006 ◽

Vol 17 (4) ◽

pp. 853-857 ◽

Cited By ~ 3

Author(s):

Zhang Liquan ◽

Shao Cheng

Keyword(s):

Fuzzy Inference System ◽

Gradient Descent ◽

Fuzzy Inference ◽

Descent Method ◽

Gradient Descent Method ◽

Inference System

Download Full-text

Building Recurrent Neural Networks to Implement Multiple Attractor Dynamics Using the Gradient Descent Method

Advances in Artificial Neural Systems ◽

10.1155/2009/846040 ◽

2009 ◽

Vol 2009 ◽

pp. 1-11 ◽

Cited By ~ 3

Author(s):

Jun Namikawa ◽

Jun Tani

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Gradient Descent ◽

Learning Algorithm ◽

Transition Function ◽

Descent Method ◽

Van Der Pol Oscillator ◽

Multiple Time ◽

Gradient Descent Method

The present paper proposes a recurrent neural network model and learning algorithm that can acquire the ability to generate desired multiple sequences. The network model is a dynamical system in which the transition function is a contraction mapping, and the learning algorithm is based on the gradient descent method. We show a numerical simulation in which a recurrent neural network obtains a multiple periodic attractor consisting of five Lissajous curves, or a Van der Pol oscillator with twelve different parameters. The present analysis clarifies that the model contains many stable regions as attractors, and multiple time series can be embedded into these regions by using the present learning method.

Download Full-text

A learning algorithm for tuning fuzzy rules based on the gradient descent method

Proceedings of IEEE 5th International Fuzzy Systems ◽

10.1109/fuzzy.1996.551719 ◽

2002 ◽

Cited By ~ 28

Author(s):

Y. Shi ◽

M. Mizumoto ◽

N. Yubazaki ◽

M. Otani

Keyword(s):

Gradient Descent ◽

Learning Algorithm ◽

Descent Method ◽

Fuzzy Rules ◽

Gradient Descent Method

Download Full-text

An Estimation Algorithm of Attitude and Heading Under Homogenous Field Based on Improved Gradient Descent Method

2020 27th Saint Petersburg International Conference on Integrated Navigation Systems (ICINS) ◽

10.23919/icins43215.2020.9133763 ◽

2020 ◽

Author(s):

Xiao-Kang Yang ◽

Gong-Min Yan ◽

Si-Hai Li

Keyword(s):

Gradient Descent ◽

Estimation Algorithm ◽

Descent Method ◽

Gradient Descent Method

Download Full-text

Optimization Technique for Phase-Only Computer-Generated Holograms Based on Gradient Descent Method

Proceedings of the International Display Workshops ◽

10.36463/idw.2019.1014 ◽

2019 ◽

pp. 1014

Author(s):

Shujian Liu ◽

Yuki Nagahama ◽

Yasuhiro Takaki

Keyword(s):

Gradient Descent ◽

Optimization Technique ◽

Descent Method ◽

Gradient Descent Method ◽

Computer Generated Holograms

Download Full-text

Image restoration algorithm based on primal-dual hybrid gradient descent method

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.00987 ◽

2009 ◽

Vol 29 (4) ◽

pp. 987-989

Author(s):

Hui ZHANG ◽

Li-zhi CHENG ◽

Zai-xin ZHAO

Keyword(s):

Image Restoration ◽

Gradient Descent ◽

Descent Method ◽

Gradient Descent Method ◽

Restoration Algorithm ◽

Primal Dual

Download Full-text

Iteration Complexity of a Block Coordinate Gradient Descent Method for Convex Optimization

SIAM Journal on Optimization ◽

10.1137/140964795 ◽

2015 ◽

Vol 25 (3) ◽

pp. 1298-1313 ◽

Cited By ~ 1

Author(s):

Xiaoqin Hua ◽

Nobuo Yamashita

Keyword(s):

Convex Optimization ◽

Gradient Descent ◽

Descent Method ◽

Gradient Descent Method ◽

Iteration Complexity ◽

Coordinate Gradient Descent

Download Full-text