Reinforcement Learning of Multi-Link Robot with Fuzzy ART Neural Networks for State-Space Segmentation

Masayuki NUNOBIKI; Koichi OKUDA; Syunsuke MAEDA

doi:10.2493/jspe.71.141

An adaptive state space segmentation for reinforcement learning using fuzzy-art neural network

The 2004 47th Midwest Symposium on Circuits and Systems, 2004. MWSCAS '04. ◽

10.1109/mwscas.2004.1354305 ◽

2004 ◽

Cited By ~ 3

Author(s):

T. Kamio ◽

S. Soga ◽

H. Fujisaka ◽

K. Mitsubori

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

State Space ◽

Fuzzy Art

Download Full-text

Fuzzy ART Neural Network Model for Automated Detection of Freeway Incidents

Transportation Research Record Journal of the Transportation Research Board ◽

10.3141/1634-07 ◽

1998 ◽

Vol 1634 (1) ◽

pp. 56-63 ◽

Cited By ~ 13

Author(s):

Sherif S. Ishak ◽

Haitham M. Al-Deek

Keyword(s):

Neural Networks ◽

False Alarm ◽

False Alarm Rate ◽

Back Propagation ◽

Incident Detection ◽

Traffic Patterns ◽

Traffic Pattern ◽

Detection Algorithms ◽

Fuzzy Art ◽

Freeway Incidents

Pattern recognition techniques such as artificial neural networks continue to offer potential solutions to many of the existing problems associated with freeway incident-detection algorithms. This study focuses on the application of Fuzzy ART neural networks to incident detection on freeways. Unlike back-propagation models, Fuzzy ART is capable of fast, stable learning of recognition categories. It is an incremental approach that has the potential for on-line implementation. Fuzzy ART is trained with traffic patterns that are represented by 30-s loop-detector data of occupancy, speed, or a combination of both. Traffic patterns observed at the incident time and location are mapped to a group of categories. Each incident category maps incidents with similar traffic pattern characteristics, which are affected by the type and severity of the incident and the prevailing traffic conditions. Detection rate and false alarm rate are used to measure the performance of the Fuzzy ART algorithm. To reduce the false alarm rate that results from occasional misclassification of traffic patterns, a persistence time period of 3 min was arbitrarily selected. The algorithm performance improves when the temporal size of traffic patterns increases from one to two 30-s periods for all traffic parameters. An interesting finding is that the speed patterns produced better results than did the occupancy patterns. However, when combined, occupancy–speed patterns produced the best results. When compared with California algorithms 7 and 8, the Fuzzy ART model produced better performance.

Download Full-text

Location- and Person-Independent Activity Recognition with WiFi, Deep Neural Networks, and Reinforcement Learning

ACM Transactions on Internet of Things ◽

10.1145/3424739 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-25

Author(s):

Yongsen Ma ◽

Sheheryar Arshad ◽

Swetha Muniraju ◽

Eric Torkildson ◽

Enrico Rantala ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reinforcement Learning ◽

Activity Recognition ◽

Deep Neural Networks ◽

State Machine ◽

Recognition Algorithm ◽

The State ◽

Neural Architecture ◽

Learning Agent

In recent years, Channel State Information (CSI) measured by WiFi is widely used for human activity recognition. In this article, we propose a deep learning design for location- and person-independent activity recognition with WiFi. The proposed design consists of three Deep Neural Networks (DNNs): a 2D Convolutional Neural Network (CNN) as the recognition algorithm, a 1D CNN as the state machine, and a reinforcement learning agent for neural architecture search. The recognition algorithm learns location- and person-independent features from different perspectives of CSI data. The state machine learns temporal dependency information from history classification results. The reinforcement learning agent optimizes the neural architecture of the recognition algorithm using a Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM). The proposed design is evaluated in a lab environment with different WiFi device locations, antenna orientations, sitting/standing/walking locations/orientations, and multiple persons. The proposed design has 97% average accuracy when testing devices and persons are not seen during training. The proposed design is also evaluated by two public datasets with accuracy of 80% and 83%. The proposed design needs very little human efforts for ground truth labeling, feature engineering, signal processing, and tuning of learning parameters and hyperparameters.

Download Full-text

Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2317 ◽

2020 ◽

Vol 53 (2) ◽

pp. 8181-8186

Author(s):

Haonan Chang ◽

Zhuo Xu ◽

Masayoshi Tomizuka

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Learning Control ◽

Control Policies ◽

Hierarchical Neural Networks

Download Full-text

Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

SN Applied Sciences ◽

10.1007/s42452-021-04658-6 ◽

2021 ◽

Vol 3 (6) ◽

Author(s):

Ogbonnaya Anicho ◽

Philip B. Charlesworth ◽

Gurvinder S. Baicher ◽

Atulya K. Nagar

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Swarm Intelligence ◽

Performance Indicators ◽

Convergence Rates ◽

Tuning Parameters ◽

Continuous State Space ◽

Continuous State ◽

User Coverage ◽

Better Than

AbstractThis work analyses the performance of Reinforcement Learning (RL) versus Swarm Intelligence (SI) for coordinating multiple unmanned High Altitude Platform Stations (HAPS) for communications area coverage. It builds upon previous work which looked at various elements of both algorithms. The main aim of this paper is to address the continuous state-space challenge within this work by using partitioning to manage the high dimensionality problem. This enabled comparing the performance of the classical cases of both RL and SI establishing a baseline for future comparisons of improved versions. From previous work, SI was observed to perform better across various key performance indicators. However, after tuning parameters and empirically choosing suitable partitioning ratio for the RL state space, it was observed that the SI algorithm still maintained superior coordination capability by achieving higher mean overall user coverage (about 20% better than the RL algorithm), in addition to faster convergence rates. Though the RL technique showed better average peak user coverage, the unpredictable coverage dip was a key weakness, making SI a more suitable algorithm within the context of this work.

Download Full-text

Cancer Diagnosis Based on Combination of Artificial Neural Networks and Reinforcement Learning

2020 6th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS) ◽

10.1109/icspis51611.2020.9349530 ◽

2020 ◽

Author(s):

Amir Toranj Simin ◽

Seyed Mohsen Ghorabi Baygi ◽

Amin Noori

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Reinforcement Learning ◽

Cancer Diagnosis ◽

Artificial Neural

Download Full-text

R3L: Connecting Deep Reinforcement Learning To Recurrent Neural Networks For Image Denoising Via Residual Recovery

10.1109/icip42928.2021.9506323 ◽

2021 ◽

Author(s):

Rongkai Zhang ◽

Jiang Zhu ◽

Zhiyuan Zha ◽

Justin Dauwels ◽

Bihan Wen

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Image Denoising ◽

Recurrent Neural Networks

Download Full-text

Design of sensor and actuator multi model fault detection and isolation system using state space neural networks

Journal of Physics Conference Series ◽

10.1088/1742-6596/659/1/012034 ◽

2015 ◽

Vol 659 ◽

pp. 012034 ◽

Cited By ~ 2

Author(s):

Andrzej Czajkowski

Keyword(s):

Neural Networks ◽

Fault Detection ◽

State Space ◽

Fault Detection And Isolation ◽

Isolation System

Download Full-text

Parameter estimation of state space models by recurrent neural networks

IEE Proceedings - Control Theory and Applications ◽

10.1049/ip-cta:19951733 ◽

1995 ◽

Vol 142 (2) ◽

pp. 114-118 ◽

Cited By ~ 16

Author(s):

J.R. Raol

Keyword(s):

Neural Networks ◽

Parameter Estimation ◽

State Space ◽

Recurrent Neural Networks ◽

State Space Models

Download Full-text

Cognitive Control Using Adaptive RBF Neural Networks and Reinforcement Learning for Networked Control System Subject to Time-Varying Delay and Packet Losses

Arabian Journal for Science and Engineering ◽

10.1007/s13369-021-05752-y ◽

2021 ◽

Author(s):

Shuti Wang ◽

Xunhe Yin ◽

Peng Li ◽

Yanxin Zhang ◽

Xin Wang ◽

...

Keyword(s):

Neural Networks ◽

Control System ◽

Reinforcement Learning ◽

Cognitive Control ◽

Networked Control System ◽

Time Varying ◽

Rbf Neural Networks ◽

Packet Losses ◽

Time Varying Delay ◽

Varying Delay

Download Full-text