<title>Hybrid training procedure applied to recurrent neural networks</title>

Recurrent Neural Networks for Edge Intelligence

ACM Computing Surveys ◽

10.1145/3448974 ◽

2021 ◽

Vol 54 (4) ◽

pp. 1-38

Author(s):

Varsha S. Lalapura ◽

J. Amudha ◽

Hariramn Selvamuruga Satheesh

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Model Performance ◽

Network Architectures ◽

Training Procedure ◽

Resource Constrained ◽

Compression Technique ◽

Creative Art ◽

Comprehensive Survey ◽

Intelligent Models

Recurrent Neural Networks are ubiquitous and pervasive in many artificial intelligence applications such as speech recognition, predictive healthcare, creative art, and so on. Although they provide accurate superior solutions, they pose a massive challenge “training havoc.” Current expansion of IoT demands intelligent models to be deployed at the edge. This is precisely to handle increasing model sizes and complex network architectures. Design efforts to meet these for greater performance have had inverse effects on portability on edge devices with real-time constraints of memory, latency, and energy. This article provides a detailed insight into various compression techniques widely disseminated in the deep learning regime. They have become key in mapping powerful RNNs onto resource-constrained devices. While compression of RNNs is the main focus of the survey, it also highlights challenges encountered while training. The training procedure directly influences model performance and compression alongside. Recent advancements to overcome the training challenges with their strengths and drawbacks are discussed. In short, the survey covers the three-step process, namely, architecture selection, efficient training process, and suitable compression technique applicable to a resource-constrained environment. It is thus one of the comprehensive survey guides a developer can adapt for a time-series problem context and an RNN solution for the edge.

Download Full-text

Echo-State Networks for Predicting ENSO Beyond One Year

10.5194/egusphere-egu21-4826 ◽

2021 ◽

Author(s):

forough hassanibesheli ◽

Niklas Boers ◽

Jurgen Kurths

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Time Series ◽

Recurrent Neural Networks ◽

Southern Oscillation ◽

Numerical Models ◽

Weather Prediction ◽

Supervised Machine Learning ◽

Climate Indices ◽

Training Procedure

Most forecasting schemes in the geosciences, and in particular for predicting weather and climate indices such as the El Ni&#241;o Southern Oscillation (ENSO), rely on process-based numerical models [1]. Although statistical modelling[2] and prediction approaches also have a long history, more recently, different machine learning techniques have been used to predict climatic time series. One of the supervised machine learning algorithm which is suited for temporal and sequential data processing and prediction is given by recurrent neural networks (RNNs)[3]. In this study we develop a RNN-based method that (1) can learn the dynamics of a stochastic time series without requiring access to a huge amount of data for training, and (2) has comparatively simple structure and efficient training procedure. Since this algorithm is suitable for investigating complex nonlinear time series such as climate time series, we apply it to different ENSO indices. We demonstrate that our model can capture key features of the complex system dynamics underlying ENSO variability, and that it can accurately forecast ENSO for longer lead times in comparison to other recent studies[4].&#160;Reference:[1] P. Bauer, A. Thorpe, and G. Brunet, &#8220;The quiet revolution of numerical weather prediction,&#8221; Nature, vol. 525, no. 7567, pp. 47&#8211;55, 2015.[2] D. Kondrashov, S. Kravtsov, A. W. Robertson, and M. Ghil, &#8220;A hierarchy of data-based enso models,&#8221; Journal of climate, vol. 18, no. 21, pp. 4425&#8211;4444, 2005.[3] L. R. Medsker and L. Jain, &#8220;Recurrent neural networks,&#8221; Design and Applications, vol. 5, 2001.[4] Y.-G. Ham, J.-H. Kim, and J.-J. Luo, &#8220;Deep learning for multi-year enso forecasts,&#8221; Nature, vol. 573, no. 7775, pp. 568&#8211;572, 2019.

Download Full-text

Spike timing-dependent plasticity in sparse recurrent neural networks

IEICE Proceeding Series ◽

10.15248/proc.1.485 ◽

2014 ◽

Vol 1 ◽

pp. 485-488

Author(s):

Hideyuki Kato ◽

Tohru Ikeguchi

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Spike Timing ◽

Spike Timing Dependent Plasticity ◽

Dependent Plasticity

Download Full-text

Direct Adaptive Control of Process Systems Using Recurrent Neural Networks

1992 American Control Conference ◽

10.23919/acc.1992.4792020 ◽

1992 ◽

Author(s):

Sanjay Parthasarathy ◽

Alexander G. Parlos ◽

Amir F. Atiya

Keyword(s):

Neural Networks ◽

Adaptive Control ◽

Recurrent Neural Networks ◽

Process Systems ◽

Direct Adaptive Control

Download Full-text

L2 approximation properties of recurrent neural networks

1997 European Control Conference (ECC) ◽

10.23919/ecc.1997.7082360 ◽

1997 ◽

Cited By ~ 1

Author(s):

A. Ruiz ◽

D.H. Owens ◽

S. Townley

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Approximation Properties

Download Full-text

Levenshtein Augmentation Improves Performance of SMILES Based Deep-Learning Synthesis Prediction

10.26434/chemrxiv.12562121 ◽

2020 ◽

Author(s):

Dean Sumner ◽

Jiazhen He ◽

Amol Thakkar ◽

Ola Engkvist ◽

Esben Jannik Bjerrum

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Deep Learning ◽

Recurrent Neural Networks ◽

Data Augmentation ◽

State Of The Art ◽

Sequence Similarity ◽

Learning Models ◽

Underlying Network

SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as attentional gain – an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.

Download Full-text