Neuroevolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems

Shauharda Khadka; Jen Jen Chung; Kagan Tumer

doi:10.1162/evco_a_00239

Neuroevolution of a Modular Memory-Augmented Neural Network for Deep Memory Problems

Evolutionary Computation ◽

10.1162/evco_a_00239 ◽

2019 ◽

Vol 27 (4) ◽

pp. 639-664 ◽

Cited By ~ 2

Author(s):

Shauharda Khadka ◽

Jen Jen Chung ◽

Kagan Tumer

Keyword(s):

Neural Network ◽

Gradient Descent ◽

Short Term Memory ◽

Extended Period ◽

New Class ◽

Neural Architecture ◽

Memory Block ◽

Memory Problems ◽

Long Short Term Memory ◽

Gated Recurrent Units

We present Modular Memory Units (MMUs), a new class of memory-augmented neural network. MMU builds on the gated neural architecture of Gated Recurrent Units (GRUs) and Long Short Term Memory (LSTMs), to incorporate an external memory block, similar to a Neural Turing Machine (NTM). MMU interacts with the memory block using independent read and write gates that serve to decouple the memory from the central feedforward operation. This allows for regimented memory access and update, giving our network the ability to choose when to read from memory, update it, or simply ignore it. This capacity to act in detachment allows the network to shield the memory from noise and other distractions, while simultaneously using it to effectively retain and propagate information over an extended period of time. We train MMU using both neuroevolution and gradient descent, and perform experiments on two deep memory benchmarks. Results demonstrate that MMU performs significantly faster and more accurately than traditional LSTM-based methods, and is robust to dramatic increases in the sequence depth of these memory benchmarks.

Download Full-text

Crop Yield Estimation Using Deep Learning Based on Climate Big Data and Irrigation Scheduling

Energies ◽

10.3390/en14113004 ◽

2021 ◽

Vol 14 (11) ◽

pp. 3004

Author(s):

Khadijeh Alibabaei ◽

Pedro D. Gaspar ◽

Tânia M. Lima

Keyword(s):

Neural Network ◽

Deep Learning ◽

Energy Consumption ◽

Short Term Memory ◽

Irrigation Scheduling ◽

Climate Data ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory ◽

Gated Recurrent Units

Deep learning has already been successfully used in the development of decision support systems in various domains. Therefore, there is an incentive to apply it in other important domains such as agriculture. Fertilizers, electricity, chemicals, human labor, and water are the components of total energy consumption in agriculture. Yield estimates are critical for food security, crop management, irrigation scheduling, and estimating labor requirements for harvesting and storage. Therefore, estimating product yield can reduce energy consumption. Two deep learning models, Long Short-Term Memory and Gated Recurrent Units, have been developed for the analysis of time-series data such as agricultural datasets. In this paper, the capabilities of these models and their extensions, called Bidirectional Long Short-Term Memory and Bidirectional Gated Recurrent Units, to predict end-of-season yields are investigated. The models use historical data, including climate data, irrigation scheduling, and soil water content, to estimate end-of-season yield. The application of this technique was tested for tomato and potato yields at a site in Portugal. The Bidirectional Long Short-Term memory outperformed the Gated Recurrent Units network, the Long Short-Term Memory, and the Bidirectional Gated Recurrent Units network on the validation dataset. The model was able to capture the nonlinear relationship between irrigation amount, climate data, and soil water content and predict yield with an MSE of 0.017 to 0.039. The performance of the Bidirectional Long Short-Term Memory in the test was compared with the most commonly used deep learning method, the Convolutional Neural Network, and machine learning methods including a Multi-Layer Perceptrons model and Random Forest Regression. The Bidirectional Long Short-Term Memory outperformed the other models with an R2 score between 0.97 and 0.99. The results show that analyzing agricultural data with the Long Short-Term Memory model improves the performance of the model in terms of accuracy. The Convolutional Neural Network model achieved the second-best performance. Therefore, the deep learning model has a remarkable ability to predict the yield at the end of the season.

Download Full-text

SACNN: Self-attentive Convolutional Neural Network Model for Natural Language Inference

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3426884 ◽

2021 ◽

Vol 20 (3) ◽

pp. 1-16

Author(s):

Waris Quamer ◽

Praphula Kumar Jain ◽

Arpit Rai ◽

Vijayalakshmi Saravanan ◽

Rajendra Pamula ◽

...

Keyword(s):

Neural Network ◽

Natural Language ◽

Short Term Memory ◽

Local Context ◽

Short Term ◽

Proposed Model ◽

Long Short Term Memory ◽

Complex Relationships ◽

Gated Recurrent Units

Inference has been central problem for understanding and reasoning in artificial intelligence. Especially, Natural Language Inference is an interesting problem that has attracted the attention of many researchers. Natural language inference intends to predict whether a hypothesis sentence can be inferred from the premise sentence. Most prior works rely on a simplistic association between the premise and hypothesis sentence pairs, which is not sufficient for learning complex relationships between them. The strategy also fails to exploit local context information fully. Long Short Term Memory (LSTM) or gated recurrent units networks (GRU) are not effective in modeling long-term dependencies, and their schemes are far more complex as compared to Convolutional Neural Networks (CNN). To address this problem of long-term dependency, and to involve context for modeling better representation of a sentence, in this article, a general Self-Attentive Convolution Neural Network (SACNN) is presented for natural language inference and sentence pair modeling tasks. The proposed model uses CNNs to integrate mutual interactions between sentences, and each sentence with their counterparts is taken into consideration for the formulation of their representation. Moreover, the self-attention mechanism helps fully exploit the context semantics and long-term dependencies within a sentence. Experimental results proved that SACNN was able to outperform strong baselines and achieved an accuracy of 89.7% on the stanford natural language inference (SNLI) dataset.

Download Full-text

Estimation of municipal solid waste amount based on one-dimension convolutional neural network and long short-term memory with attention mechanism model: A case study of Shanghai

The Science of The Total Environment ◽

10.1016/j.scitotenv.2021.148088 ◽

2021 ◽

Vol 791 ◽

pp. 148088

Author(s):

Kunsen Lin ◽

Youcai Zhao ◽

Lu Tian ◽

Chunlong Zhao ◽

Meilan Zhang ◽

...

Keyword(s):

Neural Network ◽

Municipal Solid Waste ◽

Convolutional Neural Network ◽

Short Term Memory ◽

One Dimension ◽

Short Term ◽

Term Memory ◽

Mechanism Model ◽

Long Short Term Memory

Download Full-text

Electricity Consumption Forecasting Based on a Bidirectional Long-Short-Term Memory Artificial Neural Network

Sustainability ◽

10.3390/su13010104 ◽

2020 ◽

Vol 13 (1) ◽

pp. 104

Author(s):

Dana-Mihaela Petroșanu ◽

Alexandru Pîrjan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Short Term Memory ◽

Electricity Consumption ◽

Short Term ◽

Term Memory ◽

Storage Room ◽

Long Short Term Memory ◽

Commercial Center ◽

Business And Management

The accurate forecasting of the hourly month-ahead electricity consumption represents a very important aspect for non-household electricity consumers and system operators, and at the same time represents a key factor in what regards energy efficiency and achieving sustainable economic, business, and management operations. In this context, we have devised, developed, and validated within the paper an hourly month ahead electricity consumption forecasting method. This method is based on a bidirectional long-short-term memory (BiLSTM) artificial neural network (ANN) enhanced with a multiple simultaneously decreasing delays approach coupled with function fitting neural networks (FITNETs). The developed method targets the hourly month-ahead total electricity consumption at the level of a commercial center-type consumer and for the hourly month ahead consumption of its refrigerator storage room. The developed approach offers excellent forecasting results, highlighted by the validation stage’s results along with the registered performance metrics, namely 0.0495 for the root mean square error (RMSE) performance metric for the total hourly month-ahead electricity consumption and 0.0284 for the refrigerator storage room. We aimed for and managed to attain an hourly month-ahead consumed electricity prediction without experiencing a significant drop in the forecasting accuracy that usually tends to occur after the first two weeks, therefore achieving a reliable method that satisfies the contractor’s needs, being able to enhance his/her activity from the economic, business, and management perspectives. Even if the devised, developed, and validated forecasting solution for the hourly consumption targets a commercial center-type consumer, based on its accuracy, this solution can also represent a useful tool for other non-household electricity consumers due to its generalization capability.

Download Full-text

An Approach for Rainfall Prediction Using Long Short Term Memory Neural Network

2020 IEEE 5th International Conference on Computing Communication and Automation (ICCCA) ◽

10.1109/iccca49541.2020.9250809 ◽

2020 ◽

Author(s):

Anjali Samad ◽

Bhagyanidhi ◽

Vaibhav Gautam ◽

Piyush Jain ◽

Sangeeta ◽

...

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Rainfall Prediction ◽

Long Short Term Memory

Download Full-text

An innovative method for axial pressure evaluation in smart rubber bearing based on bidirectional long-short term memory neural network

Measurement ◽

10.1016/j.measurement.2021.109653 ◽

2021 ◽

pp. 109653

Author(s):

Zeng Yi ◽

Pan Peng ◽

He Zhizhou ◽

Shen Zhouyang

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Axial Pressure ◽

Short Term ◽

Term Memory ◽

Innovative Method ◽

Rubber Bearing ◽

Long Short Term Memory

Download Full-text

Artifact Detection in Chronically Recorded Local Field Potentials using Long-Short Term Memory Neural Network

2020 IEEE 14th International Conference on Application of Information and Communication Technologies (AICT) ◽

10.1109/aict50176.2020.9368638 ◽

2020 ◽

Author(s):

Marcos Fabietti ◽

Mufti Mahmud ◽

Ahmad Lotfi ◽

Alberto Averna ◽

David Guggenmos ◽

...

Keyword(s):

Neural Network ◽

Local Field ◽

Short Term Memory ◽

Local Field Potentials ◽

Field Potentials ◽

Short Term ◽

Artifact Detection ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Automatic respiratory event scoring in obstructive sleep apnea using a long short-term memory neural network

IEEE Journal of Biomedical and Health Informatics ◽

10.1109/jbhi.2021.3064694 ◽

2021 ◽

pp. 1-1

Author(s):

Sami Nikkonen ◽

Henri Korkalainen ◽

Akseli Leino ◽

Sami Myllymaa ◽

Brett Duce ◽

...

Keyword(s):

Neural Network ◽

Obstructive Sleep Apnea ◽

Sleep Apnea ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Obstructive Sleep ◽

Respiratory Event ◽

Long Short Term Memory

Download Full-text

An improved SPEI drought forecasting approach using the long short-term memory neural network

Journal of Environmental Management ◽

10.1016/j.jenvman.2021.111979 ◽

2021 ◽

Vol 283 ◽

pp. 111979

Author(s):

Abhirup Dikshit ◽

Biswajeet Pradhan ◽

Alfredo Huete

Keyword(s):

Neural Network ◽

Short Term Memory ◽

Drought Forecasting ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Landslide Prediction Using Long Short Term Memory(LSTM)Neural Network on time series data in Pakistan

2021 International Conference on Artificial Intelligence (ICAI) ◽

10.1109/icai52203.2021.9445236 ◽

2021 ◽

Author(s):

Mehreen Mubashar ◽

Gul Muhammad Khan ◽

Ramla Khan

Keyword(s):

Neural Network ◽

Time Series ◽

Time Series Data ◽

Short Term Memory ◽

Series Data ◽

Short Term ◽

Landslide Prediction ◽

Term Memory ◽

Long Short Term Memory

Download Full-text