Strategies for Building Artificial Neural Network Models

2000 ◽  
Author(s):  
R. L. Mahajan

Abstract An artificial neural network (ANN) is a massively parallel, dynamic system of processing elements, neurons, which are connected in complicated patterns to allow for a variety of interactions among the inputs to produce the desired output. It has the ability to learn directly from example data rather than by following the programmed rules based on a knowledge base. There is virtually no limit to what an ANN can predict or decipher, so long as it has been trained properly through examples which encompass the entire range of desired predictions. This paper provides an overview of such strategies needed to build accurate ANN models. Following a general introduction to artificial neural networks, the paper will describe different techniques to build and train ANN models. Step-by-step procedures will be described to demonstrate the mechanics of building neural network models, with particular emphasis on feedforward neural networks using back-propagation learning algorithm. The network structure and pre-processing of data are two significant aspects of ANN model building. The former has a significant influence on the predictive capability of the network [1]. Several studies have addressed the issue of optimal network structure. Kim and May [2] use statistical experimental design to determine an optimal network for a specific application. Bhat and McAvoy [3] propose a stripping algorithm, starting with a large network and then reducing the network complexity by removing unnecessary weights/nodes. This ‘complex-to-simple’ procedure requires heavy and tedious computation. Villiers and Bernard [4] conclude that although there is no significant difference between the optimal performance of one or two hidden layer networks, single layer networks do better classification on average. Marwah et al. [5] advocate a simple-to-complex methodology in which the training starts with the simplest ANN structure. The complexity of the structure is incrementally stepped-up till an acceptable learning performance is obtained. Preprocessing of data can lead to substantial improvements in the training process. Kown et al. [6] propose a data pre-processing algorithm for a highly skewed data set. Marwah et al. [5] propose two different strategies for dealing with the data. For applications with a significant amount of historical data, smart select methodology is proposed that ensures equal weighted distribution of the data over the range of the input parameters. For applications, where there is scarcity of data or where the experiments are expensive to perform, a statistical design of experiments approach is suggested. In either case, it is shown that dividing the data into training, testing and validation ensures an accurate ANN model that has excellent predictive capabilities. The paper also describes recently developed concepts of physical-neural network models and model transfer techniques. In the former, an ANN model is built on the data generated through the ‘first-principles’ analytical or numerical model of the process under consideration. It is shown that such a model, termed as a physical-neural network model has the accuracy of the first-principles model but yet is orders of magnitude faster to execute. In recognition of the fact that such a model has all the approximations that are generally inherent in physical models for many complex processes, model transfer techniques have been developed [6] that allow economical development of accurate process equipment models. Examples from thermally-based materials processing will be described to illustrate the application of the basic concepts involved.

Author(s):  
Joarder Kamruzzaman ◽  
Ruhul Sarker

The primary aim of this chapter is to present an overview of the artificial neural network basics and operation, architectures, and the major algorithms used for training the neural network models. As can be seen in subsequent chapters, neural networks have made many useful contributions to solve theoretical and practical problems in finance and manufacturing areas. The secondary aim here is therefore to provide a brief review of artificial neural network applications in finance and manufacturing areas.


2002 ◽  
pp. 220-235 ◽  
Author(s):  
Paul Lajbcygier

The pricing of options on futures is compared using conventional models and artificial neural networks. This work demonstrates superior pricing accuracy using the artificial neural networks in an important subset of the input parameter set.


2012 ◽  
Author(s):  
Khairiyah Mohd. Yusof ◽  
Fakhri Karray ◽  
Peter L. Douglas

This paper discusses the development of artificial neural network (ANN) models for a crude oil distillation column. Since the model is developed for real time optimisation (RTO) applications they are steady state, multivariable models. Training and testing data used to develop the models were generated from a reconciled steady-state model simulated in a process simulator. The radial basis function networks (RBFN), a type of feedforward ANN model, were able to model the crude tower very well, with the root mean square error for the prediction of each variable less than 1%. Grouping related output variables in a network model was found to give better predictions than lumping all the variables in a single model; this also allowed the overall complex, multivariable model to be simplified into smaller models that are more manageable. In addition, the RBFN models were also able to satisfactorily perform range and dimensional extrapolation, which is necessary for models that are used in RTO.


2019 ◽  
Vol 295 ◽  
pp. 01004 ◽  
Author(s):  
Elias Farah ◽  
Amani Abdallah ◽  
Isam Shahrour

This paper presents an application of Artificial Neural Network models (ANN) to predict the water consumption at two scales: i) District Metered Area (DMA) located in the Scientific Campus of Lille University and ii) End user representing a restaurant inside this DMA. Data are collected from Automated Meter Readers (AMRs) that measure in near real-time the water consumption. The models are trained at both daily and hourly time intervals using historical values and the variation between the hour and the type of days. The paper shows that the ANN-based models can well predict the water consumption including peak values.


Author(s):  
Agus Saptoro ◽  
Moses O. Tadé ◽  
Hari Vuthaluru

Abstract This paper proposes a method, namely MDKS (Kennard-Stone algorithm based on Mahalanobis distance), to divide the data into training and testing subsets for developing artificial neural network (ANN) models. This method is a modified version of the Kennard-Stone (KS) algorithm. With this method, better data splitting, in terms of data representation and enhanced performance of developed ANN models, can be achieved. Compared with standard KS algorithm and another improved KS algorithm (data division based on joint x - y distances (SPXY) method), the proposed method has also shown a better performance. Therefore, the proposed technique can be used as an advantageous alternative to other existing methods of data splitting for developing ANN models. Care should be taken when dealing with large amount of dataset since they may increase the computational load for MDKS due to its variance-covariance matrix calculations.


2020 ◽  
Author(s):  
Jan-Joris Devogelaer ◽  
Hugo Meekes ◽  
Paul Tinnemans ◽  
Elias Vlieg ◽  
Rene de Gelder

<div>A significant amount of attention has been given to the design and synthesis of cocrystals by both industry and academia because of its potential to change a molecule’s physicochemical properties. This paper reports on the application of a data-driven cocrystal prediction method, based on two types of artificial neural network models and cocrystal data present in the Cambridge Structural Database. The models accept pairs of coformers and predict whether a cocrystal is likely to form.</div>


2020 ◽  
Vol 87 (8) ◽  
Author(s):  
Xin Liu ◽  
Fei Tao ◽  
Haodong Du ◽  
Wenbin Yu ◽  
Kailai Xu

Abstract Artificial neural network (ANN) models are used to learn the nonlinear constitutive laws based on indirectly measurable data. The real input and output of the ANN model are derived from indirect data using a mechanical system, which is composed of several subsystems including the ANN model. As the ANN model is coupled with other subsystems, the input of the ANN model needs to be determined during the training. This approach integrates measurable data, mechanics, and ANN models so that the ANN models can be trained without direct data which is usually not available from experiments. Two examples are provided as an illustration of the proposed approach. The first example uses two-dimensional (2D) finite element (FE) analysis to train an ANN model to learn the nonlinear in-plane shear constitutive law. The second example applies a continuum damage model to train an ANN model to learn the damage accumulation law. The results show that the trained ANN models achieve great accuracy based on the proposed approach.


Author(s):  
Joarder Kamruzzaman ◽  
Ruhul A. Sarker

The primary aim of this chapter is to present an overview of the artificial neural network basics and operation, architectures, and the major algorithms used for training the neural network models. As can be seen in subsequent chapters, neural networks have made many useful contributions to solve theoretical and practical problems in finance and manufacturing areas. The secondary aim here is therefore to provide a brief review of artificial neural network applications in finance and manufacturing areas.


2007 ◽  
Vol 362 (1479) ◽  
pp. 421-430 ◽  
Author(s):  
Sami Merilaita

In this paper, I investigate the use of artificial neural networks in the study of prey coloration. I briefly review the anti-predator functions of prey coloration and describe both in general terms and with help of two studies as specific examples the use of neural network models in the research on prey coloration. The first example investigates the effect of visual complexity of background on evolution of camouflage. The second example deals with the evolutionary choice of defence strategy, crypsis or aposematism. I conclude that visual information processing by predators is central in evolution of prey coloration. Therefore, the capability to process patterns as well as to imitate aspects of predator's information processing and responses to visual information makes neural networks a well-suited modelling approach for the study of prey coloration. In addition, their suitability for evolutionary simulations is an advantage when complex or dynamic interactions are modelled. Since not all behaviours of neural network models are necessarily biologically relevant, it is important to validate a neural network model with empirical data. Bringing together knowledge about neural networks with knowledge about topics of prey coloration would provide a potential way to deepen our understanding of the specific appearances of prey coloration.


Mathematics ◽  
2021 ◽  
Vol 9 (4) ◽  
pp. 356 ◽  
Author(s):  
Svajone Bekesiene ◽  
Ieva Meidute-Kavaliauskiene ◽  
Vaida Vasiliauskiene

This study considers the usage of multilinear regression and artificial neural network modelling to forecast ozone concentrations with regard to weather-related indicators (wind speed, wind direction, relative humidity and temperature). Initial data were obtained by measuring the meteorological parameters using the PC Radio Weather Station. Ozone concentrations near high-voltage lines were measured using RS1003 and at a 220 m distance using ML9811. Neural network models such as the multilayer perceptron and radial basis function neural networks were constructed. The prognostic capacities of the designed models were assessed by comparing the result data by way of the square of the coefficient of multiple correlations (R2) and mean square error (MSE) values. The number of hidden neurons was optimised by decreasing an error function that recorded the number of units in the hidden layers to the precision of the expanded networks. The neural software IBM SPSS 26v was used for artificial neural network (ANN) modelling. The study demonstrated that the linear regression modelling approach was lacking in its capacity to predict the investigated ozone concentrations by used parameters, whereas the use of an ANN offered more precise outcomes. The conducted tests’ results established the strength of the designed artificial neural network models with irrelevant differences between detected and forecasted data.


Sign in / Sign up

Export Citation Format

Share Document