Constructing Multilayer Feedforward Neural Networks to Approximate Nonlinear Functions in Engineering Mechanics Applications

Jin-Song Pei; Eric C. Mai

doi:10.1115/1.2957600

Constructing Multilayer Feedforward Neural Networks to Approximate Nonlinear Functions in Engineering Mechanics Applications

Journal of Applied Mechanics ◽

10.1115/1.2957600 ◽

2008 ◽

Vol 75 (6) ◽

Cited By ~ 11

Author(s):

Jin-Song Pei ◽

Eric C. Mai

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Feedforward Neural Networks ◽

Nonlinear Functions ◽

Neural Network Architecture ◽

New Approach ◽

Restoring Forces ◽

Multilayer Feedforward Neural Network ◽

Engineering Mechanics

This paper presents a major step in the development and validation of a systematic prototype-based methodology for designing multilayer feedforward neural networks to model nonlinearities common in engineering mechanics. The applications of this work include (but are not limited to) system identification of nonlinear dynamic systems and neural-network-based damage detection. In this and previous studies (Pei, J. S., 2001, “Parametric and Nonparametric Identification of Nonlinear Systems,” Ph.D. thesis, Columbia University; Pei, J. S., and Smyth, A. W., 2006, “A New Approach to Design Multilayer Feedforward Neural Network Architecture in Modeling Nonlinear Restoring Forces. Part I: Formulation,” J. Eng. Mech., 132(12), pp. 1290–1300; Pei, J. S., and Smyth, A. W., 2006, “A New Approach to Design Multilayer Feedforward Neural Network Architecture in Modeling Nonlinear Restoring Forces. Part II: Applications,” J. Eng. Mech., 132(12), pp. 1301–1312; Pei, J. S., Wright, J. P., and Smyth, A. W., 2005, “Mapping Polynomial Fitting Into Feedforward Neural Networks for Modeling Nonlinear Dynamic Systems and Beyond,” Comput. Methods Appl. Mech. Eng., 194(42–44), pp. 4481–4505), the authors do not presume to provide a universal method to approximate any arbitrary function. Rather the focus is given to the development of a procedure which will consistently lead to successful approximations of nonlinear functions within the specified field. This is done by examining the dominant features of the function to be approximated and exploiting the strength of the sigmoidal basis function. As a result, a greater efficiency and understanding of both neural network architecture (e.g., the number of hidden nodes) as well as weight and bias values is achieved. Through the use of illuminating mathematical insights and a large number of training examples, this study demonstrates the simplicity, power, and versatility of the proposed prototype-based initialization methodology. A clear procedure for initializing neural networks to model various nonlinear functions commonly seen in engineering mechanics is provided. The proposed methodology is compared with the widely used Nguyen–Widrow initialization to demonstrate its robustness and efficiency in the specified applications. Future work is also identified.

Download Full-text

New Approach to Designing Multilayer Feedforward Neural Network Architecture for Modeling Nonlinear Restoring Forces. II: Applications

Journal of Engineering Mechanics ◽

10.1061/(asce)0733-9399(2006)132:12(1301) ◽

2006 ◽

Vol 132 (12) ◽

pp. 1301-1312 ◽

Cited By ~ 14

Author(s):

Jin-Song Pei ◽

Andrew W. Smyth

Keyword(s):

Neural Network ◽

Network Architecture ◽

Feedforward Neural Network ◽

Neural Network Architecture ◽

New Approach ◽

Restoring Forces ◽

Multilayer Feedforward Neural Network

Download Full-text

New Approach to Designing Multilayer Feedforward Neural Network Architecture for Modeling Nonlinear Restoring Forces. I: Formulation

Journal of Engineering Mechanics ◽

10.1061/(asce)0733-9399(2006)132:12(1290) ◽

2006 ◽

Vol 132 (12) ◽

pp. 1290-1300 ◽

Cited By ~ 22

Author(s):

Jin-Song Pei ◽

Andrew W. Smyth

Keyword(s):

Neural Network ◽

Network Architecture ◽

Feedforward Neural Network ◽

Neural Network Architecture ◽

New Approach ◽

Restoring Forces ◽

Multilayer Feedforward Neural Network

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Reynolds averaged turbulence modelling using deep neural networks with embedded invariance

Journal of Fluid Mechanics ◽

10.1017/jfm.2016.615 ◽

2016 ◽

Vol 807 ◽

pp. 155-166 ◽

Cited By ~ 274

Author(s):

Julia Ling ◽

Andrew Kurzawski ◽

Jeremy Templeton

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reynolds Stress ◽

Network Architecture ◽

Eddy Viscosity ◽

Deep Neural Networks ◽

Test Cases ◽

Neural Network Architecture ◽

Stress Anisotropy ◽

Anisotropy Tensor

There exists significant demand for improved Reynolds-averaged Navier–Stokes (RANS) turbulence models that are informed by and can represent a richer set of turbulence physics. This paper presents a method of using deep neural networks to learn a model for the Reynolds stress anisotropy tensor from high-fidelity simulation data. A novel neural network architecture is proposed which uses a multiplicative layer with an invariant tensor basis to embed Galilean invariance into the predicted anisotropy tensor. It is demonstrated that this neural network architecture provides improved prediction accuracy compared with a generic neural network architecture that does not embed this invariance property. The Reynolds stress anisotropy predictions of this invariant neural network are propagated through to the velocity field for two test cases. For both test cases, significant improvement versus baseline RANS linear eddy viscosity and nonlinear eddy viscosity models is demonstrated.

Download Full-text

Towards Heterogeneous Multi-Agent Reinforcement Learning with Graph Neural Networks

10.5753/eniac.2020.12161 ◽

2020 ◽

Author(s):

Douglas Meneghetti ◽

Reinaldo Bianchi

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Communication Channels ◽

Neural Network Architecture ◽

Graph Representations ◽

Labeled Graph ◽

Multiple Agent ◽

Multi Agent ◽

Graph Neural Networks

This work proposes a neural network architecture that learns policies for multiple agent classes in a heterogeneous multi-agent reinforcement setting. The proposed network uses directed labeled graph representations for states, encodes feature vectors of different sizes for different entity classes, uses relational graph convolution layers to model different communication channels between entity types and learns distinct policies for different agent classes, sharing parameters wherever possible. Results have shown that specializing the communication channels between entity classes is a promising step to achieve higher performance in environments composed of heterogeneous entities.

Download Full-text

Comparative Performance Analysis of Neural Network Real-Time Object Detections in Different Implementations

EPJ Web of Conferences ◽

10.1051/epjconf/202022602020 ◽

2020 ◽

Vol 226 ◽

pp. 02020

Author(s):

Alexey V. Stadnik ◽

Pavel S. Sazhin ◽

Slavomir Hnatic

Keyword(s):

Neural Network ◽

Neural Networks ◽

Computer Vision ◽

Performance Analysis ◽

Object Detection ◽

Real Time ◽

Network Architecture ◽

Neural Network Architecture ◽

Comparative Performance

The performance of neural networks is one of the most important topics in the field of computer vision. In this work, we analyze the speed of object detection using the well-known YOLOv3 neural network architecture in different frameworks under different hardware requirements. We obtain results, which allow us to formulate preliminary qualitative conclusions about the feasibility of various hardware scenarios to solve tasks in real-time environments.

Download Full-text

Optimizing the Simplicial-Map Neural Network Architecture

Journal of Imaging ◽

10.3390/jimaging7090173 ◽

2021 ◽

Vol 7 (9) ◽

pp. 173

Author(s):

Eduardo Paluzo-Hidalgo ◽

Rocio Gonzalez-Diaz ◽

Miguel A. Gutiérrez-Naranjo ◽

Jónathan Heras

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Simplicial Complexes ◽

Original Network ◽

Neural Network Architecture ◽

Simplicial Map ◽

Classification Tool ◽

Universal Approximators

Simplicial-map neural networks are a recent neural network architecture induced by simplicial maps defined between simplicial complexes. It has been proved that simplicial-map neural networks are universal approximators and that they can be refined to be robust to adversarial attacks. In this paper, the refinement toward robustness is optimized by reducing the number of simplices (i.e., nodes) needed. We have shown experimentally that such a refined neural network is equivalent to the original network as a classification tool but requires much less storage.

Download Full-text

Identifikasi Penyakit Diabetes Millitus Menggunakan Jaringan Syaraf Tiruan Dengan Metode Perambatan-Balik (Backpropagation)

10.31224/osf.io/bgs42 ◽

2018 ◽

Author(s):

Sutedi Sutedi

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Neural Network Architecture ◽

Accuracy Rate ◽

Layer 2 ◽

Input Layer ◽

Hidden Layer ◽

Diabetes Melitus

Diabetes Melitus (DM) is dangerous disease that affect many of the variouslayer of work society. This disease is not easy to accurately recognized by thegeneral society. So we need to develop a system that can identify accurately. Systemis built using neural networks with backpropagation methods and the functionactivation sigmoid. Neural network architecture using 8 input layer, 2 output layerand 5 hidden layer. The results show that this methods succesfully clasifies datadiabetics and non diabetics with near 100% accuracy rate.

Download Full-text

Baby Cry Detection in Domestic Environment using Convolutional Neural Networks

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.g5260.059720 ◽

2020 ◽

Vol 9 (7) ◽

pp. 793-795

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Network Architecture ◽

Web Application ◽

Well Being ◽

Emotion Detection ◽

Neural Network Architecture ◽

Domestic Environment ◽

The Neural Network

In this paper we will identify a cry signals of infants and the explanation behind the screams below 0-6 months of segment age. Detection of baby cry signals is essential for the pre-processing of various applications involving crial analysis for baby caregivers, such as emotion detection. Since cry signals hold baby well-being information and can be understood to an extent by experienced parents and experts. We train and validate the neural network architecture for baby cry detection and also test the fastAI with the neural network. Trained neural networks will provide a model and this model can predict the reason behind the cry sound. Only the cry sounds are recognized, and alert the user automatically. Created a web application by responding and detecting different emotions including hunger, tired, discomfort, bellypain.

Download Full-text

Performance of Multi-Layer Feedforward Neural Networks to Predict Liver Transplantation Outcome

Methods of Information in Medicine ◽

10.1055/s-0038-1634637 ◽

1996 ◽

Vol 35 (01) ◽

pp. 12-18 ◽

Cited By ~ 18

Author(s):

M. Subotin ◽

W. Marsh ◽

J. McMichael ◽

J. J. Fung ◽

I. Dvorchik

Keyword(s):

Neural Network ◽

Neural Networks ◽

Liver Transplantation ◽

Missing Data ◽

Network Performance ◽

Missing Values ◽

Feedforward Neural Networks ◽

Linear Scaling ◽

Data Set ◽

New Approach

AbstractA novel multisolutional clustering and quantization (MCO) algorithm has been developed that provides a flexible way to preprocess data. It was tested whether it would impact the neural network’s performance favorably and whether the employment of the proposed algorithm would enable neural networks to handle missing data. This was assessed by comparing the performance of neural networks using a well-documented data set to predict outcome following liver transplantation. This new approach to data preprocessing leads to a statistically significant improvement in network performance when compared to simple linear scaling. The obtained results also showed that coding missing data as zeroes in combination with the MCO algorithm, leads to a significant improvement in neural network performance on a data set containing missing values in 59.4% of cases when compared to replacement of missing values with either series means or medians.

Download Full-text