A Model-Guided Neural Network for the Prediction of Creep Behavior Under in-Service Conditions

Hui Hong; Zhenwei Cai; Han Wang; Weizhe Wang; Yingzheng Liu

doi:10.1115/1.4047281

A Model-Guided Neural Network for the Prediction of Creep Behavior Under in-Service Conditions

Journal of Engineering for Gas Turbines and Power ◽

10.1115/1.4047281 ◽

2020 ◽

Vol 142 (7) ◽

Author(s):

Hui Hong ◽

Zhenwei Cai ◽

Han Wang ◽

Weizhe Wang ◽

Yingzheng Liu

Keyword(s):

Neural Network ◽

Creep Behavior ◽

Activation Function ◽

Turbine Rotor ◽

Primary Creep ◽

Training Data ◽

Secondary Creep ◽

The Neural Network ◽

Stress Fluctuation ◽

Service Conditions

Abstract Data-driven neural network methods have been widely applied for the prediction of stress–strain behavior, but have proven ill-suited for the extrapolation of time-dependent creep behavior. To overcome this problem, we embedded a physics-based model into feedforward neural networks (FFNNs) to construct a model-guided neural network (MGNN). We proposed a new initialization method for the weights in the model, based on selecting the appropriate physics-based model and activation function, and the resulting MGNN was used for predicting the creep behavior of blade-grooves in a steam turbine rotor under in-service conditions. We compared the performance of the MGNN with baseline methods, namely MGNN0, a FFNN, and a nonlinear autoregressive network with exogenous inputs (network). The results showed that the physics-based model and the neural network in the MGNN complemented each other: the model provided physical relationships to guide the neural network, and the neural network provided stress-fluctuation-tracking for the model. This functionality enabled primary creep behavior to be used as training data for the MGNN, enabling it to predict secondary creep behavior.

Download Full-text

Analysis of Fin-Tube Evaporator Performance With Limited Experimental Data Using Artificial Neural Networks

10.1115/imece2000-1466 ◽

2000 ◽

Author(s):

Arturo Pacheco-Vega ◽

Mihir Sen ◽

Rodney L. McClain

Keyword(s):

Neural Network ◽

Heat Rate ◽

Network Models ◽

Activation Function ◽

Operating Conditions ◽

Training Data ◽

Neural Network Models ◽

The Neural Network ◽

Artificial Neural ◽

Fin Tube

Abstract In the current study we consider the problem of accuracy in heat rate estimations from artificial neural network models of heat exchangers used for refrigeration applications. The network configuration is of the feedforward type with a sigmoid activation function and a backpropagation algorithm. Limited experimental measurements from a manufacturer are used to show the capability of the neural network technique in modeling the heat transfer in these systems. Results from this exercise show that a well-trained network correlates the data with errors of the same order as the uncertainty of the measurements. It is also shown that the number and distribution of the training data are linked to the performance of the network when estimating the heat rates under different operating conditions, and that networks trained from few tests may give large errors. A methodology based on the cross-validation technique is presented to find regions where not enough data are available to construct a reliable neural network. The results from three tests show that the proposed methodology gives an upper bound of the estimated error in the heat rates.

Download Full-text

Prediction of stress in fillet portion of spur gears using artificial neural networks

Artificial intelligence for engineering design analysis and manufacturing ◽

10.1017/s0890060408000036 ◽

2007 ◽

Vol 22 (1) ◽

pp. 41-51

Author(s):

M.S. Shunmugam ◽

N. Siva Prasad

Keyword(s):

Neural Network ◽

Gear Tooth ◽

Spur Gear ◽

Activation Function ◽

Training Data ◽

Spur Gears ◽

Pressure Angle ◽

The Neural Network ◽

Artificial Neural ◽

Taguchi’S Design Of Experiments

AbstractA fillet curve is provided at the root of the spur gear tooth, as stresses are high in this portion. The fillet curve may be a trochoid or an arc of suitable size as specified by designer. The fillet stress is influenced by the fillet geometry as well as the number of teeth, modules, and the pressure angle of the gear. Because the relationship is nonlinear and complex, an artificial neural network and a backpropagation algorithm are used in the present work to predict the fillet stresses. Training data are obtained from finite element simulations that are greatly reduced using Taguchi's design of experiments. Each simulation takes around 30 min. The 4-5-1 network and a sigmoid activation function are chosen. TRAINLM function is used for training the network with a learning rate parameter of 0.01 and a momentum constant of 0.8. The neural network is able to predict the fillet stresses in 0.03 s with reasonable accuracy for spur gears having 25–125 teeth, a 1–5 mm module, a 0.05–0.45 mm fillet radius, and a 15°–25° pressure angle.

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Rethinking the Random Cropping Data Augmentation Method Used in the Training of CNN-Based SAR Image Ship Detector

Remote Sensing ◽

10.3390/rs13010034 ◽

2020 ◽

Vol 13 (1) ◽

pp. 34

Author(s):

Rong Yang ◽

Robert Wang ◽

Yunkai Deng ◽

Xiaoxue Jia ◽

Heng Zhang

Keyword(s):

Neural Network ◽

Data Augmentation ◽

Back Propagation ◽

Detection Performance ◽

Training Data ◽

Sar Image ◽

Optical Images ◽

The Neural Network ◽

Effective Training ◽

Standard Configuration

The random cropping data augmentation method is widely used to train convolutional neural network (CNN)-based target detectors to detect targets in optical images (e.g., COCO datasets). It can expand the scale of the dataset dozens of times while consuming only a small amount of calculations when training the neural network detector. In addition, random cropping can also greatly enhance the spatial robustness of the model, because it can make the same target appear in different positions of the sample image. Nowadays, random cropping and random flipping have become the standard configuration for those tasks with limited training data, which makes it natural to introduce them into the training of CNN-based synthetic aperture radar (SAR) image ship detectors. However, in this paper, we show that the introduction of traditional random cropping methods directly in the training of the CNN-based SAR image ship detector may generate a lot of noise in the gradient during back propagation, which hurts the detection performance. In order to eliminate the noise in the training gradient, a simple and effective training method based on feature map mask is proposed. Experiments prove that the proposed method can effectively eliminate the gradient noise introduced by random cropping and significantly improve the detection performance under a variety of evaluation indicators without increasing inference cost.

Download Full-text

Application of ANN Back-Propagation for Fracture Design Parameters in Extra-Low Cycle Rotating Bending Fatigue

Materials Science Forum ◽

10.4028/www.scientific.net/msf.610-613.450 ◽

2009 ◽

Vol 610-613 ◽

pp. 450-453

Author(s):

Hong Yan Duan ◽

You Tang Li ◽

Jin Zhang ◽

Gui Ping He

Keyword(s):

Neural Network ◽

Neural Network Model ◽

Back Propagation ◽

Training Data ◽

Design Parameters ◽

Ann Model ◽

Bending Fatigue ◽

The Neural Network ◽

Rotating Bending Fatigue ◽

Rotating Bending

The fracture problems of ecomaterial (aluminum alloyed cast iron) under extra-low cycle rotating bending fatigue loading were studied using artificial neural networks (ANN) in this paper. The training data were used in the formation of training set of ANN. The ANN model exhibited excellent in results comparison with the experimental results. It was concluded that predicted fracture design parameters by the trained neural network model seem more reasonable compared to approximate methods. It is possible to claim that, ANN is fairly promising prediction technique if properly used. Training ANN model was introduced at first. And then the Training data for the development of the neural network model was obtained from the experiments. The input parameters, notch depth, the presetting deflection and tip radius of the notch, and the output parameters, the cycle times of fracture were used during the network training. The neural network architecture is designed. The ANN model was developed using back propagation architecture with three layers jump connections, where every layer was connected or linked to every previous layer. The number of hidden neurons was determined according to special formula. The performance of system is summarized at last. In order to facilitate the comparisons of predicted values, the error evaluation and mean relative error are obtained. The result show that the training model has good performance, and the experimental data and predicted data from ANN are in good coherence.

Download Full-text

Bootstrapping a Neural Morphological Generator from Morphological Analyzer Output for Inuktitut

10.33011/computel.v2i.455 ◽

2019 ◽

Vol 2 (1) ◽

Author(s):

Jeffrey Micher

Keyword(s):

Neural Network ◽

Training Data ◽

Data Set ◽

Set Size ◽

The Neural Network ◽

Surface Character ◽

Finite State ◽

Character Sequences ◽

Finite State Transducer

We present a method for building a morphological generator from the output of an existing analyzer for Inuktitut, in the absence of a two-way finite state transducer which would normally provide this functionality. We make use of a sequence to sequence neural network which “translates” underlying Inuktitut morpheme sequences into surface character sequences. The neural network uses only the previous and the following morphemes as context. We report a morpheme accuracy of approximately 86%. We are able to increase this accuracy slightly by passing deep morphemes directly to output for unknown morphemes. We do not see significant improvement when increasing training data set size, and postulate possible causes for this.

Download Full-text

Signal processing algorithm for neural networks with integrodifferential splines as an activation function and its particular case of image classification

Highly available systems ◽

10.18127/j20729472-202102-02 ◽

2021 ◽

Author(s):

T.K. Biryukova

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Classification ◽

Activation Function ◽

Experimental Comparison ◽

Training Time ◽

Operation Speed ◽

The Neural Network ◽

Linear Algebraic Equations ◽

Network Operation

Classic neural networks suppose trainable parameters to include just weights of neurons. This paper proposes parabolic integrodifferential splines (ID-splines), developed by author, as a new kind of activation function (AF) for neural networks, where ID-splines coefficients are also trainable parameters. Parameters of ID-spline AF together with weights of neurons are vary during the training in order to minimize the loss function thus reducing the training time and increasing the operation speed of the neural network. The newly developed algorithm enables software implementation of the ID-spline AF as a tool for neural networks construction, training and operation. It is proposed to use the same ID-spline AF for neurons in the same layer, but different for different layers. In this case, the parameters of the ID-spline AF for a particular layer change during the training process independently of the activation functions (AFs) of other network layers. In order to comply with the continuity condition for the derivative of the parabolic ID-spline on the interval (x x0, n) , its parameters fi (i= 0,...,n) should be calculated using the tridiagonal system of linear algebraic equations: To solve the system it is necessary to use two more equations arising from the boundary conditions for specific problems. For exam- ple the values of the grid function (if they are known) in the points (x x0, n) may be used for solving the system above: f f x0 = ( 0) , f f xn = ( n) . The parameters Iii+1 (i= 0,...,n−1 ) are used as trainable parameters of neural networks. The grid boundaries and spacing of the nodes of ID-spline AF are best chosen experimentally. The optimal selection of grid nodes allows improving the quality of results produced by the neural network. The formula for a parabolic ID-spline is such that the complexity of the calculations does not depend on whether the grid of nodes is uniform or non-uniform. An experimental comparison of the results of image classification from the popular FashionMNIST dataset by convolutional neural 0, x< 0 networks with the ID-spline AFs and the well-known ReLUx( ) =AF was carried out. The results reveal that the usage x x, ≥ 0 of the ID-spline AFs provides better accuracy of neural network operation than the ReLU AF. The training time for two convolutional layers network with two ID-spline AFs is just about 2 times longer than with two instances of ReLU AF. Doubling of the training time due to complexity of the ID-spline formula is the acceptable price for significantly better accuracy of the network. Wherein the difference of an operation speed of the networks with ID-spline and ReLU AFs will be negligible. The use of trainable ID-spline AFs makes it possible to simplify the architecture of neural networks without losing their efficiency. The modification of the well-known neural networks (ResNet etc.) by replacing traditional AFs with ID-spline AFs is a promising approach to increase the neural network operation accuracy. In a majority of cases, such a substitution does not require to train the network from scratch because it allows to use pre-trained on large datasets neuron weights supplied by standard software libraries for neural network construction thus substantially shortening training time.

Download Full-text

Deep Convolutional Neural Network Phase Unwrapping for Fringe Projection 3D Imaging

Sensors ◽

10.3390/s20133691 ◽

2020 ◽

Vol 20 (13) ◽

pp. 3691

Author(s):

Jian Liang ◽

Junchao Zhang ◽

Jianbo Shao ◽

Bofan Song ◽

Baoli Yao ◽

...

Keyword(s):

Neural Network ◽

3D Imaging ◽

Phase Unwrapping ◽

Training Data ◽

Fringe Projection ◽

Second Step ◽

Network Configuration ◽

Simulation Data ◽

The Neural Network ◽

Key Features

Phase unwrapping is a very important step in fringe projection 3D imaging. In this paper, we propose a new neural network for accurate phase unwrapping to address the special needs in fringe projection 3D imaging. Instead of labeling the wrapped phase with integers directly, a two-step training process with the same network configuration is proposed. In the first step, the network (network I) is trained to label only four key features in the wrapped phase. In the second step, another network with same configuration (network II) is trained to label the wrapped phase segments. The advantages are that the dimension of the wrapped phase can be much larger from that of the training data, and the phase with serious Gaussian noise can be correctly unwrapped. We demonstrate the performance and key features of the neural network trained with the simulation data for the experimental data.

Download Full-text

Smartphone Based Classification System for Indoor Navigation

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.775.436 ◽

2015 ◽

Vol 775 ◽

pp. 436-440

Author(s):

Yazan Aljeroudi ◽

Ari Legowo ◽

Erwin Sulaeman

Keyword(s):

Neural Network ◽

Classification System ◽

Indoor Environment ◽

Activation Function ◽

Indoor Navigation ◽

The Neural Network ◽

Evaluation Measure ◽

Tuning Process ◽

Learning Machine ◽

Accuracy Rates

This paper introduces a smartphone-based classification system for an indoor environment of a walking person. The system relies only on smartphone inertial data and it can be considered as a smartphone-based aiding system for an indoor navigation. In addition, it does not need pre-installing of wireless network in the environment or heavily tuning process before the navigation run. Therefore, this system can be used as an aiding block where the person wants to localize himself in an indoor environment starting from known navigations point. This system categorizes person navigation in indoor environment into three types of classes: walking straight, turning right, and turning left. There is an ELM (Extreme Learning Machine)-Based neural network for deciding the class of the current navigation action. The evaluation measure shows that the best performance is obtained with the Radial Basis Function (RBF) as the activation function of the neural network. Also, the obtained accuracy rates up to 95%.

Download Full-text

Machine Learned Prediction of Reaction Template Applicability for Data-Driven Retrosynthetic Predictions of Energetic Materials

10.26434/chemrxiv.12046623.v1 ◽

2020 ◽

Cited By ~ 1

Author(s):

Michael Fortunato ◽

Connor W. Coley ◽

Brian Barnes ◽

Klavs F. Jensen

Keyword(s):

Neural Network ◽

Energetic Materials ◽

Energetic Material ◽

Training Data ◽

Data Generation ◽

Material Synthesis ◽

Subgraph Matching ◽

Planning Models ◽

The Neural Network ◽

Patent Literature

State of the art computer-aided synthesis planning models are naturally biased toward commonly reported chemical reactions, thus reducing the usefulness of those models for the unusual chemistry relevant to shock physics. To address this problem, a neural network was trained to recognize reaction template applicability for small organic molecules to supplement the rare reaction examples of relevance to energetic materials. The training data for the neural network was generated by brute force determination of template subgraph matching for product molecules from a database of reactions in U.S. patent literature. This data generation strategy successfully augmented the information about template applicability for rare reaction mechanisms in the reaction database. The increased ability to recognize rare reaction templates was demonstrated for reaction templates of interest for energetic material synthesis such as heterocycle ring formation.<div><br></div><div>The following article has been submitted to by the 21st Biennial APS Conference on Shock Compression of Condensed Matter. After it is published, it will be found at https://publishing.aip.org/resources/librarians/products/journals/.</div>

Download Full-text