Efficient Deep Learning for Gradient-Enhanced Stress Dependent Damage Model

Xiaoying Zhuang; L. C. Nguyen; Hung Nguyen-Xuan; Naif Alajlan; Timon Rabczuk

doi:10.3390/app10072556

Efficient Deep Learning for Gradient-Enhanced Stress Dependent Damage Model

Applied Sciences ◽

10.3390/app10072556 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2556

Author(s):

Xiaoying Zhuang ◽

L. C. Nguyen ◽

Hung Nguyen-Xuan ◽

Naif Alajlan ◽

Timon Rabczuk

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Damage Model ◽

Activation Function ◽

Neural Network Architecture ◽

Three Point Bending ◽

Material Parameters ◽

Applied Forces ◽

Stress Dependent ◽

Shear Bending

This manuscript introduces a computational approach to micro-damage problems using deep learning for the prediction of loading deflection curves. The location of applied forces, dimensions of the specimen and material parameters are used as inputs of the process. The micro-damage is modelled with a gradient-enhanced damage model which ensures the well-posedness of the boundary value and yields mesh-independent results in computational methods such as FEM. We employ the Adam optimizer and Rectified linear unit activation function for training processes and research into the deep neural network architecture. The performance of our approach is demonstrated through some numerical examples including the three-point bending specimen, shear bending on L-shaped specimen and different failure mechanisms.

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

optNet-50: An Optimized Residual Neural Network Architecture of Deep Learning for Driver's Distraction

2020 IEEE 23rd International Multitopic Conference (INMIC) ◽

10.1109/inmic50486.2020.9318087 ◽

2020 ◽

Author(s):

Tahir Abbas ◽

Syed Farooq Ali ◽

Aadil Zia Khan ◽

Irfan Kareem

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Neural Network Architecture

Download Full-text

Deep learning of stochastic contagion dynamics on complex networks

10.21203/rs.3.rs-36564/v1 ◽

2020 ◽

Author(s):

Charles Murphy ◽

Edward Laurence ◽

Antoine Allard

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Time Series Data ◽

Training Data ◽

Series Data ◽

Neural Network Architecture ◽

Complementary Approach ◽

Quantitative Accuracy ◽

Simplifying Assumptions ◽

Effective Models

Abstract Forecasting the evolution of contagion dynamics is still an open problem to which mechanistic models only offer a partial answer. To remain mathematically and/or computationally tractable, these models must rely on simplifying assumptions, thereby limiting the quantitative accuracy of their predictions and the complexity of the dynamics they can model. Here, we propose a complementary approach based on deep learning where the effective local mechanisms governing a dynamic are learned automatically from time series data. Our graph neural network architecture makes very few assumptions about the dynamics, and we demonstrate its accuracy using stochastic contagion dynamics of increasing complexity on static and temporal networks. By allowing simulations on arbitrary network structures, our approach makes it possible to explore the properties of the learned dynamics beyond the training data. Our results demonstrate how deep learning offers a new and complementary perspective to build effective models of contagion dynamics on networks.

Download Full-text

Modelling Peri-Perceptual Brain Processes in a Deep Learning Spiking Neural Network Architecture

Scientific Reports ◽

10.1038/s41598-018-27169-8 ◽

2018 ◽

Vol 8 (1) ◽

Cited By ~ 16

Author(s):

Zohreh Gholami Doborjeh ◽

Nikola Kasabov ◽

Maryam Gholami Doborjeh ◽

Alexander Sumich

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Spiking Neural Network ◽

Neural Network Architecture

Download Full-text

The contribution of hippocampal subfields to the progression of neurodegeneration

10.1101/2020.05.06.081034 ◽

2020 ◽

Cited By ~ 2

Author(s):

Kichang Kwak ◽

Marc Niethammer ◽

Kelly S. Giovanello ◽

Martin Styner ◽

Eran Dayan ◽

...

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Molecular Layer ◽

Total Sample ◽

Neural Network Architecture ◽

Hippocampal Subfields ◽

Validation Set ◽

Hippocampal Structure ◽

Deep Learning Model ◽

Variable Progression

AbstractMild cognitive impairment (MCI) is often considered the precursor of Alzheimer’s disease. However, MCI is associated with substantially variable progression rates, which are not well understood. Attempts to identify the mechanisms that underlie MCI progression have often focused on the hippocampus, but have mostly overlooked its intricate structure and subdivisions. Here, we utilized deep learning to delineate the contribution of hippocampal subfields to MCI progression using a total sample of 1157 subjects (349 in the training set, 427 in a validation set and 381 in the testing set). We propose a dense convolutional neural network architecture that differentiates stable and progressive MCI based on hippocampal morphometry. The proposed deep learning model predicted MCI progression with an accuracy of 75.85%. A novel implementation of occlusion analysis revealed marked differences in the contribution of hippocampal subfields to the performance of the model, with presubiculum, CA1, subiculum, and molecular layer showing the most central role. Moreover, the analysis reveals that 10.5% of the volume of the hippocampus was redundant in the differentiation between stable and progressive MCI. Our predictive model uncovers pronounced differences in the contribution of hippocampal subfields to the progression of MCI. The results may reflect the sparing of hippocampal structure in individuals with a slower progression of neurodegeneration.

Download Full-text

Designing deep neural networks for continual learning in an open world

10.21248/gups.62487 ◽

2021 ◽

Author(s):

◽

Martin Mundt

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Network Architecture ◽

Neural Network Training ◽

Neural Network Architecture ◽

Neural Architecture ◽

Network Training ◽

Classification Tasks ◽

Continual Learning

Deep learning with neural networks seems to have largely replaced traditional design of computer vision systems. Automated methods to learn a plethora of parameters are now used in favor of previously practiced selection of explicit mathematical operators for a specific task. The entailed promise is that practitioners no longer need to take care of every individual step, but rather focus on gathering big amounts of data for neural network training. As a consequence, both a shift in mindset towards a focus on big datasets, as well as a wave of conceivable applications based exclusively on deep learning can be observed. This PhD dissertation aims to uncover some of the only implicitly mentioned or overlooked deep learning aspects, highlight unmentioned assumptions, and finally introduce methods to address respective immediate weaknesses. In the author’s humble opinion, these prevalent shortcomings can be tied to the fact that the involved steps in the machine learning workflow are frequently decoupled. Success is predominantly measured based on accuracy measures designed for evaluation with static benchmark test sets. Individual machine learning workflow components are assessed in isolation with respect to available data, choice of neural network architecture, and a particular learning algorithm, rather than viewing the machine learning system as a whole in context of a particular application. Correspondingly, in this dissertation, three key challenges have been identified: 1. Choice and flexibility of a neural network architecture. 2. Identification and rejection of unseen unknown data to avoid false predictions. 3. Continual learning without forgetting of already learned information. These latter challenges have already been crucial topics in older literature, alas, seem to require a renaissance in modern deep learning literature. Initially, it may appear that they pose independent research questions, however, the thesis posits that the aspects are intertwined and require a joint perspective in machine learning based systems. In summary, the essential question is thus how to pick a suitable neural network architecture for a specific task, how to recognize which data inputs belong to this context, which ones originate from potential other tasks, and ultimately how to continuously include such identified novel data in neural network training over time without overwriting existing knowledge. Thus, the central emphasis of this dissertation is to build on top of existing deep learning strengths, yet also acknowledge mentioned weaknesses, in an effort to establish a deeper understanding of interdependencies and synergies towards the development of unified solution mechanisms. For this purpose, the main portion of the thesis is in cumulative form. The respective publications can be grouped according to the three challenges outlined above. Correspondingly, chapter 1 is focused on choice and extendability of neural network architectures, analyzed in context of popular image classification tasks. An algorithm to automatically determine neural network layer width is introduced and is first contrasted with static architectures found in the literature. The importance of neural architecture design is then further showcased on a real-world application of defect detection in concrete bridges. Chapter 2 is comprised of the complementary ensuing questions of how to identify unknown concepts and subsequently incorporate them into continual learning. A joint central mechanism to distinguish unseen concepts from what is known in classification tasks, while enabling consecutive training without forgetting or revisiting older classes, is proposed. Once more, the role of the chosen neural network architecture is quantitatively reassessed. Finally, chapter 3 culminates in an overarching view, where developed parts are connected. Here, an extensive survey further serves the purpose to embed the gained insights in the broader literature landscape and emphasizes the importance of a common frame of thought. The ultimately presented approach thus reflects the overall thesis’ contribution to advance neural network based machine learning towards a unified solution that ties together choice of neural architecture with the ability to learn continually and the capability to automatically separate known from unknown data.

Download Full-text

Human Perception Based Color Image Segmentation

Computer Engineering and Applications Journal ◽

10.18495/comengapp.v2i3.34 ◽

2013 ◽

Vol 2 (3) ◽

Author(s):

Neeta Pradeep Gargote ◽

Savitha Devaraj ◽

Shravani Shahapure

Keyword(s):

Neural Network ◽

Image Segmentation ◽

Network Architecture ◽

Color Image ◽

Color Space ◽

Human Perception ◽

Activation Function ◽

Color Image Segmentation ◽

Neural Network Architecture ◽

Hsi Color Space

Color image segmentation is probably the most important task in image analysis and understanding. A novel Human Perception Based Color Image Segmentation System is presented in this paper. This system uses a neural network architecture. The neurons here uses a multisigmoid activation function. The multisigmoid activation function is the key for segmentation. The number of steps ie. thresholds in the multisigmoid function are dependent on the number of clusters in the image. The threshold values for detecting the clusters and their labels are found automatically from the first order derivative of histograms of saturation and intensity in the HSI color space. Here the main use of neural network is to detect the number of objects automatically from an image. It labels the objects with their mean colors. The algorithm is found to be reliable and works satisfactorily on different kinds of color images.

Download Full-text

Application of deep learning methods to predict ionosphere parameters in real time

E3S Web of Conferences ◽

10.1051/e3sconf/202019602007 ◽

2020 ◽

Vol 196 ◽

pp. 02007

Author(s):

Vladimir Mochalov ◽

Anastasia Mochalova

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Short Term Memory ◽

Neural Network Architecture ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

In this paper, the previously obtained results on recognition of ionograms using deep learning are expanded to predict the parameters of the ionosphere. After the ionospheric parameters have been identified on the ionogram using deep learning in real time, we can predict the parameters for some time ahead on the basis of the new data obtained Examples of predicting the ionosphere parameters using an artificial recurrent neural network architecture long short-term memory are given. The place of the block for predicting the parameters of the ionosphere in the system for analyzing ionospheric data using deep learning methods is shown.

Download Full-text

Deep learning based cone beam CT reconstruction framework using a cascaded neural network architecture (Conference Presentation)

Medical Imaging 2018: Physics of Medical Imaging ◽

10.1117/12.2293916 ◽

2018 ◽

Cited By ~ 1

Author(s):

Yinsheng Li ◽

Guang-Hong Chen

Keyword(s):

Neural Network ◽

Deep Learning ◽

Network Architecture ◽

Cone Beam Ct ◽

Cone Beam ◽

Ct Reconstruction ◽

Neural Network Architecture

Download Full-text

Detection of Suicide Ideation in Social Media Forums Using Deep Learning

Algorithms ◽

10.3390/a13010007 ◽

2019 ◽

Vol 13 (1) ◽

pp. 7 ◽

Cited By ~ 4

Author(s):

Michael Mesfin Tadesse ◽

Hongfei Lin ◽

Bo Xu ◽

Liang Yang

Keyword(s):

Social Media ◽

Deep Learning ◽

Network Architecture ◽

Suicide Ideation ◽

Suicide Risk Assessment ◽

Neural Network Architecture ◽

Combined Model ◽

Ongoing Work ◽

Classification Tasks ◽

Learning Architectures

Suicide ideation expressed in social media has an impact on language usage. Many at-risk individuals use social forum platforms to discuss their problems or get access to information on similar tasks. The key objective of our study is to present ongoing work on automatic recognition of suicidal posts. We address the early detection of suicide ideation through deep learning and machine learning-based classification approaches applied to Reddit social media. For such purpose, we employ an LSTM-CNN combined model to evaluate and compare to other classification models. Our experiment shows the combined neural network architecture with word embedding techniques can achieve the best relevance classification results. Additionally, our results support the strength and ability of deep learning architectures to build an effective model for a suicide risk assessment in various text classification tasks.

Download Full-text