Convergence analysis of the deep neural networks based globalized dual heuristic programming

Deep neural networks have been used in various machine learning applications and achieved tremendous empirical successes. However, training deep neural networks is a challenging task. Many alternatives have been proposed in place of end-to-end back-propagation. Layer-wise training is one of them, which trains a single layer at a time, rather than trains the whole layers simultaneously. In this paper, we study a layer-wise training using a block coordinate gradient descent (BCGD) for deep linear networks. We establish a general convergence analysis of BCGD and found the optimal learning rate, which results in the fastest decrease in the loss. We identify the effects of depth, width, and initialization. When the orthogonal-like initialization is employed, we show that the width of intermediate layers plays no role in gradient-based training beyond a certain threshold. Besides, we found that the use of deep networks could drastically accelerate convergence when it is compared to those of a depth 1 network, even when the computational cost is considered. Numerical examples are provided to justify our theoretical findings and demonstrate the performance of layer-wise training by BCGD.

Download Full-text

Deep Neural Networks Algorithms for Stochastic Control Problems on Finite Horizon: Convergence Analysis

SIAM Journal on Numerical Analysis ◽

10.1137/20m1316640 ◽

2021 ◽

Vol 59 (1) ◽

pp. 525-557

Author(s):

Côme Huré ◽

Huyên Pham ◽

Achref Bachouch ◽

Nicolas Langrené

Keyword(s):

Neural Networks ◽

Convergence Analysis ◽

Stochastic Control ◽

Deep Neural Networks ◽

Finite Horizon ◽

Control Problems ◽

Stochastic Control Problems

Download Full-text

Convergence Analysis of PSO for Hyper-Parameter Selection in Deep Neural Networks

Advances on P2P, Parallel, Grid, Cloud and Internet Computing - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-319-69835-9_27 ◽

2017 ◽

pp. 284-295 ◽

Cited By ~ 1

Author(s):

Jakub Nalepa ◽

Pablo Ribalta Lorenzo

Keyword(s):

Neural Networks ◽

Convergence Analysis ◽

Deep Neural Networks ◽

Parameter Selection

Download Full-text

Deep neural networks trained with heavier data augmentation learn features closer to representations in hIT

10.32470/ccn.2018.1046-0 ◽

2018 ◽

Cited By ~ 1

Author(s):

Alex Hernández-García ◽

Johannes Mehrer ◽

Nikolaus Kriegeskorte ◽

Peter König ◽

Tim C. Kietzmann

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Data Augmentation

Download Full-text

Representation of adversarial images in deep neural networks and the human brain

10.32470/ccn.2018.1066-0 ◽

2018 ◽

Author(s):

Chi Zhang ◽

Xiaohan Duan ◽

Ruyuan Zhang ◽

Li Tong

Keyword(s):

Neural Networks ◽

Human Brain ◽

Deep Neural Networks

Download Full-text

Study on Intelligent Security Camera Systems for the Automated Detection of Nighttime Snatching Incidents using Deep Neural Networks

IEEJ Transactions on Industry Applications ◽

10.1541/ieejias.136.727 ◽

2016 ◽

Vol 136 (10) ◽

pp. 727-734 ◽

Cited By ~ 2

Author(s):

Itaru Nagayama ◽

Akira Miyahara ◽

Koichi Shimabukuro

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Automated Detection ◽

Camera Systems ◽

Intelligent Security

Download Full-text

Memory Requirement Reduction of Deep Neural Networks for Field Programmable Gate Arrays Using Low-Bit Quantization of Parameters

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287739 ◽

2021 ◽

Author(s):

Niccolo Nicodemo ◽

Gaurav Naithani ◽

Konstantinos Drossos ◽

Tuomas Virtanen ◽

Roberto Saletti

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Field Programmable Gate Arrays ◽

Memory Requirement ◽

Gate Arrays ◽

Field Programmable ◽

Programmable Gate Arrays

Download Full-text

The Use of Deep Neural Networks to Predict Post-liver Transplant Mortality

10.26226/morressier.58eb975cd462b80290b4f9e3 ◽

2017 ◽

Author(s):

Brent Ershoff

Keyword(s):

Neural Networks ◽

Liver Transplant ◽

Deep Neural Networks

Download Full-text

Financial Market Prediction and Improving the Performance Based on Large-scale Exogenous Variables and Deep Neural Networks

Korean Institute of Smart Media ◽

10.30693/smj.2020.9.4.26 ◽

2020 ◽

Vol 9 (4) ◽

pp. 26-35

Author(s):

Sung Gil Cheon ◽

Ju Hong Lee ◽

Bum Ghi Choi ◽

Jae Won Song

Keyword(s):

Neural Networks ◽

Financial Market ◽

Large Scale ◽

Deep Neural Networks ◽

Exogenous Variables

Download Full-text

Faculty Opinions recommendation of Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.726413891.793534418 ◽

2017 ◽

Author(s):

Odelia Schwartz

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Deep Neural Networks ◽

Visual Object ◽

Visual Object Recognition ◽

Cortical Dynamics ◽

Spatio Temporal

Download Full-text