algorithmic stability Latest Research Papers

Randomized coordinate descent (RCD) is a popular optimization algorithm with wide applications in various machine learning problems, which motivates a lot of theoretical analysis on its convergence behavior. As a comparison, there is no work studying how the models trained by RCD would generalize to test examples. In this paper, we initialize the generalization analysis of RCD by leveraging the powerful tool of algorithmic stability. We establish argument stability bounds of RCD for both convex and strongly convex objectives, from which we develop optimal generalization bounds by showing how to early-stop the algorithm to tradeoff the estimation and optimization. Our analysis shows that RCD enjoys better stability as compared to stochastic gradient descent.

Download Full-text

Fine-grained Generalization Analysis of Structured Output Prediction

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/391 ◽

2021 ◽

Author(s):

Waleed Mustafa ◽

Yunwen Lei ◽

Antoine Ledent ◽

Marius Kloft

Keyword(s):

Language Processing ◽

Large Scale ◽

Fine Grained ◽

Generalization Bounds ◽

Probability Bounds ◽

Structured Output Prediction ◽

Algorithmic Stability ◽

Structured Output ◽

Weakly Dependent Data ◽

Prediction Problems

In machine learning we often encounter structured output prediction problems (SOPPs), i.e. problems where the output space admits a rich internal structure. Application domains where SOPPs naturally occur include natural language processing, speech recognition, and computer vision. Typical SOPPs have an extremely large label set, which grows exponentially as a function of the size of the output. Existing generalization analysis implies generalization bounds with at least a square-root dependency on the cardinality d of the label set, which can be vacuous in practice. In this paper, we significantly improve the state of the art by developing novel high-probability bounds with a logarithmic dependency on d. Furthermore, we leverage the lens of algorithmic stability to develop generalization bounds in expectation without any dependency on d. Our results therefore build a solid theoretical foundation for learning in large-scale SOPPs. Furthermore, we extend our results to learning with weakly dependent data.

Download Full-text

Algorithmic Stability for Adaptive Data Analysis

SIAM Journal on Computing ◽

10.1137/16m1103646 ◽

2021 ◽

pp. STOC16-377-STOC16-405

Author(s):

Raef Bassily ◽

Kobbi Nissim ◽

Adam Smith ◽

Thomas Steinke ◽

Uri Stemmer ◽

...

Keyword(s):

Data Analysis ◽

Algorithmic Stability ◽

Adaptive Data Analysis

Download Full-text

ALGORITHMIC STABILITY OF DEEP LEARNING NEURAL NETWORKS IN RECOGNIZING THE MICROSTRUCTURE OF MATERIALS

Bulletin of the South Ural State University Ser Computer Technologies Automatic Control & Radioelectronics ◽

10.14529/ctcr210114 ◽

2021 ◽

Vol 21 (1) ◽

pp. 159-166

Author(s):

R.A. Klestov ◽

◽

A.V. Klyuev ◽

V.Yu. Stolbov ◽

◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Network Model ◽

Neural Network Model ◽

Learning Process ◽

Functional Materials ◽

Quality Of Data ◽

Algorithmic Stability ◽

The Stability

The division of data for training a neural network into training and test data in various proportions to each other is investigated. The question is raised about how the quality of data distribution and their correct annotation can affect the final result of constructing a neural network model. The paper investigates the algorithmic stability of training a deep neural network in problems of recognition of the microstructure of materials. The study of the stability of the learning process makes it possible to estimate the performance of a neural network model on incomplete data distorted by up to 10%. Purpose. Research of the stability of the learning process of a neural network in the classification of microstructures of functional materials. Materials and methods. Artificial neural network is the main instrument on the basis of which produced the study. Different subtypes of deep convolutional networks are used such as VGG and ResNet. Neural networks are trained using an improved backpropagation method. The studied model is the frozen state of the neural network after a certain number of learning epochs. The amount of data excluded from the study was randomly distributed for each class in five different distributions. Results. Investigated neural network learning process. Results of experiments conducted computing training with gradual decrease in the number of input data. Distortions of calculation results when changing data with a step of 2 percent are investigated. The percentage of deviation was revealed, equal to 10, at which the trained neural network model loses its stability. Conclusion. The results obtained mean that with an established quantitative or qualitative deviation in the training or test set, the results obtained by training the network can hardly be trusted. Although the results of this study are applicable to a particular case, i.e., microstructure recognition problems using ResNet-152, the authors propose a simpler technique for studying the stability of deep learning neural networks based on the analysis of a test, not a training set.

Download Full-text

Algorithmic Stability Analysis for Certain Trust Region Methods

mathematical programming with data perturbations ◽

10.1201/9781003072119-6 ◽

2020 ◽

pp. 109-131 ◽

Cited By ~ 1

Author(s):

Ursula Felgenhauer

Keyword(s):

Stability Analysis ◽

Trust Region ◽

Trust Region Methods ◽

Algorithmic Stability

Download Full-text

On the Algorithmic Stability of Optimal Control with Derivative Operators

Circuits Systems and Signal Processing ◽

10.1007/s00034-020-01447-1 ◽

2020 ◽

Vol 39 (12) ◽

pp. 5863-5881

Author(s):

Tim Chen ◽

J. C.-Y. Cheng

Keyword(s):

Optimal Control ◽

Algorithmic Stability

Download Full-text

Neural Network Model for Assessing the Physical and Mechanical Properties of a Metal Material Based on Deep Learning

Journal of Digital Science ◽

10.33847/2686-8296.2.1_2 ◽

2020 ◽

pp. 18-28

Author(s):

Andrei Kliuev ◽

Roman Klestov ◽

Valerii Stolbov

Keyword(s):

Neural Network ◽

Mechanical Properties ◽

Deep Neural Network ◽

Physical And Mechanical Properties ◽

Training Set ◽

Test Set ◽

Algorithmic Stability ◽

Test Sets ◽

Trained Network ◽

Basic Test

The paper investigates the algorithmic stability of learning a deep neural network in problems of recognition of the materials microstructure. It is shown that at 8% of quantitative deviation in the basic test set the algorithm trained network loses stability. This means that with such a quantitative or qualitative deviation in the training or test sets, the results obtained with such trained network can hardly be trusted. Although the results of this study are applicable to the particular case, i.e. problems of recognition of the microstructure using ResNet-152, the authors propose a cheaper method for studying stability based on the analysis of the test, rather than the training set.

Download Full-text

Theoretical analysis of skip connections and batch normalization from generalization and optimization perspectives

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2020.7 ◽

2020 ◽

Vol 9 ◽

Cited By ~ 1

Author(s):

Yasutaka Furusho ◽

Kazushi Ikeda

Keyword(s):

Deep Neural Networks ◽

Fisher Information Matrix ◽

Information Matrix ◽

The Other ◽

Expected Loss ◽

Paper Briefly ◽

Batch Normalization ◽

Algorithmic Stability ◽

The Difference ◽

Theoretical Analyses

Abstract Deep neural networks (DNNs) have the same structure as the neocognitron proposed in 1979 but have much better performance, which is because DNNs include many heuristic techniques such as pre-training, dropout, skip connections, batch normalization (BN), and stochastic depth. However, the reason why these techniques improve the performance is not fully understood. Recently, two tools for theoretical analyses have been proposed. One is to evaluate the generalization gap, defined as the difference between the expected loss and empirical loss, by calculating the algorithmic stability, and the other is to evaluate the convergence rate by calculating the eigenvalues of the Fisher information matrix of DNNs. This overview paper briefly introduces the tools and shows their usefulness by showing why the skip connections and BN improve the performance.

Download Full-text

Algorithmic Stability Theory

Model Selection and Error Estimation in a Nutshell - Modeling and Optimization in Science and Technologies ◽

10.1007/978-3-030-24359-3_7 ◽

2019 ◽

pp. 65-74

Author(s):

Luca Oneto

Keyword(s):

Stability Theory ◽

Algorithmic Stability

Download Full-text

Algorithmic stability for adaptive data analysis

Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing - STOC 2016 ◽

10.1145/2897518.2897566 ◽

2016 ◽

Cited By ~ 22

Author(s):

Raef Bassily ◽

Kobbi Nissim ◽

Adam Smith ◽

Thomas Steinke ◽

Uri Stemmer ◽

...

Keyword(s):

Data Analysis ◽

Algorithmic Stability ◽

Adaptive Data Analysis

Download Full-text

algorithmic stability
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Stability and Generalization for Randomized Coordinate Descent

Fine-grained Generalization Analysis of Structured Output Prediction

Algorithmic Stability for Adaptive Data Analysis

ALGORITHMIC STABILITY OF DEEP LEARNING NEURAL NETWORKS IN RECOGNIZING THE MICROSTRUCTURE OF MATERIALS

Algorithmic Stability Analysis for Certain Trust Region Methods

On the Algorithmic Stability of Optimal Control with Derivative Operators

Neural Network Model for Assessing the Physical and Mechanical Properties of a Metal Material Based on Deep Learning

Theoretical analysis of skip connections and batch normalization from generalization and optimization perspectives

Algorithmic Stability Theory

Algorithmic stability for adaptive data analysis

Export Citation Format

algorithmic stabilityRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Stability and Generalization for Randomized Coordinate Descent

Fine-grained Generalization Analysis of Structured Output Prediction

Algorithmic Stability for Adaptive Data Analysis

ALGORITHMIC STABILITY OF DEEP LEARNING NEURAL NETWORKS IN RECOGNIZING THE MICROSTRUCTURE OF MATERIALS

Algorithmic Stability Analysis for Certain Trust Region Methods

On the Algorithmic Stability of Optimal Control with Derivative Operators

Neural Network Model for Assessing the Physical and Mechanical Properties of a Metal Material Based on Deep Learning

Theoretical analysis of skip connections and batch normalization from generalization and optimization perspectives

Algorithmic Stability Theory

Algorithmic stability for adaptive data analysis

algorithmic stability
Recently Published Documents