A Method for Optimal Division of Data Sets for Use in Neural Networks

Generation of geometric interpolations of building types with deep variational autoencoders

Design Science ◽

10.1017/dsj.2020.31 ◽

2020 ◽

Vol 6 ◽

Author(s):

Jaime de Miguel Rodríguez ◽

Maria Eugenia Villafañe ◽

Luka Piškorec ◽

Fernando Sancho Caparrini

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Large Data ◽

Learning Model ◽

Large Data Sets ◽

Data Sets ◽

Connectivity Map ◽

Data Set ◽

3D Objects ◽

Machine Learning Model

Abstract This work presents a methodology for the generation of novel 3D objects resembling wireframes of building types. These result from the reconstruction of interpolated locations within the learnt distribution of variational autoencoders (VAEs), a deep generative machine learning model based on neural networks. The data set used features a scheme for geometry representation based on a ‘connectivity map’ that is especially suited to express the wireframe objects that compose it. Additionally, the input samples are generated through ‘parametric augmentation’, a strategy proposed in this study that creates coherent variations among data by enabling a set of parameters to alter representative features on a given building type. In the experiments that are described in this paper, more than 150 k input samples belonging to two building types have been processed during the training of a VAE model. The main contribution of this paper has been to explore parametric augmentation for the generation of large data sets of 3D geometries, showcasing its problems and limitations in the context of neural networks and VAEs. Results show that the generation of interpolated hybrid geometries is a challenging task. Despite the difficulty of the endeavour, promising advances are presented.

Download Full-text

Empirical modeling of very large data sets using neural networks

Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium ◽

10.1109/ijcnn.2000.859413 ◽

2000 ◽

Cited By ~ 5

Author(s):

A.J. Owens

Keyword(s):

Neural Networks ◽

Large Data ◽

Empirical Modeling ◽

Large Data Sets ◽

Data Sets

Download Full-text

Uncertainty-Aware Deep Classifiers Using Generative Models

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6015 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5620-5627 ◽

Cited By ~ 1

Author(s):

Murat Sensoy ◽

Lance Kaplan ◽

Federico Cerutti ◽

Maryam Saleki

Keyword(s):

Neural Networks ◽

Epistemic Uncertainty ◽

Feature Space ◽

Generative Models ◽

Detection Methods ◽

Generative Adversarial Networks ◽

Data Sets ◽

Bayesian Approaches ◽

Data Set ◽

Auxiliary Data

Deep neural networks are often ignorant about what they do not know and overconfident when they make uninformed predictions. Some recent approaches quantify classification uncertainty directly by training the model to output high uncertainty for the data samples close to class boundaries or from the outside of the training distribution. These approaches use an auxiliary data set during training to represent out-of-distribution samples. However, selection or creation of such an auxiliary data set is non-trivial, especially for high dimensional data such as images. In this work we develop a novel neural network model that is able to express both aleatoric and epistemic uncertainty to distinguish decision boundary and out-of-distribution regions of the feature space. To this end, variational autoencoders and generative adversarial networks are incorporated to automatically generate out-of-distribution exemplars for training. Through extensive analysis, we demonstrate that the proposed approach provides better estimates of uncertainty for in- and out-of-distribution samples, and adversarial examples on well-known data sets against state-of-the-art approaches including recent Bayesian approaches for neural networks and anomaly detection methods.

Download Full-text

SEGMENTATION OF ULTRASONIC IMAGES WITH NEURAL NETWORKS

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001491000351 ◽

1991 ◽

Vol 05 (04) ◽

pp. 619-628 ◽

Cited By ~ 9

Author(s):

RONALD H. SILVERMAN

Keyword(s):

Neural Networks ◽

Image Processing ◽

Back Propagation ◽

Three Dimensional ◽

Competitive Learning ◽

Data Sets ◽

Image Structure ◽

Ultrasonic Images ◽

Traditional Approaches

Neural networks differ from traditional approaches to image processing in terms of their ability to adapt to regularities in image structure and to self-organize so as to implement directed transformations. Biomedical ultrasonic images are often degraded in quality by noise and other factors, making enhancement techniques particularly important. This paper describes use of back propagation and competitive learning for enhancement and segmentation of ultrasonic images of the eye. Of particular interest is the extension or these technique to segmentation of three-dimensional data sets, where simple thresholding and gradient operations are not entirely successful.

Download Full-text

Reduced Representation by Neural Networks with Restricted Receptive Fields

Neural Computation ◽

10.1162/neco.1995.7.3.507 ◽

1995 ◽

Vol 7 (3) ◽

pp. 507-517 ◽

Cited By ~ 1

Author(s):

Marco Idiart ◽

Barry Berk ◽

L. F. Abbott

Keyword(s):

Neural Networks ◽

Biological Networks ◽

Input Data ◽

Hebbian Learning ◽

Receptive Fields ◽

Principal Component ◽

Data Sets ◽

Reduced Representation ◽

Learning Rules ◽

Reduced Representations

Model neural networks can perform dimensional reductions of input data sets using correlation-based learning rules to adjust their weights. Simple Hebbian learning rules lead to an optimal reduction at the single unit level but result in highly redundant network representations. More complex rules designed to reduce or remove this redundancy can develop optimal principal component representations, but they are not very compelling from a biological perspective. Neurons in biological networks have restricted receptive fields limiting their access to the input data space. We find that, within this restricted receptive field architecture, simple correlation-based learning rules can produce surprisingly efficient reduced representations. When noise is present, the size of the receptive fields can be optimally tuned to maximize the accuracy of reconstructions of input data from a reduced representation.

Download Full-text

Symbiotic coevolution of artificial neural networks and training data sets

Lecture Notes in Computer Science - Parallel Problem Solving from Nature — PPSN V ◽

10.1007/bfb0056893 ◽

1998 ◽

pp. 511-520 ◽

Cited By ~ 9

Author(s):

Helmut A. Mayer

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Training Data ◽

Data Sets ◽

Artificial Neural ◽

And Training

Download Full-text

Simple Convolutional-Based Models: Are They Learning the Task or the Data?

Neural Computation ◽

10.1162/neco_a_01446 ◽

2021 ◽

pp. 1-17

Author(s):

Luis Sa-Couto ◽

Andreas Wichert

Keyword(s):

Neural Networks ◽

Pattern Recognition ◽

Training Data ◽

Model Complexity ◽

Data Sets ◽

Simple Task ◽

Data Set ◽

Knowing That ◽

Handwritten Digit ◽

End To End

Abstract Convolutional neural networks (CNNs) evolved from Fukushima's neocognitron model, which is based on the ideas of Hubel and Wiesel about the early stages of the visual cortex. Unlike other branches of neocognitron-based models, the typical CNN is based on end-to-end supervised learning by backpropagation and removes the focus from built-in invariance mechanisms, using pooling not as a way to tolerate small shifts but as a regularization tool that decreases model complexity. These properties of end-to-end supervision and flexibility of structure allow the typical CNN to become highly tuned to the training data, leading to extremely high accuracies on typical visual pattern recognition data sets. However, in this work, we hypothesize that there is a flip side to this capability, a hidden overfitting. More concretely, a supervised, backpropagation based CNN will outperform a neocognitron/map transformation cascade (MTCCXC) when trained and tested inside the same data set. Yet if we take both models trained and test them on the same task but on another data set (without retraining), the overfitting appears. Other neocognitron descendants like the What-Where model go in a different direction. In these models, learning remains unsupervised, but more structure is added to capture invariance to typical changes. Knowing that, we further hypothesize that if we repeat the same experiments with this model, the lack of supervision may make it worse than the typical CNN inside the same data set, but the added structure will make it generalize even better to another one. To put our hypothesis to the test, we choose the simple task of handwritten digit classification and take two well-known data sets of it: MNIST and ETL-1. To try to make the two data sets as similar as possible, we experiment with several types of preprocessing. However, regardless of the type in question, the results align exactly with expectation.

Download Full-text

Convolutional Neural Networks for Scientific Images and Other Large Data Sets

10.1007/978-3-030-70388-2_6 ◽

2021 ◽

pp. 149-172

Author(s):

Ryan G. McClarren

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Scientific Images

Download Full-text

SVM-Based Deep Stacking Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015273 ◽

2019 ◽

Vol 33 ◽

pp. 5273-5280 ◽

Cited By ~ 2

Author(s):

Jingyuan Wang ◽

Kai Feng ◽

Junjie Wu

Keyword(s):

Neural Networks ◽

High Performance ◽

Optimal Solution ◽

Data Sets ◽

Complex Data ◽

Layer By Layer ◽

Deep Network ◽

Local View ◽

Linear Svm ◽

Network Approaches

The deep network model, with the majority built on neural networks, has been proved to be a powerful framework to represent complex data for high performance machine learning. In recent years, more and more studies turn to nonneural network approaches to build diverse deep structures, and the Deep Stacking Network (DSN) model is one of such approaches that uses stacked easy-to-learn blocks to build a parameter-training-parallelizable deep network. In this paper, we propose a novel SVM-based Deep Stacking Network (SVM-DSN), which uses the DSN architecture to organize linear SVM classifiers for deep learning. A BP-like layer tuning scheme is also proposed to ensure holistic and local optimizations of stacked SVMs simultaneously. Some good math properties of SVM, such as the convex optimization, is introduced into the DSN framework by our model. From a global view, SVM-DSN can iteratively extract data representations layer by layer as a deep neural network but with parallelizability, and from a local view, each stacked SVM can converge to its optimal solution and obtain the support vectors, which compared with neural networks could lead to interesting improvements in anti-saturation and interpretability. Experimental results on both image and text data sets demonstrate the excellent performances of SVM-DSN compared with some competitive benchmark models.

Download Full-text

Biomedical Data Mining Using RBF Neural Networks

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch021 ◽

2011 ◽

pp. 106-111

Author(s):

Fang Chu ◽

Lipo Wang

Keyword(s):

Neural Networks ◽

Microarray Data ◽

High Accuracy ◽

Accurate Diagnosis ◽

Data Sets ◽

Biomedical Data ◽

Rbf Neural Networks ◽

Proper Treatment ◽

Pattern Recognition Problem ◽

Very High

Accurate diagnosis of cancers is of great importance for doctors to choose a proper treatment. Furthermore, it also plays a key role in the searching for the pathology of cancers and drug discovery. Recently, this problem attracts great attention in the context of microarray technology. Here, we apply radial basis function (RBF) neural networks to this pattern recognition problem. Our experimental results in some well-known microarray data sets indicate that our method can obtain very high accuracy with a small number of genes.

Download Full-text