Co-Training for Visual Object Recognition Based on Self-Supervised Models Using a Cross-Entropy Regularization

Gabriel Díaz; Billy Peralta; Luis Caro; Orietta Nicolis

doi:10.3390/e23040423

Co-Training for Visual Object Recognition Based on Self-Supervised Models Using a Cross-Entropy Regularization

Entropy ◽

10.3390/e23040423 ◽

2021 ◽

Vol 23 (4) ◽

pp. 423

Author(s):

Gabriel Díaz ◽

Billy Peralta ◽

Luis Caro ◽

Orietta Nicolis

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Object Recognition ◽

Training Model ◽

Cross Entropy ◽

Visual Object ◽

Visual Object Recognition ◽

Visual Objects ◽

Learning Techniques ◽

Proposed Model

Automatic recognition of visual objects using a deep learning approach has been successfully applied to multiple areas. However, deep learning techniques require a large amount of labeled data, which is usually expensive to obtain. An alternative is to use semi-supervised models, such as co-training, where multiple complementary views are combined using a small amount of labeled data. A simple way to associate views to visual objects is through the application of a degree of rotation or a type of filter. In this work, we propose a co-training model for visual object recognition using deep neural networks by adding layers of self-supervised neural networks as intermediate inputs to the views, where the views are diversified through the cross-entropy regularization of their outputs. Since the model merges the concepts of co-training and self-supervised learning by considering the differentiation of outputs, we called it Differential Self-Supervised Co-Training (DSSCo-Training). This paper presents some experiments using the DSSCo-Training model to well-known image datasets such as MNIST, CIFAR-100, and SVHN. The results indicate that the proposed model is competitive with the state-of-art models and shows an average relative improvement of 5% in accuracy for several datasets, despite its greater simplicity with respect to more recent approaches.

Download Full-text

Faculty Opinions recommendation of Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.726413891.793534418 ◽

2017 ◽

Author(s):

Odelia Schwartz

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Deep Neural Networks ◽

Visual Object ◽

Visual Object Recognition ◽

Cortical Dynamics ◽

Spatio Temporal

Download Full-text

Stock Pattern Classification from Charts using Deep Learning Algorithms

Academic Perspective Procedia ◽

10.33793/acperpro.03.01.89 ◽

2020 ◽

Vol 3 (1) ◽

pp. 445-454

Author(s):

Celal Buğra Kaya ◽

Alperen Yılmaz ◽

Gizem Nur Uzun ◽

Zeynep Hilal Kilimci

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Pattern Classification ◽

Short Term Memory ◽

Stock Exchange ◽

Learning Techniques ◽

Proposed Model ◽

Istanbul Stock Exchange

Pattern classification is related with the automatic finding of regularities in dataset through the utilization of various learning techniques. Thus, the classification of the objects into a set of categories or classes is provided. This study is undertaken to evaluate deep learning methodologies to the classification of stock patterns. In order to classify patterns that are obtained from stock charts, convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long-short term memory networks (LSTMs) are employed. To demonstrate the efficiency of proposed model in categorizing patterns, hand-crafted image dataset is constructed from stock charts in Istanbul Stock Exchange and NASDAQ Stock Exchange. Experimental results show that the usage of convolutional neural networks exhibits superior classification success in recognizing patterns compared to the other deep learning methodologies.

Download Full-text

Occluded Visual Object Recognition Using Deep Conditional Generative Adversarial Nets and Feedforward Convolutional Neural Networks

2020 International Conference on Machine Vision and Image Processing (MVIP) ◽

10.1109/mvip49855.2020.9116887 ◽

2020 ◽

Author(s):

Vahid Reza Khazaie ◽

Alireza AkhavanPour ◽

Reza Ebrahimpour

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Visual Object ◽

Visual Object Recognition

Download Full-text

Android Smartphone Based Visual Object Recognition for Visually Impaired Using Deep Learning

2018 International Conference on Communication and Signal Processing (ICCSP) ◽

10.1109/iccsp.2018.8524493 ◽

2018 ◽

Cited By ~ 3

Author(s):

Neel Parikh ◽

Ishita Shah ◽

Safvan Vahora

Keyword(s):

Deep Learning ◽

Object Recognition ◽

Visually Impaired ◽

Visual Object ◽

Visual Object Recognition

Download Full-text

Fusing bottom-up and top-down pathways in neural networks for visual object recognition

The 2010 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2010.5596497 ◽

2010 ◽

Cited By ~ 3

Author(s):

Yuhua Zheng ◽

Yan Meng ◽

Yaochu Jin

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Visual Object ◽

Visual Object Recognition ◽

Top Down ◽

Bottom Up

Download Full-text

Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence

Scientific Reports ◽

10.1038/srep27755 ◽

2016 ◽

Vol 6 (1) ◽

Cited By ~ 233

Author(s):

Radoslaw Martin Cichy ◽

Aditya Khosla ◽

Dimitrios Pantazis ◽

Antonio Torralba ◽

Aude Oliva

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Deep Neural Networks ◽

Visual Object ◽

Visual Object Recognition ◽

Cortical Dynamics ◽

Spatio Temporal

Download Full-text

Optimization of FireNet for Liver Lesion Classification

Electronics ◽

10.3390/electronics9081237 ◽

2020 ◽

Vol 9 (8) ◽

pp. 1237

Author(s):

Gedeon Kashala Kabe ◽

Yuqing Song ◽

Zhe Liu

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Liver Lesion ◽

Superior Performance ◽

Visual Object ◽

Visual Object Recognition ◽

Residual Function ◽

Learning Techniques ◽

Model Size

In recent years, deep learning techniques, and in particular convolutional neural networks (CNNs) methods have demonstrated a superior performance in image classification and visual object recognition. In this work, we propose a classification of four types of liver lesions, namely, hepatocellular carcinoma, metastases, hemangiomas, and healthy tissues using convolutional neural networks with a succinct model called FireNet. We improved speed for quick classification and decreased the model size and the number of parameters by using fire modules from SqueezeNet. We have used bypass connection by adding it around Fire modules for learning a residual function between input and output, and to solve the vanishing gradient problem. We have proposed a new Particle Swarm Optimization (NPSO) to optimize the network parameters in order to further boost the performance of the proposed FireNet. The experimental results show that the parameters of FireNet are 9.5 times smaller than GoogLeNet, 51.6 times smaller than AlexNet, and 75.8 smaller than ResNet. The size of FireNet is reduced 16.6 times smaller than GoogLeNet, 75 times smaller than AlexNet and 76.6 times smaller than ResNet. The final accuracy of our proposed FireNet model was 89.2%.

Download Full-text

Modeling Neurodegeneration in silico With Deep Learning

Frontiers in Neuroinformatics ◽

10.3389/fninf.2021.748370 ◽

2021 ◽

Vol 15 ◽

Author(s):

Anup Tuladhar ◽

Jasmine A. Moore ◽

Zahinoor Ismail ◽

Nils D. Forkert

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Object Recognition ◽

Language Processing ◽

Neural Plasticity ◽

In Silico ◽

Cortical Atrophy ◽

Visual Object ◽

Deep Convolutional Neural Networks ◽

The Brain

Deep neural networks, inspired by information processing in the brain, can achieve human-like performance for various tasks. However, research efforts to use these networks as models of the brain have primarily focused on modeling healthy brain function so far. In this work, we propose a paradigm for modeling neural diseases in silico with deep learning and demonstrate its use in modeling posterior cortical atrophy (PCA), an atypical form of Alzheimer’s disease affecting the visual cortex. We simulated PCA in deep convolutional neural networks (DCNNs) trained for visual object recognition by randomly injuring connections between artificial neurons. Results showed that injured networks progressively lost their object recognition capability. Simulated PCA impacted learned representations hierarchically, as networks lost object-level representations before category-level representations. Incorporating this paradigm in computational neuroscience will be essential for developing in silico models of the brain and neurological diseases. The paradigm can be expanded to incorporate elements of neural plasticity and to other cognitive domains such as motor control, auditory cognition, language processing, and decision making.

Download Full-text

Deep Convolutional Neural Networks Based on Image Data Augmentation for Visual Object Recognition

Intelligent Data Engineering and Automated Learning – IDEAL 2019 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-33607-3_51 ◽

2019 ◽

pp. 476-485

Author(s):

Khaoula Jayech

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Image Data ◽

Visual Object ◽

Visual Object Recognition ◽

Deep Convolutional Neural Networks

Download Full-text

Recurrent convolutional neural networks: a better model of biological object recognition

10.1101/133330 ◽

2017 ◽

Cited By ~ 3

Author(s):

Courtney J. Spoerer ◽

Patrick McClure ◽

Nikolaus Kriegeskorte

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Recurrent Neural Networks ◽

Feedforward Control ◽

Recognition Performance ◽

Feedforward Neural Networks ◽

Visual Object ◽

Visual Object Recognition ◽

Feedback Connections

Feedforward neural networks provide the dominant model of how the brain performs visual object recognition. However, these networks lack the lateral and feedback connections, and the resulting recurrent neuronal dynamics, of the ventral visual pathway in the human and nonhuman primate brain. Here we investigate recurrent convolutional neural networks with bottom-up (B), lateral (L), and top-down (T) connections. Combining these types of connections yields four architectures (B, BT, BL, and BLT), which we systematically test and compare. We hypothesized that recurrent dynamics might improve recognition performance in the challenging scenario of partial occlusion. We introduce two novel occluded object recognition tasks to test the efficacy of the models, digit clutter (where multiple target digits occlude one another) and digit debris (where target digits are occluded by digit fragments). We find that recurrent neural networks outperform feedforward control models (approximately matched in parametric complexity) at recognising objects, both in the absence of occlusion and in all occlusion conditions. Recurrent networks were also found to be more robust to the inclusion of additive Gaussian noise. Recurrent neural networks are better in two respects: (1) they are more neurobiologically realistic than their feedforward counterparts; (2) they are better in terms of their ability to recognise objects, especially under challenging conditions. This work shows that computer vision can benefit from using recurrent convolutional architectures and suggests that the ubiquitous recurrent connections in biological brains are essential for task performance.

Download Full-text