A machine learning method for generation of a neural network architecture: a continuous ID3 algorithm

K.J. Cios; N. Liu

doi:10.1109/72.125869

Determination of thermodynamic state variables of liquids from its microscopic structure using an artificial neural network

Soft Matter ◽

10.1039/d0sm02127j ◽

2020 ◽

Author(s):

Ulices Que-Salinas ◽

Pedro Ezequiel Ramirez-Gonzalez ◽

Alexis Torres-Carbajal

Keyword(s):

Neural Network ◽

Machine Learning ◽

Artificial Neural Network ◽

Distribution Function ◽

Microscopic Structure ◽

Machine Learning Method ◽

Learning Method ◽

State Variables ◽

Thermodynamic State

In this work we implement a machine learning method to predict the thermodynamic state of a liquid using only its microscopic structure provided by the radial distribution function (RDF). The...

Download Full-text

Designing deep neural networks for continual learning in an open world

10.21248/gups.62487 ◽

2021 ◽

Author(s):

◽

Martin Mundt

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Network Architecture ◽

Neural Network Training ◽

Neural Network Architecture ◽

Neural Architecture ◽

Network Training ◽

Classification Tasks ◽

Continual Learning

Deep learning with neural networks seems to have largely replaced traditional design of computer vision systems. Automated methods to learn a plethora of parameters are now used in favor of previously practiced selection of explicit mathematical operators for a specific task. The entailed promise is that practitioners no longer need to take care of every individual step, but rather focus on gathering big amounts of data for neural network training. As a consequence, both a shift in mindset towards a focus on big datasets, as well as a wave of conceivable applications based exclusively on deep learning can be observed. This PhD dissertation aims to uncover some of the only implicitly mentioned or overlooked deep learning aspects, highlight unmentioned assumptions, and finally introduce methods to address respective immediate weaknesses. In the author’s humble opinion, these prevalent shortcomings can be tied to the fact that the involved steps in the machine learning workflow are frequently decoupled. Success is predominantly measured based on accuracy measures designed for evaluation with static benchmark test sets. Individual machine learning workflow components are assessed in isolation with respect to available data, choice of neural network architecture, and a particular learning algorithm, rather than viewing the machine learning system as a whole in context of a particular application. Correspondingly, in this dissertation, three key challenges have been identified: 1. Choice and flexibility of a neural network architecture. 2. Identification and rejection of unseen unknown data to avoid false predictions. 3. Continual learning without forgetting of already learned information. These latter challenges have already been crucial topics in older literature, alas, seem to require a renaissance in modern deep learning literature. Initially, it may appear that they pose independent research questions, however, the thesis posits that the aspects are intertwined and require a joint perspective in machine learning based systems. In summary, the essential question is thus how to pick a suitable neural network architecture for a specific task, how to recognize which data inputs belong to this context, which ones originate from potential other tasks, and ultimately how to continuously include such identified novel data in neural network training over time without overwriting existing knowledge. Thus, the central emphasis of this dissertation is to build on top of existing deep learning strengths, yet also acknowledge mentioned weaknesses, in an effort to establish a deeper understanding of interdependencies and synergies towards the development of unified solution mechanisms. For this purpose, the main portion of the thesis is in cumulative form. The respective publications can be grouped according to the three challenges outlined above. Correspondingly, chapter 1 is focused on choice and extendability of neural network architectures, analyzed in context of popular image classification tasks. An algorithm to automatically determine neural network layer width is introduced and is first contrasted with static architectures found in the literature. The importance of neural architecture design is then further showcased on a real-world application of defect detection in concrete bridges. Chapter 2 is comprised of the complementary ensuing questions of how to identify unknown concepts and subsequently incorporate them into continual learning. A joint central mechanism to distinguish unseen concepts from what is known in classification tasks, while enabling consecutive training without forgetting or revisiting older classes, is proposed. Once more, the role of the chosen neural network architecture is quantitatively reassessed. Finally, chapter 3 culminates in an overarching view, where developed parts are connected. Here, an extensive survey further serves the purpose to embed the gained insights in the broader literature landscape and emphasizes the importance of a common frame of thought. The ultimately presented approach thus reflects the overall thesis’ contribution to advance neural network based machine learning towards a unified solution that ties together choice of neural architecture with the ability to learn continually and the capability to automatically separate known from unknown data.

Download Full-text

A Novel Multi Hidden Layer Convolutional Neural Network for Content Based Image Retrieval

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c4771.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 365-370

Keyword(s):

Neural Network ◽

Machine Learning ◽

Image Retrieval ◽

Convolutional Neural Network ◽

Network Architecture ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Content Based Image Retrieval ◽

Neural Network Architecture ◽

Query Image

The applications of a content-based image retrieval system in fields such as multimedia, security, medicine, and entertainment, have been implemented on a huge real-time database by using a convolutional neural network architecture. In general, thus far, content-based image retrieval systems have been implemented with machine learning algorithms. A machine learning algorithm is applicable to a limited database because of the few feature extraction hidden layers between the input and the output layers. The proposed convolutional neural network architecture was successfully implemented using 128 convolutional layers, pooling layers, rectifier linear unit (ReLu), and fully connected layers. A convolutional neural network architecture yields better results of its ability to extract features from an image. The Euclidean distance metric is used for calculating the similarity between the query image and the database images. It is implemented using the COREL database. The proposed system is successfully evaluated using precision, recall, and F-score. The performance of the proposed method is evaluated using the precision and recall.

Download Full-text

Optimizing neural network techniques in classifying Fermi-LAT gamma-ray sources

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stz2920 ◽

2019 ◽

Vol 490 (4) ◽

pp. 4770-4777 ◽

Cited By ~ 3

Author(s):

M Kovačević ◽

G Chiaro ◽

S Cutini ◽

G Tosti

Keyword(s):

Neural Network ◽

Machine Learning ◽

Gamma Ray ◽

Classification Performance ◽

Previous Method ◽

Machine Learning Method ◽

Learning Method ◽

Large Area ◽

Γ Ray ◽

Automatic Technique

ABSTRACT Machine learning is an automatic technique that is revolutionizing scientific research, with innovative applications and wide use in astrophysics. The aim of this study was to develop an optimized version of an Artificial Neural Network machine learning method for classifying blazar candidates of uncertain type detected by the Fermi Large Area Telescope γ-ray instrument. The final result of this study increased the classification performance by about 80 ${{\ \rm per\ cent}}$ with respect to previous method, leaving only 15 unclassified blazars out of 573 blazar candidates of uncertain type listed in the LAT 4-year Source Catalog.

Download Full-text

An Useful Review on Optical Character Recognition for Smart Era Generation

Multimedia and Sensory Input for Augmented, Mixed, and Virtual Reality - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-4703-8.ch001 ◽

2021 ◽

pp. 1-41

Author(s):

Abhishek Das ◽

Mihir Narayan Mohanty

Keyword(s):

Neural Network ◽

Machine Learning ◽

Image Processing ◽

Character Recognition ◽

Optical Character Recognition ◽

Deep Neural Network ◽

Machine Learning Method ◽

Learning Method ◽

Result Section ◽

Optical Character

In this chapter, the authors have reviewed on optical character recognition. The study belongs to both typed characters and handwritten character recognition. Online and offline character recognition are two modes of data acquisition in the field of OCR and are also studied. As deep learning is the emerging machine learning method in the field of image processing, the authors have described the method and its application of earlier works. From the study of the recurrent neural network (RNN), a special class of deep neural network is proposed for the recognition purpose. Further, convolutional neural network (CNN) is combined with RNN to check its performance. For this piece of work, Odia numerals and characters are taken as input and well recognized. The efficacy of the proposed method is explained in the result section.

Download Full-text

Combination of Convolutional Neural Network Architecture and its Learning Method for Rotation‐Invariant Handwritten Digit Recognition

IEEJ Transactions on Electrical and Electronic Engineering ◽

10.1002/tee.23278 ◽

2020 ◽

Vol 16 (1) ◽

pp. 161-163

Author(s):

Kazuya Urazoe ◽

Nobutaka Kuroki ◽

Tetsuya Hirose ◽

Masahiro Numa

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Architecture ◽

Learning Method ◽

Neural Network Architecture ◽

Rotation Invariant ◽

Handwritten Digit Recognition ◽

Digit Recognition ◽

Handwritten Digit

Download Full-text

ThriftyNets: Convolutional Neural Networks with Tiny Parameter Budget

IoT ◽

10.3390/iot2020012 ◽

2021 ◽

Vol 2 (2) ◽

pp. 222-235

Author(s):

Guillaume Coiffier ◽

Ghouthi Boukli Hacene ◽

Vincent Gripon

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Network ◽

Spatial Resolution ◽

Network Architecture ◽

Deep Neural Networks ◽

State Of The Art ◽

Feature Maps ◽

Neural Network Architecture

Deep Neural Networks are state-of-the-art in a large number of challenges in machine learning. However, to reach the best performance they require a huge pool of parameters. Indeed, typical deep convolutional architectures present an increasing number of feature maps as we go deeper in the network, whereas spatial resolution of inputs is decreased through downsampling operations. This means that most of the parameters lay in the final layers, while a large portion of the computations are performed by a small fraction of the total parameters in the first layers. In an effort to use every parameter of a network at its maximum, we propose a new convolutional neural network architecture, called ThriftyNet. In ThriftyNet, only one convolutional layer is defined and used recursively, leading to a maximal parameter factorization. In complement, normalization, non-linearities, downsamplings and shortcut ensure sufficient expressivity of the model. ThriftyNet achieves competitive performance on a tiny parameters budget, exceeding 91% accuracy on CIFAR-10 with less than 40 k parameters in total, 74.3% on CIFAR-100 with less than 600 k parameters, and 67.1% On ImageNet ILSVRC 2012 with no more than 4.15 M parameters. However, the proposed method typically requires more computations than existing counterparts.

Download Full-text

A self-supervised, physics-aware, Bayesian neural network architecture for modelling galaxy emission-line kinematics

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stab427 ◽

2021 ◽

Author(s):

James M Dawson ◽

Timothy A Davis ◽

Edward L Gomez ◽

Justus Schock

Keyword(s):

Neural Network ◽

Machine Learning ◽

Emission Line ◽

Network Architecture ◽

Large Datasets ◽

Physical Parameters ◽

Bayesian Neural Network ◽

Neural Network Architecture ◽

Bayesian Monte Carlo ◽

Kinematic Modelling

Abstract In the upcoming decades large facilities, such as the SKA, will provide resolved observations of the kinematics of millions of galaxies. In order to assist in the timely exploitation of these vast datasets we blackexplore the use of a self-supervised, physics aware neural network capable of Bayesian kinematic modelling of galaxies. We demonstrate the network’s ability to model the kinematics of cold gas in galaxies with an emphasis on recovering physical parameters and accompanying modelling errors. The model is able to recover rotation curves, inclinations and disc scale lengths for both CO and H i data which match well with those found in the literature. The model is also able to provide modelling errors over learned parameters thanks to the application of quasi-Bayesian Monte-Carlo dropout. This work shows the promising use of machine learning, and in particular self-supervised neural networks, in the context of kinematically modelling galaxies. This work represents the first steps in applying such models for kinematic fitting and we propose that variants of our model would seem especially suitable for enabling emission-line science from upcoming surveys with e.g. the SKA, allowing fast exploitation of these large datasets.

Download Full-text

USE OF DEEP MACHINE LEARNING METHODS OF ARTIFICIAL NEURAL NETWORKS FOR DESIGNING ALGORITHMS OF ELECTROMYOGRAPHY SIGNAL RECOGNITION IN BIONIC PROSTHESIS

Issues of radio electronics ◽

10.21778/2218-5453-2019-5-64-75 ◽

2019 ◽

pp. 64-75

Author(s):

A. A. Yarygin ◽

B. H. Aytbaev ◽

A. Yu. Kanyshev ◽

E. A. Alekseeva

Keyword(s):

Neural Network ◽

Machine Learning ◽

Network Architecture ◽

Skin Surface ◽

Signal Recognition ◽

End User ◽

Neural Network Architecture ◽

Activity Data ◽

The Moment ◽

Electromyography Signals

For sterling application of scientific and engineered achievements in field of bionic prosthesis it’s required to provide comfortable and natural human‑prosthesis interface for an end‑user. In this article we are looking into ways and methods of analysis of the signal collected through electromyography activity of muscles on the skin surface. Such signal is nonstationary and unstable by its nature, dependent on various factors. sEMG based interface has several unsolved problem at the moment, such as insufficient accuracy of recognition and noticeable delay caused by signal recognition and processing. Article is dedicated to application of deep machine learning required to provide decent recognition of electromyography signals. In the course of the research hardware was developed to register muscle activity. Data collecting system and algorithms of gesture recognition have been designed as well. In conclusion decent results were achieved by using convolutional neural network, with two‑dimensional input, since data stream has obvious translational orientation. In the future, modification of neural network architecture, learning algorithms and experiments with structure of data are planned.

Download Full-text

ANALISIS TINGKAT KEBERHASILAN CRYOTERAPY MENGGUNAKAN NEURAL NETWORK

Jurnal Pilar Nusa Mandiri ◽

10.33480/pilar.v15i2.599 ◽

2019 ◽

Vol 15 (2) ◽

pp. 141-148

Author(s):

Sri Rahayu ◽

Fitra Septia Nugraha ◽

Muhammad Ja’far Shidiq

Keyword(s):

Neural Network ◽

Machine Learning ◽

Side Effects ◽

20Th Century ◽

High Accuracy ◽

Machine Learning Method ◽

Learning Method ◽

Good Classification ◽

Auc Value

Human health is very important to always pay attention especially after someone has been declared suffering from an illness that can inhibit positive activities. One of the most feared diseases of the 20th century is cancer. This disease requires treatment that is quite expensive. Alternative treatments are cryotherapy or ice therapy. But cryotherapy also has side effects, it is necessary to do research on its success by taking into account certain conditions of the parameters. So the purpose of this study is to analyze the success of cryotherapy so that the dataset can be used as one of the benchmarks for the success of the cryotherapy tratment method. The method used in this study is the machine learning method of Neural Network with 500 training cycles, learning rate of 0,003 and momentum 0,9 which results in a good classification of obtaining quite high accuracy of 87,78% and AUC value of 0,955.

Download Full-text