Investigation of the One-Armed Bandit by Neural Network

Author(s):  
Frank C. Lin ◽  
Xiaojiang Ju
Keyword(s):  
Processes ◽  
2021 ◽  
Vol 9 (6) ◽  
pp. 919
Author(s):  
Wanlu Jiang ◽  
Chenyang Wang ◽  
Jiayun Zou ◽  
Shuqing Zhang

The field of mechanical fault diagnosis has entered the era of “big data”. However, existing diagnostic algorithms, relying on artificial feature extraction and expert knowledge are of poor extraction ability and lack self-adaptability in the mass data. In the fault diagnosis of rotating machinery, due to the accidental occurrence of equipment faults, the proportion of fault samples is small, the samples are imbalanced, and available data are scarce, which leads to the low accuracy rate of the intelligent diagnosis model trained to identify the equipment state. To solve the above problems, an end-to-end diagnosis model is first proposed, which is an intelligent fault diagnosis method based on one-dimensional convolutional neural network (1D-CNN). That is to say, the original vibration signal is directly input into the model for identification. After that, through combining the convolutional neural network with the generative adversarial networks, a data expansion method based on the one-dimensional deep convolutional generative adversarial networks (1D-DCGAN) is constructed to generate small sample size fault samples and construct the balanced data set. Meanwhile, in order to solve the problem that the network is difficult to optimize, gradient penalty and Wasserstein distance are introduced. Through the test of bearing database and hydraulic pump, it shows that the one-dimensional convolution operation has strong feature extraction ability for vibration signals. The proposed method is very accurate for fault diagnosis of the two kinds of equipment, and high-quality expansion of the original data can be achieved.


Author(s):  
Valerii Dmitrienko ◽  
Sergey Leonov ◽  
Mykola Mezentsev

The idea of ​​Belknap's four-valued logic is that modern computers should function normally not only with the true values ​​of the input information, but also under the conditions of inconsistency and incompleteness of true failures. Belknap's logic introduces four true values: T (true - true), F (false - false), N (none - nobody, nothing, none), B (both - the two, not only the one but also the other).  For ease of work with these true values, the following designations are introduced: (1, 0, n, b). Belknap's logic can be used to obtain estimates of proximity measures for discrete objects, for which the functions Jaccard and Needhem, Russel and Rao, Sokal and Michener, Hamming, etc. are used. In this case, it becomes possible to assess the proximity, recognition and classification of objects in conditions of uncertainty when the true values ​​are taken from the set (1, 0, n, b). Based on the architecture of the Hamming neural network, neural networks have been developed that allow calculating the distances between objects described using true values ​​(1, 0, n, b). Keywords: four-valued Belknap logic, Belknap computer, proximity assessment, recognition and classification, proximity function, neural network.


Energies ◽  
2022 ◽  
Vol 15 (2) ◽  
pp. 588
Author(s):  
Felipe Leite Coelho da Silva ◽  
Kleyton da Costa ◽  
Paulo Canas Rodrigues ◽  
Rodrigo Salas ◽  
Javier Linkolk López-Gonzales

Forecasting the industry’s electricity consumption is essential for energy planning in a given country or region. Thus, this study aims to apply time-series forecasting models (statistical approach and artificial neural network approach) to the industrial electricity consumption in the Brazilian system. For the statistical approach, the Holt–Winters, SARIMA, Dynamic Linear Model, and TBATS (Trigonometric Box–Cox transform, ARMA errors, Trend, and Seasonal components) models were considered. For the approach of artificial neural networks, the NNAR (neural network autoregression) and MLP (multilayer perceptron) models were considered. The results indicate that the MLP model was the one that obtained the best forecasting performance for the electricity consumption of the Brazilian industry under analysis.


2020 ◽  
Vol 15 ◽  
pp. 258
Author(s):  
S. Athanasopoulos ◽  
E. Mavrommatis ◽  
K. A. Gernoth ◽  
J. W. Clark

We evaluate the location of the proton drip line in the regions 31≤Z≤49 and 73≤Z≤91 based on the one- and two-proton separation energies predicted by our latest Hybrid Mass Model. The latter is constructed by complementing the mass-excess values ΔM predicted by the Finite Range Droplet Model (FRDM) of Moeller et al. with a neural network model trained to predict the differences ΔMexp − ΔMFRDM between these values and the experimental mass-excess values published in the 2003 Atomic Mass Evaluation AME03.


Author(s):  
Tahani Aljohani ◽  
Alexandra I. Cristea

Massive Open Online Courses (MOOCs) have become universal learning resources, and the COVID-19 pandemic is rendering these platforms even more necessary. In this paper, we seek to improve Learner Profiling (LP), i.e. estimating the demographic characteristics of learners in MOOC platforms. We have focused on examining models which show promise elsewhere, but were never examined in the LP area (deep learning models) based on effective textual representations. As LP characteristics, we predict here the employment status of learners. We compare sequential and parallel ensemble deep learning architectures based on Convolutional Neural Networks and Recurrent Neural Networks, obtaining an average high accuracy of 96.3% for our best method. Next, we predict the gender of learners based on syntactic knowledge from the text. We compare different tree-structured Long-Short-Term Memory models (as state-of-the-art candidates) and provide our novel version of a Bi-directional composition function for existing architectures. In addition, we evaluate 18 different combinations of word-level encoding and sentence-level encoding functions. Based on these results, we show that our Bi-directional model outperforms all other models and the highest accuracy result among our models is the one based on the combination of FeedForward Neural Network and the Stack-augmented Parser-Interpreter Neural Network (82.60% prediction accuracy). We argue that our prediction models recommended for both demographics characteristics examined in this study can achieve high accuracy. This is additionally also the first time a sound methodological approach toward improving accuracy for learner demographics classification on MOOCs was proposed.


2016 ◽  
pp. 368-395
Author(s):  
Eliano Pessa

The Artificial Neural Network (ANN) models gained a wide popularity owing to a number of claimed advantages such as biological plausibility, tolerance with respect to errors or noise in the input data, learning ability allowing an adaptability to environmental constraints. Notwithstanding the fact that most of these advantages are not typical only of ANNs, engineers, psychologists and neuroscientists made an extended use of ANN models in a large number of scientific investigations. In most cases, however, these models have been introduced in order to provide optimization tools more useful than the ones commonly used by traditional Optimization Theory. Unfortunately, just the successful performance of ANN models in optimization tasks produced a widespread neglect of the true – and important – objectives pursued by the first promoters of these models. These objectives can be shortly summarized by the manifesto of connectionist psychology, stating that mental processes are nothing but macroscopic phenomena, emergent from the cooperative interaction of a large number of microscopic knowledge units. This statement – wholly in line with the goal of statistical mechanics – can be readily extended to other processes, beyond the mental ones, including social, economic, and, in general, organizational ones. Therefore this chapter has been designed in order to answer a number of related questions, such as: are the ANN models able to grant for the occurrence of this sort of emergence? How can the occurrence of this emergence be empirically detected? How can the emergence produced by ANN models be controlled? In which sense the ANN emergence could offer a new paradigm for the explanation of macroscopic phenomena? Answering these questions induces to focus the chapter on less popular ANNs, such as the recurrent ones, while neglecting more popular models, such as perceptrons, and on less used units, such as spiking neurons, rather than on McCulloch-Pitts neurons. Moreover, the chapter must mention a number of strategies of emergence detection, useful for researchers performing computer simulations of ANN behaviours. Among these strategies it is possible to quote the reduction of ANN models to continuous models, such as the neural field models or the neural mass models, the recourse to the methods of Network Theory and the employment of techniques borrowed by Statistical Physics, like the one based on the Renormalization Group. Of course, owing to space (and mathematical expertise) requirements, most mathematical details of the proposed arguments are neglected, and, to gain more information, the reader is deferred to the quoted literature.


2020 ◽  
Vol 12 (1) ◽  
pp. 39-55
Author(s):  
Hadj Ahmed Bouarara

In recent years, surveillance video has become a familiar phenomenon because it gives us a feeling of greater security, but we are continuously filmed and our privacy is greatly affected. This work deals with the development of a private video surveillance system (PVSS) using regression residual convolutional neural network (RR-CNN) with the goal to propose a new security policy to ensure the privacy of no-dangerous person and prevent crime. The goal is to best meet the interests of all parties: the one who films and the one who is filmed.


Proceedings ◽  
2019 ◽  
Vol 21 (1) ◽  
pp. 26 ◽  
Author(s):  
Anxo Tato ◽  
Carlos Mosquera

Spatial Modulation (SM) is a candidate modulation scheme for beyond 5G communications systems due to its reduced hardware complexity and good trade-off between energy and spectral efficiency. This paper proposes two Machine Learning based solutions for easing the implementation of adaptive SM systems. On the one hand, a shallow neural network is shown to be an accurate and simple method for obtaining the capacity of SM. On the other hand, a deep neural network is proposed to select the coding rate in practical adaptive SM systems.


2000 ◽  
Vol 31 (4) ◽  
pp. 137-140
Author(s):  
Amine Bensaid ◽  
Bouchra Bouqata ◽  
Ralph Palliam

There are numerous methods for estimating forward interest rates as well as many studies testing the accuracy of these methods. The approach proposed in this study is similar to the one in previous works in two respects: firstly, a Monte Carlo simulation is used instead of empirical data to circumvent empirical difficulties: and secondly, in this study, accuracy is measured by estimating the forward rates rather than by exploring bond prices. This is more consistent with user objectives. The method presented here departs from the others in that it uses a Recurrent Artificial Neural Network (RANN) as an alternative technique for forecasting forward interest rates. Its performance is compared to that of a recursive method which has produced some of the best results in previous studies for forecasting forward interest rates.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Gaoyang Li ◽  
Kazuhiro Watanabe ◽  
Hitomi Anzai ◽  
Xiaorui Song ◽  
Aike Qiao ◽  
...  

Abstract Owing to the diversity of pulse-wave morphology, pulse-based diagnosis is difficult, especially pulse-wave-pattern classification (PWPC). A powerful method for PWPC is a convolutional neural network (CNN). It outperforms conventional methods in pattern classification due to extracting informative abstraction and features. For previous PWPC criteria, the relationship between pulse and disease types is not clear. In order to improve the clinical practicability, there is a need for a CNN model to find the one-to-one correspondence between pulse pattern and disease categories. In this study, five cardiovascular diseases (CVD) and complications were extracted from medical records as classification criteria to build pulse data set 1. Four physiological parameters closely related to the selected diseases were also extracted as classification criteria to build data set 2. An optimized CNN model with stronger feature extraction capability for pulse signals was proposed, which achieved PWPC with 95% accuracy in data set 1 and 89% accuracy in data set 2. It demonstrated that pulse waves are the result of multiple physiological parameters. There are limitations when using a single physiological parameter to characterise the overall pulse pattern. The proposed CNN model can achieve high accuracy of PWPC while using CVD and complication categories as classification criteria.


Sign in / Sign up

Export Citation Format

Share Document