scholarly journals RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory

Sensors ◽  
2019 ◽  
Vol 19 (3) ◽  
pp. 529 ◽  
Author(s):  
Hui Zeng ◽  
Bin Yang ◽  
Xiuqing Wang ◽  
Jiwei Liu ◽  
Dongmei Fu

With the development of low-cost RGB-D (Red Green Blue-Depth) sensors, RGB-D object recognition has attracted more and more researchers’ attention in recent years. The deep learning technique has become popular in the field of image analysis and has achieved competitive results. To make full use of the effective identification information in the RGB and depth images, we propose a multi-modal deep neural network and a DS (Dempster Shafer) evidence theory based RGB-D object recognition method. First, the RGB and depth images are preprocessed and two convolutional neural networks are trained, respectively. Next, we perform multi-modal feature learning using the proposed quadruplet samples based objective function to fine-tune the network parameters. Then, two probability classification results are obtained using two sigmoid SVMs (Support Vector Machines) with the learned RGB and depth features. Finally, the DS evidence theory based decision fusion method is used for integrating the two classification results. Compared with other RGB-D object recognition methods, our proposed method adopts two fusion strategies: Multi-modal feature learning and DS decision fusion. Both the discriminative information of each modality and the correlation information between the two modalities are exploited. Extensive experimental results have validated the effectiveness of the proposed method.

2021 ◽  
pp. 1-14
Author(s):  
Sachin Sharma ◽  
Vineet Kumar ◽  
K.P.S. Rana

Generally, the process industry is affected by unwanted fluctuations in control loops arising due to external interference, components with inherent nonlinearities or aggressively tuned controllers. These oscillations lead to production of substandard products and thus affect the overall profitability of a plant. Hence, timely detection of oscillations is desired for ensuring safety and profitability of the plant. In order to achieve this, a control loop oscillation detection and quantification algorithm using Prony method of infinite impulse response (IIR) filter design and deep neural network (DNN) has been presented in this work. Denominator polynomial coefficients of the obtained IIR filter using Prony method were used as the feature vector for DNN. Further, DNN is used to confirm the existence of oscillations in the process control loop data. Furthermore, amplitude and frequency of oscillations are also estimated with the help of cross-correlation values, computed between the original signal and estimated error signal. Experimental results confirm that the presented algorithm is capable of detecting the presence of single or multiple oscillations in the control loop data. The proposed algorithm is also able to estimate the frequency and amplitude of detected oscillations with high accuracy. The Proposed method is also compared with support vector machine (SVM) and empirical mode decomposition (EMD) based approach and it is found that proposed method is faster and more accurate than the later.


Kybernetes ◽  
2019 ◽  
Vol 49 (9) ◽  
pp. 2335-2348 ◽  
Author(s):  
Milad Yousefi ◽  
Moslem Yousefi ◽  
Masood Fathi ◽  
Flavio S. Fogliatto

Purpose This study aims to investigate the factors affecting daily demand in an emergency department (ED) and to provide a forecasting tool in a public hospital for horizons of up to seven days. Design/methodology/approach In this study, first, the important factors to influence the demand in EDs were extracted from literature then the relevant factors to the study are selected. Then, a deep neural network is applied to constructing a reliable predictor. Findings Although many statistical approaches have been proposed for tackling this issue, better forecasts are viable by using the abilities of machine learning algorithms. Results indicate that the proposed approach outperforms statistical alternatives available in the literature such as multiple linear regression, autoregressive integrated moving average, support vector regression, generalized linear models, generalized estimating equations, seasonal ARIMA and combined ARIMA and linear regression. Research limitations/implications The authors applied this study in a single ED to forecast patient visits. Applying the same method in different EDs may give a better understanding of the performance of the model to the authors. The same approach can be applied in any other demand forecasting after some minor modifications. Originality/value To the best of the knowledge, this is the first study to propose the use of long short-term memory for constructing a predictor of the number of patient visits in EDs.


2018 ◽  
Vol 2018 ◽  
pp. 1-12 ◽  
Author(s):  
Abebe Belay Adege ◽  
Hsin-Piao Lin ◽  
Getaneh Berie Tarekegn ◽  
Yirga Yayeh Munaye ◽  
Lei Yen

Indoor and outdoor positioning lets to offer universal location services in industry and academia. Wi-Fi and Global Positioning System (GPS) are the promising technologies for indoor and outdoor positioning, respectively. However, Wi-Fi-based positioning is less accurate due to the vigorous changes of environments and shadowing effects. GPS-based positioning is also characterized by much cost, highly susceptible to the physical layouts of equipment, power-hungry, and sensitive to occlusion. In this paper, we propose a hybrid of support vector machine (SVM) and deep neural network (DNN) to develop scalable and accurate positioning in Wi-Fi-based indoor and outdoor environments. In the positioning processes, we primarily construct real datasets from indoor and outdoor Wi-Fi-based environments. Secondly, we apply linear discriminate analysis (LDA) to construct a projected vector that uses to reduce features without affecting information contents. Thirdly, we construct a model for positioning through the integration of SVM and DNN. Fourthly, we use online datasets from unknown locations and check the missed radio signal strength (RSS) values using the feed-forward neural network (FFNN) algorithm to fill the missed values. Fifthly, we project the online data through an LDA-based projected vector. Finally, we test the positioning accuracies and scalabilities of a model created from a hybrid of SVM and DNN. The whole processes are implemented using Python 3.6 programming language in the TensorFlow framework. The proposed method provides accurate and scalable positioning services in different scenarios. The results also show that our proposed approach can provide scalable positioning, and 100% of the estimation accuracies are with errors less than 1 m and 1.9 m for indoor and outdoor positioning, respectively.


2019 ◽  
Vol 5 (5) ◽  
pp. eaav7903 ◽  
Author(s):  
Khaled Nasr ◽  
Pooja Viswanathan ◽  
Andreas Nieder

Humans and animals have a “number sense,” an innate capability to intuitively assess the number of visual items in a set, its numerosity. This capability implies that mechanisms to extract numerosity indwell the brain’s visual system, which is primarily concerned with visual object recognition. Here, we show that network units tuned to abstract numerosity, and therefore reminiscent of real number neurons, spontaneously emerge in a biologically inspired deep neural network that was merely trained on visual object recognition. These numerosity-tuned units underlay the network’s number discrimination performance that showed all the characteristics of human and animal number discriminations as predicted by the Weber-Fechner law. These findings explain the spontaneous emergence of the number sense based on mechanisms inherent to the visual system.


Proceedings ◽  
2019 ◽  
Vol 42 (1) ◽  
pp. 15
Author(s):  
Manuel Gil-Martín ◽  
Marcos Sánchez-Hernández ◽  
Rubén San-Segundo

Deep learning techniques are being widely applied to Human Activity Recognition (HAR). This paper describes the implementation and evaluation of a HAR system for daily life activities using the accelerometer of an iPhone 6S. This system is based on a deep neural network including convolutional layers for feature extraction from accelerations and fully-connected layers for classification. Different transformations have been applied to the acceleration signals in order to find the appropriate input data to the deep neural network. This study has used acceleration recordings from the MotionSense dataset, where 24 subjects performed 6 activities: walking downstairs, walking upstairs, sitting, standing, walking and jogging. The evaluation has been performed using a subject-wise cross-validation: recordings from the same subject do not appear in training and testing sets at the same time. The proposed system has obtained a 9% improvement in accuracy compared to the baseline system based on Support Vector Machines. The best results have been obtained using raw data as input to a deep neural network composed of two convolutional and two max-pooling layers with decreasing kernel sizes. Results suggest that using the module of the Fourier transform as inputs provides better results when classifying only between dynamic activities.


Sign in / Sign up

Export Citation Format

Share Document