scholarly journals Data Mining and Neural Networks: The Impact of Data Representation

Author(s):  
Fadzilah Siraj ◽  
Ehab A. Omer A. Omer ◽  
Md. Rajib
Sensors ◽  
2020 ◽  
Vol 20 (6) ◽  
pp. 1579
Author(s):  
Dongqi Wang ◽  
Qinghua Meng ◽  
Dongming Chen ◽  
Hupo Zhang ◽  
Lisheng Xu

Automatic detection of arrhythmia is of great significance for early prevention and diagnosis of cardiovascular disease. Traditional feature engineering methods based on expert knowledge lack multidimensional and multi-view information abstraction and data representation ability, so the traditional research on pattern recognition of arrhythmia detection cannot achieve satisfactory results. Recently, with the increase of deep learning technology, automatic feature extraction of ECG data based on deep neural networks has been widely discussed. In order to utilize the complementary strength between different schemes, in this paper, we propose an arrhythmia detection method based on the multi-resolution representation (MRR) of ECG signals. This method utilizes four different up to date deep neural networks as four channel models for ECG vector representations learning. The deep learning based representations, together with hand-crafted features of ECG, forms the MRR, which is the input of the downstream classification strategy. The experimental results of big ECG dataset multi-label classification confirm that the F1 score of the proposed method is 0.9238, which is 1.31%, 0.62%, 1.18% and 0.6% higher than that of each channel model. From the perspective of architecture, this proposed method is highly scalable and can be employed as an example for arrhythmia recognition.


2020 ◽  
Vol 6 (1) ◽  
Author(s):  
Malte Seemann ◽  
Lennart Bargsten ◽  
Alexander Schlaefer

AbstractDeep learning methods produce promising results when applied to a wide range of medical imaging tasks, including segmentation of artery lumen in computed tomography angiography (CTA) data. However, to perform sufficiently, neural networks have to be trained on large amounts of high quality annotated data. In the realm of medical imaging, annotations are not only quite scarce but also often not entirely reliable. To tackle both challenges, we developed a two-step approach for generating realistic synthetic CTA data for the purpose of data augmentation. In the first step moderately realistic images are generated in a purely numerical fashion. In the second step these images are improved by applying neural domain adaptation. We evaluated the impact of synthetic data on lumen segmentation via convolutional neural networks (CNNs) by comparing resulting performances. Improvements of up to 5% in terms of Dice coefficient and 20% for Hausdorff distance represent a proof of concept that the proposed augmentation procedure can be used to enhance deep learning-based segmentation for artery lumen in CTA images.


Author(s):  
Krzysztof Jurczuk ◽  
Marcin Czajkowski ◽  
Marek Kretowski

AbstractThis paper concerns the evolutionary induction of decision trees (DT) for large-scale data. Such a global approach is one of the alternatives to the top-down inducers. It searches for the tree structure and tests simultaneously and thus gives improvements in the prediction and size of resulting classifiers in many situations. However, it is the population-based and iterative approach that can be too computationally demanding to apply for big data mining directly. The paper demonstrates that this barrier can be overcome by smart distributed/parallel processing. Moreover, we ask the question whether the global approach can truly compete with the greedy systems for large-scale data. For this purpose, we propose a novel multi-GPU approach. It incorporates the knowledge of global DT induction and evolutionary algorithm parallelization together with efficient utilization of memory and computing GPU’s resources. The searches for the tree structure and tests are performed simultaneously on a CPU, while the fitness calculations are delegated to GPUs. Data-parallel decomposition strategy and CUDA framework are applied. Experimental validation is performed on both artificial and real-life datasets. In both cases, the obtained acceleration is very satisfactory. The solution is able to process even billions of instances in a few hours on a single workstation equipped with 4 GPUs. The impact of data characteristics (size and dimension) on convergence and speedup of the evolutionary search is also shown. When the number of GPUs grows, nearly linear scalability is observed what suggests that data size boundaries for evolutionary DT mining are fading.


Sensors ◽  
2021 ◽  
Vol 21 (3) ◽  
pp. 676
Author(s):  
Andrej Zgank

Animal activity acoustic monitoring is becoming one of the necessary tools in agriculture, including beekeeping. It can assist in the control of beehives in remote locations. It is possible to classify bee swarm activity from audio signals using such approaches. A deep neural networks IoT-based acoustic swarm classification is proposed in this paper. Audio recordings were obtained from the Open Source Beehive project. Mel-frequency cepstral coefficients features were extracted from the audio signal. The lossless WAV and lossy MP3 audio formats were compared for IoT-based solutions. An analysis was made of the impact of the deep neural network parameters on the classification results. The best overall classification accuracy with uncompressed audio was 94.09%, but MP3 compression degraded the DNN accuracy by over 10%. The evaluation of the proposed deep neural networks IoT-based bee activity acoustic classification showed improved results if compared to the previous hidden Markov models system.


SAGE Open ◽  
2021 ◽  
Vol 11 (3) ◽  
pp. 215824402110326
Author(s):  
Koffi Dumor ◽  
Li Yao ◽  
Jean-Paul Ainam ◽  
Edem Koffi Amouzou ◽  
Williams Ayivi

Recent research suggests that China’s Belt and Road Initiative (BRI) would improve the bilateral trade between China and its partners. This article uses detailed bilateral export data from 1990 to 2017 to investigate the impact of China’s BRI on its trade partners using neural network analysis techniques and structural gravity model estimations. Our main findings suggest that the BRI countries would raise exports by a modest 5.053%. This indicates that export and network upgrades should be considered from economic and policy perspectives. The results also show that neural networks is more robust compared with structural gravity framework.


2019 ◽  
Vol 43 (6) ◽  
pp. 632-654
Author(s):  
Daidai Shen ◽  
Jean-Claude Thill ◽  
Jiuwen Sun

In this article, the socioeconomic determinants on urban population in China are empirically investigated with a theoretical equilibrium model for city size. While much of the research on urban size focuses on the impact of agglomeration economies based on “optimal city size” theory, this model is eschewed in our research due to its theoretical paradox in the real world, and we turn instead toward an intermediate solution proposed by Camagni, Capello, and Caragliu. This equilibrium model is estimated on a sample of 111 prefectural cities in China with multiple regression and artificial neural networks. Empirical results have shown that the model explains the variance in the data very well, and all the determinants have significant impacts on Chinese city sizes. Although sample cities have reached their equilibrium sizes as a whole, there is substantially unbalanced distribution of population within the urban system, with a strong contingent of cities that are either squarely too large or too small.


2017 ◽  
Vol 29 (10) ◽  
pp. e12528 ◽  
Author(s):  
M. van den Top ◽  
F.-Y. Zhao ◽  
R. Viriyapong ◽  
N. J. Michael ◽  
A. C. Munder ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document