IoT-Based Bee Swarm Activity Acoustic Classification Using Deep Neural Networks

Andrej Zgank

doi:10.3390/s21030676

IoT-Based Bee Swarm Activity Acoustic Classification Using Deep Neural Networks

Sensors ◽

10.3390/s21030676 ◽

2021 ◽

Vol 21 (3) ◽

pp. 676

Author(s):

Andrej Zgank

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Markov Models ◽

Audio Signal ◽

Audio Signals ◽

Mel Frequency Cepstral Coefficients ◽

Animal Activity ◽

The Impact ◽

Acoustic Classification ◽

Swarm Activity

Animal activity acoustic monitoring is becoming one of the necessary tools in agriculture, including beekeeping. It can assist in the control of beehives in remote locations. It is possible to classify bee swarm activity from audio signals using such approaches. A deep neural networks IoT-based acoustic swarm classification is proposed in this paper. Audio recordings were obtained from the Open Source Beehive project. Mel-frequency cepstral coefficients features were extracted from the audio signal. The lossless WAV and lossy MP3 audio formats were compared for IoT-based solutions. An analysis was made of the impact of the deep neural network parameters on the classification results. The best overall classification accuracy with uncompressed audio was 94.09%, but MP3 compression degraded the DNN accuracy by over 10%. The evaluation of the proposed deep neural networks IoT-based bee activity acoustic classification showed improved results if compared to the previous hidden Markov models system.

Download Full-text

Bee Swarm Activity Acoustic Classification for an IoT-Based Farm Service

Sensors ◽

10.3390/s20010021 ◽

2019 ◽

Vol 20 (1) ◽

pp. 21 ◽

Cited By ~ 6

Author(s):

Andrej Zgank

Keyword(s):

Markov Models ◽

Predictive Coding ◽

Gaussian Mixture Models ◽

Audio Signal ◽

Gaussian Mixture ◽

Classification Performance ◽

Learning Approaches ◽

Linear Predictive Coding ◽

Mel Frequency Cepstral Coefficients ◽

Acoustic Classification

Beekeeping is one of the widespread and traditional fields in agriculture, where Internet of Things (IoT)-based solutions and machine learning approaches can ease and improve beehive management significantly. A particularly important activity is bee swarming. A beehive monitoring system can be applied for digital farming to alert the user via a service about the beginning of swarming, which requires a response. An IoT-based bee activity acoustic classification system is proposed in this paper. The audio data needed for acoustic training was collected from the Open Source Beehives Project. The input audio signal was converted into feature vectors, using the Mel-Frequency Cepstral Coefficients (with cepstral mean normalization) and Linear Predictive Coding. The influence of the acoustic background noise and denoising procedure was evaluated in an additional step. Different Hidden Markov Models’ and Gaussian Mixture Models’ topologies were developed for acoustic modeling, with the objective being to determine the most suitable one for the proposed IoT-based solution. The evaluation was carried out with a separate test set, in order to successfully classify sound between the normal and swarming conditions in a beehive. The evaluation results showed that good acoustic classification performance can be achieved with the proposed system.

Download Full-text

Impact of Low Resolution on Image Recognition with Deep Neural Networks: An Experimental Study

International Journal of Applied Mathematics and Computer Science ◽

10.2478/amcs-2018-0056 ◽

2018 ◽

Vol 28 (4) ◽

pp. 735-744 ◽

Cited By ~ 9

Author(s):

Michał Koziarski ◽

Bogusław Cyganek

Keyword(s):

Neural Networks ◽

Image Recognition ◽

Classification Accuracy ◽

Deep Neural Networks ◽

Dynamic Range ◽

Super Resolution ◽

Image Resolution ◽

Quality Data ◽

Low Resolution ◽

The Impact

Abstract Due to the advances made in recent years, methods based on deep neural networks have been able to achieve a state-of-the-art performance in various computer vision problems. In some tasks, such as image recognition, neural-based approaches have even been able to surpass human performance. However, the benchmarks on which neural networks achieve these impressive results usually consist of fairly high quality data. On the other hand, in practical applications we are often faced with images of low quality, affected by factors such as low resolution, presence of noise or a small dynamic range. It is unclear how resilient deep neural networks are to the presence of such factors. In this paper we experimentally evaluate the impact of low resolution on the classification accuracy of several notable neural architectures of recent years. Furthermore, we examine the possibility of improving neural networks’ performance in the task of low resolution image recognition by applying super-resolution prior to classification. The results of our experiments indicate that contemporary neural architectures remain significantly affected by low image resolution. By applying super-resolution prior to classification we were able to alleviate this issue to a large extent as long as the resolution of the images did not decrease too severely. However, in the case of very low resolution images the classification accuracy remained considerably affected.

Download Full-text

A Comparison of Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging

2018 26th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco.2018.8553106 ◽

2018 ◽

Cited By ~ 7

Author(s):

Keunwoo Choi ◽

Gyorgy Fazekas ◽

Mark Sandler ◽

Kyunghyun Cho

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Audio Signal ◽

Signal Preprocessing ◽

Music Tagging

Download Full-text

The Impact of Architecture on the Deep Neural Networks Training

2019 12th International Conference on Human System Interaction (HSI) ◽

10.1109/hsi47298.2019.8942622 ◽

2019 ◽

Author(s):

Pawel Rozycki ◽

Janusz Kolbusz ◽

Aleksander Malinowski ◽

Bogdan Wilamowski

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

The Impact

Download Full-text

LSTM Deep Neural Networks Postfiltering for Enhancing Synthetic Voices

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800141860008x ◽

2017 ◽

Vol 32 (01) ◽

pp. 1860008 ◽

Cited By ~ 8

Author(s):

Marvin Coto-Jiménez ◽

John Goddard-Close

Keyword(s):

Neural Networks ◽

Speech Synthesis ◽

Deep Neural Networks ◽

Short Term Memory ◽

Markov Models ◽

Natural Speech ◽

Objective Measures ◽

Recent Developments ◽

Small Footprint ◽

Synthetic Voices

Recent developments in speech synthesis have produced systems capable of producing speech which closely resembles natural speech, and researchers now strive to create models that more accurately mimic human voices. One such development is the incorporation of multiple linguistic styles in various languages and accents. Speech synthesis based on Hidden Markov Models (HMM) is of great interest to researchers, due to its ability to produce sophisticated features with a small footprint. Despite some progress, its quality has not yet reached the level of the current predominant unit-selection approaches, which select and concatenate recordings of real speech, and work has been conducted to try to improve HMM-based systems. In this paper, we present an application of long short-term memory (LSTM) deep neural networks as a postfiltering step in HMM-based speech synthesis. Our motivation stems from a similar desire to obtain characteristics which are closer to those of natural speech. The paper analyzes four types of postfilters obtained using five voices, which range from a single postfilter to enhance all the parameters, to a multi-stream proposal which separately enhances groups of parameters. The different proposals are evaluated using three objective measures and are statistically compared to determine any significance between them. The results described in the paper indicate that HMM-based voices can be enhanced using this approach, specially for the multi-stream postfilters on the considered objective measures.

Download Full-text

Sparsely Connected and Disjointly Trained Deep Neural Networks for Low Resource Behavioral Annotation: Acoustic Classification in Couples’ Therapy

10.21437/interspeech.2016-1217 ◽

2016 ◽

Cited By ~ 4

Author(s):

Haoqi Li ◽

Brian Baucom ◽

Panayiotis Georgiou

Keyword(s):

Neural Networks ◽

Couples Therapy ◽

Deep Neural Networks ◽

Low Resource ◽

Acoustic Classification

Download Full-text

Recognizing emotion in speech and text using Deep Neural Networks and Mel-Frequency Cepstral Coefficients.

Strad Research ◽

10.37896/sr8.5/039 ◽

2021 ◽

Vol 8 (5) ◽

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Acoustic classification using semi-supervised Deep Neural Networks and stochastic entropy-regularization over nearest-neighbor graphs

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7952653 ◽

2017 ◽

Cited By ~ 4

Author(s):

Sunil Thulasidasan ◽

Jeffrey Bilmes

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Nearest Neighbor ◽

Acoustic Classification

Download Full-text

Align, then memorise: the dynamics of learning with feedback alignment

Journal of Physics A Mathematical and Theoretical ◽

10.1088/1751-8121/ac411b ◽

2021 ◽

Author(s):

Maria Refinetti ◽

Stéphane d'Ascoli ◽

Ruben Ohana ◽

Sebastian Goldt

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

State Of The Art ◽

Simple Explanation ◽

Low Loss ◽

Convolutional Networks ◽

Linear Networks ◽

Alignment Algorithms ◽

Direct Feedback ◽

The Impact

Abstract Direct Feedback Alignment (DFA) is emerging as an eﬁcient and biologically plausible alternative to backpropagation for training deep neural networks. Despite relying on random feedback weights for the backward pass, DFA successfully trains state-of-the-art models such as Transformers. On the other hand, it notoriously fails to train convolutional networks. An understanding of the inner workings of DFA to explain these diverging results remains elusive. Here, we propose a theory of feedback alignment algorithms. We ﬀrst show that learning in shallow networks proceeds in two steps: an alignment phase, where the model adapts its weights to align the approximate gradient with the true gradient of the loss function, is followed by a memorisation phase, where the model focuses on ﬀtting the data. This two-step process has a degeneracy breaking eﬂect: out of all the low-loss solutions in the landscape, a network trained with DFA naturally converges to the solution which maximises gradient alignment. We also identify a key quantity underlying alignment in deep linear networks: the conditioning of the alignment matrices. The latter enables a detailed understanding of the impact of data structure on alignment, and suggests a simple explanation for the well-known failure of DFA to train convolutional neural networks. Numerical experiments on MNIST and CIFAR10 clearly demonstrate degeneracy breaking in deep non-linear networks and show that the align-then-memorize process occurs sequentially from the bottom layers of the network to the top.

Download Full-text

Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

Communications in Computer and Information Science - Computer Science – CACIC 2017 ◽

10.1007/978-3-319-75214-3_1 ◽

2018 ◽

pp. 3-12

Author(s):

Mario Alejandro García ◽

Eduardo Atilio Destéfanis

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Audio Signal

Download Full-text