Deep Learning-Based Portable Device for Audio Distress Signal Recognition in Urban Areas

Jorge Felipe Gaviria; Alejandra Escalante-Perez; Juan Camilo Castiblanco; Nicolas Vergara; Valentina Parra-Garces; Juan David Serrano; Andres Felipe Zambrano; Luis Felipe Giraldo

doi:10.3390/app10217448

Deep Learning-Based Portable Device for Audio Distress Signal Recognition in Urban Areas

Applied Sciences ◽

10.3390/app10217448 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7448

Author(s):

Jorge Felipe Gaviria ◽

Alejandra Escalante-Perez ◽

Juan Camilo Castiblanco ◽

Nicolas Vergara ◽

Valentina Parra-Garces ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Real Time ◽

Urban Areas ◽

Low Cost ◽

Raspberry Pi ◽

Automatic Identification ◽

Signal Recognition ◽

Main Challenge

Real-time automatic identification of audio distress signals in urban areas is a task that in a smart city can improve response times in emergency alert systems. The main challenge in this problem lies in finding a model that is able to accurately recognize these type of signals in the presence of background noise and allows for real-time processing. In this paper, we present the design of a portable and low-cost device for accurate audio distress signal recognition in real urban scenarios based on deep learning models. As real audio distress recordings in urban areas have not been collected and made publicly available so far, we first constructed a database where audios were recorded in urban areas using a low-cost microphone. Using this database, we trained a deep multi-headed 2D convolutional neural network that processed temporal and frequency features to accurately recognize audio distress signals in noisy environments with a significant performance improvement to other methods from the literature. Then, we deployed and assessed the trained convolutional neural network model on a Raspberry Pi that, along with the low-cost microphone, constituted a device for accurate real-time audio recognition. Source code and database are publicly available.

Download Full-text

Real-time automatic surgical phase recognition in laparoscopic sigmoidectomy using the convolutional neural network-based deep learning approach

Surgical Endoscopy ◽

10.1007/s00464-019-07281-0 ◽

2019 ◽

Vol 34 (11) ◽

pp. 4924-4931 ◽

Cited By ~ 10

Author(s):

Daichi Kitaguchi ◽

Nobuyoshi Takeshita ◽

Hiroki Matsuzaki ◽

Hiroaki Takano ◽

Yohei Owada ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Real Time ◽

Learning Approach ◽

Laparoscopic Sigmoidectomy ◽

Phase Recognition

Download Full-text

Investigation of optimal configurations of a convolutional neural network for the identification of objects in real-time

Information Technology and Nanotechnology ◽

10.18287/1613-0073-2019-2416-417-423 ◽

2019 ◽

pp. 417-423

Author(s):

M A Isayev ◽

D A Savelyev

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Real Time ◽

State Of The Art ◽

Average Precision ◽

The Core ◽

Particular Solution ◽

Optimal Configurations

The comparison of different convolutional neural networks which are the core of the most actual solutions in the computer vision area is considers in hhe paper. The study includes benchmarks of this state-of-the-art solutions by some criteria, such as mAP (mean average precision), FPS (frames per seconds), for the possibility of real-time usability. It is concluded on the best convolutional neural network model and deep learning methods that were used at particular solution.

Download Full-text

Low-cost intelligent surveillance system based on fast CNN

PeerJ Computer Science ◽

10.7717/peerj-cs.402 ◽

2021 ◽

Vol 7 ◽

pp. e402

Author(s):

Zaid Saeb Sabri ◽

Zhiyong Li

Keyword(s):

Neural Network ◽

Real Time ◽

Tracking System ◽

Low Cost ◽

Saliency Detection ◽

Detection Algorithm ◽

Raspberry Pi ◽

Surveillance Systems ◽

Single Board Computer ◽

Computational Systems

Smart surveillance systems are used to monitor specific areas, such as homes, buildings, and borders, and these systems can effectively detect any threats. In this work, we investigate the design of low-cost multiunit surveillance systems that can control numerous surveillance cameras to track multiple objects (i.e., people, cars, and guns) and promptly detect human activity in real time using low computational systems, such as compact or single board computers. Deep learning techniques are employed to detect certain objects to surveil homes/buildings and recognize suspicious and vital events to ensure that the system can alarm officers of relevant events, such as stranger intrusions, the presence of guns, suspicious movements, and identified fugitives. The proposed model is tested on two computational systems, specifically, a single board computer (Raspberry Pi) with the Raspbian OS and a compact computer (Intel NUC) with the Windows OS. In both systems, we employ components, such as a camera to stream real-time video and an ultrasonic sensor to alarm personnel of threats when movement is detected in restricted areas or near walls. The system program is coded in Python, and a convolutional neural network (CNN) is used to perform recognition. The program is optimized by using a foreground object detection algorithm to improve recognition in terms of both accuracy and speed. The saliency algorithm is used to slice certain required objects from scenes, such as humans, cars, and airplanes. In this regard, two saliency algorithms, based on local and global patch saliency detection are considered. We develop a system that combines two saliency approaches and recognizes the features extracted using these saliency techniques with a conventional neural network. The field results demonstrate a significant improvement in detection, ranging between 34% and 99.9% for different situations. The low percentage is related to the presence of unclear objects or activities that are different from those involving humans. However, even in the case of low accuracy, recognition and threat identification are performed with an accuracy of 100% in approximately 0.7 s, even when using computer systems with relatively weak hardware specifications, such as a single board computer (Raspberry Pi). These results prove that the proposed system can be practically used to design a low-cost and intelligent security and tracking system.

Download Full-text

Weapon and Object Detection Using Mobile-Net SSD Model in Deep Neural Network

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.37622 ◽

2021 ◽

Vol 9 (8) ◽

pp. 1573-1582

Author(s):

S Gopi Naik

Keyword(s):

Neural Network ◽

Computer Vision ◽

Deep Learning ◽

Object Detection ◽

Convolutional Neural Network ◽

Optimization Algorithms ◽

Object Identification ◽

Integrated System ◽

Raspberry Pi ◽

High Level

Abstract: The plan is to establish an integrated system that can manage high-quality visual information and also detect weapons quickly and efficiently. It is obtained by integrating ARM-based computer vision and optimization algorithms with deep neural networks able to detect the presence of a threat. The whole system is connected to a Raspberry Pi module, which will capture live broadcasting and evaluate it using a deep convolutional neural network. Due to the intimate interaction between object identification and video and image analysis in real-time objects, By generating sophisticated ensembles that incorporate various low-level picture features with high-level information from object detection and scenario classifiers, their performance can quickly plateau. Deep learning models, which can learn semantic, high-level, deeper features, have been developed to overcome the issues that are present in optimization algorithms. It presents a review of deep learning based object detection frameworks that use Convolutional Neural Network layers for better understanding of object detection. The Mobile-Net SSD model behaves differently in network design, training methods, and optimization functions, among other things. The crime rate in suspicious areas has been reduced as a consequence of weapon detection. However, security is always a major concern in human life. The Raspberry Pi module, or computer vision, has been extensively used in the detection and monitoring of weapons. Due to the growing rate of human safety protection, privacy and the integration of live broadcasting systems which can detect and analyse images, suspicious areas are becoming indispensable in intelligence. This process uses a Mobile-Net SSD algorithm to achieve automatic weapons and object detection. Keywords: Computer Vision, Weapon and Object Detection, Raspberry Pi Camera, RTSP, SMTP, Mobile-Net SSD, CNN, Artificial Intelligence.

Download Full-text

Development of a convolutional neural network to differentiate among the etiology of similar appearing pathological B lines on lung ultrasound: a deep learning study

BMJ Open ◽

10.1136/bmjopen-2020-045120 ◽

2021 ◽

Vol 11 (3) ◽

pp. e045120

Author(s):

Robert Arntfield ◽

Blake VanBerlo ◽

Thamer Alaifan ◽

Nathan Phelps ◽

Matthew White ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Lung Ultrasound ◽

Low Cost ◽

Characteristic Curve ◽

Ultrasound Images ◽

Diagnostic Specificity ◽

Multicentre Research ◽

B Lines

ObjectivesLung ultrasound (LUS) is a portable, low-cost respiratory imaging tool but is challenged by user dependence and lack of diagnostic specificity. It is unknown whether the advantages of LUS implementation could be paired with deep learning (DL) techniques to match or exceed human-level, diagnostic specificity among similar appearing, pathological LUS images.DesignA convolutional neural network (CNN) was trained on LUS images with B lines of different aetiologies. CNN diagnostic performance, as validated using a 10% data holdback set, was compared with surveyed LUS-competent physicians.SettingTwo tertiary Canadian hospitals.Participants612 LUS videos (121 381 frames) of B lines from 243 distinct patients with either (1) COVID-19 (COVID), non-COVID acute respiratory distress syndrome (NCOVID) or (3) hydrostatic pulmonary edema (HPE).ResultsThe trained CNN performance on the independent dataset showed an ability to discriminate between COVID (area under the receiver operating characteristic curve (AUC) 1.0), NCOVID (AUC 0.934) and HPE (AUC 1.0) pathologies. This was significantly better than physician ability (AUCs of 0.697, 0.704, 0.967 for the COVID, NCOVID and HPE classes, respectively), p<0.01.ConclusionsA DL model can distinguish similar appearing LUS pathology, including COVID-19, that cannot be distinguished by humans. The performance gap between humans and the model suggests that subvisible biomarkers within ultrasound images could exist and multicentre research is merited.

Download Full-text

A Low-cost Artificial Neural Network Model for Raspberry Pi

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.3357 ◽

2020 ◽

Vol 10 (2) ◽

pp. 5466-5469 ◽

Cited By ~ 1

Author(s):

S. N. Truong

Keyword(s):

Neural Network ◽

Object Recognition ◽

Embedded System ◽

Real Time ◽

Image Recognition ◽

Low Cost ◽

Recognition Rate ◽

Raspberry Pi ◽

The Neural Network ◽

Binary Array

In this paper, a ternary neural network with complementary binary arrays is proposed for representing the signed synaptic weights. The proposed ternary neural network is deployed on a low-cost Raspberry Pi board embedded system for the application of speech and image recognition. In conventional neural networks, the signed synaptic weights of –1, 0, and 1 are represented by 8-bit integers. To reduce the amount of required memory for signed synaptic weights, the signed values were represented by a complementary binary array. For the binary inputs, the multiplication of two binary numbers is replaced by the bit-wise AND operation to speed up the performance of the neural network. Regarding image recognition, the MINST dataset was used for training and testing of the proposed neural network. The recognition rate was as high as 94%. The proposed ternary neural network was applied to real-time object recognition. The recognition rate for recognizing 10 simple objects captured from the camera was 89%. The proposed ternary neural network with the complementary binary array for representing the signed synaptic weights can reduce the required memory for storing the model’s parameters and internal parameters by 75%. The proposed ternary neural network is 4.2, 2.7, and 2.4 times faster than the conventional ternary neural network for MNIST image recognition, speech commands recognition, and real-time object recognition respectively.

Download Full-text

Deep Neural Network for Better Face Processing

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6238.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 2236-2239

Keyword(s):

Neural Network ◽

Deep Learning ◽

Face Detection ◽

Deep Neural Network ◽

Low Cost ◽

Raspberry Pi ◽

The Face ◽

Frame Work ◽

Haar Cascade ◽

Cascade Method

This Paper represents the face detection using advanced method deep neural network which uses deep learning frame work. The old models used to detect the faces were like Haar-cascade method which detect the faces with good approaches but there is some uncertainty in the accuracy of the old models, so in this system we will use the latest deep neural network model which is embedded with latest open cv and by using the deep learning model frame work which is weighted with some other files. By using this model, we can achieve the better accuracy in face detection which can be used for further purposes like auto focus in cameras, counting number of people etc. This model detects the faces accurately and paves the way for better recognition systems which can be used in many face biometric applications. For this purpose, low-cost computer board Raspberry Pi and Camera Sensor will be used.

Download Full-text

Design and Build a Smart Door Lock Using the Deep Learning Convolutional Neural Network Method

E3S Web of Conferences ◽

10.1051/e3sconf/202132802009 ◽

2021 ◽

Vol 328 ◽

pp. 02009

Author(s):

Koko Joni ◽

Anita Ima Rahmawati ◽

Hanifudin Sukri

Keyword(s):

Neural Network ◽

Deep Learning ◽

Body Temperature ◽

Convolutional Neural Network ◽

Learning System ◽

Raspberry Pi ◽

Single Shot ◽

Network Method ◽

Error Percentage ◽

Success Percentage

The world is currently being hit by the COVID-19 virus. In this New Normal era, a rule is enforced that everyone must wear a mask wherever we are. Checking masks and body temperature is still done manually or by human observation, thus allowing for inaccuracies in observing and checking temperature. The problem occurred at Trunojoyo Madura University which still uses a manual mask and body temperature checking system. So, for accuracy and to reduce the risk of contracting officers. A tool was created to detect the mask and temperature automatically. In this study using a camera, temperature sensor MLX90614, and proximity sensor using Raspberry Pi. This research uses a machine learning system with the Deep Learning Convolutional Neural Network (CNN) Single Shot Detector (SSD) method. From this study, the results of mask detection obtained a success percentage of 93.4% and an error percentage of 6.6% from the entire test and obtained an average detection time of 2.63 seconds. And the average time of the whole system is 3.8 seconds. In this study, there was a delay during detection due to the heavy computational load on the system, so for further research, use a mini pc that has better performance.

Download Full-text

EmotionNet Nano: An Efficient Deep Convolutional Neural Network Design for Real-Time Facial Expression Recognition

Frontiers in Artificial Intelligence ◽

10.3389/frai.2020.609673 ◽

2021 ◽

Vol 3 ◽

Author(s):

James Ren Lee ◽

Linda Wang ◽

Alexander Wong

Keyword(s):

Neural Network ◽

Facial Expression ◽

Convolutional Neural Network ◽

Network Design ◽

Real Time ◽

Facial Expression Recognition ◽

Deep Neural Network ◽

Low Cost ◽

Expression Recognition ◽

Embedded Devices

While recent advances in deep learning have led to significant improvements in facial expression classification (FEC), a major challenge that remains a bottleneck for the widespread deployment of such systems is their high architectural and computational complexities. This is especially challenging given the operational requirements of various FEC applications, such as safety, marketing, learning, and assistive living, where real-time requirements on low-cost embedded devices is desired. Motivated by this need for a compact, low latency, yet accurate system capable of performing FEC in real-time on low-cost embedded devices, this study proposes EmotionNet Nano, an efficient deep convolutional neural network created through a human-machine collaborative design strategy, where human experience is combined with machine meticulousness and speed in order to craft a deep neural network design catered toward real-time embedded usage. To the best of the author’s knowledge, this is the very first deep neural network architecture for facial expression recognition leveraging machine-driven design exploration in its design process, and exhibits unique architectural characteristics such as high architectural heterogeneity and selective long-range connectivity not seen in previous FEC network architectures. Two different variants of EmotionNet Nano are presented, each with a different trade-off between architectural and computational complexity and accuracy. Experimental results using the CK + facial expression benchmark dataset demonstrate that the proposed EmotionNet Nano networks achieved accuracy comparable to state-of-the-art FEC networks, while requiring significantly fewer parameters. Furthermore, we demonstrate that the proposed EmotionNet Nano networks achieved real-time inference speeds (e.g., >25 FPS and >70 FPS at 15 and 30 W, respectively) and high energy efficiency (e.g., >1.7 images/sec/watt at 15 W) on an ARM embedded processor, thus further illustrating the efficacy of EmotionNet Nano for deployment on embedded devices.

Download Full-text

Automatic vegetation identification in Google Earth images using a convolutional neural network: A case study for Japanese bamboo forests

10.1101/351643 ◽

2018 ◽

Author(s):

Shuntaro Watanabe ◽

Kazuaki Sumi ◽

Takeshi Ise

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Environmental Science ◽

Google Earth ◽

Image Resolution ◽

Training Data ◽

Identification Accuracy ◽

Automatic Identification

ABSTRACTClassifying and mapping vegetation are very important tasks in environmental science and natural resource management. However, these tasks are not easy because conventional methods such as field surveys are highly labor intensive. Automatic identification of target objects from visual data is one of the most promising ways to reduce the costs for vegetation mapping. Although deep learning has become a new solution for image recognition and classification recently, in general, detection of ambiguous objects such as vegetation still is considered difficult. In this paper, we investigated the potential for adapting the chopped picture method, a recently described protocol for deep learning, to detect plant communities in Google Earth images. We selected bamboo forests as the target. We obtained Google Earth images from three regions in Japan. By applying the deep convolutional neural network, the model successfully learned the features of bamboo forests in Google Earth images, and the best trained model correctly detected 97% of the targets. Our results show that identification accuracy strongly depends on the image resolution and the quality of training data. Our results also highlight that deep learning and the chopped picture method can potentially become a powerful tool for high accuracy automated detection and mapping of vegetation.

Download Full-text