Movement Tube Detection Network Integrating 3D CNN and Object Detection Framework to Detect Fall

Song Zou; Weidong Min; Lingfeng Liu; Qi Wang; Xiang Zhou

doi:10.3390/electronics10080898

Movement Tube Detection Network Integrating 3D CNN and Object Detection Framework to Detect Fall

Electronics ◽

10.3390/electronics10080898 ◽

2021 ◽

Vol 10 (8) ◽

pp. 898

Author(s):

Song Zou ◽

Weidong Min ◽

Lingfeng Liu ◽

Qi Wang ◽

Xiang Zhou

Keyword(s):

Neural Network ◽

Object Detection ◽

Large Scale ◽

Video Clip ◽

Fall Detection ◽

Detection Methods ◽

Time Range ◽

Temporal Dimension ◽

Spatio Temporal ◽

3D Cnn

Unlike most of the existing neural network-based fall detection methods, which only detect fall at the time range, the algorithm proposed in this paper detect fall in both spatial and temporal dimension. A movement tube detection network integrating 3D CNN and object detection framework such as SSD is proposed to detect human fall with constrained movement tubes. The constrained movement tube, which encapsulates the person with a sequence of bounding boxes, has the merits of encapsulating the person closely and avoiding peripheral interference. A 3D convolutional neural network is used to encode the motion and appearance features of a video clip, which are fed into the tube anchors generation layer, softmax classification, and movement tube regression layer. The movement tube regression layer fine tunes the tube anchors to the constrained movement tubes. A large-scale spatio-temporal (LSST) fall dataset is constructed using self-collected data to evaluate the fall detection in both spatial and temporal dimensions. LSST has three characteristics of large scale, annotation, and posture and viewpoint diversities. Furthermore, the comparative experiments on a public dataset demonstrate that the proposed algorithm achieved sensitivity, specificity an accuracy of 100%, 97.04%, and 97.23%, respectively, outperforms the existing methods.

Download Full-text

Learning spatio-temporal visual features by a large scale neural network model

Neuroscience Research ◽

10.1016/j.neures.2010.07.1925 ◽

2010 ◽

Vol 68 ◽

pp. e434

Author(s):

Naoto Yukinawa ◽

Shin Ishii

Keyword(s):

Neural Network ◽

Network Model ◽

Neural Network Model ◽

Large Scale ◽

Visual Features ◽

Spatio Temporal

Download Full-text

A Convolutional Neural Network to Perform Object Detection and Identification in Visual Large-Scale Data

Big Data ◽

10.1089/big.2019.0093 ◽

2020 ◽

Author(s):

Riadh Ayachi ◽

Yahia Said ◽

Mohamed Atri

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Large Scale ◽

Large Scale Data ◽

Detection And Identification ◽

Scale Data

Download Full-text

The Analysis of Faults Detection Software Based on Improved Neural Network Algorithm

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.602-605.2044 ◽

2014 ◽

Vol 602-605 ◽

pp. 2044-2047

Author(s):

Miao Yan ◽

Zhi Bao Liu

Keyword(s):

Neural Network ◽

Large Scale ◽

Detection Methods ◽

Detection Accuracy ◽

Improved Method ◽

Software Faults ◽

Component Software ◽

The Neural Network ◽

Neural Network Algorithm ◽

Detection Software

The large-scale software is consisted of the components which are quite different. The detection accuracy of the traditional faults detection methods for the large-scale component software is not satisfactory. This paper proposes a large-scale software faults detection methods based on improved neural network combining the features of the large-scale software by computing the stable probability and building the neural network faults detection models. The proposed method can analyze the serial faults of the large-scale software to determine the positions of the faults. The experiment and simulation results show that the improved method for large-scale software fault detection can greatly improve the accuracy.

Download Full-text

An Incremental Class-Learning Approach with Acoustic Novelty Detection for Acoustic Event Recognition

Sensors ◽

10.3390/s21196622 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6622

Author(s):

Barış Bayram ◽

Gökhan İnce

Keyword(s):

Neural Network ◽

Acoustic Signal ◽

Large Scale ◽

Short Term Memory ◽

Novelty Detection ◽

Event Recognition ◽

Acoustic Event ◽

Audio Features ◽

Spatio Temporal ◽

Self Learning

Acoustic scene analysis (ASA) relies on the dynamic sensing and understanding of stationary and non-stationary sounds from various events, background noises and human actions with objects. However, the spatio-temporal nature of the sound signals may not be stationary, and novel events may exist that eventually deteriorate the performance of the analysis. In this study, a self-learning-based ASA for acoustic event recognition (AER) is presented to detect and incrementally learn novel acoustic events by tackling catastrophic forgetting. The proposed ASA framework comprises six elements: (1) raw acoustic signal pre-processing, (2) low-level and deep audio feature extraction, (3) acoustic novelty detection (AND), (4) acoustic signal augmentations, (5) incremental class-learning (ICL) (of the audio features of the novel events) and (6) AER. The self-learning on different types of audio features extracted from the acoustic signals of various events occurs without human supervision. For the extraction of deep audio representations, in addition to visual geometry group (VGG) and residual neural network (ResNet), time-delay neural network (TDNN) and TDNN based long short-term memory (TDNN–LSTM) networks are pre-trained using a large-scale audio dataset, Google AudioSet. The performances of ICL with AND using Mel-spectrograms, and deep features with TDNNs, VGG, and ResNet from the Mel-spectrograms are validated on benchmark audio datasets such as ESC-10, ESC-50, UrbanSound8K (US8K), and an audio dataset collected by the authors in a real domestic environment.

Download Full-text

A Lightweight Keypoint-Based Oriented Object Detection of Remote Sensing Images

Remote Sensing ◽

10.3390/rs13132459 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2459

Author(s):

Yangyang Li ◽

Heting Mao ◽

Ruijiao Liu ◽

Xuan Pei ◽

Licheng Jiao ◽

...

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Large Scale ◽

Detection Methods ◽

Gaussian Kernel ◽

Remote Sensing Images ◽

Computational Overhead ◽

Comparable Performance ◽

Bounding Boxes ◽

Oriented Object

Object detection in remote sensing images has been widely used in military and civilian fields and is a challenging task due to the complex background, large-scale variation, and dense arrangement in arbitrary orientations of objects. In addition, existing object detection methods rely on the increasingly deeper network, which increases a lot of computational overhead and parameters, and is unfavorable to deployment on the edge devices. In this paper, we proposed a lightweight keypoint-based oriented object detector for remote sensing images. First, we propose a semantic transfer block (STB) when merging shallow and deep features, which reduces noise and restores the semantic information. Then, the proposed adaptive Gaussian kernel (AGK) is adapted to objects of different scales, and further improves detection performance. Finally, we propose the distillation loss associated with object detection to obtain a lightweight student network. Experiments on the HRSC2016 and UCAS-AOD datasets show that the proposed method adapts to different scale objects, obtains accurate bounding boxes, and reduces the influence of complex backgrounds. The comparison with mainstream methods proves that our method has comparable performance under lightweight.

Download Full-text

A Comparative Analysis of Object Detection Metrics with a Companion Open-Source Toolkit

Electronics ◽

10.3390/electronics10030279 ◽

2021 ◽

Vol 10 (3) ◽

pp. 279

Author(s):

Rafael Padilla ◽

Wesley L. Passos ◽

Thadeu L. B. Dias ◽

Sergio L. Netto ◽

Eduardo A. B. da Silva

Keyword(s):

Comparative Analysis ◽

Object Detection ◽

Open Source ◽

Performance Metrics ◽

Ground Truth ◽

Detection Methods ◽

Detection Algorithms ◽

Spatio Temporal ◽

Bounding Boxes ◽

Temporal Overlap

Recent outstanding results of supervised object detection in competitions and challenges are often associated with specific metrics and datasets. The evaluation of such methods applied in different contexts have increased the demand for annotated datasets. Annotation tools represent the location and size of objects in distinct formats, leading to a lack of consensus on the representation. Such a scenario often complicates the comparison of object detection methods. This work alleviates this problem along the following lines: (i) It provides an overview of the most relevant evaluation methods used in object detection competitions, highlighting their peculiarities, differences, and advantages; (ii) it examines the most used annotation formats, showing how different implementations may influence the assessment results; and (iii) it provides a novel open-source toolkit supporting different annotation formats and 15 performance metrics, making it easy for researchers to evaluate the performance of their detection algorithms in most known datasets. In addition, this work proposes a new metric, also included in the toolkit, for evaluating object detection in videos that is based on the spatio-temporal overlap between the ground-truth and detected bounding boxes.

Download Full-text

Managing spatio-temporal complexity in Hopfield neural network simulations for large-scale static optimization

Mathematics and Computers in Simulation ◽

10.1016/j.matcom.2003.09.023 ◽

2004 ◽

Vol 64 (2) ◽

pp. 279-293 ◽

Cited By ~ 9

Author(s):

Gursel Serpen

Keyword(s):

Neural Network ◽

Large Scale ◽

Hopfield Neural Network ◽

Static Optimization ◽

Temporal Complexity ◽

Network Simulations ◽

Spatio Temporal

Download Full-text

Assessment of CNN-Based Methods for Individual Tree Detection on Images Captured by RGB Cameras Attached to UAVs

Sensors ◽

10.3390/s19163595 ◽

2019 ◽

Vol 19 (16) ◽

pp. 3595 ◽

Cited By ~ 19

Author(s):

Anderson Aparecido dos Santos ◽

José Marcato Junior ◽

Márcio Santos Araújo ◽

David Robledo Di Martini ◽

Everton Castelão Tetila ◽

...

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Spatial Resolution ◽

Tree Species ◽

Remote Sensing Data ◽

Target Object ◽

Detection Methods ◽

Individual Tree ◽

Art Object

Detection and classification of tree species from remote sensing data were performed using mainly multispectral and hyperspectral images and Light Detection And Ranging (LiDAR) data. Despite the comparatively lower cost and higher spatial resolution, few studies focused on images captured by Red-Green-Blue (RGB) sensors. Besides, the recent years have witnessed an impressive progress of deep learning methods for object detection. Motivated by this scenario, we proposed and evaluated the usage of Convolutional Neural Network (CNN)-based methods combined with Unmanned Aerial Vehicle (UAV) high spatial resolution RGB imagery for the detection of law protected tree species. Three state-of-the-art object detection methods were evaluated: Faster Region-based Convolutional Neural Network (Faster R-CNN), YOLOv3 and RetinaNet. A dataset was built to assess the selected methods, comprising 392 RBG images captured from August 2018 to February 2019, over a forested urban area in midwest Brazil. The target object is an important tree species threatened by extinction known as Dipteryx alata Vogel (Fabaceae). The experimental analysis delivered average precision around 92% with an associated processing times below 30 miliseconds.

Download Full-text

Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images

Sensors ◽

10.3390/s20061686 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1686 ◽

Cited By ~ 3

Author(s):

Feng Yang ◽

Wentong Li ◽

Haiwei Hu ◽

Wanyi Li ◽

Peng Wang

Keyword(s):

Object Detection ◽

Large Scale ◽

Ground Truth ◽

Classification Performance ◽

Aerial Images ◽

Detection Methods ◽

Robust Detection ◽

Scale Feature ◽

Multi Scale ◽

Bounding Boxes

Accurate and robust detection of multi-class objects in very high resolution (VHR) aerial images has been playing a significant role in many real-world applications. The traditional detection methods have made remarkable progresses with horizontal bounding boxes (HBBs) due to CNNs. However, HBB detection methods still exhibit limitations including the missed detection and the redundant detection regions, especially for densely-distributed and strip-like objects. Besides, large scale variations and diverse background also bring in many challenges. Aiming to address these problems, an effective region-based object detection framework named Multi-scale Feature Integration Attention Rotation Network (MFIAR-Net) is proposed for aerial images with oriented bounding boxes (OBBs), which promotes the integration of the inherent multi-scale pyramid features to generate a discriminative feature map. Meanwhile, the double-path feature attention network supervised by the mask information of ground truth is introduced to guide the network to focus on object regions and suppress the irrelevant noise. To boost the rotation regression and classification performance, we present a robust Rotation Detection Network, which can generate efficient OBB representation. Extensive experiments and comprehensive evaluations on two publicly available datasets demonstrate the effectiveness of the proposed framework.

Download Full-text

SmokeNet: Satellite Smoke Scene Detection Using Convolutional Neural Network with Spatial and Channel-Wise Attention

Remote Sensing ◽

10.3390/rs11141702 ◽

2019 ◽

Vol 11 (14) ◽

pp. 1702 ◽

Cited By ~ 8

Author(s):

Rui Ba ◽

Chen Chen ◽

Jing Yuan ◽

Weiguo Song ◽

Siuming Lo

Keyword(s):

Neural Network ◽

Satellite Imagery ◽

Large Scale ◽

State Of The Art ◽

Kappa Coefficient ◽

Feature Representation ◽

Detection Methods ◽

Smoke Detection ◽

Training Images ◽

Moderate Resolution Imaging Spectroradiometer

A variety of environmental analysis applications have been advanced by the use of satellite remote sensing. Smoke detection based on satellite imagery is imperative for wildfire detection and monitoring. However, the commonly used smoke detection methods mainly focus on smoke discrimination from a few specific classes, which reduces their applicability in different regions of various classes. To this end, in this paper, we present a new large-scale satellite imagery smoke detection benchmark based on Moderate Resolution Imaging Spectroradiometer (MODIS) data, namely USTC_SmokeRS, consisting of 6225 satellite images from six classes (i.e., cloud, dust, haze, land, seaside, and smoke) and covering various areas/regions over the world. To build a baseline for smoke detection in satellite imagery, we evaluate several state-of-the-art deep learning-based image classification models. Moreover, we propose a new convolution neural network (CNN) model, SmokeNet, which incorporates spatial and channel-wise attention in CNN to enhance feature representation for scene classification. The experimental results of our method using different proportions (16%, 32%, 48%, and 64%) of training images reveal that our model outperforms other approaches with higher accuracy and Kappa coefficient. Specifically, the proposed SmokeNet model trained with 64% training images achieves the best accuracy of 92.75% and Kappa coefficient of 0.9130. The model trained with 16% training images can also improve the classification accuracy and Kappa coefficient by at least 4.99% and 0.06, respectively, over the state-of-the-art models.

Download Full-text