Real-Time Continuous Action Recognition Using Pose Contexts With Depth Sensors

Low-Cost Embedded System Using Convolutional Neural Networks-Based Spatiotemporal Feature Map for Real-Time Human Action Recognition

Applied Sciences ◽

10.3390/app11114940 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4940

Author(s):

Jinsoo Kim ◽

Jeongho Cho

Keyword(s):

Embedded System ◽

Real Time ◽

Action Recognition ◽

Processing Speed ◽

Recognition Accuracy ◽

Low Cost ◽

Human Action Recognition ◽

Human Action ◽

Video Data ◽

Feature Maps

The field of research related to video data has difficulty in extracting not only spatial but also temporal features and human action recognition (HAR) is a representative field of research that applies convolutional neural network (CNN) to video data. The performance for action recognition has improved, but owing to the complexity of the model, some still limitations to operation in real-time persist. Therefore, a lightweight CNN-based single-stream HAR model that can operate in real-time is proposed. The proposed model extracts spatial feature maps by applying CNN to the images that develop the video and uses the frame change rate of sequential images as time information. Spatial feature maps are weighted-averaged by frame change, transformed into spatiotemporal features, and input into multilayer perceptrons, which have a relatively lower complexity than other HAR models; thus, our method has high utility in a single embedded system connected to CCTV. The results of evaluating action recognition accuracy and data processing speed through challenging action recognition benchmark UCF-101 showed higher action recognition accuracy than the HAR model using long short-term memory with a small amount of video frames and confirmed the real-time operational possibility through fast data processing speed. In addition, the performance of the proposed weighted mean-based HAR model was verified by testing it in Jetson NANO to confirm the possibility of using it in low-cost GPU-based embedded systems.

Download Full-text

Real-time, High-resolution Depth Upsampling on Embedded Accelerators

ACM Transactions on Embedded Computing Systems ◽

10.1145/3436878 ◽

2021 ◽

Vol 20 (3) ◽

pp. 1-22

Author(s):

David Langerman ◽

Alan George

Keyword(s):

High Resolution ◽

Low Power ◽

Real Time ◽

Mixed Reality ◽

Graphics Processing Unit ◽

Processing Unit ◽

Reconfigurable Logic ◽

Depth Sensors ◽

Time Requirements ◽

Graphics Processing

High-resolution, low-latency apps in computer vision are ubiquitous in today’s world of mixed-reality devices. These innovations provide a platform that can leverage the improving technology of depth sensors and embedded accelerators to enable higher-resolution, lower-latency processing for 3D scenes using depth-upsampling algorithms. This research demonstrates that filter-based upsampling algorithms are feasible for mixed-reality apps using low-power hardware accelerators. The authors parallelized and evaluated a depth-upsampling algorithm on two different devices: a reconfigurable-logic FPGA embedded within a low-power SoC; and a fixed-logic embedded graphics processing unit. We demonstrate that both accelerators can meet the real-time requirements of 11 ms latency for mixed-reality apps. 1

Download Full-text

Attention-Oriented Action Recognition for Real- Time Human-Robot Interaction

2020 25th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr48806.2021.9412346 ◽

2021 ◽

Author(s):

Ziyang Song ◽

Ziyi Yin ◽

Zejian Yuan ◽

Chong Zhang ◽

Wanchao Chi ◽

...

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Robot Interaction ◽

Robot Interaction

Download Full-text

Real-Time Fine Grained Occupancy Estimation Using Depth Sensors on ARM Embedded Platforms

2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS) ◽

10.1109/rtas.2017.8 ◽

2017 ◽

Cited By ~ 19

Author(s):

Sirajum Munir ◽

Ripudaman Singh Arora ◽

Craig Hesling ◽

Juncheng Li ◽

Jonathan Francis ◽

...

Keyword(s):

Real Time ◽

Fine Grained ◽

Depth Sensors ◽

Occupancy Estimation ◽

Embedded Platforms

Download Full-text

Real-time human action recognition based on motion shapes

10.47749/t/unicamp.2014.932501 ◽

2014 ◽

Author(s):

Thierry Pinheiro Moreira

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action

Download Full-text

A review of real-time human action recognition involving vision sensing

Real-Time Image Processing and Deep Learning 2021 ◽

10.1117/12.2585680 ◽

2021 ◽

Author(s):

Sharmin Majumder ◽

Nasser Kehtarnavaz

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Vision Sensing

Download Full-text

Real Time Human Action Recognition Using Full and Ultra High Definition Video

2015 International Conference on Computational Science and Computational Intelligence (CSCI) ◽

10.1109/csci.2015.12 ◽

2015 ◽

Cited By ~ 3

Author(s):

Gloria Castro-Munoz ◽

Jorge Martinez-Carballido

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

High Definition ◽

High Definition Video

Download Full-text

Continuous Motion Classification and Segmentation Based on Improved Dynamic Time Warping Algorithm

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001418500027 ◽

2017 ◽

Vol 32 (02) ◽

pp. 1850002 ◽

Cited By ~ 3

Author(s):

Mingqin Liu ◽

Xiaoguang Zhang ◽

Guiyun Xu

Keyword(s):

Action Recognition ◽

Dynamic Time Warping ◽

The Other ◽

Continuous Image ◽

Single Image ◽

Time Warping ◽

Continuous Action ◽

Template Sequence ◽

Dynamic Time

The continuous image sequence recognition is more difficult than the single image recognition because the classification of continuous image sequences and the image edge recognition must be very accurate. Hence, a method based on sequence alignment for action segmentation and classification is proposed to reconstruct a template sequence by estimating the mean action of a class category, which calculates the distance between a single image and a template sequence by sparse coding in Dynamic Time Warping. The proposed method, the methods of Kulkarni et al. [Continuous action recognition based on sequence alignment, Int. J. Comput. Vis. pp. 1–26.] and Hoai et al. [Joint segmentation and classification of human actions in video, IEEE Conf. Computer Vision and Pattern Recognition, 2008, pp. 108–119.] are compared on the recognition accuracy of the continuous recognition and isolated recognition, which clearly shows that the proposed method outperforms the other methods. When applied to continuous gesture classification, it not only can recognize the gesture categories more quickly and accurately, but is more realistic in solving continuous action recognition problems in a video than the other existing methods.

Download Full-text