Improving Real-Time Hand Gesture Recognition with Semantic Segmentation

Gibran Benitez-Garcia; Lidia Prudente-Tixteco; Luis Carlos Castro-Madrid; Rocio Toscano-Medina; Jesus Olivares-Mercado; Gabriel Sanchez-Perez; Luis Javier Garcia Villalba

doi:10.3390/s21020356

Improving Real-Time Hand Gesture Recognition with Semantic Segmentation

Sensors ◽

10.3390/s21020356 ◽

2021 ◽

Vol 21 (2) ◽

pp. 356

Author(s):

Gibran Benitez-Garcia ◽

Lidia Prudente-Tixteco ◽

Luis Carlos Castro-Madrid ◽

Rocio Toscano-Medina ◽

Jesus Olivares-Mercado ◽

...

Keyword(s):

Optical Flow ◽

Real Time ◽

Gesture Recognition ◽

Computational Cost ◽

Semantic Segmentation ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Temporal Shift ◽

Time Performance ◽

Wide Range

Hand gesture recognition (HGR) takes a central role in human–computer interaction, covering a wide range of applications in the automotive sector, consumer electronics, home automation, and others. In recent years, accurate and efficient deep learning models have been proposed for real-time applications. However, the most accurate approaches tend to employ multiple modalities derived from RGB input frames, such as optical flow. This practice limits real-time performance due to intense extra computational cost. In this paper, we avoid the optical flow computation by proposing a real-time hand gesture recognition method based on RGB frames combined with hand segmentation masks. We employ a light-weight semantic segmentation method (FASSD-Net) to boost the accuracy of two efficient HGR methods: Temporal Segment Networks (TSN) and Temporal Shift Modules (TSM). We demonstrate the efficiency of the proposal on our IPN Hand dataset, which includes thirteen different gestures focused on interaction with touchless screens. The experimental results show that our approach significantly overcomes the accuracy of the original TSN and TSM algorithms by keeping real-time performance.

Download Full-text

Connectionist Temporal Classification Model for Dynamic Hand Gesture Recognition using RGB and Optical flow Data

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/4/8 ◽

2020 ◽

Vol 17 (4) ◽

pp. 497-506

Author(s):

Sunil Patel ◽

Ramji Makwana

Keyword(s):

Neural Network ◽

Optical Flow ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Classification Model ◽

Hand Gesture ◽

Flow Data ◽

Dynamic Hand Gesture Recognition ◽

Connectionist Temporal Classification

Automatic classification of dynamic hand gesture is challenging due to the large diversity in a different class of gesture, Low resolution, and it is performed by finger. Due to a number of challenges many researchers focus on this area. Recently deep neural network can be used for implicit feature extraction and Soft Max layer is used for classification. In this paper, we propose a method based on a two-dimensional convolutional neural network that performs detection and classification of hand gesture simultaneously from multimodal Red, Green, Blue, Depth (RGBD) and Optical flow Data and passes this feature to Long-Short Term Memory (LSTM) recurrent network for frame-to-frame probability generation with Connectionist Temporal Classification (CTC) network for loss calculation. We have calculated an optical flow from Red, Green, Blue (RGB) data for getting proper motion information present in the video. CTC model is used to efficiently evaluate all possible alignment of hand gesture via dynamic programming and check consistency via frame-to-frame for the visual similarity of hand gesture in the unsegmented input stream. CTC network finds the most probable sequence of a frame for a class of gesture. The frame with the highest probability value is selected from the CTC network by max decoding. This entire CTC network is trained end-to-end with calculating CTC loss for recognition of the gesture. We have used challenging Vision for Intelligent Vehicles and Applications (VIVA) dataset for dynamic hand gesture recognition captured with RGB and Depth data. On this VIVA dataset, our proposed hand gesture recognition technique outperforms competing state-of-the-art algorithms and gets an accuracy of 86%

Download Full-text

A Model for Real-Time Hand Gesture Recognition Using Electromyography (EMG), Covariances and Feed-Forward Artificial Neural Networks

2020 IEEE ANDESCON ◽

10.1109/andescon50619.2020.9271979 ◽

2020 ◽

Author(s):

Marco E. Benalcazar ◽

Jose Gonzalez ◽

Andres Jaramillo-Yanez ◽

Carlos E. Anchundia ◽

Patricio Zambrano ◽

...

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Feed Forward ◽

Artificial Neural

Download Full-text

Improved Real-Time Approach to Static Hand Gesture Recognition

2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence) ◽

10.1109/confluence51648.2021.9377036 ◽

2021 ◽

Author(s):

Bhavitha B ◽

Divyaprakash R ◽

Vedha T Selvam ◽

V Vinith Kumar ◽

Ramanathan R

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture

Download Full-text

Robust hand gesture recognition using multiple shape-oriented visual cues

EURASIP Journal on Image and Video Processing ◽

10.1186/s13640-021-00567-1 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Samy Bakheet ◽

Ayoub Al-Hamadi

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Pose Estimation ◽

Depth Map ◽

Hand Gesture Recognition ◽

Support Vector ◽

Hand Gesture ◽

Hand Pose Estimation ◽

Time Operation ◽

Hand Pose

AbstractRobust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.

Download Full-text

Real-Time Hand Gesture Recognition Based on a Fusion Learning Method

2017 International Conference on Computational Science and Computational Intelligence (CSCI) ◽

10.1109/csci.2017.91 ◽

2017 ◽

Author(s):

Weihang Wang ◽

Rendong Ying ◽

Jiuchao Qian ◽

Hao Ge ◽

Jun Wang ◽

...

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Learning Method

Download Full-text

A Real-time Multimodal Hand Gesture Recognition via 3D Convolutional Neural Network and Key Frame Extraction

Proceedings of the 2018 International Conference on Machine Learning and Machine Intelligence - MLMI2018 ◽

10.1145/3278312.3278314 ◽

2018 ◽

Cited By ~ 1

Author(s):

Nguyen Ngoc Hoang ◽

Guee-Sang Lee ◽

Soo-Hyung Kim ◽

Hyung-Jeong Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Key Frame Extraction ◽

Key Frame

Download Full-text

Visual Descriptors Based Real Time Hand Gesture Recognition

2018 International Conference On Advances in Communication and Computing Technology (ICACCT) ◽

10.1109/icacct.2018.8529663 ◽

2018 ◽

Cited By ~ 1

Author(s):

Sarang Suresh Kakkoth ◽

Saylee Gharge

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Visual Descriptors

Download Full-text

FPGA based Real Time Human Hand Gesture Recognition System

Procedia Technology ◽

10.1016/j.protcy.2012.10.013 ◽

2012 ◽

Vol 6 ◽

pp. 98-107 ◽

Cited By ~ 11

Author(s):

Amit Gupta ◽

Vijay Kumar Sehrawat ◽

Mamta Khosla

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Recognition System ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Human Hand

Download Full-text

Real-time Hand-Gesture Recognition based on Deep Neural Network

SHS Web of Conferences ◽

10.1051/shsconf/202110204009 ◽

2021 ◽

Vol 102 ◽

pp. 04009

Author(s):

Naoto Ageishi ◽

Fukuchi Tomohide ◽

Abderazek Ben Abdallah

Keyword(s):

Neural Network ◽

Real Time ◽

Gesture Recognition ◽

Deep Neural Network ◽

Recognition System ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Robotic Control ◽

Driver Assistance Systems ◽

Fast Classification

Hand gestures are a kind of nonverbal communication in which visible bodily actions are used to communicate important messages. Recently, hand gesture recognition has received significant attention from the research community for various applications, including advanced driver assistance systems, prosthetic, and robotic control. Therefore, accurate and fast classification of hand gesture is required. In this research, we created a deep neural network as the first step to develop a real-time camera-only hand gesture recognition system without electroencephalogram (EEG) signals. We present the system software architecture in a fair amount of details. The proposed system was able to recognize hand signs with an accuracy of 97.31%.

Download Full-text

Real-Time Hand Gesture Recognition

International Journal of Intelligent Communication, Computing, and Networks ◽

10.51735/ijiccn/001/30 ◽

2021 ◽

Vol 02 (02) ◽

Keyword(s):

Real Time ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture

Download Full-text