Dynamic Gesture Recognition Based on MEMP Network

Xinyu Zhang; Xiaoqiang Li

doi:10.3390/fi11040091

Dynamic Gesture Recognition Based on MEMP Network

Future Internet ◽

10.3390/fi11040091 ◽

2019 ◽

Vol 11 (4) ◽

pp. 91 ◽

Cited By ~ 4

Author(s):

Xinyu Zhang ◽

Xiaoqiang Li

Keyword(s):

Neural Network ◽

Gesture Recognition ◽

Research Direction ◽

High Accuracy ◽

Data Sets ◽

Language Recognition ◽

Sign Language Recognition ◽

Identification Rate ◽

Multiple Prediction ◽

3D Cnn

In recent years, gesture recognition has been used in many fields, such as games, robotics and sign language recognition. Human computer interaction (HCI) has been significantly improved by the development of gesture recognition, and now gesture recognition in video is an important research direction. Because each kind of neural network structure has its limitation, we proposed a neural network with alternate fusion of 3D CNN and ConvLSTM, which we called the Multiple extraction and Multiple prediction (MEMP) network. The main feature of the MEMP network is to extract and predict the temporal and spatial feature information of gesture video multiple times, which enables us to obtain a high accuracy rate. In the experimental part, three data sets (LSA64, SKIG and Chalearn 2016) are used to verify the performance of network. Our approach achieved high accuracy on those data sets. In the LSA64, the network achieved an identification rate of 99.063%. In SKIG, this network obtained the recognition rates of 97.01% and 99.02% in the RGB part and the rgb-depth part. In Chalearn 2016, the network achieved 74.57% and 78.85% recognition rates in RGB part and rgb-depth part respectively.

Download Full-text

Indian Sign Language Recognition through Hybrid ConvNet-LSTM Networks

EMITTER International Journal of Engineering Technology ◽

10.24003/emitter.v9i1.613 ◽

2021 ◽

Vol 9 (1) ◽

pp. 182-203

Author(s):

Muthu Mariappan H ◽

Dr Gomathi V

Keyword(s):

Neural Network ◽

Computer Vision ◽

Real Time ◽

Sign Language ◽

Gesture Recognition ◽

Language Translation ◽

Video Gaming ◽

Language Recognition ◽

Sign Language Recognition ◽

Indian Sign Language

Dynamic hand gesture recognition is a challenging task of Human-Computer Interaction (HCI) and Computer Vision. The potential application areas of gesture recognition include sign language translation, video gaming, video surveillance, robotics, and gesture-controlled home appliances. In the proposed research, gesture recognition is applied to recognize sign language words from real-time videos. Classifying the actions from video sequences requires both spatial and temporal features. The proposed system handles the former by the Convolutional Neural Network (CNN), which is the core of several computer vision solutions and the latter by the Recurrent Neural Network (RNN), which is more efficient in handling the sequences of movements. Thus, the real-time Indian sign language (ISL) recognition system is developed using the hybrid CNN-RNN architecture. The system is trained with the proposed CasTalk-ISL dataset. The ultimate purpose of the presented research is to deploy a real-time sign language translator to break the hurdles present in the communication between hearing-impaired people and normal people. The developed system achieves 95.99% top-1 accuracy and 99.46% top-3 accuracy on the test dataset. The obtained results outperform the existing approaches using various deep models on different datasets.

Download Full-text

A Real-Time American Sign Language Recognition System using Convolutional Neural Network for Real Datasets

TEM Journal ◽

10.18421/tem93-14 ◽

2020 ◽

pp. 937-943

Author(s):

Rasha Amer Kadhim ◽

Muntadher Khamees

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

American Sign Language ◽

Real Time ◽

Sign Language ◽

Recognition System ◽

High Accuracy ◽

American Sign ◽

Language Recognition ◽

Sign Language Recognition

In this paper, a real-time ASL recognition system was built with a ConvNet algorithm using real colouring images from a PC camera. The model is the first ASL recognition model to categorize a total of 26 letters, including (J & Z), with two new classes for space and delete, which was explored with new datasets. It was built to contain a wide diversity of attributes like different lightings, skin tones, backgrounds, and a wide variety of situations. The experimental results achieved a high accuracy of about 98.53% for the training and 98.84% for the validation. As well, the system displayed a high accuracy for all the datasets when new test data, which had not been used in the training, were introduced.

Download Full-text

Software Implementation of Gesture Recognition Algorithm Using Computer Vision

Advances in Cyber-Physical Systems ◽

10.23939/acps2021.01.021 ◽

2021 ◽

Vol 6 (1) ◽

pp. 21-26

Author(s):

Vladyslav Kotyk ◽

◽

Oksana Lashko

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Machine Learning ◽

Computer Vision ◽

Gesture Recognition ◽

Recognition Algorithm ◽

Software Implementation ◽

Speech Impairments ◽

Language Recognition ◽

Sign Language Recognition

This paper examines the main methods and principles of image formation, display of the sign language recognition algorithm using computer vision to improve communication between people with hearing and speech impairments. This algorithm allows to effectively recognize gestures and display information in the form of labels. A system that includes the main modules for implementing this algorithm has been designed. The modules include the implementation of perception, transformation and image processing, the creation of a neural network using artificial intelligence tools to train a model for predicting input gesture labels. The aim of this work is to create a full-fledged program for implementing a real-time gesture recognition algorithm using computer vision and machine learning.

Download Full-text

Technological Aids for Deaf and Mute in Modern World

Recent Patents on Engineering ◽

10.2174/1872212114999201116214802 ◽

2020 ◽

Vol 14 ◽

Author(s):

Vasu Mehra ◽

Dhiraj Pandey ◽

Aayush Rastogi ◽

Aditya Singh ◽

Harsh Preet Singh

Keyword(s):

Sign Language ◽

Gesture Recognition ◽

Recognition System ◽

Modern World ◽

Language Recognition ◽

Sign Language Recognition ◽

Background Elimination ◽

Sign Recognition ◽

Technological Advances ◽

Better Than

Background:: People suffering from hearing and speaking disabilities have a few ways of communicating with other people. One of these is to communicate through the use of sign language. Objective:: Developing a system for sign language recognition becomes essential for deaf as well as a mute person. The recognition system acts as a translator between a disabled and an able person. This eliminates the hindrances in exchange of ideas. Most of the existing systems are very poorly designed with limited support for the needs of their day to day facilities. Methods:: The proposed system embedded with gesture recognition capability has been introduced here which extracts signs from a video sequence and displays them on screen. On the other hand, a speech to text as well as text to speech system is also introduced to further facilitate the grieved people. To get the best out of human computer relationship, the proposed solution consists of various cutting-edge technologies and Machine Learning based sign recognition models which have been trained by using Tensor Flow and Keras library. Result:: The proposed architecture works better than several gesture recognition techniques like background elimination and conversion to HSV because of sharply defined image provided to the model for classification. The results of testing indicate reliable recognition systems with high accuracy that includes most of the essential and necessary features for any deaf and dumb person in his/her day to day tasks. Conclusion:: It’s the need of current technological advances to develop reliable solutions which can be deployed to assist deaf and dumb people to adjust to normal life. Instead of focusing on a standalone technology, a plethora of them have been introduced in this proposed work. Proposed Sign Recognition System is based on feature extraction and classification. The trained model helps in identification of different gestures.

Download Full-text

SIBI Sign Language Recognition Using Convolutional Neural Network Combined with Transfer Learning and non-trainable Parameters

Procedia Computer Science ◽

10.1016/j.procs.2020.12.011 ◽

2021 ◽

Vol 179 ◽

pp. 72-80

Author(s):

Suharjito ◽

Narada Thiracitta ◽

Herman Gunawan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sign Language ◽

Transfer Learning ◽

Language Recognition ◽

Sign Language Recognition

Download Full-text

Thai Sign Language Recognition: an Application of Deep Neural Network

2021 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunication Engineering ◽

10.1109/ectidamtncon51128.2021.9425711 ◽

2021 ◽

Author(s):

Anusorn Chaikaew ◽

Kritsana Somkuan ◽

Thidalak Yuyen

Keyword(s):

Neural Network ◽

Sign Language ◽

Deep Neural Network ◽

Language Recognition ◽

Sign Language Recognition

Download Full-text

Sign language recognition with recurrent neural network using human keypoint detection

Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems - RACS '18 ◽

10.1145/3264746.3264805 ◽

2018 ◽

Cited By ~ 4

Author(s):

Sang-Ki Ko ◽

Jae Gi Son ◽

Hyedong Jung

Keyword(s):

Neural Network ◽

Sign Language ◽

Recurrent Neural Network ◽

Language Recognition ◽

Sign Language Recognition ◽

Keypoint Detection

Download Full-text

Devices analysis and artificial neural network parameters for sign language recognition

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON) ◽

10.1109/chilecon.2017.8229734 ◽

2017 ◽

Author(s):

Brunna Silva ◽

Wesley Calixto ◽

Geovanne Furriel

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Sign Language ◽

Language Recognition ◽

Sign Language Recognition ◽

Network Parameters ◽

Artificial Neural

Download Full-text

Pose Invariant Hand Gesture Recognition using Two Stream Transfer Learning Architecture

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f9058.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 1771-1777

Keyword(s):

Machine Learning ◽

Transfer Learning ◽

Gesture Recognition ◽

Classification Accuracy ◽

Decision Fusion ◽

Hand Gesture Recognition ◽

Machine Learning Techniques ◽

Hand Gesture ◽

Language Recognition ◽

Sign Language Recognition

The hand gesture detection problem is one of the most prominent problems in machine learning and computer vision applications. Many machine learning techniques have been employed to solve the hand gesture recognition. These techniques find applications in sign language recognition, virtual reality, human machine interaction, autonomous vehicles, driver assistive systems etc. In this paper, the goal is to design a system to correctly identify hand gestures from a dataset of hundreds of hand gesture images. In order to incorporate this, decision fusion based system using the transfer learning architectures is proposed to achieve the said task. Two pretrained models namely ‘MobileNet’ and ‘Inception V3’ are used for this purpose. To find the region of interest (ROI) in the image, YOLO (You Only Look Once) architecture is used which also decides the type of model. Edge map images and the spatial images are trained using two separate versions of the MobileNet based transfer learning architecture and then the final probabilities are combined to decide upon the hand sign of the image. The simulation results using classification accuracy indicate the superiority of the approach of this paper against the already researched approaches using different quantitative techniques such as classification accuracy.

Download Full-text