scholarly journals A New Gesture Recognition System Using Weighted Dynamic Time Warping and Symbolic Aggregation Approximation Methods on Skeleton Data

2017 ◽  
Vol 17 (1) ◽  
pp. 117-123
Author(s):  
Rafet DURGUT ◽  
İsmail KURNAZ
2018 ◽  
Vol 2018 ◽  
pp. 1-9 ◽  
Author(s):  
Hyo-Rim Choi ◽  
TaeYong Kim

We propose a modified dynamic time warping (DTW) algorithm that compares gesture-position sequences based on the direction of the gestural movement. Standard DTW does not specifically consider the two-dimensional characteristic of the user’s movement. Therefore, in gesture recognition, the sequence comparison by standard DTW needs to be improved. The proposed gesture-recognition system compares the sequences of the input gesture’s position with gesture positions saved in the database and selects the most similar gesture by filtering out unrelated gestures. The suggested algorithm uses the cosine similarity of the movement direction at each moment to calculate the difference and reflects the characteristics of the gesture movement by using the ratio of the Euclidean distance and the proportional distance to the calculated difference. Selective spline interpolation assists in solving the issue of recognition-decline at instances of gestures. Through experiments with public databases (MSRC-12 and G3D), the suggested algorithm revealed an improved performance on both databases compared to other methods.


2019 ◽  
Vol 9 (3) ◽  
pp. 528 ◽  
Author(s):  
Fahn Chin-Shyurng ◽  
Shih-En Lee ◽  
Meng-Luen Wu

Gesture recognition is a human−computer interaction method, which is widely used for educational, medical, and entertainment purposes. Humans also use gestures to communicate with each other, and musical conducting uses gestures in this way. In musical conducting, conductors wave their hands to control the speed and strength of the music played. However, beginners may have a limited comprehension of the gestures and might not be able to properly follow the ensembles. Therefore, this paper proposes a real-time musical conducting gesture recognition system to help music players improve their performance. We used a single-depth camera to capture image inputs and establish a real-time dynamic gesture recognition system. The Kinect software development kit created a skeleton model by capturing the palm position. Different palm gestures were collected to develop training templates for musical conducting. The dynamic time warping algorithm was applied to recognize the different conducting gestures at various conducting speeds, thereby achieving real-time dynamic musical conducting gesture recognition. In the experiment, we used 5600 examples of three basic types of musical conducting gestures, including seven capturing angles and five performing speeds for evaluation. The experimental result showed that the average accuracy was 89.17% in 30 frames per second.


Author(s):  
Santosh KC ◽  
Cholwich Nattee

Handwriting Recognition Technology has been improving much under the purview of pattern recognition and image processing since a few decades. This paper focuses on the comprehensive survey on on-line handwriting recognition system along with the real application by taking Nepali natural handwriting (a real example of one of the cursive handwritings). The survey mainly includes pre-processing, feature vector and similarity measures in between the non-linear 2D sequences of coordinates, and their effective applications. A very highlighting topic "Dynamic Time Warping Algorithm'' (DTW) is introduced, which has been popular in determining the distance between two non-linear sequences ranging from handwriting to speech recognition. Besides these contemporary research issues/areas, stroke number and order free Nepalese natural handwritten recognition system is presented in the second step. Writing one's own style brings unevenness in writing units, which is the most difficult part to classify. Writing units reveal number, shape, size, order of stroke, and speed in writing. Variation in the number of strokes, their order, shapes and sizes, tilting angles and similarities among characters from one another are the important factors, which are to be considered in classification for Nepali. This paper utilizes structural properties of those alphanumeric characters, which have variable writing units. It uses a string of pen tip's positions and tangent angles of every consecutive point as a feature vector sequence of a stroke. We constructed a prototype recognizer that uses the DTW algorithm to align handwritten strokes with stored strokes' templates and determine their similarity. Separate system is trained for original and preprocessed writing samples and achieved recognition rates of 85.87% and 88.59% respectively. This introduces novel real time handwriting recognition on Nepalese alphanumeric characters, which are independent of number of strokes, as well as their order. Key Words: Handwriting Recognition System; Pre-processing; Feature Vector; Dynamic Time Warping; Agglomerating Hierarchical Clustering; Nepali. DOI: 10.3126/kuset.v5i1.2845 Kathmandu University Journal of Science, Engineering and Technology Vol.5, No.1, January 2009, pp 31-55


AVITEC ◽  
2019 ◽  
Vol 1 (1) ◽  
Author(s):  
Noor Fita Indri Prayoga

Voice is one of  way to communicate and express yourself. Speaker recognition is a process carried out by a device to recognize the speaker through the voice. This study designed a speaker recognition system that was able to identify speakers based on what was said by using dynamic time warping (DTW) method based in matlab. To design a speaker recognition system begins with the process of reference data and test data. Both processes have the same process, which starts with sound recording, preprocessing, and feature extraction. In this system, the Fast Fourier Transform (FFT) method is used to extract the features. The results of the feature extraction process from the two data will be compared using the DTW method. Calculations using DTW that produce the smallest value will be determined as the output. The test results show that the system can identify the voice with the best level of recognition accuracy of 90%, and the average recognition accuracy of 80%. The results were obtained from 50 tests, carried out by 5 people consisting of 3 men and 2 women, each speaker said a predetermined word


Sensors ◽  
2019 ◽  
Vol 19 (5) ◽  
pp. 1007 ◽  
Author(s):  
James Rwigema ◽  
Hyo-Rim Choi ◽  
TaeYong Kim

In this research, we present a differential evolution approach to optimize the weights of dynamic time warping for multi-sensory based gesture recognition. Mainly, we aimed to develop a robust gesture recognition method that can be used in various environments. Both a wearable inertial sensor and a depth camera (Kinect Sensor) were used as heterogeneous sensors to verify and collect the data. The proposed approach was used for the calculation of optimal weight values and different characteristic features of heterogeneous sensor data, while having different effects during gesture recognition. In this research, we studied 27 different actions to analyze the data. As finding the optimal value of the data from numerous sensors became more complex, a differential evolution approach was used during the fusion and optimization of the data. To verify the performance accuracy of the presented method in this study, a University of Texas at Dallas Multimodal Human Action Datasets (UTD-MHAD) from previous research was used. However, the average recognition rates presented by previous research using respective methods were still low, due to the complexity in the calculation of the optimal values of the acquired data from sensors, as well as the installation environment. Our contribution was based on a method that enabled us to adjust the number of depth cameras and combine this data with inertial sensors (multi-sensors in this study). We applied a differential evolution approach to calculate the optimal values of the added weights. The proposed method achieved an accuracy 10% higher than the previous research results using the same database, indicating a much improved accuracy rate of motion recognition.


Sign in / Sign up

Export Citation Format

Share Document