Grasping Hand Pose Estimation from RGB Images Using Digital Human Model by Convolutional Neural Network

Proceedings of 3DBODY.TECH 2018 - 9th International Conference and Exhibition on 3D Body Scanning and Processing Technologies, Lugano, Switzerland, 16-17 Oct. 2018 ◽

10.15221/18.154 ◽

2018 ◽

Cited By ~ 1

Author(s):

Kentaro INO ◽

Naoto IENAGA ◽

Yuta SUGIURA ◽

Hideo SAITO ◽

Natsuki MIYATA ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Pose Estimation ◽

Human Model ◽

Hand Pose Estimation ◽

Digital Human Model ◽

Rgb Images ◽

Digital Human ◽

Hand Pose

Download Full-text

Hand Pose Estimation from RGB Images Based on Deep Learning: A Survey

2021 IEEE 7th International Conference on Virtual Reality (ICVR) ◽

10.1109/icvr51878.2021.9483815 ◽

2021 ◽

Author(s):

Yang Liu ◽

Jie Jiang ◽

Jiahao Sun

Keyword(s):

Deep Learning ◽

Pose Estimation ◽

Hand Pose Estimation ◽

Rgb Images ◽

Hand Pose

Download Full-text

Real-Time Energy Efficient Hand Pose Estimation: A Case Study

Sensors ◽

10.3390/s20102828 ◽

2020 ◽

Vol 20 (10) ◽

pp. 2828

Author(s):

Mhd Rashed Al Koutayni ◽

Vladimir Rybalkin ◽

Jameel Malik ◽

Ahmed Elhayek ◽

Christian Weis ◽

...

Keyword(s):

Neural Network ◽

Real Time ◽

Pose Estimation ◽

Energy Efficient ◽

Graphics Processing Units ◽

Estimation Algorithm ◽

High Energy ◽

Estimation Methods ◽

Hand Pose Estimation ◽

Hand Pose

The estimation of human hand pose has become the basis for many vital applications where the user depends mainly on the hand pose as a system input. Virtual reality (VR) headset, shadow dexterous hand and in-air signature verification are a few examples of applications that require to track the hand movements in real-time. The state-of-the-art 3D hand pose estimation methods are based on the Convolutional Neural Network (CNN). These methods are implemented on Graphics Processing Units (GPUs) mainly due to their extensive computational requirements. However, GPUs are not suitable for the practical application scenarios, where the low power consumption is crucial. Furthermore, the difficulty of embedding a bulky GPU into a small device prevents the portability of such applications on mobile devices. The goal of this work is to provide an energy efficient solution for an existing depth camera based hand pose estimation algorithm. First, we compress the deep neural network model by applying the dynamic quantization techniques on different layers to achieve maximum compression without compromising accuracy. Afterwards, we design a custom hardware architecture. For our device we selected the FPGA as a target platform because FPGAs provide high energy efficiency and can be integrated in portable devices. Our solution implemented on Xilinx UltraScale+ MPSoC FPGA is 4.2× faster and 577.3× more energy efficient than the original implementation of the hand pose estimation algorithm on NVIDIA GeForce GTX 1070.

Download Full-text

Weakly-Supervised 3D Hand Pose Estimation from Monocular RGB Images

Computer Vision – ECCV 2018 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01231-1_41 ◽

2018 ◽

pp. 678-694 ◽

Cited By ~ 47

Author(s):

Yujun Cai ◽

Liuhao Ge ◽

Jianfei Cai ◽

Junsong Yuan

Keyword(s):

Pose Estimation ◽

Hand Pose Estimation ◽

Rgb Images ◽

Weakly Supervised ◽

Hand Pose

Download Full-text

AN ACTION-TUNED NEURAL NETWORK ARCHITECTURE FOR HAND POSE ESTIMATION

Proceedings of the International Conference on Fuzzy Computation and 2nd International Conference on Neural Computation ◽

10.5220/0003086403580363 ◽

2010 ◽

Keyword(s):

Neural Network ◽

Pose Estimation ◽

Network Architecture ◽

Neural Network Architecture ◽

Hand Pose Estimation ◽

Hand Pose

Download Full-text

CrossFuNet: RGB and Depth Cross-Fusion Network for Hand Pose Estimation

Sensors ◽

10.3390/s21186095 ◽

2021 ◽

Vol 21 (18) ◽

pp. 6095

Author(s):

Xiaojing Sun ◽

Bin Wang ◽

Longxiang Huang ◽

Qian Zhang ◽

Sulei Zhu ◽

...

Keyword(s):

Pose Estimation ◽

Depth Map ◽

Depth Information ◽

Feature Maps ◽

Hand Pose Estimation ◽

Depth Sensors ◽

Key Points ◽

Rgb Images ◽

Public Datasets ◽

Hand Pose

Despite recent successes in hand pose estimation from RGB images or depth maps, inherent challenges remain. RGB-based methods suffer from heavy self-occlusions and depth ambiguity. Depth sensors rely heavily on distance and can only be used indoors, thus there are many limitations to the practical application of depth-based methods. The aforementioned challenges have inspired us to combine the two modalities to offset the shortcomings of the other. In this paper, we propose a novel RGB and depth information fusion network to improve the accuracy of 3D hand pose estimation, which is called CrossFuNet. Specifically, the RGB image and the paired depth map are input into two different subnetworks, respectively. The feature maps are fused in the fusion module in which we propose a completely new approach to combine the information from the two modalities. Then, the common method is used to regress the 3D key-points by heatmaps. We validate our model on two public datasets and the results reveal that our model outperforms the state-of-the-art methods.

Download Full-text

InterNet+: A Light Network for Hand Pose Estimation

Sensors ◽

10.3390/s21206747 ◽

2021 ◽

Vol 21 (20) ◽

pp. 6747

Author(s):

Yang Liu ◽

Jie Jiang ◽

Jiahao Sun ◽

Xianghan Wang

Keyword(s):

Performance Improvement ◽

Pose Estimation ◽

Network Architecture ◽

Activation Function ◽

Depth Information ◽

Global Features ◽

Hand Pose Estimation ◽

Feature Extractor ◽

Rgb Images ◽

Hand Pose

Hand pose estimation from RGB images has always been a difficult task, owing to the incompleteness of the depth information. Moon et al. improved the accuracy of hand pose estimation by using a new network, InterNet, through their unique design. Still, the network still has potential for improvement. Based on the architecture of MobileNet v3 and MoGA, we redesigned a feature extractor that introduced the latest achievements in the field of computer vision, such as the ACON activation function and the new attention mechanism module, etc. Using these modules effectively with our network, architecture can better extract global features from an RGB image of the hand, leading to a greater performance improvement compared to InterNet and other similar networks.

Download Full-text

3A1-V03 Hand pose estimation using orientation histograms(Digital Human)

The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec) ◽

10.1299/jsmermd.2014._3a1-v03_1 ◽

2014 ◽

Vol 2014 (0) ◽

pp. _3A1-V03_1-_3A1-V03_2

Author(s):

Hayato MURAI ◽

Mitsunori TADA ◽

Gakuto MASUYAMA ◽

Kazunori UMEDA

Keyword(s):

Pose Estimation ◽

Hand Pose Estimation ◽

Digital Human ◽

Hand Pose

Download Full-text

A 4.45 ms Low-Latency 3D Point-Cloud-Based Neural Network Processor for Hand Pose Estimation in Immersive Wearable Devices

2020 IEEE Symposium on VLSI Circuits ◽

10.1109/vlsicircuits18222.2020.9162895 ◽

2020 ◽

Cited By ~ 1

Author(s):

Dongseok Im ◽

Sanghoon Kang ◽

Donghyeon Han ◽

Sungpill Choi ◽

Hoi-Jun Yoo

Keyword(s):

Neural Network ◽

Pose Estimation ◽

Point Cloud ◽

Wearable Devices ◽

Network Processor ◽

Low Latency ◽

3D Point Cloud ◽

Hand Pose Estimation ◽

Hand Pose

Download Full-text

Multi-Sensor Motion Fusion Using Deep Neural Network Learning

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2017100101 ◽

2017 ◽

Vol 8 (4) ◽

pp. 1-18 ◽

Cited By ~ 1

Author(s):

Xinyao Sun ◽

Anup Basu ◽

Irene Cheng

Keyword(s):

Neural Network ◽

Pose Estimation ◽

Rapid Development ◽

Neural Network Learning ◽

Input Devices ◽

Hand Pose Estimation ◽

Temporal Features ◽

Deep Recurrent Neural Network ◽

Real Time Application ◽

Hand Pose

Hand pose estimation for a continuous sequence has been an important topic not only in computer vision but also human-computer-interaction. Exploring the feasibility to use hand gestures to replace input devices, e.g., mouse, keyboard, joy-stick and touch screen, has attracted increasing attention from academic and industrial researchers. The fast advancement of hand pose estimation techniques is complemented by the rapid development of smart sensors technology such as Kinect and Leap. We introduce a hand pose estimation multi-sensor system. Two tracking models are proposed based on Deep (Recurrent) Neural Network (DRNN) architecture. Data captured from different sensors are analyzed and fused to produce an optimal hand pose sequence. Experimental results show that our models outperform previous methods with better accuracy, meeting real-time application requirement. Performance comparisons between DNN and DRNN, spatial and spatial-temporal features, and single- and dual- sensors, are also presented.

Download Full-text

Hierarchical neural network for hand pose estimation

Signal Processing Image Communication ◽

10.1016/j.image.2020.115909 ◽

2020 ◽

Vol 87 ◽

pp. 115909

Author(s):

Zheng Chen ◽

Kuo Du ◽

Yi Sun ◽

Xiangbo Lin ◽

Xiaohong Ma

Keyword(s):

Neural Network ◽

Pose Estimation ◽

Hand Pose Estimation ◽

Hierarchical Neural Network ◽

Hand Pose

Download Full-text