scholarly journals Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image

Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1841
Author(s):  
Leyuan Liu ◽  
Zeran Ke ◽  
Jiao Huo ◽  
Jingying Chen

Mainstream methods treat head pose estimation as a supervised classification/regression problem, whose performance heavily depends on the accuracy of ground-truth labels of training data. However, it is rather difficult to obtain accurate head pose labels in practice, due to the lack of effective equipment and reasonable approaches for head pose labeling. In this paper, we propose a method which does not need to be trained with head pose labels, but matches the keypoints between a reconstructed 3D face model and the 2D input image, for head pose estimation. The proposed head pose estimation method consists of two components: the 3D face reconstruction and the 3D–2D matching keypoints. At the 3D face reconstruction phase, a personalized 3D face model is reconstructed from the input head image using convolutional neural networks, which are jointly optimized by an asymmetric Euclidean loss and a keypoint loss. At the 3D–2D keypoints matching phase, an iterative optimization algorithm is proposed to match the keypoints between the reconstructed 3D face model and the 2D input image efficiently under the constraint of perspective transformation. The proposed method is extensively evaluated on five widely used head pose estimation datasets, including Pointing’04, BIWI, AFLW2000, Multi-PIE, and Pandora. The experimental results demonstrate that the proposed method achieves excellent cross-dataset performance and surpasses most of the existing state-of-the-art approaches, with average MAEs of 4.78∘ on Pointing’04, 6.83∘ on BIWI, 7.05∘ on AFLW2000, 5.47∘ on Multi-PIE, and 5.06∘ on Pandora, although the model of the proposed method is not trained on any of these five datasets.

2014 ◽  
Vol 519-520 ◽  
pp. 693-696
Author(s):  
Jian Ming Liu ◽  
Ji Guo Zeng

Estimating the head pose is still a unique challenge for computer vision system. Previous methods at solving this problem have often proposed solutions formulated in a classification setting. In this paper, we formulate pose estimation as a regression problem to achieve robustness. We propose to use gradient orientation histograms based random regression forests for the task. Firstly, each sample image is divided into overlapped patches, and direction-sensitive features of patches are extracted. Then we train a random regression forest on these patches. Experiments are carried out on public available database, and the result shows that the proposed algorithm outperforms some other approaches in both accuracy and computational efficiency.


2009 ◽  
Vol 94 (2) ◽  
pp. 179-195 ◽  
Author(s):  
Xun Gong ◽  
Guoyin Wang ◽  
Lili Xiong

Author(s):  
Ahmet Firintepe ◽  
Mohamed Selim ◽  
Alain Pagani ◽  
Didier Stricker

Sign in / Sign up

Export Citation Format

Share Document