Feature Level Fusion for Bimodal Facial Action Unit Recognition

Audiovisual Facial Action Unit Recognition using Feature Level Fusion

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2016010104 ◽

2016 ◽

Vol 7 (1) ◽

pp. 60-76 ◽

Cited By ~ 4

Author(s):

Zibo Meng ◽

Shizhong Han ◽

Min Chen ◽

Yan Tong

Keyword(s):

Time Shift ◽

Visual Features ◽

Visual Channel ◽

Facial Action ◽

Audio Channel ◽

Feature Level Fusion ◽

Fusion Methods ◽

The Difference ◽

Facial Images ◽

Level Fusion

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.

Download Full-text

Audiovisual Facial Action Unit Recognition Using Feature Level Fusion

Computer Vision ◽

10.4018/978-1-5225-5204-8.ch024 ◽

2018 ◽

pp. 636-655

Author(s):

Zibo Meng ◽

Shizhong Han ◽

Min Chen ◽

Yan Tong

Keyword(s):

Time Shift ◽

Visual Feature ◽

Visual Channel ◽

Facial Action ◽

Audio Channel ◽

Feature Level Fusion ◽

Fusion Methods ◽

The Difference ◽

Facial Images ◽

Level Fusion

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.

Download Full-text

Feature Level Fusion of Seven Neighbor Bilinear Interpolation Data Sets of Finger Vein

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/95922020 ◽

2020 ◽

Vol 9 (2) ◽

pp. 1531-1536

Author(s):

Arjun B

Keyword(s):

Data Sets ◽

Bilinear Interpolation ◽

Finger Vein ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Tunneled Latent Variables Method for Facial Action Unit Tracking

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2009.00198 ◽

2009 ◽

Vol 35 (2) ◽

pp. 198-201 ◽

Cited By ~ 1

Author(s):

Lei WANG ◽

Bei-Ji ZOU ◽

Xiao-Ning PENG

Keyword(s):

Latent Variables ◽

Action Unit ◽

Facial Action

Download Full-text

The Review of Feature Level Fusion of Multi-Focused Images Using Wavelets

Recent Patents on Signal Processing ◽

10.2174/1877612401002010028 ◽

2010 ◽

Vol 2 (1) ◽

pp. 28-38 ◽

Cited By ~ 3

Author(s):

K. Kannan ◽

S. Arumuga Perumal ◽

K. Arulmozhi

Keyword(s):

Feature Level Fusion ◽

Level Fusion

Download Full-text

Multi-view facial action unit detection via DenseNets and CapsNets

Multimedia Tools and Applications ◽

10.1007/s11042-021-11147-w ◽

2021 ◽

Author(s):

Dakai Ren ◽

Xiangmin Wen ◽

Jiazhong Chen ◽

Yu Han ◽

Shiqi Zhang

Keyword(s):

Action Unit ◽

Facial Action ◽

Action Unit Detection

Download Full-text

Assessing Automated Facial Action Unit Detection Systems for Analyzing Cross-Domain Facial Expression Databases

Sensors ◽

10.3390/s21124222 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4222

Author(s):

Shushi Namba ◽

Wataru Sato ◽

Masaki Osumi ◽

Koh Shimokawa

Keyword(s):

Affective Computing ◽

Detection System ◽

Characteristic Curve ◽

Unmet Need ◽

Systematic Evaluation ◽

Dynamic Mode ◽

Coding System ◽

Static Mode ◽

Action Unit ◽

Facial Action

In the field of affective computing, achieving accurate automatic detection of facial movements is an important issue, and great progress has already been made. However, a systematic evaluation of systems that now have access to the dynamic facial database remains an unmet need. This study compared the performance of three systems (FaceReader, OpenFace, AFARtoolbox) that detect each facial movement corresponding to an action unit (AU) derived from the Facial Action Coding System. All machines could detect the presence of AUs from the dynamic facial database at a level above chance. Moreover, OpenFace and AFAR provided higher area under the receiver operating characteristic curve values compared to FaceReader. In addition, several confusion biases of facial components (e.g., AU12 and AU14) were observed to be related to each automated AU detection system and the static mode was superior to dynamic mode for analyzing the posed facial database. These findings demonstrate the features of prediction patterns for each system and provide guidance for research on facial expressions.

Download Full-text

A Deep 2D/3D Feature-Level Fusion for Classification of UAV Multispectral Imagery in Urban Areas

Geocarto International ◽

10.1080/10106049.2021.1959655 ◽

2021 ◽

pp. 1-16

Author(s):

Hossein Pourazar ◽

Farhad Samadzadegan ◽

Farzaneh Dadrass Javan

Keyword(s):

Urban Areas ◽

Multispectral Imagery ◽

Feature Level Fusion ◽

Level Fusion

Download Full-text

Feature-level fusion of physiological parameters to be used as cryptographic keys

2017 IEEE International Conference on Communications (ICC) ◽

10.1109/icc.2017.7996338 ◽

2017 ◽

Author(s):

Duygu Karaoglan Altop ◽

Albert Levi ◽

Volkan Tuzcu

Keyword(s):

Physiological Parameters ◽

Feature Level Fusion ◽

Cryptographic Keys ◽

Level Fusion

Download Full-text

Task-dependent multi-task multiple kernel learning for facial action unit detection

Pattern Recognition ◽

10.1016/j.patcog.2015.08.026 ◽

2016 ◽

Vol 51 ◽

pp. 187-196 ◽

Cited By ~ 20

Author(s):

Xiao Zhang ◽

Mohammad H. Mahoor

Keyword(s):

Multiple Kernel Learning ◽

Kernel Learning ◽

Action Unit ◽

Facial Action ◽

Multiple Kernel ◽

Action Unit Detection

Download Full-text