Detection of Frames With Single Complete Signs of Finger Alphabet in Video Sequence Using Visual Attention Approach

Júlia Kučerová; Jaroslav Polec

doi:10.2478/jee-2015-0022

Detection of Frames With Single Complete Signs of Finger Alphabet in Video Sequence Using Visual Attention Approach

Journal of Electrical Engineering ◽

10.2478/jee-2015-0022 ◽

2015 ◽

Vol 66 (3) ◽

pp. 142-148

Author(s):

Júlia Kučerová ◽

Jaroslav Polec

Keyword(s):

Visual Attention ◽

Visual Information ◽

Video Quality ◽

Human Communication ◽

Video Frames ◽

Language Communication ◽

Salient Regions ◽

Video Quality Metrics ◽

Key Frame Selection ◽

Local Extreme

Abstract Visual information is very important in human communication. It is used in any type of sign language communication, and in non-verbal communication of the entire population, as well. Therefore, visual information is crucial for communication of hearing impaired people. Video is the most common way to capture this type of information and it is very important to correctly process it. In this paper we propose a method for finding video frames representing single sign in the finger alphabet. The single sign is identified using standard video quality metrics. The calculations of the metrics are performed only within a region, which is determined by combination of object tracking and salient regions detection method based on human visual attention. For key frame selection, combination of sliding system for finding local extreme and adaptive threshold based on local averaging and variation is used. Proposed method is effective and achieves significantly better results in comparison with other commonly used methods.

Download Full-text

Incorporating visual attention models into video quality metrics

10.1117/12.2039780 ◽

2014 ◽

Cited By ~ 2

Author(s):

Welington Y. L. Akamine ◽

Mylène C. Q. Farias

Keyword(s):

Visual Attention ◽

Video Quality ◽

Quality Metrics ◽

Video Quality Metrics

Download Full-text

Investigating the Multimodal Nature of Human Communication

Journal of Psychophysiology ◽

10.1027/0269-8803.23.2.63 ◽

2009 ◽

Vol 23 (2) ◽

pp. 63-76 ◽

Cited By ~ 41

Author(s):

Silke Paulmann ◽

Sarah Jessen ◽

Sonja A. Kotz

Keyword(s):

Visual Information ◽

Empirical Studies ◽

Physical Property ◽

Event Related Potentials ◽

Accurate Information ◽

Human Communication ◽

Channel Condition ◽

Prosodic Cues ◽

Early Processing ◽

Related Potentials

The multimodal nature of human communication has been well established. Yet few empirical studies have systematically examined the widely held belief that this form of perception is facilitated in comparison to unimodal or bimodal perception. In the current experiment we first explored the processing of unimodally presented facial expressions. Furthermore, auditory (prosodic and/or lexical-semantic) information was presented together with the visual information to investigate the processing of bimodal (facial and prosodic cues) and multimodal (facial, lexic, and prosodic cues) human communication. Participants engaged in an identity identification task, while event-related potentials (ERPs) were being recorded to examine early processing mechanisms as reflected in the P200 and N300 component. While the former component has repeatedly been linked to physical property stimulus processing, the latter has been linked to more evaluative “meaning-related” processing. A direct relationship between P200 and N300 amplitude and the number of information channels present was found. The multimodal-channel condition elicited the smallest amplitude in the P200 and N300 components, followed by an increased amplitude in each component for the bimodal-channel condition. The largest amplitude was observed for the unimodal condition. These data suggest that multimodal information induces clear facilitation in comparison to unimodal or bimodal information. The advantage of multimodal perception as reflected in the P200 and N300 components may thus reflect one of the mechanisms allowing for fast and accurate information processing in human communication.

Download Full-text

Combining Video Quality Metrics To Select Perceptually Accurate Resolution In A Wide Quality Range: A Case Study

10.1109/icip42928.2021.9506310 ◽

2021 ◽

Author(s):

Madhukar Bhat ◽

Jean-Marc Thiesse ◽

Patrick Le Callet

Keyword(s):

Video Quality ◽

Quality Metrics ◽

Video Quality Metrics

Download Full-text

Stereoscopic video quality assessment based on visual attention and just-noticeable difference models

Signal Image and Video Processing ◽

10.1007/s11760-015-0802-4 ◽

2015 ◽

Vol 10 (4) ◽

pp. 737-744 ◽

Cited By ~ 23

Author(s):

Feng Qi ◽

Debin Zhao ◽

Xiaopeng Fan ◽

Tingting Jiang

Keyword(s):

Visual Attention ◽

Quality Assessment ◽

Video Quality ◽

Just Noticeable Difference ◽

Video Quality Assessment ◽

Stereoscopic Video

Download Full-text

Procedure for mapping of objective video quality metrics to subjective MOS scale of VP 9 compression standard for Full HD resolution

2016 International Conference on Information and Digital Technologies (IDT) ◽

10.1109/dt.2016.7557188 ◽

2016 ◽

Author(s):

Miroslav Uhrina ◽

Juraj Bienik ◽

Martin Vaculik

Keyword(s):

Video Quality ◽

Quality Metrics ◽

Video Quality Metrics

Download Full-text

Comparison of H.264/AVC, H.264 with AIF, and AVS based on different video quality metrics

2009 International Conference on Telecommunications ◽

10.1109/ictel.2009.5158642 ◽

2009 ◽

Cited By ~ 5

Author(s):

S. Pasqualini ◽

F. Fioretti ◽

A. Andreoli ◽

P. Pierleoni

Keyword(s):

Video Quality ◽

Quality Metrics ◽

Video Quality Metrics

Download Full-text

Developmental changes in infants’ and children’s attention to faces and salient regions vary across and within video stimuli

10.31234/osf.io/mecz4 ◽

2019 ◽

Author(s):

John Michael Franchak ◽

Kellan Kadooka

Keyword(s):

Visual Attention ◽

Alternative Explanation ◽

Time Windows ◽

Dynamic Scenes ◽

Video Clips ◽

Age Related ◽

Global Age ◽

Salient Regions ◽

Socially Relevant ◽

Shift Hypothesis

Visual attention in complex, dynamic scenes is attracted to locations that contain socially-relevant features, such as faces, and to areas that are visually salient. Previous work suggests that there is a global shift over development such that observers increasingly attend to faces with age. However, no prior work has tested whether this shift is truly global, that is, consistent across and within stimuli despite variations in content. To test the global shift hypothesis, we recorded eye movements of 89 children (6 months to 10 years) and adults while they viewed seven video clips. We measured the extent to which each participant attended to faces and to salient areas for each video. There was no evidence of global age-related changes in attention: Neither feature showed consistent increases or decreases with age. Moreover, windowed analyses within each stimulus video revealed moment-to-moment variations in the relation between age and each visual feature. For some time windows, adults looked more often at both feature types compared to infants and children. However, for other time windows the pattern was reversed—younger participants looked more at faces and salient locations. Lack of consistent directional effects provides strong evidence against the global shift hypothesis. We suggest an alternative explanation: Over development, observers increasingly prioritize when and where to look by learning to track which features are relevant within a scene. Implications for the development of visual attention and children’s understanding of screen-based media are discussed.

Download Full-text

No Reference and Reduced Reference Video Quality Metrics for End to End QoS Monitoring

IEICE Transactions on Communications ◽

10.1093/ietcom/e89-b.2.289 ◽

2006 ◽

Vol E89-B (2) ◽

pp. 289-296 ◽

Cited By ~ 14

Author(s):

P. LE CALLET

Keyword(s):

Video Quality ◽

Quality Metrics ◽

Video Quality Metrics ◽

Qos Monitoring ◽

End To End

Download Full-text

Video steganography based on DCT psychovisual and object motion

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i3.1859 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1015-1023 ◽

Cited By ~ 1

Author(s):

Muhammad Fuad ◽

Ferda Ernawan

Keyword(s):

Video Quality ◽

Object Motion ◽

Multimedia Data ◽

Secret Message ◽

Hiding Capacity ◽

Video Steganography ◽

Limited Bandwidth ◽

Video Frames ◽

Dct Coefficients ◽

Video Reconstruction

Steganography is a technique of concealing the message in multimedia data. Multimedia data, such as videos are often compressed to reduce the storage for limited bandwidth. The video provides additional hidden-space in the object motion of image sequences. This research proposes a video steganography scheme based on object motion and DCT-psychovisual for concealing the message. The proposed hiding technique embeds a secret message along the object motion of the video frames. Motion analysis is used to determine the embedding regions. The proposed scheme selects six DCT coefficients in the middle frequency using DCT-psychovisual effects of hiding messages. A message is embedded by modifying middle DCT coefficients using the proposed algorithm. The middle frequencies have a large hiding capacity and it relatively does not give significant effect to the video reconstruction. The performance of the proposed video steganography is evaluated in terms of video quality and robustness against MPEG compression. The experimental results produce minimum distortion of the video quality. Our scheme produces a robust of hiding messages against MPEG-4 compression with average NC value of 0.94. The proposed video steganography achieves less perceptual distortion to human eyes and it's resistant against reducing video storage.

Download Full-text

A performance analysis of objective video quality metrics for digital video watermarking

2010 3rd International Conference on Computer Science and Information Technology ◽

10.1109/iccsit.2010.5564962 ◽

2010 ◽

Cited By ~ 9

Author(s):

Manish K Thakur ◽

Vikas Saxena ◽

J. P. Gupta

Keyword(s):

Performance Analysis ◽

Digital Video ◽

Video Quality ◽

Quality Metrics ◽

Video Watermarking ◽

Digital Video Watermarking ◽

Video Quality Metrics ◽

A Performance

Download Full-text