An audio-video summarization scheme based on audio and video analysis

In this paper, a video summarization representation algorithm was proposed in compressed domain. In particular, Rough sets(RS) theory is introduced for video analysis to increase. Firstly, DCT coefficients and DC coefficients are extracted from video image sequences, so an Information System can construct with DC coefficients. Then Information System is reduced by ruduction theory of RS, the representation of the video frame is obtained by reduced DC coefficients. Finally, we can obtain the reduced Information System, i.e. the Core of Information System. Since the Core contained all the information in video sequences, and at the same time it banished redundant video frame, so it can be viewed as the effective summarization representation. Experimental results indicate that the algorithm can efficiently generate a set of summarization representative of videos sequences and enjoys following advantages. Only a subset of video frames considered during video analysis, so it can avoid the computational complexity, the video summarization representation becomes more scientific than previous methods.

Download Full-text

Automated MPEG audio-video summarization and description

Proceedings International Conference on Image Processing ICIP-02 ◽

10.1109/icip.2002.1038186 ◽

2003 ◽

Cited By ~ 8

Author(s):

M. Sugano ◽

Y. Nakajima ◽

H. Yanagihara

Keyword(s):

Video Summarization ◽

Audio Video

Download Full-text

AUDIO/VIDEO ANALYSIS OF UROLOGIC LAPAROSCOPIC SKILLS: DEVELOPMENT OF OBJECTIVE CRITERIA FOR SKILL ASSESSMENT

The Journal of Urology ◽

10.1016/s0022-5347(08)61924-2 ◽

2008 ◽

Vol 179 (4S) ◽

pp. 658-659

Author(s):

Edan Y Shapiro ◽

Sero Andonian ◽

Casey A Seideman ◽

Marcelo J Sette ◽

Benjamin R Lee ◽

...

Keyword(s):

Video Analysis ◽

Skill Assessment ◽

Skills Development ◽

Laparoscopic Skills ◽

Audio Video

Download Full-text

Content Coverage and Redundancy Removal in Video Summarization

Intelligent Analysis of Multimedia Information - Advances in Multimedia and Interactive Technologies ◽

10.4018/978-1-5225-0498-6.ch013 ◽

2017 ◽

pp. 352-374

Author(s):

Hrishikesh Bhaumik ◽

Siddhartha Bhattacharyya ◽

Susanta Chakraborty

Keyword(s):

Video Retrieval ◽

Video Summarization ◽

Viewing Time ◽

Redundancy Removal ◽

Retrieval Systems ◽

Media Types ◽

Content Coverage ◽

Video Retrieval Systems ◽

Audio Video

Over the past decade, research in the field of Content-Based Video Retrieval Systems (CBVRS) has attracted much attention as it encompasses processing of all the other media types i.e. text, image and audio. Video summarization is one of the most important applications as it potentially enables efficient and faster browsing of large video collections. A concise version of the video is often required due to constraints in viewing time, storage, communication bandwidth as well as power. Thus, the task of video summarization is to effectively extract the most important portions of the video, without sacrificing the semantic information in it. The results of video summarization can be used in many CBVRS applications like semantic indexing, video surveillance copied video detection etc. However, the quality of the summarization task depends on two basic aspects: content coverage and redundancy removal. These two aspects are both important and contradictory to each other. This chapter aims to provide an insight into the state-of-the-art approaches used for this booming field of research.

Download Full-text

Multiscale audio-video analysis and processing: segmentations and arrangements

10.1117/12.434277 ◽

2001 ◽

Author(s):

Raango Aldershoff ◽

Alfons H. Salden

Keyword(s):

Video Analysis ◽

Audio Video

Download Full-text

A Novel Hierarchical Dynamic Video Summarization Representation for Video Analysis

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.490-495.465 ◽

2012 ◽

Vol 490-495 ◽

pp. 465-469

Author(s):

Xiang Wei Li ◽

Yu Xiu Kang ◽

Gang Zheng

Keyword(s):

Information System ◽

Video Analysis ◽

Video Summarization ◽

Image Sequences ◽

Video Frame ◽

Original Video ◽

Compressed Domain ◽

Video Information ◽

Video Frames ◽

Dct Coefficients

Based on Rough Sets (RS), a novel effective video summarization representation was proposed for video analysis in compressed domain. Firstly, DCT coefficients and DC coefficients are extracted from original video image sequences, so an Information System can construct with DC coefficients. Then, Information System is reduced by attributes reduction theory of RS, the representation of the video frame is achieved by reduced DC coefficients. Finally, the reduced Information System can be achieved. Since the Core of Information System contained all major video information in video sequences, which banished the redundant video frame, so it can be considered as the efficient summarization representation. Compared to conventional or existing algorithm, the algorithm enjoys following advantages. (1) Only a subset of video frames considered during video analysis, so it can avoid the computational complexity. (2) The video summarization representation becomes more scientific and efficient than previous methods. (3) According to the reduced frame number, the algorithm can extract hierarchical dynamic video summarization representation.

Download Full-text

Multimodal Summarization of User-Generated Videos

Applied Sciences ◽

10.3390/app11115260 ◽

2021 ◽

Vol 11 (11) ◽

pp. 5260

Author(s):

Theodoros Psallidas ◽

Panagiotis Koromilas ◽

Theodoros Giannakopoulos ◽

Evaggelos Spyrou

Keyword(s):

Exponential Growth ◽

Temporal Order ◽

Video Summarization ◽

User Generated Content ◽

Visual Features ◽

Original Video ◽

Binary Classifier ◽

Video Summaries ◽

Audio Video ◽

Efficient Video

The exponential growth of user-generated content has increased the need for efficient video summarization schemes. However, most approaches underestimate the power of aural features, while they are designed to work mainly on commercial/professional videos. In this work, we present an approach that uses both aural and visual features in order to create video summaries from user-generated videos. Our approach produces dynamic video summaries, that is, comprising the most “important” parts of the original video, which are arranged so as to preserve their temporal order. We use supervised knowledge from both the aforementioned modalities and train a binary classifier, which learns to recognize the important parts of videos. Moreover, we present a novel user-generated dataset which contains videos from several categories. Every 1 sec part of each video from our dataset has been annotated by more than three annotators as being important or not. We evaluate our approach using several classification strategies based on audio, video and fused features. Our experimental results illustrate the potential of our approach.

Download Full-text

Near-lossless semantic video summarization and its applications to video analysis

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/2487268.2487269 ◽

2013 ◽

Vol 9 (3) ◽

pp. 1-23 ◽

Cited By ~ 19

Author(s):

Tao Mei ◽

Lin-Xie Tang ◽

Jinhui Tang ◽

Xian-Sheng Hua

Keyword(s):

Video Analysis ◽

Video Summarization

Download Full-text

Unsupervised News Video Segmentation by Combined Audio-Video Analysis

Multimedia Content Representation, Classification and Security - Lecture Notes in Computer Science ◽

10.1007/11848035_37 ◽

2006 ◽

pp. 273-281 ◽

Cited By ~ 6

Author(s):

M. De Santo ◽

G. Percannella ◽

C. Sansone ◽

M. Vento

Keyword(s):

Video Analysis ◽

Video Segmentation ◽

News Video ◽

Audio Video

Download Full-text