Large-Scale Video Retrieval via Deep Local Convolutional Features

Advances in Multimedia ◽

10.1155/2020/7862894 ◽

2020 ◽

Vol 2020 ◽

pp. 1-8

Author(s):

Chen Zhang ◽

Bin Hu ◽

Yucong Suo ◽

Zhiqiang Zou ◽

Yimu Ji

Keyword(s):

Large Scale ◽

Video Retrieval ◽

Video Data ◽

Query Image ◽

Key Frame Extraction ◽

Key Frame ◽

Storage Cost ◽

Extraction Algorithm ◽

Feature Aggregation ◽

And Storage

In this paper, we study the challenge of image-to-video retrieval, which uses the query image to search relevant frames from a large collection of videos. A novel framework based on convolutional neural networks (CNNs) is proposed to perform large-scale video retrieval with low storage cost and high search efficiency. Our framework consists of the key-frame extraction algorithm and the feature aggregation strategy. Specifically, the key-frame extraction algorithm takes advantage of the clustering idea so that redundant information is removed in video data and storage cost is greatly reduced. The feature aggregation strategy adopts average pooling to encode deep local convolutional features followed by coarse-to-fine retrieval, which allows rapid retrieval in the large-scale video database. The results from extensive experiments on two publicly available datasets demonstrate that the proposed method achieves superior efficiency as well as accuracy over other state-of-the-art visual search methods.

Download Full-text

Key Frame Extraction for Text Based Video Retrieval Using Maximally Stable Extremal Regions

Proceedings of the 1st International Conference on Industrial Networks and Intelligent Systems ◽

10.4108/icst.iniscom.2015.258410 ◽

2015 ◽

Cited By ~ 2

Author(s):

Werachard Wattanarachothai ◽

Karn Patanukhom

Keyword(s):

Video Retrieval ◽

Key Frame Extraction ◽

Key Frame

Download Full-text

An improved video key-frame extraction algorithm leads to video watermarking

International Journal of Information Technology ◽

10.1007/s41870-017-0054-3 ◽

2017 ◽

Vol 10 (1) ◽

pp. 21-34 ◽

Cited By ~ 2

Author(s):

Soumik Das ◽

Monalisa Banerjee ◽

Atal Chaudhuri

Keyword(s):

Video Watermarking ◽

Key Frame Extraction ◽

Key Frame ◽

Extraction Algorithm

Download Full-text

Application Research of Key Frames Extraction Technology Combined with Optimized Faster R-CNN Algorithm in Traffic Video Analysis

Complexity ◽

10.1155/2021/6620425 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Zhi-guang Jiang ◽

Xiao-tian Shi

Keyword(s):

Video Analysis ◽

Management System ◽

Large Scale ◽

Data Communication ◽

Transportation System ◽

Extraction Technology ◽

Transportation Management ◽

Key Frame Extraction ◽

Original Algorithm ◽

Key Frame

The intelligent transportation system under the big data environment is the development direction of the future transportation system. It effectively integrates advanced information technology, data communication transmission technology, electronic sensing technology, control technology, and computer technology and applies them to the entire ground transportation management system to establish a real-time, accurate, and efficient comprehensive transportation management system that works on a large scale and in all directions. Intelligent video analysis is an important part of smart transportation. In order to improve the accuracy and time efficiency of video retrieval schemes and recognition schemes, this article firstly proposes a segmentation and key frame extraction method for video behavior recognition, using a multi-time scale dual-stream network to extract video features, improving the efficiency and efficiency of video behavior detection. On this basis, an improved algorithm for vehicle detection based on Faster R-CNN is proposed, and the Faster R-CNN network feature extraction layer is improved by using the principle of residual network, and a hole convolution is added to the network to filter out the redundant features of high-resolution video images to improve the problem of vehicle missed detection in the original algorithm. The experimental results show that the key frame extraction technology combined with the optimized Faster R-CNN algorithm model greatly improves the accuracy of detection and reduces the leakage. The detection rate is satisfactory.

Download Full-text

Intelligent Sports Video Classification Based on Deep Neural Network (DNN) Algorithm and Transfer Learning

Computational Intelligence and Neuroscience ◽

10.1155/2021/1825273 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Xiaoping Guo

Keyword(s):

Neural Network ◽

Optical Flow ◽

Transfer Learning ◽

Deep Neural Network ◽

Mutation Detection ◽

Video Retrieval ◽

Flow Analysis ◽

Key Frame Extraction ◽

Sports Video ◽

Key Frame

Traditional text annotation-based video retrieval is done by manually labeling videos with text, which is inefficient and highly subjective and generally cannot accurately describe the meaning of videos. Traditional content-based video retrieval uses convolutional neural networks to extract the underlying feature information of images to build indexes and achieves similarity retrieval of video feature vectors according to certain similarity measure algorithms. In this paper, by studying the characteristics of sports videos, we propose the histogram difference method based on using transfer learning and the four-step method based on block matching for mutation detection and fading detection of video shots, respectively. By adaptive thresholding, regions with large frame difference changes are marked as candidate regions for shots, and then the shot boundaries are determined by mutation detection algorithm. Combined with the characteristics of sports video, this paper proposes a key frame extraction method based on clustering and optical flow analysis, and experimental comparison with the traditional clustering method. In addition, this paper proposes a key frame extraction algorithm based on clustering and optical flow analysis for key frame extraction of sports video. The algorithm effectively removes the redundant frames, and the extracted key frames are more representative. Through extensive experiments, the keyword fuzzy finding algorithm based on improved deep neural network and ontology semantic expansion proposed in this paper shows a more desirable retrieval performance, and it is feasible to use this method for video underlying feature extraction, annotation, and keyword finding, and one of the outstanding features of the algorithm is that it can quickly and effectively retrieve the desired video in a large number of Internet video resources, reducing the false detection rate and leakage rate while improving the fidelity, which basically meets people’s daily needs.

Download Full-text

Within and Between Shot Information Utilisation in Video Key Frame Extraction

Journal of Information & Knowledge Management ◽

10.1142/s0219649211002961 ◽

2011 ◽

Vol 10 (03) ◽

pp. 247-259 ◽

Cited By ~ 4

Author(s):

Dianting Liu ◽

Mei-Ling Shyu ◽

Chao Chen ◽

Shu-Ching Chen

Keyword(s):

Extraction Method ◽

Video Sequence ◽

Video Retrieval ◽

Video Browsing ◽

High Quality ◽

Key Frame Extraction ◽

Key Frame ◽

Video Shot ◽

Key Frames ◽

Candidate Set

In consequence of the popularity of family video recorders and the surge of Web 2.0, increasing amounts of videos have made the management and integration of the information in videos an urgent and important issue in video retrieval. Key frames, as a high-quality summary of videos, play an important role in the areas of video browsing, searching, categorisation, and indexing. An effective set of key frames should include major objects and events of the video sequence, and should contain minimum content redundancies. In this paper, an innovative key frame extraction method is proposed to select representative key frames for a video. By analysing the differences between frames and utilising the clustering technique, a set of key frame candidates (KFCs) is first selected at the shot level, and then the information within a video shot and between video shots is used to filter the candidate set to generate the final set of key frames. Experimental results on the TRECVID 2007 video dataset have demonstrated the effectiveness of our proposed key frame extraction method in terms of the percentage of the extracted key frames and the retrieval precision.

Download Full-text

Constructing and Utilizing Video Ontology for Accurate and Fast Retrieval

International Journal of Multimedia Data Engineering and Management ◽

10.4018/jmdem.2011100104 ◽

2011 ◽

Vol 2 (4) ◽

pp. 59-75 ◽

Cited By ~ 1

Author(s):

Kimiaki Shirahama ◽

Kuniaki Uehara

Keyword(s):

Knowledge Base ◽

Large Scale ◽

Video Retrieval ◽

Computational Cost ◽

Semantic Content ◽

Video Data ◽

Experimental Results ◽

Huge Number ◽

Dempster Shafer Theory ◽

Shafer Theory

This paper examines video retrieval based on Query-By-Example (QBE) approach, where shots relevant to a query are retrieved from large-scale video data based on their similarity to example shots. This involves two crucial problems: The first is that similarity in features does not necessarily imply similarity in semantic content. The second problem is an expensive computational cost to compute the similarity of a huge number of shots to example shots. The authors have developed a method that can filter a large number of shots irrelevant to a query, based on a video ontology that is knowledge base about concepts displayed in a shot. The method utilizes various concept relationships (e.g., generalization/specialization, sibling, part-of, and co-occurrence) defined in the video ontology. In addition, although the video ontology assumes that shots are accurately annotated with concepts, accurate annotation is difficult due to the diversity of forms and appearances of the concepts. Dempster-Shafer theory is used to account the uncertainty in determining the relevance of a shot based on inaccurate annotation of this shot. Experimental results on TRECVID 2009 video data validate the effectiveness of the method.

Download Full-text

An Emotional Scene Retrieval Framework for Lifelog Videos Using Ensemble Clustering

International Journal of Software Innovation ◽

10.4018/ijsi.2015070101 ◽

2015 ◽

Vol 3 (3) ◽

pp. 1-13 ◽

Cited By ~ 3

Author(s):

Hiroki Nomiya ◽

Atsushi Morikuni ◽

Teruhisa Hochin

Keyword(s):

Large Scale ◽

Video Retrieval ◽

Video Data ◽

Training Data ◽

Expression Recognition ◽

Learning Approaches ◽

Ensemble Clustering ◽

Retrieval Performance ◽

Scene Detection ◽

Emotional Scenes

A lifelog video retrieval framework is proposed for the better utilization of a large amount of lifelog video data. The proposed method retrieves emotional scenes such as the scenes in which a person in the video is smiling, considering that a certain important event could happen in most of emotional scenes. The emotional scene is detected on the basis of facial expression recognition using a wide variety of facial features. The authors adopt an unsupervised learning approach called ensemble clustering in order to recognize the facial expressions because supervised learning approaches require sufficient training data, which make it quite troublesome to apply to large-scale video databases. The retrieval performance of the proposed method is evaluated by means of an emotional scene detection experiment from the viewpoints of accuracy and efficiency. In addition, a prototype retrieval system is implemented based on the proposed emotional scene detection method.

Download Full-text

Key Frame Extraction Algorithm for Surveillance Video Based on Golden Section

Proceedings of the 2019 International Symposium on Signal Processing Systems - SSPS 2019 ◽

10.1145/3364908.3365296 ◽

2019 ◽

Author(s):

Aodi Zhao ◽

Yi Lai ◽

Ying Liu ◽

Hanbing Leng

Keyword(s):

Golden Section ◽

Surveillance Video ◽

Key Frame Extraction ◽

Key Frame ◽

Extraction Algorithm

Download Full-text

Key-Frame Extraction Algorithm Based on Entropy

2010 International Conference on E-Product E-Service and E-Entertainment ◽

10.1109/iceee.2010.5660916 ◽

2010 ◽

Author(s):

Rong Pan ◽

Yumin Tian ◽

Zhong Wang

Keyword(s):

Key Frame Extraction ◽

Key Frame ◽

Extraction Algorithm

Download Full-text

A fast key frame extraction algorithm and an accurate feature matching method for 3D reconstruction from aerial video

2017 29th Chinese Control And Decision Conference (CCDC) ◽

10.1109/ccdc.2017.7978392 ◽

2017 ◽

Cited By ~ 1

Author(s):

Cheng Zhang ◽

Hongpeng Wang ◽

Hanzhen Li ◽

Jingtai Liu

Keyword(s):

3D Reconstruction ◽

Feature Matching ◽

Matching Method ◽

Key Frame Extraction ◽

Key Frame ◽

Extraction Algorithm ◽

Aerial Video

Download Full-text