A Novel Image Retrieval Based on a Combination of Local and Global Histograms of Visual Words

Mathematical Problems in Engineering ◽

10.1155/2016/8217250 ◽

2016 ◽

Vol 2016 ◽

pp. 1-12 ◽

Cited By ~ 32

Author(s):

Zahid Mehmood ◽

Syed Muhammad Anwar ◽

Nouman Ali ◽

Hafiz Adnan Habib ◽

Muhammad Rashid

Keyword(s):

Image Retrieval ◽

Spatial Information ◽

Image Representation ◽

Ground Truth ◽

Spatial Layout ◽

Bag Of Visual Words ◽

Significant Information ◽

Visual Words ◽

Ground Truth Image ◽

Similar Images

Content-based image retrieval (CBIR) provides a sustainable solution to retrieve similar images from an image archive. In the last few years, the Bag-of-Visual-Words (BoVW) model gained attention and significantly improved the performance of image retrieval. In the standard BoVW model, an image is represented as an orderless global histogram of visual words by ignoring the spatial layout. The spatial layout of an image carries significant information that can enhance the performance of CBIR. In this paper, we are presenting a novel image representation that is based on a combination of local and global histograms of visual words. The global histogram of visual words is constructed over the whole image, while the local histogram of visual words is constructed over the local rectangular region of the image. The local histogram contains the spatial information about the salient objects. Extensive experiments and comparisons conducted on Corel-A, Caltech-256, and Ground Truth image datasets demonstrate that the proposed image representation increases the performance of image retrieval.

Download Full-text

A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval

Computer Vision ◽

10.4018/978-1-5225-5204-8.ch054 ◽

2018 ◽

pp. 1307-1321

Author(s):

Vinh-Tiep Nguyen ◽

Thanh Duc Ngo ◽

Minh-Triet Tran ◽

Duy-Dinh Le ◽

Duc Anh Duong

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Spatial Information ◽

Real Life ◽

Inverted Index ◽

Bag Of Words ◽

Visual Words ◽

Benchmark Datasets ◽

Large Scale Image Retrieval ◽

Inverted Indexing

Large-scale image retrieval has been shown remarkable potential in real-life applications. The standard approach is based on Inverted Indexing, given images are represented using Bag-of-Words model. However, one major limitation of both Inverted Index and Bag-of-Words presentation is that they ignore spatial information of visual words in image presentation and comparison. As a result, retrieval accuracy is decreased. In this paper, the authors investigate an approach to integrate spatial information into Inverted Index to improve accuracy while maintaining short retrieval time. Experiments conducted on several benchmark datasets (Oxford Building 5K, Oxford Building 5K+100K and Paris 6K) demonstrate the effectiveness of our proposed approach.

Download Full-text

A Combination of Spatial Pyramid and Inverted Index for Large-Scale Image Retrieval

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2015040103 ◽

2015 ◽

Vol 6 (2) ◽

pp. 37-51 ◽

Cited By ~ 2

Author(s):

Vinh-Tiep Nguyen ◽

Thanh Duc Ngo ◽

Minh-Triet Tran ◽

Duy-Dinh Le ◽

Duc Anh Duong

Keyword(s):

Image Retrieval ◽

Large Scale ◽

Spatial Information ◽

Real Life ◽

Inverted Index ◽

Bag Of Words ◽

Visual Words ◽

Benchmark Datasets ◽

Large Scale Image Retrieval ◽

Inverted Indexing

Download Full-text

A Novel Discriminating and Relative Global Spatial Image Representation with Applications in CBIR

Applied Sciences ◽

10.3390/app8112242 ◽

2018 ◽

Vol 8 (11) ◽

pp. 2242 ◽

Cited By ~ 16

Author(s):

Bushra Zafar ◽

Rehan Ashraf ◽

Nouman Ali ◽

Muhammad Iqbal ◽

Muhammad Sajid ◽

...

Keyword(s):

Image Classification ◽

Spatial Information ◽

Image Representation ◽

Feature Space ◽

Research Problem ◽

Visual Words ◽

Spatial Image ◽

User Query ◽

Novel Approach ◽

Image Representations

The requirement for effective image search, which motivates the use of Content-Based Image Retrieval (CBIR) and the search of similar multimedia contents on the basis of user query, remains an open research problem for computer vision applications. The application domains for Bag of Visual Words (BoVW) based image representations are object recognition, image classification and content-based image analysis. Interest point detectors are quantized in the feature space and the final histogram or image signature do not retain any detail about co-occurrences of features in the 2D image space. This spatial information is crucial, as it adversely affects the performance of an image classification-based model. The most notable contribution in this context is Spatial Pyramid Matching (SPM), which captures the absolute spatial distribution of visual words. However, SPM is sensitive to image transformations such as rotation, flipping and translation. When images are not well-aligned, SPM may lose its discriminative power. This paper introduces a novel approach to encoding the relative spatial information for histogram-based representation of the BoVW model. This is established by computing the global geometric relationship between pairs of identical visual words with respect to the centroid of an image. The proposed research is evaluated by using five different datasets. Comprehensive experiments demonstrate the robustness of the proposed image representation as compared to the state-of-the-art methods in terms of precision and recall values.

Download Full-text

A document expansion framework for tag-based image retrieval

Aslib Journal of Information Management ◽

10.1108/ajim-05-2017-0133 ◽

2018 ◽

Vol 70 (1) ◽

pp. 47-65 ◽

Cited By ~ 1

Author(s):

Wei Lu ◽

Heng Ding ◽

Jiepu Jiang

Keyword(s):

Image Retrieval ◽

Nearest Neighbor ◽

Image Representation ◽

Image Features ◽

Mobile Platforms ◽

Content Type ◽

Expansion Strategy ◽

Rank List ◽

Core Components ◽

Similar Images

Purpose The purpose of this paper is to utilize document expansion techniques for improving image representation and retrieval. This paper proposes a concise framework for tag-based image retrieval (TBIR). Design/methodology/approach The proposed approach includes three core components: a strategy of selecting expansion (similar) images from the whole corpus (e.g. cluster-based or nearest neighbor-based); a technique for assessing image similarity, which is adopted for selecting expansion images (text, image, or mixed); and a model for matching the expanded image representation with the search query (merging or separate). Findings The results show that applying the proposed method yields significant improvements in effectiveness, and the method obtains better performance on the top of the rank and makes a great improvement on some topics with zero score in baseline. Moreover, nearest neighbor-based expansion strategy outperforms the cluster-based expansion strategy, and using image features for selecting expansion images is better than using text features in most cases, and the separate method for calculating the augmented probability P(q|RD) is able to erase the negative influences of error images in RD. Research limitations/implications Despite these methods only outperform on the top of the rank instead of the entire rank list, TBIR on mobile platforms still can benefit from this approach. Originality/value Unlike former studies addressing the sparsity, vocabulary mismatch, and tag relatedness in TBIR individually, the approach proposed by this paper addresses all these issues with a single document expansion framework. It is a comprehensive investigation of document expansion techniques in TBIR.

Download Full-text

Robust image retrieval using bag of visual words with fuzzy codebooks and fuzzy assignment

Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies - i-KNOW '12 ◽

10.1145/2362456.2362498 ◽

2012 ◽

Author(s):

Marian Kogler ◽

Mathias Lux

Keyword(s):

Image Retrieval ◽

Bag Of Visual Words ◽

Visual Words ◽

Robust Image

Download Full-text

A Generalized BoVW Model for Content-Based Image Retrieval

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.496-500.1817 ◽

2014 ◽

Vol 496-500 ◽

pp. 1817-1820

Author(s):

Wang Ming Xu ◽

Hang Yang ◽

Kang Ling Fang ◽

Xin Hai Liu

Keyword(s):

Image Retrieval ◽

Sparse Coding ◽

Spectral Clustering ◽

Clustering Algorithms ◽

Bag Of Visual Words ◽

Clustering Methods ◽

Visual Words ◽

Feature Encoding ◽

Visual Dictionary ◽

Computer Vision Applications

BoVW (Bag of Visual Words) Model has attracted much attention for many computer vision applications in which an image is represented by a histogram of visual words. Two of its critical steps are to construct a visual dictionary and to quantize each local feature to its nearest visual word in the dictionary. In this paper, we present the framework of a generalized BoVW (GBoVW) Model in which feature quantization can be replaced by sparse coding based feature encoding. We also propose to use spectral clustering to construct a visual dictionary to overcome the shortcomings of K-Means based clustering algorithms. Image retrieval experiments on ZuBud database indicate that GBoVW Model improves BoVW Model and the visual dictionary generated by spectral clustering achieves better performance than that by K-Means based clustering methods.

Download Full-text

Remote sensing image retrieval method based on improved fast search and find of density peaks—bag of visual words model

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.13.046511 ◽

2019 ◽

Vol 13 (04) ◽

pp. 1

Author(s):

Xiao Zhao ◽

Yanyan Xu

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Remote Sensing Image ◽

Bag Of Visual Words ◽

Retrieval Method ◽

Visual Words ◽

Fast Search ◽

Density Peaks

Download Full-text

Segmentation Free Word Spotting for Handwritten Documents Using Bag of Visual Words Based on Co-HOG Descriptor

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2019040105 ◽

2019 ◽

Vol 9 (2) ◽

pp. 49-65

Author(s):

Thontadari C. ◽

Prabhakar C. J.

Keyword(s):

Visual Information ◽

Spatial Information ◽

Spatial Location ◽

Visual Word ◽

Bag Of Visual Words ◽

Word Spotting ◽

Handwritten Documents ◽

Visual Words ◽

Handwritten Document ◽

Free Word

In this article, the authors propose a segmentation-free word spotting in handwritten document images using a Bag of Visual Words (BoVW) framework based on the co-occurrence histogram of oriented gradient (Co-HOG) descriptor. Initially, the handwritten document is represented using visual word vectors which are obtained based on the frequency of occurrence of Co-HOG descriptor within local patches of the document. The visual word representation vector does not consider their spatial location and spatial information helps to determine a location exclusively with visual information when the different location can be perceived as the same. Hence, to add spatial distribution information of visual words into the unstructured BoVW framework, the authors adopted spatial pyramid matching (SPM) technique. The performance of the proposed method evaluated using popular datasets and it is confirmed that the authors' method outperforms existing segmentation free word spotting techniques.

Download Full-text

Enhanced bag of visual words representations for content based image retrieval: a comparative study

Artificial Intelligence Review ◽

10.1007/s10462-019-09715-6 ◽

2019 ◽

Vol 53 (3) ◽

pp. 1615-1653

Author(s):

K. S. Arun ◽

V. K. Govindan ◽

S. D. Madhu Kumar

Keyword(s):

Comparative Study ◽

Image Retrieval ◽

Content Based Image Retrieval ◽

Bag Of Visual Words ◽

Visual Words

Download Full-text

Bag-of-Visual Words based Improved Image Retrieval Algorithm for Vision Indoor Positioning

2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring) ◽

10.1109/vtc2020-spring48590.2020.9129412 ◽

2020 ◽

Author(s):

Shuang Jia ◽

Lin Ma ◽

Xuezhi Tan ◽

Danyang Qin

Keyword(s):

Image Retrieval ◽

Indoor Positioning ◽

Retrieval Algorithm ◽

Bag Of Visual Words ◽

Visual Words

Download Full-text