On Building a Universal and Compact Visual Vocabulary

Mathematical Problems in Engineering ◽

10.1155/2013/163976 ◽

2013 ◽

Vol 2013 ◽

pp. 1-8 ◽

Cited By ~ 3

Author(s):

Jian Hou ◽

Wei-Xue Liu ◽

Xu E ◽

Hamid Reza Karimi

Keyword(s):

Image Representation ◽

Classification Performance ◽

Great Success ◽

Bag Of Words ◽

Bag Of Visual Words ◽

Vocabulary Size ◽

Visual Words ◽

Visual Vocabulary ◽

Similarity Threshold ◽

Small Dataset

Bag-of-visual-words has been shown to be a powerful image representation and attained great success in many computer vision and pattern recognition applications. Usually, for a given dataset, researchers choose to build a specific visual vocabulary from the dataset, and the problem of deriving a universal visual vocabulary is rarely addressed. Based on previous work on the classification performance with respect to visual vocabulary sizes, we arrive at a hypothesis that a universal visual vocabulary can be obtained by taking-into account the similarity extent of keypoints represented by one visual word. We then propose to use a similarity threshold-based clustering method to calculate the optimal vocabulary size, where the universal similarity threshold can be obtained empirically. With the optimal vocabulary size, the optimal visual vocabularies of limited sizes from three datasets are shown to be exchangeable and therefore universal. This result indicates that a universal and compact visual vocabulary can be built from a not too small dataset. Our work narrows the gab between bag-of-visual-words and bag-of-words, where a relatively fixed vocabulary can be used with different text datasets.

Download Full-text

A novel method for content-based image retrieval to improve the effectiveness of the bag-of-words model using a support vector machine

Journal of Information Science ◽

10.1177/0165551518782825 ◽

2018 ◽

Vol 45 (1) ◽

pp. 117-135 ◽

Cited By ~ 15

Author(s):

Amna Sarwar ◽

Zahid Mehmood ◽

Tanzila Saba ◽

Khurram Ashfaq Qazi ◽

Ahmed Adnan ◽

...

Keyword(s):

Image Retrieval ◽

Image Databases ◽

Semantic Gap ◽

Content Based Image Retrieval ◽

Support Vector ◽

Bag Of Words ◽

Visual Words ◽

Visual Attributes ◽

Visual Vocabulary ◽

Novel Method

The advancements in the multimedia technologies result in the growth of the image databases. To retrieve images from such image databases using visual attributes of the images is a challenging task due to the close visual appearance among the visual attributes of these images, which also introduces the issue of the semantic gap. In this article, we recommend a novel method established on the bag-of-words (BoW) model, which perform visual words integration of the local intensity order pattern (LIOP) feature and local binary pattern variance (LBPV) feature to reduce the issue of the semantic gap and enhance the performance of the content-based image retrieval (CBIR). The recommended method uses LIOP and LBPV features to build two smaller size visual vocabularies (one from each feature), which are integrated together to build a larger size of the visual vocabulary, which also contains complementary features of both descriptors. Because for efficient CBIR, the smaller size of the visual vocabulary improves the recall, while the bigger size of the visual vocabulary improves the precision or accuracy of the CBIR. The comparative analysis of the recommended method is performed on three image databases, namely, WANG-1K, WANG-1.5K and Holidays. The experimental analysis of the recommended method on these image databases proves its robust performance as compared with the recent CBIR methods.

Download Full-text

A Hybrid Clustering Approach for Bag-of-Words Image Categorization

Mathematical Problems in Engineering ◽

10.1155/2019/4275720 ◽

2019 ◽

Vol 2019 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Hui Huang ◽

Yan Ma

Keyword(s):

Hierarchical Clustering ◽

Optimal Number ◽

Bag Of Words ◽

Image Categorization ◽

Vocabulary Size ◽

Number Of Clusters ◽

Visual Words ◽

Initial Cluster ◽

Clustering Approach ◽

Optimal Number Of Clusters

The Bag-of-Words (BoW) model is a well-known image categorization technique. However, in conventional BoW, neither the vocabulary size nor the visual words can be determined automatically. To overcome these problems, a hybrid clustering approach that combines improved hierarchical clustering with a K-means algorithm is proposed. We present a cluster validity index for the hierarchical clustering algorithm to adaptively determine when the algorithm should terminate and the optimal number of clusters. Furthermore, we improve the max-min distance method to optimize the initial cluster centers. The optimal number of clusters and initial cluster centers are fed into K-means, and finally the vocabulary size and visual words are obtained. The proposed approach is extensively evaluated on two visual datasets. The experimental results show that the proposed method outperforms the conventional BoW model in terms of categorization and demonstrate the feasibility and effectiveness of our approach.

Download Full-text

Modified Bag of Visual Words Model for Image Classification

Al-Nahrain Journal of Science ◽

10.22401/anjs.24.2.11 ◽

2021 ◽

Vol 24 (2) ◽

pp. 78-86

Author(s):

Zainab N. Sultani ◽

◽

Ban N. Dhannoon ◽

Keyword(s):

Image Classification ◽

Classification Performance ◽

Image Features ◽

Local Feature ◽

Bag Of Visual Words ◽

Scale Invariant ◽

Visual Words ◽

Challenging Tasks ◽

Feature Information ◽

Scale Invariant Feature

Image classification is acknowledged as one of the most critical and challenging tasks in computer vision. The bag of visual words (BoVW) model has proven to be very efficient for image classification tasks since it can effectively represent distinctive image features in vector space. In this paper, BoVW using Scale-Invariant Feature Transform (SIFT) and Oriented Fast and Rotated BRIEF(ORB) descriptors are adapted for image classification. We propose a novel image classification system using image local feature information obtained from both SIFT and ORB local feature descriptors. As a result, the constructed SO-BoVW model presents highly discriminative features, enhancing the classification performance. Experiments on Caltech-101 and flowers dataset prove the effectiveness of the proposed method.

Download Full-text

Elimination of Spatial Incoherency in Bag-of-Visual Words Image Representation Using Visual Sentence Modelling

2018 International Conference on Image and Vision Computing New Zealand (IVCNZ) ◽

10.1109/ivcnz.2018.8634742 ◽

2018 ◽

Author(s):

Abass A. Olaode ◽

Golshah Naghdy

Keyword(s):

Image Representation ◽

Bag Of Visual Words ◽

Visual Words

Download Full-text

Research on Vocabulary Sizes and Codebook Universality

Abstract and Applied Analysis ◽

10.1155/2014/697245 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7

Author(s):

Wei-Xue Liu ◽

Jian Hou ◽

Hamid Reza Karimi

Keyword(s):

Image Representation ◽

Classification Performance ◽

Image Feature ◽

Vocabulary Size ◽

Image Descriptors ◽

Knn Classifier ◽

Single Dataset ◽

Representation Method ◽

Almost All ◽

Local Image

Codebook is an effective image representation method. By clustering in local image descriptors, a codebook is shown to be a distinctive image feature and widely applied in object classification. In almost all existing works on codebooks, the building of the visual vocabulary follows a basic routine, that is, extracting local image descriptors and clustering with a user-designated number of clusters. The problem with this routine lies in that building a codebook for each single dataset is not efficient. In order to deal with this problem, we investigate the influence of vocabulary sizes on classification performance and vocabulary universality with the kNN classifier. Experimental results indicate that, under the condition that the vocabulary size is large enough, the vocabularies built from different datasets are exchangeable and universal.

Download Full-text

On Vocabulary Size in Bag-of-Visual-Words Representation

Advances in Multimedia Information Processing - PCM 2010 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-15702-8_38 ◽

2010 ◽

pp. 414-424 ◽

Cited By ~ 3

Author(s):

Jian Hou ◽

Jianxin Kang ◽

Naiming Qi

Keyword(s):

Bag Of Visual Words ◽

Vocabulary Size ◽

Visual Words

Download Full-text

Temu Kembali Citra Tenun Nusa Tenggara Timur menggunakan Esktraksi Fitur yang Robust terhadap Perubahan Skala, Rotasi, dan Pencahayaan

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.2020722002 ◽

2020 ◽

Vol 7 (2) ◽

pp. 349

Author(s):

Budiman Baso ◽

Nanik Suciati

Keyword(s):

Image Retrieval ◽

Computation Time ◽

Histogram Equalization ◽

Content Based Image Retrieval ◽

Bag Of Visual Words ◽

Query Image ◽

Feature Extraction Method ◽

Visual Words ◽

Adaptive Histogram Equalization ◽

Visual Vocabulary

Ragam motif pada tenun Nusa Tenggara Timur (NTT) seperti flora, fauna dan geometris menjadi suatu keunikan yang dapat membedakan daerah asal dan jenis dari tenun tersebut. Pada penelitian ini, sistem temu kembali citra berbasis isi atau Content-Based Image Retrieval (CBIR) diimplementasikan pada citra tenun NTT sehingga user dapat mencari citra tenun pada database menggunakan citra query berdasarkan fitur visual yang terkandung dalam citra. Seringkali citra query yang diinputkan user memiliki skala, rotasi dan pencahayaan yang bervariasi, sehingga diperlukan suatu metode ektraksi fitur yang dapat mengakomodasi variasi tersebut. Sistem temu kembali citra tenun pada penelitian ini menggunakan model Bag of Visual Words (BoVW) dari keypoints pada citra yang diekstrak dengan metode Speeded Up Robust Feature (SURF). BoVW dibangun menggunakan K-Means untuk menghasilkan visual vocabulary dari keypoints pada seluruh citra training. Representasi BoVW diharapkan dapat menangani variasi skala dan rotasi pada citra. Sedangkan untuk mengatasi variasi pencahayaan pada citra, dilakukan perbaikan kualitas citra dengan menggunakan Contrast Limited Adaptive Histogram Equalization (CLAHE). Percobaan dilakukan dengan membandingkan kinerja dari representasi BoVW yang dibangun menggunakan fitur SURF dengan Maximally Stable Extremal Regions (MSER) pada temu kembali citra tenun. Hasil uji coba menunjukkan bahwa metode SURF menghasilkan rata-rata akurasi 89,86% dan waktu komputasi 9,94 detik, sedangkan MSER menghasilkan rata-rata akurasi 84,04% dan waktu komputasi 1,95 detik. AbstractThe variety of motifs in East Nusa Tenggara tenun such as flora, fauna and geometric is an unique thing that can distinguish the region of origin and type of the tenun. In this study, the Content-Based Image Retrieval (CBIR) system is implemented in the tenun image. With Content-based techniques Users can search tenun images on the image database by using query images based on visual features contained in the image. Often the query image that the user enters has a different scale, rotation and lighting, so a feature extraction method is needed that can accommodate these differences. The tenun image retrieval system in this study used the Bag of Visual Words (BoVW) model of the keypoints in the extracted image using the Speeded Up Robust Feature (SURF) method. BoVW was built using K-Means to produce visual vocabulary from keypoints on all training images. The representation of BoVW is expected to be able to handle scale variations and rotations in images. Whereas to overcome the lighting variations in the image, image quality improvement is done by using Contrast Limited Adaptive Histogram Equalization (CLAHE). The experiment was conducted by comparing the performance of the BoVW representation which was built using the SURF feature with Maximally Stable Extremal Regions (MSER) at the tenun image retrieval. The results of the trial showed that SURF obtained higher accuracy in all conditions of tenun image data with an average value of 89.86% whereas MSER obtained an average accuracy value of 84.04%. But MSER's computation time is 1.95 seconds faster than SURF which is 9.94 seconds.

Download Full-text

Informative visual words construction to improve bag of words image representation

IET Image Processing ◽

10.1049/iet-ipr.2013.0449 ◽

2014 ◽

Vol 8 (5) ◽

pp. 310-318 ◽

Cited By ~ 7

Author(s):

Mohammad Mehdi Farhangi ◽

Mohsen Soryani ◽

Mahmood Fathy

Keyword(s):

Image Representation ◽

Bag Of Words ◽

Visual Words

Download Full-text

Multi-scale image semantic recognition with hierarchical visual vocabulary

Computer Science and Information Systems ◽

10.2298/csis100423035j ◽

2011 ◽

Vol 8 (3) ◽

pp. 931-951 ◽

Cited By ~ 1

Author(s):

Xinghao Jiang ◽

Tanfeng Sun ◽

Fu Guanglei

Keyword(s):

Semantic Analysis ◽

Level Structure ◽

Local Features ◽

Bag Of Visual Words ◽

Semantic Model ◽

Visual Words ◽

Multi Scale ◽

Visual Vocabulary ◽

Video Semantic Analysis ◽

Relationship Of

Local features have been proved to be effective in image/video semantic analysis. The BOVW (bag of visual words) scheme can cluster local features to form the visual vocabulary which includes an amount of words, where each word is the center of one clustering feature. The vocabulary is used to recognize the image semantic. In this paper, a new scheme to construct semantic-binding hierarchical visual vocabulary is proposed. Some attributes and relationship of the semantic nodes in the model are discussed. The hierarchical semantic model is used to organize the multi-scale semantic into a level-by-level structure. Experiments are performed based on the LabelMe dataset, the performance of our scheme is evaluated and compared with the traditional BOVW scheme, experimental results demonstrate the efficiency and flexibility of our scheme.

Download Full-text

ON-ROAD VEHICLE CLASSIFICATION BASED ON RANDOM NEURAL NETWORK AND BAG-OF-VISUAL WORDS

Probability in the Engineering and Informational Sciences ◽

10.1017/s0269964816000073 ◽

2016 ◽

Vol 30 (3) ◽

pp. 403-412 ◽

Cited By ~ 4

Author(s):

Khaled F. Hussain ◽

Ghada S. Moussa

Keyword(s):

Intelligent Transportation Systems ◽

Classification System ◽

Classification Performance ◽

Transportation Systems ◽

Classification Systems ◽

Vehicle Classification ◽

Bag Of Visual Words ◽

Traffic Surveillance ◽

Visual Words ◽

Random Neural Network

A large increase in the number and types of vehicles occurred due to the growth in population. This fact brings the need for efficient vehicle classification systems that can be used in traffic surveillance and intelligent transportation systems. In this study, a multi-type vehicle classification system based on Random Neural Networks (RNNs) and Bag-Of-Visual Words (BOVWs) is developed. A 10-fold cross-validation technique is used, with a large dataset, to assess the proposed approach. Moreover, the BOVW–RNN's classification performance is compared with LIVCS, a vehicle classification system based on RNNs. The results reveal that BOVW–RNN classification system produces more reliable and accurate classification results than LIVCS. The main contribution of this paper is that the developed system can serve as a framework for many vehicle classification systems.

Download Full-text