Image Clustering Using Active-Constraint Semi-Supervised Affinity Propagation

Author(s):  
Qi Lei ◽  
◽  
Jun Liu ◽  
Min Wu ◽  
Jie Wang ◽  
...  

Image clustering is an effective way to discover and analyze large quantities of image data. The HSV color space is particularly advantageous in image feature extraction because of its relatively prominent feature vector. The objective of this study is to develop an image clustering method using the active-constraint semi-supervised affinity propagation (ACSSAP) algorithm. The algorithm adds supervision to the affinity propagation (AP) clustering algorithm with pairwise constraints and uses active learning to guide the AP clustering algorithm. Active learning of pairwise constraints leads to an adjustment of the similarity matrix in AP at each iteration. In the experiments, the advantage of HSV space is analyzed and the ACSSAP algorithm is evaluated for data sets of different sizes in comparison with other algorithms. The result demonstrates that the ACSSAP has better performance.

Author(s):  
Z. Wang ◽  
P. Liu ◽  
T. Cui

In recent years, fire recognition based on image features has become a hotspot in fire monitoring. However, due to the complexity of forest environment, the accuracy of forest fireworks recognition based on image features is low. Based on this, this paper proposes a feature extraction algorithm based on YCrCb color space and K-means clustering. Firstly, the paper prepares and analyzes the color characteristics of a large number of forest fire image samples. Using the K-means clustering algorithm, the forest flame model is obtained by comparing the two commonly used color spaces, and the suspected flame area is discriminated and extracted. The experimental results show that the extraction accuracy of flame area based on YCrCb color model is higher than that of HSI color model, which can be applied in different scene forest fire identification, and it is feasible in practice.


2018 ◽  
Vol 2018 ◽  
pp. 1-9 ◽  
Author(s):  
Zhihan Liu ◽  
Yi Jia ◽  
Xiaolu Zhu

Car sharing is a type of car rental service, by which consumers rent cars for short periods of time, often charged by hours. The analysis of urban traffic big data is full of importance and significance to determine locations of depots for car-sharing system. Taxi OD (Origin-Destination) is a typical dataset of urban traffic. The volume of the data is extremely large so that traditional data processing applications do not work well. In this paper, an optimization method to determine the depot locations by clustering taxi OD points with AP (Affinity Propagation) clustering algorithm has been presented. By analyzing the characteristics of AP clustering algorithm, AP clustering has been optimized hierarchically based on administrative region segmentation. Considering sparse similarity matrix of taxi OD points, the input parameters of AP clustering have been adapted. In the case study, we choose the OD pairs information from Beijing’s taxi GPS trajectory data. The number and locations of depots are determined by clustering the OD points based on the optimization AP clustering. We describe experimental results of our approach and compare it with standard K-means method using quantitative and stationarity index. Experiments on the real datasets show that the proposed method for determining car-sharing depots has a superior performance.


Minerals ◽  
2021 ◽  
Vol 11 (10) ◽  
pp. 1092
Author(s):  
Yuansheng Jiang ◽  
Ying Guo ◽  
Yufei Zhou ◽  
Xiang Li ◽  
Simin Liu

Chrysoprase is a popular gemstone with consumers because of its charming apple green colour but a scientific classification of its colour has not yet been achieved. In this research, we determined the most effective background of the Munsell Chart for chrysoprase colour grading under a 6504 K fluorescent lamp and applied an affinity propagation (AP) clustering algorithm to the colour grading of coloured gems for the first time. Forty gem-quality chrysoprase samples from Australia were studied using a UV-VIS spectrophotometer and Munsell neutral grey backgrounds. The results determined the effects of a Munsell neutral grey background on the observed colour. It was found that the Munsell N9.5 background was the most effective for colour grading in this case. The observed chrysoprase colours were classified into five groups: Fancy Light, Fancy, Fancy Intense, Fancy Deep and Fancy Dark. The feasibility of the colour grading scheme was verified using the colour difference formula DE2000.


Author(s):  
Novendri Isra Asriny ◽  
Muhammad Muhajir ◽  
Devi Andrian

There has been a significant increase in the number of part-time workers in the last 3 years. Data collected from sakernas BPS showed that the number of part-time workers was 125,443,748 in the second period of 2016. This number rapidly increased in 2017, 2018 and 2019 in the same period, by 128,062,746, 131,005,641, and 133,560,880 workers. Based on the increase in the last 3 years, East Java province has the highest number of part-time workers that use the internet. This research aims to determine the number of part-time workers that use the internet by using the k-affinity propagation (K-AP) clustering. This method is used to produce the optimal number of cluster points (exemplar) is the affinity propagation (AP). Three clusters were used to determine the sum of the smallest value ratio. The result showed that clusters 1, 2, and 3 have 3, 23, and 5 members in Bondowoso, Jombang, and Surabaya districts.


2021 ◽  
Vol 13 (16) ◽  
pp. 8957
Author(s):  
Yajun Zhang ◽  
Jie Deng ◽  
Kangkang Zhu ◽  
Yongqiang Tao ◽  
Xiaolin Liu ◽  
...  

With the escalating contradiction between the growing demand for electric buses and limited supporting resources of cities to deploy electric charging infrastructure, it is a great challenge for decision-makers to synthetically plan the location and decide on the expansion sequence of electric charging stations. In light of the location decisions of electric charging stations having long-term impacts on the deployment of electric buses and the layout of city traffic networks, a comprehensive framework for planning the locations and deciding on the expansion of electric bus charging stations should be developed simultaneously. In practice, construction or renovation of a new charging station is limited by various factors, such as land resources, capital investment, and power grid load. Thus, it is necessary to develop an evaluation structure that combines these factors to provide integrated decision support for the location of bus charging stations. Under this background, this paper develops a gridded affinity propagation (AP) clustering algorithm that combines the superiorities of the AP clustering algorithm and the map gridding rule to find the optimal candidate locations for electric bus charging stations by considering multiple impacting factors such as land cost, traffic conditions, and so on. Based on the location results of the candidate stations, the expansion sequence of these candidate stations is proposed. In particular, a sequential expansion rule for planning the charging stations is proposed that considers the development trends of the charging demand. To verify the performance of the gridded AP clustering and the effectiveness of the proposed sequential expansion rule, an empirical investigation of Guiyang City, the capital of Guizhou province in China, is conducted. The results of the empirical investigation demonstrate that the proposed framework that helps find optimal locations for electric bus charging stations and the expansion sequence of these locations are decided with less capital investment pressure. This research shows that the combination of gridded AP clustering and the proposed sequential expansion rule can systematically solve the problem of finding the optimal locations and deciding on the best expansion sequence for electric bus charging stations, which denotes that the proposed structure is pretty pragmatic and would benefit the government for long-term investment in electric bus station deployment.


2015 ◽  
Vol 2015 ◽  
pp. 1-8 ◽  
Author(s):  
XiuLi Zhao ◽  
WeiXiang Xu

Affinity propagation (AP) algorithm, as a novel clustering method, does not require the users to specify the initial cluster centers in advance, which regards all data points as potential exemplars (cluster centers) equally and groups the clusters totally by the similar degree among the data points. But in many cases there exist some different intensive areas within the same data set, which means that the data set does not distribute homogeneously. In such situation the AP algorithm cannot group the data points into ideal clusters. In this paper, we proposed an extended AP clustering algorithm to deal with such a problem. There are two steps in our method: firstly the data set is partitioned into several data density types according to the nearest distances of each data point; and then the AP clustering method is, respectively, used to group the data points into clusters in each data density type. Two experiments are carried out to evaluate the performance of our algorithm: one utilizes an artificial data set and the other uses a real seismic data set. The experiment results show that groups are obtained more accurately by our algorithm than OPTICS and AP clustering algorithm itself.


2011 ◽  
Vol 2 (1) ◽  
Author(s):  
Vina Chovan Epifania ◽  
Eko Sediyono

Abstract. Image File Searching Based on Color Domination. One characteristic of an image that can be used in image searching process is the composition of the colors. Color is a trait that is easily seen by man in the picture. The use of color as a searching parameter can provide a solution in an easier searching for images stored in computer memory. Color images have RGB values that can be computed and converted into HSL color space model. Use of HSL images model is very easy because it can be calculated using a percent, so that in each pixel of the image can be grouped and named, this can give a dominant values of the colors contained in one image. By obtaining these values, the image search can be done quickly just by using these values to a retrieval system image file. This article discusses the use of the HSL color space model to facilitate the searching for a digital image in the digital image data warehouse. From the test results of the application form, a searching is faster by using the colors specified by the user. Obstacles encountered were still searching with a choice of 15 basic colors available, with a limit of 33% dominance of the color image search was not found. This is due to the dominant color in each image has the most dominant value below 33%.   Keywords: RGB, HSL, image searching Abstrak. Salah satu ciri gambar yang dapat dipergunakan dalam proses pencarian gambar adalah komposisi warna. Warna adalah ciri yang mudah dilihat oleh manusia dalam citra gambar. Penggunaan warna sebagai parameter pencarian dapat memberikan solusi dalam memudahkan pencarian gambar yang tersimpan dalam memori komputer. Warna gambar memiliki nilai RGB yang dapat dihitung dan dikonversi ke dalam model HSL color space. Penggunaan model gambar HSL sangat mudah karena dapat dihitung dengan menggunakan persen, sehingga dalam setiap piksel gambar dapat dikelompokan dan diberi nama, hal ini dapat memberikan suatu nilai dominan dari warna yang terdapat dalam satu gambar. Dengan diperolehnya nilai tersebut, pencarian gambar dapat dilakukan dengan cepat hanya dengan menggunakan nilai tersebut pada sistem pencarian file gambar. Artikel ini membahas tentang penggunaan model HSL color space untuk mempermudah pencarian suatu gambar digital didalam gudang data gambar digital. Dari hasil uji aplikasi yang sudah dibuat, diperoleh pencarian yang lebih cepat dengan menggunakan pilihan warna yang ditentukan sendiri oleh pengguna. Kendala yang masih dijumpai adalah pencarian dengan pilihan 15 warna dasar yang tersedia, dengan batas dominasi warna 33% tidak ditemukan gambar yang dicari. Hal ini disebabkan warna dominan disetiap gambar kebanyakan memiliki nilai dominan di bawah 33%. Kata Kunci: RGB, HSL, pencarian gambar


2020 ◽  
Vol 15 ◽  
pp. 155892502097832
Author(s):  
Jiaqin Zhang ◽  
Jingan Wang ◽  
Le Xing ◽  
Hui’e Liang

As the precious cultural heritage of the Chinese nation, traditional costumes are in urgent need of scientific research and protection. In particular, there are scanty studies on costume silhouettes, due to the reasons of the need for cultural relic protection, and the strong subjectivity of manual measurement, which limit the accuracy of quantitative research. This paper presents an automatic measurement method for traditional Chinese costume dimensions based on fuzzy C-means clustering and silhouette feature point location. The method is consisted of six steps: (1) costume image acquisition; (2) costume image preprocessing; (3) color space transformation; (4) object clustering segmentation; (5) costume silhouette feature point location; and (6) costume measurement. First, the relative total variation model was used to obtain the environmental robustness and costume color adaptability. Second, the FCM clustering algorithm was used to implement image segmentation to extract the outer silhouette of the costume. Finally, automatic measurement of costume silhouette was achieved by locating its feature points. The experimental results demonstrated that the proposed method could effectively segment the outer silhouette of a costume image and locate the feature points of the silhouette. The measurement accuracy could meet the requirements of industrial application, thus providing the dual value of costume culture research and industrial application.


2021 ◽  
Vol 15 (6) ◽  
pp. 1-18
Author(s):  
Kai Liu ◽  
Xiangyu Li ◽  
Zhihui Zhu ◽  
Lodewijk Brand ◽  
Hua Wang

Nonnegative Matrix Factorization (NMF) is broadly used to determine class membership in a variety of clustering applications. From movie recommendations and image clustering to visual feature extractions, NMF has applications to solve a large number of knowledge discovery and data mining problems. Traditional optimization methods, such as the Multiplicative Updating Algorithm (MUA), solves the NMF problem by utilizing an auxiliary function to ensure that the objective monotonically decreases. Although the objective in MUA converges, there exists no proof to show that the learned matrix factors converge as well. Without this rigorous analysis, the clustering performance and stability of the NMF algorithms cannot be guaranteed. To address this knowledge gap, in this article, we study the factor-bounded NMF problem and provide a solution algorithm with proven convergence by rigorous mathematical analysis, which ensures that both the objective and matrix factors converge. In addition, we show the relationship between MUA and our solution followed by an analysis of the convergence of MUA. Experiments on both toy data and real-world datasets validate the correctness of our proposed method and its utility as an effective clustering algorithm.


Sign in / Sign up

Export Citation Format

Share Document