A Hybrid Strategy for Clustering Data Mining Documents

Penerapan Data Mining Untuk Prediksi Penjualan Mobil Menggunakan Metode K-Means Clustering

Jurnal Nasional Komputasi dan Teknologi Informasi (JNKTI) ◽

10.32672/jnkti.v3i3.2428 ◽

2020 ◽

Vol 3 (3) ◽

pp. 187-201

Author(s):

Sufajar Butsianto ◽

Nindi Tya Mayangwulan

Keyword(s):

Data Mining ◽

Clustering Data ◽

Cluster 2

Penggunaan mobil di Indonesia setiap tahunnya selalu meningkat dan membuat perusahaan otomotif berlomba-lomba dalam peningkatan penjualannya. Tujuan dari penelitian ini untuk mengelompokan data penjualan kedalam sebuah cluster dengan metode Data Mining Algoritma K-Means Clustering. Data Penjualan nantinya akan dikelompokan berdasarkan kemiripan data tersebut sehingga data dengan karakteristik yang sama akan berada dalam satu cluster. Atribut yang digunakan adalah brand dan penjualan. Cluster yang terbentuk setelah dilakukan proses K-Means Clustering terbagi menjadi tiga cluster yaitu Cluster 0 jumlah anggota 235 dengan presentase 26% dikategorikan Laris, Cluster 1 jumlah anggota 604 dengan presentase 67% dikategorikan Kurang Laris, dan Cluster 2 jumlah angota 61 dengan presentase 7% dikategorikan Paling Laris, dari proses clustering diatas dapat diperoleh validasi DBI (Davies Bouldin Index) dengan nilai 0,341

Download Full-text

K-MEANS CLUSTERING ALGORITHM FOR SERVICE DATA ANALYSIS BASED ON CUSTOMERS COMBINATION

Unes journal of Information System ◽

10.31933/ujis.3.1.001-007.2018 ◽

2018 ◽

Vol 3 (1) ◽

pp. 001

Author(s):

Zulhendra Zulhendra ◽

Gunadi Widi Nurcahyo ◽

Julius Santony

Keyword(s):

Data Mining ◽

Data Analysis ◽

Clustering Algorithm ◽

Customer Complaints ◽

Using Data ◽

Clustering Data ◽

Service Data ◽

Selection Of

In this study using Data Mining, namely K-Means Clustering. Data Mining can be used in searching for a large enough data analysis that aims to enable Indocomputer to know and classify service data based on customer complaints using Weka Software. In this study using the algorithm K-Means Clustering to predict or classify complaints about hardware damage on Payakumbuh Indocomputer. And can find out the data of Laptop brands most do service on Indocomputer Payakumbuh as one of the recommendations to consumers for the selection of Laptops.

Download Full-text

IMPLEMENTASI DATA MINING UNTUK MENENTUKAN TINGKAT PENJUALAN PAKET DATA TELKOMSEL MENGGUNAKAN METODE K-MEANS CLUSTERING

Jurnal Ilmiah Teknologi dan Rekayasa ◽

10.35760/tr.2020.25i1.2677 ◽

2020 ◽

Vol 25 (1) ◽

pp. 76-88

Author(s):

Suhandio Handoko ◽

Fauziah Fauziah ◽

Endah Tri Esti Handayani

Keyword(s):

Data Mining ◽

Clustering Data

Perkembangan industri telekomunikasi saat ini sangat pesat karena telekomunikasi sudah menjadi kebutuhan utama bagi masyarakat sehingga banyak perusahaan yang bergerak di industry telekomunikasi. Banyaknya industry Telekomunikasi menuntut para pengembang untuk menemukan strategi atau suatu pola yang dapat meningkatkan penjualan dan pemasaran produk, salah satu strateginya adalah dengan memanfaatkan data transaksi. Paket data merupakan produk dibidang telekomunikasi. Proses Clustering saat ini masih di lakukan secara manual sehingga membutuhkan waktu, proses perhitungan dan ketelitian yang tinggi. Pada penelitian ini dibuat aplikasi berbasis website dengan tujuan untuk mempermudah Clustering data sehingga dapat digunakan sebagai referensi dalam perencanaan promosi produk telkomsel ke berbagai daerah. Metode yang digunakan untuk mengatasi permasalahan tersebut yaitu metode Clustering dengan menggunakan Algoritma K-Means. Algoritma K-Means merupakan algoritma pengelompokkan sejumlah data menjadi menjadi kelompok-kelompok data tertentu. Pada penelitian ini data penjualan dikelompokkan menjadi 3 yaitu data penjualan rendah, data penjualan sedang dan data penjualan tinggi. Pengujian clustering dengan algoritma K-Means pada aplikasi terhadap data transaksi penjualan paket telkomsel diperoleh persentase kesesuaian yaitu 100% dibandingkan dengan clustering manual.

Download Full-text

Clustering Fasilitas Kesehatan Berdasarkan Kecamatan Di Karawang Dengan Algoritma K-Means

BINA INSANI ICT JOURNAL ◽

10.51211/biict.v8i1.1488 ◽

2021 ◽

Vol 8 (1) ◽

pp. 83

Author(s):

Bagus Muhammad Islami ◽

Cepy Sukmayadi ◽

Tesa Nur Padilah

Keyword(s):

Data Mining ◽

Developing Countries ◽

Data Processing ◽

Health Problems ◽

Health Facilities ◽

Microsoft Excel ◽

Two Factors ◽

Clustering Data ◽

Index Value ◽

Cluster 2

Abstrak: Masalah kesehatan yang ada di dalam masyarakat terutama di negara- negara berkembang seperti Indonesia dipengaruhi oleh dua faktor yaitu aspek fisik dan aspek non fisik. Berdasarkan data yang diperoleh dari karawangkab.bps.go.id data dibagi menjadi 3 cluster yaitu sedikit, sedang dan terbanyak. Algoritma yang digunakan adalah K-Means cluster yang diimplementsikan menggunakan Microsoft Excel dan Rapidminer Studio. Hasil pengolahan data fasilitas kesehatan di karawang menghasilkan 3 cluster dengan cluster 1 yang mempunyai fasilitas kesehatan sedikit sebanyak 23 kecamatan, cluster 2 yang mempunyai fasilitas kesehatan sedang sebanyak 5 kecamatan dan cluster 3 yang mempunyai fasilitas kesehatan terbanyak terdapat 2 kecamatan. Kinerja yang dihasilkan dari algoritma K-means menghasilkan nilai Davies Boildin Index sebesar 0,109. Kata kunci: clustering, data mining, fasilitas kesehatan, K-Means. Abstract: Health problems that exist in society, especially in developing countries like Indonesia, are built by two factors, namely physical and non-physical aspects. Based on data obtained from karawangkab.bps.go.id the data is divided into 3 clusters, namely the least, medium and the most. The algorithm used is the K-Means cluster which is implemented using Microsoft Excel and Rapidminer Studio. The results of data processing of health facilities in Karawang produce 3 clusters with cluster 1 which has 23 sub-districts of health facilities, cluster 2 which has medium health facilities as many as 5 districts and cluster 3 which has the most health facilities in 2 districts. The performance resulting from the K-means algorithm results in a Davies Boildin Index value of 0.109. Keywords: clustering, data mining, health facilities, K-Means.

Download Full-text

An analysis on the impact of fluoride in human health (dental) using clustering data mining technique

International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012) ◽

10.1109/icprime.2012.6208374 ◽

2012 ◽

Cited By ~ 7

Author(s):

T. Balasubramanian ◽

R. Umarani

Keyword(s):

Data Mining ◽

Human Health ◽

Data Mining Technique ◽

Mining Technique ◽

Clustering Data ◽

The Impact

Download Full-text

Ensemble Clustering Data Mining and Databases

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch170 ◽

2018 ◽

pp. 1962-1973

Author(s):

Slawomir T. Wierzchon

Keyword(s):

Data Mining ◽

Data Structure ◽

Em Algorithm ◽

Normal Distribution ◽

Clustering Algorithms ◽

Consensus Clustering ◽

New Directions ◽

Consensus Procedure ◽

Basic Approaches ◽

Clustering Data

Standard clustering algorithms employ fixed assumptions about data structure. For instance, the k-means algorithm is applicable for spherical and linearly separable data clouds. When the data come from multidimensional normal distribution – so-called EM algorithm can be applied. But in practice the assumptions underlying given set of observations are too complex to fit into a single assumption. We can split these assumptions into manageable hypothesis justifying the use of particular clustering algorithms. Then we must aggregate partial results into a meaningful description of our data. The consensus clustering do this task. In this article we clarify the idea of consensus clustering, and we present conceptual frames for such a compound analysis. Next the basic approaches to implement consensus procedure are given. Finally, some new directions in this field are mentioned.

Download Full-text

A Multiagent System (MAS) for the Generation of Initial Centroids for kmeans Clustering Data Mining Algorithm based on Actual Sample Datapoints

Journal of Next Generation Information Technology ◽

10.4156/jnit.vol1.issue2.8 ◽

2010 ◽

Vol 1 (2) ◽

pp. 85-95

Author(s):

Dost Muhammad Khan ◽

Nawaz Mohamudally

Keyword(s):

Data Mining ◽

Multiagent System ◽

Data Mining Algorithm ◽

Actual Sample ◽

Mining Algorithm ◽

Clustering Data

Download Full-text

Data mining source code to facilitate program comprehension: experiments on clustering data retrieved from C++programs

Proceedings. 12th IEEE International Workshop on Program Comprehension, 2004. ◽

10.1109/wpc.2004.1311063 ◽

2004 ◽

Cited By ~ 6

Author(s):

Y. Kanellopoulos ◽

C. Tjortjis

Keyword(s):

Data Mining ◽

Source Code ◽

Program Comprehension ◽

C Programs ◽

Clustering Data

Download Full-text

Integrating Clustering Data Mining into the Multidimensional Modeling of Data Warehouses with UML Profiles

Data Warehousing and Knowledge Discovery - Lecture Notes in Computer Science ◽

10.1007/978-3-540-74553-2_18 ◽

2007 ◽

pp. 199-208 ◽

Cited By ~ 8

Author(s):

Jose Zubcoff ◽

Jesús Pardillo ◽

Juan Trujillo

Keyword(s):

Data Mining ◽

Data Warehouses ◽

Multidimensional Modeling ◽

Uml Profiles ◽

Clustering Data

Download Full-text

Improving Multiobjective Multidisciplinary Optimization With a Data Mining-Based Hybrid Method

Volume 2B: 41st Design Automation Conference ◽

10.1115/detc2015-47361 ◽

2015 ◽

Author(s):

Hongyi Xu ◽

Ching-Hung Chuang ◽

Ren-Jye Yang

Keyword(s):

Data Mining ◽

Bias Correction ◽

Multidisciplinary Design Optimization ◽

Multidisciplinary Optimization ◽

Side Impact ◽

Computational Time ◽

Hybrid Strategy ◽

Clustering And Classification ◽

Product Design Process ◽

Optimization Search

Multiobjective, multidisciplinary design optimization (MDO) of complex system is challenging due to the long computational time needed for evaluating new designs’ performances. Heuristic optimization algorithms are widely employed to overcome the local optimums, but the inherent randomness of such algorithms leads to another disadvantage: the need for a large number of design evaluations. To accelerate the product design process, a data mining-based hybrid strategy is developed to improve the search efficiency. Based on the historical information of the optimization search, clustering and classification techniques are employed to detect low quality designs and repetitive designs, and which are then replaced by promising designs. In addition, the metamodel with bias correction is integrated into the proposed strategy to further increase the probability of finding promising design regions within a limited number of design evaluations. Two case studies, one mathematical benchmark problem and one vehicle side impact design problem, are conducted to demonstrate the effectiveness of the proposed method in improving the searching efficiency.

Download Full-text