Bibliographic Details
| Title: |
Variance-k-means++: A deterministic centroid initialization method based on variance for enhanced clustering stability. |
| Authors: |
Widodo1,2 widodo@unj.ac.id, Ramadhan, Jiel Vayyad1,2 jielvayad261102@gmail.com, Duskarnaen, Muhammad Ficky1 duskarnaen@unj.ac.id, Fauziastuti, Via Tuhamah1 viatuhamah@unj.ac.id, Pondayu, Chelsea Zaomi1,2 chelseazaomi9@gmail.com, Septianda, Mada Rekadarma1,2 mada.septianda@gmail.com |
| Source: |
International Journal of Electrical & Computer Engineering (2088-8708). Jun2026, Vol. 16 Issue 3, p1434-1448. 14p. |
| Subjects: |
K-means clustering, Centroid, Clustering algorithms, Optimization algorithms, Deviation (Statistics), Deterministic algorithms |
| Abstract: |
K-means++ is developed to improve the performance of k-means when choosing a starting centroid. However, both algorithms in clustering still select an initial centroid randomly. Randomly selecting initial centroids has the potential to produce unstable clusters. This paper proposes a deterministic centroid initialization method called variance-k-means++, which utilizes statistical properties--mean and variance--to generate pseudo-centroids and derive initial centroids. The method aims to improve clustering stability and reduce the number of iterations. For the initial study, we used low-dimensional data to conduct the experiment series. Then, we employed two baseline methods for benchmarking, k-means and k-means++. The results show that variance-k-means++ outperformed the baseline method on average. Evaluating in Davies-Bouldin index (DBI) and convergence analysis, we obtained DBI values at 0.756 and 0,771 for vertical and horizontal variance k-means++ with Iris dataset. At the same time, baseline methods have 0.802 and 0.830 for k-means++ and k-means, respectively. In convergence analysis, the results are 5.158 for vertical and 5.474 for horizontal, while baseline methods are 9.000 and 8.842. The primary contribution of this study lies in its achievement of minimizing the number of iterations while enhancing cluster stability. [ABSTRACT FROM AUTHOR] |
|
Copyright of International Journal of Electrical & Computer Engineering (2088-8708) is the property of Institute of Advanced Engineering & Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
| Database: |
Engineering Source |