top of page

Published Paper

1751-5858

International Journal of Intelligent Information and Database Systems

International Journal of Intelligent Information and Database Systems

K-means**–a fast and efficient K-means algorithms.

Nguyen Cuong Duc, and Trong Hai Duong.

DOI:
​Keywords:

Dương Trọng Hải

Hải Dương

Abstract

K-means often converges to a local optimum. In improved versions of K-means, k-means++ is well-known for achieving a rather optimum solution with its cluster initialisation strategy and high computational efficiency. Incremental K-means is recognised for its converging to the empirically global optimum but having a high complexity due to its stepping of the number of clusters K. The paper introduces K-means** with a doubling strategy on K. Additional techniques, including only doubling big enough clusters, stepping K for the last few values and searching on other candidates for the last K, are used to help K-means** have a complexity of O(K logK), which is lower than the complexity of incremental K-means, and still converge to empirically global optimum. On a set of synthesis and real datasets, K-means** archive the minimum results in almost of test cases. K-means** is much faster than incremental K-means and comparable with the speed of k-means++.

Nguyen Cuong Duc, and Trong Hai Duong. (2018), "K-means**–a fast and efficient K-means algorithms.", International Journal of Intelligent Information and Database Systems, 11, (1), pp. 27-45, DOI: 10.1504/IJIIDS.2018.091595

bottom of page