An Optimized Approach for k-means Clustering

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Home Proceedings Qshine Number 1An Optimized Approach for k-means Clustering

Call for Paper - May 2015 Edition

IJCA solicits original research papers for the May 2015 Edition. Last date of manuscript submission is April 20, 2015. Read More

An Optimized Approach for k-means Clustering

IJCA Proceedings on 9th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness 2013

QShine

Year of Publication: 2013

Authors:

Sadhana Tiwari

Tanu Solanki

Sadhana Tiwari and Tanu Solanki. Article: An Optimized Approach for k-means Clustering. IJCA Proceedings on 9th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness 2013 QShine:5-7, December 2013. Full text available. BibTeX

@article{key:article,
	author = {Sadhana Tiwari and Tanu Solanki},
	title = {Article: An Optimized Approach for k-means Clustering},
	journal = {IJCA Proceedings on 9th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness 2013},
	year = {2013},
	volume = {QShine},
	pages = {5-7},
	month = {December},
	note = {Full text available}
}

Abstract

Cluster analysis method is one of the most analytical methods of data mining. The method will directly influence the result of clustering. This paper discusses the standard of k-mean clustering and analyzes the shortcomings of standard k-means such as k-means algorithm calculates distance of each data point from each cluster centre. Calculating this distance in each iteration makes the algorithm of low efficiency. This paper introduces an optimized algorithm which solves this problem. This is done by introducing a simple data structure to store some information in every iteration and used this information in next iteration. The introduced algorithm does not require calculating the distance of each data point from each cluster centre in each iteration due to which running time of algorithm is saved. Experimental results show that the improved algorithm can efficiently improve the speed of clustering and accuracy by reducing the computational complexity of standard k-means algorithm.

References

T. Kanungo, D. M. Mount, N. Netanyahu, C. Piatko, R. Silverman, and A. Y. Wu, "An efficient k-means clustering algorithm: Analysis and implementation" IEEE Transaction Pattern Analysis and Machine Intelligence, 2002.
Bruce A. Maxwell, Frederic L. Pryor, Casey Smith, "Cluster analysis in cross-cultural research" World Cultures 13(1): 22-38, 2002.
Kiri Wagstaff and Claire Cardie Department of computer science, Cornell University, USA "Constrained k- means algorithm with background knowledge".
Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest, Introduction to Algorithms, Prentice Hall, 1990.
Anil K. Jain, M. N. Murty, P. J. Flynn, "Data Clustering: A Review," ACM Computing Surveys, 31(3): 264-323 (1999).
Anil K. Jain and Richard C. Dubes, Algorithms for Clustering Data, Prentice Hall (1988).
Ahmet Alken, Department of Electrical and Electronics Engineering, KSU, Turkey, "Use of K-means clustering in migraine detection by using EEG records under flash stimulation" International Journal of the Physical Sciences Vol. 6(4), pp. 641-650, 18 February, 2011

Index Terms

Computer Science

Data Mining

Keywords

Cluster Analysis K-means Clustering Kd-tree Lloyd's Algorithm Standard K-means Algorithm Constrained K-means Algorithm.

Most Read Research Articles

An Optimized Approach for k-means Clustering

Abstract

References

Index Terms

Keywords